Regular expressions for text editing
Regular expressions (regexes for short) are special rules designed to find a text and/or numbers meeting certain criteria and to do something useful with the search results. An example of „something useful” may include finding all occurrences of dates in any given format (e.g. 03/24/2013) and converting these to a different format (e.g. 24.03.2013), conversion of number formats, removal of multiple spaces from the whole document in one go or joining incorrectly split paragraphs.
Regular expressions are extremely useful in processing text and numbers when preparing text for translation, for example for cleaning up text extracted from PDF files, and in the translation process itself – one can use regexes in SDL Studio and memoQ to perform a variety of actions. And while it is usually relatively simple to create a regex matching the text one needs, quite often the trick is to write a regex which will match only that, and nothing more.
The problem most people have when it comes to regexes is that they look somewhat scary and mysterious. In reality, once you know the meaning of symbols used and some basic rules, most of the time regexes are quite simple and logical. The workshop is designed to introduce regular expressions to anyone with no prior knowledge of regex and provide help and inspiration for people with basic to intermediate knowledge. Starting from very basic up to relatively complex rules with emphasis on translation-related applications, based on the real-life problems and files. After the workshop you should be able to use regexes for efficient text processing and create or modify rules to match complex text strings. We will cover usage for:
- Text editing in MS Word,
- Editing a range of text formats in Notepad++,
- Converting text into tags,
- Using auto-translatable elements,
- Creating and editing segmentation rules,
- Using regular expressions for filtering in CAT tools and Find and Replace,
- Defining custom QA rules in CAT tools and QA software,
- Defining filters to import non-standard files into memoQ, SDL Studio, WordFast and open-source tools.
Participants will receive handouts with regular expression vocabulary and detailed description of all rules created and used during the training.
Who should attend?
Beginner and advanced translators and editors.
What are the benefits?
Learn how to make the most of memoQ.
Improve the quality of your work.
Make your work more satisfying.
Date: Regular Expressions on 10th and 11th of June 2019
Time: 10 to 15:30
Location: Utrecht, Ravellaan 96 at Kantien
Price: 349 euro + VAT (includes light lunch, tea/coffee and free hugs).
Student discount: 20%
Early bird price: 299 euro + VAT (includes light lunch, tea/coffee and free hugs). Valid until the 20th of May.
Please register by filling in the form below or sending us an e-mail stating which workshop you would like to attend. We will send an invoice shortly after.
The workshop is for a maximum of 12 attendees.