Is there a way to flag overuse of a word/phrase? Thread poster: Brent Sørensen
| Brent Sørensen Germany Local time: 10:42 Member (2016) German to English + ...
I’m sure there’s a regex that shows if the same word is used more than once within a segment. Is there a way to figure out if you’ve inadvertently used the same word too much within a text? I’m talking more about functional words. | | | Natalie Poland Local time: 10:42 Member (2002) English to Russian + ... Moderator of this forum SITE LOCALIZER
will find repeated words during verification: you don't need to do anything additionally, just run file verification. | | |
By reading through your finished translation like you would surely do anyway? | | | Brent Sørensen Germany Local time: 10:42 Member (2016) German to English + ... TOPIC STARTER Of course I read it through, but sometimes things still go unnoticed… | May 29, 2021 |
Ice Scream wrote: By reading through your finished translation like you would surely do anyway? | |
|
|
Roy Oestensen Denmark Local time: 10:42 Member (2010) English to Norwegian (Bokmal) + ... The utility xBench may be of help. | May 29, 2021 |
I cannot guarantee that xBench will find all occurences of duplicate words in a sentence, but you may want to try it out to see if it helps you. It will often flag problems that the Verify function in Studio does not flag. Roy | | | You might want to try a program like "Repetition Detector" | May 30, 2021 |
Brent Sørensen wrote: I’m sure there’s a regex that shows if the same word is used more than once within a segment. Is there a way to figure out if you’ve inadvertently used the same word too much within a text? I’m talking more about functional words. A program I suggest is Repetition Detector: you run your exported translation through it, and it flag and list the words you used most often.
[Edited at 2021-05-30 07:24 GMT] | | | Brent Sørensen Germany Local time: 10:42 Member (2016) German to English + ... TOPIC STARTER Thanks :) This is exactly the sort of thing I was looking for | May 30, 2021 |
Riccardo Schiaffino wrote: Brent Sørensen wrote: I’m sure there’s a regex that shows if the same word is used more than once within a segment. Is there a way to figure out if you’ve inadvertently used the same word too much within a text? I’m talking more about functional words. A program I suggest is Repetition Detector: you run your exported translation through it, and it flag and list the words you used most often. [Edited at 2021-05-30 07:24 GMT] | | | Stepan Konev Russian Federation Local time: 11:42 English to Russian Odd selection | May 30, 2021 |
Riccardo Schiaffino wrote: A program I suggest is Repetition Detector: you run your exported translation through it, and it flag and list the words you used most often. I got very interested in this program. I installed a trial version. Looks nice. However, the way it selects words seems odd. It selects correctly at the beginning of a document but after a couple of pages all selections just shift away, and selection refers to wrong words in the Top-100 column. First I suspected my 150% scaling was the culprit. But then I switched to 100%, and it still behaves this way. Any idea why this happens? | |
|
|
No idea... try the previous version | May 30, 2021 |
Stepan Konev wrote: Riccardo Schiaffino wrote: A program I suggest is Repetition Detector: you run your exported translation through it, and it flag and list the words you used most often. I got very interested in this program. I installed a trial version. Looks nice. However, the way it selects words seems odd. It selects correctly at the beginning of a document but after a couple of pages all selections just shift away, and selection refers to wrong words in the Top-100 column. First I suspected my 150% scaling was the culprit. But then I switched to 100%, and it still behaves this way. Any idea why this happens? I haven't installed version 2... I have occasionally used version 1 (which is still available for download through their website), and never noticed anything like what happens to you. You could try version 1, to see if the problem was already there. Have you tried with different texts? The issue could be a one-off triggered by something in particular in the text you used. Another suggestion: a concordance program, like Lawrence Anthony's AntConc could also be of use. P.S. I've just tried Version 2 of RepetitionDetector 2 on a couple of long texts (one in English, the other in Italian), and the program seemed to work correctly. Having looked at your screenshot, I was wondering if the reason for the incorrect results could be the fact that your text is written in the cyrillic alphabet in a language not listed among those mentioned by RepetitionDetector's help file: "Repetition Detector 2 is working with text in the main European languages: English, French, German, Spanish, Portuguese and Italian." or, according to their web site: "The software is available for Windows in English and French but works equally well with texts in Spanish, Portuguese, Italian, German, Dutch, Danish, Norwegian, Swedish, Finnish and Icelandic."
[Edited at 2021-05-30 19:08 GMT] | | | Stepan Konev Russian Federation Local time: 11:42 English to Russian
Riccardo Schiaffino wrote: Repetition Detector 2 is working with text in the main European languages: English, French, German, Spanish, Portuguese and Italian. The same source file in English gives the same result. Probably it somehow relates to the file structure. Those two files (Russian target on the screenshot and English source) were in MS Word. However I tried another 238-page PDF file in Russian, and the program worked correctly in the sense that it captured and highlighted all Russian words with different endings but common stem. Looks promising. Overusing same words is my soft spot. Thank you for this useful resource. | | | Another free program that can flag repeated words and phrases | May 30, 2021 |
Another program that (among other things) should be able to flag repeated words and phrases is SmartEdit Writer (free, though a more powerful version is also available for purchase). What exactly does SmartEdit Writer help with and identify: Word and phrase repetition All adverbs used in your work Words and phrases that you choose to highlight, such as common typos or characters you might want to keep an eye on
[Edited at 2021-05-30 21:01 GMT] | | | Word overuse | May 31, 2021 |
Regexes can also be used for style checking, such as the overuse of a word or suffix in a sentence. I have been told that it is bad style to overuse the “mente” suffix in Portuguese. The multiple occurrence is easily validated with the following regex. Regex: ^.*?mente\b.*?mente\b.*$ The same technique can be used for overuse of "of the" in English that often results in translating from German. | | | To report site rules violations or get help, contact a site moderator: You can also contact site staff by submitting a support request » Is there a way to flag overuse of a word/phrase? Trados Business Manager Lite | Create customer quotes and invoices from within Trados Studio
Trados Business Manager Lite helps to simplify and speed up some of the daily tasks, such as invoicing and reporting, associated with running your freelance translation business.
More info » |
| Wordfast Pro | Translation Memory Software for Any Platform
Exclusive discount for ProZ.com users!
Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value
Buy now! » |
|
| | | | X Sign in to your ProZ.com account... | | | | | |