Poll: If you use a CAT-tool, how do you prefer the text to be segmented?
Thread poster: ProZ.com Staff
ProZ.com Staff
ProZ.com Staff
SITE STAFF
Jun 26, 2021

This forum topic is for the discussion of the poll question "If you use a CAT-tool, how do you prefer the text to be segmented?".

View the poll results »



 
Mario Freitas
Mario Freitas  Identity Verified
Brazil
Local time: 19:19
Member (2014)
English to Portuguese
+ ...
The shortest, the better Jun 26, 2021

One thing I learned fast with CAT tools is that the shorter the segment, the more useful it will be in the future. Smaller segments are the ones that give you 101% matches and autofill. The longer the segment, the less likely it is for this to happen. Configuring your CAT to segment by paragraph is wasting a big part of its functionalities.

Anton Konashenok
Ventnai
Ana Vozone
neilmac
Thayenga
Elena Kharlamova
MollyRose
 
Vasaporn Chaiyakul
Vasaporn Chaiyakul  Identity Verified
Thailand
Local time: 05:19
Member (2021)
English to Thai
+ ...
Sentence preferred, but.. Jun 27, 2021

In many Asian languages, there's no easy way (or perhaps no way in my pair, for example) for the tool to separate these continuous characters into sentences.

[Edited at 2021-06-27 08:41 GMT]


 
Christine Andersen
Christine Andersen  Identity Verified
Denmark
Local time: 00:19
Member (2003)
Danish to English
+ ...
Sentences, as long as you can override the automatic segmentation manually Jun 27, 2021

Quite often, I find sentences in the source language do not correspond exactly to sentences in the target.
Sometimes I simply split one into two, but it is useful to be able to merge two or three sentences or divide them differently.

What really is impossible to work with is when the text has not been properly formatted, and the CAT segments it into lines instead of sentences, or peppers the source with irrelevant tags...


 
Maria Teresa Borges de Almeida
Maria Teresa Borges de Almeida  Identity Verified
Portugal
Local time: 23:19
Member (2007)
English to Portuguese
+ ...
I don't use a CAT tool Jun 27, 2021

Dinosaurs don't like cats...

dinosaur


 
Samuel Murray
Samuel Murray  Identity Verified
Netherlands
Local time: 00:19
Member (2006)
English to Afrikaans
+ ...
Long for intermediary tools, short for matching tools Jun 27, 2021

I prefer long segments for the intermediary formats, and short segments for matching tool. In other words, if the end-client starts off with an IDML file, and the first-level agency converts it to SDLXLIFF, and the second-level agency then distributes it via a MemoQ server, from where I would then export it for translation in e.g. OmegaT, then I would prefer OmegaT to segment by sentence, but MemoQ and Trados to segment by paragraph.

 
Gibril Koroma
Gibril Koroma  Identity Verified
Canada
Local time: 18:19
French to English
+ ...
By sentence Jun 27, 2021

It's easier for me.

MollyRose
 


To report site rules violations or get help, contact a site moderator:

Moderator(s) of this forum
Jared Tabor[Call to this topic]

You can also contact site staff by submitting a support request »

Poll: If you use a CAT-tool, how do you prefer the text to be segmented?






Protemos translation business management system
Create your account in minutes, and start working! 3-month trial for agencies, and free for freelancers!

The system lets you keep client/vendor database, with contacts and rates, manage projects and assign jobs to vendors, issue invoices, track payments, store and manage project files, generate business reports on turnover profit per client/manager etc.

More info »
Trados Business Manager Lite
Create customer quotes and invoices from within Trados Studio

Trados Business Manager Lite helps to simplify and speed up some of the daily tasks, such as invoicing and reporting, associated with running your freelance translation business.

More info »