Pages in topic:   < [1 2 3 4] >
Tips for PDF translation
Thread poster: Mauricio Coitiño
Anna Villegas
Anna Villegas
Mexico
Local time: 03:28
English to Spanish
Here, Yaotl. Aug 24, 2009

One more of your own. Click on the link below:

You have your own OCR on your PC


 
Pablo Bouvier
Pablo Bouvier  Identity Verified
Local time: 11:28
German to Spanish
+ ...
Get a PDF translated in 5 steps Aug 24, 2009

Yaotl Altan wrote:

Pablo:
I've downloaded a Solid Converter PDF trial version and it works really fine!


Yes, SolidConverter PDF does most of the job, if you use the real page convertig method. The problem is that MS-Word neither some MS-Word based CAT tools does not see the text in textboxes. Did you try the method that I suggested with the werecat macro?

BR


 
Jande
Jande  Identity Verified
Australia
Local time: 19:28
Danish to English
+ ...
Tricks for translating pdf's Aug 24, 2009

First method:
Print out the pdf.

or

Arrange your screen to see both the source pdf and final (word) file.

Start translating.


Second method:
You can take a snapshot of the pdf or do a print screen (Print Scrn) and then paste it into a word document.

With two word files (if you managed to get your pdf into a word document)

Open both documents source and final document and use the "window" "compare side
... See more
First method:
Print out the pdf.

or

Arrange your screen to see both the source pdf and final (word) file.

Start translating.


Second method:
You can take a snapshot of the pdf or do a print screen (Print Scrn) and then paste it into a word document.

With two word files (if you managed to get your pdf into a word document)

Open both documents source and final document and use the "window" "compare side by side with" feature in word for windows.

I usually find that my pdf translations are as good as my word ones. I love to copy the pictures and edit them in bitmap, so the final document is a carbon copy of the original, except in another langauage.

It is easier to remember to translate headers, footers and diagrams, if you do pdf's.

As the final document should look like the first, you can check to see if you've missed any sentances by comparing paragraphs, tables etc. Especially if you use the side by side feature.
Collapse


 
Sergei Leshchinsky
Sergei Leshchinsky  Identity Verified
Ukraine
Local time: 12:28
Member (2008)
English to Russian
+ ...
My way of dealing with PDFs Feb 12, 2010

Hopefully, I have few now. But I used to process many, so I have an algorithm.
Here we go:

1) Take a PDF file (the easiest step).
2) Remove password protection, if any, using tools available on the Net.
3) Feed it to some OCR tool (I use FineReader 8 and 9 -- there is difference*).
4) Recognize and save as RTF.
5) Open in WordPad and save as DOC (thus you remove all unnecessary formatting
... See more
Hopefully, I have few now. But I used to process many, so I have an algorithm.
Here we go:

1) Take a PDF file (the easiest step).
2) Remove password protection, if any, using tools available on the Net.
3) Feed it to some OCR tool (I use FineReader 8 and 9 -- there is difference*).
4) Recognize and save as RTF.
5) Open in WordPad and save as DOC (thus you remove all unnecessary formatting).
6) Open in Word and format the source to match the original (to eliminate the bugs that might affect segmentation).
7) Translate using a CAT tool.
8) Check.
9) Double check.
10) Print to PDF using, say DoPDF driver and...

... let the client be happy with PDF!

* In FR8, unlike FR9, you can disable "formatting using frames" (so skip Step 5). FR9 is good at recognizing very small fonts, dirty paper and wavy lines (but you might get frames).

[Редактировалось 2010-02-12 22:20 GMT]
Collapse


 
Brian Young
Brian Young  Identity Verified
United States
Local time: 02:28
Danish to English
PDF conversion Feb 13, 2010

My outsourcers now use a program called Able2docpro to convert PDF's to word. It works extremely well, and retains all of the original formatting.
If I have to work with a PDF I just open it in a window, and then open a word doc in a new window. I size the windows so that they both appear on the same screen; the PDF on the top, the word doc on the bottom. This has really worked well for me.
I agree that converting PDF to word is not what I would call DTP.
I always try to get t
... See more
My outsourcers now use a program called Able2docpro to convert PDF's to word. It works extremely well, and retains all of the original formatting.
If I have to work with a PDF I just open it in a window, and then open a word doc in a new window. I size the windows so that they both appear on the same screen; the PDF on the top, the word doc on the bottom. This has really worked well for me.
I agree that converting PDF to word is not what I would call DTP.
I always try to get the original documents, but sometimes they are simply not available. In those cases I translate the language, and let someone else do the DTP, if that is required to replicate complicated formatting, or the insertion of graphics. I don't know how to do that type of work, and I don't even care to learn.
As far as OCR'ing the PDF, then you need both documents, because very often the OCR either misses too many things, or it does not recognize certain characters. You need the PDF to check the accuracy of the OCR'ed version.
I work on a mac, using Office for mac. I don't think Able2docpro will run on a mac, but I have not looked into that. It works great on a PC.
Collapse


 
Yaotl Altan
Yaotl Altan  Identity Verified
Mexico
Local time: 03:28
Member (2006)
English to Spanish
+ ...
Thank you. Feb 13, 2010

Thank you, Tadzio.

 
sebdea
sebdea
Romania
Local time: 12:28
My recommandation Apr 28, 2010

Yaotl Altan wrote:

Thank you, efreitag, but which is better? Abbyy Fine Reader 9 or Abbyy PDF Transfomer? The first costs 400 USD and the second 60 USD. Am I missing something?

[Edited at 2009-08-22 21:50 GMT]


I kindly recommend you to buy Fine Reader 9. it is more powerfull then Transformer. Still, Fine Reader has its own limitations. I have trouble when I have images inserted in the middle of text, or tables with a complex structure.

Seba


 
sebdea
sebdea
Romania
Local time: 12:28
PDF conversion is DTP (in my opinion) Apr 28, 2010

Hi,

I work as DTP operator for a translation company based in Timisoara. Clients usually are sending PDF files to be translated. If the clients need back only a PDF file with translation I use InFix PDF Editor. Great tool. But most of the clients are sending low res PDF files and want back an editable file (mostly ask for MS Word). In this case I do DTP job. I use Abby Fine Reader 9. Convert files into MS Word and do layout to match exactly with the original.

My real pr
... See more
Hi,

I work as DTP operator for a translation company based in Timisoara. Clients usually are sending PDF files to be translated. If the clients need back only a PDF file with translation I use InFix PDF Editor. Great tool. But most of the clients are sending low res PDF files and want back an editable file (mostly ask for MS Word). In this case I do DTP job. I use Abby Fine Reader 9. Convert files into MS Word and do layout to match exactly with the original.

My real problem appears when the client need an estimation for translation and for DTP. In this case the translators ask me for an editable file (they get a low res PDF from client) to import it into CAT Tools for word-counting. This is the challenge! What shall be done? I usually convert PDF into MS Word without formatting the doc. But this procedure is time consuming. And there are cases when clients don't work with us after they receive the estimation.

When PDF files are at a high resolution and are clean I use InFix PDF Editor to export PDF into XML file. The translators import the file into CAT Tools and do word counting for price estimation to the clients. But not always works so fine.

So, my question to you all is as follows. What has to be done to avoid working several hours for nothing?

Thank you, in advance.

Seba

[Edited at 2010-04-28 10:51 GMT]
Collapse


 
Mauricio Coitiño
Mauricio Coitiño  Identity Verified
Uruguay
Local time: 06:28
English to Spanish
+ ...
TOPIC STARTER
No need to convert PDF for wordcount Apr 28, 2010

There are tools for PDF wordcount, and you don't even need to download them. Just google them. Online Wordcount and Abracadabra are just examples.
Even if you don't use this tools, you just need to convert the file (it iakes a few seconds, unless it's a hundred-page doc) and then use any wordcount tool for Word files. You can leave the program working while you do something else, right?
Regards,

Mauricio


 
Vito Smolej
Vito Smolej
Germany
Local time: 11:28
Member (2004)
English to Slovenian
+ ...
SITE LOCALIZER
trick #N Apr 28, 2010

dont fall for it.

Or, in dire straits, quote a reasonable bozo price - pi as a blot factor is realistic.


 
sebdea
sebdea
Romania
Local time: 12:28
You are very wrong - No offence Apr 29, 2010

Mauricio Coitiño wrote:

There are tools for PDF wordcount, and you don't even need to download them. Just google them. Online Wordcount and Abracadabra are just examples.
Even if you don't use this tools, you just need to convert the file (it iakes a few seconds, unless it's a hundred-page doc) and then use any wordcount tool for Word files. You can leave the program working while you do something else, right?
Regards,

Mauricio


They need a more detailed word count (e.g. Repetitions found - Pretranslated - Partially Translated - 100-95% - 94-85% - 84-75% - 74-50% - Remaining not translated units - Total)

I do believe that kind of word count you will never get it with an online so called word count software.

Best regards,
Seba


 
Mauricio Coitiño
Mauricio Coitiño  Identity Verified
Uruguay
Local time: 06:28
English to Spanish
+ ...
TOPIC STARTER
No offence: just a bit of accuracy Apr 29, 2010

Dear Seba,

Let's not confuse word count with text analysis. My post dealt with the former, and that may be enough for certain projects.
Like you, I haven't used these tools yet, but I wouldn't dare to say they can't analyse. Just give them a try.
Regards,

Mauricio


 
sebdea
sebdea
Romania
Local time: 12:28
A minor misunderstanding May 4, 2010

Mauricio Coitiño wrote:

Dear Seba,

Let's not confuse word count with text analysis. My post dealt with the former, and that may be enough for certain projects.
Like you, I haven't used these tools yet, but I wouldn't dare to say they can't analyse. Just give them a try.
Regards,

Mauricio


My mistake. There is a problem with our PM. They always ask for wordcount. As you said is text analysis. Anyway. I will try use those tools but I am concerned about low resolution PDF. Those files are giving me a headache.


 
Catherine Muir
Catherine Muir  Identity Verified
Australia
Local time: 19:28
Indonesian to English
+ ...
In memoriam
Any current advice on converting image-based PDFs to MS Word? Aug 15, 2010

I have just stumbled upon this thread and, because almost all my jobs arrive as image-based PDFs, I am interested to know which conversion software gets the general nod of approval these days. I downloaded the trial version of ABBYY Fine Reader V.10, on the advice of a client, and it seems to work well, but it's expensive to buy. Is this still considered to be the best software for converting image-based PDFs, or is there something better out there now?

 
José Henrique Lamensdorf
José Henrique Lamensdorf  Identity Verified
Brazil
Local time: 06:28
English to Portuguese
+ ...
In memoriam
Update in InFix Aug 15, 2010

It all depends whether one actually want to convert a PDF into a DOC just to translate it and rebuild the original PDF pub translated, or if the text from the PDF is needed for other purposes, i.e. a newly designed pub.

For translation of a "distilled" PDF, InFix Pro is almost there. Apart from a few minor fixes it still needs, it's working! I delivered a translated PDF last week done
... See more
It all depends whether one actually want to convert a PDF into a DOC just to translate it and rebuild the original PDF pub translated, or if the text from the PDF is needed for other purposes, i.e. a newly designed pub.

For translation of a "distilled" PDF, InFix Pro is almost there. Apart from a few minor fixes it still needs, it's working! I delivered a translated PDF last week done with it. A short demo video is available at http://www.iceni.com/infixTutorials/Translation/translate.htm

Most - possibly all - of the features I'd want are already there. For the first time I finally read through the whole 150-page manual, and discovered them.
Collapse


 
Pages in topic:   < [1 2 3 4] >


To report site rules violations or get help, contact a site moderator:

Moderator(s) of this forum
Laureana Pavon[Call to this topic]

You can also contact site staff by submitting a support request »

Tips for PDF translation






Wordfast Pro
Translation Memory Software for Any Platform

Exclusive discount for ProZ.com users! Save over 13% when purchasing Wordfast Pro through ProZ.com. Wordfast is the world's #1 provider of platform-independent Translation Memory software. Consistently ranked the most user-friendly and highest value

Buy now! »
TM-Town
Manage your TMs and Terms ... and boost your translation business

Are you ready for something fresh in the industry? TM-Town is a unique new site for you -- the freelance translator -- to store, manage and share translation memories (TMs) and glossaries...and potentially meet new clients on the basis of your prior work.

More info »