Howto Make scanned PDFs searchable (OCR) using pdfocr

Howto Make scanned PDFs searchable (OCR) using pdfocr: "


What pdfocr is for


Suppose you have a PDF document that was made using a scanner, or otherwise consists of image data but doesn’t have text data. Such a PDF can’t be searched by PDF readers or desktop search applications. pdfocr is a simple utility I made that takes a PDF file, then generates a new one that has the text layer added, so it’s searchable by your PDF reader and can be indexed by your desktop search application, but is still identical when printed.

(...)
Read the rest of Howto Make scanned PDFs searchable (OCR) using pdfocr (184 words)




© admin for Ubuntu Geek, 2010. |
Permalink |
No comment |
Add to
del.icio.us


Post tags: , , ,


Related Articles






"