Produced by iText Group NV
We are proud to introduce a new iText 7 add-on, iText pdfOCR, which provides Optical Character Recognition (OCR) functionality to convert printed text in scanned documents and images into a fully searchable PDF/A-3u compliant format (PDF version 1.7) and make accessing those texts easier and faster. The iText pdfOCR add-on is built on the Tesseract 4 OCR engine technology, and like Tesseract, is open-source (Java and .NET GitHub repositories). Tesseract supports over 100 languages, and is considered to be the most popular and accurate open source OCR engine.
Still have questions?
If you are interested in learning more or have additional questions, contact us
If you are interested in learning more about pdfOCR, click here
API / SDK Server Developer tool Java Plugin Open source Free
Accessibility Archiving Electronic invoicing Imaging and capture Litigation support Output Search Software development
Creation Consuming Editing
Tagged PDF creation Templates