Featured articles

How to read ISO publications about PDF

PDF versions

Game-changing new free specification enables interoperable reuse and accessibility for PDF

Discover pdfa.org

Review publications from the PDF Association and ISO
The technical index lists critical resources for developers
Learn which companies are PDF Association members
Review hundreds of presentations from our events

Key resources

Try the new VS Code extension for PDF syntax
Check out our cheat sheets for PDF developers
Get PDF’s latest specification, ISO 32000-2 at no cost
Add ISO 32000-2 errata via our public GitHub repo, and check out the resolutions

Get involved

Discover the benefits of PDF Association membership
Join the PDF Association!
Review the PDF technical community’s working groups

How do you find the right PDF technology vendor?
Use the Solution Agent to ask the entire PDF communuity!

The PDF Association celebrates its members’ public statements
of support for ISO-standardized PDF technology.

Explore membership benefits

Get ISO 32000-2 at no cost

Become a Member!

pdfOCR

Produced by iText

We are proud to introduce a new iText 7 add-on, iText pdfOCR, which provides Optical Character Recognition (OCR) functionality to convert printed text in scanned documents and images into a fully searchable PDF/A-3u compliant format (PDF version 1.7) and make accessing those texts easier and faster. The iText pdfOCR add-on is built on the Tesseract 4 OCR engine technology, and like Tesseract, is open-source (Java and .NET GitHub repositories). Tesseract supports over 100 languages, and is considered to be the most popular and accurate open source OCR engine.

Still have questions?

If you are interested in learning more or have additional questions, contact us

If you are interested in learning more about pdfOCR, click here