Featured articles

How to read ISO publications about PDF

PDF versions

Game-changing new free specification enables interoperable reuse and accessibility for PDF

Discover pdfa.org

Review publications from the PDF Association and ISO
The technical index lists critical resources for developers
Learn which companies are PDF Association members
Review hundreds of presentations from our events

Key resources

Try the new VS Code extension for PDF syntax
Check out our cheat sheets for PDF developers
Get PDF’s latest specification, ISO 32000-2 at no cost
Add ISO 32000-2 errata via our public GitHub repo, and check out the resolutions

Get involved

Discover the benefits of PDF Association membership
Join the PDF Association!
Review the PDF technical community’s working groups

How do you find the right PDF technology vendor?
Use the Solution Agent to ask the entire PDF communuity!

The PDF Association celebrates its members’ public statements
of support for ISO-standardized PDF technology.

Explore membership benefits

Get ISO 32000-2 at no cost

Become a Member!

Deriving HTML from PDF

Produced by Normex

Implementation of an algorithm that converts well-tagged pdfs into HTML.

Since 2017 we have been actively participating in PDF Association Technical Working Group with the aim to address needs of industry for changing the way PDF files are consumed on mobile devices. The main concern was whether or not the traditional fixed-layout pdf contains enough information to be safely and unambiguously interpreted as html - therefore responsive and reusable in different environments.

The output of the work is the Derivation algorithm - document that describes how the process of conversion could be done.

As a part of the work we came up with referential set of pdf documents and implementation. These should provide enough insights into the whole concept.

If you are interested in the work, please follow us on github: https://github.com/Normex/PDF-Derivation

WordPress Cookie Notice by Real Cookie Banner