Search Results for: safedocs
October 2018, Peter Wyatt
Peter Wyatt is the PDF Association’s CTO and an independent technology consultant with deep file format and parsing expertise. A developer and … Read more
February 2020, PDF Association, Inc.
In 2019 the PDF Association created a Massachusetts-based 501(c)(6) non-profit organization, PDF Association, Inc. Originally created to serve as the US government … Read more
June 2021, Stressful PDF Corpus
The “Issue Tracker” corpus of stressful PDF files was originally developed under the DARPA-funded “SafeDocs” program as discussed on pdfa.org. If a … Read more
April 2022, Peter Wyatt
Non-rectangular links are coming to PDF 2.0! This article offers some background to link technology in PDF and discusses the forthcoming Extension … Read more
June 2023, Arlington PDF Model
The Arlington PDF Model is a free and open-source machine-readable data model of all PDF objects.
January 2024, PDF Association staff
LibreOffice now supports PDF/UA! AI LLMs’ support for PDF is substandard, and is retarding innovation. Yet more misinformation about malware. PDFacademicBot for … Read more
February 2021, Stressful PDF Corpus
Understanding the problems faced by diverse parsers can be a great learning experience. The “Issue Tracker” corpus of stressful PDF files was … Read more
February 2019, PDF Association staff
The Electronic Document Conference is a do-not-miss event for developers, product managers and technical users creating and leveraging electronic document technology. Learn … Read more
April 2019, PDF Association staff
The sessions at #EDCSEATTLE2019 may be interesting and relevant, but you still have to convince your boss to let you go! A … Read more
November 2020, Peter Wyatt
The original PDF Issue Tracker corpus generated a lot of interest from the PDF technical community; now version 2 of the “Issue … Read more
November 2020, PDF Association staff
OctoberPDFest recordings are now available! 31 videos offer a wide variety of perspectives on our favorite format.
April 2021, Peter Wyatt
This PDF file tests parsing valid combinations of adjacent PDF tokens both in the body of a PDF as well as in … Read more
PDFs in the wild offer a bewildering amount of variation in syntax, features and structure. For those building parsers or evaluating parsers, … Read more
June 2021, PDF Association staff
pdfa.org resources now include a new technical index, list of GitHub repos and more.
July 2021, PDF Association staff
We preview some of what’s coming at PDF Days Europe 2021 with sessions in accessibility, production, security and core PDF technology.
October 2021, Duff Johnson
The CEO of the PDF Association reflects on the steady expansion of the organization’s mission and the future.
September 2022, Making more sense of PDF structures in the wild at scale
This is a follow-on talk from our 2021 PDF Days presentation on the File Observatory. Our team built the File Observatory to support … Read more
October 2022, PDF Association staff
In a new podcast episode, SE Radio’s Gavin Henry interviews PDF Association CTO Wyatt and CEO Johnson about a wide range of … Read more
January 2023, Peter Wyatt
CTO Peter Wyatt summarizes key takeaways from the first two years of the PDF specification’s public errata process.
August 2023, Peter Wyatt
PDF is a large, complex specification! Developers often look for first steps in its implementation. This article offers some suggestions.