At the PDF Europe 2018 Joris Schelekens from iText will hosting a presentation titled “Structure Recognition for Information Retrieval and Layout” – what’s that about?”. In this interview he gives some background information about his presentation.Five reasons developers should participate in PDF Days Europe
PDF Days Europe, the annual PDF technology education event, will take place from 14 to 16 May 2018 in Berlin at the Hotel Steglitz International. Of the many good reasons for developers to participate, here are five of the best.5 reasons why those implementing electronic document technologies should attend PDF Days Europe
PDF Days Europe, the annual PDF technology education event, will take place from 14 to 16 May 2018 in Berlin at the Hotel Steglitz International. Of the many good reasons for users to participate, here are five of the best.2018 PDF 2.0 Interop Workshop
Following the success of our previous interop workshops in Cambridge, England and Boston, Massachusetts, the 3rd PDF 2.0 Interop Workshop takes place on May 16, 2018 as part of the post-conference program immediately following this year’s PDF Days Euro …Post-Conference of PDF Days Europe 2018 in Berlin
On Wednesday, May 16, 2018, directly following PDF Days Europe, the PDF Days Post-Conference offers a variety of workshops on PDF 2.0 Interop or PDF/UA.
Over the next few weeks we would like to introduce our speakers on the Technical Conference Europe 2013, their presentation’s topics and their personal view on the world of PDF and related ISO standards. So we prepared some little interviews with the speakers for the conference.
Francois Fernandès studied computer science and economics at the Reutlingen University. Then he started as a developer and consultant at levigo solutions GmbH where he had his first contact with PDF in 2005 some months before the first PDF/A release.
Instead of using the typical Hello World program, he learned the Java programming language by implementing a PDF parser.
The ISO committee around the PDF standards is about to search for a formal representation of the allowed elements within PDF documents. And there is a huge number of legal elements. Were talking about all possible dictionaries with numerous kinds of values within. All those dictionaries and their contents are described in the standards (ISO 32000-1, ISO 19005-1, etc.) but there is no technical (or formal) representation of these definitions.
This is what Adobes DVA and our PDF Formal Syntax is all about: Describing all possible types in both a machine and human consumable language. Hopefully, one day, there will be such a language including some default sets that formally describe what a PDF-1.7, PDF-2.0 or PDF/A-1 may contain. If there are such sets and a well-defined formal language, there will be many possibilities to profit: Generating code for loading and/or writing, validating PDFs, or even generating customizable documentation like this dictionary is represented as the C++-Class XYZ. Building and visualizing differences between PDF-versions can be automated, allowing an easy overview of what has technically changed between two versions of the PDF standard.
I will definitely attend Introduction to PDF 2.0? as Im curious to see what the next big version step in PDF will be and Colour Management for PDF developers as colour management has always been a major pain for my PDF development work.
My personal history with PDF started in 2005, some months before the official release of ISO 19005-1 (PDF/A-1). I started working for levigo solutions GmbH and my task was to write PDF document viewing logic in pure Java. The only problem: I didnt have any knowledge about Java, nor about the PDF standard itself besides using PDFs. So my first Java program wasnt the typical Hello World! program, instead it was a PDF lexer/parser. And I still keep the very first screenshot of my first successfully parsed PDF document.
As I have been working in the area of long term preservation, Id love to see further adoption of PDF/A. PDF/A is the only real option for having future-proof documents that are safe and worth archiving. The adoption is already pretty good, which is also a result of the PDF Associations good work, but it could still be improved.