It’s a question that vexes vendors of web-based solutions everywhere: why do people still insist on PDF files? And why does PDF’s mindshare keep going up? “PDF is such antediluvian technology!” they say. “It’s pre-web, are you kidding me? It’s so old-f …PDF Association technical resources: an overview
PDF is PDF because files produced with one vendor’s software can be read using a different vendor’s software with no loss of fidelity. Interoperability is key to our industry. The PDF Association is a international membership organization dedicated to …2022: The last year of paper for records-keeping
NARA (The National Archives and Records Administration) is the final depository for the long-term records generated by all other agencies of the U.S. Federal Government. The agency has a key role in preserving the cultural history of the republic as we …PDF 2.0 examples now available
The PDF Association is proud to present the first PDF 2.0 example files made available to the public. Created and donated to the PDF Association by Datalogics, this initial set of PDF 2.0 examples were crafted by hand and intentionally made simple in construction to serve as teaching tools for learning PDF file structure and syntax.PDF 2.0 interops help vendors
The PDF 2.0 interop workshops included many vendors with products for creating, editing and processing PDF files. They came together in Boston, Massachusetts for a couple of days to test their own software against 3rd party files.
All business leaders today know that a large portion of their organizations value lies in its documents. Documents also form the basis for demonstrating proof in legal disputes. The flood of emails in businesses is now requiring archivists to confront the special demands of retaining digital documents. This challenge is too often left up to the suppliers of archiving systems alone.
In addition, most manufacturers of archiving systems will suggest that creating audit-proof documents can be achieved automatically. Before this can be accomplished though, a whole line of measures has to be taken. For example the archiving format, as well as organizational measures like consolidating emails together with other documents into dossiers, are pivotal. Also, the lifecycle of a retained email is generally considerably longer than the life expectancy of the electronic archiving system. Emails may have to be traceable, faultlessly reproducible and legible after ten, twenty or more years depending on statutory provisions. The long-term retention of emails remains therefore a difficult challenge.
Emails are depending on their content business records in digital format. The retention provisions are regulated through legislation; in many countries there also exist additional directives for digital dossiers. As a further measure, records management is standardized in ISO 15489 as international best practice. There are numerous collective requirements, independent of specific legislation in a respective country, that have to be taken into consideration, especially for companies that are internationally active:
The Functionality of an Archiving System
Archive systems save files (objects), relationships between objects (relations) and descriptions of the files (metadata). As opposed to operational DMS systems, archive systems should support retention periods, especially the managed deletion of files, or better said of dossiers, once the retention period has expired. They should additionally guarantee protection from unauthorized access and mutation. Even more: the system should support access supervision through means of a register. Finally, the archive system should ensure safety of the data, for example through a redundant storage of the data and regular copying of the data to a new data medium.
The Choice of Data Formats is Essential
Most manufacturers of archiving systems will claim that the file system used for archiving is irrelevant. Even more: they allege that the requirement of audit-proof documentation is fulfilled solely upon the fact that their archiving system is being used. This assumption is not fully correct. The archiving format plays a much larger role than the archive manufacturer would like to admit. The customer should be aware that the life cycle of a document is generally considerably longer than that of the archiving system. As well, in the customers best interest, the role of the document and that of the archiving system should be clearly separated. For example, proof of a documents integrity, as well as the traceability of changes, belong to the characteristics of the document format.
Based on this awareness, archivists have selected PDF/A as their preferred file format. PDF/A was developed specifically for the long-term retention of digital documents and is the format of choice for most archives in Europe (and predominantly in Germany and Switzerland). The archiving system should ensure that only documents which adhere to this above mentioned format are stored. This can be ensured through use of a so-called Gate Keeper in the form of suitable validation software (validator).
Electronic Signature Identifies Changes to a Document
Electronic signatures fulfill four essential functions:
It is important to understand that neither the electronic signature nor a specific document format (whether PDF or TIFF) can prevent a digital document from being either optically or quantitatively changed via technical means. However, the digital signature can ensure that the change is recognizable and reproducible at all times.
The Technical Solution From a technical point of view, emails display several unique characteristics. They contain attachments in all imaginable file formats, like office documents, CAD drawings and many more. The files are often contained in structured containers (ZIP and RAR), which themselves can be repackaged countless times over. And more: emails often contain several variations of the same information, for example as pure text as well as HTML files, and can therefore be ambiguous. In addition, emails contain an abundance of information that is not immediately visible, for example the name of the sending and receiving email servers, date and time of reception and further information about the file.
Figure: Converting emails with attachments for long-term archiving
Emails cannot be saved in an archive under these conditions. The hardware devices available on the market (archive appliances) that write emails from a server directly into an archive can at best be classified as simple backup devices. Due to the long-term legibility and explicit reproducibility requirements, emails and their attachments have to be converted to PDF/A format, annotated with metadata, and integrated with other digital documents into dossiers and records. When converting to PDF/A, it is vital to ensure that the reproduction of the content in the PDF/A document is carried out in the same manner as the user sees it in the email. This is, due to the above mentioned ambiguous information formats and the multitude of email and office applications and versions in use, not something that can be taken for granted. But it is technically resolvable.
Even though emails have their unique technical characteristics, they can and should be treated like all other digital documents. Company-wide conversion solutions with product designations like Document Converter Service or Rendition Service can be referred to for the implementation. Solutions that are dedicated to email archiving alone are in the contrary not recommended: they generally either inadequately or do not at all support the conversion to PDF/A and the consolidation of digital documents to dossiers.
Practically every organization must give thought to archiving emails, whether now or in the future. The legal, organizational, economic and technical aspects of an implementation project must be adequately highlighted and adapted to match the organizations needs. For this reason it is recommended to reserve sufficient funds for consulting services in the preliminary phases of an email archiving project. This groundwork will provide the decision makers in an organization with the right balance between archiving project expenses and the risk of legal actions, giving the project a suitable guideline for the implementation.
Source: www.dokmagazin.de, December 2011 (translation)
Author: Dr. Hans Bärfuss (PDF Tools AG)