Facebook
Twitter
YOUTUBE
LINKEDIN
XING
About the contributor
PDF Association

Mission Statement: To promote Open Standards-based electronic document implementations using PDF technology through education, expertise and shared experience for stakeholders worldwide.
More contributions
Interview with Ulrich Isermeyer, Sr. Business Development Manager, Adobe Systems GmbH, about PDF Days Europe 2018

Ulrich Isermeyer, Sr. Business Development Manager, Adobe Systems GmbH, will be hosting a presentation titled “The power of 3D PDF” at the PDF Days Europe 2018. In this Interview he gives some background information about it.

Six Questions to Duff Johnson, Executive Director at PDF Association

Our Executive Director Duff Johnson gives some personally information about his experience with the “PDF universe” and much more.

Interview with Kevin Willems, Software Engineer at iText Software, about PDF Days Europe 2018

Kevin Willems, Software Engineer at iText Software, will be hosting a presentation titled “Redaction in electronic documents” at the PDF Days Europe 2018. In this interview Kevin gives some background information about it.

Interview with Vel Genov, Senior Product Manager at Datalogics, about the PDF Days Europe 2018

Vel Genov, Senior Product Manager at Datalogics, will be hosting a presentation titled “Common PDF Interoperability Concerns” at the PDF Days Europe 2018. In this interview Vel gives some background information about it.

Interview with Matt Kuznicki, CPO of Datalogics, about the PDF Days Europe 2018

Matt Kuznicki, CPO of Datalogics, will be hosting a presentation titled “PDF 2.0 Updates to Rendering and Color Processing” at the PDF Days Europe 2018. In this interview Matt gives some background information about it.

What ECM professionals must know about PDF


PDF behind the ECM maskAlthough PDF represents the bulk of content in ECM systems the majority of such implementations do not handle PDF documents much differently than they way they’ve handled TIFF images for the past 25 years.

Here are a few lesser-known facts about PDF that are essential information for ECM and information management professionals responsible for document management.

Not all PDF creation software is equal

The inherent flexibility of the PDF format means that there are an extraordinary variety of ways to package text, images, vectors, fonts and more together in order to achieve a reliable, sharable page. But that doesn’t mean that every PDF is equal. Some PDF production software makes files that can’t be searched. Others produce documents that open or render to the screen very slowly. Still others display incorrect colors, or screw up encoding the source document’s fonts so that text turns into little blocks.

What to do: Don’t compromise on quality. Use established, reputable, PDF libraries, APIs and end-user software. There are many choices available to suit every need and budget.

Exclude software that’s dangerous to your documents

Be aware that many commonplace software can destroy important aspects of PDF documents. Apple’s Preview will destroy digital signatures, tags and more. Instead of adding proper PDF annotations (which can easily be removed), Evernote’s popular tool trashes PDF pages by permanently obscuring page-content with its “notes”.

What to do: Disallow software that risks your documents and does not respect the methods for annotations and other features available in PDF.

Removing PII without risk

Ensuring PDF documents do not contain Personally Identifiable Information (PII) and other privacy or security-related content is a critical aspect of releasing sensitive documents to 3rd parties or into the public domain. From professional-strength search software that finds phrases and text-strings to redaction tools that properly remove content as well as allowing for markup and categorization, there are many options for getting it right… and some avoidable ways to get it wrong.

What to do: Be sure your search software can find all the information you need to remove. One easy check; compare the number of search-terms found by different tools. There will often be a difference. For redaction tools: educate your users that a black highlighter does not “remove” anything, and is in reality a simple way to get fired for leaking sensitive information.

Use ECM software that understands PDF

PDF documents can include far more information than most ECM applications are capable of recognizing. From document metadata to digital signatures, from attached files to archival-grade PDF, most of today’s ECM systems just don’t understand the PDF format beyond the minimum of what’s necessary for rendering. Outcomes include content missing from search results, loss of metadata and other vital workflow information, as well as failures to meet regulatory requirements.

What to do: Ask your ECM vendor to detail their support for PDF. Do they support ISO-standardized PDF (ISO 32000)? Do they support any of the subset specifications such as PDF/A (for archiving) or PDF/UA (for accessibility)? Do they support high-quality PDF, not only for document creation, but also in processing (rendering, searching, indexing, collating, etc.)

Summary

Your systems and users transact PDF files more than any other type of electronic document. PDF deserves your attention in order to maximize its utility… and minimize risks from misuse.

 

© Rolffimages | Dreamstime.com – Man Removes Face To Reveal Mask Underneath Photo


Tags:
Categories: Document Management, ECM, Search