PDF Association logo.

Status: Partner Member
Country: US
Sector: All industries
Joined at: Feb 08
Website: http://www.datalogics.com/

Linked User
Maryanne Pavlin
Matt Kuznicki
Nicki Bullock
Emma Kaschke
Leonard Ho

Identify PDF Problems Early with PDF CHECKER

Introduced in 2018, Datalogics PDF CHECKER is a free, easy-to-configure scriptable server tool designed to detect many different types of PDF problems. Combined with Datalogics PDF OPTIMIZER, PDF CHECKER can help to automate document workflows.


PDF is a robust document format that is used worldwide to store and transport information. That robustness and market ubiquity also brings challenges for workflows that require consistent and reliable PDFs. For example, a glossy magazine publishing workflow will often utilize PDFs with very high resolution images and complex transparency interactions. However, a distributed printing workflow where PDFs are transmitted worldwide for remote printing on low cost printers need compact PDFs to minimize bandwidth usage. Early detection and amelioration of PDF issues that impact your specific workflow is therefore crucial.

How can PDF CHECKER help?

Built on the Adobe PDF Library, PDF CHECKER is an ideal early warning solution to flag potentially problematic PDF files before they make it into your document management system or workflow. Then, with this knowledge of your PDF documents, you can take the necessary steps to apply appropriate changes to those documents. Let’s take a quick look — there are six distinct categories of checks that can be performed:

  • General – checks PDFs for digital signatures, PDF/A compliance claims, or passwords requirements, as well as for PDFs with syntax issues that prevent opening
  • Cleanup – checks for suboptimal compression
  • Fonts – checks for fonts that are not embedded or fonts that are fully embedded, but could be made smaller by subsetting as well as several common font descriptor error conditions
  • Objects –  checks for JavaScript and thumbnails
  • Userdata – checks for for a variety of object types that may be unnecessary or restricted for your intended usage such as annotations, OCGs (layers), transparency, embedded files and more
  • Images – checks for image resolutions that are too high, too low, or use specific compression types with distinct settings for color, grayscale or monochrome

Once you have identified potential problems in your PDF documents, the next step, of course, is to address them by processing your documents with a PDF editing or optimization tool. Datalogics PDF OPTIMIZER is an ideal complement, containing profile options that match each of PDF CHECKER’s profile options.

We encourage you to try PDF CHECKER, available as a free download here.

Related Products
Adobe PDF Library

The Adobe PDF Library SDK is a low-level PDF library that contains a powerful set of native C/C++ APIs with interfaces for .NET and Java APIs. Systems integrators, independent software vendors (ISVs), enterprise IT developers, and others can integrate Adobe PDF functionality within custom applications in a client and / or server environment.

PDF Java Toolkit

Datalogics PDF Java Toolkit is a native Java library that provides high-level APIs for automating PDF workflows like processing PDF forms, verifying digital signatures, and extracting text. It also offers low-level APIs for working directly with the structure of the PDF for those times you need it.

Adobe PDF Converter

Adobe Normalizer, is an API which allows developers to quickly and easily convert Encapsulated PostScript (EPS) and PostScript (PS) files to Adobe’s Portable Document Format (PDF). The industry-standard Adobe Distiller and Distiller Server are themselves built upon PDF Converter SDK; and now this API is available separately to application developers.

Adobe PDF Print Engine

The Adobe PDF Print Engine is a common rendering engine technology, packaged as a software development kit (SDK). It can be the basis for a variety of products for previewing and printing Adobe Portable Document Format (PDF) documents at different stages of the professional print workflow.


Datalogics PDF2IMG is a command-line utility that converts PDF files to a variety of image formats including PNG, JPG, TIFF, BMP, and more. It is built upon the Adobe PDF Library and uses Adobe technology for unrivaled color management during the PDF conversion process

PDF Alchemist

Datalogics PDF Alchemist is a new (C/C++) SDK for intelligently extracting text and images from PDFs and exporting to HTML 5 or EPUB. It employs sophisticated techniques to identify and reconstruct “text flows” within the PDF.