Facebook
Twitter
YOUTUBE
LINKEDIN
XING
PDFlib GmbH
Status: Full Member
Country: DE
Sector: All industries
Contact:
Joined at: Sep 06
Website: http://www.pdflib.com/

Linked User
Rainer Plöckl
Stephan Mühlstrasser
Thomas Merz

PDFlib TET PDF IFilter – Enterprise PDF Search for Windows



TET PDF IFilter extracts text and metadata from PDF documents and makes it available to search and retrieval software on Windows. This allows PDF documents to be searched on the local desktop, a corporate server, or the Web. TET PDF IFilter is based on the patented PDFlib Text and Image Extraction Toolkit (TET), which is an established developer product for reliably extracting text from PDF documents.

TET PDF IFilter is a robust implementation of Microsoft’s IFilter indexing interface. It works with all search and retrieval products which support the IFilter interface, e.g. SharePoint and SQL Server. Such products use format-specific filter programs – called IFilters – for particular file formats, e.g. HTML. TET PDF IFilter is such a program, aimed at PDF documents. The user interface for searching the documents may be the Windows Explorer, a Web or database frontend, a query script, or a custom application. As an alternative to interactive searches, queries can also be submitted programmatically without any user interface.

Based on patented TET technology

PDFlib TET, the basis of TET PDF IFilter, was first released in 2002, and has been used by customers worldwide in server and desktop environments. As an alternative to extracting PDF page contents and metadata as raw text, TET can supply the document contents in XML format. TET is also available as a free plugin for Adobe Acrobat; this plugin allows interactive test and evaluation of TET’s superior text and image extraction.

Unique advantages

TET PDF IFilter offers the following advantages:

  • Supports Western text, Chinese, Japanese, and Korean (CJK) text and right-to-left languages such as Arabic and Hebrew
  • Indexes protected documents and extracts text even from PDFs where Acrobat fails
  • Supports Unicode folding, decomposition, and normalization
  • Deployment: thread-safe, fast and robust, 32- and 64-bit versions
  • Automatic script and language detection for improved search

Enterprise PDF search

TET PDF IFilter is available in fully thread-safe native 32- and 64-bit versions. You can implement enterprise PDF search solutions with TET PDF IFilter and the following products:

  • Microsoft SharePoint Server 2013 and earlier
  • Microsoft Search Server
  • Microsoft SQL Server
  • Microsoft Exchange Server
  • Mirosoft Site Server

TET PDF IFilter can be used with all other Microsoft and third-party products which support the IFilter interface.

Desktop PDF search

TET PDF IFilter can also be used to implement desktop PDF search, e.g. with Windows Search, which is integrated in Windows.

TET PDF IFilter is freely available for non-commercial use on desktop operating systems, which provides a convenient basis for test and evaluation.

Location
Franziska-Bilek-Weg 9, 80339 München, Deutschland



Related Products
PDFlib FontReporter


PDFlib FontReporter is a free plugin for analyzing fonts in PDF documents.

PDFlib Products for Mobile Devices and Embedded Platforms
PDFlib products for generating and processing PDF documents on smartphones and tablets are available for mobile devices and embedded platforms

PDFlib pCOS – PDF Information Retrieval Tool


PDFlib pCOS provides a simple and elegant facility for retrieving any information from a PDF document which is not part of the page contents.

PDFlib PLOP DS - PDF Linearization, Optimization, Protection, Digital Signature


PLOP DS (Digital Signature) a versatile tool for linearizing, optimizing, repairing, analyzing, encrypting and decrypting and digitally signing PDF documents.

PDFlib PLOP - PDF Linearization, Optimization, Protection



PDFlib TET Plugin


The free TET Plugin provides easy access to the PDFlib Text Extraction Toolkit (TET).

PDFlib TET PDF IFilter - Enterprise PDF Search for Windows



PDFlib TET


PDFlib TET (Text and Image Extraction Toolkit) reliably extracts text, images and metadata from PDF documents. TET makes available the text contents of a PDF as Unicode strings, plus detailed colour, glyph and font information as well as the position on the page.

PDFlib Personalization Server (PPS)


The PDFlib Personalization Server (PPS) includes PDFlib+PDI plus additional functions for variable data processing using PDFlib Blocks.

PDFlib+PDI


PDFlib+PDI includes all PDFlib functions, plus the PDF Import Library (PDI).

PDFlib


PDFlib is the leading developer toolbox for generating and manipulating files in the Portable Document Format (PDF).