PDF Association logo

Discover pdfa.org

Key resources

Get involved

How do you find the right PDF technology vendor?
Use the Solution Agent to ask the entire PDF communuity!
The PDF Association celebrates its members’ public statements
of support
for ISO-standardized PDF technology.

Member Area

Working together at PDF Days Europe 2022; picture of Xiong Longfei

Presented at PDF Days Europe 2022
( 2022, Sep )

Deformed table restoration in scanned PDF

Best practice: Better table reconstruction results for scanned PDFs or pictures under complex scenes.

Excerpt: As one of the most common data-representation formats, the table has taken a significant place in documents because of its capability of representing the wealth of information. However, in reality, a vast of tables are published in an uneditable format such as scanned PDF or photo. Therefore, the table reconstruction project aims to detecting the table regions and recognizing the contents from the uneditable files. Then, it would extract the valid information and recognize the table structure fr … Read more
About the presenter(s)

Xiong Longfei is a deputy technical director of Kingsoft Office. He works in artificial intelligence for several years and is a committee member of the CSIG(China Society of Image and … Read more


Xiong Longfei
Kingsoft Office

Description

As one of the most common data-representation formats, the table has taken a significant place in documents because of its capability of representing the wealth of information. However, in reality, a vast of tables are published in an uneditable format such as scanned PDF or photo. Therefore, the table reconstruction project aims to detecting the table regions and recognizing the contents from the uneditable files. Then, it would extract the valid information and recognize the table structure from the extracted table. After all, it would reconstruct an editable table by combining the obtained table contents and structure for the user’s re-creation. The difficulties of the project are the table recognition and reconstruction from the unnormal scheme such as deformed tables, contaminated tables and tables that are photographed with different illumination. To solve the problems, we provide a bunch of deep learning methods to improve table reconstruction performance. sample1 sample2


WordPress Cookie Notice by Real Cookie Banner