Automatic extraction of table metadata from digital documents

Research output: Chapter in Book/Report/Conference proceedingConference contribution

25 Scopus citations

Abstract

Tables are used to present, list, summarize, and structure important data in documents. In scholarly articles, they are often used to present the relationships among data and highlight a collection of results obtained from experiments and scientific analysis. In digital libraries, extracting this data automatically and understanding the structure and content of tables are very important to many applications. Automatic identification extraction, and search for the contents of tables can be made more precise with the help of metadata. In this paper, we propose a set of medium-independent table metadata to facilitate the table indexing, searching, and exchanging. To extract the contents of tables and their metadata, an automatic table metadata extraction algorithm is designed and tested on PDF documents.

Original languageEnglish (US)
Title of host publication6th ACM/IEEE-CS Joint Conference on Digital Libraries 2006
Subtitle of host publicationOpening Information Horizons, JCDL '06
Pages339-340
Number of pages2
DOIs
StatePublished - Dec 1 2006
Event6th ACM/IEEE-CS Joint Conference on Digital Libraries 2006: Opening Information Horizons, JCDL '06 - Chapel Hill, NC, United States
Duration: Jun 11 2006Jun 15 2006

Publication series

NameProceedings of the ACM/IEEE Joint Conference on Digital Libraries
Volume2006
ISSN (Print)1552-5996

Other

Other6th ACM/IEEE-CS Joint Conference on Digital Libraries 2006: Opening Information Horizons, JCDL '06
CountryUnited States
CityChapel Hill, NC
Period6/11/066/15/06

    Fingerprint

All Science Journal Classification (ASJC) codes

  • Engineering(all)

Cite this

Liu, Y., Mitra, P., Giles, C. L., & Bai, K. (2006). Automatic extraction of table metadata from digital documents. In 6th ACM/IEEE-CS Joint Conference on Digital Libraries 2006: Opening Information Horizons, JCDL '06 (pp. 339-340). (Proceedings of the ACM/IEEE Joint Conference on Digital Libraries; Vol. 2006). https://doi.org/10.1145/1141753.1141835