Automatic extraction of data from 2-D plots in documents

Research output: Chapter in Book/Report/Conference proceedingConference contribution

11 Citations (Scopus)

Abstract

Two-dimensional (2-D) plots in digital documents contain important information. Often, the results of scientific experiments and performance of businesses are summarized using plots. Although 2-D plots are easily understood by human users, current search engines rarely utilize the information contained in the plots to enhance the results returned in response to queries posed by endusers. We propose an automated algorithm for extracting information from line curves in 2-D plots. The extracted information can be stored in a database and indexed to answer end-user queries and enhance search results. We have collected 2-D plot images from a variety of resources and tested our extraction algorithms. Experimental evaluation has demonstrated that our method can produce results suitable for real world use.

Original languageEnglish (US)
Title of host publicationProceedings - 9th International Conference on Document Analysis and Recognition, ICDAR 2007
Pages188-192
Number of pages5
DOIs
StatePublished - Dec 1 2007
Event9th International Conference on Document Analysis and Recognition, ICDAR 2007 - Curitiba, Brazil
Duration: Sep 23 2007Sep 26 2007

Publication series

NameProceedings of the International Conference on Document Analysis and Recognition, ICDAR
Volume1
ISSN (Print)1520-5363

Other

Other9th International Conference on Document Analysis and Recognition, ICDAR 2007
CountryBrazil
CityCuritiba
Period9/23/079/26/07

Fingerprint

Search engines
Industry
Experiments

All Science Journal Classification (ASJC) codes

  • Computer Vision and Pattern Recognition

Cite this

Lu, X., Wang, J. Z., Mitra, P., & Giles, C. L. (2007). Automatic extraction of data from 2-D plots in documents. In Proceedings - 9th International Conference on Document Analysis and Recognition, ICDAR 2007 (pp. 188-192). [4378701] (Proceedings of the International Conference on Document Analysis and Recognition, ICDAR; Vol. 1). https://doi.org/10.1109/ICDAR.2007.4378701
Lu, Xiaonan ; Wang, James Z. ; Mitra, Prasenjit ; Giles, C. Lee. / Automatic extraction of data from 2-D plots in documents. Proceedings - 9th International Conference on Document Analysis and Recognition, ICDAR 2007. 2007. pp. 188-192 (Proceedings of the International Conference on Document Analysis and Recognition, ICDAR).
@inproceedings{ce3b83172ab648ffafe5e4eecae4be73,
title = "Automatic extraction of data from 2-D plots in documents",
abstract = "Two-dimensional (2-D) plots in digital documents contain important information. Often, the results of scientific experiments and performance of businesses are summarized using plots. Although 2-D plots are easily understood by human users, current search engines rarely utilize the information contained in the plots to enhance the results returned in response to queries posed by endusers. We propose an automated algorithm for extracting information from line curves in 2-D plots. The extracted information can be stored in a database and indexed to answer end-user queries and enhance search results. We have collected 2-D plot images from a variety of resources and tested our extraction algorithms. Experimental evaluation has demonstrated that our method can produce results suitable for real world use.",
author = "Xiaonan Lu and Wang, {James Z.} and Prasenjit Mitra and Giles, {C. Lee}",
year = "2007",
month = "12",
day = "1",
doi = "10.1109/ICDAR.2007.4378701",
language = "English (US)",
isbn = "0769528228",
series = "Proceedings of the International Conference on Document Analysis and Recognition, ICDAR",
pages = "188--192",
booktitle = "Proceedings - 9th International Conference on Document Analysis and Recognition, ICDAR 2007",

}

Lu, X, Wang, JZ, Mitra, P & Giles, CL 2007, Automatic extraction of data from 2-D plots in documents. in Proceedings - 9th International Conference on Document Analysis and Recognition, ICDAR 2007., 4378701, Proceedings of the International Conference on Document Analysis and Recognition, ICDAR, vol. 1, pp. 188-192, 9th International Conference on Document Analysis and Recognition, ICDAR 2007, Curitiba, Brazil, 9/23/07. https://doi.org/10.1109/ICDAR.2007.4378701

Automatic extraction of data from 2-D plots in documents. / Lu, Xiaonan; Wang, James Z.; Mitra, Prasenjit; Giles, C. Lee.

Proceedings - 9th International Conference on Document Analysis and Recognition, ICDAR 2007. 2007. p. 188-192 4378701 (Proceedings of the International Conference on Document Analysis and Recognition, ICDAR; Vol. 1).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Automatic extraction of data from 2-D plots in documents

AU - Lu, Xiaonan

AU - Wang, James Z.

AU - Mitra, Prasenjit

AU - Giles, C. Lee

PY - 2007/12/1

Y1 - 2007/12/1

N2 - Two-dimensional (2-D) plots in digital documents contain important information. Often, the results of scientific experiments and performance of businesses are summarized using plots. Although 2-D plots are easily understood by human users, current search engines rarely utilize the information contained in the plots to enhance the results returned in response to queries posed by endusers. We propose an automated algorithm for extracting information from line curves in 2-D plots. The extracted information can be stored in a database and indexed to answer end-user queries and enhance search results. We have collected 2-D plot images from a variety of resources and tested our extraction algorithms. Experimental evaluation has demonstrated that our method can produce results suitable for real world use.

AB - Two-dimensional (2-D) plots in digital documents contain important information. Often, the results of scientific experiments and performance of businesses are summarized using plots. Although 2-D plots are easily understood by human users, current search engines rarely utilize the information contained in the plots to enhance the results returned in response to queries posed by endusers. We propose an automated algorithm for extracting information from line curves in 2-D plots. The extracted information can be stored in a database and indexed to answer end-user queries and enhance search results. We have collected 2-D plot images from a variety of resources and tested our extraction algorithms. Experimental evaluation has demonstrated that our method can produce results suitable for real world use.

UR - http://www.scopus.com/inward/record.url?scp=51149119523&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=51149119523&partnerID=8YFLogxK

U2 - 10.1109/ICDAR.2007.4378701

DO - 10.1109/ICDAR.2007.4378701

M3 - Conference contribution

AN - SCOPUS:51149119523

SN - 0769528228

SN - 9780769528229

T3 - Proceedings of the International Conference on Document Analysis and Recognition, ICDAR

SP - 188

EP - 192

BT - Proceedings - 9th International Conference on Document Analysis and Recognition, ICDAR 2007

ER -

Lu X, Wang JZ, Mitra P, Giles CL. Automatic extraction of data from 2-D plots in documents. In Proceedings - 9th International Conference on Document Analysis and Recognition, ICDAR 2007. 2007. p. 188-192. 4378701. (Proceedings of the International Conference on Document Analysis and Recognition, ICDAR). https://doi.org/10.1109/ICDAR.2007.4378701