Automatically Generating a Concept Hierarchy with Graphs

Pucktada Treeratpituk, Madian Khabsa, C. Lee Giles

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

We propose a novel graph-based approach for constructing concept hierarchy from a large text corpus. Our algorithm incorporates both statistical co-occurrences and lexical similarity in optimizing the structure of the taxonomy. To automatically generate topic-dependent taxonomies from a large text corpus, we first extracts topical terms and their relationships from the corpus. The algorithm then constructs a weighted graph representing topics and their associations. A graph partitioning algorithm is then used to recursively partition the topic graph into a taxonomy. For evaluation, we apply our approach to articles, primarily computer science, in the CiteSeerX digital library and search engine.

Original languageEnglish (US)
Title of host publicationJCDL 2015 - Proceedings of the 15th ACM/IEEE-CE Joint Conference on Digital Libraries
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages265-266
Number of pages2
ISBN (Electronic)9781450335942
DOIs
StatePublished - Jun 21 2015
Event15th ACM/IEEE-CE Joint Conference on Digital Libraries, JCDL 2015 - Knoxville, United States
Duration: Jun 21 2015Jun 25 2015

Publication series

NameProceedings of the ACM/IEEE Joint Conference on Digital Libraries
Volume2015-June
ISSN (Print)1552-5996

Other

Other15th ACM/IEEE-CE Joint Conference on Digital Libraries, JCDL 2015
CountryUnited States
CityKnoxville
Period6/21/156/25/15

Fingerprint

Taxonomies
Digital libraries
Search engines
Computer science

All Science Journal Classification (ASJC) codes

  • Engineering(all)

Cite this

Treeratpituk, P., Khabsa, M., & Giles, C. L. (2015). Automatically Generating a Concept Hierarchy with Graphs. In JCDL 2015 - Proceedings of the 15th ACM/IEEE-CE Joint Conference on Digital Libraries (pp. 265-266). (Proceedings of the ACM/IEEE Joint Conference on Digital Libraries; Vol. 2015-June). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1145/2756406.2756967
Treeratpituk, Pucktada ; Khabsa, Madian ; Giles, C. Lee. / Automatically Generating a Concept Hierarchy with Graphs. JCDL 2015 - Proceedings of the 15th ACM/IEEE-CE Joint Conference on Digital Libraries. Institute of Electrical and Electronics Engineers Inc., 2015. pp. 265-266 (Proceedings of the ACM/IEEE Joint Conference on Digital Libraries).
@inproceedings{878486596bf1448a8d1683f11db35df2,
title = "Automatically Generating a Concept Hierarchy with Graphs",
abstract = "We propose a novel graph-based approach for constructing concept hierarchy from a large text corpus. Our algorithm incorporates both statistical co-occurrences and lexical similarity in optimizing the structure of the taxonomy. To automatically generate topic-dependent taxonomies from a large text corpus, we first extracts topical terms and their relationships from the corpus. The algorithm then constructs a weighted graph representing topics and their associations. A graph partitioning algorithm is then used to recursively partition the topic graph into a taxonomy. For evaluation, we apply our approach to articles, primarily computer science, in the CiteSeerX digital library and search engine.",
author = "Pucktada Treeratpituk and Madian Khabsa and Giles, {C. Lee}",
year = "2015",
month = "6",
day = "21",
doi = "10.1145/2756406.2756967",
language = "English (US)",
series = "Proceedings of the ACM/IEEE Joint Conference on Digital Libraries",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
pages = "265--266",
booktitle = "JCDL 2015 - Proceedings of the 15th ACM/IEEE-CE Joint Conference on Digital Libraries",
address = "United States",

}

Treeratpituk, P, Khabsa, M & Giles, CL 2015, Automatically Generating a Concept Hierarchy with Graphs. in JCDL 2015 - Proceedings of the 15th ACM/IEEE-CE Joint Conference on Digital Libraries. Proceedings of the ACM/IEEE Joint Conference on Digital Libraries, vol. 2015-June, Institute of Electrical and Electronics Engineers Inc., pp. 265-266, 15th ACM/IEEE-CE Joint Conference on Digital Libraries, JCDL 2015, Knoxville, United States, 6/21/15. https://doi.org/10.1145/2756406.2756967

Automatically Generating a Concept Hierarchy with Graphs. / Treeratpituk, Pucktada; Khabsa, Madian; Giles, C. Lee.

JCDL 2015 - Proceedings of the 15th ACM/IEEE-CE Joint Conference on Digital Libraries. Institute of Electrical and Electronics Engineers Inc., 2015. p. 265-266 (Proceedings of the ACM/IEEE Joint Conference on Digital Libraries; Vol. 2015-June).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Automatically Generating a Concept Hierarchy with Graphs

AU - Treeratpituk, Pucktada

AU - Khabsa, Madian

AU - Giles, C. Lee

PY - 2015/6/21

Y1 - 2015/6/21

N2 - We propose a novel graph-based approach for constructing concept hierarchy from a large text corpus. Our algorithm incorporates both statistical co-occurrences and lexical similarity in optimizing the structure of the taxonomy. To automatically generate topic-dependent taxonomies from a large text corpus, we first extracts topical terms and their relationships from the corpus. The algorithm then constructs a weighted graph representing topics and their associations. A graph partitioning algorithm is then used to recursively partition the topic graph into a taxonomy. For evaluation, we apply our approach to articles, primarily computer science, in the CiteSeerX digital library and search engine.

AB - We propose a novel graph-based approach for constructing concept hierarchy from a large text corpus. Our algorithm incorporates both statistical co-occurrences and lexical similarity in optimizing the structure of the taxonomy. To automatically generate topic-dependent taxonomies from a large text corpus, we first extracts topical terms and their relationships from the corpus. The algorithm then constructs a weighted graph representing topics and their associations. A graph partitioning algorithm is then used to recursively partition the topic graph into a taxonomy. For evaluation, we apply our approach to articles, primarily computer science, in the CiteSeerX digital library and search engine.

UR - http://www.scopus.com/inward/record.url?scp=84951972499&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84951972499&partnerID=8YFLogxK

U2 - 10.1145/2756406.2756967

DO - 10.1145/2756406.2756967

M3 - Conference contribution

AN - SCOPUS:84951972499

T3 - Proceedings of the ACM/IEEE Joint Conference on Digital Libraries

SP - 265

EP - 266

BT - JCDL 2015 - Proceedings of the 15th ACM/IEEE-CE Joint Conference on Digital Libraries

PB - Institute of Electrical and Electronics Engineers Inc.

ER -

Treeratpituk P, Khabsa M, Giles CL. Automatically Generating a Concept Hierarchy with Graphs. In JCDL 2015 - Proceedings of the 15th ACM/IEEE-CE Joint Conference on Digital Libraries. Institute of Electrical and Electronics Engineers Inc. 2015. p. 265-266. (Proceedings of the ACM/IEEE Joint Conference on Digital Libraries). https://doi.org/10.1145/2756406.2756967