Search engine driven author disambiguation

Yee Fan Tan, Min Yen Kan, Dongwon Lee

Research output: Chapter in Book/Report/Conference proceedingConference contribution

68 Scopus citations

Abstract

In scholarly digital libraries, author disambiguation is an important task that attributes a scholarly work with specific authors. This is critical when individuals share the same name. We present an approach to this task that analyzes the results of automatically-crafted web searches. A key observation is that pages from rare web sites are stronger source of evidence than pages from common web sites, which we model as Inverse Host Frequency (IHF). Our system is able to achieve an average accuracy of 0.836.

Original languageEnglish (US)
Title of host publication6th ACM/IEEE-CS Joint Conference on Digital Libraries 2006
Subtitle of host publicationOpening Information Horizons, JCDL '06
Pages314-315
Number of pages2
DOIs
StatePublished - Dec 1 2006
Event6th ACM/IEEE-CS Joint Conference on Digital Libraries 2006: Opening Information Horizons, JCDL '06 - Chapel Hill, NC, United States
Duration: Jun 11 2006Jun 15 2006

Publication series

NameProceedings of the ACM/IEEE Joint Conference on Digital Libraries
Volume2006
ISSN (Print)1552-5996

Other

Other6th ACM/IEEE-CS Joint Conference on Digital Libraries 2006: Opening Information Horizons, JCDL '06
CountryUnited States
CityChapel Hill, NC
Period6/11/066/15/06

All Science Journal Classification (ASJC) codes

  • Engineering(all)

Fingerprint Dive into the research topics of 'Search engine driven author disambiguation'. Together they form a unique fingerprint.

Cite this