Abstract

Automatic recommendation of citations for a manuscript is highly valuable for scholarly activities since it can substantially improve the efficiency and quality of literature search. The prior techniques placed a considerable burden on users, who were required to provide a representative bibliography or to mark passages where citations are needed. In this paper we present a system that considerably reduces this burden: a user simply inputs a query manuscript (without a bibliography) and our system automatically finds locations where citations are needed. We show that naïve approaches do not work well due to massive noise in the document corpus. We produce a successful approach by carefully examining the relevance between segments in a query manuscript and the representative segments extracted from a document corpus. An extensive empirical evaluation using the CiteSeerX data set shows that our approach is effective.

Original languageEnglish (US)
Title of host publicationProceedings of the 4th ACM International Conference on Web Search and Data Mining, WSDM 2011
Pages755-764
Number of pages10
DOIs
StatePublished - 2011
Event4th ACM International Conference on Web Search and Data Mining, WSDM 2011 - Hong Kong, China
Duration: Feb 9 2011Feb 12 2011

Publication series

NameProceedings of the 4th ACM International Conference on Web Search and Data Mining, WSDM 2011

Other

Other4th ACM International Conference on Web Search and Data Mining, WSDM 2011
Country/TerritoryChina
CityHong Kong
Period2/9/112/12/11

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Computer Science Applications
  • Software

Fingerprint

Dive into the research topics of 'Citation recommendation without author supervision'. Together they form a unique fingerprint.

Cite this