Next generation CiteSeer

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

CiteSeer, a computer and information science search engine and digital library, has been a radical departure for scientific document access and analysis. With nearly 700,000 documents, it has sometimes two million page views a day making it one of the most popular document access engines in science. CiteSeer is also portable, having been extended to ebusiness (eBizSearch) and more recently to academic business documents (SMEALSearch). CiteSeer is based on two features: actively acquiring new documents and automatic tagging and linking of metadata information inherent in an academic document's syntactic structure. Why is CiteSeer so popular? We discuss this and methods for providing new tagged metadata such as institutions and acknowledgements, new data resources and services and the issues in automation. We then discuss the next generation of CiteSeer.

Original languageEnglish (US)
Title of host publicationWIDM 2004: Proceedings of the Sixth ACM International Workshop on Web Information and Data Management
EditorsA.H.F. Laender, D. Lee, M. Ronthaler
Pages47
Number of pages1
StatePublished - 2004
EventWIDM 2004: Proceedings of the Sixth ACM International Workshop on Web Information and Data Management - Washington, DC, United States
Duration: Nov 12 2004Nov 13 2004

Other

OtherWIDM 2004: Proceedings of the Sixth ACM International Workshop on Web Information and Data Management
CountryUnited States
CityWashington, DC
Period11/12/0411/13/04

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Information Systems

Fingerprint Dive into the research topics of 'Next generation CiteSeer'. Together they form a unique fingerprint.

  • Cite this

    Giles, C. L. (2004). Next generation CiteSeer. In A. H. F. Laender, D. Lee, & M. Ronthaler (Eds.), WIDM 2004: Proceedings of the Sixth ACM International Workshop on Web Information and Data Management (pp. 47)