Indexing and retrieval of scientific literature

Steve Lawrence, Kurt Bollacker, C. Lee Giles

Research output: Chapter in Book/Report/Conference proceedingConference contribution

64 Citations (Scopus)

Abstract

The web has greatly improved access to scientific literature. However, scientific articles on the web are largely disorganized, with research articles being spread across archive sites, institution sites, journal sites, and researcher home-pages. No index covers all of the available literature, and the major web search engines typically do not index the content of Postscript/PDF documents at all. This paper discusses the creation of digital libraries of scientific literature on the web, including the efficient location of articles, full-text indexing of the articles, autonomous citation indexing, information extraction, display of query-sensitive summaries and citation context, hubs and authorities computation, similar document detection, user profiling, distributed error correction, graph analysis, and detection of overlapping documents. The software for the system is available at no cost for non-commercial use.

Original languageEnglish (US)
Title of host publicationInternational Conference on Information and Knowledge Management, Proceedings
PublisherACM
Pages139-146
Number of pages8
ISBN (Print)1581131461
StatePublished - 1999
EventProceedings of the 1999 8th International Conference on Information Knowledge Management (CIKM'99) - Kansas City, MO, USA
Duration: Nov 2 1999Nov 6 1999

Other

OtherProceedings of the 1999 8th International Conference on Information Knowledge Management (CIKM'99)
CityKansas City, MO, USA
Period11/2/9911/6/99

Fingerprint

Indexing
World Wide Web
Citations
Web search
Overlapping
Error correction
Authority
Costs
Graph
Hub
Search engine
Query
Software
Profiling
Digital libraries
Information extraction

All Science Journal Classification (ASJC) codes

  • Business, Management and Accounting(all)

Cite this

Lawrence, S., Bollacker, K., & Giles, C. L. (1999). Indexing and retrieval of scientific literature. In International Conference on Information and Knowledge Management, Proceedings (pp. 139-146). ACM.
Lawrence, Steve ; Bollacker, Kurt ; Giles, C. Lee. / Indexing and retrieval of scientific literature. International Conference on Information and Knowledge Management, Proceedings. ACM, 1999. pp. 139-146
@inproceedings{8a23633947894630b5d7b101dc0005f3,
title = "Indexing and retrieval of scientific literature",
abstract = "The web has greatly improved access to scientific literature. However, scientific articles on the web are largely disorganized, with research articles being spread across archive sites, institution sites, journal sites, and researcher home-pages. No index covers all of the available literature, and the major web search engines typically do not index the content of Postscript/PDF documents at all. This paper discusses the creation of digital libraries of scientific literature on the web, including the efficient location of articles, full-text indexing of the articles, autonomous citation indexing, information extraction, display of query-sensitive summaries and citation context, hubs and authorities computation, similar document detection, user profiling, distributed error correction, graph analysis, and detection of overlapping documents. The software for the system is available at no cost for non-commercial use.",
author = "Steve Lawrence and Kurt Bollacker and Giles, {C. Lee}",
year = "1999",
language = "English (US)",
isbn = "1581131461",
pages = "139--146",
booktitle = "International Conference on Information and Knowledge Management, Proceedings",
publisher = "ACM",

}

Lawrence, S, Bollacker, K & Giles, CL 1999, Indexing and retrieval of scientific literature. in International Conference on Information and Knowledge Management, Proceedings. ACM, pp. 139-146, Proceedings of the 1999 8th International Conference on Information Knowledge Management (CIKM'99), Kansas City, MO, USA, 11/2/99.

Indexing and retrieval of scientific literature. / Lawrence, Steve; Bollacker, Kurt; Giles, C. Lee.

International Conference on Information and Knowledge Management, Proceedings. ACM, 1999. p. 139-146.

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Indexing and retrieval of scientific literature

AU - Lawrence, Steve

AU - Bollacker, Kurt

AU - Giles, C. Lee

PY - 1999

Y1 - 1999

N2 - The web has greatly improved access to scientific literature. However, scientific articles on the web are largely disorganized, with research articles being spread across archive sites, institution sites, journal sites, and researcher home-pages. No index covers all of the available literature, and the major web search engines typically do not index the content of Postscript/PDF documents at all. This paper discusses the creation of digital libraries of scientific literature on the web, including the efficient location of articles, full-text indexing of the articles, autonomous citation indexing, information extraction, display of query-sensitive summaries and citation context, hubs and authorities computation, similar document detection, user profiling, distributed error correction, graph analysis, and detection of overlapping documents. The software for the system is available at no cost for non-commercial use.

AB - The web has greatly improved access to scientific literature. However, scientific articles on the web are largely disorganized, with research articles being spread across archive sites, institution sites, journal sites, and researcher home-pages. No index covers all of the available literature, and the major web search engines typically do not index the content of Postscript/PDF documents at all. This paper discusses the creation of digital libraries of scientific literature on the web, including the efficient location of articles, full-text indexing of the articles, autonomous citation indexing, information extraction, display of query-sensitive summaries and citation context, hubs and authorities computation, similar document detection, user profiling, distributed error correction, graph analysis, and detection of overlapping documents. The software for the system is available at no cost for non-commercial use.

UR - http://www.scopus.com/inward/record.url?scp=0033279067&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0033279067&partnerID=8YFLogxK

M3 - Conference contribution

SN - 1581131461

SP - 139

EP - 146

BT - International Conference on Information and Knowledge Management, Proceedings

PB - ACM

ER -

Lawrence S, Bollacker K, Giles CL. Indexing and retrieval of scientific literature. In International Conference on Information and Knowledge Management, Proceedings. ACM. 1999. p. 139-146