Modeling the author bias between two on-line computer science citation databases

Vaclav Petricek, Ingemar J. Cox, Hui Han, Isaac G. Councill, C. Lee Giles

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

We examine the difference and similarities between two on-line computer science citation databases DBLP and CiteSeer. The database entries in DBLP are inserted manually while the CiteSeer entries are obtained autonomously. We show that the CiteSeer database contains considerably fewer single author papers. This bias can be modeled by an exponential process with intuitive explanation. The model permits us to predict that the DBLP database covers approximately 30% of the entire literature of Computer Science.

Original languageEnglish (US)
Title of host publication14th International World Wide Web Conference, WWW2005
Pages1062-1063
Number of pages2
DOIs
StatePublished - Dec 1 2005
Event14th International World Wide Web Conference, WWW2005 - Chiba, Japan
Duration: May 10 2005May 14 2005

Publication series

Name14th International World Wide Web Conference, WWW2005

Other

Other14th International World Wide Web Conference, WWW2005
CountryJapan
CityChiba
Period5/10/055/14/05

Fingerprint

Computer science

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Software

Cite this

Petricek, V., Cox, I. J., Han, H., Councill, I. G., & Giles, C. L. (2005). Modeling the author bias between two on-line computer science citation databases. In 14th International World Wide Web Conference, WWW2005 (pp. 1062-1063). (14th International World Wide Web Conference, WWW2005). https://doi.org/10.1145/1062745.1062869
Petricek, Vaclav ; Cox, Ingemar J. ; Han, Hui ; Councill, Isaac G. ; Giles, C. Lee. / Modeling the author bias between two on-line computer science citation databases. 14th International World Wide Web Conference, WWW2005. 2005. pp. 1062-1063 (14th International World Wide Web Conference, WWW2005).
@inproceedings{cfcf357ddefd44acad181e809a8ffd72,
title = "Modeling the author bias between two on-line computer science citation databases",
abstract = "We examine the difference and similarities between two on-line computer science citation databases DBLP and CiteSeer. The database entries in DBLP are inserted manually while the CiteSeer entries are obtained autonomously. We show that the CiteSeer database contains considerably fewer single author papers. This bias can be modeled by an exponential process with intuitive explanation. The model permits us to predict that the DBLP database covers approximately 30{\%} of the entire literature of Computer Science.",
author = "Vaclav Petricek and Cox, {Ingemar J.} and Hui Han and Councill, {Isaac G.} and Giles, {C. Lee}",
year = "2005",
month = "12",
day = "1",
doi = "10.1145/1062745.1062869",
language = "English (US)",
isbn = "1595930515",
series = "14th International World Wide Web Conference, WWW2005",
pages = "1062--1063",
booktitle = "14th International World Wide Web Conference, WWW2005",

}

Petricek, V, Cox, IJ, Han, H, Councill, IG & Giles, CL 2005, Modeling the author bias between two on-line computer science citation databases. in 14th International World Wide Web Conference, WWW2005. 14th International World Wide Web Conference, WWW2005, pp. 1062-1063, 14th International World Wide Web Conference, WWW2005, Chiba, Japan, 5/10/05. https://doi.org/10.1145/1062745.1062869

Modeling the author bias between two on-line computer science citation databases. / Petricek, Vaclav; Cox, Ingemar J.; Han, Hui; Councill, Isaac G.; Giles, C. Lee.

14th International World Wide Web Conference, WWW2005. 2005. p. 1062-1063 (14th International World Wide Web Conference, WWW2005).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Modeling the author bias between two on-line computer science citation databases

AU - Petricek, Vaclav

AU - Cox, Ingemar J.

AU - Han, Hui

AU - Councill, Isaac G.

AU - Giles, C. Lee

PY - 2005/12/1

Y1 - 2005/12/1

N2 - We examine the difference and similarities between two on-line computer science citation databases DBLP and CiteSeer. The database entries in DBLP are inserted manually while the CiteSeer entries are obtained autonomously. We show that the CiteSeer database contains considerably fewer single author papers. This bias can be modeled by an exponential process with intuitive explanation. The model permits us to predict that the DBLP database covers approximately 30% of the entire literature of Computer Science.

AB - We examine the difference and similarities between two on-line computer science citation databases DBLP and CiteSeer. The database entries in DBLP are inserted manually while the CiteSeer entries are obtained autonomously. We show that the CiteSeer database contains considerably fewer single author papers. This bias can be modeled by an exponential process with intuitive explanation. The model permits us to predict that the DBLP database covers approximately 30% of the entire literature of Computer Science.

UR - http://www.scopus.com/inward/record.url?scp=33845414545&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33845414545&partnerID=8YFLogxK

U2 - 10.1145/1062745.1062869

DO - 10.1145/1062745.1062869

M3 - Conference contribution

AN - SCOPUS:33845414545

SN - 1595930515

SN - 9781595930514

T3 - 14th International World Wide Web Conference, WWW2005

SP - 1062

EP - 1063

BT - 14th International World Wide Web Conference, WWW2005

ER -

Petricek V, Cox IJ, Han H, Councill IG, Giles CL. Modeling the author bias between two on-line computer science citation databases. In 14th International World Wide Web Conference, WWW2005. 2005. p. 1062-1063. (14th International World Wide Web Conference, WWW2005). https://doi.org/10.1145/1062745.1062869