Big scholarly data in citeseerx: Information extraction from the web

Alexander G. Ororbia, Jian Wu, Madian Khabsa, Kyle Williams, C. Lee Giles

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Scopus citations

Abstract

We examine CiteSeerX, an intelligent system designed with the goal of automatically acquiring and organizing large- scale collections of scholarly documents from the world wide web. From the perspective of automatic information extrac- tion and modes of alternative search, we examine various functional aspects of this complex system in order to in- vestigate and explore ongoing and future research develop- ments1.

Original languageEnglish (US)
Title of host publicationWWW 2015 Companion - Proceedings of the 24th International Conference on World Wide Web
PublisherAssociation for Computing Machinery, Inc
Pages597-602
Number of pages6
ISBN (Electronic)9781450334730
DOIs
StatePublished - May 18 2015
Event24th International Conference on World Wide Web, WWW 2015 - Florence, Italy
Duration: May 18 2015May 22 2015

Publication series

NameWWW 2015 Companion - Proceedings of the 24th International Conference on World Wide Web

Other

Other24th International Conference on World Wide Web, WWW 2015
CountryItaly
CityFlorence
Period5/18/155/22/15

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Software

Fingerprint Dive into the research topics of 'Big scholarly data in citeseerx: Information extraction from the web'. Together they form a unique fingerprint.

  • Cite this

    Ororbia, A. G., Wu, J., Khabsa, M., Williams, K., & Giles, C. L. (2015). Big scholarly data in citeseerx: Information extraction from the web. In WWW 2015 Companion - Proceedings of the 24th International Conference on World Wide Web (pp. 597-602). (WWW 2015 Companion - Proceedings of the 24th International Conference on World Wide Web). Association for Computing Machinery, Inc. https://doi.org/10.1145/2740908.2741736