Scaling SeerSuite in the cloud

Pradeep Teregowda, C. Lee Giles

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Scopus citations

Abstract

The SeerSuite digital library search engine framework is used to build tools such as CiteSeerx. It includes a complex metadata extraction system capable of extracting elements, such as author name, title, citations and citation contexts that are crucial bibliometric data and for building a citation graph. The workload faced by the exractor is dynamic in nature and this variability makes CiteSeerx attractive for hosting in a cloud computing environment. Given its application binary dependencies and its reliance on a specialized infrastructure, the current extractor has several limitations. These limitations motivated the design and implementation of the metadata extraction system proposed in this study. A message oriented middleware architecture is used with a publish/subscribe pattern to build a scalable, flexible system that can be deployed across a range of cloud infrastructure. To demonstrate the broad applicability of the proposed system, we evaluate it in terms of its reference implementation across different scenarios of deployment and in regard to its scalability.

Original languageEnglish (US)
Title of host publicationProceedings of the IEEE International Conference on Cloud Engineering, IC2E 2013
Pages146-155
Number of pages10
DOIs
StatePublished - 2013
Event1st IEEE International Conference on Cloud Engineering, IC2E 2013 - San Francisco, CA, United States
Duration: Mar 25 2013Mar 28 2013

Publication series

NameProceedings of the IEEE International Conference on Cloud Engineering, IC2E 2013

Other

Other1st IEEE International Conference on Cloud Engineering, IC2E 2013
CountryUnited States
CitySan Francisco, CA
Period3/25/133/28/13

All Science Journal Classification (ASJC) codes

  • Software

Fingerprint Dive into the research topics of 'Scaling SeerSuite in the cloud'. Together they form a unique fingerprint.

Cite this