Modeling journal bibliometrics to predict downloads and inform purchase decisions at university research libraries

Daniel M. Coughlin, Bernard J. Jansen

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

University libraries provide access to thousands of online journals and other content, spending millions of dollars annually on these electronic resources. Providing access to these online resources is costly, and it is difficult both to analyze the value of this content to the institution and to discern those journals that comparatively provide more value. In this research, we examine 1,510 journals from a large research university library, representing more than 40% of the university's annual subscription cost for electronic resources at the time of the study. We utilize a web analytics approach for the creation of a linear regression model to predict usage among these journals. We categorize metrics into two classes: global (journal focused) and local (institution dependent). Using 275 journals for our training set, our analysis shows that a combination of global and local metrics creates the strongest model for predicting full-text downloads. Our linear regression model has an accuracy of more than 80% in predicting downloads for the 1,235 journals in our test set. The implications of the findings are that university libraries that use local metrics have better insight into the value of a journal and therefore more efficient cost content management.

Original languageEnglish (US)
Pages (from-to)2263-2273
Number of pages11
JournalJournal of the Association for Information Science and Technology
Volume67
Issue number9
DOIs
StatePublished - Sep 1 2016

Fingerprint

university research
purchase
Linear regression
resources
electronics
Values
regression
university
subscription
costs
dollar
Costs
management
Modeling
University research
Bibliometrics
Purchase decision
Electronic resources
Linear regression model

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Computer Networks and Communications
  • Information Systems and Management
  • Library and Information Sciences

Cite this

@article{9f6e02ae25da4170ac8173f1f0c44e0b,
title = "Modeling journal bibliometrics to predict downloads and inform purchase decisions at university research libraries",
abstract = "University libraries provide access to thousands of online journals and other content, spending millions of dollars annually on these electronic resources. Providing access to these online resources is costly, and it is difficult both to analyze the value of this content to the institution and to discern those journals that comparatively provide more value. In this research, we examine 1,510 journals from a large research university library, representing more than 40{\%} of the university's annual subscription cost for electronic resources at the time of the study. We utilize a web analytics approach for the creation of a linear regression model to predict usage among these journals. We categorize metrics into two classes: global (journal focused) and local (institution dependent). Using 275 journals for our training set, our analysis shows that a combination of global and local metrics creates the strongest model for predicting full-text downloads. Our linear regression model has an accuracy of more than 80{\%} in predicting downloads for the 1,235 journals in our test set. The implications of the findings are that university libraries that use local metrics have better insight into the value of a journal and therefore more efficient cost content management.",
author = "Coughlin, {Daniel M.} and Jansen, {Bernard J.}",
year = "2016",
month = "9",
day = "1",
doi = "10.1002/asi.23549",
language = "English (US)",
volume = "67",
pages = "2263--2273",
journal = "Journal of the Association for Information Science and Technology",
issn = "2330-1635",
publisher = "John Wiley and Sons Ltd",
number = "9",

}

TY - JOUR

T1 - Modeling journal bibliometrics to predict downloads and inform purchase decisions at university research libraries

AU - Coughlin, Daniel M.

AU - Jansen, Bernard J.

PY - 2016/9/1

Y1 - 2016/9/1

N2 - University libraries provide access to thousands of online journals and other content, spending millions of dollars annually on these electronic resources. Providing access to these online resources is costly, and it is difficult both to analyze the value of this content to the institution and to discern those journals that comparatively provide more value. In this research, we examine 1,510 journals from a large research university library, representing more than 40% of the university's annual subscription cost for electronic resources at the time of the study. We utilize a web analytics approach for the creation of a linear regression model to predict usage among these journals. We categorize metrics into two classes: global (journal focused) and local (institution dependent). Using 275 journals for our training set, our analysis shows that a combination of global and local metrics creates the strongest model for predicting full-text downloads. Our linear regression model has an accuracy of more than 80% in predicting downloads for the 1,235 journals in our test set. The implications of the findings are that university libraries that use local metrics have better insight into the value of a journal and therefore more efficient cost content management.

AB - University libraries provide access to thousands of online journals and other content, spending millions of dollars annually on these electronic resources. Providing access to these online resources is costly, and it is difficult both to analyze the value of this content to the institution and to discern those journals that comparatively provide more value. In this research, we examine 1,510 journals from a large research university library, representing more than 40% of the university's annual subscription cost for electronic resources at the time of the study. We utilize a web analytics approach for the creation of a linear regression model to predict usage among these journals. We categorize metrics into two classes: global (journal focused) and local (institution dependent). Using 275 journals for our training set, our analysis shows that a combination of global and local metrics creates the strongest model for predicting full-text downloads. Our linear regression model has an accuracy of more than 80% in predicting downloads for the 1,235 journals in our test set. The implications of the findings are that university libraries that use local metrics have better insight into the value of a journal and therefore more efficient cost content management.

UR - http://www.scopus.com/inward/record.url?scp=84983262034&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84983262034&partnerID=8YFLogxK

U2 - 10.1002/asi.23549

DO - 10.1002/asi.23549

M3 - Article

VL - 67

SP - 2263

EP - 2273

JO - Journal of the Association for Information Science and Technology

JF - Journal of the Association for Information Science and Technology

SN - 2330-1635

IS - 9

ER -