Real time search user behavior

Bernard J. Jansen, Gerry Campbell, Matthew Gregg

Research output: Chapter in Book/Report/Conference proceedingConference contribution

14 Citations (Scopus)

Abstract

Real time search is an increasingly important area of information seeking on the Web. In this research, we analyze 1,005,296 user interactions with a real time search engine over a 190 day period. We investigate aggregate usage of the search engine, such as number of users, queries, and terms. We also investigate the structure of queries and terms submitted by these users. The results are compared to Web searching on traditional search engines. Results show that 60% of the traffic comes from the engine's application program interface, indicating that real time search is heavily leveraged by other applications. Of the queries, 30% were unique (used only once in the entire dataset). The most frequent query accounted for 0.003% of the query set. Less than 8% of the terms were unique. The most frequently used terms accounted for only 0.03% of the total terms. Concerning search topics, the most used terms dealt with technology, entertainment, and politics, reflecting both the temporal nature of the queries and, perhaps, an early adopter user-based. Sexual queries were quite low, relative to traditional Web search. Searchers of real time content often repeat queries overtime, perhaps indicating long term interest in a topic. We discuss the implications for search engines and information providers as real time content increasingly enters the main stream.

Original languageEnglish (US)
Title of host publicationCHI 2010 - The 28th Annual CHI Conference on Human Factors in Computing Systems, Conference Proceedings and Extended Abstracts
Pages3961-3966
Number of pages6
DOIs
StatePublished - Jun 9 2010
Event28th Annual CHI Conference on Human Factors in Computing Systems, CHI 2010 - Atlanta, GA, United States
Duration: Apr 10 2010Apr 15 2010

Publication series

NameConference on Human Factors in Computing Systems - Proceedings

Other

Other28th Annual CHI Conference on Human Factors in Computing Systems, CHI 2010
CountryUnited States
CityAtlanta, GA
Period4/10/104/15/10

Fingerprint

Search engines
Application programs
World Wide Web
Interfaces (computer)
Engines

All Science Journal Classification (ASJC) codes

  • Software
  • Human-Computer Interaction
  • Computer Graphics and Computer-Aided Design

Cite this

Jansen, B. J., Campbell, G., & Gregg, M. (2010). Real time search user behavior. In CHI 2010 - The 28th Annual CHI Conference on Human Factors in Computing Systems, Conference Proceedings and Extended Abstracts (pp. 3961-3966). (Conference on Human Factors in Computing Systems - Proceedings). https://doi.org/10.1145/1753846.1754086
Jansen, Bernard J. ; Campbell, Gerry ; Gregg, Matthew. / Real time search user behavior. CHI 2010 - The 28th Annual CHI Conference on Human Factors in Computing Systems, Conference Proceedings and Extended Abstracts. 2010. pp. 3961-3966 (Conference on Human Factors in Computing Systems - Proceedings).
@inproceedings{50b45ff7c6d24c39bd235be525f48af3,
title = "Real time search user behavior",
abstract = "Real time search is an increasingly important area of information seeking on the Web. In this research, we analyze 1,005,296 user interactions with a real time search engine over a 190 day period. We investigate aggregate usage of the search engine, such as number of users, queries, and terms. We also investigate the structure of queries and terms submitted by these users. The results are compared to Web searching on traditional search engines. Results show that 60{\%} of the traffic comes from the engine's application program interface, indicating that real time search is heavily leveraged by other applications. Of the queries, 30{\%} were unique (used only once in the entire dataset). The most frequent query accounted for 0.003{\%} of the query set. Less than 8{\%} of the terms were unique. The most frequently used terms accounted for only 0.03{\%} of the total terms. Concerning search topics, the most used terms dealt with technology, entertainment, and politics, reflecting both the temporal nature of the queries and, perhaps, an early adopter user-based. Sexual queries were quite low, relative to traditional Web search. Searchers of real time content often repeat queries overtime, perhaps indicating long term interest in a topic. We discuss the implications for search engines and information providers as real time content increasingly enters the main stream.",
author = "Jansen, {Bernard J.} and Gerry Campbell and Matthew Gregg",
year = "2010",
month = "6",
day = "9",
doi = "10.1145/1753846.1754086",
language = "English (US)",
isbn = "9781605589312",
series = "Conference on Human Factors in Computing Systems - Proceedings",
pages = "3961--3966",
booktitle = "CHI 2010 - The 28th Annual CHI Conference on Human Factors in Computing Systems, Conference Proceedings and Extended Abstracts",

}

Jansen, BJ, Campbell, G & Gregg, M 2010, Real time search user behavior. in CHI 2010 - The 28th Annual CHI Conference on Human Factors in Computing Systems, Conference Proceedings and Extended Abstracts. Conference on Human Factors in Computing Systems - Proceedings, pp. 3961-3966, 28th Annual CHI Conference on Human Factors in Computing Systems, CHI 2010, Atlanta, GA, United States, 4/10/10. https://doi.org/10.1145/1753846.1754086

Real time search user behavior. / Jansen, Bernard J.; Campbell, Gerry; Gregg, Matthew.

CHI 2010 - The 28th Annual CHI Conference on Human Factors in Computing Systems, Conference Proceedings and Extended Abstracts. 2010. p. 3961-3966 (Conference on Human Factors in Computing Systems - Proceedings).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Real time search user behavior

AU - Jansen, Bernard J.

AU - Campbell, Gerry

AU - Gregg, Matthew

PY - 2010/6/9

Y1 - 2010/6/9

N2 - Real time search is an increasingly important area of information seeking on the Web. In this research, we analyze 1,005,296 user interactions with a real time search engine over a 190 day period. We investigate aggregate usage of the search engine, such as number of users, queries, and terms. We also investigate the structure of queries and terms submitted by these users. The results are compared to Web searching on traditional search engines. Results show that 60% of the traffic comes from the engine's application program interface, indicating that real time search is heavily leveraged by other applications. Of the queries, 30% were unique (used only once in the entire dataset). The most frequent query accounted for 0.003% of the query set. Less than 8% of the terms were unique. The most frequently used terms accounted for only 0.03% of the total terms. Concerning search topics, the most used terms dealt with technology, entertainment, and politics, reflecting both the temporal nature of the queries and, perhaps, an early adopter user-based. Sexual queries were quite low, relative to traditional Web search. Searchers of real time content often repeat queries overtime, perhaps indicating long term interest in a topic. We discuss the implications for search engines and information providers as real time content increasingly enters the main stream.

AB - Real time search is an increasingly important area of information seeking on the Web. In this research, we analyze 1,005,296 user interactions with a real time search engine over a 190 day period. We investigate aggregate usage of the search engine, such as number of users, queries, and terms. We also investigate the structure of queries and terms submitted by these users. The results are compared to Web searching on traditional search engines. Results show that 60% of the traffic comes from the engine's application program interface, indicating that real time search is heavily leveraged by other applications. Of the queries, 30% were unique (used only once in the entire dataset). The most frequent query accounted for 0.003% of the query set. Less than 8% of the terms were unique. The most frequently used terms accounted for only 0.03% of the total terms. Concerning search topics, the most used terms dealt with technology, entertainment, and politics, reflecting both the temporal nature of the queries and, perhaps, an early adopter user-based. Sexual queries were quite low, relative to traditional Web search. Searchers of real time content often repeat queries overtime, perhaps indicating long term interest in a topic. We discuss the implications for search engines and information providers as real time content increasingly enters the main stream.

UR - http://www.scopus.com/inward/record.url?scp=77953100508&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=77953100508&partnerID=8YFLogxK

U2 - 10.1145/1753846.1754086

DO - 10.1145/1753846.1754086

M3 - Conference contribution

AN - SCOPUS:77953100508

SN - 9781605589312

T3 - Conference on Human Factors in Computing Systems - Proceedings

SP - 3961

EP - 3966

BT - CHI 2010 - The 28th Annual CHI Conference on Human Factors in Computing Systems, Conference Proceedings and Extended Abstracts

ER -

Jansen BJ, Campbell G, Gregg M. Real time search user behavior. In CHI 2010 - The 28th Annual CHI Conference on Human Factors in Computing Systems, Conference Proceedings and Extended Abstracts. 2010. p. 3961-3966. (Conference on Human Factors in Computing Systems - Proceedings). https://doi.org/10.1145/1753846.1754086