Exploring Social Annotations for Information Retrieval

Ding Zhou, Jiang Bian, Shuyi Zheng, Hongyuan Zha, Lee Giles C. Lee Giles

Research output: Chapter in Book/Report/Conference proceedingConference contribution

138 Citations (Scopus)

Abstract

Social annotation has gained increasing popularity in many Web-based applications, leading to an emerging research area in text analysis and information retrieval. This paper is concerned with developing probabilistic models and computational algorithms for social annotations. We propose a unified framework to combine the modeling of social annotations with the language modeling-based methods for information retrieval. The proposed approach consists of two steps: (1) discovering topics in the contents and annotations of documents while categorizing the users by domains; and (2) enhancing document and query language models by incorporating user domain interests as well as topical background models. In particular, we propose a new general generative model for social annotations, which is then simplified to a computationally tractable hierarchical Bayesian network. Then we apply smoothing techniques in a risk minimization framework to incorporate the topical information to language models. Experiments are carried out on a real-world annotation data set sampled from del.icio.us. Our results demonstrate significant improvements over traditional approaches.

Original languageEnglish (US)
Title of host publicationProceeding of the 17th International Conference on World Wide Web 2008, WWW'08
Pages715-724
Number of pages10
DOIs
StatePublished - Dec 15 2008
Event17th International Conference on World Wide Web 2008, WWW'08 - Beijing, China
Duration: Apr 21 2008Apr 25 2008

Publication series

NameProceeding of the 17th International Conference on World Wide Web 2008, WWW'08

Other

Other17th International Conference on World Wide Web 2008, WWW'08
CountryChina
CityBeijing
Period4/21/084/25/08

Fingerprint

Information retrieval
Query languages
Bayesian networks
Experiments

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications

Cite this

Zhou, D., Bian, J., Zheng, S., Zha, H., & C. Lee Giles, L. G. (2008). Exploring Social Annotations for Information Retrieval. In Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08 (pp. 715-724). (Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08). https://doi.org/10.1145/1367497.1367594
Zhou, Ding ; Bian, Jiang ; Zheng, Shuyi ; Zha, Hongyuan ; C. Lee Giles, Lee Giles. / Exploring Social Annotations for Information Retrieval. Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08. 2008. pp. 715-724 (Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08).
@inproceedings{ea9b6d6c615147ed9fa3c184ad681c77,
title = "Exploring Social Annotations for Information Retrieval",
abstract = "Social annotation has gained increasing popularity in many Web-based applications, leading to an emerging research area in text analysis and information retrieval. This paper is concerned with developing probabilistic models and computational algorithms for social annotations. We propose a unified framework to combine the modeling of social annotations with the language modeling-based methods for information retrieval. The proposed approach consists of two steps: (1) discovering topics in the contents and annotations of documents while categorizing the users by domains; and (2) enhancing document and query language models by incorporating user domain interests as well as topical background models. In particular, we propose a new general generative model for social annotations, which is then simplified to a computationally tractable hierarchical Bayesian network. Then we apply smoothing techniques in a risk minimization framework to incorporate the topical information to language models. Experiments are carried out on a real-world annotation data set sampled from del.icio.us. Our results demonstrate significant improvements over traditional approaches.",
author = "Ding Zhou and Jiang Bian and Shuyi Zheng and Hongyuan Zha and {C. Lee Giles}, {Lee Giles}",
year = "2008",
month = "12",
day = "15",
doi = "10.1145/1367497.1367594",
language = "English (US)",
isbn = "9781605580852",
series = "Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08",
pages = "715--724",
booktitle = "Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08",

}

Zhou, D, Bian, J, Zheng, S, Zha, H & C. Lee Giles, LG 2008, Exploring Social Annotations for Information Retrieval. in Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08. Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08, pp. 715-724, 17th International Conference on World Wide Web 2008, WWW'08, Beijing, China, 4/21/08. https://doi.org/10.1145/1367497.1367594

Exploring Social Annotations for Information Retrieval. / Zhou, Ding; Bian, Jiang; Zheng, Shuyi; Zha, Hongyuan; C. Lee Giles, Lee Giles.

Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08. 2008. p. 715-724 (Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Exploring Social Annotations for Information Retrieval

AU - Zhou, Ding

AU - Bian, Jiang

AU - Zheng, Shuyi

AU - Zha, Hongyuan

AU - C. Lee Giles, Lee Giles

PY - 2008/12/15

Y1 - 2008/12/15

N2 - Social annotation has gained increasing popularity in many Web-based applications, leading to an emerging research area in text analysis and information retrieval. This paper is concerned with developing probabilistic models and computational algorithms for social annotations. We propose a unified framework to combine the modeling of social annotations with the language modeling-based methods for information retrieval. The proposed approach consists of two steps: (1) discovering topics in the contents and annotations of documents while categorizing the users by domains; and (2) enhancing document and query language models by incorporating user domain interests as well as topical background models. In particular, we propose a new general generative model for social annotations, which is then simplified to a computationally tractable hierarchical Bayesian network. Then we apply smoothing techniques in a risk minimization framework to incorporate the topical information to language models. Experiments are carried out on a real-world annotation data set sampled from del.icio.us. Our results demonstrate significant improvements over traditional approaches.

AB - Social annotation has gained increasing popularity in many Web-based applications, leading to an emerging research area in text analysis and information retrieval. This paper is concerned with developing probabilistic models and computational algorithms for social annotations. We propose a unified framework to combine the modeling of social annotations with the language modeling-based methods for information retrieval. The proposed approach consists of two steps: (1) discovering topics in the contents and annotations of documents while categorizing the users by domains; and (2) enhancing document and query language models by incorporating user domain interests as well as topical background models. In particular, we propose a new general generative model for social annotations, which is then simplified to a computationally tractable hierarchical Bayesian network. Then we apply smoothing techniques in a risk minimization framework to incorporate the topical information to language models. Experiments are carried out on a real-world annotation data set sampled from del.icio.us. Our results demonstrate significant improvements over traditional approaches.

UR - http://www.scopus.com/inward/record.url?scp=57349185892&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=57349185892&partnerID=8YFLogxK

U2 - 10.1145/1367497.1367594

DO - 10.1145/1367497.1367594

M3 - Conference contribution

AN - SCOPUS:57349185892

SN - 9781605580852

T3 - Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08

SP - 715

EP - 724

BT - Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08

ER -

Zhou D, Bian J, Zheng S, Zha H, C. Lee Giles LG. Exploring Social Annotations for Information Retrieval. In Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08. 2008. p. 715-724. (Proceeding of the 17th International Conference on World Wide Web 2008, WWW'08). https://doi.org/10.1145/1367497.1367594