Probabilistic models for discovering e-communities

Ding Zhou, Eren Manavoglu, Jia Li, C. Lee Giles, Hongyuan Zha

Research output: Chapter in Book/Report/Conference proceedingConference contribution

163 Citations (Scopus)

Abstract

The increasing amount of communication between individuals in e-formats (e.g. email, Instant messaging and the Web) has motivated computational research in social network analysis (SNA). Previous work in SNA has emphasized the social network (SN) topology measured by communication frequencies while ignoring the semantic information in SNs. In this paper, we propose two generative Bayesian models for semantic community discovery in SNs, combining probabilistic modeling with community detection in SNs. To simulate the generative models, an EnF-Gibbs sampling algorithm is proposed to address the efficiency and performance problems of traditional methods. Experimental studies on Enron email corpus show that our approach successfully detects the communities of individuals and in addition provides semantic topic descriptions of these communities.

Original languageEnglish (US)
Title of host publicationProceedings of the 15th International Conference on World Wide Web
Pages173-182
Number of pages10
DOIs
StatePublished - Dec 1 2006
Event15th International Conference on World Wide Web - Edinburgh, Scotland, United Kingdom
Duration: May 23 2006May 26 2006

Publication series

NameProceedings of the 15th International Conference on World Wide Web

Other

Other15th International Conference on World Wide Web
CountryUnited Kingdom
CityEdinburgh, Scotland
Period5/23/065/26/06

Fingerprint

Semantics
Electronic mail
Electric network analysis
Communication
Topology
Sampling
Statistical Models

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications
  • Software

Cite this

Zhou, D., Manavoglu, E., Li, J., Giles, C. L., & Zha, H. (2006). Probabilistic models for discovering e-communities. In Proceedings of the 15th International Conference on World Wide Web (pp. 173-182). (Proceedings of the 15th International Conference on World Wide Web). https://doi.org/10.1145/1135777.1135807
Zhou, Ding ; Manavoglu, Eren ; Li, Jia ; Giles, C. Lee ; Zha, Hongyuan. / Probabilistic models for discovering e-communities. Proceedings of the 15th International Conference on World Wide Web. 2006. pp. 173-182 (Proceedings of the 15th International Conference on World Wide Web).
@inproceedings{f09b28188758425baa90f6b78643acc8,
title = "Probabilistic models for discovering e-communities",
abstract = "The increasing amount of communication between individuals in e-formats (e.g. email, Instant messaging and the Web) has motivated computational research in social network analysis (SNA). Previous work in SNA has emphasized the social network (SN) topology measured by communication frequencies while ignoring the semantic information in SNs. In this paper, we propose two generative Bayesian models for semantic community discovery in SNs, combining probabilistic modeling with community detection in SNs. To simulate the generative models, an EnF-Gibbs sampling algorithm is proposed to address the efficiency and performance problems of traditional methods. Experimental studies on Enron email corpus show that our approach successfully detects the communities of individuals and in addition provides semantic topic descriptions of these communities.",
author = "Ding Zhou and Eren Manavoglu and Jia Li and Giles, {C. Lee} and Hongyuan Zha",
year = "2006",
month = "12",
day = "1",
doi = "10.1145/1135777.1135807",
language = "English (US)",
isbn = "1595933239",
series = "Proceedings of the 15th International Conference on World Wide Web",
pages = "173--182",
booktitle = "Proceedings of the 15th International Conference on World Wide Web",

}

Zhou, D, Manavoglu, E, Li, J, Giles, CL & Zha, H 2006, Probabilistic models for discovering e-communities. in Proceedings of the 15th International Conference on World Wide Web. Proceedings of the 15th International Conference on World Wide Web, pp. 173-182, 15th International Conference on World Wide Web, Edinburgh, Scotland, United Kingdom, 5/23/06. https://doi.org/10.1145/1135777.1135807

Probabilistic models for discovering e-communities. / Zhou, Ding; Manavoglu, Eren; Li, Jia; Giles, C. Lee; Zha, Hongyuan.

Proceedings of the 15th International Conference on World Wide Web. 2006. p. 173-182 (Proceedings of the 15th International Conference on World Wide Web).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Probabilistic models for discovering e-communities

AU - Zhou, Ding

AU - Manavoglu, Eren

AU - Li, Jia

AU - Giles, C. Lee

AU - Zha, Hongyuan

PY - 2006/12/1

Y1 - 2006/12/1

N2 - The increasing amount of communication between individuals in e-formats (e.g. email, Instant messaging and the Web) has motivated computational research in social network analysis (SNA). Previous work in SNA has emphasized the social network (SN) topology measured by communication frequencies while ignoring the semantic information in SNs. In this paper, we propose two generative Bayesian models for semantic community discovery in SNs, combining probabilistic modeling with community detection in SNs. To simulate the generative models, an EnF-Gibbs sampling algorithm is proposed to address the efficiency and performance problems of traditional methods. Experimental studies on Enron email corpus show that our approach successfully detects the communities of individuals and in addition provides semantic topic descriptions of these communities.

AB - The increasing amount of communication between individuals in e-formats (e.g. email, Instant messaging and the Web) has motivated computational research in social network analysis (SNA). Previous work in SNA has emphasized the social network (SN) topology measured by communication frequencies while ignoring the semantic information in SNs. In this paper, we propose two generative Bayesian models for semantic community discovery in SNs, combining probabilistic modeling with community detection in SNs. To simulate the generative models, an EnF-Gibbs sampling algorithm is proposed to address the efficiency and performance problems of traditional methods. Experimental studies on Enron email corpus show that our approach successfully detects the communities of individuals and in addition provides semantic topic descriptions of these communities.

UR - http://www.scopus.com/inward/record.url?scp=34250694541&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=34250694541&partnerID=8YFLogxK

U2 - 10.1145/1135777.1135807

DO - 10.1145/1135777.1135807

M3 - Conference contribution

AN - SCOPUS:34250694541

SN - 1595933239

SN - 9781595933232

T3 - Proceedings of the 15th International Conference on World Wide Web

SP - 173

EP - 182

BT - Proceedings of the 15th International Conference on World Wide Web

ER -

Zhou D, Manavoglu E, Li J, Giles CL, Zha H. Probabilistic models for discovering e-communities. In Proceedings of the 15th International Conference on World Wide Web. 2006. p. 173-182. (Proceedings of the 15th International Conference on World Wide Web). https://doi.org/10.1145/1135777.1135807