Sampling large-scale social networks: Insights from simulated networks

Peter Ebbes, Zan Huang, Arvind Rangaswamy, Hari P. Thadakamalla

Research output: Contribution to conferencePaper

8 Citations (Scopus)

Abstract

We conduct a detailed simulation study to assess how well various sampling techniques recover network characteristics such as degree, clustering coefficient, and path length distributions of several simulated population networks that have the high clustering tendency characteristic of social networks but vary in terms of degree distribution and density. We consider several alternative sampling procedures tailored to the context of social network sampling, including random-node and random-edge sampling, egocentric sampling, and several variations of graph-exploration-based sampling methods (random walk, forest fire, and snowball methods). Our main findings are that for networks with Poisson degree distribution the snowball method is overall the best while for networks of power-law degree distribution random walk is the best when the network is sparse and the forest fire method is the best when the network is dense.

Original languageEnglish (US)
Pages49-54
Number of pages6
StatePublished - Jan 1 2008
Event2008 Workshop on Information Technologies and Systems, WITS 2008 - Paris, France
Duration: Dec 13 2008Dec 14 2008

Other

Other2008 Workshop on Information Technologies and Systems, WITS 2008
CountryFrance
CityParis
Period12/13/0812/14/08

Fingerprint

Sampling
Fires

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Control and Systems Engineering

Cite this

Ebbes, P., Huang, Z., Rangaswamy, A., & Thadakamalla, H. P. (2008). Sampling large-scale social networks: Insights from simulated networks. 49-54. Paper presented at 2008 Workshop on Information Technologies and Systems, WITS 2008, Paris, France.
Ebbes, Peter ; Huang, Zan ; Rangaswamy, Arvind ; Thadakamalla, Hari P. / Sampling large-scale social networks : Insights from simulated networks. Paper presented at 2008 Workshop on Information Technologies and Systems, WITS 2008, Paris, France.6 p.
@conference{3c3e2306265c45acbd2e8f850d43ac9e,
title = "Sampling large-scale social networks: Insights from simulated networks",
abstract = "We conduct a detailed simulation study to assess how well various sampling techniques recover network characteristics such as degree, clustering coefficient, and path length distributions of several simulated population networks that have the high clustering tendency characteristic of social networks but vary in terms of degree distribution and density. We consider several alternative sampling procedures tailored to the context of social network sampling, including random-node and random-edge sampling, egocentric sampling, and several variations of graph-exploration-based sampling methods (random walk, forest fire, and snowball methods). Our main findings are that for networks with Poisson degree distribution the snowball method is overall the best while for networks of power-law degree distribution random walk is the best when the network is sparse and the forest fire method is the best when the network is dense.",
author = "Peter Ebbes and Zan Huang and Arvind Rangaswamy and Thadakamalla, {Hari P.}",
year = "2008",
month = "1",
day = "1",
language = "English (US)",
pages = "49--54",
note = "2008 Workshop on Information Technologies and Systems, WITS 2008 ; Conference date: 13-12-2008 Through 14-12-2008",

}

Ebbes, P, Huang, Z, Rangaswamy, A & Thadakamalla, HP 2008, 'Sampling large-scale social networks: Insights from simulated networks' Paper presented at 2008 Workshop on Information Technologies and Systems, WITS 2008, Paris, France, 12/13/08 - 12/14/08, pp. 49-54.

Sampling large-scale social networks : Insights from simulated networks. / Ebbes, Peter; Huang, Zan; Rangaswamy, Arvind; Thadakamalla, Hari P.

2008. 49-54 Paper presented at 2008 Workshop on Information Technologies and Systems, WITS 2008, Paris, France.

Research output: Contribution to conferencePaper

TY - CONF

T1 - Sampling large-scale social networks

T2 - Insights from simulated networks

AU - Ebbes, Peter

AU - Huang, Zan

AU - Rangaswamy, Arvind

AU - Thadakamalla, Hari P.

PY - 2008/1/1

Y1 - 2008/1/1

N2 - We conduct a detailed simulation study to assess how well various sampling techniques recover network characteristics such as degree, clustering coefficient, and path length distributions of several simulated population networks that have the high clustering tendency characteristic of social networks but vary in terms of degree distribution and density. We consider several alternative sampling procedures tailored to the context of social network sampling, including random-node and random-edge sampling, egocentric sampling, and several variations of graph-exploration-based sampling methods (random walk, forest fire, and snowball methods). Our main findings are that for networks with Poisson degree distribution the snowball method is overall the best while for networks of power-law degree distribution random walk is the best when the network is sparse and the forest fire method is the best when the network is dense.

AB - We conduct a detailed simulation study to assess how well various sampling techniques recover network characteristics such as degree, clustering coefficient, and path length distributions of several simulated population networks that have the high clustering tendency characteristic of social networks but vary in terms of degree distribution and density. We consider several alternative sampling procedures tailored to the context of social network sampling, including random-node and random-edge sampling, egocentric sampling, and several variations of graph-exploration-based sampling methods (random walk, forest fire, and snowball methods). Our main findings are that for networks with Poisson degree distribution the snowball method is overall the best while for networks of power-law degree distribution random walk is the best when the network is sparse and the forest fire method is the best when the network is dense.

UR - http://www.scopus.com/inward/record.url?scp=84902165692&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84902165692&partnerID=8YFLogxK

M3 - Paper

AN - SCOPUS:84902165692

SP - 49

EP - 54

ER -

Ebbes P, Huang Z, Rangaswamy A, Thadakamalla HP. Sampling large-scale social networks: Insights from simulated networks. 2008. Paper presented at 2008 Workshop on Information Technologies and Systems, WITS 2008, Paris, France.