Crowdsky: Skyline computation with crowdsourcing

Jongwuk Lee, Dongwon Lee, Sang Wook Kim

Research output: Chapter in Book/Report/Conference proceedingConference contribution

3 Citations (Scopus)

Abstract

In this paper, we propose a crowdsourcing-based approach to solving skyline queries with incomplete data. Our main idea is to leverage crowds to infer the pair-wise preferences between tuples when the values of tuples in some attributes are unknown. Specifically, our proposed solution considers three key factors used in existing crowd-enabled algorithms: (1) minimizing a monetary cost in identifying a crowdsourced skyline by using a dominating set, (2) reducing the number of rounds for latency by parallelizing the questions asked to crowds, and (3) improving the accuracy of a crowdsourced skyline by dynamically assigning the number of crowd workers per question. We evaluate our solution over both simulated and real crowdsourcing using the Amazon Mechanical Turk. Compared to a sort-based baseline method, our solution significantly minimizes the monetary cost, and reduces the number of rounds up to two orders of magnitude. In addition, our dynamic majority voting method shows higher accuracy than both static majority voting method and the existing solution using unary questions.

Original languageEnglish (US)
Title of host publicationAdvances in Database Technology - EDBT 2016
Subtitle of host publication19th International Conference on Extending Database Technology, Proceedings
EditorsIoana Manolescu, Evaggelia Pitoura, Amelie Marian, Sofian Maabout, Letizia Tanca, Georgia Koutrika, Kostas Stefanidis
PublisherOpenProceedings.org
Pages125-136
Number of pages12
ISBN (Electronic)9783893180707
DOIs
StatePublished - Jan 1 2016
Event19th International Conference on Extending Database Technology, EDBT 2016 - Bordeaux, France
Duration: Mar 15 2016Mar 18 2016

Publication series

NameAdvances in Database Technology - EDBT
Volume2016-March
ISSN (Electronic)2367-2005

Other

Other19th International Conference on Extending Database Technology, EDBT 2016
CountryFrance
CityBordeaux
Period3/15/163/18/16

Fingerprint

Costs

All Science Journal Classification (ASJC) codes

  • Information Systems
  • Software
  • Computer Science Applications

Cite this

Lee, J., Lee, D., & Kim, S. W. (2016). Crowdsky: Skyline computation with crowdsourcing. In I. Manolescu, E. Pitoura, A. Marian, S. Maabout, L. Tanca, G. Koutrika, & K. Stefanidis (Eds.), Advances in Database Technology - EDBT 2016: 19th International Conference on Extending Database Technology, Proceedings (pp. 125-136). (Advances in Database Technology - EDBT; Vol. 2016-March). OpenProceedings.org. https://doi.org/10.5441/002/edbt.2016.14
Lee, Jongwuk ; Lee, Dongwon ; Kim, Sang Wook. / Crowdsky : Skyline computation with crowdsourcing. Advances in Database Technology - EDBT 2016: 19th International Conference on Extending Database Technology, Proceedings. editor / Ioana Manolescu ; Evaggelia Pitoura ; Amelie Marian ; Sofian Maabout ; Letizia Tanca ; Georgia Koutrika ; Kostas Stefanidis. OpenProceedings.org, 2016. pp. 125-136 (Advances in Database Technology - EDBT).
@inproceedings{3726d9244bf04b5ca3b3bb01075612be,
title = "Crowdsky: Skyline computation with crowdsourcing",
abstract = "In this paper, we propose a crowdsourcing-based approach to solving skyline queries with incomplete data. Our main idea is to leverage crowds to infer the pair-wise preferences between tuples when the values of tuples in some attributes are unknown. Specifically, our proposed solution considers three key factors used in existing crowd-enabled algorithms: (1) minimizing a monetary cost in identifying a crowdsourced skyline by using a dominating set, (2) reducing the number of rounds for latency by parallelizing the questions asked to crowds, and (3) improving the accuracy of a crowdsourced skyline by dynamically assigning the number of crowd workers per question. We evaluate our solution over both simulated and real crowdsourcing using the Amazon Mechanical Turk. Compared to a sort-based baseline method, our solution significantly minimizes the monetary cost, and reduces the number of rounds up to two orders of magnitude. In addition, our dynamic majority voting method shows higher accuracy than both static majority voting method and the existing solution using unary questions.",
author = "Jongwuk Lee and Dongwon Lee and Kim, {Sang Wook}",
year = "2016",
month = "1",
day = "1",
doi = "10.5441/002/edbt.2016.14",
language = "English (US)",
series = "Advances in Database Technology - EDBT",
publisher = "OpenProceedings.org",
pages = "125--136",
editor = "Ioana Manolescu and Evaggelia Pitoura and Amelie Marian and Sofian Maabout and Letizia Tanca and Georgia Koutrika and Kostas Stefanidis",
booktitle = "Advances in Database Technology - EDBT 2016",

}

Lee, J, Lee, D & Kim, SW 2016, Crowdsky: Skyline computation with crowdsourcing. in I Manolescu, E Pitoura, A Marian, S Maabout, L Tanca, G Koutrika & K Stefanidis (eds), Advances in Database Technology - EDBT 2016: 19th International Conference on Extending Database Technology, Proceedings. Advances in Database Technology - EDBT, vol. 2016-March, OpenProceedings.org, pp. 125-136, 19th International Conference on Extending Database Technology, EDBT 2016, Bordeaux, France, 3/15/16. https://doi.org/10.5441/002/edbt.2016.14

Crowdsky : Skyline computation with crowdsourcing. / Lee, Jongwuk; Lee, Dongwon; Kim, Sang Wook.

Advances in Database Technology - EDBT 2016: 19th International Conference on Extending Database Technology, Proceedings. ed. / Ioana Manolescu; Evaggelia Pitoura; Amelie Marian; Sofian Maabout; Letizia Tanca; Georgia Koutrika; Kostas Stefanidis. OpenProceedings.org, 2016. p. 125-136 (Advances in Database Technology - EDBT; Vol. 2016-March).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Crowdsky

T2 - Skyline computation with crowdsourcing

AU - Lee, Jongwuk

AU - Lee, Dongwon

AU - Kim, Sang Wook

PY - 2016/1/1

Y1 - 2016/1/1

N2 - In this paper, we propose a crowdsourcing-based approach to solving skyline queries with incomplete data. Our main idea is to leverage crowds to infer the pair-wise preferences between tuples when the values of tuples in some attributes are unknown. Specifically, our proposed solution considers three key factors used in existing crowd-enabled algorithms: (1) minimizing a monetary cost in identifying a crowdsourced skyline by using a dominating set, (2) reducing the number of rounds for latency by parallelizing the questions asked to crowds, and (3) improving the accuracy of a crowdsourced skyline by dynamically assigning the number of crowd workers per question. We evaluate our solution over both simulated and real crowdsourcing using the Amazon Mechanical Turk. Compared to a sort-based baseline method, our solution significantly minimizes the monetary cost, and reduces the number of rounds up to two orders of magnitude. In addition, our dynamic majority voting method shows higher accuracy than both static majority voting method and the existing solution using unary questions.

AB - In this paper, we propose a crowdsourcing-based approach to solving skyline queries with incomplete data. Our main idea is to leverage crowds to infer the pair-wise preferences between tuples when the values of tuples in some attributes are unknown. Specifically, our proposed solution considers three key factors used in existing crowd-enabled algorithms: (1) minimizing a monetary cost in identifying a crowdsourced skyline by using a dominating set, (2) reducing the number of rounds for latency by parallelizing the questions asked to crowds, and (3) improving the accuracy of a crowdsourced skyline by dynamically assigning the number of crowd workers per question. We evaluate our solution over both simulated and real crowdsourcing using the Amazon Mechanical Turk. Compared to a sort-based baseline method, our solution significantly minimizes the monetary cost, and reduces the number of rounds up to two orders of magnitude. In addition, our dynamic majority voting method shows higher accuracy than both static majority voting method and the existing solution using unary questions.

UR - http://www.scopus.com/inward/record.url?scp=85046636383&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85046636383&partnerID=8YFLogxK

U2 - 10.5441/002/edbt.2016.14

DO - 10.5441/002/edbt.2016.14

M3 - Conference contribution

AN - SCOPUS:85046636383

T3 - Advances in Database Technology - EDBT

SP - 125

EP - 136

BT - Advances in Database Technology - EDBT 2016

A2 - Manolescu, Ioana

A2 - Pitoura, Evaggelia

A2 - Marian, Amelie

A2 - Maabout, Sofian

A2 - Tanca, Letizia

A2 - Koutrika, Georgia

A2 - Stefanidis, Kostas

PB - OpenProceedings.org

ER -

Lee J, Lee D, Kim SW. Crowdsky: Skyline computation with crowdsourcing. In Manolescu I, Pitoura E, Marian A, Maabout S, Tanca L, Koutrika G, Stefanidis K, editors, Advances in Database Technology - EDBT 2016: 19th International Conference on Extending Database Technology, Proceedings. OpenProceedings.org. 2016. p. 125-136. (Advances in Database Technology - EDBT). https://doi.org/10.5441/002/edbt.2016.14