Clustering remote RDF data using SPARQL update queries

Letao Qi, Harris T. Lin, Vasant Honavar

Research output: Chapter in Book/Report/Conference proceedingConference contribution

7 Citations (Scopus)

Abstract

The emergence of large and distributed RDF data in the Linked Open Data cloud calls for approaches to extract useful knowledge using machine learning techniques such as clustering. However, the massive size and remote nature of RDF data hinder traditional approaches that gather the datasets onto a centralized location for analysis. In this work, we show how to implement two representative clustering algorithms using update queries against the SPARQL endpoint of the RDF store. We compare the time complexity and the communication complexity of our algorithms with of those that require direct centralized access to the data and hence have to retrieve the entire RDF dataset from the remote location. We conduct experiments on a real social network dataset and report our preliminary findings.

Original languageEnglish (US)
Title of host publication2013 IEEE 29th International Conference on Data Engineering Workshops, ICDEW 2013
Pages236-242
Number of pages7
DOIs
StatePublished - Aug 19 2013
Event2013 IEEE 29th International Conference on Data Engineering Workshops, ICDEW 2013 - Brisbane, QLD, Australia
Duration: Apr 8 2013Apr 11 2013

Publication series

NameProceedings - International Conference on Data Engineering
ISSN (Print)1084-4627

Other

Other2013 IEEE 29th International Conference on Data Engineering Workshops, ICDEW 2013
CountryAustralia
CityBrisbane, QLD
Period4/8/134/11/13

Fingerprint

Clustering algorithms
Learning systems
Communication
Experiments

All Science Journal Classification (ASJC) codes

  • Software
  • Signal Processing
  • Information Systems

Cite this

Qi, L., Lin, H. T., & Honavar, V. (2013). Clustering remote RDF data using SPARQL update queries. In 2013 IEEE 29th International Conference on Data Engineering Workshops, ICDEW 2013 (pp. 236-242). [6547456] (Proceedings - International Conference on Data Engineering). https://doi.org/10.1109/ICDEW.2013.6547456
Qi, Letao ; Lin, Harris T. ; Honavar, Vasant. / Clustering remote RDF data using SPARQL update queries. 2013 IEEE 29th International Conference on Data Engineering Workshops, ICDEW 2013. 2013. pp. 236-242 (Proceedings - International Conference on Data Engineering).
@inproceedings{fde68d70a8804995b1288220da4a6986,
title = "Clustering remote RDF data using SPARQL update queries",
abstract = "The emergence of large and distributed RDF data in the Linked Open Data cloud calls for approaches to extract useful knowledge using machine learning techniques such as clustering. However, the massive size and remote nature of RDF data hinder traditional approaches that gather the datasets onto a centralized location for analysis. In this work, we show how to implement two representative clustering algorithms using update queries against the SPARQL endpoint of the RDF store. We compare the time complexity and the communication complexity of our algorithms with of those that require direct centralized access to the data and hence have to retrieve the entire RDF dataset from the remote location. We conduct experiments on a real social network dataset and report our preliminary findings.",
author = "Letao Qi and Lin, {Harris T.} and Vasant Honavar",
year = "2013",
month = "8",
day = "19",
doi = "10.1109/ICDEW.2013.6547456",
language = "English (US)",
isbn = "9781467353021",
series = "Proceedings - International Conference on Data Engineering",
pages = "236--242",
booktitle = "2013 IEEE 29th International Conference on Data Engineering Workshops, ICDEW 2013",

}

Qi, L, Lin, HT & Honavar, V 2013, Clustering remote RDF data using SPARQL update queries. in 2013 IEEE 29th International Conference on Data Engineering Workshops, ICDEW 2013., 6547456, Proceedings - International Conference on Data Engineering, pp. 236-242, 2013 IEEE 29th International Conference on Data Engineering Workshops, ICDEW 2013, Brisbane, QLD, Australia, 4/8/13. https://doi.org/10.1109/ICDEW.2013.6547456

Clustering remote RDF data using SPARQL update queries. / Qi, Letao; Lin, Harris T.; Honavar, Vasant.

2013 IEEE 29th International Conference on Data Engineering Workshops, ICDEW 2013. 2013. p. 236-242 6547456 (Proceedings - International Conference on Data Engineering).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Clustering remote RDF data using SPARQL update queries

AU - Qi, Letao

AU - Lin, Harris T.

AU - Honavar, Vasant

PY - 2013/8/19

Y1 - 2013/8/19

N2 - The emergence of large and distributed RDF data in the Linked Open Data cloud calls for approaches to extract useful knowledge using machine learning techniques such as clustering. However, the massive size and remote nature of RDF data hinder traditional approaches that gather the datasets onto a centralized location for analysis. In this work, we show how to implement two representative clustering algorithms using update queries against the SPARQL endpoint of the RDF store. We compare the time complexity and the communication complexity of our algorithms with of those that require direct centralized access to the data and hence have to retrieve the entire RDF dataset from the remote location. We conduct experiments on a real social network dataset and report our preliminary findings.

AB - The emergence of large and distributed RDF data in the Linked Open Data cloud calls for approaches to extract useful knowledge using machine learning techniques such as clustering. However, the massive size and remote nature of RDF data hinder traditional approaches that gather the datasets onto a centralized location for analysis. In this work, we show how to implement two representative clustering algorithms using update queries against the SPARQL endpoint of the RDF store. We compare the time complexity and the communication complexity of our algorithms with of those that require direct centralized access to the data and hence have to retrieve the entire RDF dataset from the remote location. We conduct experiments on a real social network dataset and report our preliminary findings.

UR - http://www.scopus.com/inward/record.url?scp=84881411773&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84881411773&partnerID=8YFLogxK

U2 - 10.1109/ICDEW.2013.6547456

DO - 10.1109/ICDEW.2013.6547456

M3 - Conference contribution

AN - SCOPUS:84881411773

SN - 9781467353021

T3 - Proceedings - International Conference on Data Engineering

SP - 236

EP - 242

BT - 2013 IEEE 29th International Conference on Data Engineering Workshops, ICDEW 2013

ER -

Qi L, Lin HT, Honavar V. Clustering remote RDF data using SPARQL update queries. In 2013 IEEE 29th International Conference on Data Engineering Workshops, ICDEW 2013. 2013. p. 236-242. 6547456. (Proceedings - International Conference on Data Engineering). https://doi.org/10.1109/ICDEW.2013.6547456