Real-time effective framework for unstructured data mining

Richard Kwadzo Lomotey, Ralph Deters

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Citations (Scopus)

Abstract

Today, the enterprise landscape faces voluminous amount of data. The information gathered from these data sources are useful for improving on product and services delivery. However, it is challenging to perform knowledge discovery in database (KDD) activities on these data sources because of its unstructured nature. Previous studies have proposed the hierarchical clustering methodology since it enhances human readability and provides clear dependency structure through topics, term and document organization. But, the methodology can be resource intensive and time consuming. In order to improve on the terms extraction process, we propose a tool called RSenter that searches through interconnected Hyperlinks and NoSQL database (specifically, CouchDB). We evaluate the tool based on search algorithms such as parallelization, random walk (or linear search), pessimistic search, and optimistic search. The tool shows high accuracy and optimality in view of the search time.

Original languageEnglish (US)
Title of host publicationProceedings - 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2013
Pages1081-1088
Number of pages8
DOIs
StatePublished - Dec 1 2013
Event12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2013 - Melbourne, VIC, Australia
Duration: Jul 16 2013Jul 18 2013

Publication series

NameProceedings - 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2013

Other

Other12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2013
CountryAustralia
CityMelbourne, VIC
Period7/16/137/18/13

Fingerprint

Data mining
Industry

All Science Journal Classification (ASJC) codes

  • Computer Networks and Communications

Cite this

Lomotey, R. K., & Deters, R. (2013). Real-time effective framework for unstructured data mining. In Proceedings - 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2013 (pp. 1081-1088). [6680952] (Proceedings - 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2013). https://doi.org/10.1109/TrustCom.2013.131
Lomotey, Richard Kwadzo ; Deters, Ralph. / Real-time effective framework for unstructured data mining. Proceedings - 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2013. 2013. pp. 1081-1088 (Proceedings - 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2013).
@inproceedings{61f61f50990a4c04815c0cf5bd331705,
title = "Real-time effective framework for unstructured data mining",
abstract = "Today, the enterprise landscape faces voluminous amount of data. The information gathered from these data sources are useful for improving on product and services delivery. However, it is challenging to perform knowledge discovery in database (KDD) activities on these data sources because of its unstructured nature. Previous studies have proposed the hierarchical clustering methodology since it enhances human readability and provides clear dependency structure through topics, term and document organization. But, the methodology can be resource intensive and time consuming. In order to improve on the terms extraction process, we propose a tool called RSenter that searches through interconnected Hyperlinks and NoSQL database (specifically, CouchDB). We evaluate the tool based on search algorithms such as parallelization, random walk (or linear search), pessimistic search, and optimistic search. The tool shows high accuracy and optimality in view of the search time.",
author = "Lomotey, {Richard Kwadzo} and Ralph Deters",
year = "2013",
month = "12",
day = "1",
doi = "10.1109/TrustCom.2013.131",
language = "English (US)",
isbn = "9780769550220",
series = "Proceedings - 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2013",
pages = "1081--1088",
booktitle = "Proceedings - 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2013",

}

Lomotey, RK & Deters, R 2013, Real-time effective framework for unstructured data mining. in Proceedings - 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2013., 6680952, Proceedings - 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2013, pp. 1081-1088, 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2013, Melbourne, VIC, Australia, 7/16/13. https://doi.org/10.1109/TrustCom.2013.131

Real-time effective framework for unstructured data mining. / Lomotey, Richard Kwadzo; Deters, Ralph.

Proceedings - 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2013. 2013. p. 1081-1088 6680952 (Proceedings - 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2013).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Real-time effective framework for unstructured data mining

AU - Lomotey, Richard Kwadzo

AU - Deters, Ralph

PY - 2013/12/1

Y1 - 2013/12/1

N2 - Today, the enterprise landscape faces voluminous amount of data. The information gathered from these data sources are useful for improving on product and services delivery. However, it is challenging to perform knowledge discovery in database (KDD) activities on these data sources because of its unstructured nature. Previous studies have proposed the hierarchical clustering methodology since it enhances human readability and provides clear dependency structure through topics, term and document organization. But, the methodology can be resource intensive and time consuming. In order to improve on the terms extraction process, we propose a tool called RSenter that searches through interconnected Hyperlinks and NoSQL database (specifically, CouchDB). We evaluate the tool based on search algorithms such as parallelization, random walk (or linear search), pessimistic search, and optimistic search. The tool shows high accuracy and optimality in view of the search time.

AB - Today, the enterprise landscape faces voluminous amount of data. The information gathered from these data sources are useful for improving on product and services delivery. However, it is challenging to perform knowledge discovery in database (KDD) activities on these data sources because of its unstructured nature. Previous studies have proposed the hierarchical clustering methodology since it enhances human readability and provides clear dependency structure through topics, term and document organization. But, the methodology can be resource intensive and time consuming. In order to improve on the terms extraction process, we propose a tool called RSenter that searches through interconnected Hyperlinks and NoSQL database (specifically, CouchDB). We evaluate the tool based on search algorithms such as parallelization, random walk (or linear search), pessimistic search, and optimistic search. The tool shows high accuracy and optimality in view of the search time.

UR - http://www.scopus.com/inward/record.url?scp=84893472210&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=84893472210&partnerID=8YFLogxK

U2 - 10.1109/TrustCom.2013.131

DO - 10.1109/TrustCom.2013.131

M3 - Conference contribution

SN - 9780769550220

T3 - Proceedings - 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2013

SP - 1081

EP - 1088

BT - Proceedings - 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2013

ER -

Lomotey RK, Deters R. Real-time effective framework for unstructured data mining. In Proceedings - 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2013. 2013. p. 1081-1088. 6680952. (Proceedings - 12th IEEE International Conference on Trust, Security and Privacy in Computing and Communications, TrustCom 2013). https://doi.org/10.1109/TrustCom.2013.131