Automated medical diagnosis by ranking clusters across the symptom-disease network

Jingchao Ni, Hongliang Fei, Wei Fan, Xiang Zhang

Research output: Chapter in Book/Report/Conference proceedingConference contribution

5 Citations (Scopus)

Abstract

The rapid growth of medical recording data has increased the demand for automated analysis. An important problem in recent medical research is automated medical diagnosis, which is to infer likely diseases for the observed symptoms. Existing approaches typically perform the inference on a sparse bipartite graph with two sets of nodes representing diseases and symptoms, respectively. By using this graph, existing methods basically assume no direct dependency exists between diseases (or symptoms), which may not be true in practice. To address this limitation, we propose to integrate two domain networks encoding similarities between diseases and those between symptoms to avoid information loss as well as to alleviate the sparsity problem of the bipartite graph. Another limitation of the existing methods is that they usually output a ranked list of diseases mixed from very different etiologies which greatly limits their practical usefulness. An ideal method should allow a clustered structure in the disease ranking list so that both similar and different diseases can be easily identified. Therefore, we formulate automated diagnosis as a novel cross-domain cluster ranking problem, which identifies and ranks the disease clusters simultaneously in the symptom-disease network. Our formulation employs a joint learning scheme in which the dual procedures of cluster finding and cluster ranking are coupled and mutually reinforced. Experimental results on real-world datasets demonstrate the effectiveness of our method.

Original languageEnglish (US)
Title of host publicationProceedings - 17th IEEE International Conference on Data Mining, ICDM 2017
EditorsGeorge Karypis, Srinivas Alu, Vijay Raghavan, Xindong Wu, Lucio Miele
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages1009-1014
Number of pages6
ISBN (Electronic)9781538638347
DOIs
StatePublished - Dec 15 2017
Event17th IEEE International Conference on Data Mining, ICDM 2017 - New Orleans, United States
Duration: Nov 18 2017Nov 21 2017

Publication series

NameProceedings - IEEE International Conference on Data Mining, ICDM
Volume2017-November
ISSN (Print)1550-4786

Other

Other17th IEEE International Conference on Data Mining, ICDM 2017
CountryUnited States
CityNew Orleans
Period11/18/1711/21/17

Fingerprint

Data recording

All Science Journal Classification (ASJC) codes

  • Engineering(all)

Cite this

Ni, J., Fei, H., Fan, W., & Zhang, X. (2017). Automated medical diagnosis by ranking clusters across the symptom-disease network. In G. Karypis, S. Alu, V. Raghavan, X. Wu, & L. Miele (Eds.), Proceedings - 17th IEEE International Conference on Data Mining, ICDM 2017 (pp. 1009-1014). (Proceedings - IEEE International Conference on Data Mining, ICDM; Vol. 2017-November). Institute of Electrical and Electronics Engineers Inc.. https://doi.org/10.1109/ICDM.2017.130
Ni, Jingchao ; Fei, Hongliang ; Fan, Wei ; Zhang, Xiang. / Automated medical diagnosis by ranking clusters across the symptom-disease network. Proceedings - 17th IEEE International Conference on Data Mining, ICDM 2017. editor / George Karypis ; Srinivas Alu ; Vijay Raghavan ; Xindong Wu ; Lucio Miele. Institute of Electrical and Electronics Engineers Inc., 2017. pp. 1009-1014 (Proceedings - IEEE International Conference on Data Mining, ICDM).
@inproceedings{c9e0f4db190f4b488e8bff07682b3fba,
title = "Automated medical diagnosis by ranking clusters across the symptom-disease network",
abstract = "The rapid growth of medical recording data has increased the demand for automated analysis. An important problem in recent medical research is automated medical diagnosis, which is to infer likely diseases for the observed symptoms. Existing approaches typically perform the inference on a sparse bipartite graph with two sets of nodes representing diseases and symptoms, respectively. By using this graph, existing methods basically assume no direct dependency exists between diseases (or symptoms), which may not be true in practice. To address this limitation, we propose to integrate two domain networks encoding similarities between diseases and those between symptoms to avoid information loss as well as to alleviate the sparsity problem of the bipartite graph. Another limitation of the existing methods is that they usually output a ranked list of diseases mixed from very different etiologies which greatly limits their practical usefulness. An ideal method should allow a clustered structure in the disease ranking list so that both similar and different diseases can be easily identified. Therefore, we formulate automated diagnosis as a novel cross-domain cluster ranking problem, which identifies and ranks the disease clusters simultaneously in the symptom-disease network. Our formulation employs a joint learning scheme in which the dual procedures of cluster finding and cluster ranking are coupled and mutually reinforced. Experimental results on real-world datasets demonstrate the effectiveness of our method.",
author = "Jingchao Ni and Hongliang Fei and Wei Fan and Xiang Zhang",
year = "2017",
month = "12",
day = "15",
doi = "10.1109/ICDM.2017.130",
language = "English (US)",
series = "Proceedings - IEEE International Conference on Data Mining, ICDM",
publisher = "Institute of Electrical and Electronics Engineers Inc.",
pages = "1009--1014",
editor = "George Karypis and Srinivas Alu and Vijay Raghavan and Xindong Wu and Lucio Miele",
booktitle = "Proceedings - 17th IEEE International Conference on Data Mining, ICDM 2017",
address = "United States",

}

Ni, J, Fei, H, Fan, W & Zhang, X 2017, Automated medical diagnosis by ranking clusters across the symptom-disease network. in G Karypis, S Alu, V Raghavan, X Wu & L Miele (eds), Proceedings - 17th IEEE International Conference on Data Mining, ICDM 2017. Proceedings - IEEE International Conference on Data Mining, ICDM, vol. 2017-November, Institute of Electrical and Electronics Engineers Inc., pp. 1009-1014, 17th IEEE International Conference on Data Mining, ICDM 2017, New Orleans, United States, 11/18/17. https://doi.org/10.1109/ICDM.2017.130

Automated medical diagnosis by ranking clusters across the symptom-disease network. / Ni, Jingchao; Fei, Hongliang; Fan, Wei; Zhang, Xiang.

Proceedings - 17th IEEE International Conference on Data Mining, ICDM 2017. ed. / George Karypis; Srinivas Alu; Vijay Raghavan; Xindong Wu; Lucio Miele. Institute of Electrical and Electronics Engineers Inc., 2017. p. 1009-1014 (Proceedings - IEEE International Conference on Data Mining, ICDM; Vol. 2017-November).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - Automated medical diagnosis by ranking clusters across the symptom-disease network

AU - Ni, Jingchao

AU - Fei, Hongliang

AU - Fan, Wei

AU - Zhang, Xiang

PY - 2017/12/15

Y1 - 2017/12/15

N2 - The rapid growth of medical recording data has increased the demand for automated analysis. An important problem in recent medical research is automated medical diagnosis, which is to infer likely diseases for the observed symptoms. Existing approaches typically perform the inference on a sparse bipartite graph with two sets of nodes representing diseases and symptoms, respectively. By using this graph, existing methods basically assume no direct dependency exists between diseases (or symptoms), which may not be true in practice. To address this limitation, we propose to integrate two domain networks encoding similarities between diseases and those between symptoms to avoid information loss as well as to alleviate the sparsity problem of the bipartite graph. Another limitation of the existing methods is that they usually output a ranked list of diseases mixed from very different etiologies which greatly limits their practical usefulness. An ideal method should allow a clustered structure in the disease ranking list so that both similar and different diseases can be easily identified. Therefore, we formulate automated diagnosis as a novel cross-domain cluster ranking problem, which identifies and ranks the disease clusters simultaneously in the symptom-disease network. Our formulation employs a joint learning scheme in which the dual procedures of cluster finding and cluster ranking are coupled and mutually reinforced. Experimental results on real-world datasets demonstrate the effectiveness of our method.

AB - The rapid growth of medical recording data has increased the demand for automated analysis. An important problem in recent medical research is automated medical diagnosis, which is to infer likely diseases for the observed symptoms. Existing approaches typically perform the inference on a sparse bipartite graph with two sets of nodes representing diseases and symptoms, respectively. By using this graph, existing methods basically assume no direct dependency exists between diseases (or symptoms), which may not be true in practice. To address this limitation, we propose to integrate two domain networks encoding similarities between diseases and those between symptoms to avoid information loss as well as to alleviate the sparsity problem of the bipartite graph. Another limitation of the existing methods is that they usually output a ranked list of diseases mixed from very different etiologies which greatly limits their practical usefulness. An ideal method should allow a clustered structure in the disease ranking list so that both similar and different diseases can be easily identified. Therefore, we formulate automated diagnosis as a novel cross-domain cluster ranking problem, which identifies and ranks the disease clusters simultaneously in the symptom-disease network. Our formulation employs a joint learning scheme in which the dual procedures of cluster finding and cluster ranking are coupled and mutually reinforced. Experimental results on real-world datasets demonstrate the effectiveness of our method.

UR - http://www.scopus.com/inward/record.url?scp=85043974492&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85043974492&partnerID=8YFLogxK

U2 - 10.1109/ICDM.2017.130

DO - 10.1109/ICDM.2017.130

M3 - Conference contribution

AN - SCOPUS:85043974492

T3 - Proceedings - IEEE International Conference on Data Mining, ICDM

SP - 1009

EP - 1014

BT - Proceedings - 17th IEEE International Conference on Data Mining, ICDM 2017

A2 - Karypis, George

A2 - Alu, Srinivas

A2 - Raghavan, Vijay

A2 - Wu, Xindong

A2 - Miele, Lucio

PB - Institute of Electrical and Electronics Engineers Inc.

ER -

Ni J, Fei H, Fan W, Zhang X. Automated medical diagnosis by ranking clusters across the symptom-disease network. In Karypis G, Alu S, Raghavan V, Wu X, Miele L, editors, Proceedings - 17th IEEE International Conference on Data Mining, ICDM 2017. Institute of Electrical and Electronics Engineers Inc. 2017. p. 1009-1014. (Proceedings - IEEE International Conference on Data Mining, ICDM). https://doi.org/10.1109/ICDM.2017.130