Individuals suffer from chronic diseases without being identified in time, which brings lots of burden of disease to the society. This paper presents a multiple disease risk prediction method to systematically assess future disease risks for patients based on their longitudinal medical records. In this study, medical diagnoses based on International Classification of Diseases (ICD) are aggregated into different levels for prediction to meet the needs of different stakeholders. The proposed approach gets validated using two independent hospital medical datasets, which includes 7105 patients with 18, 893 patients and 4170 patients with 13, 124 visits, respectively. The initial analysis reveals a high variation in patients' characteristics. The study demonstrates that recurrent neural network with long-short time memory units performs well in different levels of diagnosis aggregation. Especially, the results show that the developed model can be well applied to predicting future disease risks for patients, with the exact-match score of 98.90% and 95.12% using 3-digit ICD code aggregation, while 96.60% and 96.83% using 4-digit ICD code aggregation for these two datasets, respectively. Moreover, the approach can be developed as a reference tool for hospital information systems, enhancing patients' healthcare management over time.
All Science Journal Classification (ASJC) codes
- Computer Science Applications
- Electrical and Electronic Engineering
- Health Information Management