TY - JOUR
T1 - Benchmarking Machine Learning Algorithms on Blood Glucose Prediction for Type i Diabetes in Comparison with Classical Time-Series Models
AU - Xie, Jinyu
AU - Wang, Qian
N1 - Funding Information:
The authors would like to thank Dr. Cindy Marling and Dr. Razvan Bunescu from the Ohio University for providing the patient data through a Data Use Agreement (DUA no. D201804) between the Ohio University and the Pennsylvania State University. The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper. Wang would like to thank the support from Penn State University Social Science Research Institute (SSRI). This work received no other financial support from any agencies, foundations, or companies. The authors had full access to all of the data, devices, and materials used in this study and take complete responsibility for the integrity of the data and the accuracy of the data analysis and interpretation of outcomes.
Publisher Copyright:
© 1964-2012 IEEE.
PY - 2020/11
Y1 - 2020/11
N2 - Objective: This paper aims to compare the performance of several commonly known machine-learning (ML) models versus a classic Autoregression with Exogenous inputs (ARX) model in the prediction of blood glucose (BG) levels using time-series data of patients with Type 1 diabetes (T1D). Methods: The ML algorithms include ML-based regression models and deep learning models such as a vanilla Long-Short-Term-Memory (LSTM) Network and a Temporal Convolution Network (TCN). Evaluations have been conducted with respect to different input features, regression model orders, as well as using the recursive method or direct method for multi-step prediction of BG levels. Prediction performance metrics include the average Root Mean Square Error (RMSE), temporal gain (TG) for early prediction, and the normalized energy of the second-order differences (ESOD) of the predicted time series to reflect risk of false alerts on hypo/hyper glycemia events. Results: The ARX model achieved the lowest average RMSE for both recursive and direct methods, the second highest average TG under the direct method, but with a higher average normalized ESOD than some other models. Conclusion: There was no significant advantage observed from the ML models compared to the classic ARX model in predicting BG levels for T1D, except that TCN's performance was more robust with respect to BG trajectories with spurious oscillations, for which ARX tended to over-predict peak BG values and under-predict valley BG values. Significance: Insight learned from this study could help researchers and clinical practitioners to select appropriate models for BG prediction.
AB - Objective: This paper aims to compare the performance of several commonly known machine-learning (ML) models versus a classic Autoregression with Exogenous inputs (ARX) model in the prediction of blood glucose (BG) levels using time-series data of patients with Type 1 diabetes (T1D). Methods: The ML algorithms include ML-based regression models and deep learning models such as a vanilla Long-Short-Term-Memory (LSTM) Network and a Temporal Convolution Network (TCN). Evaluations have been conducted with respect to different input features, regression model orders, as well as using the recursive method or direct method for multi-step prediction of BG levels. Prediction performance metrics include the average Root Mean Square Error (RMSE), temporal gain (TG) for early prediction, and the normalized energy of the second-order differences (ESOD) of the predicted time series to reflect risk of false alerts on hypo/hyper glycemia events. Results: The ARX model achieved the lowest average RMSE for both recursive and direct methods, the second highest average TG under the direct method, but with a higher average normalized ESOD than some other models. Conclusion: There was no significant advantage observed from the ML models compared to the classic ARX model in predicting BG levels for T1D, except that TCN's performance was more robust with respect to BG trajectories with spurious oscillations, for which ARX tended to over-predict peak BG values and under-predict valley BG values. Significance: Insight learned from this study could help researchers and clinical practitioners to select appropriate models for BG prediction.
UR - http://www.scopus.com/inward/record.url?scp=85092426708&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85092426708&partnerID=8YFLogxK
U2 - 10.1109/TBME.2020.2975959
DO - 10.1109/TBME.2020.2975959
M3 - Article
C2 - 32091990
AN - SCOPUS:85092426708
SN - 0018-9294
VL - 67
SP - 3101
EP - 3124
JO - IRE transactions on medical electronics
JF - IRE transactions on medical electronics
IS - 11
M1 - 9007528
ER -