TY - JOUR
T1 - Model-based and model-free machine learning techniques for diagnostic prediction and classification of clinical outcomes in Parkinson's disease
AU - Gao, Chao
AU - Sun, Hanbo
AU - Wang, Tuo
AU - Tang, Ming
AU - Bohnen, Nicolaas I.
AU - Müller, Martijn L.T.M.
AU - Herman, Talia
AU - Giladi, Nir
AU - Kalinin, Alexandr
AU - Spino, Cathie
AU - Dauer, William
AU - Hausdorff, Jeffrey M.
AU - Dinov, Ivo D.
N1 - Funding Information:
This work was partially supported by NSF grants 17348531, 1636840, 1416953, NIH grants P01 NS015655, RO1 NS070856, P20 NR015331, P50 NS091856, P30 DK089503, U54 EB020406, and P30 AG053760, the Elsie Andresen Fiske Research Fund, and the Michael J Fox Foundation for Parkinson’s Research. Colleagues at the Statistics Online Computational Resource (SOCR), Tel Aviv Sourasky Medical Center for the Study of Movement, Cognition, and Mobility, the Big Data Discovery Science (BDDS), and the Michigan Institute for Data Science (MIDAS) provided vital support and advice. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Funding Information:
This work was partially supported by NSF grants 17348531, 1636840, 1416953, NIH grants P01 NS015655, RO1 NS070856, P20 NR015331, P50 NS091856, P30 DK089503, U54 EB020406, and P30 AG053760, the Elsie Andresen Fiske Research Fund, and the Michael J Fox Foundation for Parkinson's Research. Colleagues at the Statistics Online Computational Resource (SOCR), Tel Aviv Sourasky Medical Center for the Study of Movement, Cognition, and Mobility, the Big Data Discovery Science (BDDS), and the Michigan Institute for Data Science (MIDAS) provided vital support and advice. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Publisher Copyright:
© 2018 The Author(s).
PY - 2018/12/1
Y1 - 2018/12/1
N2 - In this study, we apply a multidisciplinary approach to investigate falls in PD patients using clinical, demographic and neuroimaging data from two independent initiatives (University of Michigan and Tel Aviv Sourasky Medical Center). Using machine learning techniques, we construct predictive models to discriminate fallers and non-fallers. Through controlled feature selection, we identified the most salient predictors of patient falls including gait speed, Hoehn and Yahr stage, postural instability and gait difficulty-related measurements. The model-based and model-free analytical methods we employed included logistic regression, random forests, support vector machines, and XGboost. The reliability of the forecasts was assessed by internal statistical (5-fold) cross validation as well as by external out-of-bag validation. Four specific challenges were addressed in the study: Challenge 1, develop a protocol for harmonizing and aggregating complex, multisource, and multi-site Parkinson's disease data; Challenge 2, identify salient predictive features associated with specific clinical traits, e.g., patient falls; Challenge 3, forecast patient falls and evaluate the classification performance; and Challenge 4, predict tremor dominance (TD) vs. posture instability and gait difficulty (PIGD). Our findings suggest that, compared to other approaches, model-free machine learning based techniques provide a more reliable clinical outcome forecasting of falls in Parkinson's patients, for example, with a classification accuracy of about 70-80%.
AB - In this study, we apply a multidisciplinary approach to investigate falls in PD patients using clinical, demographic and neuroimaging data from two independent initiatives (University of Michigan and Tel Aviv Sourasky Medical Center). Using machine learning techniques, we construct predictive models to discriminate fallers and non-fallers. Through controlled feature selection, we identified the most salient predictors of patient falls including gait speed, Hoehn and Yahr stage, postural instability and gait difficulty-related measurements. The model-based and model-free analytical methods we employed included logistic regression, random forests, support vector machines, and XGboost. The reliability of the forecasts was assessed by internal statistical (5-fold) cross validation as well as by external out-of-bag validation. Four specific challenges were addressed in the study: Challenge 1, develop a protocol for harmonizing and aggregating complex, multisource, and multi-site Parkinson's disease data; Challenge 2, identify salient predictive features associated with specific clinical traits, e.g., patient falls; Challenge 3, forecast patient falls and evaluate the classification performance; and Challenge 4, predict tremor dominance (TD) vs. posture instability and gait difficulty (PIGD). Our findings suggest that, compared to other approaches, model-free machine learning based techniques provide a more reliable clinical outcome forecasting of falls in Parkinson's patients, for example, with a classification accuracy of about 70-80%.
UR - http://www.scopus.com/inward/record.url?scp=85046866518&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85046866518&partnerID=8YFLogxK
U2 - 10.1038/s41598-018-24783-4
DO - 10.1038/s41598-018-24783-4
M3 - Article
C2 - 29740058
AN - SCOPUS:85046866518
SN - 2045-2322
VL - 8
JO - Scientific reports
JF - Scientific reports
IS - 1
M1 - 7129
ER -