TY - JOUR
T1 - GWAS-based machine learning approach to predict duloxetine response in major depressive disorder
AU - Maciukiewicz, Malgorzata
AU - Marshe, Victoria S.
AU - Hauschild, Anne Christin
AU - Foster, Jane A.
AU - Rotzinger, Susan
AU - Kennedy, James L.
AU - Kennedy, Sidney H.
AU - Müller, Daniel J.
AU - Geraci, Joseph
N1 - Funding Information:
CAN-BIND is an Integrated Discovery Program carried out in partnership with, and financial support from, the Ontario Brain Institute , an independent non-profit corporation , funded partially by the Ontario government .
Funding Information:
MM was given funding from Canadian Institutes of Health Research (CIHR, application no: 81655 ) to continue working on ML models. VM is funded by Ontario Mental Health Foundation (OMHF) and doctoral fellowship from CIHR Stage.
Funding Information:
MM was given funding from Canadian Institutes of Health Research (CIHR, application no: 81655) to continue working on ML models. VM is funded by Ontario Mental Health Foundation (OMHF) and doctoral fellowship from CIHR Stage.
Publisher Copyright:
© 2017
PY - 2018/4
Y1 - 2018/4
N2 - Major depressive disorder (MDD) is one of the most prevalent psychiatric disorders and is commonly treated with antidepressant drugs. However, large variability is observed in terms of response to antidepressants. Machine learning (ML) models may be useful to predict treatment outcomes. A sample of 186 MDD patients received treatment with duloxetine for up to 8 weeks were categorized as “responders” based on a MADRS change >50% from baseline; or “remitters” based on a MADRS score ≤10 at end point. The initial dataset (N = 186) was randomly divided into training and test sets in a nested 5-fold cross-validation, where 80% was used as a training set and 20% made up five independent test sets. We performed genome-wide logistic regression to identify potentially significant variants related to duloxetine response/remission and extracted the most promising predictors using LASSO regression. Subsequently, classification-regression trees (CRT) and support vector machines (SVM) were applied to construct models, using ten-fold cross-validation. With regards to response, none of the pairs performed significantly better than chance (accuracy p >.1). For remission, SVM achieved moderate performance with an accuracy = 0.52, a sensitivity = 0.58, and a specificity = 0.46, and 0.51 for all coefficients for CRT. The best performing SVM fold was characterized by an accuracy = 0.66 (p =.071), sensitivity = 0.70 and a sensitivity = 0.61. In this study, the potential of using GWAS data to predict duloxetine outcomes was examined using ML models. The models were characterized by a promising sensitivity, but specificity remained moderate at best. The inclusion of additional non-genetic variables to create integrated models may improve prediction.
AB - Major depressive disorder (MDD) is one of the most prevalent psychiatric disorders and is commonly treated with antidepressant drugs. However, large variability is observed in terms of response to antidepressants. Machine learning (ML) models may be useful to predict treatment outcomes. A sample of 186 MDD patients received treatment with duloxetine for up to 8 weeks were categorized as “responders” based on a MADRS change >50% from baseline; or “remitters” based on a MADRS score ≤10 at end point. The initial dataset (N = 186) was randomly divided into training and test sets in a nested 5-fold cross-validation, where 80% was used as a training set and 20% made up five independent test sets. We performed genome-wide logistic regression to identify potentially significant variants related to duloxetine response/remission and extracted the most promising predictors using LASSO regression. Subsequently, classification-regression trees (CRT) and support vector machines (SVM) were applied to construct models, using ten-fold cross-validation. With regards to response, none of the pairs performed significantly better than chance (accuracy p >.1). For remission, SVM achieved moderate performance with an accuracy = 0.52, a sensitivity = 0.58, and a specificity = 0.46, and 0.51 for all coefficients for CRT. The best performing SVM fold was characterized by an accuracy = 0.66 (p =.071), sensitivity = 0.70 and a sensitivity = 0.61. In this study, the potential of using GWAS data to predict duloxetine outcomes was examined using ML models. The models were characterized by a promising sensitivity, but specificity remained moderate at best. The inclusion of additional non-genetic variables to create integrated models may improve prediction.
UR - http://www.scopus.com/inward/record.url?scp=85041460311&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85041460311&partnerID=8YFLogxK
U2 - 10.1016/j.jpsychires.2017.12.009
DO - 10.1016/j.jpsychires.2017.12.009
M3 - Article
C2 - 29407288
AN - SCOPUS:85041460311
SN - 0022-3956
VL - 99
SP - 62
EP - 68
JO - Journal of Psychiatric Research
JF - Journal of Psychiatric Research
ER -