Moving beyond regression techniques in cardiovascular risk prediction: Applying machine learning to address analytic challenges

Benjamin A. Goldstein; Ann Marie Navar; Rickey E. Carter

doi:10.1093/eurheartj/ehw302

Moving beyond regression techniques in cardiovascular risk prediction: Applying machine learning to address analytic challenges

Benjamin A. Goldstein, Ann Marie Navar, Rickey E. Carter

Research output: Contribution to journal › Review article › peer-review

323 Scopus citations

Abstract

Risk prediction plays an important role in clinical cardiology research. Traditionally, most risk models have been based on regression models. While useful and robust, these statistical methods are limited to using a small number of predictors which operate in the sameway on everyone, and uniformly throughout their range. The purpose of this review is to illustrate the use of machine-learning methods for development of risk prediction models. Typically presented as black box approaches, most machine-learning methods are aimed at solving particular challenges that arise in data analysis that are not well addressed by typical regression approaches. To illustrate these challenges, as well as how different methods can address them, we consider trying to predicting mortality after diagnosis of acute myocardial infarction. We use data derived from our institution's electronic health record and abstract data on 13 regularly measured laboratory markers. We walk through different challenges that arise in modelling these data and then introduce different machine-learning approaches. Finally, we discuss general issues in the application of machine-learning methods including tuning parameters, loss functions, variable importance, and missing data. Overall, this review serves as an introduction for those working on risk modelling to approach the diffuse field of machine learning.

Original language	English (US)
Pages (from-to)	1805-1814
Number of pages	10
Journal	European heart journal
Volume	38
Issue number	23
DOIs	https://doi.org/10.1093/eurheartj/ehw302
State	Published - Jun 14 2017
Externally published	Yes

Keywords

Electronic health records
Personalized medicine
Precision medicine
Risk prediction

ASJC Scopus subject areas

Cardiology and Cardiovascular Medicine

Access to Document

10.1093/eurheartj/ehw302

Cite this

@article{3e13049a336a4b389597eb8ee86197e0,

title = "Moving beyond regression techniques in cardiovascular risk prediction: Applying machine learning to address analytic challenges",

abstract = "Risk prediction plays an important role in clinical cardiology research. Traditionally, most risk models have been based on regression models. While useful and robust, these statistical methods are limited to using a small number of predictors which operate in the sameway on everyone, and uniformly throughout their range. The purpose of this review is to illustrate the use of machine-learning methods for development of risk prediction models. Typically presented as black box approaches, most machine-learning methods are aimed at solving particular challenges that arise in data analysis that are not well addressed by typical regression approaches. To illustrate these challenges, as well as how different methods can address them, we consider trying to predicting mortality after diagnosis of acute myocardial infarction. We use data derived from our institution's electronic health record and abstract data on 13 regularly measured laboratory markers. We walk through different challenges that arise in modelling these data and then introduce different machine-learning approaches. Finally, we discuss general issues in the application of machine-learning methods including tuning parameters, loss functions, variable importance, and missing data. Overall, this review serves as an introduction for those working on risk modelling to approach the diffuse field of machine learning.",

keywords = "Electronic health records, Personalized medicine, Precision medicine, Risk prediction",

author = "Goldstein, {Benjamin A.} and Navar, {Ann Marie} and Carter, {Rickey E.}",

note = "Publisher Copyright: {\textcopyright} The Author 2016. Published by Oxford University Press on behalf of the European Society of Cardiology.",

year = "2017",

month = jun,

day = "14",

doi = "10.1093/eurheartj/ehw302",

language = "English (US)",

volume = "38",

pages = "1805--1814",

journal = "European heart journal",

issn = "0195-668X",

publisher = "Oxford University Press",

number = "23",

}

TY - JOUR

T1 - Moving beyond regression techniques in cardiovascular risk prediction

T2 - Applying machine learning to address analytic challenges

AU - Goldstein, Benjamin A.

AU - Navar, Ann Marie

AU - Carter, Rickey E.

N1 - Publisher Copyright: © The Author 2016. Published by Oxford University Press on behalf of the European Society of Cardiology.

PY - 2017/6/14

Y1 - 2017/6/14

N2 - Risk prediction plays an important role in clinical cardiology research. Traditionally, most risk models have been based on regression models. While useful and robust, these statistical methods are limited to using a small number of predictors which operate in the sameway on everyone, and uniformly throughout their range. The purpose of this review is to illustrate the use of machine-learning methods for development of risk prediction models. Typically presented as black box approaches, most machine-learning methods are aimed at solving particular challenges that arise in data analysis that are not well addressed by typical regression approaches. To illustrate these challenges, as well as how different methods can address them, we consider trying to predicting mortality after diagnosis of acute myocardial infarction. We use data derived from our institution's electronic health record and abstract data on 13 regularly measured laboratory markers. We walk through different challenges that arise in modelling these data and then introduce different machine-learning approaches. Finally, we discuss general issues in the application of machine-learning methods including tuning parameters, loss functions, variable importance, and missing data. Overall, this review serves as an introduction for those working on risk modelling to approach the diffuse field of machine learning.

AB - Risk prediction plays an important role in clinical cardiology research. Traditionally, most risk models have been based on regression models. While useful and robust, these statistical methods are limited to using a small number of predictors which operate in the sameway on everyone, and uniformly throughout their range. The purpose of this review is to illustrate the use of machine-learning methods for development of risk prediction models. Typically presented as black box approaches, most machine-learning methods are aimed at solving particular challenges that arise in data analysis that are not well addressed by typical regression approaches. To illustrate these challenges, as well as how different methods can address them, we consider trying to predicting mortality after diagnosis of acute myocardial infarction. We use data derived from our institution's electronic health record and abstract data on 13 regularly measured laboratory markers. We walk through different challenges that arise in modelling these data and then introduce different machine-learning approaches. Finally, we discuss general issues in the application of machine-learning methods including tuning parameters, loss functions, variable importance, and missing data. Overall, this review serves as an introduction for those working on risk modelling to approach the diffuse field of machine learning.

KW - Electronic health records

KW - Personalized medicine

KW - Precision medicine

KW - Risk prediction

UR - http://www.scopus.com/inward/record.url?scp=85021953748&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85021953748&partnerID=8YFLogxK

U2 - 10.1093/eurheartj/ehw302

DO - 10.1093/eurheartj/ehw302

M3 - Review article

C2 - 27436868

AN - SCOPUS:85021953748

SN - 0195-668X

VL - 38

SP - 1805

EP - 1814

JO - European heart journal

JF - European heart journal

IS - 23

ER -

Moving beyond regression techniques in cardiovascular risk prediction: Applying machine learning to address analytic challenges

Abstract

Keywords

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this