Collaborative targeted maximum likelihood estimation for variable importance measure: Illustration for functional outcome prediction in mild traumatic brain injuries

Romain Pirracchio, John K. Yue, Geoffrey T. Manley, Mark J. Van Der Laan, Alan E. Hubbard, Hester F. Lingsma, Andrew I.R. Maas, Pratik Mukherjee, David O. Okonkwo, David M. Schnyer, Alex B. Valadka, Esther L. Yuh

Research output: Contribution to journalArticlepeer-review

7 Scopus citations

Abstract

Standard statistical practice used for determining the relative importance of competing causes of disease typically relies on ad hoc methods, often byproducts of machine learning procedures (stepwise regression, random forest, etc.). Causal inference framework and data-adaptive methods may help to tailor parameters to match the clinical question and free one from arbitrary modeling assumptions. Our focus is on implementations of such semiparametric methods for a variable importance measure (VIM). We propose a fully automated procedure for VIM based on collaborative targeted maximum likelihood estimation (cTMLE), a method that optimizes the estimate of an association in the presence of potentially numerous competing causes. We applied the approach to data collected from traumatic brain injury patients, specifically a prospective, observational study including three US Level-1 trauma centers. The primary outcome was a disability score (Glasgow Outcome Scale - Extended (GOSE)) collected three months post-injury. We identified clinically important predictors among a set of risk factors using a variable importance analysis based on targeted maximum likelihood estimators (TMLE) and on cTMLE. Via a parametric bootstrap, we demonstrate that the latter procedure has the potential for robust automated estimation of variable importance measures based upon machine-learning algorithms. The cTMLE estimator was associated with substantially less positivity bias as compared to TMLE and larger coverage of the 95% CI. This study confirms the power of an automated cTMLE procedure that can target model selection via machine learning to estimate VIMs in complicated, high-dimensional data.

Original languageEnglish (US)
Pages (from-to)286-297
Number of pages12
JournalStatistical Methods in Medical Research
Volume27
Issue number1
DOIs
StatePublished - Jan 1 2018
Externally publishedYes

Keywords

  • Causal inference
  • Collaborative targeted maximum likelihood
  • High-dimensional data
  • Positivity
  • Semi-parametric
  • Variable importance measure

ASJC Scopus subject areas

  • Epidemiology
  • Statistics and Probability
  • Health Information Management

Fingerprint

Dive into the research topics of 'Collaborative targeted maximum likelihood estimation for variable importance measure: Illustration for functional outcome prediction in mild traumatic brain injuries'. Together they form a unique fingerprint.

Cite this