TY - JOUR
T1 - Integration of Face-to-Face Screening with Real-time Machine Learning to Predict Risk of Suicide among Adults
AU - Wilimitis, Drew
AU - Turer, Robert W.
AU - Ripperger, Michael
AU - McCoy, Allison B.
AU - Sperry, Sarah H.
AU - Fielstein, Elliot M.
AU - Kurz, Troy
AU - Walsh, Colin G.
N1 - Funding Information:
Funding/Support: The study was funded by Evelyn Selby Stead Fund for Innovation, Vanderbilt University Medical Center (grant R01 MH121455: Distinguishing clinical and genetic risk of suicidal ideation from attempts to inform prevention and grant R01 MH116269: Leveraging Electronic Health Records for Pharmacogenomics of Psychiatric Disorders) and grant W81XWH-10-2-0181 (Optimized Suicide Risk Detection and Management in Military Primary Care) from the Military Suicide Research Consortium. Funding for the Research Derivative and BioVU Synthetic Derivative is through UL1 RR024975/RR/NCRR from the National Center for Research Resources.
Funding Information:
Conflict of Interest Disclosures: Dr Walsh reported receiving grants from the National Institutes of Health, the US Food and Drug Administration, the Military Suicide Research Consortium, Wellcome Leap, the Selby Stead Fund, Vanderbilt University Medical Center, and the Tennessee Department of Health; receiving personal fees from Southeastern Home Office Underwriters Association and Hannover Re; and holding equity in Sage AI outside the submitted work. No other disclosures were reported.
Publisher Copyright:
© 2022 American Medical Association. All rights reserved.
PY - 2022/5/13
Y1 - 2022/5/13
N2 - Importance: Understanding the differences and potential synergies between traditional clinician assessment and automated machine learning might enable more accurate and useful suicide risk detection. Objective: To evaluate the respective and combined abilities of a real-time machine learning model and the Columbia Suicide Severity Rating Scale (C-SSRS) to predict suicide attempt (SA) and suicidal ideation (SI). Design, Setting, and Participants: This cohort study included encounters with adult patients (aged ≥18 years) at a major academic medical center. The C-SSRS was administered during routine care, and a Vanderbilt Suicide Attempt and Ideation Likelihood (VSAIL) prediction was generated in the electronic health record. Encounters took place in the inpatient, ambulatory surgical, and emergency department settings. Data were collected from June 2019 to September 2020. Main Outcomes and Measures: Primary outcomes were the incidence of SA and SI, encoded as International Classification of Diseases codes, occurring within various time periods after an index visit. We evaluated the retrospective validity of the C-SSRS, VSAIL, and ensemble models combining both. Discrimination metrics included area under the receiver operating curve (AUROC), area under the precision-recall curve (AUPR), sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV). Results: The cohort included 120398 unique index visits for 83394 patients (mean [SD] age, 51.2 [20.6] years; 38107 [46%] men; 45273 [54%] women; 13644 [16%] Black; 63869 [77%] White). Within 30 days of an index visit, the combined models had higher AUROC (SA: 0.874-0.887; SI: 0.869-0.879) than both the VSAIL (SA: 0.729; SI: 0.773) and C-SSRS (SA: 0.823; SI: 0.777) models. In the highest risk-decile, ensemble methods had PPV of 1.3% to 1.4% for SA and 8.3% to 8.7% for SI and sensitivity of 77.6% to 79.5% for SA and 67.4% to 70.1% for SI, outperforming VSAIL (PPV for SA: 0.4%; PPV for SI: 3.9%; sensitivity for SA: 28.8%; sensitivity for SI: 35.1%) and C-SSRS (PPV for SA: 0.5%; PPV for SI: 3.5%; sensitivity for SA: 76.6%; sensitivity for SI: 68.8%). Conclusions and Relevance: In this study, suicide risk prediction was optimal when leveraging both in-person screening (for acute measures of risk in patient-reported suicidality) and historical EHR data (for underlying clinical factors that can quantify a patient's passive risk level). To improve suicide risk classification, prediction systems could combine pretrained machine learning with structured clinician assessment without needing to retrain the original model..
AB - Importance: Understanding the differences and potential synergies between traditional clinician assessment and automated machine learning might enable more accurate and useful suicide risk detection. Objective: To evaluate the respective and combined abilities of a real-time machine learning model and the Columbia Suicide Severity Rating Scale (C-SSRS) to predict suicide attempt (SA) and suicidal ideation (SI). Design, Setting, and Participants: This cohort study included encounters with adult patients (aged ≥18 years) at a major academic medical center. The C-SSRS was administered during routine care, and a Vanderbilt Suicide Attempt and Ideation Likelihood (VSAIL) prediction was generated in the electronic health record. Encounters took place in the inpatient, ambulatory surgical, and emergency department settings. Data were collected from June 2019 to September 2020. Main Outcomes and Measures: Primary outcomes were the incidence of SA and SI, encoded as International Classification of Diseases codes, occurring within various time periods after an index visit. We evaluated the retrospective validity of the C-SSRS, VSAIL, and ensemble models combining both. Discrimination metrics included area under the receiver operating curve (AUROC), area under the precision-recall curve (AUPR), sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV). Results: The cohort included 120398 unique index visits for 83394 patients (mean [SD] age, 51.2 [20.6] years; 38107 [46%] men; 45273 [54%] women; 13644 [16%] Black; 63869 [77%] White). Within 30 days of an index visit, the combined models had higher AUROC (SA: 0.874-0.887; SI: 0.869-0.879) than both the VSAIL (SA: 0.729; SI: 0.773) and C-SSRS (SA: 0.823; SI: 0.777) models. In the highest risk-decile, ensemble methods had PPV of 1.3% to 1.4% for SA and 8.3% to 8.7% for SI and sensitivity of 77.6% to 79.5% for SA and 67.4% to 70.1% for SI, outperforming VSAIL (PPV for SA: 0.4%; PPV for SI: 3.9%; sensitivity for SA: 28.8%; sensitivity for SI: 35.1%) and C-SSRS (PPV for SA: 0.5%; PPV for SI: 3.5%; sensitivity for SA: 76.6%; sensitivity for SI: 68.8%). Conclusions and Relevance: In this study, suicide risk prediction was optimal when leveraging both in-person screening (for acute measures of risk in patient-reported suicidality) and historical EHR data (for underlying clinical factors that can quantify a patient's passive risk level). To improve suicide risk classification, prediction systems could combine pretrained machine learning with structured clinician assessment without needing to retrain the original model..
UR - http://www.scopus.com/inward/record.url?scp=85130026403&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85130026403&partnerID=8YFLogxK
U2 - 10.1001/jamanetworkopen.2022.12095
DO - 10.1001/jamanetworkopen.2022.12095
M3 - Article
C2 - 35560048
AN - SCOPUS:85130026403
SN - 2574-3805
VL - 5
SP - E2212095
JO - JAMA network open
JF - JAMA network open
IS - 5
ER -