Assessment of a Deep Learning Model to Predict Hepatocellular Carcinoma in Patients with Hepatitis C Cirrhosis

George N. Ioannou; Weijing Tang; Lauren A. Beste; Monica A. Tincopa; Grace L. Su; Tony Van; Elliot B. Tapper; Amit G. Singal; Ji Zhu; Akbar K. Waljee

doi:10.1001/jamanetworkopen.2020.15626

Assessment of a Deep Learning Model to Predict Hepatocellular Carcinoma in Patients with Hepatitis C Cirrhosis

George N. Ioannou, Weijing Tang, Lauren A. Beste, Monica A. Tincopa, Grace L. Su, Tony Van, Elliot B. Tapper, Amit G. Singal, Ji Zhu, Akbar K. Waljee

Research output: Contribution to journal › Article › peer-review

61 Scopus citations

Abstract

Importance: Deep learning, a family of machine learning models that use artificial neural networks, has achieved great success at predicting outcomes in nonmedical domains. Objective: To examine whether deep learning recurrent neural network (RNN) models that use raw longitudinal data extracted directly from electronic health records outperform conventional regression models in predicting the risk of developing hepatocellular carcinoma (HCC). Design, Setting, and Participants: This prognostic study included 48151 patients with hepatitis C virus (HCV)-related cirrhosis in the national Veterans Health Administration who had at least 3 years of follow-up after the diagnosis of cirrhosis. Patients were identified by having at least 1 positive HCV RNA test between January 1, 2000, to January 1, 2016, and were followed up from the diagnosis of cirrhosis to January 1, 2019, for the development of incident HCC. A total of 3 models predicting HCC during a 3-year period were developed and compared, as follows: (1) logistic regression (LR) with cross-sectional inputs (cross-sectional LR); (2) LR with longitudinal inputs (longitudinal LR); and (3) RNN with longitudinal inputs. Data analysis was conducted from April 2018 to August 2020. Exposures: Development of HCC. Main Outcomes and Measures: Area under the receiver operating characteristic curve, area under the precision-recall curve, and Brier score. Results: During a mean (SD) follow-up of 11.6 (5.0) years, 10741 of 48151 patients (22.3%) developed HCC (annual incidence, 3.1%), and a total of 52983 samples (51948 [98.0%] from men) were collected. Patients who developed HCC within 3 years were older than patients who did not (mean [SD] age, 58.2 [6.6] years vs 56.9 [6.9] years). RNN models had superior mean (SD) area under the receiver operating characteristic curve (0.759 [0.009]) and mean (SD) Brier score (0.136 [0.003]) than cross-sectional LR (0.689 [0.009] and 0.149 [0.003], respectively) and longitudinal LR (0.682 [0.007] and 0.150 [0.003], respectively) models. Using the RNN model, the samples with the mean (SD) highest 51% (1.5%) of HCC risk, in which 80% of all HCCs occurred, or the mean (SD) highest 66% (1.2%) of HCC risk, in which 90% of all HCCs occurred, could potentially be targeted. Among samples from patients who achieved sustained virologic response, the performance of the RNN models was even better (mean [SD] area under receiver operating characteristic curve, 0.806 [0.025]; mean [SD] Brier score, 0.117 [0.007]). Conclusions and Relevance: In this study, deep learning RNN models outperformed conventional LR models, suggesting that RNN models could be used to identify patients with HCV-related cirrhosis with a high risk of developing HCC for risk-based HCC outreach and surveillance strategies..

Original language	English (US)
Article number	e2015626
Journal	JAMA Network Open
Volume	3
Issue number	9
DOIs	https://doi.org/10.1001/jamanetworkopen.2020.15626
State	Published - Sep 1 2020
Externally published	Yes

ASJC Scopus subject areas

General Medicine

Access to Document

10.1001/jamanetworkopen.2020.15626

Cite this

@article{6db6a799030a4f81a1b4b1c8645aaea4,

title = "Assessment of a Deep Learning Model to Predict Hepatocellular Carcinoma in Patients with Hepatitis C Cirrhosis",

abstract = "Importance: Deep learning, a family of machine learning models that use artificial neural networks, has achieved great success at predicting outcomes in nonmedical domains. Objective: To examine whether deep learning recurrent neural network (RNN) models that use raw longitudinal data extracted directly from electronic health records outperform conventional regression models in predicting the risk of developing hepatocellular carcinoma (HCC). Design, Setting, and Participants: This prognostic study included 48151 patients with hepatitis C virus (HCV)-related cirrhosis in the national Veterans Health Administration who had at least 3 years of follow-up after the diagnosis of cirrhosis. Patients were identified by having at least 1 positive HCV RNA test between January 1, 2000, to January 1, 2016, and were followed up from the diagnosis of cirrhosis to January 1, 2019, for the development of incident HCC. A total of 3 models predicting HCC during a 3-year period were developed and compared, as follows: (1) logistic regression (LR) with cross-sectional inputs (cross-sectional LR); (2) LR with longitudinal inputs (longitudinal LR); and (3) RNN with longitudinal inputs. Data analysis was conducted from April 2018 to August 2020. Exposures: Development of HCC. Main Outcomes and Measures: Area under the receiver operating characteristic curve, area under the precision-recall curve, and Brier score. Results: During a mean (SD) follow-up of 11.6 (5.0) years, 10741 of 48151 patients (22.3%) developed HCC (annual incidence, 3.1%), and a total of 52983 samples (51948 [98.0%] from men) were collected. Patients who developed HCC within 3 years were older than patients who did not (mean [SD] age, 58.2 [6.6] years vs 56.9 [6.9] years). RNN models had superior mean (SD) area under the receiver operating characteristic curve (0.759 [0.009]) and mean (SD) Brier score (0.136 [0.003]) than cross-sectional LR (0.689 [0.009] and 0.149 [0.003], respectively) and longitudinal LR (0.682 [0.007] and 0.150 [0.003], respectively) models. Using the RNN model, the samples with the mean (SD) highest 51% (1.5%) of HCC risk, in which 80% of all HCCs occurred, or the mean (SD) highest 66% (1.2%) of HCC risk, in which 90% of all HCCs occurred, could potentially be targeted. Among samples from patients who achieved sustained virologic response, the performance of the RNN models was even better (mean [SD] area under receiver operating characteristic curve, 0.806 [0.025]; mean [SD] Brier score, 0.117 [0.007]). Conclusions and Relevance: In this study, deep learning RNN models outperformed conventional LR models, suggesting that RNN models could be used to identify patients with HCV-related cirrhosis with a high risk of developing HCC for risk-based HCC outreach and surveillance strategies..",

author = "Ioannou, {George N.} and Weijing Tang and Beste, {Lauren A.} and Tincopa, {Monica A.} and Su, {Grace L.} and Tony Van and Tapper, {Elliot B.} and Singal, {Amit G.} and Ji Zhu and Waljee, {Akbar K.}",

year = "2020",

month = sep,

day = "1",

doi = "10.1001/jamanetworkopen.2020.15626",

language = "English (US)",

volume = "3",

journal = "JAMA Network Open",

issn = "2574-3805",

publisher = "American Medical Association",

number = "9",

}

TY - JOUR

T1 - Assessment of a Deep Learning Model to Predict Hepatocellular Carcinoma in Patients with Hepatitis C Cirrhosis

AU - Ioannou, George N.

AU - Tang, Weijing

AU - Beste, Lauren A.

AU - Tincopa, Monica A.

AU - Su, Grace L.

AU - Van, Tony

AU - Tapper, Elliot B.

AU - Singal, Amit G.

AU - Zhu, Ji

AU - Waljee, Akbar K.

PY - 2020/9/1

Y1 - 2020/9/1

N2 - Importance: Deep learning, a family of machine learning models that use artificial neural networks, has achieved great success at predicting outcomes in nonmedical domains. Objective: To examine whether deep learning recurrent neural network (RNN) models that use raw longitudinal data extracted directly from electronic health records outperform conventional regression models in predicting the risk of developing hepatocellular carcinoma (HCC). Design, Setting, and Participants: This prognostic study included 48151 patients with hepatitis C virus (HCV)-related cirrhosis in the national Veterans Health Administration who had at least 3 years of follow-up after the diagnosis of cirrhosis. Patients were identified by having at least 1 positive HCV RNA test between January 1, 2000, to January 1, 2016, and were followed up from the diagnosis of cirrhosis to January 1, 2019, for the development of incident HCC. A total of 3 models predicting HCC during a 3-year period were developed and compared, as follows: (1) logistic regression (LR) with cross-sectional inputs (cross-sectional LR); (2) LR with longitudinal inputs (longitudinal LR); and (3) RNN with longitudinal inputs. Data analysis was conducted from April 2018 to August 2020. Exposures: Development of HCC. Main Outcomes and Measures: Area under the receiver operating characteristic curve, area under the precision-recall curve, and Brier score. Results: During a mean (SD) follow-up of 11.6 (5.0) years, 10741 of 48151 patients (22.3%) developed HCC (annual incidence, 3.1%), and a total of 52983 samples (51948 [98.0%] from men) were collected. Patients who developed HCC within 3 years were older than patients who did not (mean [SD] age, 58.2 [6.6] years vs 56.9 [6.9] years). RNN models had superior mean (SD) area under the receiver operating characteristic curve (0.759 [0.009]) and mean (SD) Brier score (0.136 [0.003]) than cross-sectional LR (0.689 [0.009] and 0.149 [0.003], respectively) and longitudinal LR (0.682 [0.007] and 0.150 [0.003], respectively) models. Using the RNN model, the samples with the mean (SD) highest 51% (1.5%) of HCC risk, in which 80% of all HCCs occurred, or the mean (SD) highest 66% (1.2%) of HCC risk, in which 90% of all HCCs occurred, could potentially be targeted. Among samples from patients who achieved sustained virologic response, the performance of the RNN models was even better (mean [SD] area under receiver operating characteristic curve, 0.806 [0.025]; mean [SD] Brier score, 0.117 [0.007]). Conclusions and Relevance: In this study, deep learning RNN models outperformed conventional LR models, suggesting that RNN models could be used to identify patients with HCV-related cirrhosis with a high risk of developing HCC for risk-based HCC outreach and surveillance strategies..

AB - Importance: Deep learning, a family of machine learning models that use artificial neural networks, has achieved great success at predicting outcomes in nonmedical domains. Objective: To examine whether deep learning recurrent neural network (RNN) models that use raw longitudinal data extracted directly from electronic health records outperform conventional regression models in predicting the risk of developing hepatocellular carcinoma (HCC). Design, Setting, and Participants: This prognostic study included 48151 patients with hepatitis C virus (HCV)-related cirrhosis in the national Veterans Health Administration who had at least 3 years of follow-up after the diagnosis of cirrhosis. Patients were identified by having at least 1 positive HCV RNA test between January 1, 2000, to January 1, 2016, and were followed up from the diagnosis of cirrhosis to January 1, 2019, for the development of incident HCC. A total of 3 models predicting HCC during a 3-year period were developed and compared, as follows: (1) logistic regression (LR) with cross-sectional inputs (cross-sectional LR); (2) LR with longitudinal inputs (longitudinal LR); and (3) RNN with longitudinal inputs. Data analysis was conducted from April 2018 to August 2020. Exposures: Development of HCC. Main Outcomes and Measures: Area under the receiver operating characteristic curve, area under the precision-recall curve, and Brier score. Results: During a mean (SD) follow-up of 11.6 (5.0) years, 10741 of 48151 patients (22.3%) developed HCC (annual incidence, 3.1%), and a total of 52983 samples (51948 [98.0%] from men) were collected. Patients who developed HCC within 3 years were older than patients who did not (mean [SD] age, 58.2 [6.6] years vs 56.9 [6.9] years). RNN models had superior mean (SD) area under the receiver operating characteristic curve (0.759 [0.009]) and mean (SD) Brier score (0.136 [0.003]) than cross-sectional LR (0.689 [0.009] and 0.149 [0.003], respectively) and longitudinal LR (0.682 [0.007] and 0.150 [0.003], respectively) models. Using the RNN model, the samples with the mean (SD) highest 51% (1.5%) of HCC risk, in which 80% of all HCCs occurred, or the mean (SD) highest 66% (1.2%) of HCC risk, in which 90% of all HCCs occurred, could potentially be targeted. Among samples from patients who achieved sustained virologic response, the performance of the RNN models was even better (mean [SD] area under receiver operating characteristic curve, 0.806 [0.025]; mean [SD] Brier score, 0.117 [0.007]). Conclusions and Relevance: In this study, deep learning RNN models outperformed conventional LR models, suggesting that RNN models could be used to identify patients with HCV-related cirrhosis with a high risk of developing HCC for risk-based HCC outreach and surveillance strategies..

UR - http://www.scopus.com/inward/record.url?scp=85090180783&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85090180783&partnerID=8YFLogxK

U2 - 10.1001/jamanetworkopen.2020.15626

DO - 10.1001/jamanetworkopen.2020.15626

M3 - Article

C2 - 32870314

AN - SCOPUS:85090180783

SN - 2574-3805

VL - 3

JO - JAMA Network Open

JF - JAMA Network Open

IS - 9

M1 - e2015626

ER -

Assessment of a Deep Learning Model to Predict Hepatocellular Carcinoma in Patients with Hepatitis C Cirrhosis

Abstract

ASJC Scopus subject areas

Access to Document

Other files and links

Fingerprint

Cite this