Repeatability of diagnostic features and scoring systems for hepatocellular carcinoma by using MR imaging

Matthew S. Davenport, Shokoufeh Khalatbari, Peter S.C. Liu, Katherine E. Maturen, Ravi K. Kaza, Ashish P. Wasnik, Mahmoud M. Al-Hawary, Daniel I. Glazer, Erica B. Stein, Jeet Patel, Deepak K. Somashekar, Benjamin L. Viglianti, Hero K. Hussain

Research output: Contribution to journalArticlepeer-review

126 Scopus citations


Purpose: To determine for expert and novice radiologists repeatability of major diagnostic features and scoring systems (ie, Liver Imaging Reporting and Data System [LI-RADS], Organ Procurement and Transplantation Network [OPTN], and American Association for the Study of Liver Diseases [AASLD]) for hepatocellular carcinoma (HCC) by using magnetic resonance (MR) imaging. Materials and Methods: Institutional review board approval was obtained and patient consent was waived for this HIPAA-compliant, retrospective study. The LI-RADS discussed in this article refers to version 2013.1. Ten blinded readers reviewed 100 liver MR imaging studies that demonstrated observations preliminarily assigned LI-RADS scores of LR1-LR5. Diameter and major HCC features (arterial hyperenhancement, washout appearance, pseudocapsule) were recorded for each observation. LI-RADS, OPTN, and AASLD scores were assigned. Interreader agreement was assessed by using intraclass correlation coefficients and κ statistics. Scoring rates were compared by using McNemar test. Results: Overall interreader agreement was substantial for arterial hyperenhancement (0.67 [95% confidence interval {CI}: 0.65, 0.69]), moderate for washout appearance (0.48 [95%CI: 0.46, 0.50]), moderate for pseudocapsule (0.52 [95% CI: 050, 0.54]), fair for LI-RADS (0.35 [95% CI: 0.34, 0.37]), fair for AASLD (0.39 [95% CI: 0.37, 0.42]), and moderate for OPTN (0.53 [95% CI: 0.51, 0.56]). Agreement for measured diameter was almost perfect (range, 0.95-0.97). There was substantial agreement for most scores consistent with HCC. Experts agreed significantly more than did novices and were significantly more likely than were novices to assign a diagnosis of HCC ( P < .001). Conclusion: Two of three major features for HCC (washout appearance and pseudocapsule) have only moderate interreader agreement. Experts and novices who assigned scores consistent with HCC had substantial but not perfect agreement. Expert agreement is substantial for OPTN, but moderate for LI-RADS and AASLD. Novices were less consistent and less likely to diagnose HCC than were experts.

Original languageEnglish (US)
Pages (from-to)132-142
Number of pages11
Issue number1
StatePublished - Jul 2014
Externally publishedYes

ASJC Scopus subject areas

  • Radiology Nuclear Medicine and imaging


Dive into the research topics of 'Repeatability of diagnostic features and scoring systems for hepatocellular carcinoma by using MR imaging'. Together they form a unique fingerprint.

Cite this