Interrater Reliability in Assessing Quality of Diagnostic Accuracy Studies Using the QUADAS Tool. A Preliminary Assessment

William Hollingworth, L. Santiago Medina, Robert E. Lenkinski, Dean K. Shibata, Byron Bernal, David Zurakowski, Bryan Comstock, Jeffrey G. Jarvik

Research output: Contribution to journalArticlepeer-review

46 Scopus citations


Rationale and Objectives: Quality Assessment of Diagnostic Accuracy Studies (QUADAS) is a new tool to measure the methodological quality of diagnostic accuracy studies in systematic reviews. We used data from a systematic review of magnetic resonance spectroscopy (MRS) in the characterization of suspected brain tumors to provide a preliminary evaluation of the inter-rater reliability of QUADAS. Materials and Methods: A structured literature search identified 19 diagnostic accuracy studies. These publications were distributed randomly to primary and secondary reviewers for dual independent assessment. Reviewers recorded methodological quality by using QUADAS on a custom-designed spreadsheet. We calculated correlation, percentage of agreement, and κ statistic to assess inter-rater reliability. Results: Most studies in our review were judged to have used an accurate reference standard. Conversely, the MRS literature frequently failed to specify the length of time between index and reference tests or that the clinicians were unaware of the index test findings when reporting the reference standard. There was good correlation (ρ = 0.78) between reviewers in assessment of the overall number of quality criteria met. However, mean agreement for individual QUADAS questions was only fair (κ = 0.22) and ranged from no agreement beyond chance (κ < 0) to moderate agreement (κ = 0.58). Conclusion: Inter-rater reliability in our study was relatively low. Nevertheless, we believe that QUADAS potentially is a useful tool for highlighting the strengths and weaknesses of existing diagnostic accuracy studies. Low reliability suggests that different reviewers will reach different conclusions if QUADAS is used to exclude "low-quality" articles from meta-analyses. We discuss methods for improving the validity and reliability of QUADAS.

Original languageEnglish (US)
Pages (from-to)803-810
Number of pages8
JournalAcademic radiology
Issue number7
StatePublished - Jul 2006


  • Sensitivity and specificity
  • evidence-based medicine
  • methods
  • radiology
  • review, systematic

ASJC Scopus subject areas

  • Radiology Nuclear Medicine and imaging


Dive into the research topics of 'Interrater Reliability in Assessing Quality of Diagnostic Accuracy Studies Using the QUADAS Tool. A Preliminary Assessment'. Together they form a unique fingerprint.

Cite this