Confidence score: a data-driven measure for inclusive systematic reviews considering unpublished preprints

Jiayi Tong, Chongliang Luo, Yifei Sun, Rui Duan, M. Elle Saine, Lifeng Lin, Yifan Peng, Yiwen Lu, Anchita Batra, Anni Pan, Olivia Wang, Ruowang Li, Arielle Marks-Anglin, Yuchen Yang, Xu Zuo, Yulun Liu, Jiang Bian, Stephen E. Kimmel, Keith Hamilton, Adam CukerRebecca A. Hubbard, Hua Xu, Yong Chen

Research output: Contribution to journalArticlepeer-review

Abstract

Objectives: COVID-19, since its emergence in December 2019, has globally impacted research. Over 360 000 COVID-19-related manuscripts have been published on PubMed and preprint servers like medRxiv and bioRxiv, with preprints comprising about 15% of all manuscripts. Yet, the role and impact of preprints on COVID-19 research and evidence synthesis remain uncertain. Materials and Methods: We propose a novel data-driven method for assigning weights to individual preprints in systematic reviews and meta-analyses. This weight termed the "confidence score"is obtained using the survival cure model, also known as the survival mixture model, which takes into account the time elapsed between posting and publication of a preprint, as well as metadata such as the number of first 2-week citations, sample size, and study type. Results: Using 146 preprints on COVID-19 therapeutics posted from the beginning of the pandemic through April 30, 2021, we validated the confidence scores, showing an area under the curve of 0.95 (95% CI, 0.92-0.98). Through a use case on the effectiveness of hydroxychloroquine, we demonstrated how these scores can be incorporated practically into meta-analyses to properly weigh preprints. Discussion: It is important to note that our method does not aim to replace existing measures of study quality but rather serves as a supplementary measure that overcomes some limitations of current approaches. Conclusion: Our proposed confidence score has the potential to improve systematic reviews of evidence related to COVID-19 and other clinical conditions by providing a data-driven approach to including unpublished manuscripts.

Original languageEnglish (US)
Pages (from-to)809-819
Number of pages11
JournalJournal of the American Medical Informatics Association
Volume31
Issue number4
DOIs
StatePublished - Apr 1 2024

Keywords

  • data-driven modeling
  • evidence synthesis
  • preprint
  • systematic review

ASJC Scopus subject areas

  • Health Informatics

Fingerprint

Dive into the research topics of 'Confidence score: a data-driven measure for inclusive systematic reviews considering unpublished preprints'. Together they form a unique fingerprint.

Cite this