TY - JOUR
T1 - Confidence score
T2 - a data-driven measure for inclusive systematic reviews considering unpublished preprints
AU - Tong, Jiayi
AU - Luo, Chongliang
AU - Sun, Yifei
AU - Duan, Rui
AU - Saine, M. Elle
AU - Lin, Lifeng
AU - Peng, Yifan
AU - Lu, Yiwen
AU - Batra, Anchita
AU - Pan, Anni
AU - Wang, Olivia
AU - Li, Ruowang
AU - Marks-Anglin, Arielle
AU - Yang, Yuchen
AU - Zuo, Xu
AU - Liu, Yulun
AU - Bian, Jiang
AU - Kimmel, Stephen E.
AU - Hamilton, Keith
AU - Cuker, Adam
AU - Hubbard, Rebecca A.
AU - Xu, Hua
AU - Chen, Yong
N1 - Publisher Copyright:
© 2023 The Author(s). Published by Oxford University Press on behalf of the American Medical Informatics Association. All rights reserved.
PY - 2024/4/1
Y1 - 2024/4/1
N2 - Objectives: COVID-19, since its emergence in December 2019, has globally impacted research. Over 360 000 COVID-19-related manuscripts have been published on PubMed and preprint servers like medRxiv and bioRxiv, with preprints comprising about 15% of all manuscripts. Yet, the role and impact of preprints on COVID-19 research and evidence synthesis remain uncertain. Materials and Methods: We propose a novel data-driven method for assigning weights to individual preprints in systematic reviews and meta-analyses. This weight termed the "confidence score"is obtained using the survival cure model, also known as the survival mixture model, which takes into account the time elapsed between posting and publication of a preprint, as well as metadata such as the number of first 2-week citations, sample size, and study type. Results: Using 146 preprints on COVID-19 therapeutics posted from the beginning of the pandemic through April 30, 2021, we validated the confidence scores, showing an area under the curve of 0.95 (95% CI, 0.92-0.98). Through a use case on the effectiveness of hydroxychloroquine, we demonstrated how these scores can be incorporated practically into meta-analyses to properly weigh preprints. Discussion: It is important to note that our method does not aim to replace existing measures of study quality but rather serves as a supplementary measure that overcomes some limitations of current approaches. Conclusion: Our proposed confidence score has the potential to improve systematic reviews of evidence related to COVID-19 and other clinical conditions by providing a data-driven approach to including unpublished manuscripts.
AB - Objectives: COVID-19, since its emergence in December 2019, has globally impacted research. Over 360 000 COVID-19-related manuscripts have been published on PubMed and preprint servers like medRxiv and bioRxiv, with preprints comprising about 15% of all manuscripts. Yet, the role and impact of preprints on COVID-19 research and evidence synthesis remain uncertain. Materials and Methods: We propose a novel data-driven method for assigning weights to individual preprints in systematic reviews and meta-analyses. This weight termed the "confidence score"is obtained using the survival cure model, also known as the survival mixture model, which takes into account the time elapsed between posting and publication of a preprint, as well as metadata such as the number of first 2-week citations, sample size, and study type. Results: Using 146 preprints on COVID-19 therapeutics posted from the beginning of the pandemic through April 30, 2021, we validated the confidence scores, showing an area under the curve of 0.95 (95% CI, 0.92-0.98). Through a use case on the effectiveness of hydroxychloroquine, we demonstrated how these scores can be incorporated practically into meta-analyses to properly weigh preprints. Discussion: It is important to note that our method does not aim to replace existing measures of study quality but rather serves as a supplementary measure that overcomes some limitations of current approaches. Conclusion: Our proposed confidence score has the potential to improve systematic reviews of evidence related to COVID-19 and other clinical conditions by providing a data-driven approach to including unpublished manuscripts.
KW - data-driven modeling
KW - evidence synthesis
KW - preprint
KW - systematic review
UR - http://www.scopus.com/inward/record.url?scp=85189675131&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85189675131&partnerID=8YFLogxK
U2 - 10.1093/jamia/ocad248
DO - 10.1093/jamia/ocad248
M3 - Article
C2 - 38065694
AN - SCOPUS:85189675131
SN - 1067-5027
VL - 31
SP - 809
EP - 819
JO - Journal of the American Medical Informatics Association
JF - Journal of the American Medical Informatics Association
IS - 4
ER -