TY - CHAP
T1 - Data Sharing and Reuse
T2 - A Method by the AIRR Community
AU - on behalf of the AIRR Community
AU - Corrie, Brian D.
AU - Christley, Scott
AU - Busse, Christian E.
AU - Cowell, Lindsay G.
AU - Neller, Kira C.M.
AU - Rubelt, Florian
AU - Schwab, Nicholas
N1 - Funding Information:
We would like to thank our colleagues from the AIRR Community, who have dedicated many hours to the development of the community and the standards and initiatives on which this chapter is based. In particular, we would like to thank the authors of the other AIRR Community chapters in this volume, with a special thanks to Susanna Marquez, William Lees, and Ulrik Stervbo who assisted with content and editing of this chapter.
Publisher Copyright:
© 2022, The Author(s).
PY - 2022
Y1 - 2022
N2 - High-throughput sequencing of adaptive immune receptor repertoires (AIRR, i.e., IG and TR) has revolutionized the ability to study the adaptive immune response via large-scale experiments. Since 2009, AIRR sequencing (AIRR-seq) has been widely applied to survey the immune state of individuals (see “The AIRR Community Guide to Repertoire Analysis” chapter for details). One of the goals of the AIRR Community is to make the resulting AIRR-seq data FAIR (Findable, Accessible, Interoperable, and Reusable) (Wilkinson et al. Sci Data 3:1–9, 2016), with a primary goal of making it easy for the research community to reuse AIRR-seq data (Breden et al. Front Immunol 8:1418, 2017; Scott and Breden. Curr Opin Syst Biol 24:71–77, 2020). The basis for this is the MiAIRR data standard (Rubelt et al. Nat Immunol 18:1274–1278, 2017). For long-term preservation, it is recommended that researchers store their sequence read data in an INSDC repository. At the same time, the AIRR Community has established the AIRR Data Commons (Christley et al. Front Big Data 3:22, 2020), a distributed set of AIRR-compliant repositories that store the critically important annotated AIRR-seq data based on the MiAIRR standard, making the data findable, interoperable, and, because the data are annotated, more valuable in its reuse. Here, we build on the other AIRR Community chapters and illustrate how these principles and standards can be incorporated into AIRR-seq data analysis workflows. We discuss the importance of careful curation of metadata to ensure reproducibility and facilitate data sharing and reuse, and we illustrate how data can be shared via the AIRR Data Commons.
AB - High-throughput sequencing of adaptive immune receptor repertoires (AIRR, i.e., IG and TR) has revolutionized the ability to study the adaptive immune response via large-scale experiments. Since 2009, AIRR sequencing (AIRR-seq) has been widely applied to survey the immune state of individuals (see “The AIRR Community Guide to Repertoire Analysis” chapter for details). One of the goals of the AIRR Community is to make the resulting AIRR-seq data FAIR (Findable, Accessible, Interoperable, and Reusable) (Wilkinson et al. Sci Data 3:1–9, 2016), with a primary goal of making it easy for the research community to reuse AIRR-seq data (Breden et al. Front Immunol 8:1418, 2017; Scott and Breden. Curr Opin Syst Biol 24:71–77, 2020). The basis for this is the MiAIRR data standard (Rubelt et al. Nat Immunol 18:1274–1278, 2017). For long-term preservation, it is recommended that researchers store their sequence read data in an INSDC repository. At the same time, the AIRR Community has established the AIRR Data Commons (Christley et al. Front Big Data 3:22, 2020), a distributed set of AIRR-compliant repositories that store the critically important annotated AIRR-seq data based on the MiAIRR standard, making the data findable, interoperable, and, because the data are annotated, more valuable in its reuse. Here, we build on the other AIRR Community chapters and illustrate how these principles and standards can be incorporated into AIRR-seq data analysis workflows. We discuss the importance of careful curation of metadata to ensure reproducibility and facilitate data sharing and reuse, and we illustrate how data can be shared via the AIRR Data Commons.
KW - AIRR-seq
KW - B-cell receptor
KW - Data reuse
KW - Data sharing
KW - FAIR data
KW - Immunoglobulin
KW - T-cell receptor
UR - http://www.scopus.com/inward/record.url?scp=85131108726&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85131108726&partnerID=8YFLogxK
U2 - 10.1007/978-1-0716-2115-8_23
DO - 10.1007/978-1-0716-2115-8_23
M3 - Chapter
C2 - 35622339
AN - SCOPUS:85131108726
T3 - Methods in Molecular Biology
SP - 447
EP - 476
BT - Methods in Molecular Biology
PB - Humana Press Inc.
ER -