TY - JOUR
T1 - Global characterization of copy number variants in epilepsy patients from whole genome sequencing
AU - Monlong, Jean
AU - Girard, Simon L.
AU - Meloche, Caroline
AU - Cadieux-Dion, Maxime
AU - Andrade, Danielle M.
AU - Lafreniere, Ron G.
AU - Gravel, Micheline
AU - Spiegelman, Dan
AU - Dionne-Laporte, Alexandre
AU - Boelman, Cyrus
AU - Hamdan, Fadi F.
AU - Michaud, Jacques L.
AU - Rouleau, Guy
AU - Minassian, Berge A.
AU - Bourque, Guillaume
AU - Cossette, Patrick
N1 - Funding Information:
This work was supported by a grant from Genome Canada/Genome Quebec, the Canadian Foundation for Innovation (CFI-32462), the National Sciences and Engineering Research Council (NSERC-448167-2013) and a grant from the Canadian Institute for Health Research (CIHR-MOP-115090). SLG and GB are supported by the Fonds de Recherche Quebec Sante (FRQS-29493 and FRQS-25348). SLG is supported by a grant from the Reseau de Medecine Genetique Appliquee, a research network from the FRQS. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. Data analyses were enabled by compute and storage resources provided by Compute Canada and Calcul Québec. We would like to thank Pascale Marquis at the Canadian Centre for Computational Genomics for processing the raw sequencing data to genomic variant calls and for her active participation in various quality assessment steps. The Canadian Centre for Computational Genomics (C3G) is a Node of the Canadian Genomic Innovation Network and is supported by the Canadian Government through Genome Canada. We are grateful to the team of the Québec Study of Newborn Twins who provided the twin dataset and the Cagekid consortium who provided the renal cancer dataset. We would like to thank Sylvia Dobrzeniecka for sample handling and lab work. We are grateful to Dr. Ledia Brunga for her work on the epileptic cohort and to Brianna Goldenstein and Claudia Moreau for revising this manuscript. Finally, we would like to thank Simon Gravel, Mathieu Blanchette, Mathieu Bourgey, Toby Dylan Hocking and Claudia Moreau for helpful discussions.
Publisher Copyright:
© 2018 Monlong et al.
PY - 2018/4
Y1 - 2018/4
N2 - Epilepsy will affect nearly 3% of people at some point during their lifetime. Previous copy number variants (CNVs) studies of epilepsy have used array-based technology and were restricted to the detection of large or exonic events. In contrast, whole-genome sequencing (WGS) has the potential to more comprehensively profile CNVs but existing analytic methods suffer from limited accuracy. We show that this is in part due to the non-uniformity of read coverage, even after intra-sample normalization. To improve on this, we developed PopSV, an algorithm that uses multiple samples to control for technical variation and enables the robust detection of CNVs. Using WGS and PopSV, we performed a comprehensive characterization of CNVs in 198 individuals affected with epilepsy and 301 controls. For both large and small variants, we found an enrichment of rare exonic events in epilepsy patients, especially in genes with predicted loss-of-function intolerance. Notably, this genome-wide survey also revealed an enrichment of rare non-coding CNVs near previously known epilepsy genes. This enrichment was strongest for non-coding CNVs located within 100 Kbp of an epilepsy gene and in regions associated with changes in the gene expression, such as expression QTLs or DNase I hypersensitive sites. Finally, we report on 21 potentially damaging events that could be associated with known or new candidate epilepsy genes. Our results suggest that comprehensive sequence-based profiling of CNVs could help explain a larger fraction of epilepsy cases.
AB - Epilepsy will affect nearly 3% of people at some point during their lifetime. Previous copy number variants (CNVs) studies of epilepsy have used array-based technology and were restricted to the detection of large or exonic events. In contrast, whole-genome sequencing (WGS) has the potential to more comprehensively profile CNVs but existing analytic methods suffer from limited accuracy. We show that this is in part due to the non-uniformity of read coverage, even after intra-sample normalization. To improve on this, we developed PopSV, an algorithm that uses multiple samples to control for technical variation and enables the robust detection of CNVs. Using WGS and PopSV, we performed a comprehensive characterization of CNVs in 198 individuals affected with epilepsy and 301 controls. For both large and small variants, we found an enrichment of rare exonic events in epilepsy patients, especially in genes with predicted loss-of-function intolerance. Notably, this genome-wide survey also revealed an enrichment of rare non-coding CNVs near previously known epilepsy genes. This enrichment was strongest for non-coding CNVs located within 100 Kbp of an epilepsy gene and in regions associated with changes in the gene expression, such as expression QTLs or DNase I hypersensitive sites. Finally, we report on 21 potentially damaging events that could be associated with known or new candidate epilepsy genes. Our results suggest that comprehensive sequence-based profiling of CNVs could help explain a larger fraction of epilepsy cases.
UR - http://www.scopus.com/inward/record.url?scp=85046430387&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85046430387&partnerID=8YFLogxK
U2 - 10.1371/journal.pgen.1007285
DO - 10.1371/journal.pgen.1007285
M3 - Article
C2 - 29649218
AN - SCOPUS:85046430387
SN - 1553-7390
VL - 14
JO - PLoS genetics
JF - PLoS genetics
IS - 4
M1 - e1007285
ER -