Global characterization of copy number variants in epilepsy patients from whole genome sequencing

PLoS Genet. 2018 Apr 12;14(4):e1007285. doi: 10.1371/journal.pgen.1007285. eCollection 2018 Apr.

Abstract

Epilepsy will affect nearly 3% of people at some point during their lifetime. Previous copy number variants (CNVs) studies of epilepsy have used array-based technology and were restricted to the detection of large or exonic events. In contrast, whole-genome sequencing (WGS) has the potential to more comprehensively profile CNVs but existing analytic methods suffer from limited accuracy. We show that this is in part due to the non-uniformity of read coverage, even after intra-sample normalization. To improve on this, we developed PopSV, an algorithm that uses multiple samples to control for technical variation and enables the robust detection of CNVs. Using WGS and PopSV, we performed a comprehensive characterization of CNVs in 198 individuals affected with epilepsy and 301 controls. For both large and small variants, we found an enrichment of rare exonic events in epilepsy patients, especially in genes with predicted loss-of-function intolerance. Notably, this genome-wide survey also revealed an enrichment of rare non-coding CNVs near previously known epilepsy genes. This enrichment was strongest for non-coding CNVs located within 100 Kbp of an epilepsy gene and in regions associated with changes in the gene expression, such as expression QTLs or DNase I hypersensitive sites. Finally, we report on 21 potentially damaging events that could be associated with known or new candidate epilepsy genes. Our results suggest that comprehensive sequence-based profiling of CNVs could help explain a larger fraction of epilepsy cases.

Publication types

  • Research Support, Non-U.S. Gov't
  • Validation Study

MeSH terms

  • Case-Control Studies
  • Cohort Studies
  • DNA Copy Number Variations*
  • Epilepsy / genetics*
  • Humans
  • Quantitative Trait Loci
  • Whole Genome Sequencing

Grants and funding

This work was supported by a grant from Genome Canada/Genome Quebec, the Canadian Foundation for Innovation (CFI-32462), the National Sciences and Engineering Research Council (NSERC-448167-2013) and a grant from the Canadian Institute for Health Research (CIHR-MOP-115090). SLG and GB are supported by the Fonds de Recherche Quebec Sante (FRQS-29493 and FRQS-25348). SLG is supported by a grant from the Reseau de Medecine Genetique Appliquee, a research network from the FRQS. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.