Incorporating Functional Information in Tests of Excess De Novo Mutational Load

Am J Hum Genet. 2015 Aug 6;97(2):272-83. doi: 10.1016/j.ajhg.2015.06.013. Epub 2015 Jul 30.

Abstract

A number of recent studies have investigated the role of de novo mutations in various neurodevelopmental and neuropsychiatric disorders. These studies attempt to implicate causal genes by looking for an excess load of de novo mutations within those genes. Current statistical methods for assessing this excess are based on the implicit assumption that all qualifying mutations in a gene contribute equally to disease. However, it is well established that different mutations can have radically different effects on the ultimate protein product and, as a result, on disease risk. Here, we propose a method, fitDNM, that incorporates functional information in a test of excess de novo mutational load. Specifically, we derive score statistics from a retrospective likelihood that incorporates the probability of a mutation being damaging to gene function. We show that, under the null, the resulting test statistic is distributed as a weighted sum of Poisson random variables and we implement a saddlepoint approximation of this distribution to obtain accurate p values. Using simulation, we have shown that our method outperforms current methods in terms of statistical power while maintaining validity. We have applied this approach to four de novo mutation datasets of neurodevelopmental and neuropsychiatric disorders: autism spectrum disorder, epileptic encephalopathy, schizophrenia, and severe intellectual disability. Our approach also implicates genes that have been implicated by existing methods. Furthermore, our approach provides strong statistical evidence supporting two potentially causal genes: SUV420H1 in autism spectrum disorder and TRIO in a combined analysis of the four neurodevelopmental and neuropsychiatric disorders investigated here.

Publication types

  • Research Support, N.I.H., Extramural

MeSH terms

  • Computer Simulation
  • Genetic Diseases, Inborn / genetics*
  • Guanine Nucleotide Exchange Factors / genetics
  • Histone-Lysine N-Methyltransferase / genetics
  • Humans
  • Likelihood Functions
  • Models, Genetic*
  • Mutation / genetics*
  • Nervous System Diseases / genetics*
  • Poisson Distribution
  • Protein Serine-Threonine Kinases / genetics
  • Retrospective Studies

Substances

  • Guanine Nucleotide Exchange Factors
  • Histone-Lysine N-Methyltransferase
  • KMT5C protein, human
  • Protein Serine-Threonine Kinases
  • TRIO protein, human