Prevalence and architecture of de novo mutations in developmental disorders

Nature. 2017 Feb 23;542(7642):433-438. doi: 10.1038/nature21062. Epub 2017 Jan 25.

Abstract

The genomes of individuals with severe, undiagnosed developmental disorders are enriched in damaging de novo mutations (DNMs) in developmentally important genes. Here we have sequenced the exomes of 4,293 families containing individuals with developmental disorders, and meta-analysed these data with data from another 3,287 individuals with similar disorders. We show that the most important factors influencing the diagnostic yield of DNMs are the sex of the affected individual, the relatedness of their parents, whether close relatives are affected and the parental ages. We identified 94 genes enriched in damaging DNMs, including 14 that previously lacked compelling evidence of involvement in developmental disorders. We have also characterized the phenotypic diversity among these disorders. We estimate that 42% of our cohort carry pathogenic DNMs in coding sequences; approximately half of these DNMs disrupt gene function and the remainder result in altered protein function. We estimate that developmental disorders caused by DNMs have an average prevalence of 1 in 213 to 1 in 448 births, depending on parental age. Given current global demographics, this equates to almost 400,000 children born per year.

Publication types

  • Meta-Analysis
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adolescent
  • Adult
  • Autoantigens / genetics
  • CDC2 Protein Kinase / genetics
  • Casein Kinase II / genetics
  • Child
  • Chromosomal Proteins, Non-Histone
  • Cohort Studies
  • DEAD-box RNA Helicases / genetics
  • DNA-Binding Proteins / genetics
  • Developmental Disabilities / genetics*
  • Exome / genetics
  • Female
  • Heredity / genetics
  • Histone-Lysine N-Methyltransferase / genetics
  • Homeodomain Proteins / genetics
  • Humans
  • Male
  • Mi-2 Nucleosome Remodeling and Deacetylase Complex / genetics
  • Middle Aged
  • Mutation / genetics*
  • Myeloid-Lymphoid Leukemia Protein / genetics
  • Nerve Tissue Proteins / genetics
  • Parents
  • Phenotype
  • Prevalence
  • Protein Phosphatase 2C / genetics
  • Repressor Proteins / genetics
  • Sequence Analysis, DNA
  • Sex Characteristics
  • Transcription Factors / genetics
  • Young Adult
  • ras GTPase-Activating Proteins / genetics

Substances

  • ADNP protein, human
  • ANKRD11 protein, human
  • ARID1B protein, human
  • Autoantigens
  • CHD4 protein, human
  • Chromosomal Proteins, Non-Histone
  • DNA-Binding Proteins
  • Homeodomain Proteins
  • KMT2A protein, human
  • MSL3 protein, human
  • Nerve Tissue Proteins
  • Repressor Proteins
  • SYNGAP1 protein, human
  • TCF20 protein, human
  • Transcription Factors
  • ZBTB18 protein, human
  • ras GTPase-Activating Proteins
  • Myeloid-Lymphoid Leukemia Protein
  • Histone-Lysine N-Methyltransferase
  • CSNK2A1 protein, human
  • Casein Kinase II
  • CDC2 Protein Kinase
  • CDK13 protein, human
  • PPM1D protein, human
  • Protein Phosphatase 2C
  • Mi-2 Nucleosome Remodeling and Deacetylase Complex
  • DDX3X protein, human
  • DEAD-box RNA Helicases