Genome-Wide Analyses of Vocabulary Size in Infancy and Toddlerhood: Associations With Attention-Deficit/Hyperactivity Disorder, Literacy, and Cognition-Related Traits

Biol Psychiatry. 2024 May 1;95(9):859-869. doi: 10.1016/j.biopsych.2023.11.025. Epub 2023 Dec 7.

Abstract

Background: The number of words children produce (expressive vocabulary) and understand (receptive vocabulary) changes rapidly during early development, partially due to genetic factors. Here, we performed a meta-genome-wide association study of vocabulary acquisition and investigated polygenic overlap with literacy, cognition, developmental phenotypes, and neurodevelopmental conditions, including attention-deficit/hyperactivity disorder (ADHD).

Methods: We studied 37,913 parent-reported vocabulary size measures (English, Dutch, Danish) for 17,298 children of European descent. Meta-analyses were performed for early-phase expressive (infancy, 15-18 months), late-phase expressive (toddlerhood, 24-38 months), and late-phase receptive (toddlerhood, 24-38 months) vocabulary. Subsequently, we estimated single nucleotide polymorphism-based heritability (SNP-h2) and genetic correlations (rg) and modeled underlying factor structures with multivariate models.

Results: Early-life vocabulary size was modestly heritable (SNP-h2 = 0.08-0.24). Genetic overlap between infant expressive and toddler receptive vocabulary was negligible (rg = 0.07), although each measure was moderately related to toddler expressive vocabulary (rg = 0.69 and rg = 0.67, respectively), suggesting a multifactorial genetic architecture. Both infant and toddler expressive vocabulary were genetically linked to literacy (e.g., spelling: rg = 0.58 and rg = 0.79, respectively), underlining genetic similarity. However, a genetic association of early-life vocabulary with educational attainment and intelligence emerged only during toddlerhood (e.g., receptive vocabulary and intelligence: rg = 0.36). Increased ADHD risk was genetically associated with larger infant expressive vocabulary (rg = 0.23). Multivariate genetic models in the ALSPAC (Avon Longitudinal Study of Parents and Children) cohort confirmed this finding for ADHD symptoms (e.g., at age 13; rg = 0.54) but showed that the association effect reversed for toddler receptive vocabulary (rg = -0.74), highlighting developmental heterogeneity.

Conclusions: The genetic architecture of early-life vocabulary changes during development, shaping polygenic association patterns with later-life ADHD, literacy, and cognition-related traits.

Keywords: ADHD; Cognition; Development; GWAS; SEM; Vocabulary.

Publication types

  • Meta-Analysis

MeSH terms

  • Adolescent
  • Attention Deficit Disorder with Hyperactivity* / diagnosis
  • Attention Deficit Disorder with Hyperactivity* / genetics
  • Cognition
  • Genome-Wide Association Study
  • Humans
  • Infant
  • Literacy*
  • Longitudinal Studies
  • Phenotype
  • Vocabulary