Fine-mapping of 150 breast cancer risk regions identifies 191 likely target genes

Nat Genet. 2020 Jan;52(1):56-73. doi: 10.1038/s41588-019-0537-1. Epub 2020 Jan 7.

Abstract

Genome-wide association studies have identified breast cancer risk variants in over 150 genomic regions, but the mechanisms underlying risk remain largely unknown. These regions were explored by combining association analysis with in silico genomic feature annotations. We defined 205 independent risk-associated signals with the set of credible causal variants in each one. In parallel, we used a Bayesian approach (PAINTOR) that combines genetic association, linkage disequilibrium and enriched genomic features to determine variants with high posterior probabilities of being causal. Potentially causal variants were significantly over-represented in active gene regulatory regions and transcription factor binding sites. We applied our INQUSIT pipeline for prioritizing genes as targets of those potentially causal variants, using gene expression (expression quantitative trait loci), chromatin interaction and functional annotations. Known cancer drivers, transcription factors and genes in the developmental, apoptosis, immune system and DNA integrity checkpoint gene ontology pathways were over-represented among the highest-confidence target genes.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Bayes Theorem
  • Biomarkers, Tumor / genetics*
  • Breast Neoplasms / genetics*
  • Chromosome Mapping / methods*
  • Female
  • Genetic Predisposition to Disease*
  • Genome-Wide Association Study*
  • Humans
  • Linkage Disequilibrium
  • Polymorphism, Single Nucleotide*
  • Quantitative Trait Loci*
  • Regulatory Sequences, Nucleic Acid
  • Risk Factors

Substances

  • Biomarkers, Tumor