Inter- and intra-breed genome-wide copy number diversity in a large cohort of European equine breeds

BMC Genomics. 2019 Oct 22;20(1):759. doi: 10.1186/s12864-019-6141-z.

Abstract

Background: Copy Number Variation (CNV) is a common form of genetic variation underlying animal evolution and phenotypic diversity across a wide range of species. In the mammalian genome, high frequency of CNV differentiation between breeds may be candidates for population-specific selection. However, CNV differentiation, selection and its population genetics have been poorly explored in horses.

Results: We investigated the patterns, population variation and gene annotation of CNV using the Axiom® Equine Genotyping Array (670,796 SNPs) from a large cohort of individuals (N = 1755) belonging to eight European horse breeds, varying from draught horses to several warmblood populations. After quality control, 152,640 SNP CNVs (individual markers), 18,800 segment CNVs (consecutive SNP CNVs of same gain/loss state or both) and 939 CNV regions (CNVRs; overlapping segment CNVs by at least 1 bp) compared to the average signal of the reference (Belgian draught horse) were identified. Our analyses showed that Equus caballus chromosome 12 (ECA12) was the most enriched in segment CNV gains and losses (~ 3% average proportion of the genome covered), but the highest number of segment CNVs were detected on ECA1 and ECA20 (regardless of size). The Friesian horses showed private SNP CNV gains (> 20% of the samples) on ECA1 and Exmoor ponies displayed private SNP CNV losses on ECA25 (> 20% of the samples). The Warmblood cluster showed private SNP CNV gains located in ECA9 and Draught cluster showed private SNP CNV losses located in ECA7. The length of the CNVRs ranged from 1 kb to 21.3 Mb. A total of 10,612 genes were annotated within the CNVRs. The PANTHER annotation of these genes showed significantly under- and overrepresented gene ontology biological terms related to cellular processes and immunity (Bonferroni P-value < 0.05). We identified 80 CNVRs overlapping with known QTL for fertility, coat colour, conformation and temperament. We also report 67 novel CNVRs.

Conclusions: This work revealed that CNV patterns, in the genome of some European horse breeds, occurred in specific genomic regions. The results provide support to the hypothesis that high frequency private CNVs residing in genes may potentially be responsible for the diverse phenotypes seen between horse breeds.

Keywords: Copy number variation; Horse; SNP genotyping array; Structural variation.

MeSH terms

  • Animals
  • Breeding
  • Comparative Genomic Hybridization
  • DNA Copy Number Variations / genetics*
  • Europe
  • Evolution, Molecular
  • Genetic Variation*
  • Genetics, Population
  • Genome / genetics*
  • Genotype
  • Horses / genetics*
  • Phenotype
  • Selection, Genetic