Accurate determination of microsatellite allele frequencies in pooled DNA samples

Eur J Hum Genet. 2004 Nov;12(11):925-34. doi: 10.1038/sj.ejhg.5201234.

Abstract

Pooling of DNA samples instead of individual genotyping can speed up genetic association studies. However, for microsatellite markers, the electrophoretic pattern of DNA pools can be complex, and procedures for deriving allele frequencies are often confounded by PCR-induced stutter artefacts. We have developed a mathematical procedure to remove stutter noise and accurately determine allele frequencies in pools. A stutter correction model can be reliably derived from one standard 'training set' of the same 10 individual DNA samples for each marker, which can also include heterozygous patterns with partially overlapping peaks. Compared with earlier methods, this reduces the number of genotypes needed in the training set considerably, and allows standardization of analyses for different markers. Moreover, the use of a procedure that fits all data simultaneously makes the method less sensitive to aberrant data. The model was tested with 34 markers, 18 of which were newly defined from human sequence data. Allele frequencies derived from stutter-corrected DNA pool patterns were compared with the summed individual genotyping results of all the individuals in the pools (n = 109 and n = 64). We show that the model is robust and accurately extracts allele frequencies from pooled DNA samples for 32 of the 34 microsatellite markers tested. Finally, we performed a case-control study in celiac disease and found that weakly associated disease alleles, identified by individual genotyping, were only detectable in pools after stutter correction. This efficient method for correcting stutter artefacts in microsatellite markers enables large-scale genetic association studies using DNA pools to be performed.

Publication types

  • Validation Study

MeSH terms

  • Case-Control Studies
  • Celiac Disease / genetics
  • DNA / genetics*
  • Gene Frequency*
  • Heterozygote
  • Homozygote
  • Humans
  • Microsatellite Repeats*
  • Models, Genetic

Substances

  • DNA