Assessing the fidelity of ancient DNA sequences amplified from nuclear genes

Genetics. 2006 Feb;172(2):733-41. doi: 10.1534/genetics.105.049718. Epub 2005 Nov 19.

Abstract

To date, the field of ancient DNA has relied almost exclusively on mitochondrial DNA (mtDNA) sequences. However, a number of recent studies have reported the successful recovery of ancient nuclear DNA (nuDNA) sequences, thereby allowing the characterization of genetic loci directly involved in phenotypic traits of extinct taxa. It is well documented that postmortem damage in ancient mtDNA can lead to the generation of artifactual sequences. However, as yet no one has thoroughly investigated the damage spectrum in ancient nuDNA. By comparing clone sequences from 23 fossil specimens, recovered from environments ranging from permafrost to desert, we demonstrate the presence of miscoding lesion damage in both the mtDNA and nuDNA, resulting in insertion of erroneous bases during amplification. Interestingly, no significant differences in the frequency of miscoding lesion damage are recorded between mtDNA and nuDNA despite great differences in cellular copy numbers. For both mtDNA and nuDNA, we find significant positive correlations between total sequence heterogeneity and the rates of type 1 transitions (adenine --> guanine and thymine --> cytosine) and type 2 transitions (cytosine --> thymine and guanine --> adenine), respectively. Type 2 transitions are by far the most dominant and increase relative to those of type 1 with damage load. The results suggest that the deamination of cytosine (and 5-methyl cytosine) to uracil (and thymine) is the main cause of miscoding lesions in both ancient mtDNA and nuDNA sequences. We argue that the problems presented by postmortem damage, as well as problems with contamination from exogenous sources of conserved nuclear genes, allelic variation, and the reliance on single nucleotide polymorphisms, call for great caution in studies relying on ancient nuDNA sequences.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Animals
  • Cell Nucleus / genetics*
  • DNA, Mitochondrial / chemistry*
  • DNA, Mitochondrial / genetics*
  • Fossils*
  • Mammals / genetics
  • Molecular Sequence Data
  • Palaeognathae / genetics
  • Polymerase Chain Reaction / methods*
  • Sequence Analysis, DNA*

Substances

  • DNA, Mitochondrial

Associated data

  • GENBANK/DQ318533
  • GENBANK/DQ318534
  • GENBANK/DQ318535
  • GENBANK/DQ318536
  • GENBANK/DQ318537
  • GENBANK/DQ318538
  • GENBANK/DQ318539
  • GENBANK/DQ318540
  • GENBANK/DQ318541
  • GENBANK/DQ318542
  • GENBANK/DQ318543
  • GENBANK/DQ318544
  • GENBANK/DQ318545
  • GENBANK/DQ318546
  • GENBANK/DQ318547
  • GENBANK/DQ318548
  • GENBANK/DQ318549
  • GENBANK/DQ318550
  • GENBANK/DQ318551
  • GENBANK/DQ318552
  • GENBANK/DQ318553
  • GENBANK/DQ318554
  • GENBANK/DQ318555
  • GENBANK/DQ318556
  • GENBANK/DQ318557
  • GENBANK/DQ318558
  • GENBANK/DQ318559
  • GENBANK/DQ318560
  • GENBANK/DQ318561
  • GENBANK/DQ318562