A method for multiplexed full-length single-molecule sequencing of the human mitochondrial genome

Nat Commun. 2022 Oct 6;13(1):5902. doi: 10.1038/s41467-022-33530-3.

Abstract

Methods to reconstruct the mitochondrial DNA (mtDNA) sequence using short-read sequencing come with an inherent bias due to amplification and mapping. They can fail to determine the phase of variants, to capture multiple deletions and to cover the mitochondrial genome evenly. Here we describe a method to target, multiplex and sequence at high coverage full-length human mitochondrial genomes as native single-molecules, utilizing the RNA-guided DNA endonuclease Cas9. Combining Cas9 induced breaks, that define the mtDNA beginning and end of the sequencing reads, as barcodes, we achieve high demultiplexing specificity and delineation of the full-length of the mtDNA, regardless of the structural variant pattern. The long-read sequencing data is analysed with a pipeline where our custom-developed software, baldur, efficiently detects single nucleotide heteroplasmy to below 1%, physically determines phase and can accurately disentangle complex deletions. Our workflow is a tool for studying mtDNA variation and will accelerate mitochondrial research.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • DNA, Mitochondrial / genetics
  • Deoxyribonuclease I / genetics
  • Genome, Human / genetics
  • Genome, Mitochondrial* / genetics
  • High-Throughput Nucleotide Sequencing / methods
  • Humans
  • Nucleotides
  • RNA
  • Sequence Analysis, DNA / methods

Substances

  • DNA, Mitochondrial
  • Nucleotides
  • RNA
  • Deoxyribonuclease I