MHConstructor: a high-throughput, haplotype-informed solution to the MHC assembly challenge

Genome Biol. 2024 Oct 17;25(1):274. doi: 10.1186/s13059-024-03412-6.

Abstract

The extremely high levels of genetic polymorphism within the human major histocompatibility complex (MHC) limit the usefulness of reference-based alignment methods for sequence assembly. We incorporate a short-read, de novo assembly algorithm into a workflow for novel application to the MHC. MHConstructor is a containerized pipeline designed for high-throughput, haplotype-informed, reproducible assembly of both whole genome sequencing and target capture short-read data in large, population cohorts. To-date, no other self-contained tool exists for the generation of de novo MHC assemblies from short-read data. MHConstructor facilitates wide-spread access to high-quality, alignment-free MHC sequence analysis.

Keywords: De novo assembly; Haplotype; Human leukocyte antigen genes; Major histocompatibility complex; Short-read sequencing.

MeSH terms

  • Algorithms
  • Haplotypes*
  • High-Throughput Nucleotide Sequencing / methods
  • Humans
  • Major Histocompatibility Complex* / genetics
  • Software