metaMIC: reference-free misassembly identification and correction of de novo metagenomic assemblies

Genome Biol. 2022 Nov 14;23(1):242. doi: 10.1186/s13059-022-02810-y.

Abstract

Evaluating the quality of metagenomic assemblies is important for constructing reliable metagenome-assembled genomes and downstream analyses. Here, we present metaMIC ( https://github.com/ZhaoXM-Lab/metaMIC ), a machine learning-based tool for identifying and correcting misassemblies in metagenomic assemblies. Benchmarking results on both simulated and real datasets demonstrate that metaMIC outperforms existing tools when identifying misassembled contigs. Furthermore, metaMIC is able to localize the misassembly breakpoints, and the correction of misassemblies by splitting at misassembly breakpoints can improve downstream scaffolding and binning results.

Keywords: Binning; Metagenome-assembled genomes; Metagenomic assemblies; Misassembled contigs; Misassembly breakpoints.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Algorithms
  • Benchmarking
  • Machine Learning
  • Metagenome*
  • Metagenomics* / methods
  • Sequence Analysis, DNA / methods
  • Software