Enhancement of de novo sequencing, assembly and annotation of the Mongolian gerbil genome with transcriptome sequencing and assembly from several different tissues

BMC Genomics. 2019 Nov 27;20(1):903. doi: 10.1186/s12864-019-6276-y.

Abstract

Background: The Mongolian gerbil (Meriones unguiculatus) has historically been used as a model organism for the auditory and visual systems, stroke/ischemia, epilepsy and aging related research since 1935 when laboratory gerbils were separated from their wild counterparts. In this study we report genome sequencing, assembly, and annotation further supported by transcriptome sequencing and assembly from 27 different tissues samples.

Results: The genome was sequenced using Illumina HiSeq 2000 and after assembly resulted in a final genome size of 2.54 Gbp with contig and scaffold N50 values of 31.4 Kbp and 500.0 Kbp, respectively. Based on the k-mer estimated genome size of 2.48 Gbp, the assembly appears to be complete. The genome annotation was supported by transcriptome data that identified 31,769 (> 2000 bp) predicted protein-coding genes across 27 tissue samples. A BUSCO search of 3023 mammalian groups resulted in 86% of curated single copy orthologs present among predicted genes, indicating a high level of completeness of the genome.

Conclusions: We report the first de novo assembly of the Mongolian gerbil genome enhanced by assembly of transcriptome data from several tissues. Sequencing of this genome and transcriptome increases the utility of the gerbil as a model organism, opening the availability of now widely used genetic tools.

Keywords: Gerbil genome; Meriones unguiculatus; Model organism; Transcriptome.

MeSH terms

  • Animals
  • Computational Biology* / methods
  • Gene Expression Profiling / methods
  • Genome*
  • Genomics* / methods
  • Gerbillinae / genetics*
  • High-Throughput Nucleotide Sequencing*
  • Molecular Sequence Annotation
  • Organ Specificity
  • Transcriptome*