STITCH: algorithm to splice, trim, identify, track, and capture the uniqueness of 16S rRNAs sequence pairs using public or in-house database

Microb Ecol. 2011 Apr;61(3):669-75. doi: 10.1007/s00248-010-9779-2. Epub 2010 Nov 27.

Abstract

A comparison of variable regions within the 16S rRNA gene is widely used to characterize relationships between bacteria and to identify phylogenetic affiliation of unknown bacteria. In environmental studies, polymerase chain reaction amplification of 16S rRNA followed by cloning and sequencing of numerous individual clones is an extensively used molecular method for elucidating microbial diversity. The sequencing process typically utilizes a forward and reverse primer pair to produce two partial reads (~700 to 800 base pairs each) that overlap and in total cover a large region of the full 16S rRNA sequence (~1.5 k base). In a typical application, this approach rapidly generates very large numbers of 16S rRNA datasets that can overwhelm manual processing efforts leading to both delays and errors. In particular, the approach presents two computational challenges: (1) the assembly of a composite sequence from the two partial reads and (2) the subsequent appropriate identification of the organism represented by the newly sequenced clones. Herein, we describe a software package, search, trim, identify, track, and capture the uniqueness of 16S rRNAs using public and in-house database (STITCH), which offers automated sequence pair splicing and genetic identification, thus simplifying the computationally intensive analysis of large sequencing libraries. The STITCH software is freely accessible over the Internet at: http://prion.bchs.uh.edu/stitch/.

Publication types

  • Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

  • Algorithms*
  • Bacteria / classification
  • Bacteria / genetics
  • Computational Biology / methods*
  • Databases, Nucleic Acid*
  • RNA Splicing
  • RNA, Ribosomal, 16S / genetics*
  • Sequence Analysis, RNA / methods*
  • Software
  • User-Computer Interface

Substances

  • RNA, Ribosomal, 16S