Discovering non-coding RNA elements in Drosophila 3' untranslated regions

Int J Bioinform Res Appl. 2014;10(4-5):479-97. doi: 10.1504/IJBRA.2014.062996.

Abstract

The Non-Coding RNA (ncRNA) elements in the 3' Untranslated Regions (3'-UTRs) are known to participate in the genes' post-transcriptional regulations. Inferring co-expression patterns of the genes through clustering these 3'-UTR ncRNA elements will provide invaluable insights for studying their biological functions. In this paper, we propose an improved RNA structural clustering pipeline. Benchmark of the new pipeline on Rfam data demonstrates over 10% performance improvements compared to the traditional hierarchical clustering pipeline. By applying the new clustering pipeline to 3'-UTRs of Drosophila melanogaster's genome, we have successfully identified 184 ncRNA clusters with 91.3% accuracy. One of these clusters corresponds to genes that are preferentially expressed in male Drosophila. Another cluster contains genes that are responsible for the functions of septate junction in epithelial cells. These discoveries encourage more studies on novel post-transcriptional regulation mechanisms.

Keywords: 3' Drosophila genome; RNA secondary structure; bioinformatics; clustering; co–expression patterns; gene expression; ncRNA clusters; non–coding RNA; post–transcriptional regulation; untranslated region.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • 3' Untranslated Regions*
  • Algorithms
  • Animals
  • Cluster Analysis
  • Computational Biology / methods
  • Drosophila Proteins / chemistry
  • Drosophila Proteins / genetics
  • Drosophila melanogaster / genetics*
  • Gene Expression Profiling / methods
  • Male
  • Models, Statistical
  • Nucleic Acid Conformation
  • RNA / chemistry
  • RNA Processing, Post-Transcriptional
  • RNA, Untranslated*
  • Sequence Alignment
  • Sequence Analysis, RNA

Substances

  • 3' Untranslated Regions
  • Drosophila Proteins
  • RNA, Untranslated
  • RNA