Noncoding RNA: Current Deep Sequencing Data Analysis Approaches and Challenges

Hum Mutat. 2016 Dec;37(12):1283-1298. doi: 10.1002/humu.23066. Epub 2016 Sep 5.

Abstract

One of the most significant biological discoveries of the last decade is represented by the reality that the vast majority of the transcribed genomic output comprises diverse classes of noncoding RNAs (ncRNAs) that may play key roles and/or be affected by many biochemical cellular processes (i.e., RNA editing), with implications in human health and disease. With 90% of the human genome being transcribed and novel classes of ncRNA emerging (tRNA-derived small RNAs and circular RNAs among others), the great majority of the human transcriptome suggests that many important ncRNA functions/processes are yet to be discovered. An approach to filling such vast void of knowledge has been recently provided by the increasing application of next-generation sequencing (NGS), offering the unprecedented opportunity to obtain a more accurate profiling with higher resolution, increased throughput, sequencing depth, and low experimental complexity, concurrently posing an increasing challenge in terms of efficiency, accuracy, and usability of data analysis software. This review provides an overview of ncRNAs, NGS technology, and the most recent/popular computational approaches and the challenges they attempt to solve, which are essential to a more sensitive and comprehensive ncRNA annotation capable of furthering our understanding of this still vastly uncharted genomic territory.

Keywords: NGS; RNA editing; circRNA; computational approaches; lncRNA; ncRNA; small ncRNA; tRF.

Publication types

  • Review

MeSH terms

  • Gene Expression Profiling / methods
  • Genome, Human
  • High-Throughput Nucleotide Sequencing / methods*
  • Humans
  • Molecular Sequence Annotation
  • RNA, Untranslated / genetics*
  • Sequence Analysis, RNA / methods*
  • Software

Substances

  • RNA, Untranslated