Functionally Significant Features in the 5' Untranslated Region of the ABCA1 Gene and Their Comparison in Vertebrates

Cells. 2019 Jun 21;8(6):623. doi: 10.3390/cells8060623.

Abstract

Single nucleotide polymorphisms located in 5' untranslated regions (5'UTRs) can regulate gene expression and have clinical impact. Recognition of functionally significant sequences within 5'UTRs is crucial in next-generation sequencing applications. Furthermore, information about the behavior of 5'UTRs during gene evolution is scarce. Using the example of the ATP-binding cassette transporter A1 (ABCA1) gene (Tangier disease), we describe our algorithm for functionally significant sequence finding. 5'UTR features (upstream start and stop codons, open reading frames (ORFs), GC content, motifs, and secondary structures) were studied using freely available bioinformatics tools in 55 vertebrate orthologous genes obtained from Ensembl and UCSC. The most conserved sequences were suggested as hot spots. Exon and intron enhancers and silencers (sc35, ighg2 cgamma2, ctnt, gh-1, and fibronectin eda exon), transcription factors (TFIIA, TATA, NFAT1, NFAT4, and HOXA13), some of them cancer related, and microRNA (hsa-miR-4474-3p) were localized to these regions. An upstream ORF, overlapping with the main ORF in primates and possibly coding for a small bioactive peptide, was also detected. Moreover, we showed several features of 5'UTRs, such as GC content variation, hairpin structure conservation or 5'UTR segmentation, which are interesting from a phylogenetic point of view and can stimulate further evolutionary oriented research.

Keywords: 5′ untranslated region; ABCA1; bioinformatics; gene regulation; single nucleotide polymorphism.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • 5' Untranslated Regions / genetics*
  • ATP Binding Cassette Transporter 1 / chemistry
  • ATP Binding Cassette Transporter 1 / genetics*
  • Amino Acid Sequence
  • Animals
  • Base Composition / genetics
  • Base Sequence
  • Conserved Sequence / genetics
  • Enhancer Elements, Genetic / genetics
  • Humans
  • Introns / genetics
  • Mammals / genetics
  • Molecular Sequence Annotation
  • Nucleic Acid Conformation
  • Nucleotide Motifs / genetics
  • Open Reading Frames / genetics
  • Phylogeny
  • RNA Splicing / genetics
  • RNA, Messenger / genetics
  • RNA, Messenger / metabolism
  • Vertebrates / genetics*

Substances

  • 5' Untranslated Regions
  • ATP Binding Cassette Transporter 1
  • RNA, Messenger