Identification of Drosophila MicroRNA targets

PLoS Biol. 2003 Dec;1(3):E60. doi: 10.1371/journal.pbio.0000060. Epub 2003 Oct 13.

Abstract

MicroRNAs (miRNAs) are short RNA molecules that regulate gene expression by binding to target messenger RNAs and by controlling protein production or causing RNA cleavage. To date, functions have been assigned to only a few of the hundreds of identified miRNAs, in part because of the difficulty in identifying their targets. The short length of miRNAs and the fact that their complementarity to target sequences is imperfect mean that target identification in animal genomes is not possible by standard sequence comparison methods. Here we screen conserved 3' UTR sequences from the Drosophila melanogaster genome for potential miRNA targets. The screening procedure combines a sequence search with an evaluation of the predicted miRNA-target heteroduplex structures and energies. We show that this approach successfully identifies the five previously validated let-7, lin-4, and bantam targets from a large database and predict new targets for Drosophila miRNAs. Our target predictions reveal striking clusters of functionally related targets among the top predictions for specific miRNAs. These include Notch target genes for miR-7, proapoptotic genes for the miR-2 family, and enzymes from a metabolic pathway for miR-277. We experimentally verified three predicted targets each for miR-7 and the miR-2 family, doubling the number of validated targets for animal miRNAs. Statistical analysis indicates that the best single predicted target sites are at the border of significance; thus, target predictions should be considered as tentative until experimentally validated. We identify features shared by all validated targets that can be used to evaluate target predictions for animal miRNAs. Our initial evaluation and experimental validation of target predictions suggest functions for two miRNAs. For others, the screen suggests plausible functions, such as a role for miR-277 as a metabolic switch controlling amino acid catabolism. Cross-genome comparison proved essential, as it allows reduction of the sequence search space. Improvements in genome annotation and increased availability of cDNA sequences from other genomes will allow more sensitive screens. An increase in the number of confirmed targets is expected to reveal general structural features that can be used to improve their detection. While the screen is likely to miss some targets, our study shows that valid targets can be identified from sequence alone.

MeSH terms

  • 3' Untranslated Regions
  • Animals
  • Computational Biology
  • DNA, Complementary / metabolism
  • Databases, Genetic
  • Drosophila Proteins / chemistry
  • Drosophila melanogaster / genetics*
  • Gene Expression Regulation
  • Genetic Techniques*
  • Genome
  • MicroRNAs* / genetics
  • MicroRNAs* / metabolism
  • Models, Genetic*
  • Molecular Sequence Data
  • Multigene Family
  • RNA Interference
  • RNA, Messenger / metabolism
  • Software
  • Species Specificity

Substances

  • 3' Untranslated Regions
  • DNA, Complementary
  • Drosophila Proteins
  • MicroRNAs
  • RNA, Messenger

Associated data

  • GENBANK/AF460844
  • GENBANK/AJ421757
  • GENBANK/AJ421762
  • GENBANK/AJ421767
  • GENBANK/AJ421769
  • GENBANK/AJ421771
  • GENBANK/AJ421773
  • GENBANK/AJ457831
  • GENBANK/AJ550546
  • GENBANK/AY055843
  • RefSeq/NM_059880
  • RefSeq/NM_060087
  • RefSeq/NM_076575
  • RefSeq/NM_077516
  • RefSeq/NM_078672
  • RefSeq/NM_079253
  • RefSeq/NM_079349
  • RefSeq/NM_079412
  • RefSeq/NM_079413
  • RefSeq/NM_079414
  • RefSeq/NM_079785
  • RefSeq/NM_079786
  • RefSeq/NM_079827
  • RefSeq/NM_080033
  • RefSeq/NM_080079
  • RefSeq/NM_130489
  • RefSeq/NM_132656
  • RefSeq/NM_132772
  • RefSeq/NM_141648
  • RefSeq/NM_166306
  • RefSeq/NM_167928
  • RefSeq/NR_000799
  • RefSeq/NR_000938