Abstract
Many animal and plant genomes are transcribed much more extensively than current annotations predict. However, the biological function of these unannotated transcribed regions is largely unknown. Approximately 7% and 23% of the detected transcribed nucleotides during D. melanogaster embryogenesis map to unannotated intergenic and intronic regions, respectively. Based on computational analysis of coordinated transcription, we conservatively estimate that 29% of all unannotated transcribed sequences function as missed or alternative exons of well-characterized protein-coding genes. We estimate that 15.6% of intergenic transcribed regions function as missed or alternative transcription start sites (TSS) used by 11.4% of the expressed protein-coding genes. Identification of P element mutations within or near newly identified 5' exons provides a strategy for mapping previously uncharacterized mutations to their respective genes. Collectively, these data indicate that at least 85% of the fly genome is transcribed and processed into mature transcripts representing at least 30% of the fly genome.
Publication types
-
Research Support, N.I.H., Extramural
-
Research Support, Non-U.S. Gov't
MeSH terms
-
Amino Acid Sequence
-
Animals
-
DNA, Intergenic
-
Drosophila Proteins / genetics
-
Drosophila melanogaster / embryology*
-
Drosophila melanogaster / genetics*
-
Embryo, Nonmammalian
-
Exons
-
Gene Expression Regulation, Developmental*
-
Genome, Insect
-
Molecular Sequence Data
-
Mutation
-
Oligonucleotide Array Sequence Analysis
-
Transcription Initiation Site
-
Transcription, Genetic*
Substances
-
DNA, Intergenic
-
Drosophila Proteins
Associated data
-
GENBANK/DQ327735
-
GENBANK/DQ327736
-
GENBANK/DQ327737
-
GENBANK/DQ327738
-
GENBANK/DQ327739
-
GENBANK/DQ327740
-
GENBANK/DQ327741
-
GENBANK/DQ327742
-
GENBANK/DQ327743
-
GENBANK/DQ327744
-
GENBANK/DQ327745
-
GENBANK/DQ327746
-
GENBANK/DQ327747
-
GENBANK/DQ327748
-
GENBANK/DQ327749
-
GENBANK/DQ327750
-
GENBANK/DQ327751
-
GENBANK/DQ327752
-
GENBANK/DQ327753
-
GENBANK/DQ327754
-
GENBANK/DQ327755
-
GENBANK/DQ327756
-
GENBANK/DQ327757
-
GENBANK/DQ327758
-
GENBANK/DQ327759
-
GENBANK/DQ327760
-
GENBANK/DQ327761
-
GENBANK/DQ327762
-
GENBANK/DQ327763
-
GENBANK/DQ327764
-
GENBANK/DQ327765
-
GENBANK/DQ327766
-
GENBANK/DQ327767
-
GENBANK/DQ327768
-
GENBANK/DQ327769
-
GENBANK/DQ327770
-
GENBANK/DQ327771
-
GENBANK/DQ327772
-
GENBANK/DQ327773
-
GENBANK/DQ327774
-
GENBANK/DQ327775
-
GENBANK/DQ327776
-
GENBANK/DQ327777
-
GENBANK/DQ327778
-
GENBANK/DQ327779
-
GENBANK/DQ327780
-
GENBANK/DQ327781
-
GENBANK/DQ327782
-
GENBANK/DQ327783
-
GENBANK/DQ327784
-
GENBANK/DQ327785
-
GENBANK/DQ327786
-
GENBANK/DQ327787
-
GENBANK/DQ327788
-
GENBANK/DQ327789
-
GENBANK/DQ327790
-
GENBANK/DQ327791
-
GENBANK/DQ327792
-
GENBANK/DQ327793
-
GENBANK/DQ327794
-
GENBANK/DQ327795
-
GENBANK/DQ327796
-
GENBANK/DQ327797
-
GENBANK/DQ327798
-
GENBANK/DQ327799
-
GENBANK/DQ327800
-
GENBANK/DQ327801
-
GENBANK/DQ327802
-
GENBANK/DQ327803
-
GENBANK/DQ327804
-
GENBANK/DQ327805
-
GENBANK/DQ327806
-
GENBANK/DQ327807
-
GEO/GSE5514