Genome DNA sequencing around the EF-1 alpha multigene locus of Arabidopsis thaliana indicates a high gene density and a shuffling of noncoding regions

Genome Res. 1997 Mar;7(3):198-209. doi: 10.1101/gr.7.3.198.

Abstract

In Arabidopsis thaliana, EF-1 alpha proteins are encoded by a multigene family of four members. Three of them are clustered at the same locus, which was positioned 24 cM from the top of chromosome 1. A region of DNA spanning 63 kb around these locus was sequenced and analyzed. One main characteristic of the locus is the mosaic organization of both genes and intergenic regions. Fourteen genes were identified, among which only four were already described, and other unidentified are most likely present. Functionally diverse genes are found at close intervals. Exon and intron distribution is highly variable at this locus, one gene being split into at least 20 introns. Several duplications were found within the sequenced segment both in coding and noncoding regions, including two gene families. Moreover, a sequence corresponding to the 5' noncoding region of the EF-1 alpha genes and harboring a 5' intervening sequence is duplicated and found upstream of several genes, suggesting that noncoding regions can be shuffled during evolution.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Arabidopsis / genetics*
  • Base Sequence
  • Chromosome Mapping
  • Chromosomes, Artificial, Yeast
  • Genes, Plant / genetics*
  • Glutaredoxins
  • Molecular Sequence Data
  • Multigene Family / genetics*
  • Oxidoreductases*
  • Peptide Elongation Factor 1
  • Peptide Elongation Factors / genetics*
  • Plant Proteins / genetics*
  • Plants, Toxic
  • Proteins / genetics
  • Ricinus / genetics
  • Sequence Analysis, DNA
  • Sequence Homology, Nucleic Acid

Substances

  • Glutaredoxins
  • Peptide Elongation Factor 1
  • Peptide Elongation Factors
  • Plant Proteins
  • Proteins
  • Oxidoreductases

Associated data

  • GENBANK/U63815