Lateral gene transfer and ancient paralogy of operons containing redundant copies of tryptophan-pathway genes in Xylella species and in heterocystous cyanobacteria

Genome Biol. 2003;4(2):R14. doi: 10.1186/gb-2003-4-2-r14. Epub 2003 Jan 29.

Abstract

Background: Tryptophan-pathway genes that exist within an apparent operon-like organization were evaluated as examples of multi-genic genomic regions that contain phylogenetically incongruous genes and coexist with genes outside the operon that are congruous. A seven-gene cluster in Xylella fastidiosa includes genes encoding the two subunits of anthranilate synthase, an aryl-CoA synthetase, and trpR. A second gene block, present in the Anabaena/Nostoc lineage, but not in other cyanobacteria, contains a near-complete tryptophan operon nested within an apparent supraoperon containing other aromatic-pathway genes.

Results: The gene block in X. fastidiosa exhibits a sharply delineated low-GC content. This, as well as bias of codon usage and 3:1 dinucleotide analysis, strongly implicates lateral gene transfer (LGT). In contrast, parametric studies and protein tree phylogenies did not support the origination of the Anabaena/Nostoc gene block by LGT.

Conclusions: Judging from the apparent minimal amelioration, the low-GC gene block in X. fastidiosa probably originated by LGT at a relatively recent time. The surprising inability to pinpoint a donor lineage still leaves room for alternative, albeit less likely, explanations other than LGT. On the other hand, the large Anabaena/Nostoc gene block does not seem to have arisen by LGT. We suggest that the contemporary Anabaena/Nostoc array of divergent paralogs represents an ancient ancestral state of paralog divergence, with extensive streamlining by gene loss occurring in the lineage of descent representing other (unicellular) cyanobacteria.

Publication types

  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Amino Acid Sequence
  • Anabaena / genetics
  • Bacterial Proteins / genetics
  • Base Sequence
  • Codon / genetics
  • Cyanobacteria / genetics*
  • Cyanobacteria / metabolism
  • Gammaproteobacteria / genetics*
  • Gammaproteobacteria / metabolism
  • Gene Transfer, Horizontal*
  • Molecular Sequence Data
  • Operon / genetics*
  • Phylogeny
  • Repressor Proteins / genetics
  • Sequence Homology, Amino Acid
  • Tryptophan / biosynthesis*

Substances

  • Bacterial Proteins
  • Codon
  • Repressor Proteins
  • TRPR protein, E coli
  • Tryptophan