The genome sequence of Caenorhabditis briggsae: a platform for comparative genomics

PLoS Biol. 2003 Nov;1(2):E45. doi: 10.1371/journal.pbio.0000045. Epub 2003 Nov 17.

Abstract

The soil nematodes Caenorhabditis briggsae and Caenorhabditis elegans diverged from a common ancestor roughly 100 million years ago and yet are almost indistinguishable by eye. They have the same chromosome number and genome sizes, and they occupy the same ecological niche. To explore the basis for this striking conservation of structure and function, we have sequenced the C. briggsae genome to a high-quality draft stage and compared it to the finished C. elegans sequence. We predict approximately 19,500 protein-coding genes in the C. briggsae genome, roughly the same as in C. elegans. Of these, 12,200 have clear C. elegans orthologs, a further 6,500 have one or more clearly detectable C. elegans homologs, and approximately 800 C. briggsae genes have no detectable matches in C. elegans. Almost all of the noncoding RNAs (ncRNAs) known are shared between the two species. The two genomes exhibit extensive colinearity, and the rate of divergence appears to be higher in the chromosomal arms than in the centers. Operons, a distinctive feature of C. elegans, are highly conserved in C. briggsae, with the arrangement of genes being preserved in 96% of cases. The difference in size between the C. briggsae (estimated at approximately 104 Mbp) and C. elegans (100.3 Mbp) genomes is almost entirely due to repetitive sequence, which accounts for 22.4% of the C. briggsae genome in contrast to 16.5% of the C. elegans genome. Few, if any, repeat families are shared, suggesting that most were acquired after the two species diverged or are undergoing rapid evolution. Coclustering the C. elegans and C. briggsae proteins reveals 2,169 protein families of two or more members. Most of these are shared between the two species, but some appear to be expanding or contracting, and there seem to be as many as several hundred novel C. briggsae gene families. The C. briggsae draft sequence will greatly improve the annotation of the C. elegans genome. Based on similarity to C. briggsae, we found strong evidence for 1,300 new C. elegans genes. In addition, comparisons of the two genomes will help to understand the evolutionary forces that mold nematode genomes.

Publication types

  • Comparative Study
  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't
  • Research Support, U.S. Gov't, Non-P.H.S.
  • Research Support, U.S. Gov't, P.H.S.

MeSH terms

  • Animals
  • Biological Evolution
  • Caenorhabditis / genetics*
  • Caenorhabditis elegans / genetics*
  • Chromosome Mapping
  • Chromosomes, Artificial, Bacterial
  • Cluster Analysis
  • Codon
  • Conserved Sequence
  • Evolution, Molecular
  • Exons
  • Gene Library
  • Genome*
  • Genomics / methods*
  • Interspersed Repetitive Sequences
  • Introns
  • MicroRNAs / genetics
  • Models, Genetic
  • Models, Statistical
  • Molecular Sequence Data
  • Multigene Family
  • Open Reading Frames
  • Physical Chromosome Mapping
  • Plasmids / metabolism
  • Protein Structure, Tertiary
  • Proteins / chemistry
  • RNA / chemistry
  • RNA, Ribosomal / genetics
  • RNA, Spliced Leader
  • RNA, Transfer / genetics
  • Sequence Analysis, DNA
  • Species Specificity

Substances

  • Codon
  • MicroRNAs
  • Proteins
  • RNA, Ribosomal
  • RNA, Spliced Leader
  • RNA
  • RNA, Transfer