The genome sequence of the orchid Phalaenopsis equestris

Nat Genet. 2015 Jan;47(1):65-72. doi: 10.1038/ng.3149. Epub 2014 Nov 24.

Abstract

Orchidaceae, renowned for its spectacular flowers and other reproductive and ecological adaptations, is one of the most diverse plant families. Here we present the genome sequence of the tropical epiphytic orchid Phalaenopsis equestris, a frequently used parent species for orchid breeding. P. equestris is the first plant with crassulacean acid metabolism (CAM) for which the genome has been sequenced. Our assembled genome contains 29,431 predicted protein-coding genes. We find that contigs likely to be underassembled, owing to heterozygosity, are enriched for genes that might be involved in self-incompatibility pathways. We find evidence for an orchid-specific paleopolyploidy event that preceded the radiation of most orchid clades, and our results suggest that gene duplication might have contributed to the evolution of CAM photosynthesis in P. equestris. Finally, we find expanded and diversified families of MADS-box C/D-class, B-class AP3 and AGL6-class genes, which might contribute to the highly specialized morphology of orchid flowers.

Publication types

  • Comparative Study
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Evolution, Molecular
  • Gene Expression Profiling
  • Gene Expression Regulation, Developmental
  • Gene Expression Regulation, Plant
  • Genes, Plant
  • Genome, Plant*
  • Introns / genetics
  • MADS Domain Proteins
  • Mutation Rate
  • Orchidaceae / classification
  • Orchidaceae / genetics*
  • Orchidaceae / metabolism
  • Photosynthesis / genetics
  • Phylogeny
  • Plant Proteins / genetics
  • Plant Proteins / metabolism
  • RNA, Messenger / biosynthesis
  • RNA, Messenger / genetics
  • RNA, Plant / biosynthesis
  • RNA, Plant / genetics
  • Sequence Alignment
  • Species Specificity

Substances

  • MADS Domain Proteins
  • Plant Proteins
  • RNA, Messenger
  • RNA, Plant

Associated data

  • BioProject/PRJNA192198