Transcriptome innovations in primates revealed by single-molecule long-read sequencing

Genome Res. 2022 Aug 25;32(8):1448-1462. doi: 10.1101/gr.276395.121.

Abstract

Transcriptomic diversity greatly contributes to the fundamentals of disease, lineage-specific biology, and environmental adaptation. However, much of the actual isoform repertoire contributing to shaping primate evolution remains unknown. Here, we combined deep long- and short-read sequencing complemented with mass spectrometry proteomics in a panel of lymphoblastoid cell lines (LCLs) from human, three other great apes, and rhesus macaque, producing the largest full-length isoform catalog in primates to date. Around half of the captured isoforms are not annotated in their reference genomes, significantly expanding the gene models in primates. Furthermore, our comparative analyses unveil hundreds of transcriptomic innovations and isoform usage changes related to immune function and immunological disorders. The confluence of these evolutionary innovations with signals of positive selection and their limited impact in the proteome points to changes in alternative splicing in genes involved in immune response as an important target of recent regulatory divergence in primates.

MeSH terms

  • Alternative Splicing
  • Animals
  • Evolution, Molecular
  • High-Throughput Nucleotide Sequencing / methods
  • Humans
  • Macaca mulatta / genetics
  • Primates / genetics
  • Protein Isoforms / genetics
  • Transcriptome*

Substances

  • Protein Isoforms