Quantification and discovery of sequence determinants of protein-per-mRNA amount in 29 human tissues

Mol Syst Biol. 2019 Feb 18;15(2):e8513. doi: 10.15252/msb.20188513.

Abstract

Despite their importance in determining protein abundance, a comprehensive catalogue of sequence features controlling protein-to-mRNA (PTR) ratios and a quantification of their effects are still lacking. Here, we quantified PTR ratios for 11,575 proteins across 29 human tissues using matched transcriptomes and proteomes. We estimated by regression the contribution of known sequence determinants of protein synthesis and degradation in addition to 45 mRNA and 3 protein sequence motifs that we found by association testing. While PTR ratios span more than 2 orders of magnitude, our integrative model predicts PTR ratios at a median precision of 3.2-fold. A reporter assay provided functional support for two novel UTR motifs, and an immobilized mRNA affinity competition-binding assay identified motif-specific bound proteins for one motif. Moreover, our integrative model led to a new metric of codon optimality that captures the effects of codon frequency on protein synthesis and degradation. Altogether, this study shows that a large fraction of PTR ratio variation in human tissues can be predicted from sequence, and it identifies many new candidate post-transcriptional regulatory elements.

Keywords: codon usage; mRNA sequence motifs; proteomics; transcriptomics; translational control.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Gene Expression Regulation / genetics
  • Genome, Human / genetics
  • Humans
  • Mass Spectrometry / methods
  • Proteins / genetics*
  • Proteome / genetics*
  • Proteomics / methods
  • RNA, Messenger / genetics
  • Sequence Analysis, RNA / methods
  • Tissue Distribution / genetics*
  • Transcriptome / genetics*

Substances

  • Proteins
  • Proteome
  • RNA, Messenger