A draft map of the human proteome

Nature. 2014 May 29;509(7502):575-81. doi: 10.1038/nature13302.

Abstract

The availability of human genome sequence has transformed biomedical research over the past decade. However, an equivalent map for the human proteome with direct measurements of proteins and peptides does not exist yet. Here we present a draft map of the human proteome using high-resolution Fourier-transform mass spectrometry. In-depth proteomic profiling of 30 histologically normal human samples, including 17 adult tissues, 7 fetal tissues and 6 purified primary haematopoietic cells, resulted in identification of proteins encoded by 17,294 genes accounting for approximately 84% of the total annotated protein-coding genes in humans. A unique and comprehensive strategy for proteogenomic analysis enabled us to discover a number of novel protein-coding regions, which includes translated pseudogenes, non-coding RNAs and upstream open reading frames. This large human proteome catalogue (available as an interactive web-based resource at http://www.humanproteomemap.org) will complement available human genome and transcriptome data to accelerate biomedical research in health and disease.

Publication types

  • Research Support, N.I.H., Extramural
  • Research Support, Non-U.S. Gov't

MeSH terms

  • Adult
  • Cells, Cultured
  • Databases, Protein
  • Fetus / metabolism
  • Fourier Analysis
  • Gene Expression Profiling
  • Genome, Human / genetics
  • Hematopoietic Stem Cells / cytology
  • Hematopoietic Stem Cells / metabolism
  • Humans
  • Internet
  • Mass Spectrometry
  • Molecular Sequence Annotation
  • Open Reading Frames / genetics
  • Organ Specificity
  • Protein Biosynthesis
  • Protein Isoforms / analysis
  • Protein Isoforms / genetics
  • Protein Isoforms / metabolism
  • Protein Sorting Signals
  • Protein Transport
  • Proteome / analysis
  • Proteome / chemistry
  • Proteome / genetics
  • Proteome / metabolism*
  • Proteomics*
  • Pseudogenes / genetics
  • RNA, Untranslated / genetics
  • Reproducibility of Results
  • Untranslated Regions / genetics

Substances

  • Protein Isoforms
  • Protein Sorting Signals
  • Proteome
  • RNA, Untranslated
  • Untranslated Regions