PEPPeR, a platform for experimental proteomic pattern recognition

Jacob D Jaffe; D R Mani; Kyriacos C Leptos; George M Church; Michael A Gillette; Steven A Carr

doi:10.1074/mcp.M600222-MCP200

PEPPeR, a platform for experimental proteomic pattern recognition

Mol Cell Proteomics. 2006 Oct;5(10):1927-41. doi: 10.1074/mcp.M600222-MCP200. Epub 2006 Jul 19.

Authors

Jacob D Jaffe¹, D R Mani, Kyriacos C Leptos, George M Church, Michael A Gillette, Steven A Carr

Affiliation

¹ The Broad Institute of Harvard and the Massachusetts Institute of Technology, Cambridge, 02142, USA.

Abstract

Quantitative proteomics holds considerable promise for elucidation of basic biology and for clinical biomarker discovery. However, it has been difficult to fulfill this promise due to over-reliance on identification-based quantitative methods and problems associated with chromatographic separation reproducibility. Here we describe new algorithms termed "Landmark Matching" and "Peak Matching" that greatly reduce these problems. Landmark Matching performs time base-independent propagation of peptide identities onto accurate mass LC-MS features in a way that leverages historical data derived from disparate data acquisition strategies. Peak Matching builds upon Landmark Matching by recognizing identical molecular species across multiple LC-MS experiments in an identity-independent fashion by clustering. We have bundled these algorithms together with other algorithms, data acquisition strategies, and experimental designs to create a Platform for Experimental Proteomic Pattern Recognition (PEPPeR). These developments enable use of established statistical tools previously limited to microarray analysis for treatment of proteomics data. We demonstrate that the proposed platform can be calibrated across 2.5 orders of magnitude and can perform robust quantification of ratios in both simple and complex mixtures with good precision and error characteristics across multiple sample preparations. We also demonstrate de novo marker discovery based on statistical significance of unidentified accurate mass components that changed between two mixtures. These markers were subsequently identified by accurate mass-driven MS/MS acquisition and demonstrated to be contaminant proteins associated with known proteins whose concentrations were designed to change between the two mixtures. These results have provided a real world validation of the platform for marker discovery.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.

MeSH terms

Algorithms*
Animals
Biomarkers
Calibration
Mice
Mice, Inbred C57BL
Models, Theoretical
Normal Distribution
Pattern Recognition, Automated*
Peptides / chemistry
Proteomics / methods*

Substances

Biomarkers
Peptides

Grants and funding

R01 CA126219/CA/NCI NIH HHS/United States