TermineR: Extracting information on endogenous proteolytic processing from shotgun proteomics data

Miguel Cosenza-Contreras; Adrianna Seredynska; Daniel Vogele; Niko Pinter; Eva Brombacher; Ruth Fiestas Cueto; Thien-Ly Julia Dinh; Patrick Bernhard; Manuel Rogg; Junwei Liu; Patrick Willems; Simon Stael; Pitter F Huesgen; E Wolfgang Kuehn; Clemens Kreutz; Christoph Schell; Oliver Schilling

doi:10.1002/pmic.202300491

TermineR: Extracting information on endogenous proteolytic processing from shotgun proteomics data

Proteomics. 2024 Oct;24(19):e2300491. doi: 10.1002/pmic.202300491. Epub 2024 Aug 10.

Authors

Miguel Cosenza-Contreras^{1

2}, Adrianna Seredynska^{1

2}, Daniel Vogele^{1

2

3}, Niko Pinter², Eva Brombacher^{1

4

5

6}, Ruth Fiestas Cueto^{1

2}, Thien-Ly Julia Dinh^{1

2}, Patrick Bernhard², Manuel Rogg², Junwei Liu⁷, Patrick Willems^{8

9

10

11}, Simon Stael^{8

9

12}, Pitter F Huesgen^{1

4}, E Wolfgang Kuehn⁷, Clemens Kreutz^{4

5}, Christoph Schell^{2

13}, Oliver Schilling^{2

14

13}

Affiliations

¹ Faculty of Biology, University of Freiburg, Freiburg, Germany.
² Faculty of Medicine, Institute for Surgical Pathology Medical Center-University of Freiburg, Freiburg, Germany.
³ ProtPath Research Training Group, University of Freiburg, Freiburg, Germany.
⁴ Centre for Integrative Biological Signaling Studies (CIBSS), Freiburg, Germany.
⁵ Faculty of Medicine and Medical Center, Institute of Medical Biometry and Statistics, University of Freiburg, Freiburg, Germany.
⁶ Spemann Graduate School of Biology and Medicine (SGBM), University of Freiburg, Freiburg, Germany.
⁷ Department of Medicine IV, Faculty of Medicine, Medical Center-University of Freiburg, Freiburg, Germany.
⁸ Department of Plant Biotechnology and Bioinformatics, Ghent University, Ghent, Belgium.
⁹ VIB-UGent Center for Plant Systems Biology, Ghent, Belgium.
¹⁰ Department of Biomolecular Medicine, Ghent University, Ghent, Belgium.
¹¹ Center for Medical Biotechnology, VIB, Ghent, Belgium.
¹² Department of Molecular Sciences, Uppsala BioCenter, Swedish University of Agricultural Sciences and Linnean Center for Plant Biology, Uppsala, Sweden.
¹³ Freiburg Institute for Advanced Studies (FRIAS), University of Freiburg, Freiburg, Germany.
¹⁴ German Cancer Consortium (DKTK) and German Cancer Research Center (DKFZ), Heidelberg, Germany.

PMID: 39126236
DOI: 10.1002/pmic.202300491

Abstract

State-of-the-art mass spectrometers combined with modern bioinformatics algorithms for peptide-to-spectrum matching (PSM) with robust statistical scoring allow for more variable features (i.e., post-translational modifications) being reliably identified from (tandem-) mass spectrometry data, often without the need for biochemical enrichment. Semi-specific proteome searches, that enforce a theoretical enzymatic digestion to solely the N- or C-terminal end, allow to identify of native protein termini or those arising from endogenous proteolytic activity (also referred to as "neo-N-termini" analysis or "N-terminomics"). Nevertheless, deriving biological meaning from these search outputs can be challenging in terms of data mining and analysis. Thus, we introduce TermineR, a data analysis approach for the (1) annotation of peptides according to their enzymatic cleavage specificity and known protein processing features, (2) differential abundance and enrichment analysis of N-terminal sequence patterns, and (3) visualization of neo-N-termini location. We illustrate the use of TermineR by applying it to tandem mass tag (TMT)-based proteomics data of a mouse model of polycystic kidney disease, and assess the semi-specific searches for biological interpretation of cleavage events and the variable contribution of proteolytic products to general protein abundance. The TermineR approach and example data are available as an R package at https://github.com/MiguelCos/TermineR.

Keywords: data processing; polycystic kidney disease; proteolysis; terminomics.

MeSH terms

Algorithms
Animals
Databases, Protein
Mice
Peptides / analysis
Peptides / chemistry
Peptides / metabolism
Polycystic Kidney Diseases / metabolism
Protein Processing, Post-Translational
Proteolysis*
Proteome / analysis
Proteome / metabolism
Proteomics* / methods
Software
Tandem Mass Spectrometry* / methods

Substances

Proteome
Peptides

Abstract

MeSH terms

Substances

Grants and funding