Considerations for constructing a protein sequence database for metaproteomics

J Alfredo Blakeley-Ruiz; Manuel Kleiner

doi:10.1016/j.csbj.2022.01.018

Considerations for constructing a protein sequence database for metaproteomics

Comput Struct Biotechnol J. 2022 Jan 21:20:937-952. doi: 10.1016/j.csbj.2022.01.018. eCollection 2022.

Authors

J Alfredo Blakeley-Ruiz^{1

2}, Manuel Kleiner²

Affiliations

¹ Center for Gastrointestinal Biology and Disease, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.
² Department of Plant and Microbial Biology, North Carolina State University, Raleigh, NC, USA.

Abstract

Mass spectrometry-based metaproteomics has emerged as a prominent technique for interrogating the functions of specific organisms in microbial communities, in addition to total community function. Identifying proteins by mass spectrometry requires matching mass spectra of fragmented peptide ions to a database of protein sequences corresponding to the proteins in the sample. This sequence database determines which protein sequences can be identified from the measurement, and as such the taxonomic and functional information that can be inferred from a metaproteomics measurement. Thus, the construction of the protein sequence database directly impacts the outcome of any metaproteomics study. Several factors, such as source of sequence information and database curation, need to be considered during database construction to maximize accurate protein identifications traceable to the species of origin. In this review, we provide an overview of existing strategies for database construction and the relevant studies that have sought to test and validate these strategies. Based on this review of the literature and our experience we provide a decision tree and best practices for choosing and implementing database construction strategies.

Keywords: Metagenomics; Metaproteome; Microbial community; Microbial ecology; Microbiome; Microbiota; Multi-omics.

Publication types

Review

Abstract

Publication types

Grants and funding