Computational prediction of protein aggregation: Advances in proteomics, conformation-specific algorithms and biotechnological applications

Comput Struct Biotechnol J. 2020 Jun 10:18:1403-1413. doi: 10.1016/j.csbj.2020.05.026. eCollection 2020.

Abstract

Protein aggregation is a widespread phenomenon that stems from the establishment of non-native intermolecular contacts resulting in protein precipitation. Despite its deleterious impact on fitness, protein aggregation is a generic property of polypeptide chains, indissociable from protein structure and function. Protein aggregation is behind the onset of neurodegenerative disorders and one of the serious obstacles in the production of protein-based therapeutics. The development of computational tools opened a new avenue to rationalize this phenomenon, enabling prediction of the aggregation propensity of individual proteins as well as proteome-wide analysis. These studies spotted aggregation as a major force driving protein evolution. Actual algorithms work on both protein sequences and structures, some of them accounting also for conformational fluctuations around the native state and the protein microenvironment. This toolbox allows to delineate conformation-specific routines to assist in the identification of aggregation-prone regions and to guide the optimization of more soluble and stable biotherapeutics. Here we review how the advent of predictive tools has change the way we think and address protein aggregation.

Keywords: A3D, AGGRESCAN3D; APRs, Aggregation-prone regions; Amyloid; Bioinformatics; DI, Developability index; Evolution; IAPP, Islet amyloid polypeptide; IDPs, Intrinsically disordered proteins; Protein aggregation; Protein production; Protein structure; Proteomics; SAP, Spatial aggregation propensity; STAP, STructural Aggregation-Prone region; mAbs, Monoclonal antibodies.

Publication types

  • Review