Assessment of the assessment: evaluation of the model quality estimates in CASP10

Andriy Kryshtafovych; Alessandro Barbato; Krzysztof Fidelis; Bohdan Monastyrskyy; Torsten Schwede; Anna Tramontano

doi:10.1002/prot.24347

Assessment of the assessment: evaluation of the model quality estimates in CASP10

Proteins. 2014 Feb;82 Suppl 2(0 2):112-26. doi: 10.1002/prot.24347. Epub 2013 Aug 31.

Authors

Andriy Kryshtafovych¹, Alessandro Barbato, Krzysztof Fidelis, Bohdan Monastyrskyy, Torsten Schwede, Anna Tramontano

Affiliation

¹ Genome Center, University of California, Davis, 95616 California, USA.

Abstract

The article presents an assessment of the ability of the thirty-seven model quality assessment (MQA) methods participating in CASP10 to provide an a priori estimation of the quality of structural models, and of the 67 tertiary structure prediction groups to provide confidence estimates for their predicted coordinates. The assessment of MQA predictors is based on the methods used in previous CASPs, such as correlation between the predicted and observed quality of the models (both at the global and local levels), accuracy of methods in distinguishing between good and bad models as well as good and bad regions within them, and ability to identify the best models in the decoy sets. Several numerical evaluations were used in our analysis for the first time, such as comparison of global and local quality predictors with reference (baseline) predictors and a ROC analysis of the predictors' ability to differentiate between the well and poorly modeled regions. For the evaluation of the reliability of self-assessment of the coordinate errors, we used the correlation between the predicted and observed deviations of the coordinates and a ROC analysis of correctly identified errors in the models. A modified two-stage procedure for testing MQA methods in CASP10 whereby a small number of models spanning the whole range of model accuracy was released first followed by the release of a larger number of models of more uniform quality, allowed a more thorough analysis of abilities and inabilities of different types of methods. Clustering methods were shown to have an advantage over the single- and quasi-single- model methods on the larger datasets. At the same time, the evaluation revealed that the size of the dataset has smaller influence on the global quality assessment scores (for both clustering and nonclustering methods), than its diversity. Narrowing the quality range of the assessed models caused significant decrease in accuracy of ranking for global quality predictors but essentially did not change the results for local predictors. Self-assessment error estimates submitted by the majority of groups were poor overall, with two research groups showing significantly better results than the remaining ones.

Keywords: CASP; QA; model quality assessment; protein structure modeling; protein structure prediction.

Publication types

Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't

MeSH terms

Computational Biology / methods*
Models, Molecular
Models, Statistical*
Protein Conformation*
Proteins / chemistry*
ROC Curve
Sequence Analysis, Protein

Substances

Proteins

Abstract

Publication types

MeSH terms

Substances

Grants and funding