A novel approach to the recognition of protein architecture from sequence using Fourier analysis and neural networks

Adrian J Shepherd; Denise Gorse; Janet M Thornton

doi:10.1002/prot.10290

A novel approach to the recognition of protein architecture from sequence using Fourier analysis and neural networks

Proteins. 2003 Feb 1;50(2):290-302. doi: 10.1002/prot.10290.

Authors

Adrian J Shepherd¹, Denise Gorse, Janet M Thornton

Affiliation

¹ Department of Biochemistry and Molecular Biology, University College London, London, United Kingdom. a.shepherd@biochem.ucl.ac.uk

PMID: 12486723
DOI: 10.1002/prot.10290

Abstract

A novel method is presented for the prediction of protein architecture from sequence using neural networks. The method involves the preprocessing of protein sequence data by numerically encoding it and then applying a Fourier transform. The encoded and transformed data are then used to train a neural network to recognize a number of different protein architectures. The method proved significantly better than comparable alternative strategies such as percentage dipeptide frequency, but is still limited by the size of the data set and the input demands of a neural network. Its main potential is as a complement to existing fold recognition techniques, with its ability to identify global symmetries within protein structures its greatest strength.

MeSH terms

Algorithms
Amino Acid Sequence
Benchmarking
Computational Biology / methods*
Computer Simulation*
Databases, Protein
Fourier Analysis*
Neural Networks, Computer*
Protein Folding
Protein Structure, Secondary
Protein Structure, Tertiary
Proteins / chemistry*

Substances

Proteins