Progress in predicting inter-residue contacts of proteins with neural networks and correlated mutations
- 1 January 2001
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 45 (S5), 157-162
- https://doi.org/10.1002/prot.1173
Abstract
This article presents recent progress in predicting inter‐residue contacts of proteins with a neural network‐based method. Improvement over the results obtained at the previous CASP3 competition is attained by using as input to the network a complex code, which includes evolutionary information, sequence conservation, correlated mutations, and predicted secondary structures. The predictor was trained and cross‐validated on a data set comprising the contact maps of 173 non‐homologous proteins as computed from their well‐resolved three‐dimensional structures. The method could assign protein contacts with an average accuracy of 0.21 and with an improvement over a random predictor of a factor greater than 6, which is higher than that previously obtained with methods only based either on neural networks or on correlated mutations. Although far from being ideal, these scores are the highest reported so far for predicting protein contact maps. On 29 targets automatically predicted by the server (CORNET) the average accuracy is 0.14. The predictor is poorly performing on all‐α proteins, not represented in the training set. On all‐β and mixed proteins (22 targets) the average accuracy is 0.16. This set comprises proteins of different complexity and different chain length, suggesting that the predictor is capable of generalization over a broad number of features. Proteins 2001;Suppl 5:157–162.Keywords
This publication has 14 references indexed in Scilit:
- Predictions of protein segments with the same aminoacid sequence and different secondary structure: A benchmark for predictive methodsProteins-Structure Function and Bioinformatics, 2000
- A neural network based predictor of residue contacts in proteinsProtein Engineering, Design and Selection, 1999
- The HSSP database of protein structure-sequence alignments and family profilesNucleic Acids Research, 1998
- Recovery of protein structure from contact mapsFolding and Design, 1997
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- Correlated mutations contain information about protein-protein interaction 1 1Edited by A. R. FershtJournal of Molecular Biology, 1997
- Improving contact predictions by the combination of correlated mutations and other sources of sequence informationFolding and Design, 1997
- The prediction of protein contacts from multiple sequence alignmentsProtein Engineering, Design and Selection, 1996
- Correlated mutations and residue contacts in proteinsProteins-Structure Function and Bioinformatics, 1994
- Tests for comparing related amino-acid sequences. Cytochrome c and cytochrome c551Journal of Molecular Biology, 1971