Progress in predicting inter-residue contacts of proteins with neural networks and correlated mutations

1 January 2001

journal article
research article
Published by Wiley in Proteins-Structure Function and Bioinformatics

Vol. 45 (S5), 157-162
https://doi.org/10.1002/prot.1173

Abstract

This article presents recent progress in predicting inter‐residue contacts of proteins with a neural network‐based method. Improvement over the results obtained at the previous CASP3 competition is attained by using as input to the network a complex code, which includes evolutionary information, sequence conservation, correlated mutations, and predicted secondary structures. The predictor was trained and cross‐validated on a data set comprising the contact maps of 173 non‐homologous proteins as computed from their well‐resolved three‐dimensional structures. The method could assign protein contacts with an average accuracy of 0.21 and with an improvement over a random predictor of a factor greater than 6, which is higher than that previously obtained with methods only based either on neural networks or on correlated mutations. Although far from being ideal, these scores are the highest reported so far for predicting protein contact maps. On 29 targets automatically predicted by the server (CORNET) the average accuracy is 0.14. The predictor is poorly performing on all‐α proteins, not represented in the training set. On all‐β and mixed proteins (22 targets) the average accuracy is 0.16. This set comprises proteins of different complexity and different chain length, suggesting that the predictor is capable of generalization over a broad number of features. Proteins 2001;Suppl 5:157–162.

Keywords

This publication has 14 references indexed in Scilit:

Predictions of protein segments with the same aminoacid sequence and different secondary structure: A benchmark for predictive methods
Proteins-Structure Function and Bioinformatics, 2000
A neural network based predictor of residue contacts in proteins
Protein Engineering, Design and Selection, 1999
The HSSP database of protein structure-sequence alignments and family profiles
Nucleic Acids Research, 1998
Recovery of protein structure from contact maps
Folding and Design, 1997
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
Nucleic Acids Research, 1997
Correlated mutations contain information about protein-protein interaction 1 1Edited by A. R. Fersht
Journal of Molecular Biology, 1997
Improving contact predictions by the combination of correlated mutations and other sources of sequence information
Folding and Design, 1997
The prediction of protein contacts from multiple sequence alignments
Protein Engineering, Design and Selection, 1996
Correlated mutations and residue contacts in proteins
Proteins-Structure Function and Bioinformatics, 1994
Tests for comparing related amino-acid sequences. Cytochrome c and cytochrome c551
Journal of Molecular Biology, 1971

Cited by 79 articles