Protein distance constraints predicted by neural networks and probability density functions
Open Access
- 1 November 1997
- journal article
- research article
- Published by Oxford University Press (OUP) in Protein Engineering, Design and Selection
- Vol. 10 (11), 1241-1248
- https://doi.org/10.1093/protein/10.11.1241
Abstract
We predict interatomic Calpha distances by two independent data driven methods. The first method uses statistically derived probability distributions of the pairwise distance between two amino acids, whilst the latter method consists of a neural network prediction approach equipped with windows taking the context of the two residues into account. These two methods are used to predict whether distances in independent test sets were above or below given thresholds. We investigate which distance thresholds produce the most information-rich constraints and, in turn, the optimal performance of the two methods. The predictions are based on a data set derived using a new threshold which defines when sequence similarity implies structural similarity. We show that distances in proteins are predicted more accurately by neural networks than by probability density functions. We show that the accuracy of the predictions can be further increased by using sequence profiles. A threading method based on the predicted distances is presented. A homepage with software, predictions and data related to this paper is available at http://www.cbs.dtu.dk/services/CPHmodels/.Keywords
This publication has 9 references indexed in Scilit:
- Protein structure prediction by threading methods: Evaluation of current techniquesProteins-Structure Function and Bioinformatics, 1995
- An Assessment of Amino Acid Exchange Matrices in Aligning Protein Sequences: The Twilight Zone RevisitedJournal of Molecular Biology, 1995
- Predicting the helix packing of globular proteins by self‐correcting distance geometryProtein Science, 1995
- Protein Structure Prediction: Recognition of Primary, Secondary, and Tertiary Structural Features from Amino Acid SequenceCritical Reviews in Biochemistry and Molecular Biology, 1995
- Compensating changes in protein multiple sequence alignmentsProtein Engineering, Design and Selection, 1994
- A Structural Basis for Sequence ComparisonsJournal of Molecular Biology, 1993
- Database of homology‐derived protein structures and the structural meaning of sequence alignmentProteins-Structure Function and Bioinformatics, 1991
- Knowledge-based prediction of protein structures and the design of novel moleculesNature, 1987
- Distance-constraint approach to protein folding. II. Prediction of three-dimensional structure of bovine pancreatic trypsin inhibitorProtein Journal, 1982