Prediction of protein folding class from amino acid composition
- 1 May 1993
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 16 (1), 79-91
- https://doi.org/10.1002/prot.340160109
Abstract
An empirical relation between the amino acid composition and three-dimensional folding pattern of several classes of proteins has been determined. Computer simulated neural networks have been used to assign proteins to one of the following classes based on their amino acid composition and size: (1) 4α-helical bundles, (2) parallel (α/β)8 barrels, (3) nucleotide binding fold, (4) immunoglobulin fold, or (5) none of these. Networks trained on the known crystal structures as well as sequences of closely related proteins are shown to correctly predict folding classes of proteins not represented in the training set with an average accuracy of 87%. Other folding motifs can easily be added to the prediction scheme once larger databases become available. Analysis of the neural network weights reveals that amino acids favoring prediction of a folding class are usually over represented in that class and amino acids with unfavorable weights are underrepresented in composition. The neural networks utilize combinations of these multiple small variations in amino acid composition in order to make a prediction. The favorably weighted amino acids in a given class also form the most intramolecular interactions with other residues in proteins of that class. A detailed examination of the contacts of these amino acids reveals some general patterns that may help stabilize each folding class.Keywords
This publication has 36 references indexed in Scilit:
- Predicting protein secondary structure content: A tandem neural network approachJournal of Molecular Biology, 1992
- Generalized protein tertiary structure recognition using associative memory hamiltoniansJournal of Molecular Biology, 1991
- Mechanism of the reaction catalyzed by mandelate racemase. 2. Crystal structure of mandelate racemase at 2.5-.ANG. resolution: identification of the active site and possible catalytic residuesBiochemistry, 1991
- Improvements in protein secondary structure prediction by an enhanced neural networkJournal of Molecular Biology, 1990
- A novel approach to prediction of the 3‐dimensional structures of protein backbones by neural networksFEBS Letters, 1990
- Predicting the secondary structure of globular proteins using neural network modelsJournal of Molecular Biology, 1988
- Structure of the ColE1 Rop protein at 1.7 Å resolutionJournal of Molecular Biology, 1987
- Helix packing and subunit conformation in horse spleen apoferritinNature, 1980
- The protein data bank: A computer-based archival file for macromolecular structuresJournal of Molecular Biology, 1977
- A correlation between amino acid composition and protein structureJournal of Molecular Biology, 1964