Predicting surface exposure of amino acids from protein sequence
- 1 August 1990
- journal article
- research article
- Published by Oxford University Press (OUP) in Protein Engineering, Design and Selection
- Vol. 3 (8), 659-665
- https://doi.org/10.1093/protein/3.8.659
Abstract
The amino acid residues on a protein surface play a key role in interaction with other molecules, determine many physical properties, and constrain the structure of the folded protein. A database of monomeric protein crystal structures was used to teach computer-simulated neural networks rules for predicting surface exposure from local sequence. These trained networks are able to correctly predict surface exposure for 72% of residues in a testing set using a binary model (buried/exposed) and for 54% of residues using a ternary model (buried/intermediate/exposed). In the ternary model, only 11% of the exposed residues are predicted as buried and only 5% of the buried residues are predicted as exposed. Also, since the networks are able to predict exposure with a quantitative confidence estimate, it is possible to assign exposure for over half of the residues in a binary model with >80% accuracy. Even more accurate predictions are obtained by making a consensus prediction of exposure for a homologous family. The effect of the local environment of an amino acid on its accessibility, though smaller than expected, is significant and accounts for the higher success rate of prediction than obtained with previously used criteria. In the absence of a three-dimensional structure, the ability to predict surface accessibility of amino acids directly from the sequence is a valuable tool in choosing sites of chemical modification or specific mutations and in studies of molecular interaction.This publication has 13 references indexed in Scilit:
- Predicting the secondary structure of globular proteins using neural network modelsJournal of Molecular Biology, 1988
- Interior and surface of monomeric proteinsJournal of Molecular Biology, 1987
- Tertiary templates for proteinsJournal of Molecular Biology, 1987
- Local sequence patterns of hydrophobicity and solvent accessibility in soluble globular proteinsBiopolymers, 1987
- Solvent accessible surface area and excluded volume in proteinsJournal of Molecular Biology, 1984
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983
- Correlation between chemical modification and surface accessibility in yeast phenylalanine transfer RNABiopolymers, 1983
- Prediction of protein antigenic determinants from amino acid sequences.Proceedings of the National Academy of Sciences, 1981
- Solvent-accessible surfaces of nucleic acidsJournal of Molecular Biology, 1979
- The protein data bank: A computer-based archival file for macromolecular structuresJournal of Molecular Biology, 1977