An algorithm for predicting protein–protein interaction sites: Abnormally exposed amino acid residues and secondary structure elements

Open Access

1 May 2006

journal article
Published by Wiley in Protein Science

Vol. 15 (5), 1017-1029
https://doi.org/10.1110/ps.051589106

Abstract

Multiprotein systems mediate most regulatory processes in living organisms. Although the structures of the individual proteins are often defined, less is known of the structures of multiprotein systems. Computational methods for predicting interfaces, using evolutionary conservation and/or physicochemical data, have been developed. Here we consider the use of solvent accessibility, residue propensity, and hydrophobicity, in conjunction with secondary structure data, as prediction parameters. We analyze the influence of residue type and secondary structure on solvent accessibility and define a measure of “relative exposedness.” Clustering abnormally high scoring residues provides a basis for predicting interaction sites. The analysis is extended to investigate abnormally exposed secondary structure elements, particularly β‐sheet strands. We show that surface‐exposed β‐strands lacking protective features are more likely to be found at protein–protein interfaces, allowing us to create an algorithm with ∼68% and ∼75% accuracy in differentiating between interacting and edge strands in isolated β‐strands and β‐sheet strands, respectively. These methods of identifying abnormally exposed surface regions are combined in an algorithm, which, on a data set of 77 unbound and disjoint (single chain extracted from complex) structures, predicts 79% of the protein–protein interfaces correctly. If enzyme–inhibitor complexes, where the inhibitor mimics a nonprotein substrate, are excluded, the accuracy increases to 85%.

Keywords

This publication has 49 references indexed in Scilit:

Distinguishing Structural and Functional Restraints in Evolution in Order to Identify Interaction Sites
Journal of Molecular Biology, 2004
Twist and shear in β-sheets and β-ribbons
Journal of Molecular Biology, 2002
Functional organization of the yeast proteome by systematic analysis of protein complexes
Nature, 2002
The Protein Data Bank
Nucleic Acids Research, 2000
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
Nucleic Acids Research, 1997
Analysis of protein-protein interaction sites using surface patches 1 1Edited by G.Von Heijne
Journal of Molecular Biology, 1997
Prediction of protein-protein interaction sites using patch analysis 1 1Edited by G. von Heijne
Journal of Molecular Biology, 1997
AQUA and PROCHECK-NMR: Programs for checking the quality of protein structures solved by NMR
Journal of Biomolecular NMR, 1996
Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features
Biopolymers, 1983
Prediction of protein antigenic determinants from amino acid sequences.
Proceedings of the National Academy of Sciences, 1981

Cited by 53 articles