On the nature of cavities on protein surfaces: Application to the identification of drug‐binding sites
- 13 February 2006
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 63 (4), 892-906
- https://doi.org/10.1002/prot.20897
Abstract
In this article we introduce a new method for the identification and the accurate characterization of protein surface cavities. The method is encoded in the program SCREEN (Surface Cavity REcognition and EvaluatioN). As a first test of the utility of our approach we used SCREEN to locate and analyze the surface cavities of a nonredundant set of 99 proteins cocrystallized with drugs. We find that this set of proteins has on average about 14 distinct cavities per protein. In all cases, a drug is bound at one (and sometimes more than one) of these cavities. Using cavity size alone as a criterion for predicting drug-binding sites yields a high balanced error rate of 15.7%, with only 71.7% coverage. Here we characterize each surface cavity by computing a comprehensive set of 408 physicochemical, structural, and geometric attributes. By applying modern machine learning techniques (Random Forests) we were able to develop a classifier that can identify drug-binding cavities with a balanced error rate of 7.2% and coverage of 88.9%. Only 18 of the 408 cavity attributes had a statistically significant role in the prediction. Of these 18 important attributes, almost all involved size and shape rather than physicochemical properties of the surface cavity. The implications of these results are discussed. A SCREEN Web server is available at http://interface.bioc.columbia.edu/screen. Proteins 2006.Keywords
This publication has 84 references indexed in Scilit:
- Enzyme/Non-enzyme Discrimination and Prediction of Enzyme Active Site Location Using Charge-based MethodsJournal of Molecular Biology, 2004
- Heterogeneity and Inaccuracy in Protein Structures Solved by X-Ray CrystallographyStructure, 2004
- Analysing Six Types of Protein–Protein InterfacesJournal of Molecular Biology, 2003
- Prediction of functionally important residues based solely on the computed energetics of protein structure 1 1Edited by B. HonigJournal of Molecular Biology, 2001
- Anatomy of protein pockets and cavities: Measurement of binding site geometry and implications for ligand designProtein Science, 1998
- Analysis of protein-protein interaction sites using surface patches 1 1Edited by G.Von HeijneJournal of Molecular Biology, 1997
- Prediction of protein-protein interaction sites using patch analysis 1 1Edited by G. von HeijneJournal of Molecular Biology, 1997
- An Evolutionary Trace Method Defines Binding Surfaces Common to Protein FamiliesJournal of Molecular Biology, 1996
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983
- Computer analysis of protein-protein interactionJournal of Molecular Biology, 1978