A new test set for validating predictions of protein–ligand interaction
Top Cited Papers
- 8 October 2002
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 49 (4), 457-471
- https://doi.org/10.1002/prot.10232
Abstract
We present a large test set of protein–ligand complexes for the purpose of validating algorithms that rely on the prediction of protein–ligand interactions. The set consists of 305 complexes with protonation states assigned by manual inspection. The following checks have been carried out to identify unsuitable entries in this set: (1) assessing the involvement of crystallographically related protein units in ligand binding; (2) identification of bad clashes between protein side chains and ligand; and (3) assessment of structural errors, and/or inconsistency of ligand placement with crystal structure electron density. In addition, the set has been pruned to assure diversity in terms of protein–ligand structures, and subsets are supplied for different protein-structure resolution ranges. A classification of the set by protein type is available. As an illustration, validation results are shown for GOLD and SuperStar. GOLD is a program that performs flexible protein–ligand docking, and SuperStar is used for the prediction of favorable interaction sites in proteins. The new CCDC/Astex test set is freely available to the scientific community (http://www.ccdc.cam.ac.uk). Proteins 2002;49:457–471.Keywords
This publication has 22 references indexed in Scilit:
- Superstar: comparison of CSD and PDB-based interaction fields as a basis for the prediction of protein-ligand interactionsJournal of Molecular Biology, 2001
- EUDOC: a computer program for identification of drug interaction sites in macromolecules and drug leads from chemical databasesJournal of Computational Chemistry, 2001
- Superstar: improved knowledge-based interaction fields for protein binding sites11Edited by R. HuberJournal of Molecular Biology, 2001
- DARWIN: A program for docking flexible moleculesProteins-Structure Function and Bioinformatics, 2000
- New Approach to Molecular Docking and Its Application to Virtual Screening of Chemical DatabasesJournal of Chemical Information and Computer Sciences, 2000
- Knowledge-based scoring function to predict protein-ligand interactionsJournal of Molecular Biology, 2000
- Flexible ligand docking: A multistep strategy approachProteins-Structure Function and Bioinformatics, 1999
- SuperStar: A Knowledge-based Approach for Identifying Interaction Sites in ProteinsJournal of Molecular Biology, 1999
- High resolution fast quantitative docking using fourier domain correlation techniquesProteins-Structure Function and Bioinformatics, 1997
- Development and validation of a genetic algorithm for flexible docking 1 1Edited by F. E. CohenJournal of Molecular Biology, 1997