Customizing scoring functions for docking
- 14 February 2008
- journal article
- research article
- Published by Springer Nature in Journal of Computer-Aided Molecular Design
- Vol. 22 (5), 269-286
- https://doi.org/10.1007/s10822-008-9174-y
Abstract
Empirical scoring functions used in protein-ligand docking calculations are typically trained on a dataset of complexes with known affinities with the aim of generalizing across different docking applications. We report a novel method of scoring-function optimization that supports the use of additional information to constrain scoring function parameters, which can be used to focus a scoring function’s training towards a particular application, such as screening enrichment. The approach combines multiple instance learning, positive data in the form of ligands of protein binding sites of known and unknown affinity and binding geometry, and negative (decoy) data of ligands thought not to bind particular protein binding sites or known not to bind in particular geometries. Performance of the method for the Surflex-Dock scoring function is shown in cross-validation studies and in eight blind test cases. Tuned functions optimized with a sufficient amount of data exhibited either improved or undiminished screening performance relative to the original function across all eight complexes. Analysis of the changes to the scoring function suggest that modifications can be learned that are related to protein-specific features such as active-site mobility.Keywords
This publication has 33 references indexed in Scilit:
- Benchmarking Sets for Molecular DockingJournal of Medicinal Chemistry, 2006
- The PDBbind Database: Methodologies and UpdatesJournal of Medicinal Chemistry, 2005
- Surflex: Fully Automatic Flexible Molecular Docking Using a Molecular Similarity-Based Search EngineJournal of Medicinal Chemistry, 2003
- Acetylcholinesterase Complexed with Bivalent Ligands Related to Huperzine A: Experimental Evidence for Species-Dependent Protein−Ligand ComplementarityJournal of the American Chemical Society, 2002
- Knowledge-based scoring function to predict protein-ligand interactionsJournal of Molecular Biology, 2000
- SCORE: A New Empirical Method for Estimating the Binding Affinity of a Protein-Ligand ComplexJournal of Molecular Modeling, 1998
- Development and validation of a genetic algorithm for flexible docking 1 1Edited by F. E. CohenJournal of Molecular Biology, 1997
- Solving the multiple instance problem with axis-parallel rectanglesArtificial Intelligence, 1997
- A Fast Flexible Docking Method using an Incremental Construction AlgorithmJournal of Molecular Biology, 1996
- A geometric approach to macromolecule-ligand interactionsJournal of Molecular Biology, 1982