Binding MOAD (Mother Of All Databases)

Abstract
Binding MOAD (Mother of All Databases) is the largest collection of high‐quality, protein–ligand complexes available from the Protein Data Bank. At this time, Binding MOAD contains 5331 protein–ligand complexes comprised of 1780 unique protein families and 2630 unique ligands. We have searched the crystallography papers for all 5000+ structures and compiled binding data for 1375 (26%) of the protein–ligand complexes. The binding‐affinity data ranges 13 orders of magnitude. This is the largest collection of binding data reported to date in the literature. We have also addressed the issue of redundancy in the data. To create a nonredundant dataset, one protein from each of the 1780 protein families was chosen as a representative. Representatives were chosen by tightest binding, best resolution, etc. For the 1780 “best” complexes that comprise the nonredundant version of Binding MOAD, 475 (27%) have binding data. This significant collection of protein–ligand complexes will be very useful in elucidating the biophysical patterns of molecular recognition and enzymatic regulation. The complexes with binding‐affinity data will help in the development of improved scoring functions and structure‐based drug discovery techniques. The dataset can be accessed at http://www.BindingMOAD.org. Proteins 2005.