CLOUDS, a protocol for deriving a molecular proton density via NMR

Abstract
We demonstrate the feasibility of computing realistic spatial proton distributions for proteins in solution from experimental NMR nuclear Overhauser effect data only and with minimal assignments. The method, CLOUDS, relies on precise and abundant interproton distance restraints calculated via a relaxation matrix analysis of sets of experimental nuclear Overhauser effect spectroscopy crosspeaks. The MIDGE protocol was adapted for this purpose. A gas of unassigned, unconnected H atoms is condensed into a structured proton distribution (cloud) via a molecular dynamics simulated-annealing scheme in which the internuclear distances and van der Waals repulsive terms are the only active restraints. Proton densities are generated by combining a large number of such clouds, each computed from a different trajectory. After filtering by reference to the cloud closest to the mean, a minimal dispersion proton density (foc) is identified. The latter affords a quasi-continuous hydrogen-only probability distribution that conveys immediate information on the protein surface topology (grooves, protrusions, potential binding site cavities, etc.), directly related to the molecular structure. Feasibility of the method was tested on NMR data measured on two globular protein domains of low regular secondary structure content, the col 2 domain of human matrix metalloproteinase-2 and the kringle 2 domain of human plasminogen, of 60 and 83 amino acid residues, respectively.