Resolution‐adapted recombination of structural features significantly improves sampling in restraint‐guided structure calculation
Open Access
- 9 November 2011
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 80 (3), 884-895
- https://doi.org/10.1002/prot.23245
Abstract
Recent work has shown that NMR structures can be determined by integrating sparse NMR data with structure prediction methods such as Rosetta. The experimental data serve to guide the search for the lowest energy state towards the deep minimum at the native state which is frequently missed in Rosetta de novo structure calculations. However, as the protein size increases, sampling again becomes limiting; for example, the standard Rosetta protocol involving Monte Carlo fragment insertion starting from an extended chain fails to converge for proteins over 150 amino acids even with guidance from chemical shifts (CS‐Rosetta) and other NMR data. The primary limitation of this protocol—that every folding trajectory is completely independent of every other—was recently overcome with the development of a new approach involving resolution‐adapted structural recombination (RASREC). Here we describe the RASREC approach in detail and compare it to standard CS‐Rosetta. We show that the improved sampling of RASREC is essential in obtaining accurate structures over a benchmark set of 11 proteins in the 15‐25 kDa size range using chemical shifts, backbone RDCs and HN‐HN NOE data; in a number of cases the improved sampling methodology makes a larger contribution than incorporation of additional experimental data. Experimental data are invaluable for guiding sampling to the vicinity of the global energy minimum, but for larger proteins, the standard Rosetta fold‐from‐extended‐chain protocol does not converge on the native minimum even with experimental data and the more powerful RASREC approach is necessary to converge to accurate solutions. Proteins 2011.Keywords
This publication has 25 references indexed in Scilit:
- Incorporation of evolutionary information into Rosetta comparative modelingProteins-Structure Function and Bioinformatics, 2011
- RosettaEPR: An integrated tool for protein structure determination from sparse EPR dataJournal of Structural Biology, 2011
- Rosetta3Methods in Enzymology, 2010
- Feature space resampling for protein conformational searchProteins-Structure Function and Bioinformatics, 2009
- Structure prediction for CASP8 with all‐atom refinement using RosettaProteins-Structure Function and Bioinformatics, 2009
- Guiding conformation space search with an all‐atom energy potentialProteins-Structure Function and Bioinformatics, 2008
- Consistent blind protein structure generation from NMR chemical shift dataProceedings of the National Academy of Sciences, 2008
- High-resolution structure prediction and the crystallographic phase problemNature, 2007
- Protein structure determination from NMR chemical shiftsProceedings of the National Academy of Sciences, 2007
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresPeptide Science, 1983