Toward better refinement of comparative models: Predicting loops in inexact environments

25 February 2008

journal article
research article
Published by Wiley in Proteins-Structure Function and Bioinformatics

Vol. 72 (3), 959-971
https://doi.org/10.1002/prot.21990

Abstract

Achieving atomic‐level accuracy in comparative protein models is limited by our ability to refine the initial, homolog‐derived model closer to the native state. Despite considerable effort, progress in developing a generalized refinement method has been limited. In contrast, methods have been described that can accurately reconstruct loop conformations in native protein structures. We hypothesize that loop refinement in homology models is much more difficult than loop reconstruction in crystal structures, in part, because side‐chain, backbone, and other structural inaccuracies surrounding the loop create a challenging sampling problem; the loop cannot be refined without simultaneously refining adjacent portions. In this work, we single out one sampling issue in an artificial but useful test set and examine how loop refinement accuracy is affected by errors in surrounding side‐chains. In 80 high‐resolution crystal structures, we first perturbed 6–12 residue loops away from the crystal conformation, and placed all protein side chains in non‐native but low energy conformations. Even these relatively small perturbations in the surroundings made the loop prediction problem much more challenging. Using a previously published loop prediction method, median backbone (N‐Cα‐C‐O) RMSD's for groups of 6, 8, 10, and 12 residue loops are 0.3/0.6/0.4/0.6 Å, respectively, on native structures and increase to 1.1/2.2/1.5/2.3 Å on the perturbed cases. We then augmented our previous loop prediction method to simultaneously optimize the rotamer states of side chains surrounding the loop. Our results show that this augmented loop prediction method can recover the native state in many perturbed structures where the previous method failed; the median RMSD's for the 6, 8, 10, and 12 residue perturbed loops improve to 0.4/0.8/1.1/1.2 Å. Finally, we highlight three comparative models from blind tests, in which our new method predicted loops closer to the native conformation than first modeled using the homolog template, a task generally understood to be difficult. Although many challenges remain in refining full comparative models to high accuracy, this work offers a methodical step toward that goal. Proteins 2008.

Keywords

Funding Information

NIH (GM52018, GM81710, P41 RR-01081)
Sandler Program in the Basic Sciences
Sloan Foundation
Genentech Scholars Program
NSF (MCB-0346399)

This publication has 34 references indexed in Scilit:

Physically realistic homology models built with rosetta can be more accurate than their templates
Proceedings of the National Academy of Sciences, 2006
Protein loop structure prediction with flexible stem geometries
Proteins-Structure Function and Bioinformatics, 2005
Assessment of predictions submitted for the CASP6 comparative modeling category
Proteins-Structure Function and Bioinformatics, 2005
Critical assessment of methods of protein structure prediction (CASP)—Round 6
Proteins-Structure Function and Bioinformatics, 2005
A decade of CASP: progress, bottlenecks and prognosis in protein structure prediction
Current Opinion in Structural Biology, 2005
Progress and challenges in high‐resolution refinement of protein structure models
Proteins-Structure Function and Bioinformatics, 2005
Improvement of comparative model accuracy by free-energy optimization along principal components of natural structural variation
Proceedings of the National Academy of Sciences, 2004
Protein Structure Prediction and Structural Genomics
Science, 2001
Completeness in structural genomics
Nature Structural & Molecular Biology, 2001
Modeling of loops in protein structures
Protein Science, 2000

Cited by 80 articles