Protein loop structure prediction with flexible stem geometries
- 17 November 2005
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 61 (4), 748-762
- https://doi.org/10.1002/prot.20669
Abstract
The structure prediction of loops with flexible stem residues is addressed in this article. While the secondary structure of the stem residues is assumed to be known, the geometry of the protein into which the loop must fit is considered to be unknown in our methodology. As a consequence, the compatibility of the loop with the remainder of the protein is not used as a criterion to reject loop decoys. The loop structure prediction with flexible stems is more difficult than fitting loops into a known protein structure in that a larger conformational space has to be covered. The main focus of the study is to assess the precision of loop structure prediction if no information on the protein geometry is available. The proposed approach is based on (1) dihedral angle sampling, (2) structure optimization by energy minimization with a physically based energy function, (3) clustering, and (4) a comparison of strategies for the selection of loops identified in (3). Steps (1) and (2) have similarities to previous approaches to loop structure prediction with fixed stems. Step (3) is based on a new iterative approach to clustering that is tailored for the loop structure prediction problem with flexible stems. In this new approach, clustering is not only used to identify conformers that are likely to be close to the native structure, but clustering is also employed to identify far-from-native decoys. By discarding these decoys iteratively, the overall quality of the ensemble and the loop structure prediction is improved. Step (4) provides a comparative study of criteria for loop selection based on energy, colony energy, cluster density, and a hybrid criterion introduced here. The proposed method is tested on a large set of 3215 loops from proteins in the PdbSelect25 set and to 179 loops from proteins from the Casp6 experiment. Proteins 2005.Keywords
This publication has 26 references indexed in Scilit:
- Modeling structurally variable regions in homologous proteins with rosettaProteins-Structure Function and Bioinformatics, 2004
- High‐resolution prediction of protein helix positions and orientationsProteins-Structure Function and Bioinformatics, 2004
- A hierarchical approach to all‐atom protein loop predictionProteins-Structure Function and Bioinformatics, 2004
- Ab initio construction of polypeptide fragments: Efficient generation of accurate, representative ensemblesProteins-Structure Function and Bioinformatics, 2003
- Ab initio construction of polypeptide fragments: Accuracy of loop decoy discrimination by an all‐atom statistical potential and the AMBER force field with the Generalized Born solvation modelProteins-Structure Function and Bioinformatics, 2003
- Evaluating conformational free energies: The colony energy and its application to the problem of loop predictionProceedings of the National Academy of Sciences, 2002
- Modeling of loops in protein structuresProtein Science, 2000
- Prediction of protein side-chain rotamers from a backbone-dependent rotamer library: a new homology modeling toolJournal of Molecular Biology, 1997
- Origins of structural diversity within sequentially identical hexapeptidesProtein Science, 1993
- Prediction of the folding of short polypeptide segments by uniform conformational samplingBiopolymers, 1987