SPICKER: A clustering approach to identify near‐native protein folds
- 1 March 2004
- journal article
- research article
- Published by Wiley in Journal of Computational Chemistry
- Vol. 25 (6), 865-871
- https://doi.org/10.1002/jcc.20011
Abstract
We have developed SPICKER, a simple and efficient strategy to identify near-native folds by clustering protein structures generated during computer simulations. In general, the most populated clusters tend to be closer to the native conformation than the lowest energy structures. To assess the generality of the approach, we applied SPICKER to 1489 representative benchmark proteins ≤200 residues that cover the PDB at the level of 35% sequence identity; each contains up to 280,000 structure decoys generated using the recently developed TASSER (Threading ASSembly Refinement) algorithm. The best of the top five identified folds has a root-mean-square deviation from native (RMSD) in the top 1.4% of all decoys. For 78% of the proteins, the difference in RMSD from native to the identified models and RMSD from native to the absolutely best individual decoy is below 1 Å; the majority belong to the targets with converged conformational distributions. Although native fold identification from divergent decoy structures remains a challenge, our overall results show significant improvement over our previous clustering algorithms. © 2004 Wiley Periodicals, Inc. J Comput Chem 25: 865–871, 2004Keywords
This publication has 11 references indexed in Scilit:
- TOUCHSTONE II: A New Approach to Ab Initio Protein Structure PredictionBiophysical Journal, 2003
- Local energy landscape flattening: Parallel hyperbolic Monte Carlo sampling of protein foldingProteins-Structure Function and Bioinformatics, 2002
- Prospects for ab initio protein structural genomicsJournal of Molecular Biology, 2001
- Recent improvements in prediction of protein structure by global optimization of a potential energy functionProceedings of the National Academy of Sciences, 2001
- Finding the needle in a haystack: educing native folds from ambiguousab initio protein structure predictionsJournal of Computational Chemistry, 2001
- Clustering of low-energy conformations near the native structures of small proteinsProceedings of the National Academy of Sciences, 1998
- Parallel tempering algorithm for conformational studies of biological moleculesChemical Physics Letters, 1997
- Comparative Protein Modelling by Satisfaction of Spatial RestraintsJournal of Molecular Biology, 1993
- Replica Monte Carlo Simulation of Spin-GlassesPhysical Review Letters, 1986
- Respective roles of short- and long-range interactions in protein folding.Proceedings of the National Academy of Sciences, 1978