Threading a database of protein cores
- 1 November 1995
- journal article
- research article
- Published by Wiley in Proteins-Structure Function and Bioinformatics
- Vol. 23 (3), 356-369
- https://doi.org/10.1002/prot.340230309
Abstract
We present an analysis of 10 blind predictions prepared for a recent conference, “Critical Assessment of Techniques for Protein Structure Prediction.”1 The sequences of these proteins are not detectably similar to those of any protein in the structure database then available, but we attempted, by a threading method, to recognize similarity to known domain folds. Four of the 10 proteins, as we subsequently learned, do indeed show significant similarity to then‐known structures. For 2 of these proteins the predictions were accurate, in the sense that a similar structure was at or near the top of the list of threading scores, and the threading alignment agreed well with the corresponding structural alignment. For the best predicted model mean alignment error relative to the optimal structural alignment was 2.7 residues, arising entirely from small “register shifts” of strands or helices. In the analysis we attempt to identify factors responsible for these successes and failures. Since our threading method does not use gap penalties, we may readily distinguish between errors arising from our prior definition of the “cores” of known structures and errors arising from inherent limitations in the threading potential. It would appear from the results that successful substructure recognition depends most critically on accurate definition of the “fold” of a database protein. This definition must correctly delineate substructures that are, and are not, likely to be conserved during protein evolution.Keywords
This publication has 39 references indexed in Scilit:
- Automated comparative modelling of protein structuresCurrent Opinion in Biotechnology, 1994
- De novo protein design using pairwise potentials and a genetic algorithmProtein Science, 1994
- The three‐dimensional profile method using residue preference as a continuous function of residue environmentProtein Science, 1994
- Prediction of Protein Structure by Evaluation of Sequence-structure Fitness: Aligning Sequences to Contact Profiles Derived from Three-dimensional StructuresJournal of Molecular Biology, 1993
- Alignment and Searching for Common Protein Folds Using a Data Bank of Structural TemplatesJournal of Molecular Biology, 1993
- Identification of Tertiary Structure Resemblance in Proteins Using a Maximal Common Subgraph Isomorphism AlgorithmJournal of Molecular Biology, 1993
- Topology fingerprint approach to the inverse protein folding problemJournal of Molecular Biology, 1992
- A new approach to protein fold recognitionNature, 1992
- Prediction of progress at lastNature, 1991
- Refined crystal structure of ferredoxin II from Desulfovibrio gigas at 1·7 ÅJournal of Molecular Biology, 1991