Threading a database of protein cores

1 November 1995

journal article
research article
Published by Wiley in Proteins-Structure Function and Bioinformatics

Vol. 23 (3), 356-369
https://doi.org/10.1002/prot.340230309

Abstract

We present an analysis of 10 blind predictions prepared for a recent conference, “Critical Assessment of Techniques for Protein Structure Prediction.”¹ The sequences of these proteins are not detectably similar to those of any protein in the structure database then available, but we attempted, by a threading method, to recognize similarity to known domain folds. Four of the 10 proteins, as we subsequently learned, do indeed show significant similarity to then‐known structures. For 2 of these proteins the predictions were accurate, in the sense that a similar structure was at or near the top of the list of threading scores, and the threading alignment agreed well with the corresponding structural alignment. For the best predicted model mean alignment error relative to the optimal structural alignment was 2.7 residues, arising entirely from small “register shifts” of strands or helices. In the analysis we attempt to identify factors responsible for these successes and failures. Since our threading method does not use gap penalties, we may readily distinguish between errors arising from our prior definition of the “cores” of known structures and errors arising from inherent limitations in the threading potential. It would appear from the results that successful substructure recognition depends most critically on accurate definition of the “fold” of a database protein. This definition must correctly delineate substructures that are, and are not, likely to be conserved during protein evolution.

Keywords

This publication has 39 references indexed in Scilit:

Automated comparative modelling of protein structures
Current Opinion in Biotechnology, 1994
De novo protein design using pairwise potentials and a genetic algorithm
Protein Science, 1994
The three‐dimensional profile method using residue preference as a continuous function of residue environment
Protein Science, 1994
Prediction of Protein Structure by Evaluation of Sequence-structure Fitness: Aligning Sequences to Contact Profiles Derived from Three-dimensional Structures
Journal of Molecular Biology, 1993
Alignment and Searching for Common Protein Folds Using a Data Bank of Structural Templates
Journal of Molecular Biology, 1993
Identification of Tertiary Structure Resemblance in Proteins Using a Maximal Common Subgraph Isomorphism Algorithm
Journal of Molecular Biology, 1993
Topology fingerprint approach to the inverse protein folding problem
Journal of Molecular Biology, 1992
A new approach to protein fold recognition
Nature, 1992
Prediction of progress at last
Nature, 1991
Refined crystal structure of ferredoxin II from Desulfovibrio gigas at 1·7 Å
Journal of Molecular Biology, 1991

Cited by 361 articles