The DEF data base of sequence based protein fold class predictions.

1 September 1994

journal article

Vol. 22 (17), 3616-9

Abstract

A new method for predicting protein fold-classes and protein domains from sequence data is constructed and used for generating a data base of protein fold-class assignments. Any given sequence of amino acids is assigned a specific prediction of one out of 45 typical protein fold-classes, a prediction of one out of 4 super fold-classes for the content of secondary structures and a profile of fold-class predictions along the sequence. The prediction accuracy for the super fold-classes is around 91% correct and 82% correct for the specific fold-classes. This accuracy is maintained down to a few percent of sequence identity.

This publication has 12 references indexed in Scilit:

Protein Structures from Distance Inequalities
Journal of Molecular Biology, 1993
Prediction of protein folding class from amino acid composition
Proteins-Structure Function and Bioinformatics, 1993
Protein tertiary structure recognition using optimized Hamiltonians with local interactions.
Proceedings of the National Academy of Sciences, 1992
A new approach to protein fold recognition
Nature, 1992
Cleaning up gene databases
Nature, 1990
Protein secondary structure prediction with a neural network.
Proceedings of the National Academy of Sciences, 1989
Protein secondary structure and homology by neural networks The α‐helices in rhodopsin
FEBS Letters, 1988
Predicting the secondary structure of globular proteins using neural network models
Journal of Molecular Biology, 1988
Comparison of the predicted and observed secondary structure of T4 phage lysozyme
Biochimica et Biophysica Acta (BBA) - Protein Structure, 1975

Cited by 19 articles