The DEF data base of sequence based protein fold class predictions.
- 1 September 1994
- journal article
- Vol. 22 (17), 3616-9
Abstract
A new method for predicting protein fold-classes and protein domains from sequence data is constructed and used for generating a data base of protein fold-class assignments. Any given sequence of amino acids is assigned a specific prediction of one out of 45 typical protein fold-classes, a prediction of one out of 4 super fold-classes for the content of secondary structures and a profile of fold-class predictions along the sequence. The prediction accuracy for the super fold-classes is around 91% correct and 82% correct for the specific fold-classes. This accuracy is maintained down to a few percent of sequence identity.This publication has 12 references indexed in Scilit:
- Protein Structures from Distance InequalitiesJournal of Molecular Biology, 1993
- Prediction of protein folding class from amino acid compositionProteins-Structure Function and Bioinformatics, 1993
- Protein tertiary structure recognition using optimized Hamiltonians with local interactions.Proceedings of the National Academy of Sciences, 1992
- A new approach to protein fold recognitionNature, 1992
- Cleaning up gene databasesNature, 1990
- Protein secondary structure prediction with a neural network.Proceedings of the National Academy of Sciences, 1989
- Protein secondary structure and homology by neural networks The α‐helices in rhodopsinFEBS Letters, 1988
- Predicting the secondary structure of globular proteins using neural network modelsJournal of Molecular Biology, 1988
- Comparison of the predicted and observed secondary structure of T4 phage lysozymeBiochimica et Biophysica Acta (BBA) - Protein Structure, 1975