Cascaded multiple classifiers for secondary structure prediction
Open Access
- 1 January 2000
- journal article
- Published by Wiley in Protein Science
- Vol. 9 (6), 1162-1176
- https://doi.org/10.1110/ps.9.6.1162
Abstract
We describe a new classifier for protein secondary structure prediction that is formed by cascading together different types of classifiers using neural networks and linear discrimination. The new classifier achieves an accuracy of 76.7% (assessed by a rigorous full Jack‐knife procedure) on a new nonredundant dataset of 496 nonhomologous sequences (obtained from G.J. Barton and JA. Cuff). This database was especially designed to train and test protein secondary structure prediction methods, and it uses a more stringent definition of homologous sequence than in previous studies. We show that it is possible to design classifiers that can highly discriminate the three classes (H, E, C) with an accuracy of up to 78% for β‐strands, using only a local window and resampling techniques. This indicates that the importance of long‐range interactions for the prediction of β‐strands has been probably previously overestimated.Keywords
This publication has 63 references indexed in Scilit:
- Protein secondary structure prediction based on position-specific scoring matrices 1 1Edited by G. Von HeijneJournal of Molecular Biology, 1999
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997
- CATH – a hierarchic classification of protein domain structuresStructure, 1997
- Prediction of Protein Secondary Structure by Combining Nearest-neighbor Algorithms and Multiple Sequence AlignmentsJournal of Molecular Biology, 1995
- CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choiceNucleic Acids Research, 1994
- Prediction of Protein Secondary Structure at Better than 70% AccuracyJournal of Molecular Biology, 1993
- Improvements in protein secondary structure prediction by an enhanced neural networkJournal of Molecular Biology, 1990
- Predicting the secondary structure of globular proteins using neural network modelsJournal of Molecular Biology, 1988
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983
- Algorithms for prediction of α-helical and β-structural regions in globular proteinsJournal of Molecular Biology, 1974