Cascaded multiple classifiers for secondary structure prediction

Open Access

1 January 2000

journal article
Published by Wiley in Protein Science

Vol. 9 (6), 1162-1176
https://doi.org/10.1110/ps.9.6.1162

Abstract

We describe a new classifier for protein secondary structure prediction that is formed by cascading together different types of classifiers using neural networks and linear discrimination. The new classifier achieves an accuracy of 76.7% (assessed by a rigorous full Jack‐knife procedure) on a new nonredundant dataset of 496 nonhomologous sequences (obtained from G.J. Barton and JA. Cuff). This database was especially designed to train and test protein secondary structure prediction methods, and it uses a more stringent definition of homologous sequence than in previous studies. We show that it is possible to design classifiers that can highly discriminate the three classes (H, E, C) with an accuracy of up to 78% for β‐strands, using only a local window and resampling techniques. This indicates that the importance of long‐range interactions for the prediction of β‐strands has been probably previously overestimated.

Keywords

This publication has 63 references indexed in Scilit:

Protein secondary structure prediction based on position-specific scoring matrices 1 1Edited by G. Von Heijne
Journal of Molecular Biology, 1999
Gapped BLAST and PSI-BLAST: a new generation of protein database search programs
Nucleic Acids Research, 1997
CATH – a hierarchic classification of protein domain structures
Structure, 1997
Prediction of Protein Secondary Structure by Combining Nearest-neighbor Algorithms and Multiple Sequence Alignments
Journal of Molecular Biology, 1995
CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice
Nucleic Acids Research, 1994
Prediction of Protein Secondary Structure at Better than 70% Accuracy
Journal of Molecular Biology, 1993
Improvements in protein secondary structure prediction by an enhanced neural network
Journal of Molecular Biology, 1990
Predicting the secondary structure of globular proteins using neural network models
Journal of Molecular Biology, 1988
Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical features
Biopolymers, 1983
Algorithms for prediction of α-helical and β-structural regions in globular proteins
Journal of Molecular Biology, 1974

Cited by 307 articles