Topology prediction for helical transmembrane proteins at 86% accuracy–Topology prediction at 86% accuracy
Open Access
- 1 August 1996
- journal article
- research article
- Published by Wiley in Protein Science
- Vol. 5 (8), 1704-1718
- https://doi.org/10.1002/pro.5560050824
Abstract
Previously, we introduced a neural network system predicting locations of transmembrane helices (HTMs) based on evolutionary profiles (PHDhtm, Rost B, Casadio R, Fariselli P, Sander C, 1995, Protein Sci 4:521–533). Here, we describe an improvement and an extension of that system. The improvement is achieved by a dynamic programming‐like algorithm that optimizes helices compatible with the neural network output. The extension is the prediction of topology (orientation of first loop region with respect to membrane) by applying to the refined prediction the observation that positively charged residues are more abundant in extra‐cytoplasmic regions. Furthermore, we introduce a method to reduce the number of false positives, i.e., proteins falsely predicted with membrane helices. The evaluation of prediction accuracy is based on a cross‐validation and a double‐blind test set (in total 131 proteins). The final method appears to be more accurate than other methods published: (1) For almost 89% (π3%) of the test proteins, all HTMs are predicted correctly. (2) For more than 86% (π3%) of the proteins, topology is predicted correctly. (3) We define reliability indices that correlate with prediction accuracy: for one half of the proteins, segment accuracy raises to 98%; and for two‐thirds, accuracy of topology prediction is 95%. (4) The rate of proteins for which HTMs are predicted falsely is below 2% (π1%). Finally, the method is applied to 1,616 sequences of Haemophilus influenzae. We predict 19% of the genome sequences to contain one or more HTMs. This appears to be lower than what we predicted previously for the yeast VIII chromosome (about 25%).Keywords
This publication has 64 references indexed in Scilit:
- Challenging times for bioinformaticsNature, 1995
- Refined Structure of the Porin from Rhodopseudomonas blastica: Comparison with the Porin from Rhodobacter capsulatusJournal of Molecular Biology, 1994
- Prediction of Transmembrane Segments in Proteins Utilising Multiple Sequence AlignmentsJournal of Molecular Biology, 1994
- Prediction of Protein Secondary Structure at Better than 70% AccuracyJournal of Molecular Biology, 1993
- Quadratic Minimization of Predictors for Protein Secondary StructureJournal of Molecular Biology, 1993
- Membrane protein structure predictionJournal of Molecular Biology, 1992
- Basic local alignment search toolJournal of Molecular Biology, 1990
- Control of topology and mode of assembly of a polytopic membrane protein by positively charged residuesNature, 1989
- Dictionary of protein secondary structure: Pattern recognition of hydrogen‐bonded and geometrical featuresBiopolymers, 1983
- The protein data bank: A computer-based archival file for macromolecular structuresJournal of Molecular Biology, 1977