Improving the sensitivity of the sequence profile method
- 1 January 1994
- journal article
- research article
- Published by Wiley in Protein Science
- Vol. 3 (1), 139-146
- https://doi.org/10.1002/pro.5560030118
Abstract
The sequence profile method (Gribskov M, McLachlan AD, Eisenberg D, 1987, Proc Natl Acad Sci USA 84:4355‐4358) is a powerful tool to detect distant relationships between amino acid sequences. A profile is a table of posi‐tion‐specific scores and gap penalties, providing a generalized description of a protein motif, which can be used for sequence alignments and database searches instead of an individual sequence. A sequence profile is derived from a multiple sequence alignment. We have found 2 ways to improve the sensitivity of sequence profiles: (1) Sequence weights: Usage of individual weights for each sequence avoids bias toward closely related sequences. These weights are automatically assigned based on the distance of the sequences using a published procedure (Sib‐bald PR, Argos P, 1990, J Mol Biol 216:813‐818). (2) Amino acid substitution table: In addition to the alignment, the construction of a profile also needs an amino acid substitution table. We have found that in some cases a new table, the BLOSUM45 table (Henikoff S, Henikoff JG, 1992, Proc Natl Acud Sci USA 89:10915‐10919), is more sensitive than the original Dayhoff table or the modified Dayhoff table used in the current implementation. Profiles derived by the improved method are more sensitive and selective in a number of cases where previous methods have failed to completely separate true members from false positives.Keywords
This publication has 18 references indexed in Scilit:
- SH2 and SH3 domains: From structure to functionCell, 1992
- The SWISS-PROT protein sequence data bankNucleic Acids Research, 1992
- Prosite: a dictionary of sites and patterns in proteinsNucleic Acids Research, 1992
- Weighting aligned protein or nucleic acid sequences to correct for unequal representationJournal of Molecular Biology, 1990
- [9] Profile analysisMethods in Enzymology, 1990
- Recognition of functional regions in primary structures using a set of property patternsFEBS Letters, 1989
- Weights for data related by a treeJournal of Molecular Biology, 1989
- Comparison of biosequencesAdvances in Applied Mathematics, 1981
- How different amino acid sequences determine similar protein structures: The structure and evolutionary dynamics of the globinsJournal of Molecular Biology, 1980