Gibbs motif sampling: Detection of bacterial outer membrane protein repeats
Open Access
- 1 August 1995
- journal article
- research article
- Published by Wiley in Protein Science
- Vol. 4 (8), 1618-1632
- https://doi.org/10.1002/pro.5560040820
Abstract
The detection and alignment of locally conserved regions (motifs) in multiple sequences can provide insight into protein structure, function, and evolution. A new Gibbs sampling algorithm is described that detects motif‐encoding regions in sequences and optimally partitions them into distinct motif models; this is illustrated using a set of immunoglobulin fold proteins. When applied to sequences sharing a single motif, the sampler can be used to classify motif regions into related submodels, as is illustrated using helix‐turn‐helix DNA‐binding proteins. Other statistically based procedures are described for searching a database for sequences matching motifs found by the sampler. When applied to a set of 32 very distantly related bacterial integral outer membrane proteins, the sampler revealed that they share a subtle, repetitive motif. Although BLAST (Altschul SF et al., 1990, J Mol Biol 215:403–410) fails to detect significant pairwise similarity between any of the sequences, the repeats present in these outer membrane proteins, taken as a whole, are highly significant (based on a generally applicable statistical test for motifs described here). Analysis of bacterial porins with known trimeric β‐barrel structure and related proteins reveals a similar repetitive motif corresponding to alternating membrane‐spanning β‐strands. These β‐strands occur on the membrane interface (as opposed to the trimeric interface) of the β‐barrel. The broad conservation and structural location of these repeats suggests that they play important functional roles.Keywords
This publication has 67 references indexed in Scilit:
- Bayesian Models for Multiple Local Sequence Alignment and Gibbs Sampling StrategiesJournal of the American Statistical Association, 1995
- The Immunoglobulin Fold: Structural Classification, Sequence Patterns and Common CoreJournal of Molecular Biology, 1994
- Detecting Patterns in Protein SequencesJournal of Molecular Biology, 1994
- Hidden Markov Models in Computational BiologyJournal of Molecular Biology, 1994
- Statistics of local complexity in amino acid sequences and sequence databasesComputers & Chemistry, 1993
- Expectation maximization algorithm for identifying protein-binding sites with variable lengths from unaligned DNA fragmentsJournal of Molecular Biology, 1992
- The immunoglobulin familyCurrent Opinion in Structural Biology, 1991
- Carboxy-terminal phenylalanine is essential for the correct assembly of a bacterial outer membrane proteinJournal of Molecular Biology, 1991
- Basic local alignment search toolJournal of Molecular Biology, 1990
- Models for the structure of outer-membrane proteins of Escherichia coli derived from raman spectroscopy and prediction methodsJournal of Molecular Biology, 1986