A phylogenetic Gibbs sampler that yields centroid solutions forcis-regulatory site prediction
Open Access
- 8 May 2007
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 23 (14), 1718-1727
- https://doi.org/10.1093/bioinformatics/btm241
Abstract
Motivation: Identification of functionally conserved regulatory elements in sequence data from closely related organisms is becoming feasible, due to the rapid growth of public sequence databases. Closely related organisms are most likely to have common regulatory motifs; however, the recent speciation of such organisms results in the high degree of correlation in their genome sequences, confounding the detection of functional elements. Additionally, alignment algorithms that use optimization techniques are limited to the detection of a single alignment that may not be representative. Comparative-genomics studies must be able to address the phylogenetic correlation in the data and efficiently explore the alignment space, in order to make specific and biologically relevant predictions. Results: We describe here a Gibbs sampler that employs a full phylogenetic model and reports an ensemble centroid solution. We describe regulatory motif detection using both simulated and real data, and demonstrate that this approach achieves improved specificity, sensitivity, and positive predictive value over non-phylogenetic algorithms, and over phylogenetic algorithms that report a maximum likelihood solution. Availability: The software is freely available at http://bayesweb.wadsworth.org/gibbs/gibbs.html Contact:William_Thompson_1@brown.edu Supplementary information: Supplementary data are available at Bioinformatics online.Keywords
This publication has 31 references indexed in Scilit:
- The yjeB ( nsrR ) Gene of Escherichia coli Encodes a Nitric Oxide-Sensitive Transcriptional RegulatorJournal of Bacteriology, 2006
- Combining phylogenetic motif discovery and motif clustering to predict co-regulated genesBioinformatics, 2005
- RNA secondary structure prediction by centroids in a Boltzmann weighted ensembleRNA, 2005
- The ENCODE (ENCyclopedia Of DNA Elements) ProjectScience, 2004
- Insights into the evolution of Yersinia pestis through whole-genome comparison with Yersinia pseudotuberculosisProceedings of the National Academy of Sciences, 2004
- Yersinia pestis , the cause of plague, is a recently emerged clone of Yersinia pseudotuberculosisProceedings of the National Academy of Sciences, 1999
- Biological Sequence AnalysisPublished by Cambridge University Press (CUP) ,1998
- TRANSCRIPTIONAL REGULATION BY cAMP AND ITS RECEPTOR PROTEINAnnual Review of Biochemistry, 1993
- Evolutionary trees from DNA sequences: A maximum likelihood approachJournal of Molecular Evolution, 1981
- Monte Carlo sampling methods using Markov chains and their applicationsBiometrika, 1970