Cross-species comparison significantly improves genome-wide prediction of cis-regulatory modules in Drosophila
Open Access
- 9 September 2004
- journal article
- research article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 5 (1), 129
- https://doi.org/10.1186/1471-2105-5-129
Abstract
Background: The discovery of cis-regulatory modules in metazoan genomes is crucial for understanding the connection between genes and organism diversity. It is important to quantify how comparative genomics can improve computational detection of such modules. Results: We run the Stubb software on the entire D. melanogaster genome, to obtain predictions of modules involved in segmentation of the embryo. Stubb uses a probabilistic model to score sequences for clustering of transcription factor binding sites, and can exploit multiple species data within the same probabilistic framework. The predictions are evaluated using publicly available gene expression data for thousands of genes, after careful manual annotation. We demonstrate that the use of a second genome (D. pseudoobscura) for cross-species comparison significantly improves the prediction accuracy of Stubb, and is a more sensitive approach than intersecting the results of separate runs over the two genomes. The entire list of predictions is made available online. Conclusion: Evolutionary conservation of modules serves as a filter to improve their detection in silico. The future availability of additional fruitfly genomes therefore carries the prospect of highly specific genome-wide predictions using Stubb.Keywords
This publication has 16 references indexed in Scilit:
- Transcriptional Control in the Segmentation Gene Network of DrosophilaPLoS Biology, 2004
- Prediction of similarly acting cis-regulatory modules by subsequence profiling and comparative genomics in Drosophila melanogaster and D.pseudoobscuraBioinformatics, 2004
- An Evolutionary Analysis of Orphan Genes in DrosophilaGenome Research, 2003
- NEW EMBO MEMBER'S REVIEW: In and out of Torso RTK signallingThe EMBO Journal, 2003
- LAGAN and Multi-LAGAN: Efficient Tools for Large-Scale Multiple Alignment of Genomic DNAGenome Research, 2003
- Computation-Based Discovery of Related Transcriptional Regulatory Modules and Motifs Using an Experimentally Validated Combinatorial ModelGenome Research, 2002
- Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genomeProceedings of the National Academy of Sciences, 2002
- Genome-wide analysis of clustered Dorsal binding sites identifies putative target genes in the Drosophila embryoProceedings of the National Academy of Sciences, 2001
- From gradients to stripes in Drosophila embryogenesis: filling in the gapsTrends in Genetics, 1996
- The origin of pattern and polarity in the Drosophila embryoCell, 1992