MAPPER: a search engine for the computational identification of putative transcription factor binding sites in multiple genomes
Open Access
- 30 March 2005
- journal article
- review article
- Published by Springer Nature in BMC Bioinformatics
- Vol. 6 (1), 79
- https://doi.org/10.1186/1471-2105-6-79
Abstract
Background: Cis-regulatory modules are combinations of regulatory elements occurring in close proximity to each other that control the spatial and temporal expression of genes. The ability to identify them in a genome-wide manner depends on the availability of accurate models and of search methods able to detect putative regulatory elements with enhanced sensitivity and specificity. Results: We describe the implementation of a search method for putative transcription factor binding sites (TFBSs) based on hidden Markov models built from alignments of known sites. We built 1,079 models of TFBSs using experimentally determined sequence alignments of sites provided by the TRANSFAC and JASPAR databases and used them to scan sequences of the human, mouse, fly, worm and yeast genomes. In several cases tested the method identified correctly experimentally characterized sites, with better specificity and sensitivity than other similar computational methods. Moreover, a large-scale comparison using synthetic data showed that in the majority of cases our method performed significantly better than a nucleotide weight matrix-based method. Conclusion: The search engine, available at http://mapper.chip.org, allows the identification, visualization and selection of putative TFBSs occurring in the promoter or other regions of a gene from the human, mouse, fly, worm and yeast genomes. In addition it allows the user to upload a sequence to query and to build a model by supplying a multiple sequence alignment of binding sites for a transcription factor of interest. Due to its extensive database of models, powerful search engine and flexible interface, MAPPER represents an effective resource for the large-scale computational analysis of transcriptional regulation.Keywords
This publication has 95 references indexed in Scilit:
- Eukaryotic Regulatory Element Conservation Analysis and Identification Using Comparative GenomicsGenome Research, 2004
- Transcription regulation and animal diversityNature, 2003
- The UCSC Genome Browser DatabaseNucleic Acids Research, 2003
- An algorithm for finding protein–DNA binding sites with applications to chromatin- immunoprecipitation microarray experimentsNature Biotechnology, 2002
- Finding Motifs Using Random ProjectionsJournal of Computational Biology, 2002
- Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genomeProceedings of the National Academy of Sciences, 2002
- Human-mouse genome comparisons to locate regulatory sitesNature Genetics, 2000
- Computational identification of Cis -regulatory elements associated with groups of functionally related genes in Saccharomyces cerevisiae 1 1Edited by F. E. CohenJournal of Molecular Biology, 2000
- Identification of regulatory regions which confer muscle-specific gene expressionJournal of Molecular Biology, 1998
- Analysis of the distribution of binding sites for a tissue-specific transcription factor in the vertebrate genome 1 1Edited by M. GottesmanJournal of Molecular Biology, 1997