A clustering property of highly-degenerate transcription factor binding sites in the mammalian genome
Open Access
- 1 January 2006
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 34 (8), 2238-2246
- https://doi.org/10.1093/nar/gkl248
Abstract
Transcription factor binding sites (TFBSs) are short DNA sequences interacting with transcription factors (TFs), which regulate gene expression. Due to the relatively short length of such binding sites, it is largely unclear how the specificity of protein–DNA interaction is achieved. Here, we have performed a genome-wide analysis of TFBS-like sequences for the transcriptional repressor, RE1 Silencing Transcription Factor (REST), as well as for several other representative mammalian TFs (c-myc, p53, HNF-1 and CREB). We find a nonrandom distribution of inexact sites for these TFs, referred to as highly-degenerate TFBSs, that are enriched around the cognate binding sites. Comparisons among human, mouse and rat orthologous promoters reveal that these highly-degenerate sites are conserved significantly more than expected by random chance, suggesting their positive selection during evolution. We propose that this arrangement provides a favorable genomic landscape for functional target site selection.Keywords
This publication has 46 references indexed in Scilit:
- De novo cis-regulatory module elicitation for eukaryotic genomesProceedings of the National Academy of Sciences, 2005
- TRANSFAC(R): transcriptional regulation, from patterns to profilesNucleic Acids Research, 2003
- Evolution of development in closely related species of flies and wormsNature Reviews Genetics, 2002
- Corepressor-Dependent Silencing of Chromosomal Regions Encoding Neuronal GenesScience, 2002
- Physical constraints and functional characteristics of transcription factor–DNA interactionProceedings of the National Academy of Sciences, 2002
- Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genomeProceedings of the National Academy of Sciences, 2002
- Computer-assisted identification of cell cycle-related genes: new targets for E2F transcription factorsJournal of Molecular Biology, 2001
- Electrophoretic Mobility Shift Assays for the Analysis of DNA-Protein InteractionsPublished by Springer Nature ,2000
- Genome-Wide Location and Function of DNA Binding ProteinsScience, 2000
- DNA binding sites: representation and discoveryBioinformatics, 2000