Discovering regulatory elements in non-coding sequences by analysis of spaced dyads
Open Access
- 15 April 2000
- journal article
- research article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 28 (8), 1808-1818
- https://doi.org/10.1093/nar/28.8.1808
Abstract
The application of microarray and related technologies is currently generating a systematic catalog of the transcriptional response of any single gene to a multiplicity of experimental conditions. Clustering genes according to the similarity of their transcriptional response provides a direct hint to the regulons of the different transcription factors, many of which have still not been characterized. We have developed a new method for deciphering the mechanism underlying the common transcriptional response of a set of genes, i.e. discovering cis-acting regulatory elements from a set of unaligned upstream sequences. This method, called dyad analysis, is based on the observation that many regulatory sites consist of a pair of highly conserved trinucleotides, spaced by a non-conserved region of fixed width. The approach is to count the number of occurrences of each possible spaced pair of trinucleotides, and to assess its statistical significance. The method is highly efficient in the detection of sites bound by C6 Zn2 binuclear cluster proteins, as well as other transcription factors. In addition, we show that the dyad and single-word analyses are efficient for the detection of regulatory patterns in gene clusters from DNA chip experiments. In combination, these programs should provide a fast and efficient way to discover new regulatory sites for as yet unknown transcription factors.Keywords
This publication has 35 references indexed in Scilit:
- Detecting Protein Function and Protein-Protein Interactions from Genome SequencesScience, 1999
- Exploring the new world of the genome with DNA microarraysNature Genetics, 1999
- The Transcriptional Program of Sporulation in Budding YeastScience, 1998
- Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitationNature Biotechnology, 1998
- Genomic-scale analysis goes upstream?Nature Biotechnology, 1998
- Extracting regulatory sites from the upstream region of yeast genes by computational analysis of oligonucleotide frequencies 1 1Edited by G. von HeijneJournal of Molecular Biology, 1998
- Identification of functional elements in unaligned nucleic acid sequences by a novel tuple search algorithmBioinformatics, 1996
- Gibbs motif sampling: Detection of bacterial outer membrane protein repeatsProtein Science, 1995
- Expectation maximization algorithm for identifying protein-binding sites with variable lengths from unaligned DNA fragmentsJournal of Molecular Biology, 1992
- Recognition of characteristic patterns in sets of functionally equivalent DNA sequencesBioinformatics, 1987