Computational identification of transcriptional regulatory elements in DNA sequence
Open Access
- 19 July 2006
- journal article
- review article
- Published by Oxford University Press (OUP) in Nucleic Acids Research
- Vol. 34 (12), 3585-3598
- https://doi.org/10.1093/nar/gkl372
Abstract
Identification and annotation of all the functional elements in the genome, including genes and the regulatory sequences, is a fundamental challenge in genomics and computational biology. Since regulatory elements are frequently short and variable, their identification and discovery using computational algorithms is difficult. However, significant advances have been made in the computational methods for modeling and detection of DNA regulatory elements. The availability of complete genome sequence from multiple organisms, as well as mRNA profiling and high-throughput experimental methods for mapping protein-binding sites in DNA, have contributed to the development of methods that utilize these auxiliary data to inform the detection of transcriptional regulatory elements. Progress is also being made in the identification of cis-regulatory modules and higher order structures of the regulatory sequences, which is essential to the understanding of transcription regulation in the metazoan genomes. This article reviews the computational approaches for modeling and identification of genomic regulatory elements, with an emphasis on the recent developments, and current challenges.Keywords
This publication has 101 references indexed in Scilit:
- Initial sequencing and comparative analysis of the mouse genomeNature, 2002
- Transcriptional Regulatory Networks in Saccharomyces cerevisiaeScience, 2002
- An algorithm for finding protein–DNA binding sites with applications to chromatin- immunoprecipitation microarray experimentsNature Biotechnology, 2002
- Finding Motifs Using Random ProjectionsJournal of Computational Biology, 2002
- Exploiting transcription factor binding site clustering to identify cis-regulatory modules involved in pattern formation in the Drosophila genomeProceedings of the National Academy of Sciences, 2002
- Initial sequencing and analysis of the human genomeNature, 2001
- The Genome Sequence of Drosophila melanogasterScience, 2000
- Finding DNA regulatory motifs within unaligned noncoding sequences clustered by whole-genome mRNA quantitationNature Biotechnology, 1998
- Sequence logos: a new way to display consensus sequencesNucleic Acids Research, 1990
- Selection of DNA binding sites by regulatory proteinsJournal of Molecular Biology, 1987