DECOD: fast and accurate discriminative DNA motif finding
Open Access
- 12 July 2011
- journal article
- research article
- Published by Oxford University Press (OUP) in Bioinformatics
- Vol. 27 (17), 2361-2367
- https://doi.org/10.1093/bioinformatics/btr412
Abstract
Motivation: Motif discovery is now routinely used in high-throughput studies including large-scale sequencing and proteomics. These datasets present new challenges. The first is speed. Many motif discovery methods do not scale well to large datasets. Another issue is identifying discriminative rather than generative motifs. Such discriminative motifs are important for identifying co-factors and for explaining changes in behavior between different conditions. Results: To address these issues we developed a method for DECOnvolved Discriminative motif discovery (DECOD). DECOD uses a k-mer count table and so its running time is independent of the size of the input set. By deconvolving the k-mers DECOD considers context information without using the sequences directly. DECOD outperforms previous methods both in speed and in accuracy when using simulated and real biological benchmark data. We performed new binding experiments for p53 mutants and used DECOD to identify p53 co-factors, suggesting new mechanisms for p53 activation. Availability: The source code and binaries for DECOD are available at http://www.sb.cs.cmu.edu/DECOD Contact:zivbj@cs.cmu.edu Supplementary information: Supplementary data are available at Bioinformatics online.Keywords
This publication has 35 references indexed in Scilit:
- Insights into GATA-1-Mediated Gene Activation versus Repression via Genome-wide Chromatin Occupancy AnalysisMolecular Cell, 2009
- Modes of p53 RegulationCell, 2009
- Induction of SOX4 by DNA damage is critical for p53 stabilization and functionProceedings of the National Academy of Sciences, 2009
- Chromatin Immunoprecipitation–on-Chip Reveals Stress-Dependent p53 Occupancy in Primary Normal Cells but Not in Established Cell LinesCancer Research, 2008
- Seeder: discriminative seeding DNA motif discoveryBioinformatics, 2008
- Transcriptional control of human p53-regulated genesNature Reviews Molecular Cell Biology, 2008
- Transcription factor and microRNA motif discovery: The Amadeus platform and a compendium of metazoan target setsGenome Research, 2008
- Discriminative motif discovery in DNA and protein sequences using the DEME algorithmBMC Bioinformatics, 2007
- Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencingNature Methods, 2007
- STAMP: a web tool for exploring DNA-binding motif similaritiesNucleic Acids Research, 2007