DECOD: fast and accurate discriminative DNA motif finding

Open Access

12 July 2011

journal article
research article
Published by Oxford University Press (OUP) in Bioinformatics

Vol. 27 (17), 2361-2367
https://doi.org/10.1093/bioinformatics/btr412

Abstract

Motivation: Motif discovery is now routinely used in high-throughput studies including large-scale sequencing and proteomics. These datasets present new challenges. The first is speed. Many motif discovery methods do not scale well to large datasets. Another issue is identifying discriminative rather than generative motifs. Such discriminative motifs are important for identifying co-factors and for explaining changes in behavior between different conditions. Results: To address these issues we developed a method for DECOnvolved Discriminative motif discovery (DECOD). DECOD uses a k-mer count table and so its running time is independent of the size of the input set. By deconvolving the k-mers DECOD considers context information without using the sequences directly. DECOD outperforms previous methods both in speed and in accuracy when using simulated and real biological benchmark data. We performed new binding experiments for p53 mutants and used DECOD to identify p53 co-factors, suggesting new mechanisms for p53 activation. Availability: The source code and binaries for DECOD are available at http://www.sb.cs.cmu.edu/DECOD Contact:zivbj@cs.cmu.edu Supplementary information: Supplementary data are available at Bioinformatics online.

Keywords

This publication has 35 references indexed in Scilit:

Insights into GATA-1-Mediated Gene Activation versus Repression via Genome-wide Chromatin Occupancy Analysis
Molecular Cell, 2009
Modes of p53 Regulation
Cell, 2009
Induction of SOX4 by DNA damage is critical for p53 stabilization and function
Proceedings of the National Academy of Sciences, 2009
Chromatin Immunoprecipitation–on-Chip Reveals Stress-Dependent p53 Occupancy in Primary Normal Cells but Not in Established Cell Lines
Cancer Research, 2008
Seeder: discriminative seeding DNA motif discovery
Bioinformatics, 2008
Transcriptional control of human p53-regulated genes
Nature Reviews Molecular Cell Biology, 2008
Transcription factor and microRNA motif discovery: The Amadeus platform and a compendium of metazoan target sets
Genome Research, 2008
Discriminative motif discovery in DNA and protein sequences using the DEME algorithm
BMC Bioinformatics, 2007
Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencing
Nature Methods, 2007
STAMP: a web tool for exploring DNA-binding motif similarities
Nucleic Acids Research, 2007

Cited by 39 articles