Design and analysis of ChIP-seq experiments for DNA-binding proteins
Top Cited Papers
Open Access
- 16 November 2008
- journal article
- research article
- Published by Springer Nature in Nature Biotechnology
- Vol. 26 (12), 1351-1359
- https://doi.org/10.1038/nbt.1508
Abstract
Critical considerations in the design and analysis of ChIP-seq experiments include how to align sequenced tags to the genome, how to detect binding sites and how to estimate the number of tags needed to confidently determine where a protein binds DNA. Using data set for three transcription factors, Kharchenko et al. address these considerations by comparing three novel algorithms with published computational methods. Recent progress in massively parallel sequencing platforms has enabled genome-wide characterization of DNA-associated proteins using the combination of chromatin immunoprecipitation and sequencing (ChIP-seq). Although a variety of methods exist for analysis of the established alternative ChIP microarray (ChIP-chip), few approaches have been described for processing ChIP-seq data. To fill this gap, we propose an analysis pipeline specifically designed to detect protein-binding positions with high accuracy. Using previously reported data sets for three transcription factors, we illustrate methods for improving tag alignment and correcting for background signals. We compare the sensitivity and spatial precision of three peak detection algorithms with published methods, demonstrating gains in spatial precision when an asymmetric distribution of tags on positive and negative strands is considered. We also analyze the relationship between the depth of sequencing and characteristics of the detected binding positions, and provide a method for estimating the sequencing depth necessary for a desired coverage of protein binding sites.Keywords
This publication has 21 references indexed in Scilit:
- Mapping the chromosomal targets of STAT1 by Sequence Tag Analysis of Genomic Enrichment (STAGE)Genome Research, 2007
- Genome-wide profiles of STAT1 DNA association using chromatin immunoprecipitation and massively parallel sequencingNature Methods, 2007
- High-Resolution Profiling of Histone Methylations in the Human GenomeCell, 2007
- Analysis of the Vertebrate Insulator Protein CTCF-Binding Sites in the Human GenomeCell, 2007
- DNA methylation profiling of human chromosomes 6, 20 and 22Nature Genetics, 2006
- Flexibility and constraint in the nucleosome core landscape of Caenorhabditis elegans chromatinGenome Research, 2006
- Comparative genomics modeling of the NRSF/REST repressor network: From single conserved sites to genome-wide repertoireGenome Research, 2006
- Model-based analysis of tiling-arrays for ChIP-chipProceedings of the National Academy of Sciences, 2006
- MEME: discovering and analyzing DNA and protein sequence motifsNucleic Acids Research, 2006
- BLAT—The BLAST-Like Alignment ToolGenome Research, 2002