Genomic Sequence Is Highly Predictive of Local Nucleosome Depletion
Open Access
- 25 January 2008
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Computational Biology
- Vol. 4 (1), e13
- https://doi.org/10.1371/journal.pcbi.0040013
Abstract
The regulation of DNA accessibility through nucleosome positioning is important for transcription control. Computational models have been developed to predict genome-wide nucleosome positions from DNA sequences, but these models consider only nucleosome sequences, which may have limited their power. We developed a statistical multi-resolution approach to identify a sequence signature, called the N-score, that distinguishes nucleosome binding DNA from non-nucleosome DNA. This new approach has significantly improved the prediction accuracy. The sequence information is highly predictive for local nucleosome enrichment or depletion, whereas predictions of the exact positions are only modestly more accurate than a null model, suggesting the importance of other regulatory factors in fine-tuning the nucleosome positions. The N-score in promoter regions is negatively correlated with gene expression levels. Regulatory elements are enriched in low N-score regions. While our model is derived from yeast data, the N-score pattern computed from this model agrees well with recent high-resolution protein-binding data in human. A eukaryotic genome is packaged into chromatin. The chromatin not only makes it possible to fit the relatively long genome into a tiny nucleus, but also plays an important regulatory role. The nucleosome is the fundamental repeating unit of chromatin. High-resolution tiling array experiments have shown that many nucleosomes are well-positioned in vivo, consistent with an important regulatory role. However, the mechanisms that determine nucleosome positioning are still poorly understood. We have developed a novel computational method for predicting nucleosome positions using only the genomic sequence information. The method detects periodic sequence signatures that discriminate nucleosome sequences from linker sequences. We show that this approach has significantly improved predictive power compared to previous studies. Interestingly, the most predictable regions tend to be located where stringent regulations are needed, i.e., the neighborhood of a transcription start site. This model predicts that nucleosome occupancy is not strongly controlled by short DNA sequence motifs but rather progressively controlled by regular organization of short elements into periodic patterns. We also provide evidence that sequence specificity for nucleosome binding is conserved from yeast to human.Keywords
This publication has 45 references indexed in Scilit:
- High-Resolution Profiling of Histone Methylations in the Human GenomeCell, 2007
- Translational and rotational settings of H2A.Z nucleosomes across the Saccharomyces cerevisiae genomeNature, 2007
- Analysis of the Vertebrate Insulator Protein CTCF-Binding Sites in the Human GenomeCell, 2007
- Distinct and predictive chromatin signatures of transcriptional promoters and enhancers in the human genomeNature Genetics, 2007
- A genomic code for nucleosome positioningNature, 2006
- Stubb: a program for discovery and analysis of cis-regulatory modulesNucleic Acids Research, 2006
- Genome-scale profiling of histone H3.3 replacement patternsNature Genetics, 2005
- Evidence for nucleosome depletion at active regulatory regions genome-wideNature Genetics, 2004
- An algorithm for finding protein–DNA binding sites with applications to chromatin- immunoprecipitation microarray experimentsNature Biotechnology, 2002
- Sequence periodicities in chicken nucleosome core DNAJournal of Molecular Biology, 1986