Comparative genomics modeling of the NRSF/REST repressor network: From single conserved sites to genome-wide repertoire
- 8 September 2006
- journal article
- research article
- Published by Cold Spring Harbor Laboratory in Genome Research
- Vol. 16 (10), 1208-1221
- https://doi.org/10.1101/gr.4997306
Abstract
We constructed and applied an open source informatic framework called Cistematic in an effort to predict the target gene repertoire for transcription factors with large binding sites. Cistematic uses two different evolutionary conservation-filtering algorithms in conjunction with several analysis modules. Beginning with a single conserved and biologically tested site for the neuronal repressor NRSF/REST, Cistematic generated a refined PSFM (position specific frequency matrix) based on conserved site occurrences in mouse, human, and dog genomes. Predictions from this model were validated by chromatin immunoprecipitation (ChIP) followed by quantitative PCR. The combination of transfection assays and ChIP enrichment data provided an objective basis for setting a threshold for membership and rank-ordering a final gene cohort model consisting of 842 high-confidence sites in the human genome associated with 733 genes. Statistically significant enrichment of NRSE-associated genes was found for neuron-specific Gene Ontology (GO) terms and neuronal mRNA expression profiles. A more extensive evolutionary survey showed that NRSE sites matching the PSFM model exist in roughly similar numbers in all fully sequenced vertebrate genomes but are notably absent from invertebrate and protochordate genomes, as is NRSF itself. Some NRSF/REST sites reside in repeats, which suggests a mechanism for both ancient and modern dispersal of NRSEs through vertebrate genomes. Multiple predicted sites are located near neuronal microRNA and splicing-factor genes, and these tested positive for NRSF/REST occupancy in vivo. The resulting network model integrates post-transcriptional and translational controllers, including candidate feedback loops on NRSF and its corepressor, CoREST.Keywords
This publication has 52 references indexed in Scilit:
- A clustering property of highly-degenerate transcription factor binding sites in the mammalian genomeNucleic Acids Research, 2006
- In situ detection of miRNAs in animal embryos using LNA-modified oligonucleotide probesNature Methods, 2005
- Nova regulates brain-specific splicing to shape the synapseNature Genetics, 2005
- Combinatorial microRNA target predictionsNature Genetics, 2005
- Conserved Seed Pairing, Often Flanked by Adenosines, Indicates that Thousands of Human Genes are MicroRNA TargetsCell, 2005
- A pancreatic islet-specific microRNA regulates insulin secretionNature, 2004
- A gene atlas of the mouse and human protein-encoding transcriptomesProceedings of the National Academy of Sciences, 2004
- Applied bioinformatics for the identification of regulatory elementsNature Reviews Genetics, 2004
- The UCSC Genome Browser DatabaseNucleic Acids Research, 2003
- The Human Genome Browser at UCSCGenome Research, 2002