Analysis of genomic context: prediction of functional associations from conserved bidirectionally transcribed gene pairs
- 30 June 2004
- journal article
- research article
- Published by Springer Nature in Nature Biotechnology
- Vol. 22 (7), 911-917
- https://doi.org/10.1038/nbt988
Abstract
Several widely used methods for predicting functional associations between proteins are based on the systematic analysis of genomic context. Efforts are ongoing to improve these methods and to search for novel aspects in genomes that could be exploited for function prediction. Here, we use gene expression data to demonstrate two functional implications of genome organization: first, chromosomal proximity indicates gene coregulation in prokaryotes independent of relative gene orientation; and second, adjacent bidirectionally transcribed genes (that is,'divergently' organized coding regions) with conserved gene orientation are strongly coregulated. We further demonstrate that such bidirectionally transcribed gene pairs are functionally associated and derive from this a novel genomic context method that reliably predicts links between >2,500 pairs of genes in ∼ 100 species. Around 650 of these functional associations are supported by other genomic context methods. In most instances, one gene encodes a transcriptional regulator, and the other a nonregulatory protein. In-depth analysis in Escherichia coli shows that the vast majority of these regulators both control transcription of the divergently transcribed target gene/operon and auto-regulate their own biosynthesis. The method thus enables the prediction of target processes and regulatory features for several hundred transcriptional regulators.Keywords
This publication has 78 references indexed in Scilit:
- Global analysis of protein localization in budding yeastNature, 2003
- Multiple sequence alignment with the Clustal series of programsNucleic Acids Research, 2003
- Classification schemes for protein structure and functionNature Reviews Genetics, 2003
- PRODORIC: prokaryotic database of gene regulationNucleic Acids Research, 2003
- The origin and evolution of model organismsNature Reviews Genetics, 2002
- Transcriptional Regulatory Networks in Saccharomyces cerevisiaeScience, 2002
- Transitive functional annotation by shortest-path analysis of gene expression dataProceedings of the National Academy of Sciences, 2002
- Predictome: a database of putative functional links between proteinsNucleic Acids Research, 2002
- Who's your neighbor? New computational approaches for functional genomicsNature Biotechnology, 2000
- Gapped BLAST and PSI-BLAST: a new generation of protein database search programsNucleic Acids Research, 1997