Genome-Scale Analysis of the Uses of the Escherichia coli Genome: Model-Driven Analysis of Heterogeneous Data Sets

Open Access

1 November 2003

journal article
Published by American Society for Microbiology in Journal of Bacteriology

Vol. 185 (21), 6392-6399
https://doi.org/10.1128/jb.185.21.6392-6399.2003

Abstract

The recent availability of heterogeneous high-throughput data types has increased the need for scalable in silico methods with which to integrate data related to the processes of regulation, protein synthesis, and metabolism. A sequence-based framework for modeling transcription and translation in prokaryotes has been established and has been extended to study the expression state of the entire Escherichia coli genome. The resulting in silico analysis of the expression state highlighted three facets of gene expression in E. coli: (i) the metabolic resources required for genome expression and protein synthesis were found to be relatively invariant under the conditions tested; (ii) effective promoter strengths were estimated at the genome scale by using global mRNA abundance and half-life data, revealing genes subject to regulation under the experimental conditions tested; and (iii) large-scale genome location-dependent expression patterns with approximately 600-kb periodicity were detected in the E. coli genome based on the 49 expression data sets analyzed. These results support the notion that a structured model-driven analysis of expression data yields additional information that can be subjected to commonly used statistical analyses. The integration of heterogeneous genome-scale data (i.e., sequence, expression data, and mRNA half-life data) is readily achieved in the context of an in silico model.

Keywords

This publication has 43 references indexed in Scilit:

Global RNA Half-Life Analysis in Escherichia coli Reveals Positional Patterns of Transcript Degradation
Genome Research, 2003
Wavelet transforms for the characterization and detection of repeating motifs
Journal of Molecular Biology, 2002
Interrelating Different Types of Genomic Data, from Proteome to Secretome: 'Oming in on Function
Genome Research, 2001
DNA Microarray-Mediated Transcriptional Profiling of the Escherichia coli Response to Hydrogen Peroxide
Journal of Bacteriology, 2001
High-Density Microarray-Mediated Gene Expression Profiling of Escherichia coli
Journal of Bacteriology, 2001
Global Gene Expression Profiling in Escherichia coliK12
Journal of Biological Chemistry, 2000
A DNA structural atlas for Escherichia coli 1 1Edited by T. Richmond
Journal of Molecular Biology, 2000
MultiFun, a Multifunctional Classification Scheme forEscherichia coliK-12 Gene Products
Microbial & Comparative Genomics, 2000
MultiFun, a Multifunctional Classification Scheme forEscherichia coliK-12 Gene Products
Microbial & Comparative Genomics, 2000
The Complete Genome Sequence of Escherichia coli K-12
Science, 1997

Cited by 72 articles