Genome-Scale Reconstruction of Escherichia coli's Transcriptional and Translational Machinery: A Knowledge Base, Its Mathematical Formulation, and Its Functional Characterization
Open Access
- 13 March 2009
- journal article
- research article
- Published by Public Library of Science (PLoS) in PLoS Computational Biology
- Vol. 5 (3) , e1000312
- https://doi.org/10.1371/journal.pcbi.1000312
Abstract
Metabolic network reconstructions represent valuable scaffolds for ‘-omics’ data integration and are used to computationally interrogate network properties. However, they do not explicitly account for the synthesis of macromolecules (i.e., proteins and RNA). Here, we present the first genome-scale, fine-grained reconstruction of Escherichia coli's transcriptional and translational machinery, which produces 423 functional gene products in a sequence-specific manner and accounts for all necessary chemical transformations. Legacy data from over 500 publications and three databases were reviewed, and many pathways were considered, including stable RNA maturation and modification, protein complex formation, and iron–sulfur cluster biogenesis. This reconstruction represents the most comprehensive knowledge base for these important cellular functions in E. coli and is unique in its scope. Furthermore, it was converted into a mathematical model and used to: (1) quantitatively integrate gene expression data as reaction constraints and (2) compute functional network states, which were compared to reported experimental data. For example, the model predicted accurately the ribosome production, without any parameterization. Also, in silico rRNA operon deletion suggested that a high RNA polymerase density on the remaining rRNA operons is needed to reproduce the reported experimental ribosome numbers. Moreover, functional protein modules were determined, and many were found to contain gene products from multiple subsystems, highlighting the functional interaction of these proteins. This genome-scale reconstruction of E. coli's transcriptional and translational machinery presents a milestone in systems biology because it will enable quantitative integration of ‘-omics’ datasets and thus the study of the mechanistic principles underlying the genotype–phenotype relationship. Systems biology aims to understand the interactions of cellular components in a systemic manner. Mathematical modeling is critical to the integration and analysis of these components on a conceptual as well as mechanistic level. To date, detailed genome-scale reconstructions of metabolism have become available for a growing number of organisms. Although metabolism has an important role in cells, other cellular functions need to be considered as well, such as signaling, regulation, and macromolecular synthesis. For instance, the cellular machinery required for RNA and protein synthesis consists of a complex set of proteins. Here, we show that one can collect all of the necessary information for a prokaryotic organism to create a gene-specific, fine-grained representation of the macromolecular synthesis machinery. E. coli was chosen as a model organism because of the wealth of available information. The explicit representation of transcription and translation in terms of a mass-balanced network enables a detailed, quantitative accounting of the protein synthesis capabilities of E. coli in silico. Hence, this study demonstrates the feasibility of constructing very large networks and also represents a critical step toward building cellular models of growth that can account for gene-specific protein production in a stoichiometric fashion on the genome scale.Keywords
This publication has 75 references indexed in Scilit:
- Reconstruction of biochemical networks in microorganismsNature Reviews Microbiology, 2008
- The growing scope of applications of genome-scale metabolic reconstructions using Escherichia coliNature Biotechnology, 2008
- Systems analysis of metabolism in the pathogenic trypanosomatid Leishmania majorMolecular Systems Biology, 2008
- Formulating genome‐scale kinetic models in the post‐genome eraMolecular Systems Biology, 2008
- A genome‐scale metabolic reconstruction for Escherichia coli K‐12 MG1655 that accounts for 1260 ORFs and thermodynamic informationMolecular Systems Biology, 2007
- Systems approach to refining genome annotationProceedings of the National Academy of Sciences, 2006
- Towards multidimensional genome annotationNature Reviews Genetics, 2006
- Metabolic gene–deletion strains of Escherichia coli evolve to computationally predicted growth phenotypesNature Genetics, 2004
- Integrating high-throughput and computational data elucidates bacterial networksNature, 2004
- Global organization of metabolic fluxes in the bacterium Escherichia coliNature, 2004