A primary value of genome-scale metabolic models is their ability to provide a
holistic view of metabolism allowing, for instance, for quantitative investigation
of dependencies between species existing far apart in the metabolic network
[20]. Once
experimentally validated, these models can be used to characterize metabolic
resource allocation, to generate experimentally testable predictions of cell
phenotype, to elucidate metabolic network evolution scenarios, and to design
experiments that most effectively reveal genotype-phenotype relationships.
Furthermore, owing to their genome-wide scale, these models enable systematic
assessment of how perturbations in the metabolic network affect the organism as a
whole, such as in determining lethality of mutations or predicting the effects of
nutrient limitations. Since these multiple and intertwined relationships are not
immediately obvious without genome-scale analysis, they would not be found during
investigation of small, isolated circuits or genes as is typical in a traditional
reductionist approach
[65],
[66].
We present here a genome-scale reconstruction and constraint-based model of the
P. putida strain KT2440, accounting for 815 genes whose
products correspond to 877 reactions and connect 886 metabolites. The manually
curated reconstruction was based on the most up-to-date annotation of the bacterium,
the content of various biological databases, primary research publications and
specifically designed functional genomics experiments. New or refined annotations
for many genes were suggested during the reconstruction process. The model was
validated with a series of experimental sets, including continuous culture data,
BIOLOG substrate utilization assays, 13C flux measurements and a set of
specifically-generated mutant strains. FBA and FVA were used to ascertain the
distribution of resources in KT2440, to systematically assess gene and reaction
essentiality and to gauge the robustness of the metabolic network. Hence, this work
represents one of the most thorough sets of analyses thus far performed for an
organism by means of constraint-based modeling, providing thereby a solid
genome-scale framework for the exploration of the metabolism of this fascinating and
versatile bacterium. However, since this modeling endeavor relies upon a number of
approximations, the limits, potential and applicability of the analysis must be
clearly identified and defined. We address these points below.
Altogether, our results and analyses show that the model accurately captures a
substantial fraction of the metabolic functions of P. putida
KT2440. Therefore, the model was used to generate hypotheses on constraining and
redirecting fluxes towards the improvement of production of polyhydroxyalkanoates,
which are precursors for industrially and medically important bioplastics. This is,
to our knowledge, the first reported application of constraint-based modeling to
direct and improve the yield of a compound of which the production is not directly
coupled to the growth of the organism. This opens up novel areas of application for
the constraint-based approach. Our approach, based on the OptKnock algorithm, allows
for both prediction of mutants with desirable properties and identification of
conditions that support the expression of these properties.
Notwithstanding the generally good agreement between experimental results and
simulations of our model, several of the discrepancies encountered reflect pitfalls
inherent to constraint-based modeling that go beyond the scope of our study:
Firstly, the high number of blocked reactions and the mismatches with the BIOLOG data
show that there are still many areas of the metabolism that require thorough
exploration. The genes encoding transport-related are particularly relevant, as for
most of them, neither the translocated compound nor the mechanism of translocation
is known. Furthermore, it should be highlighted that the genome still has 1635 genes
annotated as “hypothetical” or “conserved
hypothetical”, more than 800 genes annotated as putative, and over 800 for
which the functional annotation gives no information beyond the protein family name.
It is thus likely that a fraction of the hypothetical and non-specifically annotated
genes in the current P. putida annotation are responsible for
unknown metabolic or transport processes, or that some might code for proteins that
add redundancy to known pathways. This observation is common to all genomes
sequenced so far and illustrates a major hurdle in the model building process (and
hence, its usefulness) that can be overcome only through extensive studies in
functional genomics.
Secondly, although we carefully constrained the
in silico flux space
through FBA and FVA and obtained distribution spaces roughly consistent with those
experimentally determined via
13C- flux analysis, these approaches are
inherently limited as they assume growth as a sole metabolic objective and ignore
any effects not explicitly represented in a constraint-based metabolic model. It has
been shown that FBA using objective functions other than growth can improve
predictive accuracy under certain conditions
[53]. Kinetic limitations
also may play a very important role in determining the extent to which a particular
reaction or pathway is used. Teusink et al.
[52] showed that in the
case of
L. plantarum these factors may lead to false predictions.
Thirdly, the reconstruction includes causal relationships between genes and reactions
via gene-protein-relationships (GPRs) but it lacks explicit information regarding
gene regulation. The regulation of gene expression causes that there are many genes
in the cell that are expressed only under certain growth conditions. Therefore, the
in silico flux space is generally larger than the true
in vivo flux space of the metabolic network. This, in turn, may
influence the robustness of the metabolic network and the essentiality of some
reactions and genes. The lack of regulatory information and of the genetic
interactions involved is likely to be one of the causes for faulty predictions of
the viability of mutant strains. Adding this information will be an important step
in the further development and improvement of the accuracy of the reconstruction.
Fourthly, although our analyses indicated that growth yield is relatively insensitive
to changes in biomass composition, these analyses also suggest that factors other
than the structure of the metabolic network play an important role in defining the
relationship between the growth yield and environmental conditions. The prediction
of the exact growth yield requires the precise measurement of maintenance values,
which may vary substantially from one condition to the other
[44]–
[46]. As
the maintenance accounts for 10–30% of the total carbon source
provided in unstressed conditions, this may set a limit to the accuracy of the
growth yield predictions.
To enhance the usefulness and predictiveness of the model, several avenues could be
followed in the future. Firstly, additional constraints can be overlaid on the
network to reduce the space of possibilities and increase the accuracy of
predictions. In addition to specific knowledge of particular enzymatic or transport
processes, such constraints are best based on high-throughput experimental evidence
such as transcriptomic and proteomic data, which are instrumental in expanding
genotype-phenotype relationships in the context of genome-scale metabolic models
[67].
Microarray experiments have guided the discovery of metabolic regulons, and usage of
microarray and proteomic data to constrain metabolic models has improved model
accuracy for other systems
[23]. Secondly,
P. putida provides a
good opportunity for incorporating kinetic information into a genome-scale model as
there are various kinetic models available and under development for small circuits
in
P. putida
[68]–
[71]. Incorporating data
from these models into the genome-scale reconstruction would provide insights into
the relationships of isolated metabolic subsystems within the global metabolism.
This synthesis would also improve the flux predictions of the global model,
particularly in areas where current FBA-based predictions methods fail due to their
inherent limitations.
Experimental validation of a genome-scale model is an iterative process that is
performed continuously as a model is refined and improved through novel information
and validation rounds. In this work, we have globally validated iJP815 as well as
specific parts thereof by using both up-to-date publicly available data and data
generated in our lab, but there will be always parts of the model that include
blocked reactions and pathways that will require further, specific validation. As
more knowledge becomes available from the joint efforts of the large
P.
putida community (e.g.,
http://www.psysmo.org), focus will
be put on these low-knowledge areas for future experimental endeavors. We anticipate
that this model will be of valuable assistance to those efforts.
The metabolic reconstruction, the subsequent mathematical computation and the
experimental validation reported here provide a sound framework to explore the
metabolic capabilities of this versatile bacterium, thereby yielding valuable
insights into the genotype-phenotype relationships governing its metabolism and
contributing to our ability to exploit the biotechnological potential of
pseudomonads. By providing the means to examine all aspects of metabolism, an
iterative modeling process can generate logical hypotheses and identify conditions
(such as regulatory events or conditional expression of cellular functions) that
would reconcile disagreements between experimental observations and simulation
results. Through a detailed in silico analysis of
polyhydroxyalkanoate production, we show how central metabolic precursors of a
compound of interest not directly coupled to the organism's growth function
might be increased via modification of global flux patterns. Furthermore, as the
species Pseudomonas putida encompasses strains with a wide range of
metabolic features and numerous isolates with unique phenotypes, the reconstruction
presented provides a basic scaffold upon which future models of other P.
putida strains can be built with the addition or subtraction of
strain-specific metabolic pathways. Due to its applicability across the numerous
P. putida strains iJP815 provides a sound basis for many future
studies towards the elucidation of habitat-specific features, bioremediation
applications and metabolic engineering strategies with members of this ubiquitous,
metabolically versatile and fascinating genus.