Pseudomonas putida is a non-pathogenic member of rRNA group I of the genus
Pseudomonas that colonizes many different environments and is well known for its broad metabolic versatility and genetic plasticity [
1,
2].
P. putida KT2440 is a TOL plasmid cured, spontaneous restriction deficient derivative of
P. putida mt-2 [
3,
4]. This strain represents the first host-vector biosafety system for cloning in gram-negative soil bacteria and hence, has been extensively used as a host for gene cloning and expression of heterologous genes [
5-
8]. Consequently, large efforts have been made in exploiting these capacities in a diverse range of biotechnological applications including i) bioremediation of contaminated areas [
9,
10]; ii) quality improvement of fossil fuels, e.g., by desulphurization [
11]; iii) biocatalytic production of fine chemicals [
9,
12-
14]; iv) production of bioplastic [
15-
17]; and v) as agents of plant growth promotion and plant pest control [
18,
19].
Since the publication of
P. putida KT2440's genome [
20], our knowledge about this strain has significantly increased [
21] and various "-omics" data sets have become available, such as transcriptomic [
22,
23], proteomic [
24], and fluxomic data [
25,
26]. Subsequently, some
in silico analyses of its metabolic and biotechnological capacities have been published [
27,
28]. However, systemic understanding of metabolic and biotechnology capabilities of
P. putida KT2440 requires the construction of a more comprehensive model enabling the integration of the canonical experimental data along with genomic and high-throughput data in a hierarchical and coherent fashion [
29].
The constraint-based reconstruction and analysis (COBRA) approach is one possible modeling approach that uses stoichiometric information about biochemical transformation taking place in a target organism to construct the model. While a metabolic reconstruction is unique to the target organism one can derive many different condition-specific models from a single reconstruction. This conversion of a metabolic reconstruction of an organism into models requires the imposition of physicochemical and environmental constraints to define systems boundaries [
30-
32]. The conversion also includes the transformation of the reaction list into a computable, mathematical matrix format. In this so-called S matrix, where S stands for stoichiometric, the rows correspond to the network metabolites and the columns to the network reactions. The coefficients of the substrates and products of each reaction are entered in the corresponding cell of the matrix. This conversion can be done automatically (e.g., using the Matlab-based COBRA toolbox [
33]). Once in this format, numerous mathematical tools can be used to interrogate the metabolic network properties
in silico. Many of the published mathematical tools have been reviewed [
34] and encoded in Matlab format [
33]. A large subset of these tools relies on linear programming (LP), a mathematical tool used to find a solution to an optimization problem (e.g., maximal possible growth rate of my metabolic network under a given set of environmental constraints). While LP-based tools are very helpful in studying reconstructed metabolic networks, some questions may better be addressed without having to choose an objective function. Those methods are called unbiased methods, in contrast to biased LP-based methods, because they identify all feasible flux distributions under the given set of environmental constraints rather than only the optimal distributions. The COBRA approach [
30,
32] has been successfully used to build and analyze genome-scale
in silico reconstructions for representatives of archaea (e.g.,
Methanosarcina barkeri [
35]), of bacteria (e.g.,
E. coli [
36];
B. subtilis [
37];
H. pylori [
38];
M. tuberculosis [
39,
40];
S. aureus [
41,
42];
L. lactis [
43]), and of eukarya (e.g., Human [
44]). The numerous mathematical tools have been used for i) identification and filling of knowledge gaps (e.g. missing gene annotations [
45]); ii) prediction of the outcome of adaptive evolution [
46-
48]; iii) design of engineered production strains [
49]; and iv) the understanding of topological features of metabolic networks [
50-
53]. A recent review illustrates the variety of questions that have been addressed to
E. coli's metabolic network using different biased and unbiased COBRA methods [
54].
Here, we describe a highly detailed, genome-scale, metabolic reconstruction of
Pseudomonas putida KT2440. Based on the naming convention for metabolic networks [
55], this genome scale reconstruction was deemed
iJN746, where
i stands for
in silico, JN are the initials of the constructor, and 746 corresponds to the number of included metabolic genes. The reconstruction was built using the COBRA approach [
30,
32] and validated using flux balance analysis (FBA, [
56]). The
in silico metabolic network was further evaluated by comparing i) predicted growth rate capacities in different carbon sources and ii) predicted essential genes with experimental data from
P. putida KT2440 and
P. aeruginosa. Finally, we show the utility of the
P. putida reconstruction to analyze its biodegradative (i.e. toluene degradation) and biotechnological (i.e. bioplastic production) capacities.