Introduction

One of the past and future driving forces of systems-oriented microbiology is the increasing need for superior cell factories, requesting efficient design and optimization strategies. Previous strain improvement has enabled random mutagenesis and selection or targeted genetic engineering without consideration of its impact on the entire metabolism. This caused detrimental side effects so that industrial biocatalysts today suffer from slow growth, weak stress tolerance, or the formation of undesired by-products, meaning that the potential of the available biochemical reaction set is often not fully recruited (Wittmann 2010). This is one of the reasons why industrial strain development is shifted towards systems metabolic engineering approaches which rationally introduce beneficial genetic modifications based on a systems-wide understanding of the underlying metabolic and regulatory networks (Lee et al. 2009). Such multi-omics studies have been performed more and more in the recent years in systems biology and systems metabolic engineering (Ishii et al. 2007; Park et al. 2008; Zhang et al. 2010). The present contribution highlights the application of flux analysis for quantitative analysis and engineering of metabolic pathways and its integration into a systems-oriented framework (Fig. 1). The underlying concept as well as the potential of flux-based analysis and engineering of microbial metabolism is hereby illustrated by selected examples, not aiming at a full coverage of studies on microbial fluxomics. For a more complete literature overview on metabolic flux studies of microorganisms, the reader is directed to recent review articles (Iwatani et al. 2008; Kim et al. 2008; Blank and Kuepfer 2010).

Fig. 1
figure 1

Architecture and interactions among and within the different functional layers in a cellular system. Solid line flow of information; dashed line interaction between molecular species

Tools and concepts for 13C-fluxomics

Metabolic flux analysis is about the quantification of intracellular fluxes, i.e., in vivo reaction rates through the different pathways within the intact living cell. Hereby, the level of substrates, cofactors, effectors, or regulators determine the in vivo flux through a particular enzyme as systems property. This differs in most cases dramatically from estimates on the in vitro activity for which enzymes are withdrawn from their physiological environment and analyzed under artificial assay conditions. In practice, flux analysis typically focuses on the 50–100 reactions of central metabolism which have particular relevance for biotechnology applications since they catalyze the major carbon flow and match with the reactions involved in biosynthesis of most industrial products.

Initially, flux studies recruited measured extracellular fluxes, i.e., substrate uptake or product secretion, and biosynthetic requirements entered into assumed stoichiometric reaction networks to derive certain intracellular fluxes (Shuichi and Masayoshi 1979). These pioneering studies provided first valuable insights into microbial metabolism on the flux level, but relied on uncertain assumptions, simplifications, and constraints such as balances of reduction equivalents or energy stoichiometry which unfortunately determine the actual flux result quite significantly (Wittmann 2002). Moreover, important fine structures as parallel or cyclic reactions which play an important role in microorganisms, e.g., in biosynthesis of biotechnology products such as lysine (Sonntag et al. 1993) or in central carbon metabolism (Hult et al. 1980; Sauer and Eikmanns 2005), were not accessible. In recent years, flux analysis has been extended by the integration of 13C labeling information from stable isotope experiments overcoming previous limitations. In these studies, 13C-labeled tracer substrates are fed to the examined cells until the 13C label has propagated through the metabolic network into metabolic products. Their 13C labeling pattern depends on the particular flux distribution and thus provides a sensitive “fingerprint” to calculate the fluxes. By means of mass spectrometry (MS) (Christensen and Nielsen 1999; Wittmann and Heinzle 1999; Dauner and Sauer 2000) and nuclear magnetic resonance spectroscopy (NMR) (Szyperski 1995), the labeling pattern can be quantified, whereby both methods ideally provide complementary data sets from the same sample. For flux calculation, the measured 13C labeling data, biosynthetic requirements and extracellular fluxes are integrated with a computer model which is an in silico representation of the metabolic network investigated. The basic workflow of the resulting comprehensive strategy, comprising experimental and computational steps, is given in Fig. 2. Metabolic flux analysis as outlined below has evolved into an advanced approach to assess steady-state fluxes of microbial cells (Sauer 2006). Inherently, fluxomics is laborious and requires a broad spectrum of quite diverse experimental and computational expertises, which is not easily brought together. Accordingly, and in contrast to other omics technologies, only a few leading laboratories have so far significantly contributed to its development and application.

Fig. 2
figure 2

Schematic work flow for 13C metabolic flux (fluxome) analysis comprising (I) experimental design to identify optimal substrates for a given flux problem, (II) isotope tracer experiments with measurement of 13C labeling patterns by MS together with extracellular fluxes and biosynthetic requirements, and (III) computational calculation of fluxes with models as in silico representation of the studied network. This basic concept is typically applied for routine quantification of steady-state fluxes. It can be varied in several aspects depending on the focus of the flux study

13C labeling analysis

Most common metabolic flux studies recruit mass spectrometric 13C labeling analysis of about 10–15 amino acids obtained from hydrolyzed cell protein (Christensen and Nielsen 1999; Dauner and Sauer 2000). They contain rich information for flux estimation, since they reflect the carbon backbone of eight key intermediates from different parts of central metabolism (Szyperski 1995). The most popular technique for amino acid labeling analysis today is GC/MS, since this approach requires only about 1 mg of cells, allows an excellent separation of the analytes within only about 20–30 min, and provides high precision data on mass isotopologue distribution with measurement errors of <0.5% (Wittmann 2007). Initially developed and applied mainly in the field of biomedicine (Lapidot and Nissim 1980; Nissim et al. 1981; Tsalikian et al. 1984; Nissim and Lapidot 1986), GC–MS-based metabolic flux approaches have been substantially extended and optimized and emerged as a key technology in metabolic physiology and biotechnology (Christensen and Nielsen 2000; Kelleher 2001; Wiechert 2001; Wittmann 2002; Des Rosiers et al. 2004; Des Rosiers and Chatham 2005).

Modeling and software packages

For flux calculation, computer-based models of the investigated metabolic network are usually utilized to globally fit the unknown flux parameters combining isotopomer and metabolite balancing (Wittmann 2007). Applying an optimization algorithm the deviation of the labeling data between computer model and experiment is minimized by iterative variation of the free fluxes until the flux distribution is identified. In combination with experimentally determined extracellular fluxes, absolute carbon fluxes throughout the network are derived. With certain constraints, GC–MS labeling data can also be used to calculate local flux ratios of converging reactions in the network (Fischer and Sauer 2003; Fischer et al. 2004). Although at first limited to only about 10–15 flux ratios directly accessible, this method has been recently combined with stoichiometric models and measured extracellular fluxes also providing a distribution of absolute fluxes (Zamboni et al. 2005). In recent years, general modeling frameworks have been developed for a general and systematic description of carbon transfer. These include based balancing of isotopomers (Schmidt et al. 1997), cumomers (Wiechert et al. 1999), and bondomers (van Winden et al. 2002) which display different representations of metabolic labeling states. The most efficient approach with regard to simulation speed is the recent introduction of elementary mode units (EMU) (Antoniewicz et al. 2007a). The different algorithms are implemented into convenient software tools which allow user-friendly flux calculation from measurement data as well as statistical treatment of flux data. Examples are OpenFlux using the EMU approach (Quek et al. 2009), 13C-Flux based on cumomers (Wiechert et al. 2001), Fiat-Flux involving flux ratio analysis (Zamboni et al. 2005), and a Matlab-based tool recruiting isotopomers (Wittmann and Heinzle 1999). Most of these packages support optimal experimental design of carbon labeling experiments by identifying optimal tracer substrates for a given flux problem as well as statistical treatment of flux data. First promising studies have recently shown dynamic labeling experiments for flux analysis under dynamic conditions, which open new possibilities to resolve fluxes in non-stationary systems (Nöh et al. 2006; Antoniewicz et al. 2007b; Nöh et al. 2007).

Application of 13C fluxomics

Pathway function

Extensive biochemical analysis carried out since the 1940s has basically provided detailed knowledge on the enzymes and pathways present in biological systems—including various microorganisms. With the extension of flux quantification from individual reactions to particular subsets of selected enzymes and finally to networks at the systems level, fascinating insights into metabolic properties of microorganisms could be obtained, especially through the recent years. Firstly, this has led to a new understanding of the role of certain pathways beyond their classically attributed function. An outstanding example is the pentose phosphate pathway, originally assumed to supply building blocks for biosynthesis. Metabolic flux analysis, however, demonstrated that this pathway carries flux far beyond the actual requirement for anabolism and fulfils a flexible role in catabolic breakdown and redox metabolism. Flexible partitioning of the flux between the glycolysis and the pentose phosphate pathway has been found in many prokaryotic and eukaryotic microorganisms (Blank et al. 2005b; Fuhrer et al. 2005). Illustrating examples include growth of Saccharomyces cerevisiae at high and low glucose concentration (Gombert et al. 2001) or at varied specific growth rate (Frick and Wittmann 2005), the utilization of alternative carbon sources by Corynebacterium glutamicum (Kiefer et al. 2004; Wittmann et al. 2004), or the production of antibiotics in Streptomyces coelicolor (Borodina et al. 2008). In cases of significantly changed metabolic burden, cells recruit their entire network and distribute the response on the flux level globally as observed for amino acid producing C. glutamicum (Marx et al. 1997). Even a single nucleotide exchange in the whole genome leading to deregulation of a biosynthetic enzyme may cause a systems-wide rearrangement of flux when cells switch from growth to production (Kim et al. 2006).

Also the flux through the tricarboxylic acid (TCA) cycle has been shown to be a function of the environmental conditions as exemplified for S. cerevisiae (Gombert et al. 2001; Blank and Sauer 2004). Interestingly, the slow-growing myxobacterium Sorangium cellulosum exhibits an enormously high relative TCA cycle flux far beyond typical values found in fast-growing microbial cells (Bolten et al. 2009). Calculation on energy stoichiometry based on the estimated fluxes indicated that such a high TCA cycle flux is needed to ensure growth in this microorganism, because about 90% of ATP produced is withdrawn for maintenance requirements.

Discovery of novel pathways

The rich information on metabolism by flux analysis, previously not accessible, led to the discovery of different novel pathways. Important findings comprise the simultaneous operation of glycolytic and gluconeogenetic reactions at the junction between glycolysis and TCA cycle crucial to anticipate changing environmental conditions at the expense of ATP in many microorganisms (Sauer and Eikmanns 2005), the PEP–glyoxylate pathway as novel catabolic route in Escherichia coli (Nanchen et al. 2006) or a novel biosynthetic route for isoleucine biosynthesis in C. glutamicum (Krömer et al. 2006).

More recently, 13C flux analysis was extended to investigate fluxes in more unexplored species. Among the various interesting findings obtained from such studies is the identification of the Entner–Doudoroff pathway as major catabolic route in the actinomycete Streptomyces tenebrarius (Borodina et al. 2005) or in the marine bacteria Dinoroseobacter shibae and Phaeobacter gallaeciensis (Fürch et al. 2009), the discovery of a novel route for biosynthesis of serine in lactate-grown Shewanella oneidensis (Tang et al. 2009) or of isoleucine in Geobacter metallireducens during growth on acetate (Tang et al. 2007).

Robustness and rigidity of metabolic networks

The successful miniaturization of flux analysis approaches, enabling the broad investigation of mutant libraries, has manifested flux analysis as a key technology to unravel control mechanisms in biological systems. In a large-scale study, the relative flux distribution in 137 null mutants of Bacillus subtilis was found rather invariant (Fischer and Sauer 2005). Flux profiling of different S. cerevisiae deletion mutants demonstrated that network redundancy through duplicate genes was the major (75%) and alternative pathways the minor (25%) molecular mechanism of genetic network robustness (Blank et al. 2005a). Hereby, local flux rerouting seems one key mechanism of cellular response to maintain major growth characteristics upon gene deletion. As prominent example, the lack of the genes encoding for pyruvate kinase, a central glycolytic enzyme, is compensated in E. coli (Emmerling et al. 2002) and in C. glutamicum (Becker et al. 2008) by activation of a by-pass recruiting PEP carboxylase, malate dehydrogenase, and malic enzyme to convert PEP into pyruvate, whereas other fluxes in the network remain almost unaffected.

Engineering of metabolic pathway fluxes for biotechnology

It lies at hand that metabolic flux analysis has become a core methodology in metabolic engineering which particularly aims at the optimization of flux (Iwatani et al. 2008; Park et al. 2008). With regard to metabolic flux analysis, C. glutamicum can probably be considered as the most important model organism (Wittmann and de Graaf 2005; Wittmann 2010) so that the following examples taken from investigation of this microorganism might give a flavor to which level of understanding and targeted metabolic engineering flux analysis can contribute. To date, the most valuable flux data towards rational strain optimization have certainly been obtained by routine stationary metabolic flux analysis (Wittmann 2010). The systematic quantification of flux in different lysine-producing strains (Marx et al. 1997, 1999; Wittmann and Heinzle 2001, 2002; Marx et al. 2003) or under different environmental conditions (Wendisch et al. 2000; Klapa et al. 2003; Kiefer et al. 2004; Wittmann et al. 2004; Shirai et al. 2006) has unraveled key pathways to be modified for improved production performance (Ikeda 2003; Wittmann and Becker 2007). Based on the predictions by 13C fluxomics, superior lysine production strains could be created which exhibited improved performance via single gene modifications. Prominent examples are an increased supply of NADPH, required as cofactor for lysine biosynthesis, by amplification of the pentose phosphate pathway (Ohnishi et al. 2005; Becker et al. 2007) or the gluconeogenetic enzyme fructose 1,6-bisphosphatase (Becker et al. 2005; Georgi et al. 2005). Similarly, the targeted attenuation of competing enzymes and pathways, including the TCA cycle (Becker et al. 2009), pyruvate dehydrogenase (Becker et al. 2010) as well as phosphoenolpyruvate carboxykinase (Riedel et al. 2001) successfully increased product formation. Also, glutamate production by C. glutamicum has been in the focus of flux analysis. As for lysine, the biosynthesis of glutamate demands for efficient anaplerotic replenishment (Peters-Wendisch et al. 2001; Shirai et al. 2007). Based on flux predictions, deletion of pyruvate kinase improved glutamate production under biotin-limited conditions (Sawada et al. 2010). Flux analysis further revealed that the flux through ODHC significantly decreased after induction of glutamate production (Shirai et al. 2005) which could be utilized for subsequent strain engineering (Asakura et al. 2007; Kim et al. 2009). In addition, flux analysis has been applied to a number of other microbial production strains and processes, providing extensive insights into the underlying metabolism, as recently summarized in excellent overviews (Iwatani et al. 2008; Blank and Kuepfer 2010).

Towards systems-level understanding of metabolism

Integration of multi omics data sets with fluxes

Single omics analyses, which are used quite frequently, provide information about only one layer of a cellular system, but fail to capture the interaction between the different layers essential to understand the system properties. To this end, we see more and more multi-omics studies in recent years, often involving the alliance of specialized research groups due to high costs and efforts (Sauer et al. 2007). Most of these studies combine the most frequently used tools for transcriptomics and proteomics in order to obtain complementary coverage of metabolism, perform cross-validation, or obtain novel biological insights into post-transcriptional regulation mechanisms (Zhang et al. 2010). As described above, flux analysis of microbial cells is a rather advanced and routine method in itself, but has not been as broadly used, as genomics, transcriptomics, and proteomics which have advanced into widely applied commercially available technologies. The analysis of the metabolome is not as mature and yet not fully comprehensive, which is inherently caused by the different chemical nature, high turnover rate and large concentration difference of the analytes (Zhang et al. 2010), and persisting difficulties with appropriate sampling and cell quenching hampering quantitative metabolomics (Bolten et al. 2007). This is one explanation for the tremendous difference of several orders of magnitude to which extent the different omics technologies are applied today (Fig. 3). It appears obvious that in many cases, the interaction between components of the network involved cannot be understood without knowledge on in vivo flux. To unravel and quantify the different layers of control superimposing the flux network, the flux state has to be linked to the set of cellular components, i.e., genes, transcripts, proteins, and metabolites. The integration of 13C fluxomics in multi-omics experiments, however, has special requirements. Care has to be taken concerning the label introduced in 13C fluxomics experiments which typically interferes with conventional MS-based technologies for identification and quantification of proteins as well as metabolites. In such cases, a parallelized setup is needed. The isotope labeling experiments for the fluxome have to be run under identical conditions in parallel to those for the other omics, including a thorough check for key physiological parameters such as substrate uptake, growth, product formation, respiration, or intracellular fingerprints which are not affected by the 13C label (Krömer et al. 2004). This might include additional controls via certain intracellular metabolites accessible by enzyme assays or HPLC as well as transcription profiling, which are not biased by the presence of label. In addition, financial constraints have to be considered with respect to the high price of 13C substrates, typically demanding for small cultivation volume.

Fig. 3
figure 3

Frequency of the use of the terms “genome”, “transcriptome”, “proteome”, “metabolome”, and “fluxome” in scientific publications. Since the terms “metabolome” and “fluxome” are not always used in the relevant studies, the search included “intracellular metabolite” or “metabolic flux” as alternative terms to represent the corresponding field. The number of appearance was extracted from the PubMed database (August, 2010)

Unraveling of regulation networks

The resulting picture immediately indicates that only a small set of multi-omics studies so far includes fluxes—and thus allows to really bridging cellular components with network function, which is a key goal of systems biology and systems metabolic engineering. Out of the few studies performed so far, most investigated biotechnologically relevant microorganisms, underlining systems metabolic engineering as one of the major drivers. One of the pioneering examples focused on C. glutamicum for production of the feed amino acid lysine (Krömer et al. 2004). Through integration of transcriptomics with fluxomics, correlation between gene expression and in vivo flux could be quantified for various enzymes in carbon core metabolism providing novel insights into regulation of this microorganism. It became obvious that the different reactions and pathways are controlled at different levels. As example, reactions of glucose uptake, pentose phosphate pathway, and TCA cycle were found closely regulated on the transcriptional level (Fig. 4a), whereas genes in lysine biosynthesis showed a constant expression level, despite a marked change of the metabolic flux, indicating that they are strongly regulated at the post-transcriptional level (Fig. 4b). Similar studies on E. coli (Shimizu 2004; Fong et al. 2006; Hua et al. 2007) and B. subtilis (Schilling et al. 2007) confirmed that transcriptional control is involved, but not sufficient to account for metabolic regulation, indicating other important mechanisms such as metabolite–protein interactions or protein phosphorylation (Heinemann and Sauer 2010). A comprehensive insight into the production of pigmented antibiotics in S. coelicolor was gained by coupling the analysis of fluxes with profiling of intracellular metabolites and gene expression (Borodina et al. 2008). It could be shown that the increased pentose phosphate pathway flux, linked to enhanced production, appeared largely because of accumulation of glucose 6-phosphate and fructose 6-phosphate. Integrated analysis of gene expression data further revealed transcriptional changes in genes encoding redox co-factor-dependent enzymes as well as those encoding pentose phosphate pathway enzymes.

Fig. 4
figure 4

Integration of 13C fluxome and transcriptome data to classify regulation for metabolic pathways into transcriptional (a) and post-transcriptional control (b). The data are taken from a multi-omics study on lysine production by Corynebacterium glutamicum (Krömer et al. 2004). Gene expression and flux for each pathway are given here as mean value for the corresponding genes and reaction steps involved which all showed the same trend. The normalization of flux and expression is described in the original manuscript

In E. coli, the combined analysis of transcriptome, metabolome, and fluxome confirmed the existence of hidden reactions in the central metabolism of its cells (Nakahigashi et al. 2009). One of the most massive data sets was recently generated for a systematic analysis of E. coli cells to genetic and environmental perturbation (Ishii et al. 2007). This included the quantification of 4,000 mRNA transcripts, 2,000 proteins, 600 metabolites as well as pathway fluxes. The study revealed only few and local responses to gene deletions, whereas the metabolism responded highly flexible and globally to change in the growth rate. E. coli thus seems to use complementary strategies that result in a metabolic network robust against perturbations. While many questions remain unanswered from the above studies, their data sets provide a rich source for future computational analysis to extract novel biological insights (Zamboni and Sauer 2009).

Understanding network-wide balancing of redox and energy status

The combined use of different omics tools, especially metabolomics and fluxomics, has been applied to understand how metabolic networks organize the overall adjustment to optimal redox and energy levels and distribute the give and take among the various reactions involved. In a remarkable recent study, using a combination of 13C fluxomics, metabolomics, and in vitro enzyme assays, the formation of NADPH was found to mismatch the anabolic demand in different microbial species, which could be attributed to the transhydrogenase activity in Paracoccus versutus, B. subtilis and E. coli, whereas other species recruit other mechanisms, possibly also a certain promiscuity in cofactor specificity of catabolic enzymes, surprisingly also in the pentose phosphate pathway (Fuhrer and Sauer 2009). Interesting insights into the balancing of redox metabolism could be also obtained for C. glutamicum combining proteomics, metabolomics, and 13C fluxomics under conditions of oxidative stress (Krömer et al. 2008). The results are highlighted by the identification of a global redistribution of NADPH supplying pathways as response to the expression of protective NADPH-consuming proteins and an imbalance in the redox status. Interestingly, the TCA cycle, but not the normally observed pentose phosphate pathway, was found as the major NADPH supplying pathway under conditions of oxidative stress corresponding to a low NADPH/NADP+ ratio (Fig. 5). This obviously leads to suboptimal supply of NADPH at substantially increased loss of CO2. The flux redistribution, however, was not sufficient to fully re-adjust the redox state to the normal value found in the wild type. Clearly, this opens promising possibilities to engineer the metabolism for superior performance under conditions of oxidative stress, a phenomenon often observed in biotechnological production processes. Similarly to the redox metabolism, also the maintenance of the energy status of the cell seems to include network-wide tasks. Combined metabolomics and 13C fluxomics showed that cells of E. coli respond to a decrease in the adenylate energy charge by an increase of the glycolytic flux upon a temperature shift from 37 to 42 °C for induction of recombinant protein production (Wittmann et al. 2007). It is interesting to note that the flux response also involved the activation of the previously discovered PEP–glyoxylate pathway (see above) as new catabolic route, probably induced by a bottleneck at the entry to the TCA cycle.

Fig. 5
figure 5

Integration of 13C fluxome and metabolome data to understand network-wide redox balancing. The data are taken from a multi-omics study on Corynebacterium glutamicum under oxidative stress (Krömer et al. 2008). The cellular redox state is expressed as ratio of NADPH/NADP+. The fluxes are given as relative flux normalized to the glucose uptake rate

Outlook

The increasing need for flux studies as key technology in future systems biology and systems metabolic engineering will clearly intensify efforts to apply and extend this technology. Concerning the integration with other omics approaches, promising future developments comprise the coupling of dynamic 13C labeling experiments with metabolomics. The increasing metabolite coverage in metabolomics provides direct access to labeling patterns of many more pathway intermediates than routinely available, greatly extending the resolution power of flux studies. This has been impressively shown by first examples, e.g., in assembling potentially involved metabolites into a sequence of reactions functioning in vivo, exemplified by the discovery of the ethylmalonyl–CoA pathway in Methylobacterium extorquens during the assimilation of C1 carbon (Peyraud et al. 2009). Beyond such studies, focusing on particular parts of the network, global multi-omics studies will require intelligent software tools that extract true novel biology out of the expected massive data sets. Current developments of statistical methods such as unsupervised learning, correlation network analysis, pattern recognition, or principal component analysis as well as dynamic Bayesian networks, or dynamic control theory appear quite useful, but will surely not suffice to fully unravel the complexity of the systems studied. Here, further exploration of novel concepts integration the heterogeneous data sets is still needed.

To study cells under conditions different from routine chemostat or balanced batch cultures will require the development of novel flux approaches including specific strategies for labeling input, cultivation, labeling analysis, and modeling. First interesting examples extend flux analysis to previously inaccessible conditions. This comprises the quantification of flux dynamics in non-stationary systems (Antoniewicz et al. 2007b), or of fluxes in non-growing cells (Yang et al. 2005, 2006). An interesting concept for flux analysis in large scale, involving precise labeling quantification at low enrichment by GC–IR–MS (Yuan et al. 2010) opens the possibility to perform flux studies in real case production processes. The coming years will hopefully see more conceptional extension of flux analysis so that it becomes applicable to the level of single cells towards unraveling of population heterogeneity, as well as to the flux cross-talk within consortia of different cell types, the dominating form of microbial life—both promising fascinating novel biology and useful information towards superior biocatalysts.