Natural Parameter Conditions for Singular Perturbations of Chemical and Biochemical Reaction Networks

Eilertsen, Justin; Schnell, Santiago; Walcher, Sebastian

doi:10.1007/s11538-023-01150-7

Natural Parameter Conditions for Singular Perturbations of Chemical and Biochemical Reaction Networks

Original Article
Published: 26 April 2023

Volume 85, article number 48, (2023)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Bulletin of Mathematical Biology Aims and scope Submit manuscript

Natural Parameter Conditions for Singular Perturbations of Chemical and Biochemical Reaction Networks

Download PDF

355 Accesses
2 Citations
2 Altmetric
Explore all metrics

Abstract

We consider reaction networks that admit a singular perturbation reduction in a certain parameter range. The focus of this paper is on deriving “small parameters” (briefly for small perturbation parameters), to gauge the accuracy of the reduction, in a manner that is consistent, amenable to computation and permits an interpretation in chemical or biochemical terms. Our work is based on local timescale estimates via ratios of the real parts of eigenvalues of the Jacobian near critical manifolds. This approach modifies the one introduced by Segel and Slemrod and is familiar from computational singular perturbation theory. While parameters derived by this method cannot provide universal quantitative estimates for the accuracy of a reduction, they represent a critical first step toward this end. Working directly with eigenvalues is generally unfeasible, and at best cumbersome. Therefore we focus on the coefficients of the characteristic polynomial to derive parameters, and relate them to timescales. Thus, we obtain distinguished parameters for systems of arbitrary dimension, with particular emphasis on reduction to dimension one. As a first application, we discuss the Michaelis–Menten reaction mechanism system in various settings, with new and perhaps surprising results. We proceed to investigate more complex enzyme catalyzed reaction mechanisms (uncompetitive, competitive inhibition and cooperativity) of dimension three, with reductions to dimension one and two. The distinguished parameters we derive for these three-dimensional systems are new. In fact, no rigorous derivation of small parameters seems to exist in the literature so far. Numerical simulations are included to illustrate the efficacy of the parameters obtained, but also to show that certain limitations must be observed.

Critical Parameters for Singular Perturbation Reductions of Chemical Reaction Networks

Article Open access 09 September 2022

Singular Perturbation Techniques and Asymptotic Expansions for Some Complex Enzyme Reactions

Stochastic enzyme kinetics and the quasi-steady-state reductions: Application of the slow scale linear noise approximation à la Fenichel

Article 01 July 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Reducing the dimension of chemical and biochemical reaction networks or mechanisms is of great relevance both for theoretical considerations and for laboratory practice. For instance, the fundamental structure of a reaction mechanism is frequently known, or assumed from educated guesswork, but reaction rate constants are a priori unknown. Moreover, due to possible wide discrepancies in timescales, as well as limitations on experimentally obtainable data, it is important to identify scenarios and parameter regions that guarantee accuracy of a suitably chosen reduction. Singular perturbations frequently appear here,^{Footnote 1} and the fundamental theorems by Tikhonov (1952) and Fenichel (1979) provide a procedure to determine a reduced equation, and reliable convergence results. These theorems require an a priori identification of a perturbation parameter (also called “small parameter”). From a qualitative perspective, one actually considers a critical manifold together with an associated small parameter, and a corresponding slow invariant manifold. Given a well-defined limiting process for the small parameter, theory guarantees convergence of solutions of the full system to corresponding solutions of the reduced system. From a practical (“laboratory”) perspective, however, convergence theorems are not sufficient, and quantitative results are needed to gauge the accuracy of fitting procedures. This implies the need for an appropriate small parameter, which we denote by $\varepsilon _S$ for the moment, that also reflects quantitative features. In contrast to the critical manifold, from a qualitative perspective the perturbation parameter is far from unique.^{Footnote 2} From a quantitative perspective, ideally $\varepsilon _S$ should provide an upper estimate for the discrepancy between the exact and approximate solutions over the whole course of the slow dynamics. From a biochemical perspective it should elucidate the influence of reaction parameters. In many application-oriented publications, the authors assume (explicitly or implicitly) that certain perturbation parameters provide a quantitative estimate for the approximation; see, e.g., Heineken et al. (1967), Segel (1988), Tzafriri (2003), Schnell (2014), Choi et al. (2017).^{Footnote 3} However, while heuristical arguments may support such assumptions, no mathematical proof is given [see the discussion of the Michaelis–Menten system in Eilertsen et al. (2022)]. From the applied perspective, in absence of rigorous results on quantitative error estimates for reductions of biochemical reaction networks or mechanisms, there is no alternative to employing heuristics. Thus, there exists a sizable gap between available theoretical results and applications, and closing this gap requires further theoretical results. The present paper is intended as a contribution toward narrowing the gap, invoking mathematical theory.

From an overall perspective (based on a derivation of singular perturbation theorems), one could say that finding ideal small parameters for a given singular perturbation scenario requires a three-step procedure:

1.
In a first step, estimate the approach of a particular solution to the slow manifold: A common method employs Lyapunov functions. Thus, one obtains a parameter that measures the discrepancy between the right-hand sides of the full system and the reduced equation, following a short initial transient.
2.
In a second step, estimate a suitable critical time at which the slow dynamics sets in, and estimate the solution at this critical time. This is needed to guarantee that the transient phase is indeed short, and to obtain a suitable initial value for the reduced equation.
3.
In a third step, estimate the approximation of the exact solution by the corresponding solution of the reduced equation.^{Footnote 4}

At first glance, this procedure seems to pose no problems. The feasibility of the steps outlined above is guaranteed by standard results about ordinary differential equations. But, the hard part lies in their practical implementation for a given parameter-dependent system. Generally, it is not easy to obtain meaningful and reasonably sharp estimates. A case-by-case discussion seems unavoidable [see, Schnell and Maini (2000), Eilertsen et al. (2018, 2021a), Eilertsen and Schnell (2018, 2020) for examples employing various alternative approaches], for each given system.

With the three steps as a background, our goal is to make a significant contribution toward the first step, via linear timescale arguments. We will both expand and improve existing results, and moreover obtain perturbation parameters for higher-dimensional systems for which no rigorous results have previously been reported. In a biochemical context, it seems that timescale arguments were first introduced by Segel (1988), and Segel and Slemrod (1989). Conceptually, we build upon this approach, but we take a consistent local perspective. Thus, we consider (real parts of) eigenvalue ratios, based on the idea that underlies computational singular perturbation theory, going back to Lam and Goussis (1994). Our emphasis is on obtaining parameters that are workable for application-oriented readers in mathematical enzymology, and admit an interpretation in biochemical terms.

1.1 Background

A solid mathematical foundation for qualitative viability of most reduction procedures in chemistry and biochemistry is provided by singular perturbation theory (Tikhonov 1952; Fenichel 1979). This was first clearly stated and utilized in Heineken et al. (1967).

For illustrative purposes, and as further motivation, we consider a familiar system from biochemistry, viz. the (irreversible) Michaelis–Menten reaction mechanism or network (Michaelis and Menten 1913), which is modeled by the two-dimensional differential equation

$$\begin{aligned} \begin{array}{rclclcl} \dot{s}&{}=&{} -k_1e_0s&{}+&{}(k_1s+k_{-1})c, \\ \dot{c}&{}=&{} k_1e_0s&{}-&{}(k_1s+k_{-1}+k_2)c. \\ \end{array} \end{aligned}$$

(1)

For small initial enzyme concentration with respect to the initial substrate concentration, Briggs and Haldane (1925) assumed quasi-steady state (QSS) for complex concentration, thus obtaining the QSS manifold given by

$$\begin{aligned} c=\dfrac{k_1e_0s}{k_{-1}+k_2+k_1s}; \end{aligned}$$

(2)

and reduction to the Michaelis–Menten equation

$$\begin{aligned} \dot{s} =-\dfrac{k_1k_2e_0s}{k_{-1}+k_2+k_1s}. \end{aligned}$$

(3)

To quantify the notion of smallness for enzyme concentration, they introduced the dimensionless parameter

$$\begin{aligned} \varepsilon _{BH}:=\dfrac{e_0}{s_0} \end{aligned}$$

(4)

[later utilized by Heineken et al. (1967) in the first application of singular perturbation theory to this reaction], and required $\varepsilon _{BH}\ll 1$ as a necessary condition for accuracy of the reduction. Further parameters to ensure accuracy of approximation by the Michaelis–Menten equation were introduced later on. Reich and Selkov (1974) introduced

$$\begin{aligned} \varepsilon _{RS}:= k_1e_0/(k_{-1}+k_2), \end{aligned}$$

(5)

for which Palsson and Lightfoot (1984) later gave a justification based on linearization at the stationary point 0.^{Footnote 5} Moreover, Segel and Slemrod (1989) derived

$$\begin{aligned} \varepsilon _{SSl}:=\dfrac{k_1e_0}{k_{-1}+k_2+k_1s_0}. \end{aligned}$$

(6)

The fundamental approach by Segel and Slemrod (1989), obtaining perturbation parameters by comparing suitable timescales, has been used widely in the literature ever since.^{Footnote 6}

For Michaelis–Menten reaction mechanism, singular perturbation theory shows convergence of solutions of (1) to corresponding solutions of the reduced equation as $e_0\rightarrow 0$, in which case all of the parameters $\varepsilon _{BH},\,\varepsilon _{RS},\,\varepsilon _{SSl}$ approach zero. But on the other hand, it is not generally true that $\varepsilon _{BH}\rightarrow 0$, or $\varepsilon _{RS}\rightarrow 0$, or $\varepsilon _{SSl}\rightarrow 0$, implies convergence to the solution of the reduced system. This, as well as related matters, was discussed in detail in Eilertsen et al. (2022), with a presentation of counterexamples. We also invite the readers to see other examples in Sect. 4.

These facts illustrate that considering a single parameter—without context and without a clearly defined notion of the limiting process—will generally not be sufficient to ensure the validity of some particular reduction. In a singular perturbation setting the critical manifold is the basic object, and one generally needs to specify the way in which corresponding small parameters approach zero.

With regard to the procedure outlined in Steps 1 to 3 above, a wish list for small parameters includes the following physically motivated conditions:

$\varepsilon _S$ is dimensionless;
$\varepsilon _S$ is composed of reaction rates and initial values (admitting an interpretation in physical terms);
$\varepsilon _S$ is controllable in experiments.

These requirements will be taken into account as well.

Our vantage point is work by Goeke et al. (2015, 2017), which provides an algorithmic approach to determine critical parameter values (Tikhonov–Fenichel parameter values, TFPV), and their critical manifolds: Choosing a curve in parameter space (with curve parameter $\varepsilon $) that starts at a TFPV gives rise to a singularly perturbed system, based on a clearly defined approach of the small parameter to zero.

Pursuing a less ambitious goal than the one outlined in Steps 1 to 3 above, we will utilize the separation of timescales on the slow manifold, adapting work by Lam and Goussis (1994) on computational singular perturbation theory. We focus attention on local considerations. Timescales are identified as inverse absolute real parts of eigenvalues of the linearization of a vector field, near stationary points. Restriction to the vicinity of stationary points is an essential condition here. Given a singular perturbation setting, Zagaris et al. (2004) proved that the approach via “small eigenvalue ratios” is consistent. Unless some eigenvalues of large modulus are purely imaginary, the eigenvalue approach provides a small parameter that satisfies the requirement in Step 1 above, up to a multiplicative constant that remains to be determined.^{Footnote 7} But dealing directly with eigenvalues (even in the rare case when they are explicitly known) is generally too cumbersome to allow productive work and concrete conclusions.

The emphasis of the present paper lies on local (linear) timescale estimates and comparisons, using a mix of algebraic and analytic tools. We will obtain parameters that are palatable to application-oriented readers and allow for interpretation in a biochemical context. Most of the parameters obtained have not appeared in the literature before, and some perhaps are unexpected.

1.2 Overview of Results

Given a chemical or biochemical reaction network or mechanism, we will present a method to obtain distinguished dimensionless parameters. These parameters are directly related to the local fast-slow dynamics of the singularly perturbed system. In contrast to many existing timescale estimates in the literature, the one employed here is conceptually consistent. Timescale considerations mutate from artwork to a relatively routine procedure, and we establish necessary conditions for timescale separation and singular perturbation reductions.

In the preparatory Sect. 2, we collect some notions and results related to singular perturbation theory. In particular, we recall Tikhonov–Fenichel parameter values (TFPV). We also note properties of the Jacobian and its characteristic polynomial on the critical manifold. It should be emphasized that our search always begins with identifying a TFPV and its associated critical manifold; all our small parameter estimates are rooted in this scenario. We establish a repository of dimensionless parameters from coefficients of the characteristic polynomial, and we recall the relation between these coefficients and the eigenvalues of the Jacobian. Finally, we fix some notation and establish some blanket nondegeneracy conditions that are assumed throughout the paper.

Section 3 is devoted to one-dimensional critical manifolds, which are of considerable relevance to experimentalists. Generally, the timecourse of a single product or substrate is measured in an experiment. Specific kinetic parameters (such as the Michaelis constant) are estimated via nonlinear regression, in which the recorded timecourse data is fitted to a one-dimensional and autonomous QSS model that approximates substrate depletion (or product formation) of the reaction on the slow timescale; see, for example, Stroberg and Schnell (2016) and Choi et al. (2017). In the one-dimensional setting, near the critical manifold there is one and only one eigenvalue of the Jacobian with small absolute real part. From the characteristic polynomial, we obtain distinguished small parameters, and we establish their correspondence to timescales. The parameters thus obtained admit an interpretation in terms of reaction parameters, so they satisfy a crucial practical requirement. They measure the ratio of the slow to the fastest timescale, and thus provide a necessary condition for timescale separation. But, in dimension greater than two, this condition is not strong enough when there are large discrepancies within the fast timescales. According to Appendix, Sect. 9.1, the ratio of the slow to the “slowest of the fast” timescales is the relevant quantity. To estimate this ratio, we introduce another type of parameter that yields sharp estimates whenever all eigenvalues are “essentially real” [borrowing terminology of Lam and Goussis (1994)]. We then specialize our results to systems of dimensions two and three.

In Sect. 4, we apply the results from Sect. 3 to the (reversible and irreversible) Michaelis–Menten system in various circumstances. We obtain a distinguished parameter for the reversible system with small enzyme concentration; this seems to be new. Specializing to the irreversible case, we obtain a parameter $\varepsilon _{MM}$ and conclude, via an argument different from Palsson and Lightfoot (1984), that the Reich-Selkov parameter $\varepsilon _{RS}$ is the most suitable among the standard parameters in the irreversible system. Moreover, we obtain a rather surprising distinguished parameter for the partial equilibrium approximation with slow product formation. To support the claim that this is indeed an appropriate parameter for Step 1, as stated above, we determine relevant Lyapunov estimates, and we add some observations with regard to Step 3. To illustrate the necessity of some technical restrictions in our results, we close this section by discussing a degenerate scenario with a singular critical variety.

In Sect. 5, we turn to critical manifolds of dimension greater than one. Imitating the approach for one-dimensional critical manifolds and invoking results from local analytic geometry, we obtain distinguished parameters that measure the ratio of the fastest timescale to the “fastest of the slow” timescales. We provide a detailed analysis for three-dimensional systems with two-dimensional critical manifold.

In Sect. 6, we apply our theory to some familiar three-dimensional systems from biochemistry, viz. cooperative systems with two complexes, and competitive as well as uncompetitive inhibition, for low enzyme concentration. For these systems the only available perturbation parameters in common use seem to be $\varepsilon _{BH}=e_0/s_0$, $\varepsilon _{SSl}$ and ad hoc variants of these. There seems to exist no derivation of small parameters via timescale arguments (in the spirit of Segel and Slemrod) in the literature. We thus break new ground, and we obtain meaningful and useful distinguished parameters. We illustrate our results with several numerical examples, to verify the efficacy of the parameters. But, we also include simulations to show their limited applicability in certain regions of parameter space. Such limitations were to be expected, since Steps 2 and 3 are needed for a complete analysis. These examples also illustrate the necessity of additional hypotheses imposed in the derivation of the distinguished parameters.

In Sect. 7, we consider some reductions of three-dimensional systems obtained via projection onto two-dimensional critical manifolds. Specifically, we compute some two-dimensional reductions of the competitive and uncompetitive inhibitory reaction mechanisms, and we derive distinguished parameters that are relevant for the accuracy of these reductions. Again, we illustrate our results by numerical simulations. To finish, we discuss a three timescale scenario that leads to a hierarchical structure in which the two-dimensional slow manifold contains an embedded one-dimensional “very slow” manifold.

Section 9, an Appendix, is a recapitulation of the Lyapunov function method for singularly perturbed systems, also outlining the relevance of the eigenvalue ratios for Step 1, and some observations on Steps 2 and 3. Moreover, Appendix contains a summary of some facts from the literature, and proofs for some technical results. Sections 2, 3 and 5 as well as Appendix (Sect. 9) are mostly technical. Readers primarily interested in applications may want to skim these only, and focus on the applications in Sects. 4, 6 and 7.

2 Preliminaries

We will discuss parameter-dependent ordinary differential equations

$$\begin{aligned} \dot{x} = h(x,\pi ),\quad x\in \mathbb R^n,\quad \pi \in \Pi ,\quad \Pi \subseteq \mathbb R^m\text { closed, } \end{aligned}$$

(7)

with the right-hand side a polynomial in x and $\pi $. Our main motivation is the study of chemical mass action reaction mechanisms and their singular perturbation reductions.

2.1 Tikhonov–Fenichel Parameter Values (a Review)

We consider singular perturbation reductions that are based on the classical work by Tikhonov (1952) and Fenichel (1979). Frequently the pertinent theorems are stated for systems in slow-fast standard form

$$\begin{aligned} \begin{aligned} \dot{u}_1&= \varepsilon \,f_1(u_1,u_2,\varepsilon ),\\ \dot{u}_2&= f_2(u_1,u_2,\varepsilon ),\\ \end{aligned} \end{aligned}$$

(8)

with a small parameter $\varepsilon $, subject to certain additional conditions. In slow time, $\tau =\varepsilon t$, the reduced system takes the form

$$\begin{aligned} \begin{aligned} \dfrac{\textrm{d}u_1}{\textrm{d}\tau }&= f_1(u_1,u_2,\varepsilon ),\\ 0&= f_2(u_1,u_2,\varepsilon ),\\ \end{aligned} \end{aligned}$$

and the above mentioned conditions ensure that the second equation admits a local resolution for $u_2$ as a function of $u_1$ and $\varepsilon $. For general parameter-dependent systems (7) one first needs to identify the parameter values from which such reductions emanate. We recall some notions and results (slightly modified from Goeke et al. 2015):

1.
A parameter $\widehat{\pi }\in \Pi $ is called a Tikhonov–Fenichel parameter value (TFPV) for dimension $s$ ( $1\le s\le n-1$) of system (7) whenever the following hold:
1. (i)
  An irreducible component of the critical variety, i.e., of the zero set $ \mathcal {V}(h(\cdot , \widehat{\pi }))$ of $x\mapsto h(x\,,\widehat{\pi })$, contains a (Zariski dense) local submanifold $\widetilde{Y}$ of dimension $s$, which is called the critical manifold.
2. (ii)
  For all $x\in \widetilde{Y}$ one has $\textrm{rank}\,D_1h(x,\widehat{\pi })=n-s$ and
  $$\begin{aligned} \mathbb R^n = \textrm{Ker}\ D_1h(x,\widehat{\pi }) \oplus \textrm{Im}\ D_1h(x,\widehat{\pi }). \end{aligned}$$
  Here $D_1$ denotes the partial derivative with respect to x.
3. (iii)
  For all $x\in \widetilde{Y}$ the nonzero eigenvalues of $\ D_1h(x,\widehat{\pi }) $ have real parts $<0$.
2.
Given a TFPV, for any smooth curve $\varepsilon \mapsto \widehat{\pi }+\varepsilon \rho +\cdots $ in parameter space $\Pi $, the system
$$\begin{aligned} \dot{x}=&{} h(x,\widehat{\pi }{+}\varepsilon \rho {+}\cdots ){=}h(x,\widehat{\pi }){+}\varepsilon D_2h(x,\widehat{\pi })\,\rho {+}\cdots {=:} h^{(0)}(x) {+}\varepsilon h^{(1)}(x)+\cdots , \end{aligned}$$
with $D_2$ denoting the partial derivative with respect to $\pi $, admits a singular perturbation (Tikhonov–Fenichel) reduction. A standard method is to fix a parameter direction and a “ray” $\varepsilon \mapsto \widehat{\pi }+\varepsilon \rho $ in parameter space. In a chemical interpretation this may correspond to a gradual increase of some parameters, such as initial concentrations. Our work will always be based on this procedure; by this specification we avoid ambiguities about the range of parameters.
3.
The computation of a reduction in the coordinate-free setting is described in Goeke and Walcher (2014): Assuming the TFPV conditions in item 1, there exist rational functions P, with values in $\mathbb R^{n\times (n-s)}$, and $\mu $, with values in $\mathbb R^{n-s}$, such that
$$\begin{aligned} h^{(0)}(x)=P(x)\mu (x) \text { on } \widetilde{Y}, \end{aligned}$$
and P(x) as well as $D\mu (x)$ have full rank on $\widetilde{Y}$. The reduced equation on $\widetilde{Y}$ then has the representation
$$\begin{aligned} \dot{x}=\varepsilon \left( I-P(x)\left( D\mu (x)P(x)\right) ^{-1}D\mu (x)\right) h^{(1)}(x), \end{aligned}$$
(9)
which is correct up to $O(\varepsilon ^2)$. By Tikhonov and Fenichel, solutions of (7) that start near $\widetilde{Y}$ will converge to solutions of the reduced system as $\varepsilon \rightarrow 0$. But some caveats are in order:
- The reduction is guaranteed only locally, for neighborhoods of compact subsets of the critical manifold and for sufficiently small $\varepsilon $. Determining a neighborhood explicitly for which the reduction is valid poses an individual problem for each system.^{Footnote 8}
- In particular, the distance of the initial value of (7) from the slow manifold (not only from the critical manifold) is relevant for the reduction. In general, an approximate initial value for the reduced equation on the slow manifold must be determined.
- If the transversality condition in (ii) above breaks down, standard singular perturbation theory is no longer applicable. But, even when it is satisfied, the range of validity for the reduction may be quite small. This reflects the effect of a local transformation to Tikhonov standard form.
- Finally, the reduced equation may be trivial, in which case higher-order terms in $\varepsilon $ are dominant and no conclusion can be drawn from the first order reduction. By the same token, if the term following $\varepsilon $ in (9) is small, then the quality of the reduction may be poor.
4.
Turning to computational matters, consider the characteristic polynomial
$$\begin{aligned} \chi (\tau ,x,\pi )=\tau ^n+\sigma _{1}(x,\pi )\tau ^{n-1}+\cdots + \sigma _{n-1}(x,\pi )\tau +\sigma _n(x,\pi ) \end{aligned}$$
(10)
of the Jacobian $D_1h(x,\pi )$. Then, given $0<s<n$, a parameter value $\widehat{\pi }$ is a TFPV with locally exponentially attracting critical manifold $\widetilde{Y}$ of dimension s, and $x_0 \in \widetilde{Y}$, only if the following hold:
- $h(x_0,\widehat{\pi })=0$.
- The characteristic polynomial $\chi (\tau ,x,\pi )$ satisfies
  1. (i)
    $\sigma _n(x_0,\widehat{\pi })=\cdots =\sigma _{n-s+1}(x_0,\widehat{\pi })=0$;
  2. (ii)
    all roots of $\chi (\tau ,x_0,\widehat{\pi })/\tau ^s$ have negative real parts.
This characterization shows that $x_0$ satisfies an overdetermined system of equations (more than n equations in n variables), which in turn allows to algorithmically determine conditions on $\widehat{\pi }$ by way of elimination theory; see Goeke et al. (2015). Due to the Hurwitz-Routh theorem (see, e.g., Gantmacher 2005),
$$\begin{aligned} \sigma _k(x_0,\widehat{\pi })>0 \text { for }x_0\in \widetilde{Y},\,1\le k\le n-s \end{aligned}$$
is a necessary consequence of condition (ii). Necessary and sufficient conditions for TFPV are stated in Goeke et al. (2015), but we will not need them here.

2.2 Dimensionless Parameters

From Goeke et al. (2015), one finds critical parameter values and corresponding critical manifolds, but there remains to specify the notion of “small perturbation,” and to relate it to reaction parameters. Singular perturbation theory guarantees convergence in the limit $\varepsilon \rightarrow 0$, but for a given system estimates for the rate of convergence are desirable.

To be physically meaningful, relevant small parameters should be dimensionless. The only dimensions appearing in reaction parameters are time and concentration, thus by dimensional analysis (Buckingham Pi Theorem; see, e.g., Wan 2018), there exist $\ge m-2$ independent dimensionless Laurent monomials in the parameters, such that every dimensionless analytic function of the reaction parameters can locally be expressed as a function of these.^{Footnote 9} This collection may be quite large; we impose the additional requirement that parameters should correspond to timescales. In a preliminary step, we therefore list an inventory of rational dimensionless quantities for the network or mechanism.

Lemma 1

Let (7) correspond to a CRN with mass action kinetics, and $\chi $ as in (10). Then:

(a)
The coefficient $\sigma _k$ of $\chi $ has dimension $(\textrm{Time})^{-k}$.
(b)
Whenever $i_1,\ldots ,i_p\ge 1$ and $j_1,\ldots ,j_q\ge 1$ are integers such that $i_1+\cdots +i_p=j_1+\cdots +j_q$, the expression
$$\begin{aligned} \dfrac{\sigma _{i_1}\cdots \sigma _{i_p}}{\sigma _{j_1}\cdots \sigma _{j_q}} \end{aligned}$$
(when defined) is dimensionless.

Proof

Every monomial on the right-hand side of (7) has dimension $\textrm{Concentration}/\textrm{Time}$, since this holds for the left-hand side. The entries of the Jacobian $D_1h$ are obtained via differentiation with respect to some $x_i$, hence have dimension $(\textrm{Time})^{-1}$. Since $\sigma _i$ is a polynomial in the matrix enries of degree i, part (a) follows. Part (b) is an immediate consequence.

2.3 Timescales

There exist various notions of timescale in the literature, and in some cases this ambiguity influences the derivation of small parameters. For a case in point, we invite the reader to see Segel and Slemrod (1989), who use different notions of timescale for the fast and slow dynamics. But, for systems that decay or grow exponentially, and by extension for linear and approximately linear systems, there exists a well-defined notion:

Definition 1

Let $A:\,\mathbb R^n\rightarrow \mathbb R^n$ be a linear map, and consider the linear differential equation $\dot{x}= A\,x$. For $\lambda $ an eigenvalue of A, with nonzero real part, we call $|\textrm{Re}\,\lambda |^{-1}$ the timescale corresponding to $\lambda $.

The timescale of an invariant subspace $V\subseteq \mathbb R^n$ (which is a subspace of a sum of generalized eigenspaces) is defined as the slowest timescale of the eigenvalues involved.

For a single eigenvalue, the timescale characterizes the speed of growth or decay of solutions along the generalized eigenspace of $\lambda $. For an invariant subspace, it characterizes the speed for generic initial values.

We will work with this consistent notion of linear timescale, and its extension to linearizations of nonlinear systems near stationary points, throughout the paper. Thus, we adopt the perspective taken in Lam and Goussis (1994), which is justified by Fenichel’s local characterization of the dynamics near the critical manifold $\widetilde{Y}$ (Fenichel 1979, Section V), as proven by Zagaris et al. (2004). Indeed, the time evolution near $\widetilde{Y}$ is governed by the linearization $D_1h(x,\widehat{\pi }+\varepsilon \rho )$, with $\pi =\widehat{\pi }+\varepsilon \rho $ close to a TFPV $\widehat{\pi }$, and $x\in \widetilde{Y}$. For $\pi =\widehat{\pi }$ the Jacobian has vanishing eigenvalues, hence for $\pi $ near $\widehat{\pi }$ one will have eigenvalues of small modulus, while all nonzero eigenvalues of $D_1h(x,\widehat{\pi })$ have negative real parts.

From a practical perspective, eigenvalues are at best inconvenient to work with. Moreover, in our context, resorting to numerical approximations is not a viable option. To obtain more palatable parameters, we recall the correspondence between the eigenvalues $\lambda _1,\ldots ,\lambda _n$ of $D_1h(x,\widehat{\pi }+\varepsilon \rho )$ and the coefficients $\sigma _k$ of the characteristic polynomial. One has

$$\begin{aligned} \sigma _k=(-1)^k\sum \lambda _{i_1}\cdots \lambda _{i_k} \end{aligned}$$

with the summation extending over all tuples $i_1,\ldots ,i_k$ such that $1\le i_1<\cdots <i_k\le n$. In particular

$$\begin{aligned} \begin{array}{rcl} -\sigma _1&{}=&{}\lambda _1+\cdots +\lambda _n;\\ (-1)^{n-1}\sigma _{n-1}&{}=&{} \sum \limits _{i=1}^n\prod _{j\not =i}\lambda _j;\\ (-1)^n\sigma _n&{}=&{}\lambda _1\cdots \lambda _n;\\ \dfrac{\sigma _{n-1}}{\sigma _n}&{}=&{}-\sum \dfrac{1}{\lambda }_j. \end{array} \end{aligned}$$

(11)

2.4 Blanket Assumptions

The principal goal of the present paper is to provide consistent and workable local timescale estimates in terms of the reaction parameters. Throughout the remainder of the paper, the following notions will be used and the following assumptions will be understood:

1.
We consider a polynomial parameter-dependent system (7), and a TFPV $\widehat{\pi }$ for dimension $s\ge 1$, with critical manifold $\widetilde{Y}$. The entries of $\widehat{\pi }$ are not uniquely determined by the critical manifold. We allow these entries to range in a suitable compact subset of parameter space (to be restricted by requirements in the following items).
2.
We fix $\rho $ in the parameter space, and consider the singularly perturbed system for the ray in parameter space $\widehat{\pi }+\varepsilon \rho $, with $0\le \varepsilon \le \varepsilon _\textrm{max}$, and restrictions on $\varepsilon _\textrm{max}>0$ to be specified.
3.
Moreover, we let $K\subset \mathbb R^n$ be a compact set with nonempty interior, such that $\widetilde{Y}\cap K$ is also compact. K should contain the initial values for all relevant solutions of (7).^{Footnote 10}
4.
Since $\widehat{\pi }$ is a TFPV, we have $\sigma _k(x,\widehat{\pi })>0$ for all $x\in \widetilde{Y}\cap K$, $1\le k\le n-s$. We choose $\varepsilon _\textrm{max}$ so that $\sigma _k(x,\widehat{\pi }+\varepsilon \rho )$ is defined and bounded above and below by positive constants on
$$\begin{aligned} K^*=K^*(\varepsilon _\textrm{max})=\left( \widetilde{Y}\cap K\right) \times [0,\varepsilon _\textrm{max}], \end{aligned}$$
(12)
for $1\le k\le n-s$. Such a choice is possible by compactness and continuity, given a suitable compact set in parameter space.
5.
As a crucial basic condition, we require that Tikhonov–Fenichel reduction is accurate up to order $\varepsilon ^2$ in a compact neighborhood $\widetilde{K}$ of $\widetilde{Y}\cap K$, with $\varepsilon \le \varepsilon _\textrm{max}$. Consult Sect. 9.1 to verify that this requirement can be satisfied.

We emphasize that the present paper focuses on asymptotic timescale estimates near the critical manifold, which are based on Fenichel’s local theory. The determination of $\varepsilon _\textrm{max}$ (and by extension, the range of applicability) will not be addressed in general. Moreover, in applications we may replace sharp estimates by weaker ones that permit an interpretation in biochemical terms.

3 Critical Manifolds of Dimension One

In this technical section, we consider system (7) in $\mathbb R^n$, $n\ge 2$ with a critical manifold of dimension $s=1$. We will derive two types of distinguished parameters that characterize timescale discrepancies, and discuss systems of dimensions two and three in some detail.

We have $\sigma _n(x,\widehat{\pi })=0$ on $\widetilde{Y}$, and $\sigma _k(x,\widehat{\pi }+\varepsilon \rho )>0$ for $1\le k\le n-1$, $x\in \widetilde{Y}\cap K$ and $0\le \varepsilon \le \varepsilon _\textrm{max}$. Moreover

$$\begin{aligned} \sigma _n(x,\widehat{\pi }+\varepsilon \rho )=\varepsilon \widehat{\sigma }_n(x,\widehat{\pi },\rho ,\varepsilon ) \end{aligned}$$

(13)

with a polynomial $\widehat{\sigma }_n$. We require the nondegeneracy condition

$$\begin{aligned} \widehat{\sigma }_n(x,\widehat{\pi },\rho ,0)\not =0 \text { for all } x\in \widetilde{Y}\cap K. \end{aligned}$$

(14)

Denote by $\lambda _1,\ldots ,\lambda _n$ the eigenvalues of $D_1h(x,\pi )$, choosing the labels so that $\lambda _n(x,\widehat{\pi })=0$ for all $x\in \widetilde{Y}\cap K$.

The following facts are known. We recall some proofs in Appendix, for the reader’s convenience.

Lemma 2

(a)
One has
$$\begin{aligned} \lambda _n(x,\widehat{\pi }+\varepsilon \rho )=\varepsilon \widehat{\lambda }_n(x,\widehat{\pi },\rho ,\varepsilon ), \end{aligned}$$
with $\widehat{\lambda }_n$ analytic, and $\widehat{\lambda }_n(x,\widehat{\pi },\rho ,0)\not =0$ on K.
(b)
Given $\beta >1$, there exist $\Theta >0$, $\theta >0$ such that $-\Theta /\beta \le \textrm{Re}\,\lambda _i(x,\widehat{\pi })\le -\beta \theta $ for all $x\in \widetilde{Y}\cap K$, $1\le i\le n-1$.
(c)
For suitably small $\varepsilon _\textrm{max}$, one has
$$\begin{aligned} -\Theta \le \textrm{Re}\,\lambda _i(x,\widehat{\pi }+\varepsilon \rho )\le -\theta \end{aligned}$$
for all $(x,\varepsilon )\in K^*$, $1\le i\le n-1$.

3.1 Distinguished Small Parameters

We turn to the construction of small parameters from the repository in Lemma 1. Consider the rational function

$$\begin{aligned} (x,\varepsilon )\mapsto \dfrac{\sigma _n(x,\widehat{\pi }+\varepsilon \rho )}{\sigma _{1}(x,\widehat{\pi }+\varepsilon \rho )\cdot \sigma _{n-1}(x,\widehat{\pi }+\varepsilon \rho )}, \quad x\in \widetilde{Y}\cap K,\,\varepsilon \in [0,\varepsilon _\textrm{max}]. \end{aligned}$$

(15)

Definition 2

(i)
Let
$$\begin{aligned} \begin{array}{rcl} L(\widehat{\pi },\rho )&{}:=&{}\inf _{x\in \widetilde{Y}\cap K}\left| \dfrac{\widehat{\sigma }_n(x,\widehat{\pi },\rho ,0)}{\sigma _{1}(x,\widehat{\pi })\cdot \sigma _{n-1}(x,\widehat{\pi })}\right| ,\\ U(\widehat{\pi },\rho )&{}:=&{}\sup _{x\in \widetilde{Y}\cap K}\left| \dfrac{\widehat{\sigma }_n(x,\widehat{\pi },\rho ,0)}{\sigma _{1}(x,\widehat{\pi })\cdot \sigma _{n-1}(x,\widehat{\pi })}\right| .\\ \end{array} \end{aligned}$$
(16)
(ii)
We call,
$$\begin{aligned} \varepsilon ^*(\widehat{\pi },\rho ,\varepsilon ):=\varepsilon \cdot U(\widehat{\pi },\rho ), \end{aligned}$$
(17)
the distinguished upper bound for the TFPV $\widehat{\pi }$ with parameter direction $\rho $ of system (7), and we call,
$$\begin{aligned} \varepsilon _*(\widehat{\pi },\rho ,\varepsilon ):=\varepsilon \cdot L(\widehat{\pi },\rho ), \end{aligned}$$
(18)
the distinguished lower bound for the TFPV $\widehat{\pi }$ with parameter direction $\rho $.

By the nondegeneracy condition, one has $U(\widehat{\pi },\rho )\ge L(\widehat{\pi },\rho )>0$. We obtain the following asymptotic inequalities:

Proposition 1

Given $\alpha >0$, for sufficiently small $\varepsilon _\textrm{max}$, the inequalities

$$\begin{aligned} \frac{1}{(1+\alpha )}L(\widehat{\pi },\rho ) \le \left| \dfrac{\widehat{\sigma }_n(x,\widehat{\pi },\rho ,\varepsilon )}{\sigma _{1}(x,\widehat{\pi }+\varepsilon \rho )\cdot \sigma _{n-1}(x,\widehat{\pi }+\varepsilon \rho )}\right| \le (1+\alpha ) U(\widehat{\pi },\rho ) \end{aligned}$$

(19)

hold on $K^*$.

Proof

By analyticity in $\varepsilon $ one has, for $\varepsilon _\textrm{max}$ sufficiently small,

$$\begin{aligned} \left| \dfrac{\widehat{\sigma }_n(x,\widehat{\pi },\rho ,\varepsilon )}{\sigma _{1}(x,\widehat{\pi }+\varepsilon \rho )\cdot \sigma _{n-1}(x,\widehat{\pi }+\varepsilon \rho )}-\dfrac{\widehat{\sigma }_n(x,\widehat{\pi },\rho ,0)}{\sigma _{1}(x,\widehat{\pi })\cdot \sigma _{n-1}(x,\widehat{\pi })} \right| \le \mathrm{const.}\cdot \varepsilon \end{aligned}$$

for all $(x,\varepsilon )\in K^*$. The assertion follows.

Remark 1

There are two points to make:

By definition, determining the distinguished upper and lower bounds amounts to determining the maximum and minimum of a rational function on a compact set. It may not be possible (or not advisable) to determine $\varepsilon ^*$ or $\varepsilon _*$ exactly, and one may have be content with sufficiently tight upper resp. lower estimates.
The derivation of the small parameters involves the critical manifold and the TFPV $\widehat{\pi }$, hence they depend on these choices. Moreover, there is some freedom of choice for the parameter direction $\rho $, which also influences the bounds. For these reasons one should not assume universal efficacy of any small parameter without further context.

3.2 The Correspondence to Timescales

We now discuss the correspondence between timescales and the parameters determined from (15). By direct verification, via (11) one finds for the eigenvalues $\lambda _1,\ldots ,\lambda _n$ of $D_1h(x,\pi )$:

Lemma 3

(a)
The identity
$$\begin{aligned} \sum _{i\not =j}\frac{\lambda _i}{\lambda _j}=\frac{\sigma _1\sigma _{n-1}}{\sigma _n}-n \end{aligned}$$
(20)
holds whenever all $\lambda _i\not =0$.
(b)
With $(x,\varepsilon )\in K^*$, for $\varepsilon \not =0$ one has
$$\begin{aligned} \frac{1}{\varepsilon }\sum _{i<n}\lambda _i/\widehat{\lambda }_n+\sum _{i\not =j;\,i,j<n}\lambda _i/\lambda _j+\varepsilon \sum _{i<n}\widehat{\lambda }_n/\lambda _i=\frac{1}{\varepsilon }\frac{\sigma _1\sigma _{n-1}}{\widehat{\sigma }_n}-n. \end{aligned}$$

This gives rise to further asymptotic inequalities:

Proposition 2

Let $\beta $, $\theta $ and $\Theta $ be as in Lemma 2, and $\alpha >0$. Then, for sufficiently small $\varepsilon _\textrm{max}>0$, the following hold:

(a)
For all $(x,\varepsilon )\in K^*$,
$$\begin{aligned} \frac{1}{(1+\alpha )}\varepsilon _*(\widehat{\pi },\rho ,\varepsilon )\le \left| \dfrac{\lambda _n(x,\widehat{\pi }+\varepsilon \rho )}{\sum _{i<n}\lambda _i(x,\widehat{\pi }+\varepsilon \rho )}\right| \le (1+\alpha )\varepsilon ^*(\widehat{\pi },\rho ,\varepsilon ). \end{aligned}$$
(21)
In particular, there exist constants $C_1,\,C_2$, such that
$$\begin{aligned} C_1 \varepsilon \le \left| \dfrac{\lambda _n(x,\widehat{\pi }+\varepsilon \rho )}{\sum _{i<n}\lambda _i(x,\widehat{\pi }+\varepsilon \rho )}\right| \le C_2 \varepsilon . \end{aligned}$$
(b)
The global estimates
$$\begin{aligned} \frac{1}{(1+\alpha )}\varepsilon _*\le \dfrac{\inf |\lambda _n|}{(n-1)\Theta }\le \dfrac{\sup |\lambda _n|}{(n-1)\theta } \le (1+\alpha )\varepsilon ^* \end{aligned}$$
(22)
hold, with infimum and supremum being taken over all $(x,\varepsilon )\in K^*$.

Proof

From Lemma 3 one obtains that

$$\begin{aligned} \dfrac{|\widehat{\lambda }_n(x,\widehat{\pi },\rho ,\varepsilon )|}{|\sum _{i=1}^{n-1}\lambda _i(x,\widehat{\pi }+\varepsilon \rho )|}=\left| \dfrac{\widehat{\sigma }_n(x,\widehat{\pi },\rho ,\varepsilon )}{\sigma _1(x,\widehat{\pi }+\varepsilon \rho )\sigma _{n-1}(x,\widehat{\pi }+\varepsilon \rho )}\right| +\varepsilon \eta (x,\widehat{\pi },\rho ,\varepsilon ) \end{aligned}$$

for all $(x,\varepsilon )\in K^*$, with bounded $\eta $. Combining this with Proposition 1 yields the assertions of part (a), and also

$$\begin{aligned} \frac{1}{(1+\alpha )}L(\widehat{\pi },\rho )\le \left| \dfrac{\widehat{\lambda }_n(x,\widehat{\pi },\rho ,\varepsilon )}{\sum _{i<n}\lambda _i(x,\widehat{\pi }+\varepsilon \rho )}\right| \le (1+\alpha )U(\widehat{\pi },\rho ) \end{aligned}$$

for all $(x,\varepsilon )\in K^*$, provided $\varepsilon _\textrm{max}$ is sufficiently small. Noting

$$\begin{aligned} |\sum _{i=1}^{n-1}\lambda _i(x,\pi )|=|\sum _{i=1}^{n-1}\textrm{Re}\,\lambda _i(x,\pi )|=\sum _{i=1}^{n-1}|\textrm{Re}\,\lambda _i(x,\pi )|, \end{aligned}$$

the second statement follows by standard estimates.

Informally speaking, Proposition 2 provides estimates for the ratio of the slowest to the fastest timescale, with $\sum _{i<n}\lambda _i$ being dominated by the real part with largest modulus. Thus, for dimension $n>2$, the estimates may be unsatisfactory whenever $\Theta \gg \theta $. For applications the second estimate in (22) is more relevant, since the fast dynamics will be governed by the smallest absolute real part of $\lambda _1,\ldots ,\lambda _{n-1}$ (see, Sect. 9.1). The parameter $\varepsilon ^*$ by itself does not completely characterize the timescale discrepancies, as should be expected. If there is more than one eigenvalue ratio to consider then a single quantity cannot measure all of them.

However, in the following—specialized but relevant—setting a general estimate can be obtained from the coefficients of the characteristic polynomial.

Proposition 3

Let $\beta $, $\theta $ and $\Theta $ be as in Lemma 2, and $\alpha >0$. Moreover assume that the eigenvalues $\lambda _1,\ldots ,\lambda _{n-1}$ satisfy $|\textrm{Re}\,\lambda _j|>|\textrm{Im}\,\lambda _j|$, and let $|\textrm{Re}\,\lambda _1|\ge \cdots \ge |\textrm{Re}\,\lambda _{n-1}|$. Define

$$\begin{aligned} \mu ^*:=\varepsilon \cdot \sup _{x\in \widetilde{Y}\cap K}\left| \dfrac{\widehat{\sigma }_n(x,\widehat{\pi },\rho ,0)\cdot \sigma _{n-2}(x,\widehat{\pi })}{\sigma _{n-1}(x,\widehat{\pi })^2}\right| . \end{aligned}$$

(23)

Then, for sufficiently small $\varepsilon _\textrm{max}>0$, one has

$$\begin{aligned} \sup _{(x,\varepsilon )\in K^*}\left| \dfrac{\lambda _n}{\textrm{Re}\,\lambda _{n-1}}\right| \le \sqrt{2} (1+\alpha )\ \mu ^*. \end{aligned}$$

(24)

Whenever $\lambda _{n-1}\in \mathbb R$, then the estimate can be sharpened to

$$\begin{aligned} \sup _{(x,\varepsilon )\in K^*}\left| \dfrac{\lambda _n}{\textrm{Re}\,\lambda _{n-1}}\right| \le (1+\alpha )\ \mu ^*. \end{aligned}$$

(25)

Proof

(i)
Preliminary observation: Let $k\ge 2$ and $\beta _1,\ldots ,\beta _k\in \mathbb C$ with negative real parts, and $|\textrm{Re}\,\beta _1|\ge \cdots \ge |\textrm{Re}\,\beta _{k}|$. Moreover denote by $(-1)^\ell \tau _\ell $ the $\ell ^\textrm{th}$ elementary symmetric polynomial in the $\beta _j$. If $|\textrm{Re}\,\beta _j|>|\textrm{Im}\,\beta _j|$ for $j=1,\ldots ,k$, then
$$\begin{aligned} |\textrm{Re}\,\beta _k|\ge \dfrac{\tau _{k}}{\sqrt{2}\tau _{k-1}}, \text { and } |\beta _k|\ge \dfrac{\tau _{k}}{\tau _{k-1}} \text { when } \beta _k\in \mathbb R. \end{aligned}$$
To verify this, recall
$$\begin{aligned} \sum _{i\not =j}\dfrac{\beta _i}{\beta _j}=\dfrac{\tau _1\tau _{k-1}}{\tau _k}-k\le \dfrac{\tau _1\tau _{k-1}}{\tau _k}-1. \end{aligned}$$
Now, for complex numbers $z,\,w$ with negative real parts and $|\textrm{Re}\,z|>|\textrm{Im}\,z|$, $|\textrm{Re}\,w|>|\textrm{Im}\,w|$, one has $\textrm{Re}\,\frac{z}{w}>0$. Therefore, all $\textrm{Re}\,\beta _i/\beta _j>0$, $1\le i,\,j\le k-1$, and since their sum is real we obtain the estimate
$$\begin{aligned} \dfrac{\tau _1}{|\beta _k|}=\sum _{i=1}^{k}\dfrac{\beta _i}{\beta _k}=1+\sum _{i=1}^{k-1}\dfrac{\beta _i}{\beta _k}\le \dfrac{\tau _1\tau _{k-1}}{\tau _k}. \end{aligned}$$
With $|\textrm{Re}\,\beta _k|\ge |\beta _k|/\sqrt{2}$ the assertion follows. For real $\beta _k$ the factor $\sqrt{2}$ may be discarded.
(ii)
We apply the above to the $\lambda _i(x,\widehat{\pi })$ and $\sigma _j(x,\widehat{\pi })$, $1\le i \le n-1$, obtaining
$$\begin{aligned} \sigma _1\le \sqrt{2}|\textrm{Re}\,\lambda _{n-1}|\ \dfrac{\sigma _1\sigma _{n-2}}{\sigma _{n-1}}. \end{aligned}$$
By Lemma 3, we have (with arguments $x\in \widetilde{Y}\cap K$, $\widehat{\pi }$ and $\rho $ suppressed)
$$\begin{aligned} \left| \dfrac{\sigma _1\sigma _{n-1}}{\widehat{\sigma }_n}\right| =\left| \dfrac{\lambda _1+\cdots +\lambda _{n-1}}{\widehat{\lambda }_n}\right| =\left| \dfrac{\sigma _1}{\widehat{\lambda }_n}\right| \le \sqrt{2}\left| \dfrac{\textrm{Re}\,\lambda _{n-1}}{\widehat{\lambda }_n}\right| \cdot \left| \dfrac{\sigma _1\sigma _{n-2}}{\sigma _{n-1}}\right| , \end{aligned}$$
and in turn
$$\begin{aligned} \left| \dfrac{\widehat{\lambda }_n(x,\widehat{\pi },\rho ,0)}{\textrm{Re}\,\lambda _{n-1}(x,\widehat{\pi })}\right| \le \sqrt{2} \,\dfrac{\widehat{\sigma }_n(x,\widehat{\pi },\rho ,0)\sigma _{n-2}(x,\widehat{\pi })}{\sigma _{n-1}(x,\widehat{\pi })^2}. \end{aligned}$$
By continuity and compactness the assertion readily follows when $\varepsilon _\textrm{max}$ is sufficiently small. As in (i) the factor $\sqrt{2}$ may be discarded for real $\lambda _{n-1}$.

Remark 2

There are four observations to make:

As with the distinguished upper bound, determining $\mu ^*$ amounts to finding the maximum of a rational function on a compact set.
The proofs of Propositions 2 and 3 implicitly impose further restrictions on $\varepsilon _\textrm{max}$.
Proposition 3 holds in particular in settings when all eigenvalues are “essentially real,” meaning small ${|{\hbox {Im}} \lambda |}/{ |{\hbox {Re}} \lambda |}$. This is frequently the case for chemical networks and reaction mechanisms.
One can obviously derive analogous, but weaker estimates, whenever the ratios ${ |{\hbox {Im}} \lambda |}/{|{\hbox {Re}} \lambda |}$ are bounded above by some constant. Likewise, the estimates underlying part (i) of the proof could be sharpened.

3.3 Two-Dimensional Systems

We turn to systems of dimension two, where a TFPV necessarily refers to a critical manifold of dimension $s=1$. We keep the notation and conventions from Sect. 2.4. Rather than specializing the asymptotic results from Propositions 2 and 3, we will retrace their derivation and obtain slightly sharper estimates.

First and foremost, the TFPV conditions imply that $\sigma _1$ must be bounded above and below by positive constants. The accuracy of the reduction is reflected in the ratio of the eigenvalues $\lambda _1,\,\lambda _2$ of $D_1h(x,\widehat{\pi }+\varepsilon \rho )$ with x in the critical manifold, and $\lambda _2=0$ at $\widehat{\pi }$. Then

$$\begin{aligned} \sigma _1=-(\lambda _1+\lambda _2),\quad \sigma _2=\lambda _1\lambda _2 \end{aligned}$$

and moreover $ \lambda _2=\varepsilon \widehat{\lambda }_2$ and $ \sigma _2=\varepsilon \widehat{\sigma }_2$. For $n=2$ the familiar identity

$$\begin{aligned} \frac{\lambda _2}{\lambda _1} + \frac{\lambda _1}{\lambda _2}=\frac{\lambda _1^2+\lambda _2^2}{\lambda _1\lambda _2}=\frac{\sigma _1^2-2\sigma _2}{\sigma _2}=\frac{\sigma _1^2}{\sigma _2}-2 \end{aligned}$$

(26)

for $\lambda _1\not =0,\,\lambda _2\not =0$ yields sharper estimates than Proposition 2. Similar estimates were also used in Eilertsen et al. (2022).

Lemma 4

(a)
For all $M>1,\,\widetilde{M}>2,\, M^*>3$ the implications
$$\begin{aligned} \begin{array}{rcl} |\lambda _1/\lambda _2|&{}>&{}M \Rightarrow |\sigma _1^2/\sigma _2|>M+2;\\ |\sigma _1^2/\sigma _2|&{}\le &{}\widetilde{M} \Rightarrow |\lambda _1/\lambda _2|\le \widetilde{M}-2;\\ |\sigma _1^2/\sigma _2|&{}>&{}M^* \Rightarrow |\lambda _1/\lambda _2|>M^*-3, \end{array} \end{aligned}$$
hold whenever $|\lambda _2/\lambda _1|<1$.
(b)
In the TFPV case,
$$\begin{aligned} \frac{1}{\varepsilon }\cdot \frac{\sigma _1^2}{\widehat{\sigma }_2}=2+\varepsilon \frac{\widehat{\lambda }_2}{\lambda _1} + \frac{1}{\varepsilon }\frac{\lambda _1}{\widehat{\lambda }_2} \end{aligned}$$
and with $\varepsilon \rightarrow 0$
$$\begin{aligned} \frac{\widehat{\sigma }_2(x,\widehat{\pi },\rho ,0)}{\sigma _1^2(x,\widehat{\pi })}=\frac{\widehat{\lambda }_2(x,\widehat{\pi },\rho ,0)}{\lambda _1(x,\widehat{\pi })}. \end{aligned}$$
(c)
For given $\alpha >0$, suitable choice of $\varepsilon _\textrm{max}$ yields
$$\begin{aligned} \frac{1}{(1+\alpha )}\varepsilon _*\le \inf \dfrac{|\lambda _2|}{|\lambda _1|}\le \sup \dfrac{|\lambda _2|}{|\lambda _1|} \le (1+\alpha )\varepsilon ^*, \end{aligned}$$
(27)
with infimum and supremum taken over all $(x,\varepsilon )\in K^*$.

Lemma 4 shows that $\varepsilon ^*$ provides a tight global upper estimate for the eigenvalue ratio (and thus for the timescale ratio) as $\varepsilon \rightarrow 0$, with x running through $\widetilde{Y}\cap K$. Moreover, in the analysis of particular systems, one may retrace the arguments leading to the lemma, and determine estimates for $\varepsilon _\textrm{max}$, e.g., from higher-order Taylor expansions.

3.4 Three-Dimensional Systems

We specialize the general results to dimension three. Given the blanket assumptions from Sect. 2.4, we denote by $\lambda _1,\lambda _2$, and $\lambda _3=\varepsilon \widehat{\lambda }_3$ the eigenvalues of the linearization. We have

$$\begin{aligned} U(\widehat{\pi },\rho )=\sup _{x\in \widetilde{Y}\cap K}\left| \dfrac{\widehat{\sigma }_3(x,\widehat{\pi },\rho ,0)}{\sigma _{1}(x,\widehat{\pi })\cdot \sigma _{2}(x,\widehat{\pi })}\right| , \quad \varepsilon ^*(\widehat{\pi },\rho ,\varepsilon )=\varepsilon U(\widehat{\pi },\rho ), \end{aligned}$$

(28)

and similar expressions for L and $\varepsilon _*$.

Proposition 4

As for applicability of the parameter $\mu ^*$, one has:

(a)
- The eigenvalues $\lambda _1$ and $\lambda _2$ are real if and only if $\sigma _1^2-4\sigma _2\ge 0$.
- Given that $\lambda _1\not \in \mathbb R$ and $\lambda _2=\overline{\lambda }_1$, one has $|\textrm{Re}\,\lambda _1|>|\textrm{Im}\,\lambda _1|$ if and only if $\sigma _1^2-2\sigma _2> 0$.
(b)
Assume that one of the conditions in part (a) holds. Then, given $\alpha >0$, for sufficiently small $\varepsilon _\textrm{max}$ one has
$$\begin{aligned} \sup _{(x,\varepsilon )\in K^*}\left| \dfrac{\lambda _3}{\textrm{Re}\,\lambda _{2}}\right| \le \sqrt{2} (1+\alpha )\ \mu ^*, \end{aligned}$$
resp.
$$\begin{aligned} \sup _{(x,\varepsilon )\in K^*}\left| \dfrac{\lambda _3}{\lambda _{2}}\right| \le (1+\alpha )\ \mu ^*\text { whenever } \lambda _{2}\in \mathbb R; \end{aligned}$$
with
$$\begin{aligned} \mu ^*=\varepsilon \cdot \sup _{x\in \widetilde{Y}\cap K}\left| \dfrac{\widehat{\sigma }_3(x,\widehat{\pi },\rho ,0)\cdot \sigma _{1}(x,\widehat{\pi })}{\sigma _{2}(x,\widehat{\pi })^2}\right| . \end{aligned}$$

Proof

To determine the nature of the eigenvalues on the critical manifold, we use the identity

$$\begin{aligned} \left( \frac{\lambda _1-\lambda _2}{\lambda _1+\lambda _2}\right) ^2=1-4\frac{\sigma _2}{\sigma _1^2}\text { on }\widetilde{Y}. \end{aligned}$$

(29)

This implies the (of course well known) first statement of part (a). The second statement follows from

$$\begin{aligned} -\left( \dfrac{\textrm{Im}\,\lambda _1}{\textrm{Re}\,\lambda _1}\right) ^2=1-4\frac{\sigma _2}{\sigma _1^2}. \end{aligned}$$

The rest is straightforward with Proposition 3.

Remark 3

We make the following two points

For $\lambda _1$ and $\lambda _2$ real and negative, one obtains a lower estimate from
$$\begin{aligned} \left| \dfrac{2\lambda _2}{\widehat{\lambda }_3}\right| \le \left| \dfrac{\lambda _1+\lambda _2}{\widehat{\lambda }_3}\right| =\left| \dfrac{\sigma _1\sigma _2}{\widehat{\sigma }_3}\right| \Longrightarrow 2\left| \dfrac{\widehat{\sigma }_3}{\sigma _1\sigma _2}\right| \le \left| \dfrac{\widehat{\lambda }_3}{\lambda _2}\right| \text { on } \widetilde{Y}\cap K. \end{aligned}$$
If $\lambda _1$ is not real and $\lambda _2=\overline{\lambda }_1$, with negative real parts, then the specialization of (20), viz.
$$\begin{aligned} \frac{\lambda _1+\lambda _2}{\lambda _3} +\left( \frac{\lambda _1}{\lambda _2}+\frac{\lambda _2}{\lambda _1}\right) +\left( \frac{\lambda _3}{\lambda _1}+\frac{\lambda _3}{\lambda _2}\right) =\frac{\sigma _1\sigma _2}{\sigma _3}-3, \end{aligned}$$
for real $\lambda _3$, $|\lambda _3|<|\textrm{Re}\,\lambda _1|$, shows that both the second term and the third term on the left-hand side are bounded below by $-2$ and above by 2, and we obtain
$$\begin{aligned} \dfrac{\sigma _1\sigma _2}{\sigma _3}-7\le \dfrac{2\textrm{Re}\,\lambda _1}{\lambda _3}\le \dfrac{\sigma _1\sigma _2}{\sigma _3}+1. \end{aligned}$$
In particular this yields an asymptotic timescale estimate
$$\begin{aligned} \left| \dfrac{\widehat{\lambda }_3}{\textrm{Re}\,\lambda _1}\right| \rightarrow 2\left| \dfrac{\widehat{\sigma }_3}{\sigma _1\sigma _2}\right| \text { as }\varepsilon \rightarrow 0. \end{aligned}$$

Remark 4

When all eigenvalues are real then one obtains the ratio of $\lambda _1$ and $\lambda _2$, with $|\lambda _2|\le |\lambda _1|$, from

$$\begin{aligned} \dfrac{\sigma _2}{\sigma _1^2}=\dfrac{\lambda _1\lambda _2+\varepsilon (\cdots )}{(\lambda _1+\lambda _2+\varepsilon (\cdots ))^2}=\dfrac{\lambda _2/\lambda _1}{(1+\lambda _2/\lambda _1)^2}+\varepsilon (\cdots ) \end{aligned}$$

and the arguments leading up to Lemma 4. With

$$\begin{aligned} \kappa _*:=\inf _{x\in \widetilde{Y}\cap K}\left| \dfrac{\sigma _2(x,\widehat{\pi })}{\sigma _{1}(x,\widehat{\pi })^2}\right| ,\quad \kappa ^*:=\sup _{x\in \widetilde{Y}\cap K}\left| \dfrac{\sigma _2(x,\widehat{\pi })}{\sigma _{1}(x,\widehat{\pi })^2}\right| , \end{aligned}$$

(30)

the following hold for every $\alpha >0$, with sufficiently small $\varepsilon $:

On $\widetilde{Y}\cap K$ one has
$$\begin{aligned} \left| \dfrac{\lambda _2}{\lambda _1}\right| \ge \dfrac{\kappa _*}{1+\alpha }. \end{aligned}$$
If $|\lambda _2/\lambda _1|\le \delta $ for all $x\in \widetilde{Y}\cap K$ then $\kappa ^*\le \dfrac{\delta }{2\delta +1}$.

Large discrepancy between $\lambda _1$ and $\lambda _2$ (in addition to $\mu ^*\ll 1$) may indicate a scenario with three timescales (informally speaking): slow, fast and very fast. Cardin and Texeira (2017) provided a rigorous extension of Fenichel theory for such settings, providing solid ground for their analysis. Note that large discrepancy between $\varepsilon ^*$ and $\mu ^*$ implies large discrepancy between $\lambda _1$ and $\lambda _2$, in view of the definitions.

4 Michaelis–Menten Reaction Mechanism Revisited

The reader may wonder why we include a rather long section on the most familiar reaction in biochemistry. The basic motivation is that some widely held beliefs on its QSS variants are problematic [see, Eilertsen et al. (2022), for a recent study]. Beyond this, the timescale ratio approach actually yields new results for the reversible Michaelis–Menten (MM) system, as well as for MM with slow product formation.

4.1 The Reversible Reaction with Low Enzyme Concentration

The reversible MM reaction mechanism with low enzyme concentration corresponds to the system

$$\begin{aligned} \begin{array}{rclclcl} \dot{s}&{}=&{} -k_1e_0s&{}+&{}(k_1s+k_{-1})c &{} &{} \\ \dot{c}&{}=&{} k_1e_0s&{}-&{}(k_1s+k_{-1}+k_2)c &{} +&{}k_{-2}(e_0-c)(s_0-s-c) \\ \end{array} \end{aligned}$$

(31)

with standard initial conditions $s(0)=s_0$, $c(0)=0$. The earliest discussion of (31) dates back to Miller and Alberty (1958), but the reversible reaction has garnered relatively little attention compared to the irreversible one.

The parameter space $\Pi =\mathbb R_{\ge 0}^6$ has elements $(e_0,s_0,k_1,k_{-1},k_2,k_{-2})^\textrm{tr}$, and we set $x=(s,\,c)^\textrm{tr}$. As is well known, setting $e_0=0$ and all other parameters $>0$ defines a TFPV, with the critical manifold $\widetilde{Y}$ given by $c=0$. For the reduced equation, one finds (see, e.g., Noethen and Walcher 2011)

$$\begin{aligned} \dot{s}=-e_0\cdot \frac{s(k_1k_2+k_{-1}k_{-2})-k_{-1}k_{-2}s_0}{k_1s+k_{-1}+k_2+k_{-2}(s_0-s)}. \end{aligned}$$

By the first blanket assumption in Sect. 2.4, we restrict $(s_0,k_1,k_{-1},k_2,k_{-2})^\textrm{tr}$ to a compact subset of the open positive orthant. With fixed $e_0^*>0$ (with dimension concentration), we let $\rho =(e_0^*,0,\ldots ,0)^\textrm{tr}$. We will work with both $e_0$ and $\varepsilon e_0^*$. Rather than obtaining $\varepsilon _*$ and $\varepsilon ^*$ directly from Lemma 4, we retrace their derivation and get error estimates in the process. The coefficients of the characteristic polynomial with $x\in \widetilde{Y}$ are

$$\begin{aligned} \begin{array}{rcl} \sigma _1(x,\widehat{\pi }+\varepsilon \rho )&{}=&{} k_1e_0+k_1s+k_{-1}+k_2+k_{-2}(e_0+s_0-s);\\ \sigma _2(x,\widehat{\pi }+\varepsilon \rho )&{}=&{} e_0\left( k_1k_{-2}(e_0+s_0)+k_1k_2+k_{-1}k_{-2}\right) . \end{array} \end{aligned}$$

The set K (compatible with the standard initial conditions), defined by $0\le s\le s_0$ and $0\le c\le e_0^*$, is compact and positively invariant.

We only discuss the case $k_1\ge k_{-2}$. The other case amounts to reversing the roles of s and p. Note that $\sigma _2$ is independent of s. The minimum of $\sigma _1(x,\widehat{\pi }+\varepsilon \rho )$ on $\widetilde{Y}\cap K$ equals

$$\begin{aligned} k_1e_0+k_{-1}+k_2+k_{-2}(e_0+s_0), \end{aligned}$$

and the maximum is

$$\begin{aligned} k_1(e_0+s_0)+k_{-1}+k_2+k_{-2}e_0. \end{aligned}$$

In particular, the minimum of $\sigma _1(x,\widehat{\pi })$ on $\widetilde{Y}\cap K$ equals

$$\begin{aligned} k_{-1}+k_2+k_{-2}s_0. \end{aligned}$$

Moreover, we have

$$\begin{aligned} \widehat{\sigma }_2(x,\widehat{\pi },0)= k_1k_{-2}s_0+k_1k_2+k_{-1}k_{-2}, \end{aligned}$$

a positive constant.

By Lemma 4 and its derivation, we find

$$\begin{aligned} \dfrac{ e_0\left( k_1k_{-2}(e_0+s_0)+k_1k_2+k_{-1}k_{-2}\right) }{\left( k_1(e_0+s_0)+k_{-1}+k_2+k_{-2}e_0)\right) ^2}\le \dfrac{\sigma _2}{\sigma _1^2}\le \dfrac{ e_0\left( k_1k_{-2}(e_0+s_0)+k_1k_2+k_{-1}k_{-2}\right) }{\left( k_1e_0+k_{-1}+k_2+k_{-2}(e_0+s_0)\right) ^2}, \end{aligned}$$

valid for all $\varepsilon >0$. Neglecting higher-order terms in $\varepsilon $ yields

$$\begin{aligned} \varepsilon _{*}= \dfrac{ e_0\left( k_1k_{-2}s_0+k_1k_2+k_{-1}k_{-2}\right) }{\left( k_{-1}+k_2+k_1s_0\right) ^2}; \quad \varepsilon ^{*}= \dfrac{ e_0\left( k_1k_{-2}s_0+k_1k_2+k_{-1}k_{-2}\right) }{\left( k_{-1}+k_2+k_{-2}s_0\right) ^2}. \end{aligned}$$

Therefore, it seems appropriate to define the distinguished local parameter for the reversible MM system as

$$\begin{aligned} \varepsilon _{MMR}:=\varepsilon ^{*}= \dfrac{ e_0\left( k_1k_{-2}s_0+k_1k_2+k_{-1}k_{-2}\right) }{\left( k_{-1}+k_2+k_{-2}s_0\right) ^2}. \end{aligned}$$

(32)

It appears that this particular parameter has not been introduced so far, nor has any close relative. Indeed, there seem to exist no parameters in the literature that were specifically derived for the reversible reaction. In their discussion of the reversible system, Seshadri and Fritzsch (1980) worked with the parameter $\varepsilon _{RS}$ that Reich and Selkov had designed for the irreversible system; see Eq. (5).

4.2 The Irreversible Reaction with Low Enzyme Concentration

We specialize to the irreversible case, and thus we have the differential equation (1) with $e_0=\varepsilon e_0^*$. The QSS manifold of this system is defined by $c=g(s):=e_0s/(K_M+s)$.

4.2.1 Distinguished Small Parameters

The parameters from the reversible scenario simplify to

$$\begin{aligned} \varepsilon _*= \dfrac{ e_0k_1k_2}{\left( k_1s_0+k_{-1}+k_2\right) ^2}; \quad \varepsilon ^{*}=\varepsilon _{MM}:=\dfrac{ e_0k_1k_2}{\left( k_{-1}+k_2\right) ^2}, \end{aligned}$$

with

$$\begin{aligned} \min \sigma _1=k_{-1}+k_2;\quad \widehat{\sigma }_2=k_1k_2. \end{aligned}$$

Note that the TFPV and nondegeneracy conditions, together with the compactness condition in parameter space, require that $k_2$ is bounded below by some positive constant.

As in the previous section, we find that $\varepsilon _{MM}$ is a sharp upper estimate for the eigenvalue ratio. In fact,

$$\begin{aligned} \dfrac{\sigma _2}{\sigma _1^2}\le \dfrac{ k_2k_1e_0}{\left( k_1e_0+k_{-1}+k_2\right) ^2}\le \varepsilon _{MM} \end{aligned}$$

throughout.

As noted in Introduction, various small parameters have been proposed for the irreversible MM system. Comparing these, we note

$$\begin{aligned} \varepsilon _{MM}=\dfrac{ e_0k_1}{k_{-1}+k_2}\cdot \dfrac{k_2}{k_{-1}+k_2}\le \dfrac{ e_0k_1}{k_{-1}+k_2}=\varepsilon _{RS}, \end{aligned}$$

with the Reich-Selkov parameter. Whenever $k_{-1}$ and $k_2$ have the same order of magnitude (in any case $k_2$ must be bounded away from 0 by nondegeneracy), the disparity between $\varepsilon _{MM}$ and $\varepsilon _{RS}$ may be seen as inessential.

The parameters $\varepsilon _{MM}$ and $\varepsilon _{RS}$ differ markedly from the most familiar small parameters, viz. $\varepsilon _{BH}$ [see (4) as used by Heineken et al. (1967)], and $\varepsilon _{SSl}$ [see (6) as introduced in Segel and Slemrod (1989)], which both involve the initial substrate concentration. As shown in Noethen and Walcher (2007), smallness of the Segel–Slemrod parameter is necessary and sufficient to ensure negligible loss of substrate in the initial phase. But, as noted in Patsatzis and Goussis (2019) and in Eilertsen et al. (2022), large initial substrate concentration—while ensuring a fast approach to the QSS manifold—is not sufficient to guarantee a good QSS approximation over the whole course of the reaction. A general argument in favor of $\varepsilon _*$ and $\varepsilon _{MM}$ is that they directly measure the local ratio of timescales.

4.2.2 Further Observations

We briefly discuss what can be inferred from

$$\begin{aligned} \varepsilon _{MM}=\dfrac{k_1k_2e_0}{(k_{-1}+k_2)^2}\rightarrow 0 \end{aligned}$$

alone, with no further restriction on the limiting process.

In the simplest imaginable scenario, letting a parameter tend to zero might automatically imply validity of some QSS approximation, but this is not the case here. The TFPV conditions on $\sigma _1$ imply that $k_{-1}$ is bounded above and we obtain three cases: In addition to the case $e_0\rightarrow 0$, we have the case $k_1\rightarrow 0$, yielding a singular perturbation reduction with the same critical manifold but a linear reduced equation. Furthermore we have the case $k_2\rightarrow 0$, which leads to a singular perturbation scenario with a different critical manifold and different reduction (see, the next subsection).

This observation supports a statement from Introduction. A given small parameter by itself will in general not determine a unique singular perturbation scenario, and a transfer without reflection of the reduction procedure from one scenario to a different one may yield incorrect results. It is necessary to consider the complete setting, including TFPV, critical manifold and small parameter. Moreover, one needs to carefully stipulate how limits are taken. For instance, letting $s_0\rightarrow \infty $, while ensuring $\varepsilon _{SSl}\rightarrow 0$, will fail to ensure convergence. Likewise, letting, e.g., $k_{-1}\rightarrow \infty $ in the Reich-Selkov parameter does not imply convergence.

For the irreversible reaction with substrate inflow at rate $k_0$, one obtains the same expressions for $\sigma _2/\sigma _1^2$ at the TFPV with $k_0=0$ and $e_0=0$ (all other parameters $>0$), the critical manifold being given by $c=0$. Before obtaining $\varepsilon _*,\,\varepsilon ^*$ one needs to choose appropriate initial conditions; we take $s(0)=c(0)=0$ here. Solutions are not necessarily confined to compact sets, so one may not be able to choose the set K from Sect. 2.4 to be positively invariant. In the case $s(0)=c(0)=0$ the computation of the distinguished upper bound $\varepsilon ^*$ works as in the case with no influx; the supremum exists and is equal to $\varepsilon _{MM}$. However, one gets $\varepsilon _*\rightarrow 0$ with increasing s when there exists no positive stationary point (all solutions are unbounded in positive time), hence the lower estimate provides no information. If there exists a finite positive stationary point $\widetilde{s}$ of the reduced equation, then one obtains $\varepsilon _*>0$ by replacing $s_0$ by $\widetilde{s}$ in the lower estimate in 4.2.1. In this case, a compact positively invariant set exists with $s\le \widetilde{s}$, as was shown in Eilertsen et al. (2021b).

4.3 The Irreversible Reaction with Slow Product Formation

We turn to the scenario with slow product formation, the other reactions being fast.^{Footnote 11} Here $k_2=0$, with all other parameters $>0$, defines a TFPV with critical manifold $\widetilde{Y}$ given by

$$\begin{aligned} c=\frac{k_1e_0s}{k_1s+k_{-1}}. \end{aligned}$$

Although setting up $k_2=0$ appears counterintuitive for an enzyme catalayzed reaction, there is a family of enzymes, known as pseudoenzymes, that have either zero catalytic activity ($k_2=0$), or vestigial catalytic activity ($k_2\approx 0$) due to the lack of catalytic amino acids or motifs (Eyers and Murphy 2016). These enzymes exist in all the kingdoms of life and are also named as “zombie” enzyme, dead enzyme, or prozymes. Pseudoenzymes play different functions in signaling network, such as serving as dynamic scaffolds, modulators of enzymes, or competitors in canonical signaling pathways (Murphy et al. 2017). Since one frequently finds incorrect reductions in the literature, it seems appropriate to recall correct ones. Heineken et al. (1967) provided a correct reduction (see, (34) below). In Goeke and Walcher (2013), a version for substrate concentration is given:

$$\begin{aligned} \dot{s}=-\dfrac{k_2k_1e_0s(k_1s+k_{-1})}{k_1k_{-1}e_0+(k_1s+k_{-1})^2}= -\dfrac{k_2e_0s(s+K_S)}{K_Se_0+(s+K_S)^2}; \quad \quad K_S:=k_{-1}/k_1. \end{aligned}$$

With known $e_0$, this equation^{Footnote 12} in principle allows to identify the limiting rate $k_2e_0$ and the equilibrium constant $K_S$. It should be noted that one also needs an appropriate initial time and initial value for the reduction. Since one cannot assume negligible substrate loss in the transient phase, an appropriate fitting would require completion of Step 2 of the program outlined in Introduction.

4.3.1 Distinguished Small Parameters

Intersecting $\widetilde{Y}$ with the positively invariant compact set K defined by $0\le s\le s_0$ and $0\le c\le e_0$, amounts to restricting $0\le s\le s_0$. The elements of the parameter space $\Pi =\mathbb R_{\ge 0}^5$ have the form $(e_0,s_0,k_1,k_{-1},k_2)^\textrm{tr}$, and a natural choice of ray direction is $\rho =(0,0,0,0,k_2^*)^\textrm{tr}$, with $k_2=\varepsilon k_2^*$.

The coefficients of the characteristic polynomial on $\widetilde{Y}$ are

$$\begin{aligned} \begin{array}{rcl} \sigma _1&{}=&{} \dfrac{k_{-1}k_1e_0}{k_1s+k_{-1}}+k_1s+k_{-1}+k_2,\\ \sigma _2&{}=&{} e_0k_1k_2\cdot \dfrac{k_{-1}}{k_1s+k_{-1}}. \end{array} \end{aligned}$$

To distinguish small parameters, we need to consider the following steps:

We first evaluate the nondegeneracy conditions for the coefficients of the characteristic polynomial, from TFPV requirements and compactness. The minimum of $\sigma _1(x,\widehat{\pi })$ on $\widetilde{Y}\cap K$ is equal to $k_{-1}+k_1e_0$ when $k_1e_0\le k_{-1}$, and equal to $2\sqrt{k_{-1}k_1e_0}$ otherwise. This minimum must be bounded below by some positive constant. Combining this observation with the boundedness of the maximum of
$$\begin{aligned} \widehat{\sigma }_2=\dfrac{k_2^*k_{-1}k_1e_0}{k_1s+k_{-1}} \quad \text {on} \quad [0,\,s_0], \end{aligned}$$
which is equal to $k_2^*k_1e_0$, one sees that $k_1e_0$ and $k_{-1}$ must be bounded above and below by positive constants.
Turning to small parameters, in the asymptotic limit one obtains
$$\begin{aligned} \varepsilon ^*=k_2\sup \dfrac{av}{(a+v^2)^2} \quad \text {with} \quad a=k_{-1}k_1e_0,\,v=k_1s+k_{-1}, \end{aligned}$$
where the supremum is taken over $k_{-1}\le v\le k_{-1}+k_1s_0$. By elementary calculus one finds the global maximum of this function on the unbounded interval $v\ge 0$, thus for sufficiently large $s_0$ we obtain the maximum at $v=\sqrt{k_{-1}k_1e_0/3}$, and find the estimate
$$\begin{aligned} \varepsilon ^{*}\le \dfrac{3\sqrt{3}}{16}\dfrac{k_2}{\sqrt{k_{-1}\cdot k_1e_0}}=:\dfrac{3\sqrt{3}}{8}\cdot \varepsilon _{PE},\quad \text {with }\varepsilon _{PE}:=\dfrac{2k_2}{\sqrt{k_{-1}\cdot k_1e_0}}. \end{aligned}$$
Note that $\varepsilon _{PE}$ always yields an upper estimate for the eigenvalue ratio near the critical manifold. One could thus discard the factor $1+\alpha $ in Lemma 4.
Depending on the given parameters, in some cases one may obtain sharper estimates for $\varepsilon ^*$ from the endpoints of the interval $[0,s_0]$. In any case, to determine $\varepsilon _*$ one needs to consider the boundary points of this interval.
The expression for $\varepsilon _{PE}$ may look strange, but $\sqrt{k_2/k_{-1}}\cdot \sqrt{k_2/(k_1e_0)}$ is the geometric mean of two reaction rate ratios, thus admits a biochemical interpretation. There is little work in the literature on small parameters for the case of slow product formation. Heineken et al. (1967) suggested $ {k_2}/{(k_1s_0)}$, while Patsatzis and Goussis introduced a parameter depending on s and c along a trajectory, taking the maximum over all $s,\,c$ yields ${k_2}/{k_{-1}}$. The latter represents a commonly accepted “small parameter” for this scenario; see, Keener and Sneyd (2009, Section 1.4.1). In the limiting case $k_2\rightarrow 0$, one also has $\varepsilon _{MM}\rightarrow 0$, but one should not conclude that the standard QSS approximation is valid here. Recall that, in the low enzyme setting, $k_2$ needs to be bounded away from zero due to nondegeneracy requirements.

4.3.2 Approach to the Slow Manifold

For MM reaction mechanism with slow product formation, we specialize the arguments in Appendix 9.1.1 to determine $\varepsilon _L$, and show that $\varepsilon _{PE}$ appears naturally in this estimate.^{Footnote 13} We use the results (and refer to the notation) of Sect. 9.1.

We rewrite the system in Tikhonov standard form. Since $\frac{d}{dt}(s+c)=-k_2c$, $s+c$ is a first integral of the fast system in the limit $k_2=0$, with $x=s+c$, $y=s$ (so $c=x-y$, $x\ge y\ge 0$), and $k_2=\varepsilon k_2^*$ we obtain

$$\begin{aligned} \begin{array}{rcl} \dot{x} &{} =&{}-k_2(x-y) \\ \dot{y} &{} =&{} -k_1e_0y+(k_1{y}+k_{-1})(x-y)\\ &{}=&{} -k_1(y-h_-(x))\cdot (y-h_+(x)) \end{array} \end{aligned}$$

(33)

with

$$\begin{aligned} h_\pm (x):=\frac{1}{2}\left( -(K_S+e_0-x)\pm q(x)\right) ;\quad q(x):=\sqrt{(K_S+e_0-x)^2+4K_Sx}. \end{aligned}$$

We focus on the particular initial conditions with zero complex, thus

$$\begin{aligned} x(0)=y(0)=s_0. \end{aligned}$$

The QSS variety $\widetilde{Y}$ is defined by $y=h_+(x)$, and the reduced equation reads

$$\begin{aligned} \dot{x}=-\dfrac{k_2}{2}\left( (K_S+e_0+x)-\sqrt{(K_S+e_0-x)^2+4K_Sx}\right) . \end{aligned}$$

(34)

We use the notation and apply the general procedure from the Sect. 9.1, with

$$\begin{aligned} A=-k_1(y-h_-(x))=-k_1q(x) \quad \text {on}\quad \widetilde{Y}, \end{aligned}$$

and $g(x)=h_+(x)$. We will use some properties of q in the following. The calculation of $q'(x)$ leads to

$$\begin{aligned} q'(x)=\dfrac{K_S+x-e_0}{\sqrt{(K_S+e_0-x)^2+4K_Sx}}, \end{aligned}$$

hence $|q'(x)|\le 1$ for all $x\ge 0$. Moreover, the sign of $q'$ changes from − to $+$ at $x=e_0-K_S$ when $e_0-K_S\ge 0$, and is otherwise positive for all $x\ge 0$. Thus, the minimum of q is attained at 0, with value $K_S+e_0$, when $e_0<K_S$, and is attained at $e_0-K_S$, with value $2\sqrt{K_Se_0}$, when $e_0\ge K_S$. By the arithmetic–geometric mean inequality, we thus have

$$\begin{aligned} q(x)\ge 2\sqrt{K_Se_0}\quad \text { for all } x\ge 0. \end{aligned}$$

This shows

$$\begin{aligned} A\le -2k_1\sqrt{K_Se_0}=-2\sqrt{k_1e_0k_{-1}}, \end{aligned}$$

and we arrive at

$$\begin{aligned} \gamma =\sqrt{k_1e_0k_{-1}}. \end{aligned}$$

According to Sect. 9.1, $\gamma ^{-1}$ is an appropriate timescale for the approach to the slow manifold.

To determine $\kappa $, we have $g(x)=h_+(x)=\frac{1}{2}(x-K_S-e_0+q(x))$, thus $|g'(x)|\le 1$, and

$$\begin{aligned} |f_1(x,y)|=k_2(x-y)\le k_2e_0,\quad \text {since }x-y=c\le e_0, \end{aligned}$$

hence we may set $\kappa =k_2e_0$.

Altogether, we obtain from the Lyapunov function the (dimensional) parameter

$$\begin{aligned} \varepsilon _L=\dfrac{2\kappa }{\gamma }=e_0\cdot \varepsilon _{PE}. \end{aligned}$$

(35)

To obtain a non-dimensional small parameter, normalization by $e_0$ seems to be the natural choice here, which yields

$$\begin{aligned} \widehat{\varepsilon }_L=\varepsilon _{PE}. \end{aligned}$$

(36)

In this particular setting, the local timescale parameter completely characterizes the approach of the solution to the slow manifold.

4.3.3 Estimates for Long Times

We will not attempt to estimate a critical time for the onset of the slow dynamics, and without this we cannot determine approximation errors for solutions of the reduced equation (as outlined in Sect. 9.1.3). In this respect, the discussion of the MM reaction mechanism with slow product formation remains incomplete. But the following observation provides a relevant condition for the long-term behavior. Since $|y-g(x)|\rightarrow e_0\cdot \varepsilon _{PE}$, the solution will enter the domain with $|y-g(x)|\le 2 e_0\cdot \varepsilon _{PE}$ after some short transitory phase.^{Footnote 14} In this domain, we obtain the reduced equation with error term:

$$\begin{aligned} \begin{array}{rcl} \dot{x}&{}=&{}-k_2(x-g(x))+k_2(y-g(x))\\ &{}\le &{} -\dfrac{k_2}{2}\left( (K_S+e_0+x)-\sqrt{(K_S+e_0-x)^2+4K_Sx}\right) \\ &{}&{}+k_2\cdot \dfrac{2k_2e_0}{\sqrt{k_1e_0\cdot k_{-1}}} =:U(x).\\ \end{array} \end{aligned}$$

(37)

By a differential inequality argument, the solution of $\dot{x}=U(x)$, with positive initial value, is an upper bound for the first entry of the solution of (33), given appropriate initial values near the QSS variety. Moreover the solution of the reduced equation (34) with the same initial value remains positive. For $t\rightarrow \infty $, the absolute value of the difference of these solutions converges to the stationary point of $\dot{x}=U(x)$, which therefore indicates the discrepancy. We determine the stationary point, neglecting terms of order $>1$ in $k_2$:

$$\begin{aligned} \begin{array}{rcl} \left( (K_S+e_0+x)-4 \dfrac{k_2e_0}{\sqrt{k_1e_0\cdot k_{-1}}}\right) ^2 &{} =&{}(K_S+e_0-x)^2+4K_Sx \\ \Rightarrow e_0 x &{} =&{}\dfrac{2k_2e_0}{\sqrt{k_1e_0\cdot k_{-1}}}\cdot (K_S+e_0+x)+\cdots \\ \Rightarrow \dfrac{x}{e_0} &{} =&{}2\dfrac{k_2}{\sqrt{k_1e_0\cdot k_{-1}}}\cdot \dfrac{k_1e_0+k_{-1}}{k_1e_0}+\cdots .\\ \end{array} \end{aligned}$$

Thus, we obtain the parameter

$$\begin{aligned} \varepsilon _{\infty }=\dfrac{k_1e_0+k_{-1}}{k_1e_0}\cdot \dfrac{2k_2}{\sqrt{k_1e_0\cdot k_{-1}}}=\dfrac{k_1e_0+k_{-1}}{k_1e_0}\cdot \varepsilon _{PE}, \end{aligned}$$

(38)

which provides an upper bound for the long-term discrepancy of the true solution and its approximation.

4.4 A Degenerate Scenario

To illustrate the limitations of the approach via Proposition 2, consider the irreversible system with TFPV $k_{-1}=k_2=0$, the other parameters positive, and $\rho =(0,0,0,k_{-1}^*,k_2^*)^\textrm{tr}$. Here the critical variety is reducible, being the union of the lines $Y_1$, $Y_2$ defined by $e_0-c=0$ resp. $s=0$, and the TFPV conditions fail at their intersection. We consider the case $e_0<s_0$, and define $\widetilde{Y}_1$ by $c=e_0,\,s>0$. The fast system admits the first integral $s+c$, so the initial value of the slow system on $\widetilde{Y}_1$ is close to $(s_0-e_0,e_0)^\textrm{tr}$. Proceeding, one may choose

$$\begin{aligned} K=\left\{ (s,c)^\textrm{tr};\, s+c\le s_0, \,s\ge \widetilde{s}\right\} , \quad 0<\widetilde{s}<s_0-e_0. \end{aligned}$$

Then, $\widetilde{Y}_1\cap K$ is compact, but not positively invariant, and on this set one has

$$\begin{aligned} \sigma _1= k_1s,\quad \sigma _2=0,\text { and } \widehat{\sigma }_2=0. \end{aligned}$$

Here, the nondegeneracy condition in (13) fails, and we obtain no timescale ratio by way of Lemma 4. A direct computation in a neigborhood of $\widetilde{Y}_1$ yields

$$\begin{aligned} \lambda _2/\lambda _1=\varepsilon k_1k_2^*(e_0-c), \end{aligned}$$

but this obscures the fact that both eigenvalues approach zero as $s\rightarrow 0$. Standard singular perturbation methods are not sufficient to analyze the dynamics of this system for small $\varepsilon $.

5 TFPV for Higher Dimensions

We keep the notation and conventions from Sects. 2.1 and 2.4, but now we will focus on a TFPV $\widehat{\pi }$ for dimension $s>1$. The goal of this technical section is to identify distinguished parameters and discuss their relation to timescales. There is a rather obvious direct extension of results from the $s=1$ case, but the timescale correspondence will be not as pronounced. Moreover, we will need to impose a stronger nondegeneracy condition. We abbreviate

$$\begin{aligned} \widetilde{\sigma }_i(x,\varepsilon ):=\sigma _i(x,\widehat{\pi }+\varepsilon \rho ),\quad 1\le i\le n, \end{aligned}$$

(39)

keeping in mind that $\widetilde{\sigma }_i(x,0)>0$ for all $x\in \widetilde{Y}\cap K$ and $1\le i\le n-s$, due to $\widehat{\pi }$ being a TFPV. Additionally, we set $\widetilde{\sigma }_0:=1$.

5.1 Distinguished Small Parameters

Some notions and results from Sect. 3 can easily be modified for the case $s>1$. For suitable $\varepsilon _\textrm{max}>0$, we have

$$\begin{aligned} \sigma _{i}(x,\widehat{\pi }+\varepsilon \rho )>0 \text { for all }(x,\varepsilon )\in K^*,\quad 1\le i\le n-s, \end{aligned}$$

and due to $\sigma _{n-s+1}(x,\widehat{\pi })=0$ for $x\in \widetilde{Y}\cap K$, we obtain

$$\begin{aligned} \sigma _{n-s+1}(x,\widehat{\pi }+\varepsilon \rho )=\varepsilon \widehat{\sigma }_{n-s+1}(x,\widehat{\pi },\rho ,\varepsilon ) \end{aligned}$$

with a polynomial $\widehat{\sigma }_{n-s+1}$, for all $(x,\varepsilon )\in K^*$.

Definition 3

Let

$$\begin{aligned} \begin{array}{rcl} L(\widehat{\pi },\rho )&{}:=&{}\inf _{x\in \widetilde{Y}\cap K}\;\left| \dfrac{\widehat{\sigma }_{n-s+1}(x,\widehat{\pi },\rho ,0)}{\sigma _{1}(x,\widehat{\pi })\cdot \sigma _{n-s}(x,\widehat{\pi })}\right| ,\\ U(\widehat{\pi },\rho )&{}:=&{}\sup _{x \in \widetilde{Y}\cap K}\left| \dfrac{\widehat{\sigma }_{n-s+1}(x,\widehat{\pi },\rho ,0)}{\sigma _{1}(x,\widehat{\pi })\cdot \sigma _{n-s}(x,\widehat{\pi })}\right| .\\ \end{array} \end{aligned}$$

(40)

Now, we define

$$\begin{aligned} \varepsilon ^*(\widehat{\pi },\rho ,\varepsilon ):=\varepsilon U(\widehat{\pi },\rho ) \end{aligned}$$

(41)

the distinguished upper bound for the TFPV $\widehat{\pi }$ for dimension s, with parameter direction $\rho $, of system (7). Moreover we call

$$\begin{aligned} \varepsilon _*(\widehat{\pi },\rho ,\varepsilon ):=\varepsilon L(\widehat{\pi },\rho ) \end{aligned}$$

(42)

the distinguished lower bound for the TFPV $\widehat{\pi }$ for dimension s with parameter direction $\rho $.

As in the case of reduction to dimension one, determining the distinguished parameters amounts to determining the extrema of a rational function on a compact set, or (when this is not possible, or not sensible) determining reasonably sharp estimates for these extrema. We note the following straightforward variant of Proposition 1.

Proposition 5

Given $\alpha >0$, for sufficiently small $\varepsilon _\textrm{max}$, the estimates

$$\begin{aligned} \frac{\varepsilon }{(1+\alpha )}L(\widehat{\pi },\rho ) \le \left| \dfrac{\sigma _{n-s+1}(x,\widehat{\pi }+\varepsilon \rho )}{\sigma _{1}(x,\widehat{\pi }+\varepsilon \rho )\cdot \sigma _{n-s}(x,\widehat{\pi }+\varepsilon \rho )}\right| \le \varepsilon (1+\alpha ) U(\widehat{\pi },\rho ) \end{aligned}$$

(43)

hold on $K^*$.

5.2 The Correspondence to Timescales

Proofs of the following statements are given in Appendix (Lemmas 6 and 7).

Let $\widehat{\pi }$ be a TFPV for dimension s, with critical manifold $\widetilde{Y}$. Then for all $x\in \widetilde{Y}\cap K$ one has

$$\begin{aligned} \widetilde{\sigma }_i(x,\varepsilon )=\varepsilon ^{i-n+s}\widehat{\sigma }_i(x,\varepsilon )\text { for all }x\in \widetilde{Y}\cap K, \quad n-s\le i\le n, \end{aligned}$$

(44)

with polynomials $\widehat{\sigma }_i$.

Assume that (44) is given, and furthermore assume the nondegeneracy condition

$$\begin{aligned} \widehat{\sigma }_{n-s}(x,0)\not =0 \text { and }\widehat{\sigma }_{n}(x,0)\not =0 \text { on }\widetilde{Y}\cap K. \end{aligned}$$

(45)

Then the zeros $\lambda _i(x,\widehat{\pi }+\varepsilon \rho )$ of the characteristic polynomial can be labeled such that

$$\begin{aligned} \lambda _1(x,\widehat{\pi })\not =0,\ldots , \lambda _{n-s}(x,\widehat{\pi })\not =0\quad \text { on } \widetilde{Y}\cap K, \end{aligned}$$

and

$$\begin{aligned} \lambda _i(x,\widehat{\pi }+\varepsilon \rho )=\varepsilon \widehat{\lambda }_i(x,\widehat{\pi },\rho ,\varepsilon ),\quad n-s+1\le i\le n \end{aligned}$$

with continuous functions in $\varepsilon $.

Given the nondegeneracy assumptions, we turn to discussing the correspondence of $ \varepsilon _{*}$ and $ \varepsilon ^*$ to timescales. By (11), and by the definition of $\widetilde{\sigma }_i$ in (39), one has

$$\begin{aligned} \begin{array}{rcl} -\widetilde{\sigma }_1&{}=&{}\lambda _1+\cdots +\lambda _{n-s}+\varepsilon \,(\cdots );\\ (-1)^{n-s} \widetilde{\sigma }_{n-s}&{}=&{} \sum \lambda _{j_1}\cdots \lambda _{j_{n-s}}=\lambda _{1}\cdots \lambda _{{n-s}}+\varepsilon \,(\cdots );\\ (-1)^{n-s+1} \widetilde{\sigma }_{n-s+1}&{}=&{} \sum \lambda _{i_1}\cdots \lambda _{i_{n-s+1}}\\ &{}=&{}\lambda _{1}\cdots \lambda _{n-s}\left( \lambda _{{n-s+1}}+\cdots +\lambda _n\right) +\varepsilon ^2\,(\cdots ).\\ \end{array} \end{aligned}$$

This directly provides a result on separation of timescales.

Proposition 6

Assume that the nondegeneracy condition (45) holds.

(a)
The identity
$$\begin{aligned} \dfrac{\widetilde{\sigma }_{n-s+1}}{\widetilde{\sigma }_1\widetilde{\sigma }_{n-s}}=\dfrac{ \lambda _{{n-s+1}}+\cdots +\lambda _n}{\lambda _1+\cdots +\lambda _{n-s}}+\varepsilon ^2\,(\cdots )=\varepsilon \dfrac{ \widehat{\lambda }_{{n-s+1}}+\cdots +\widehat{\lambda }_n}{\lambda _1+\cdots +\lambda _{n-s}}+\varepsilon ^2\,(\cdots ) \end{aligned}$$
holds on $K^*$, with $(\cdots )$ representing a continuous function.
(b)
Given $\alpha >0$, and $\varepsilon _\textrm{max}$ sufficiently small, the estimates
$$\begin{aligned} \frac{1}{(1+\alpha )}\varepsilon _*(\widehat{\pi },\rho ,\varepsilon )\le \left| \dfrac{\sum _{i\le n-s}\lambda _i(x,\widehat{\pi }+\varepsilon \rho )}{\sum _{j>n-s}\lambda _j(x,\widehat{\pi }+\varepsilon \rho )}\right| \le (1+\alpha )\varepsilon ^*(\widehat{\pi },\rho ,\varepsilon ) \end{aligned}$$
(46)
hold for all $(x,\varepsilon )\in K^*$. In particular, there exist constants $C_1,\,C_2$ such that
$$\begin{aligned} \;\;C_1 \varepsilon \le \left| \dfrac{\sum _{i\le n-s}\lambda _i(x,\widehat{\pi }+\varepsilon \rho )}{\sum _{j>n-s}\lambda _j(x,\widehat{\pi }+\varepsilon \rho )}\right| \le C_2 \varepsilon . \end{aligned}$$

Thus, for higher dimensions of the critical manifold the coefficients of the characteristic polynomial still provide—albeit weaker—estimates for timescale ratios. Informally speaking, ${\widetilde{\sigma }_{n-s+1}}/({\widetilde{\sigma }_1\widetilde{\sigma }_{n-s}})$ measures the ratio of the “fastest slow timescale” and the “fastest fast timescale.” Similar to the situation for $s=1$, a more relevant ratio is the one of the “fastest slow timescale” and the “slowest fast timescale.” We invite readers to compare Sect. 9.1 in Appendix. We remark that for real or “essentially real” $\lambda _1,\ldots ,\lambda _{n-s}$ one may obtain results similar to Proposition 3, but we will not pursue this further.

5.3 Further Dimensionless Parameters

Given the setting of (44), it is natural to ask about different types of dimensionless small parameters, in addition to the distinguished ones obtained from Proposition 5. We consider terms of the form

$$\begin{aligned} \dfrac{\widetilde{\sigma }_{n-s+k}}{\widetilde{\sigma }_{j_1}\cdots \widetilde{\sigma }_{j_\ell }\cdot \widetilde{\sigma }_{n-s+v_1}\cdots \widetilde{\sigma }_{n-s+v_m}} \end{aligned}$$

with $k\ge 1$, $\ell \ge 0$, $m>0$, and the indices $1\le j_1\le \cdots \le j_\ell $, $1\le v_1\le \cdots \le v_m$ subject to the following conditions:

(1)
“Dimensionless”: This mean by Lemma 1
$$\begin{aligned} j_1+\cdots +j_\ell +(n-s)+v_1+\cdots +(n-s)+v_m=(n-s)+k. \end{aligned}$$
(2)
“Order one in $\varepsilon $”:
$$\begin{aligned} v_1+\cdots +v_m=k-1. \end{aligned}$$

Proposition 7

The only classes of dimensionless small parameters that satisfy (1) and (2) are the following:

(a)
$m=1$ with $\ell =1$ and $j_1=1$, with parameters
$$\begin{aligned} \dfrac{\widetilde{\sigma }_{n-s+k}}{\widetilde{\sigma }_1\,\widetilde{\sigma }_{n-s+k-1}}, \quad 2\le k\le s. \end{aligned}$$
(47)
(b)
$m=2$, $n\ge 4$, $s=n-1$ and $\ell =0$, with parameters
$$\begin{aligned} \dfrac{\widetilde{\sigma }_{2+v_1+v_2}}{\widetilde{\sigma }_{1+v_1}\,\widetilde{\sigma }_{1+v_2}}, \quad 1\le v_1\le v_2,\quad v_1+v_2\le n-2. \end{aligned}$$
(48)

Proof

Combining (1) and (2) one finds

$$\begin{aligned} j_1+\cdots +j_\ell +(m-1)\,(n-s)=1, \end{aligned}$$

thus, necessarily $m\le 2$ due to $s<n$. In case $m=1$, one has $\ell =1$ and $j_1=1$. In case $m=2$, one necessarily has $s=n-1$ and $\ell =0$.

To obtain explicit parameter bounds in the first case, use

$$\begin{aligned} \sigma _{n-s+j}(x,\widehat{\pi }+\varepsilon \rho )=\varepsilon ^j\widehat{\sigma }_{n-s+j}(x,\widehat{\pi },\rho ,\varepsilon ),\quad j\ge 1 \end{aligned}$$

to determine

$$\begin{aligned} \begin{array}{rcl} \widetilde{L}_j(\widehat{\pi },\rho )&{}:=&{} \inf _{x\in \widetilde{Y}\cap K}\;\left| \dfrac{\widehat{\sigma }_{n-s+j}(x,\widehat{\pi },\rho ,0)}{\widehat{\sigma }_{n-s+j-1}(x,\widehat{\pi },\rho ,0)\,\sigma _1(x,\widehat{\pi })}\right| ,\\ \widetilde{U}_j(\widehat{\pi },\rho )&{}:=&{}\sup _{x\in \widetilde{Y}\cap K}\left| \dfrac{\widehat{\sigma }_{n-s+j}(x,\widehat{\pi },\rho ,0)}{\widehat{\sigma }_{n-s+j-1}(x,\widehat{\pi },\rho ,0)\,\sigma _1(x,\widehat{\pi })}\right| ,\\ \end{array} \end{aligned}$$

and small parameters

$$\begin{aligned} \delta _{j*}:=\varepsilon \cdot \widetilde{L}_j(\widehat{\pi },\rho ,0),\quad \delta _j^*:=\varepsilon \cdot \widetilde{U}_j(\widehat{\pi },\rho ,0), \quad j\ge 2. \end{aligned}$$

Remark 5

In the first case, there is a notable correspondence to eigenvalues (thus to timescales). A variant of the argument in Proposition 6 shows that

$$\begin{aligned} \dfrac{\widetilde{\sigma }_{n-s+k}}{\widetilde{\sigma }_1\,\widetilde{\sigma }_{n-s+k-1}}=\varepsilon \dfrac{\tau _k(\widehat{\lambda }_{n-s+1},\ldots ,\widehat{\lambda }_n)}{(\lambda _1+\cdots +\lambda _{n-s})\cdot \tau _{k-1}(\widehat{\lambda }_{n-s+1},\ldots ,\widehat{\lambda }_n)}+\varepsilon ^2(\cdots ), \end{aligned}$$

where $\tau _\ell $ denotes the $\ell ^\textrm{th}$ elementary symmetric polynomial in s variables.

5.4 Dimension Three

We specialize the results to dimension three and $s=2$, assuming nondegeneracy. By (44), $\widetilde{\sigma }_2$ is of order $\varepsilon $, and $\widetilde{\sigma }_3$ is of order $\varepsilon ^2$.

In view of Propositions 5 and 6, we consider

$$\begin{aligned} \dfrac{\widetilde{\sigma }_2}{\widetilde{\sigma }_1^2}=\varepsilon \dfrac{\widehat{\lambda }_2+\widehat{\lambda }_3}{\lambda _1}+\varepsilon ^2\cdots . \end{aligned}$$

Informally speaking, this expression governs the ratio of the fastest slow timescale to the fast timescale, which is the pertinent ratio according to Sect. 9.1.2. We obtain

$$\begin{aligned} U=\sup _{x \in \widetilde{Y}\cap K}\left| \dfrac{\widehat{\sigma }_{2}(x,\widehat{\pi },\rho ,0)}{\sigma _{1}(x,\widehat{\pi })^2}\right| ,\quad \varepsilon ^*=\varepsilon \cdot U \end{aligned}$$

as well as

$$\begin{aligned} L=\;\inf _{x \in \widetilde{Y}\cap K}\left| \dfrac{\widehat{\sigma }_{2}(x,\widehat{\pi },\rho ,0)}{\sigma _{1}(x,\widehat{\pi })^2}\right| ,\quad \varepsilon _*=\varepsilon \cdot L. \end{aligned}$$

Similar to the observations in Remark 4, disparate slow eigenvalues may indicate a scenario with three timescales (informally speaking, fast, slow and very slow). To measure the disparity, we use Proposition 7 and consider

$$\begin{aligned} \dfrac{\widetilde{\sigma }_3}{\widetilde{\sigma }_1 \widetilde{\sigma }_2}=\varepsilon \dfrac{\lambda _1\widehat{\lambda }_2\widehat{\lambda }_3}{(\lambda _1+\varepsilon \cdots )(\lambda _1(\widehat{\lambda }_2+\widehat{\lambda }_3)+\varepsilon \cdots )}=\varepsilon \dfrac{\widehat{\lambda }_2\widehat{\lambda }_3}{\lambda _1(\widehat{\lambda }_2+\widehat{\lambda }_3)}+\varepsilon ^2\cdots . \end{aligned}$$

Combining parameters shows

$$\begin{aligned} \dfrac{\widetilde{\sigma }_1\widetilde{\sigma }_3}{\widetilde{\sigma }_2^2}=\dfrac{\widehat{\lambda }_2\widehat{\lambda }_3}{(\widehat{\lambda }_2+\widehat{\lambda }_3)^2}+\varepsilon \cdots =\dfrac{\widehat{\lambda }_3/\widehat{\lambda }_2}{(1+\widehat{\lambda }_3/\widehat{\lambda }_2)^2}+\varepsilon \cdots . \end{aligned}$$

Thus, the constants

$$\begin{aligned} \kappa ^*:=\sup _{x\in \widetilde{Y}\cap K}\dfrac{\sigma _1(x,\widehat{\pi }) \widehat{\sigma }_3(x,\widehat{\pi },\rho ,0)}{\widehat{\sigma }_2(x,\widehat{\pi },\rho ,0)^2}\text { and }\kappa _*:=\inf _{x\in \widetilde{Y}\cap K}\dfrac{\sigma _1(x,\widehat{\pi }) \widehat{\sigma }_3(x,\widehat{\pi },\rho ,0)}{\widehat{\sigma }_2(x,\widehat{\pi },\rho ,0)^2} \end{aligned}$$

measure the disparity of $\widehat{\lambda }_2$ and $\widehat{\lambda }_3$. In particular, given that $|\lambda _3|\le |\lambda _2|$ one has

$$\begin{aligned} \left| \dfrac{\lambda _3}{\lambda _2}\right| \ge \kappa _*\text { throughout } \widetilde{Y}\cap K. \end{aligned}$$

6 Case Studies: Reduction from Dimension Three to One

In this section, we discuss two biochemically relevant modifications of the MM reaction mechanism and a non-Michaelis–Menten reaction mechanism, with low enzyme concentration, and their familiar (quasi-steady state) reductions to dimension one. This seems to be the first instance that small parameters in the spirit of Segel and Slemrod—although consistently based on linear timescales—are derived for these reaction mechanisms in a systematic manner. Note that, in the application-oriented literature, the perturbation parameter of choice mostly seems to be $\varepsilon _{BH}=e_0/s_0$, on loan from the MM reaction mechanism.

We will directly consider the asymptotic small parameters $\varepsilon ^*, \,\varepsilon _*,\,\mu ^*$ by application of the results in Sect. 3, and obtain rather satisfactory estimates for these. Considering the steps outlined in the Introduction, we thus complete a substantial part of Step 1. Proceeding beyond this, along the lines of Sect. 9.1, would involve considerable and lengthy work for each system, so we will not go further. However, to test and illustrate the efficacy of the parameters, we include extensive numerical simulations. We also include examples that demonstrate the limitations of the local timescale approach, and in particular show that the nondegeneracy conditions imposed on the “non-small” parameters are necessary.

6.1 Cooperativity Reaction Mechanism

The (irreversible) cooperative reaction mechanism

$$\begin{aligned} \begin{array}{rcccl} E+S&{}\overset{k_1}{\underset{k_{-1}}{\rightleftharpoons }}&{} C_1&{}\overset{k_2}{\rightharpoonup }\ {} &{}E+P,\\ S+C_1&{}\overset{k_{3}}{\underset{k_{-3}}{\rightleftharpoons }}&{} C_2&{} \overset{k_4}{\rightharpoonup }\ &{} C_1+P \end{array} \end{aligned}$$

(49)

is a non-Michaelis–Menten reaction mechanism of enzyme action. It is modeled by the mass action equations

$$\begin{aligned} \begin{array}{rclclclclcl} \dot{s}=&{}-&{} k_1(e_0-c_1-c_2)s&{}+&{}k_{-1}c_1 &{} &{}-&{} k_3sc_1 &{}+&{} k_{-3}c_2, \\ \dot{c}_1=&{}&{} k_1(e_0-c_1-c_2)s&{}-&{}(k_{-1}+k_2)c_1&{} &{}-&{}k_3sc_1 &{}+&{} (k_4+k_{-3})c_2, \\ \dot{c}_2=&{} &{} k_3sc_1 &{}&{}&{}&{} &{} &{}- &{}(k_4+k_{-3}) c_2, \end{array} \end{aligned}$$

(50)

via stoichiometric conservation laws. Typical initial conditions are $s(0)=s_0,\,e(0)=e_0,\,$ and $c_1(0)=c_2(0)=p(0)=0$. The conservation laws yield the compact positively invariant set

$$\begin{aligned} K:=\{(s,c_1,c_2)\in \mathbb {R}_{\ge 0}^3: 0 \le s\le s_0, 0 \le c_1+c_2 \le e_0^*\}, \end{aligned}$$

(51)

with some reference value $e_0^*>0$. The parameter space $\Pi =\mathbb R_{\ge 0}^8$ has elements $ (e_0,s_0,k_1,k_{-1},k_2,k_3,k_{-3},k_4)^\textrm{tr},$ and setting $e_0=0$ defines a TFPV,

$$\begin{aligned} \widehat{\pi }:=(0,s_0,k_1,k_{-1},k_2,k_3,k_{-3},k_4)^\textrm{tr} \end{aligned}$$

for dimension one, subject to certain nondegeneracy conditions on the $k_i$. The associated critical manifold is

$$\begin{aligned} \widetilde{Y}:=\{(s,c_1,c_2)\in \mathbb {R}^3_{\ge 0}:c_1=c_2=0\}. \end{aligned}$$

(52)

We now set $\rho =(e_0^*,0,\ldots ,0)^\textrm{tr}$, and consider the perturbed system with parameter $\pi =\widehat{\pi }+\varepsilon \rho $. The singular perturbation reduction (according to formula (9) in Sect. 2.1) was carried out in Noethen and Walcher (2007, Section 4) and Goeke and Walcher (2013, Examples 8.2 and 8.7). This reduction agrees with the well known classical quasi-steady state reduction for complexes of the cooperativity reaction mechanism (see Keener and Sneyd 2009, Section 1.4.4). We have

$$\begin{aligned} \dot{s} = - \frac{k_1e_0s\left( k_3k_4s+k_2(k_{-3}+k_4)\right) }{(k_1s+k_{-1}+k_2)(k_{-3}+k_4) + k_1k_3s^2},\quad s(0)=s_0. \end{aligned}$$

(53)

The quasi-steady state variety (see, Keener and Sneyd 2009) is given parametrically by

$$\begin{aligned} \begin{pmatrix} c_1\\ c_2 \end{pmatrix}= \dfrac{k_1e_0s}{(k_{-1}+k_2)(k_{-3}+k_4)+k_1(k_{-3}+k_4)s+k_1k_3s^2}\cdot \begin{pmatrix} k_{-3}+k_4 \\ k_3s \end{pmatrix},\quad 0\le s\le s_0, \end{aligned}$$

and agrees with the first order approximation of the slow manifold. Fenichel theory guarantees that (53) holds for sufficiently small $e_0=\varepsilon e_0^*$, up to errors of order $\varepsilon ^2$. The initial value for the reduced equation is generally chosen as $s_0$, and we adopt this choice here (refraining from a closer analysis of the approximation error).

6.1.1 Asymptotic Small Parameters

According to the first blanket assumption in Sect. 2.4, we will assume that $(s_0,k_1,k_{-1},k_2,k_3,k_{-3},k_4)^\textrm{tr}$ is contained in a compact subset of $\mathbb R_{\ge 0}^7$. In particular $s_0$ and all the $k_i$ are bounded above by some positive constants. We now further specify this compact parameter set. On $\widetilde{Y}\cap K$ with $\pi =\widehat{\pi }$, we have

$$\begin{aligned} \begin{array}{rcl} \sigma _1&{}=&{} (k_1+k_3)s+k_{-1}+k_2+k_{-3}+k_4;\\ \sigma _2&{}=&{} k_1k_3s^2+k_1(k_{-3}+k_4)s+(k_{-1}+k_2)(k_{-3}+k_4);\\ \widehat{\sigma }_3&{}=&{}k_1e_0^*\cdot \left( k_3k_4s+k_2(k_{-3}+k_4)\right) . \end{array} \end{aligned}$$

Due to the TFPV requirement, $\sigma _1$ and $\sigma _2$ must be bounded below on $K\cap \widetilde{Y}$ by positive constants,

$$\begin{aligned} k_{-1}+k_2 + k_{-3}+k_4&=\min \sigma _1>0,\\ (k_{-1}+k_2)(k_{-3}+k_4)&=\min \sigma _2>0, \end{aligned}$$

and from this one sees that the TFPV conditions hold if and only if both $k_{-1}+k_2$ and $k_{-3}+k_4$ are bounded below by positive constants. Nontriviality of the reduced equation (53) also imposes conditions on $k_1$, $k_2$, $k_3$, and $k_4$. Moreover, for instance, in the limit $k_3\rightarrow 0$, with $k_4$ bounded below by a positive constant, the reduced equation is nontrivial but approaches the Michaelis–Menten equation. We will take a closer look at this situation below.

Generally, the TFPV and nondegeneracy conditions will certainly hold whenever $(s_0,k_1,k_{-1},k_2,k_3,k_{-3},k_4)^\textrm{tr}$ is contained in a compact subset of the open positive orthant. Our aim is now to determine a suitable dimensionless parameter that corresponds to the legitimacy of (53). The typical requirement in the literature, that $e_0/s_0\ll 1$, yields a sufficient asymptotic condition for bounded $s_0$, since singular perturbation theory guarantees convergence as $e_0\rightarrow 0$, but no quantitative information can be inferred. In contrast, we use the results of Sect. 3 to provide a correspondence to linear timescales.

The explicit calculation of $\varepsilon ^*$ according to Proposition 2, i.e., determining the maximum of

$$\begin{aligned} s\mapsto r(s):=\dfrac{\widehat{\sigma }_3}{\sigma _1\sigma _2},\quad 0\le s\le s_0 \end{aligned}$$

(55)

involves the computation of the roots of the numerator of the derivative, thus of a parameter-dependent cubic polynomial q in s. The signs of all the coefficients^{Footnote 15} are negative, except possibly the constant coefficient. By the Descartes rule of signs, the polynomial q has at most one positive zero. If there exists no positive zero, then r is strictly decreasing for $0\le s< \infty $ and attains its maximum at $s=0$,^{Footnote 16} and in any case one has

$$\begin{aligned} \varepsilon ^*\ge r(0)= \varepsilon _{MM}\cdot \frac{k_{-1}+k_2}{k_{-1}+k_2+k_{-3}+k_4}. \end{aligned}$$

If a positive zero $s^*$ exists^{Footnote 17}, then the maximum of r will be attained there. An exact calculation via Cardano does not provide any palatable information, but an upper bound for $\varepsilon ^*$ is obtained rather easily from the monotonicity of the $\sigma _j$:

$$\begin{aligned} \begin{array}{rcl} \varepsilon ^* &{}\le &{} \varepsilon \frac{\sup _{\widetilde{Y} \cap K} \widehat{\sigma }_3}{\inf _{\widetilde{Y} \cap K} \sigma _1 \inf _{\widetilde{Y} \cap K} \sigma _2}\\ &{}=&{}{ \frac{k_1e_0}{k_{-1}+k_2}\cdot \bigg (\frac{k_3k_4s_0+k_2(k_{-3}+k_4)}{(k_{-1}+k_2+k_{-3}+k_4)(k_{-3}+k_4)} \bigg )}\\ &{}=&{} \varepsilon _{MM}\cdot \bigg (\frac{k_3k_4s_0(k_{-1}+k_2)}{k_2(k_{-1}+k_2+k_{-3}+k_4)(k_{-3}+k_4)}+ \frac{k_{-1}+k_2}{k_{-1}+k_2+k_{-3}+k_4}\bigg )=:\varepsilon _C. \end{array} \end{aligned}$$

(56)

Comparing this to the lower estimate r(0), one finds that the upper estimate by $\varepsilon _C$ is acceptable as long as $s_0$ is not too large, but weakens with increasing $s_0$. As noted in Sect. 3, $\varepsilon ^*$—and by extension $\varepsilon _C$—provides an estimate for the ratio of slowest to fastest timescale. Thus, smallness of $\varepsilon ^*$ is a necessary condition, but it may not be sufficient when the fast timescales are far apart.

We therefore consider an estimate for the ratio of the slow timescale to the slower of the fast ones via $\mu ^*$. It is straightforward to verify that $\sigma _1^2-4\sigma _2 \ge 0$, thus all eigenvalues are real, and Proposition 4(b) is applicable. The explicit calculation of $\mu ^*$ again involves a cubic polynomial in s, for $0\le s\le s_0$. In this case, the Descartes sign rule allows for two or no positive zeros, and there exist at most two local maxima for $0\le s<\infty $. One of these is located at $s=0$, yielding in any case the lower estimate

$$\begin{aligned} \mu ^*\ge \varepsilon _{MM}\cdot \frac{k_{-1}+k_2+k_{-3}+k_4}{k_{-3}+k_4}, \end{aligned}$$

(57)

but an explicit computation of the maximum provides little information. Instead, we again resort to an upper bound

$$\begin{aligned} \begin{array}{rcl} \mu ^* &{}\le &{} \varepsilon \frac{\sup _{\widetilde{Y} \cap K} \widehat{\sigma }_3\sup _{\widetilde{Y} \cap K} \sigma _1}{\inf _{\widetilde{Y} \cap K} \sigma _2^2}\\ &{}=&{} \varepsilon _{MM}\cdot \bigg (\frac{k_3k_4s_0+k_2(k_{-3}+k_4)}{k_2(k_{-3}+k_4)}\cdot \frac{(k_1+k_3)s_0+k_{-1}+k_2+k_{-3}+k_4}{k_{-3}+k_4}\bigg ) =:\mu _C. \end{array} \end{aligned}$$

(58)

Comparison with (57) shows that the estimate by $\mu _C$ is satisfactory as long as $s_0$ is not too large, but it will become rather weak with increasing $s_0$.

All estimates involve the distinguished Michaelis–Menten parameter $\varepsilon _{MM}$, multiplied by some positive factor. For both estimates in (57), (58) this factor is $>1$.

6.1.2 Numerical Simulations

While we have obtained asymptotic timescale estimates for given reaction parameters, these estimates are unsatisfactory for large substrate concentrations. Moreover, by its nature our approach alone does not provide an upper estimate for the distance of the solution to the slow manifold. So, to obtain a priori gauge of the efficacy of (53), it is natural to resort to numerical simulations. These simulations serve two purposes: a positive and a negative. On the positive side, they illustrate that the small parameters $\varepsilon ^*$ and $\mu ^*$ are good indicators for viability of the QSS reduction, in a wide parameter range. On the negative side, numerical examples highlight parameter combinations where consideration of $\varepsilon ^*$ and $\mu ^*$ is misleading. Such cases can be traced back to problems with the blanket assumptions from Sect. 2.4, or with assumptions implicit in the proofs of Propositions 2 and 3.

We will consider some specific examples, and instead of relying on $\varepsilon _C$ and $\mu _C$ we will compute both $\varepsilon ^*$ and $\mu ^*$ numerically in the simulations that follow. This is still far less computationally involved than working with eigenvalues of linearizations on $\widetilde{Y}\cap K$. In the figures illustrating all the simulations, to show the behavior of trajectories over the interval $0\le t<\infty $, time is mapped to

$$\begin{aligned} \tau = t/T, \;\tau \in [0,1], \end{aligned}$$

where the numerical solution has been computed on the interval [0, T], and T is chosen large enough to ensure the numerical simulations capture the long-time dynamics of the reaction. We start with some examples that document the efficacy of the parameters in “normal” parameter domains:

1.
In a first numerical study, we compare the numerical solution to the mass action equations (50) with the numerical solution to (53) in the scenario when all parameters except $e_0$ are of the same order of magnitude. In the simulations, all parameter values except $e_0$ are set equal to 1, and $e_0$ is varied from $10^0$–$10^{-3}$. The simulation results are reported in Fig. 1, which reinforces the assertion that $\varepsilon ^* \ll 1$ and $\mu ^* \ll 1$ support the validity of (53). Moreover, we see that smallness of $\mu ^*$ is the more relevant condition.
2.
In a second numerical study, we examine a case with varied parameter values, but all (except $e_0$) within the same order of magnitude. The results are reported in Fig. 2, and once again support the claim that the accuracy of (53) improves as $\varepsilon ^*\rightarrow 0$ and $\mu ^* \rightarrow 0$, with higher relevance for $\mu ^*$.
3.
As a third numerical example, we consider a combination of parameter values that are somewhat disparate in terms of the magnitudes. Nevertheless, we once again confirm that that the accuracy of (53) improves as $\varepsilon ^*\rightarrow 0$ and $\mu ^* \rightarrow 0$, again with higher relevance for $\mu ^*$ (see, Fig. 3). This simulation also debunks the commonly accepted notion that $e_0/s_0 \ll 1$ is sufficient for the accuracy of (53).

Throughout these simulations we observe that the magnitude of $\mu ^*$ is more relevant for the quality of the QSS approximation than the magnitude of $\varepsilon ^*$. This is in accordance with the results of Sect. 3.2.

6.1.3 Exceptional Cases: Near-Degeneracy and Near-Invariance

Here, we briefly discuss two special scenarios with $\mu ^*\gg 1$, but precede this by a word of caution. Obviously, whenever $\mu ^*>1$, then the implicit assumptions in the proofs of Propositions 3 and 4 are violated for the given values of $e_0$, and the propositions are not applicable in this range. To enable applicability, $\varepsilon _\textrm{max}$ would have to be adjusted to a smaller value. However, the consideration of such extreme cases provides insight into the significance of various parameters.

The first case involves a near-degeneracy scenario. The critical variety contains a degenerate point and $1 \ll \mu ^*$, while at first sight the QSS reduction (53) appears to be highly accurate. In the second case, a two-dimensional nearly invariant subspace emerges within phase space. Here we present a description of the cases:

1.
Consider the parameter point
$$\begin{aligned} \pi ^{\ddag } = (s_0,0,k_1,0,0,k_3,k_{-3},k_4)^{\text {tr}}, \end{aligned}$$
thus in addition to $e_0=0$ one has $k_{-1}=k_2=0$ (which is problematic in view of nondegeneracy conditions). The associated critical variety, Y, consists of two intersecting lines of equilibria (and is therefore not a manifold)^{Footnote 18}
$$\begin{aligned} Y:= \{(s,c_1,c_2)\in \mathbb {R}^3_{\ge 0}: c_1=c_2=0\} \cup \{(s,c_1,c_2)\in \mathbb {R}^3_{\ge 0}: s=c_2=0\}. \end{aligned}$$
The perturbation form of the mass action equations with $e_0 = \varepsilon e_0^*, k_2 = \varepsilon k_2^*$ and $k_{-1} =\varepsilon k_{-1}^*$ is
$$\begin{aligned} \begin{pmatrix}\dot{s}\\ \dot{c}_1\\ \dot{c}_2\end{pmatrix} = \begin{pmatrix}\;\;(k_1-k_3) &{} k_1s+k_{-3}\\ -(k_1+k_3)&{} -k_1s+k_4+k_{-3}\\ k_3 &{} -(k_{-3}+k_4)\end{pmatrix}\begin{pmatrix}sc_1\\ c_2\end{pmatrix} + \varepsilon \begin{pmatrix}k_1e_0^*s +k_{-1}^*c_1\\ k_1e_0^*s-(k_{-1}^*+k_2^*)c_1\\ 0\end{pmatrix}.\nonumber \\ \end{aligned}$$
(59)
In this case, the rank of the Jacobian is not constant
$$\begin{aligned} \textrm{rank}\;\; D_1h(s,c_1,c_2,\pi ^{\ddagger })&=1,\;\;\text {if}\;\;(s_1,c_1,c_2)=(0,0,0);\\ \textrm{rank}\;\; D_1h(s,c_1,c_2,\pi ^{\ddagger })&= 2, \;\;\text {otherwise}. \end{aligned}$$
While the rank condition from Sect. 2.1 fails^{Footnote 19} on Y, it is straightforward to verify that the compact submanifolds defined by
$$\begin{aligned} \widetilde{Y}_{1}&:= \{(s,c_1,c_2)\in K: c_1=c_2=0 \;\;\text {and} \;\;s \ge \theta _1\}, \quad 0< \theta _1<s_0,\\ \widetilde{Y}_{2}&:= \{(s,c_1,c_2)\in K: s=c_2=0 \;\;\text {and} \;\;c_1 \ge \theta _2\}, \quad 0<\theta _2<e_0^*, \end{aligned}$$
are normally hyperbolic and attracting. Thus, for $\pi $ sufficiently close to $\pi ^{\ddag }$, and for $s_0 >0$, trajectories will rapidly approach the attracting branch $ \widetilde{Y}_{1}$. Projection of the perturbation onto the tangent space of $ \widetilde{Y}_{1}$, according to (9), yields
$$\begin{aligned} \begin{pmatrix} \dot{s} \\ \dot{c}_1\\ \dot{c}_2 \end{pmatrix}=&{} \varepsilon \begin{pmatrix} 1 &{}{} \frac{k_1(k_3s+k_4+k_{-3})-k_3k_4}{k_1(k_3s+k_4+k_{-3})} &{}{} \frac{(2k_3s+k_4+2k_{-3})k_1-k_3k_4}{k_1(k_3s+k_4+k_{-3})}\\ 0 &{}{} 0 &{}{}0 \\ 0&{}{} 0 &{}{}0\end{pmatrix} \begin{pmatrix}-k_1e_0^*s\\ \;\;k_1e_0^*s\\ 0\end{pmatrix} \end{aligned}$$
and the corresponding reduction on $ \widetilde{Y}_{1}$ is
$$\begin{aligned} \dot{s} = -\frac{e_0k_4k_3s}{k_{-3}+k_4+k_3s}. \end{aligned}$$
(62)
Remarkably, one can recover (62) by setting $k_{-1}=k_2=0$ in (53). Thus, equation (62) can be viewed as a special case of (53) in the limit of small $k_2$ and $k_{-1}$. Moreover, numerical simulations seem to indicate that the reduction (53) is valid over the full time course, even when $\mu ^*$ is quite large (see, Fig. 4, Left panel. But, this is illusory. Both (53) and (62) fail to approximate the depletion of s near the origin, as the Right panel shows. Thus, near-degeneracy scenarios can generate conditions in which (53) may appear to yield an excellent approximation. But recall that small $\varepsilon ^*$ combined with large $\mu ^*$ indicates that two eigenvalues are small, and this necessarily prohibits the reduction from being valid over the complete time course. There are other degenerate scenarios for this reaction (for instance, $k_{-3}=k_4=0$ or $k_{-3}=k_4=k_2=k_{-1}=0$ with all other parameters bounded below by a positive constant). We will not further investigate these.
2.
In the final numerical example of this case study, we exhibit a scenario for which (53) provides a valid approximation even though $1 \ll \mu ^*$. This can happen, for instance, in the limit of small $e_0$ and small $k_3$. For $k_3 = 0$, the two-dimensional subspace $V:= \{(s,c_1,c_2) \in \mathbb {R}^3: c_2 = 0\}$ is invariant. One approach to such a scenario is to consider a singular perturbation reduction with both $e_0= \varepsilon e_0^{\star }, k_3 = \varepsilon k_3^{\star }$ of order $\varepsilon $. The perturbation form of the mass action system is
$$\begin{aligned} \begin{pmatrix} \dot{s}\\ \dot{c}_1\\ \dot{c}_2\end{pmatrix}= & {} \begin{pmatrix} k_1s + k_{-1} &{} k_1s+k_{-3} \\ {}-k_1s-(k_{-1}+k_2) &{} \;\;k_1s + (k_{-3}+k_4) \\ 0 &{} -(k_{-3}+k_4)\end{pmatrix}\begin{pmatrix} c_1\\ c_2\end{pmatrix} \nonumber \\{} & {} + \varepsilon \begin{pmatrix}-k_1e_0^{*}s-k_3^{*}c_1s\\ \;\;k_1e_0^{*}s-k_3^{*}c_1s\\ k_3^{*}c_1s\end{pmatrix}, \end{aligned}$$
(63)
with the critical manifold given by $c_1=c_2=0$. Projection onto the critical manifold according to (9) yields
$$\begin{aligned} \begin{pmatrix} \dot{s}\\ \dot{c}_1\\ \dot{c}_2\end{pmatrix} =\varepsilon \begin{pmatrix} 1 &{} \frac{k_1s+k_{-1}}{k_1s+k_{-1}+k_2} &{} \frac{k_1s(k_2+k_4)+k_4k_{-1}-k_2k_{-3}}{(k_1s+k_{-1}+k_2)(k_{-3}+k_4)}\\ 0 &{} 0 &{}0 \\ 0 &{} 0 &{}0\end{pmatrix}\begin{pmatrix}-k_1e_0^{*}s\\ \;\;k_1e_0^{*}s\\ 0\end{pmatrix}, \end{aligned}$$
and thus the QSS reduction
$$\begin{aligned} \dot{s} = -\frac{k_1k_2e_0s}{k_1s+k_{-1}+k_2}, \end{aligned}$$
which corresponds to the sQSSA of the MM reaction mechanism. One may regard this also from a different perspective: For fixed $k_3$, one obtains the reduction (53). Then, letting $k_3\rightarrow 0$ yields the Michaelis–Menten equation. Notably, the lower estimate (57) for $\mu ^*$ is independent of $k_3$, and thus large $\mu ^*$ will remain large as $k_3\rightarrow 0$. On the other hand, the upper estimate for $\varepsilon ^*$ decreases as $k_3\rightarrow 0$. We recover the Michaelis–Menten equation, because a slight perturbation to $k_3=0$ results in V being nearly invariant (see, e.g., Goeke et al. 2017 for the notion). Biochemically, near invariance of V is equivalent to gradually “turning off” the cooperative mechanism, since the secondary complex $C_2$ is being produced at a very small rate. Mathematically, the near invariance of V implies for the given initial values, thus $c_2(0)=0$, that the relevant dynamics are essentially two-dimensional even prior to reduction, and further reduction to a one-dimensional manifold depends only on a single eigenvalue ratio. In the simulation example, the fast eigenvalue with smaller absolute value—which generally is responsible for the slow-fast separation—has negligible influence, since the dynamics evolves on an invariant manifold very near $c_2 = 0$. Consequently $\varepsilon ^*$ (or indeed $\varepsilon _{MM}$) is the relevant quantity rather than $\mu ^*$; see Fig. 5.

6.2 Uncompetitive Inhibition Reaction Mechanism

The irreversible MM reaction mechanism in the presence of an uncompetitive inhibitor

$$\begin{aligned} \begin{array}{rcccl} E+S&{}\overset{k_1}{\underset{k_{-1}}{\rightleftharpoons }}&{} C_1&{}\overset{k_2}{\rightharpoonup }\ {} &{}E+P,\\ C_1+I&{}\overset{k_{3}}{\underset{k_{-3}}{\rightleftharpoons }}&{} C_2,&{} &{} \end{array} \end{aligned}$$

(64)

is modeled deterministically by the system

$$\begin{aligned} \begin{array}{rclclclclcl} \dot{s}=&{}-&{} k_1(e_0-c_1-c_2)s&{}+&{}k_{-1}c_1, &{} \\ \dot{c}_1=&{}&{} k_1(e_0-c_1-c_2)s&{}-&{}(k_{-1}+k_2)c_1&{} -&{} k_3(i_0-c_2)c_1&{} + &{} k_{-3}c_2,\\ \dot{c}_2=&{} &{} k_3 (i_0-c_2)c_1 &{}- &{}k_{-3} c_2, \end{array} \end{aligned}$$

(65)

via stoichiometric conservation laws. The standard initial conditions are $(s,c_1,c_2)(0)=(s_0,0,0)$. We fix a reference value $e_0^*$ and obtain from the conservation laws the compact positively invariant set

$$\begin{aligned} K:=\{(s,c_1,c_2)\in \mathbb {R}^3_{\ge 0}: 0 \le s\le s_0,\;\; c_1+c_2 \le e_0^*, \;\;c_2 \le \min \{e_0^*,i_0\}\}. \end{aligned}$$

The parameter space $\Pi =\mathbb R_{\ge 0}^8$ has elements $\pi =(e_0,s_0,k_1,k_{-1},k_2,k_3,k_{-3},i_0)^\textrm{tr}$. Given suitable nondegeneracy conditions on the parameters (to be specified below), setting $e_0=0$ defines a TFPV for dimension one:

$$\begin{aligned} \widehat{\pi }:=(0,s_0,k_1,k_{-1},k_2,k_3,k_{-3},i_0)^\textrm{tr}, \end{aligned}$$

with associated critical manifold

$$\begin{aligned} \widetilde{Y}:=\{(s,c_1,c_2)\in \mathbb {R}^3_{\ge 0}: c_1=c_2=0\}. \end{aligned}$$

We set $\rho =(e_0^*,0,\ldots ,0)^\textrm{tr}$ and consider the ray $\varepsilon \mapsto \widehat{\pi }+\varepsilon \rho $ in parameter space. Then, the perturbed system has the form

$$\begin{aligned} \begin{pmatrix}\dot{s}\\ \dot{c}_1\\ \dot{c}_2\end{pmatrix}= & {} \begin{pmatrix}k_1s+k_{-1}&{}k_1s\\ {}-k_1s-(k_{-1}+k_2)-k_3(i_0-c_2) &{} -k_1s+k_{-3}\\ k_3(i_0-c_2)&{}-k_{-3}\end{pmatrix}\begin{pmatrix}c_1\\ c_2\end{pmatrix}\nonumber \\{} & {} + \varepsilon \begin{pmatrix}-k_1e_0^{*}s\\ \;\;k_1e_0^{*}s\\ 0\end{pmatrix}. \end{aligned}$$

(66)

According to (9), the singular perturbation reduction of (66) is given by

$$\begin{aligned} \begin{pmatrix}\dot{s}\\ \dot{c}_1\\ \dot{c}_2\end{pmatrix}=&{} \varepsilon \begin{pmatrix}1 &{}{} \frac{(k_1s+k_{-1})k_{-3}+i_0k_1k_3s}{(k_1s+k_2+k_{-1})k_{-3}+i_0k_1k_3s} &{}{} \frac{(k_1s+k_{-1})k_{-3}+(i_0k_3+k_2)k_1s}{(k_1s+k_2+k_{-1})k_{-3}+i_0k_1k_3s}\\ 0&{}{} 0&{}{}0\\ 0&{}{}0&{}{}0\end{pmatrix} \begin{pmatrix}-k_1e_0^{*}s\\ \;\;k_1e_0^{*}s\\ 0\end{pmatrix}, \end{aligned}$$

(67)

thus $\dot{c}_1=\dot{c}_2=0$ and

$$\begin{aligned} \dot{s}=-\frac{k_1e_0k_2k_{-3}\,s}{(k_1s+k_2+k_{-1})k_{-3}+i_0k_1k_3s},\quad s(0)=s_0, \end{aligned}$$

(68)

in the limiting case of small $e_0=\varepsilon e_0^*$, up to errors of order $\varepsilon ^2$.

The reduced equation (77) has been previously reported in the literature (see, e.g., Schnell and Mendoza 2001). It is different from the classical QSS reduction, which is obtained by substituting exact equations for the $c_1$—and $c_2$—nullclines into (65). But, in accordance with (Goeke et al. 2017, Proposition 5), the difference between the classical reduction and (68) will be of order $\varepsilon ^2$. Typically, in numerical simulations there will only be noticeable differences between the classical reduction and the Fenichel reduction at very large substrate concentrations.

6.2.1 Asymptotic Small Parameters

On $\widetilde{Y}\cap K$, we have at $\pi =\widehat{\pi }$

$$\begin{aligned} \begin{array}{rcl} \sigma _1&{}=&{}k_1s+k_{-1}+k_2+k_3i_0+k_{-3},\\ \sigma _2&{}=&{} k_1s(k_3i_0+k_{-3})+(k_{-1}+k_2)k_{-3},\\ \widehat{\sigma }_3&{}=&{} k_2k_1e_0^*k_{-3}. \end{array} \end{aligned}$$

As always, we assume that all the parameters are contained in a suitable compact subset of parameter space, in particular they are bounded above by positive constants. The TFPV property requires, in addition, that $\sigma _1$ and $\sigma _2$ are bounded below on $K\cap \widetilde{Y}$ by positive constants, thus

$$\begin{aligned} \begin{array}{rcl} k_3i_0+k_{-3}+k_{-1}+k_2&{}=&{}\min \sigma _1>0,\\ (k_{-1}+k_2)k_{-3}&{}=&{}\min \sigma _2>0, \end{array} \end{aligned}$$

and therefore the TFPV conditions hold if and only if both $k_{-1}+k_2$ and $k_{-3}$ are bounded below by positive constants. Moreover, the reduction (68) should be significantly different from a trivial equation, and hence one also requires $k_2$ to be bounded below by some positive constant. No lower bound for $k_3i_0$ is imposed by the TFPV conditions, but note that (68) approaches the Michaelis–Menten equation as $i_0\rightarrow 0$ or $k_3\rightarrow 0$. We will discuss this scenario below.

As before, we will obtain usable estimates for the timescale ratio from Propositions 2, 3 and 4. For uncompetitive inhibition, the maxima can be determined explicitly.

The distinguished small parameter $\varepsilon ^*$, with $\sigma _1,\sigma _2$ and $\widehat{\sigma }_3$ evaluated at $\pi =\widehat{\pi }$, may be determined from

$$\begin{aligned} \begin{array}{rcl} \varepsilon ^* &{}=&{}\varepsilon \max _{0\le s\le s_0} \frac{\widehat{\sigma }_3(s,\widehat{\pi },\rho ,0)}{\sigma _1(s,\widehat{\pi })\sigma _2(s,\widehat{\pi })}\\ &{}=&{}\varepsilon \frac{\widehat{\sigma }_3(0,\widehat{\pi },\rho ,0)}{\sigma _1(0,\widehat{\pi })\sigma _2(0,\widehat{\pi })}\\ &{}=&{}\frac{k_2k_1e_0}{(k_{-1}+k_2)^2}\cdot \frac{k_{-1}+k_2}{k_3i_0+k_{-3}+k_{-1}+k_2}\\ &{}=&{}\varepsilon _{MM}\cdot \frac{k_{-1}+k_2}{k_3i_0+k_{-3}+k_{-1}+k_2} =:\varepsilon _{U}, \end{array} \end{aligned}$$

(69)

with the distinguished parameter $\varepsilon _{MM}$ from the MM reaction mechanism. Note that to see why the first equality sign in (69) holds, you can determine the derivative and verify that it is negative for $s\ge 0$.

It is straightforward to verify that all eigenvalues are real, since $\sigma _1^2-4\sigma _2 \ge 0$. Thus, from $\sigma _1,\sigma _2$ and $\widehat{\sigma }_3$ evaluated at $\widehat{\pi }$, the parameter $\mu ^*$ is obtained from

$$\begin{aligned} \begin{array}{rcl} \mu ^* &{}=&{} \varepsilon \max _{0\le s\le s_0} \frac{\widehat{\sigma }_3(s,\widehat{\pi },\rho ,0)\sigma _1(s,\widehat{\pi })}{\sigma _2(s,\widehat{\pi })^2}\\ &{}=&{}\frac{k_2k_1e_0}{(k_{-1}+k_2)^2}\cdot \bigg (\frac{k_3i_0+k_2+k_{-1}+k_{-3}}{k_{-3}}\bigg )\\ &{}=&{} \varepsilon _{MM}\cdot \bigg (\frac{k_3i_0+k_2+k_{-1}+k_{-3}}{k_{-3}}\bigg ) =:\mu _U. \end{array} \end{aligned}$$

(70)

Note that the first equality holds, because the derivative of

$$\begin{aligned} s\mapsto \displaystyle {\frac{\widehat{\sigma }_3(s,\widehat{\pi },\rho ,0)\sigma _1(s,\widehat{\pi })}{\sigma _2(s,\widehat{\pi })^2}} \end{aligned}$$

is negative for all $s\ge 0$.

6.2.2 Numerical Simulations

We now turn to numerical simulations, with the same dual motivation as in Sect. 6.1. Parallel to our analysis of (56) and (58), we discuss the reliability of the qualifiers $\varepsilon _{U}\ll 1$ and $\mu _U \ll 1$ in gauging the validity of (68):

1.
We begin with the special case $\pi =(e_0,1,1,1,1,1,1,1)$, representing a scenario where all parameters except $e_0$ are of the same order 1, and vary $e_0$ from 1 to $10^{-3}$. The results are reported in Fig. 6, and collectively support the statement that (68) holds when $\mu _U$ is sufficiently less than 1. With all “non-small” parameters having the same order, one also sees that sufficiently small $\varepsilon _{U}$ suffices.
2.
Parallel to our analysis of the cooperative reaction, we next consider parameters with widely disparate magnitude. In this simulation, the accuracy of (68) improves only as $\mu _U \rightarrow 0$, and this illustrates the relevance of $\mu _U$ as the dimensionless parameter that indicates the accuracy of (68); see Fig. 7.

6.2.3 Near-Invariance

As in the case of the cooperative reaction mechanism, near-invariance scenarios also exist for uncompetitive inhibition. Setting $e_0=i_0=0$ (also) yields a TFPV for dimension one, viz.

$$\begin{aligned} \widehat{\bar{\pi }}:=(0,s_0,k_1,k_{-1},k_2,k_3,k_{-3},0)^\textrm{tr}, \end{aligned}$$

with the same associated critical manifold $\widetilde{Y}$, defined by $c_1=c_2=0$. We fix a further reference value $i_0^*$ and consider the ray direction $\rho ^\dagger =(e_0^*,0,\ldots ,0,i_0^*)^\textrm{tr}$. Then, the perturbed system with $\pi =\widehat{\bar{\pi }}+\varepsilon \rho ^\dagger $ has the form

$$\begin{aligned} \begin{pmatrix}\dot{s}\\ \dot{c}_1\\ \dot{c}_2\end{pmatrix}= & {} \begin{pmatrix}k_1s+k_{-1}&{}k_1s\\ {}-k_1s-(k_{-1}+k_2)+k_3c_2 &{} -k_1s+k_{-3}\\ {}-k_3c_2&{}k_{-3}\end{pmatrix}\begin{pmatrix}c_1\\ c_2\end{pmatrix} + \varepsilon \begin{pmatrix}-k_1e_0^{*}s\\ \;\;k_1e_0^{*}s-k_3i_0^{*}c_1\\ k_3i_0^{*}c_1\end{pmatrix}\nonumber \\ \end{aligned}$$

(71)

Applying the reduction according to (9) yields

$$\begin{aligned} \begin{pmatrix}\dot{s}\\ \dot{c}_1\\ \dot{c}_2\end{pmatrix}=\varepsilon \begin{pmatrix}1 &{} \frac{k_1s+k_{-1}}{k_1s+k_{-1}+k_2} &{} \frac{(k_1s+k_{-1})k_{-3}+k_1k_2s}{k_{-3}(k_1s+k_{-1}+k_2)} \\ 0 &{} 0 &{}0 \\ 0&{}0&{}0\end{pmatrix} \begin{pmatrix}-k_1e_0^{*}s\\ \;\;k_1e_0^{*}s\\ 0\end{pmatrix} \end{aligned}$$

and thus, with $\dot{c}_1=\dot{c}_2=0$,

$$\begin{aligned} \dot{s}=-\frac{k_1e_0k_2s}{k_1s+k_2+k_{-1}},\quad s(0)=s_0, \end{aligned}$$

(72)

which is valid asymptotically as $\varepsilon \rightarrow 0$. Here, we recover the familiar Michaelis–Menten equation in the limit when the concentrations of both enzyme and inhibitor approach zero of order $\varepsilon $. (The same reduction is obtained for $k_3=\varepsilon k_3^*$ and $e_0=\varepsilon e_0^*$.)

From a different perspective, when the term $k_3i_0$ vanishes, the subspace $W:=\{(s,c_1,c_2)\in \mathbb {R}^3: c_2=0\}$ is invariant, and a slight perturbation (not necessarily of order $\varepsilon $) results in the near-invariance of W. Considering the expressions (69) and (70) for $\varepsilon ^*$ and $\mu ^*$, respectively, one sees that $k_3i_0\rightarrow 0$ has no strong effect on these parameters and that $\varepsilon _{MM}$ is a good upper estimate for $\varepsilon _U$. One may rewrite (68) as

$$\begin{aligned} \dot{s} = -\frac{k_1e_0k_2s}{k_1s(1+ k_3i_0/k_{-3})+k_{-1}+k_2}, \end{aligned}$$

(73)

thus, when $k_3i_0/k_{-3} \ll 1$, then the standard Michaelis–Menten reduction is approximately valid. In this case, the dynamics are effectively two-dimensional. Hence (for the given initial values) the magnitude of $\mu _U$ is irrelevant, and (68) will hold even if $1< k_1e_0/k_{-3}$ and $1< \mu _U$^{Footnote 20} since $\varepsilon _{MM} \ll 1$ automatically ensures the validity of (68) when W is nearly invariant (see, Fig. 8).

6.3 Competitive Inhibition Reaction Mechanism

The irreversible competitive inhibition reaction mechanism

$$\begin{aligned} \begin{array}{rcccl} E+S&{}\overset{k_1}{\underset{k_{-1}}{\rightleftharpoons }}&{} C_1&{}\overset{k_2}{\rightharpoonup }\ {} &{}E+P,\\ E+I&{}\overset{k_{3}}{\underset{k_{-3}}{\rightleftharpoons }}&{} C_2&{} &{} \end{array} \end{aligned}$$

(74)

corresponds (with mass action kinetics and stoichiometric conservation laws) to the ODE system

$$\begin{aligned} \begin{array}{rclclclclcl} \dot{s}=&{}-&{} k_1(e_0-c_1-c_2)s&{}+&{}k_{-1}c_1,&{} \\ \dot{c}_1=&{}&{} k_1(e_0-c_1-c_2)s&{}-&{}(k_{-1}+k_2)c_1,&{} \\ \dot{c}_2=&{} &{} k_3 (e_0-c_1-c_2) (i_0-c_2) &{}- &{}k_{-3} c_2. \end{array} \end{aligned}$$

(75)

The usual initial conditions are $s(0)=s_0,\,e(0)=e_0,\,i(0)=i_0$ and $c_1(0)=c_2(0)=p(0)=0$. We fix a reference value $e_0^*$, and then from the conservation laws we obtain the compact positively invariant set

$$\begin{aligned} K:=\{(s,c_1,c_2)\in \mathbb {R}_{\ge 0}^3: 0\le s\le s_0,\;\; 0\le c_1\le e_0^*,\;\; 0\le c_2\le \min \{e_0^*,\,i_0\}\}. \end{aligned}$$

The parameter space $\Pi =\mathbb R_{\ge 0}^8$ has elements

$$\begin{aligned} (e_0,s_0,i_0,k_1,k_{-1},k_2,k_3,k_{-3})^\textrm{tr}, \end{aligned}$$

and it is known that $e_0=0$, with all other parameters positive, defines a TFPV,

$$\begin{aligned} \widehat{\pi }:= (0,s_0,i_0,k_1,k_{-1},k_2,k_3,k_{-3})^\textrm{tr} \end{aligned}$$

(76)

with corresponding critical manifold

$$\begin{aligned} \widetilde{Y}:=\{(s,c_1,c_2)\in \mathbb {R}^3_{\ge 0}: c_1=c_2=0\}. \end{aligned}$$

(see, below for nondegeneracy conditions on the remaining parameters.) We choose the parameter ray direction $\rho =(e_0^*,0,\ldots ,0)^\textrm{tr}$, with $e_0=\varepsilon e_0^*$. The singular perturbation reduction (see, Goeke and Walcher 2014, Section 3.2) yields the equation

$$\begin{aligned} \dot{s}=-\dfrac{k_1k_{-3}k_2e_0s}{(k_1s+k_{-1}+k_2)k_{-3} + k_3i_0(k_{-1}+k_2)}. \end{aligned}$$

(77)

The reduced equation (68) has been previously reported in the literature (see, e.g., Schnell and Mendoza 2001). Note that the reduction (77) again differs from the classical QSS reduction (see, e.g., Keener and Sneyd 2009, Section 1.4.3). However, (77) and the classical reduction agree up to a term of order $\varepsilon ^2$ and are therefore asymptotically equivalent.

6.3.1 Asymptotic Small Parameters

The coefficients of the characteristic polynomial on the critical manifold are

$$\begin{aligned} \begin{array}{rcl} \sigma _1&{}=&{} k_1s+k_{-1}+k_2+k_3i_0+k_{-3};\\ \sigma _2&{}=&{} k_{-3}k_1s+(k_3i_0+k_{-3})(k_{-1}+k_2);\\ \widehat{\sigma }_3&{}=&{}k_2k_1e_0^*\cdot \left( k_3i_0+k_{-3}\right) . \end{array} \end{aligned}$$

We generally assume that all parameters are contained in a compact subset of the positive orthant, hence are bounded above by certain positive constants. Moreover $\sigma _1(\widehat{\pi },s)$ and $ \sigma _2(\widehat{\pi },s)$ satisfy the TFPV property

$$\begin{aligned} \begin{array}{rcl} k_3i_0+k_{-3}+k_{-1}+k_2&{}=&{}\min \sigma _1>0,\\ (k_{-1}+k_2)(k_3i_0+k_{-3})&{}=&{}\min \sigma _2>0 \end{array} \end{aligned}$$

if and only if $k_{-1}+k_2$ and $k_3i_0+k_{-3}$ are bounded below by certain positive constants. More restrictively, we will assume that $i_0$ is bounded below by some positive constant. Finally $k_2$ and $k_{-3}$ should be bounded below by positive constants, lest the reduced equation (77) is too close to trivial.

With $\sigma _1, \sigma _2$ and $\widehat{\sigma }_3$ evaluated at $\widehat{\pi }$, we obtain the distinguished small parameter

$$\begin{aligned} \begin{array}{rcl} \varepsilon ^* &{}=&{} \varepsilon \displaystyle \sup _{\widehat{Y} \cap K} \frac{\widehat{\sigma }_3(s,\widehat{\pi },\rho ,0)}{\sigma _1(s,\widehat{\pi })\sigma _2(s,\widehat{\pi })}\\ &{}=&{} \frac{k_2k_1e_0}{(k_{-1}+k_2)^2}\cdot \frac{k_{-1}+k_2}{k_{-1}+k_2+k_3i_0+k_{-3}}\\ &{}=&{} \varepsilon _{MM}\cdot \frac{k_{-1}+k_2}{k_{-1}+k_2+k_3i_0+k_{-3}}:=\varepsilon _I\\ \end{array} \end{aligned}$$

(78)

To verify the equalities, note that $\widehat{\sigma }_3$ is constant while $\sigma _1,\,\sigma _2$ are increasing with s.

It is straightforward to check that $\sigma _1^2-4\sigma _2 \ge 0$, thus all eigenvalues are real, and Proposition 3 is applicable. Determining the parameter $\mu ^*$ requires a distinction of cases. The derivative of

$$\begin{aligned} s\mapsto q(s):=\frac{\widehat{\sigma }_3(s,\widehat{\pi },\rho ,0)\sigma _1(s,\widehat{\pi })}{\sigma _2(s,\widehat{\pi })^2} \end{aligned}$$

is a rational function in s with numerator of degree one. Both coefficients are negative if and only if

$$\begin{aligned} 2k_{-3}(k_3i_0+k_{-3}) +k_{-3} (k_{-1}+k_2) \ge (k_{-1}+k_2)i_0k_3, \end{aligned}$$

(79)

otherwise they have opposite signs. Note that (79) is satisfied whenever $(k_3i_0)/(k_{-3}) \le 1$. This inequality admits a direct interpretation in terms of the reaction mechanism. On the one hand, it places a lower bound on the allowable size of $k_{-3}$. More importantly, it holds whenever the inhibitor concentration is not too high, thus it is controllable by experimental design.

When (79) holds then $s\mapsto q(s)$ is strictly decreasing for $s\ge 0$, and

$$\begin{aligned} \mu ^*= \mu _I^{(1)}:=\varepsilon \frac{\widehat{\sigma }_3(0,\widehat{\pi },\rho ,0)\sigma _1(0,\widehat{\pi })}{\sigma _2(0,\widehat{\pi })^2}=\varepsilon _{MM}\cdot \frac{k_{-1}+k_2+k_3i_0+k_{-3}}{k_3i_0+k_{-3}}. \end{aligned}$$

(80)

Whenever (79) does not hold then a straightforward calculation shows that the maximum of $s\mapsto q(s)$ for $0\le s<\infty $ is given by

$$\begin{aligned} \begin{array}{rcl} \mu ^*=\mu _I^{(2)} &{} =&{} \dfrac{k_2k_1e_0\cdot (k_3i_0+k_{-3})}{4k_{-3}\cdot \bigg (k_3i_0(k_{-1}+k_2)-k_{-3}(k_3i_0+k_{-3})\bigg )}\\ &{}=&{}\varepsilon _{MM}\cdot \dfrac{(k_{-1}+k_2)^2\cdot (k_3i_0+k_{-3})}{4k_{-3}\cdot \bigg (k_3i_0(k_{-1}+k_2)-k_{-3}(k_3i_0+k_{-3})\bigg )}.\\ \end{array} \end{aligned}$$

(81)

This expression is somewhat unwieldy. But $\mu _I^{(2)}$ admits an obvious lower bound, obtained by discarding the negative term in the denominator:

$$\begin{aligned} \mu _I^{(2)} \ge \frac{1}{4}\cdot \frac{k_1k_2e_0}{k_{-3}(k_{-1}+k_2)}\cdot \bigg (1+\frac{k_{-3}}{k_3i_0}\bigg ). \end{aligned}$$

Moreover, the negation of (79) provides an estimate for the denominator which yields an upper bound

$$\begin{aligned} \mu _I^{(2)}\le \frac{1}{4}\cdot \frac{k_2k_1e_0(k_3i_0+k_{-3})}{k_{-3}^2(k_3i_0+k_{-3}+k_{-1}+k_2)}. \end{aligned}$$

The lower bound shows that it is necessary to require $k_1e_0 \ll \min \{k_{-3},k_3i_0\}$ whenever $k_{-1}$ and $k_2$ are of the same order.

Finally, whenever $s_0$ is not too large one may also consider the estimate

$$\begin{aligned} \begin{array}{rcl} \mu ^* &{}\le &{} \varepsilon \frac{\widehat{\sigma }_3(s_0,\widehat{\pi },\rho ,0)\sigma _1(s_0,\widehat{\pi })}{\sigma _2(0,\widehat{\pi })^2}\\ &{}=&{}{\frac{k_2k_1e_0}{(k_{-1}+k_2)^2}} \cdot \frac{k_1s_0+k_{-1}+k_2+k_3i_0+k_{-3}}{k_3i_0+k_{-3}}\\ &{}=&{}\varepsilon _{MM} \cdot \frac{k_1s_0+k_{-1}+k_2+k_3i_0+k_{-3}}{k_3i_0+k_{-3}}=:\widetilde{\mu }_I, \end{array} \end{aligned}$$

(82)

which is a direct consequence of monotonicity properties of the $\sigma _j$. This inequality is exact whenever (79) does not hold and $s_0$ is smaller than the argument of $\max q$.

6.3.2 Numerical Simulations

Generally, by Fenichel theory the accuracy of the reduction (77) improves along the parameter ray as $\varepsilon _I\rightarrow 0$ and $\mu _I^{(i)}\rightarrow 0$, for $i=1,2$, respectively. Continuing the procedure employed in the previous case studies, we illustrate the efficacy of the qualifiers $\varepsilon _I\ll 1, \mu _I^{(i)} \ll 1$ (with appropriate index i) with several numerical simulations:

1.
For our first example, we once again consider the case in which all parameters except $e_0$ are equal, which is a representative of parameters of the same magnitude. Numerical simulations confirm that the accuracy of (77) improves as $\varepsilon _I\rightarrow 0$ and $\mu _I^{(1)} \rightarrow 0$ (see, Fig. 9).
2.
In our second example, we demonstrate the effectiveness of $\varepsilon _{I}$ and $\mu _I^{(i)}$ with parameters that have disparate magnitudes. We observe that $\mu _I^{(1)}$ is the definitive indicator of the accuracy of (77) when (79) holds, while $\mu _I^{(2)}$ is the indicator of the accuracy of (77) whenever (79) fails, reflecting the fact that one eigenvalue must have much smaller absolute value than the other two (see, Figs. 10, 11).

6.3.3 Near-Invariance

As in the previous sections, we now discuss special instances of near-invariance. The inhibitory mechanism can be turned off by requiring $k_3i_0=0$, which implies that, for sufficiently small $k_3$ or $i_0$, the subspace $U:=\{(s,c_1,c_2)\in \mathbb {R}^3: c_2=0\}$ will be nearly invariant. One perspective is to define the parameter ray by $k_{3}= \varepsilon k_3^*$ and $e_0 = \varepsilon e_0^*$. Then, the perturbation form of the mass action equations is

$$\begin{aligned} \begin{pmatrix} \dot{s}\\ \dot{c}_1\\ \dot{c}_2 \end{pmatrix}=\begin{pmatrix} k_1s+k_{-1} &{} \;\;k_1s \\ -k_1s -(k_{-1}+k_2) &{} -k_1s\\ 0 &{} -k_{-3}\end{pmatrix}\begin{pmatrix} c_1 \\ c_2\end{pmatrix} + \varepsilon \begin{pmatrix}-k_1e_0^*s\\ k_1e_0^*s\\ -k_3^*( c_1+c_2)(i_0-c_2)\end{pmatrix}+\mathcal {O}(\varepsilon ^2). \end{aligned}$$

The QSS reduction is obtained by projecting the leading order perturbation onto the critical manifold, thus

$$\begin{aligned} \begin{pmatrix}\dot{s}\\ \dot{c}_1\\ \dot{c}_2 \end{pmatrix}= \varepsilon \begin{pmatrix}1 &{} \frac{k_1s+k_{-1}}{k_1s+k_{-1}+k_2} &{} \frac{k_1k_2s}{(k_1s+k_{-1}+k_2)k_{-3}} \\ 0 &{} 0 &{}0\\ 0&{}0&{}0\end{pmatrix}\begin{pmatrix}-k_1e_0^*s\\ \;\;k_1e_0^*s\\ 0\end{pmatrix}, \end{aligned}$$

from which we recover

$$\begin{aligned} \dot{s} = -\frac{k_1k_2e_0s}{k_1s+k_{-1}+k_2}, \end{aligned}$$

i.e., the sQSSA of the MM reaction mechanism. This is not surprising. With initial conditions $s(0)=s_0,c_1(0)=c_2(0)=0$, the dynamics are approximately two-dimensional. From a different perspective (taking independent limits), we can write (77) as

$$\begin{aligned} \dot{s} = -\frac{k_1k_2e_0s}{k_1s + (k_{-1}+k_2)\bigg (1+\frac{k_3i_0}{k_{-3}}\bigg )}, \end{aligned}$$

from which it is clear by inspection that the sQSSA is recoverable from (77) whenever $k_3i_0/k_{-3} \ll 1$. Consequently, whenever $k_3i_0/k_{-3} \ll 1$ we need only consider the magnitude of $\varepsilon _{I}$ to ascertain the accuracy of (77) (see, Fig. 12).

7 Case Studies: Reduction from Dimension Three to Two

In this section, we further discuss the uncompetitive and competitive inhibition reaction mechanisms, but now we consider exemplary cases of reduction to dimension two. These scenarios are of less practical relevance than those in the previous section, but we present them for illustrative purposes. We will provide less detailed discussions, and will be content to show the feasibility of the method. The results from Sect. 5, in particular Proposition 5 and Sect. 5.4, will be employed. The determination of distinguished parameters now amounts to finding (or estimating) the maximum and minimum of rational functions in two variables on some compact set.

7.1 Uncompetitive Inhibition Reaction Mechanism

For the uncompetitive inhibition reaction mechanism, (64) and (65), one sees that $k_1=k_{-3}=0$, with all other parameters contained in some compact subset of the open positive orthant, defines a TFPV $\widehat{\pi }$, with a two-dimensional critical manifold $\widetilde{Y}$ given by $c_1=0$. The TFPV conditions mean that both elementary reactions responsible for the formation of $C_1$ are slow.

We consider system (65) with initial values $s(0)=s_0, \,c_1(0)=c_2(0)=0$ on the compact positively invariant set

$$\begin{aligned} K:=\{(s,c_1,c_2)\in \mathbb {R}^3_{\ge 0}: 0 \le s\le s_0,\;\; c_1+c_2 \le e_0, \;\;c_2 \le \min \{e_0,i_0\}\}, \end{aligned}$$

and take the ray direction

$$\begin{aligned} \rho =(0,0,k_1^*,0,0,0,k_{-3}^*)^\textrm{tr} \end{aligned}$$

in parameter space, with $k_i=\varepsilon k_i^*$, $k_i^*>0$, for $i\in \{1,\,-3\}$. Straightforward computations yield the reduced system

$$\begin{aligned} \begin{pmatrix}\dot{s}\\ \dot{c}_2\end{pmatrix}=\dfrac{1}{k_{-1}+k_2+{k_{3}}(i_0-c_2)}\begin{pmatrix}-k_1(e_0-c_2)(k_2+k_3(i_0-c_2))s+k_{-3}k_{-1}c_2\\ k_1k_3(e_0-c_2)(i_0-c_2)s-k_{-3}(k_{-1}+k_2)c_2\end{pmatrix},\nonumber \\ \end{aligned}$$

(83)

with initial conditions $s(0)=s_0,\,c_2(0)=0.$ A straightforward phase plane analysis of system (83) (respectively, of the orbitally equivalent system with the common denominator discarded) shows that every solution in the positive quadrant converges to the stationary point 0.

7.1.1 Asymptotic Small Parameters

We now determine dimensionless parameters that gauge the accuracy of (83). For the sake of brevity, we will restrict attention to the case $e_0>i_0$. The coefficients of the characteristic polynomial on $\widetilde{Y}$ are given by

$$\begin{aligned} \begin{array}{rcl} \widetilde{\sigma }_1&{}=&{}k_{-1}+k_2+k_3(i_0-c_2)+\varepsilon \,(\cdots )\\ \widetilde{\sigma }_2&{}=&{}k_1\left( k_3(i_0-c_2)(e_0-c_2+s)+k_2(e_0-c_2)\right) +k_{-3}(k_{-1}+k_2)+\varepsilon ^2\,(\cdots )\\ \widetilde{\sigma }_3&{}=&{} k_1k_{-3} k_2(e_0-c_2). \end{array} \end{aligned}$$

Thus,

$$\begin{aligned} \begin{array}{rcl} \sigma _1&{}=&{}k_{-1}+k_2+k_3(i_0-c_2)\\ \widehat{\sigma }_2&{}=&{}k_1^*\left( k_3(i_0-c_2)(e_0-c_2+s)+k_2(e_0-c_2)\right) +k_{-3}^*(k_{-1}+k_2)\\ \widehat{\sigma }_3&{}=&{} k_1^*k_{-3}^* k_2(e_0-c_2), \end{array} \end{aligned}$$

and the first nondegeneracy condition from Lemma 7 is satisfied since $e_0>i_0$.

According to Propositions 5 and 6 and their proofs, for timescale comparisons we consider the rational function

$$\begin{aligned} q(s, c_2)= \dfrac{\widehat{\sigma }_2(s,c_2.\widehat{\pi },\rho ,0)}{\sigma _1(s,c_2,\widehat{\pi })^2},\quad 0\le s\le s_0, \quad 0\le c_2\le i_0. \end{aligned}$$

Since $\widehat{\sigma }_2$ decreases with $c_2$ and increases with s, while $\sigma _1$ decreases with $c_2$, we obtain an upper estimate from

$$\begin{aligned} \varepsilon ^*\le \varepsilon \dfrac{\max \widehat{\sigma }_2}{(\min \sigma _1)^2}= \dfrac{k_1(k_3i_0(e_0+s_0)+k_2e_0)+k_{-3}(k_{-1}+k_2)}{(k_{-1}+k_2)^2}=:\delta ^*. \end{aligned}$$

Moreover from $\varepsilon ^*\ge q(s_0,0)$, we find that

$$\begin{aligned} \dfrac{k_1(k_3i_0(e_0+s_0)+k_2e_0)+k_{-3}(k_{-1}+k_2)}{(k_{-1}+k_2+k_3i_0)^2}\le \varepsilon ^*. \end{aligned}$$

Likewise, we obtain lower timescale estimates from

$$\begin{aligned} \varepsilon _*\ge \varepsilon \dfrac{\min \widehat{\sigma }_2}{(\max \sigma _1)^2}= \dfrac{k_1k_2(e_0-i_0)+k_{-3}(k_{-1}+k_2)}{(k_{-1}+k_2+k_3i_0)^2}=:\delta _*. \end{aligned}$$

Thus, for $i_0$ not too large, the estimates by $\delta ^*$ and $\delta _*$ are quite acceptable.

To estimate the disparity of the slow eigenvalues, according to Sect. 5.4, we consider

$$\begin{aligned} \kappa ^*=\max \dfrac{\sigma _1\widehat{\sigma }_3}{\widehat{\sigma }_2^2}\le \dfrac{(k_{-1}+k_2+k_3i_0)k_1^*k_{-3}^*k_2e_0}{(k_1^*k_2e_0+k_{-3}^*(k_{-1}+k_2))^2}=:\nu ^* \end{aligned}$$

as well as

$$\begin{aligned} \kappa _*=\min \dfrac{\sigma _1\widehat{\sigma }_3}{\widehat{\sigma }_2^2}\ge \dfrac{(k_{-1}+k_2)k_1^*k_{-3}^*k_2(e_0-i_0)}{(k_1^*(k_3i_0(e_0+s_0)+k_2e_0)+k_{-3}^*(k_{-1}+k_2))^2}=:\nu _*. \end{aligned}$$

Whenever $i_0$ is not too large, these two parameters are close, and so are the slow eigenvalues.

7.1.2 Numerical Simulations

From Fenichel theory, it is known that the accuracy of the reduction (83) improves along the perturbation direction as $\varepsilon \rightarrow 0$. We include some numerical simulations to gauge the efficacy of the parameter $\delta ^*$:

1.
Following the outline established in Sect. 6, we first consider a case when all parameters are of unit order. Numerical simulations confirm that the accuracy of (83) improves as $\delta ^*\rightarrow 0$ along the parameter ray direction (see, Fig. 13). We include the values of $\delta _*$ to indicate the variation of timescale ratios.
2.
In a second set of simulations, we consider the case of parameter values that are disparate in magnitude. Numerical simulations confirm once again that the accuracy of (83) improves along the parameter ray direction as $\delta ^*\rightarrow 0$ (see, Fig. 14).

7.2 Competitive Inhibition Reaction Mechanism

For the competitive inhibition reaction mechanism, (74) and (75), we consider the case that formation of complex $C_1$, and both formation and degradation of complex $C_2$, are slow. Setting $k_1=k_3=k_{-3}=0$, with all the other parameters contained in a compact subset of the positive orthant, defines a TFPV $\widehat{\pi }$ for dimension $s=2$, the critical manifold $\widetilde{Y}$ being given by $c_1=0$ (see, Kruff and Walcher 2019). We consider the system on the compact positively invariant set K defined by

$$\begin{aligned} K:=\{(s,c_1,c_2)\in \mathbb {R}_{\ge 0}^3: 0\le s\le s_0,\;\; 0\le c_1\le e_0,\;\; 0\le c_2\le \min \{e_0,\,i_0\}\}, \end{aligned}$$

choosing the ray direction

$$\begin{aligned} \rho =(0,0,k_1^*,0,0,k_3^*,k_{-3}^*)^\textrm{tr} \end{aligned}$$

in parameter space, and $k_i=\varepsilon k_i^*$ with $k_i^*>0$ for $i\in \{1,\,3,\,-3\}$. Standard computations yield the reduced system

$$\begin{aligned} \begin{array}{rcl} \dot{s} &{} =&{} -\dfrac{k_1k_2}{k_{-1}+k_2}(e_0-c_2)s \\ \dot{c}_2 &{} =&{} k_3(e_0-c_2)(i_0-c_2)-k_{-3}c_2 \end{array} \end{aligned}$$

(84)

with initial conditions $s(0)=s_0$, $c_2(0)=0$. The qualitative behavior of this system is easily determined. All solutions in the positive quadrant converge to a stationary point $(0, c_2^*)$, with $0<c_2^*<\min \{e_0,i_0\}$.^{Footnote 21}

7.2.1 Asymptotic Small Parameters

For the sake of brevity, we will consider only the case $e_0>i_0$. The coefficients of the characteristic polynomial on $\widetilde{Y}$ are

$$\begin{aligned} \begin{array}{rcl} \widetilde{\sigma }_1&{}=&{}k_{-1}+k_2+\varepsilon \,(\cdots )\\ \widetilde{\sigma }_2&{}=&{}k_1k_2(e_0-c_2)+k_3(e_0+i_0-2c_2)(k_{-1}+k_2)+k_{-3}(k_{-1}+k_2)+\varepsilon ^2\,(\cdots )\\ \widetilde{\sigma }_3&{}=&{} k_1k_3\cdot \left( k_2e_0(e_0+i_0-2c_2)+k_2c_2(2c_2-e_0-i_0)\right) +k_1k_{-3}\cdot k_2(e_0-c_2), \end{array} \end{aligned}$$

and we obtain

$$\begin{aligned} \begin{array}{rcl} \sigma _1&{}=&{}k_{-1}+k_2\\ \widehat{\sigma }_2&{}=&{}k_1^*k_2(e_0-c_2)+k_3^*(e_0+i_0-2c_2)(k_{-1}+k_2)+k_{-3}^*(k_{-1}+k_2)\\ \widehat{\sigma }_3&{}=&{}k_1^*k_2(e_0-c_2)\left( k_3^* (e_0+i_0-2c_2) +k_{-3}^*\right) . \end{array} \end{aligned}$$

The nondegeneracy conditions are satisfied (also at $c_2=i_0$), due to $e_0>i_0$. As for timescales, we need to analyze the rational function

$$\begin{aligned} q(s, c_2)= \dfrac{\widehat{\sigma }_2(s,c_2,\widehat{\pi },\rho ,0)}{\sigma _1(s,c_2,\widehat{\pi })^2},\quad 0\le s\le s_0,\quad 0\le c_1\le e_0,\quad 0\le c_2\le i_0. \end{aligned}$$

Since $\sigma _1$ is constant and $\widehat{\sigma }_2$ is decreasing with $c_2$, attaining its maximum at $c_2=0$, and its minimum at $c_2=i_0$, we find the distinguished parameters

$$\begin{aligned} \varepsilon ^*=\dfrac{k_1k_2e_0+(k_3(e_0+i_0)+k_{-3})(k_{-1}+k_2)}{(k_{-1}+k_2)^2} \end{aligned}$$

and

$$\begin{aligned} \varepsilon _{*}=\dfrac{k_1k_2(e_0-i_0)+(k_3(e_0-i_0)+k_{-3})(k_{-1}+k_2)}{(k_{-1}+k_2)^2}. \end{aligned}$$

Furthermore, according to Sect. 5.4, we consider

$$\begin{aligned} \kappa ^*=\max \dfrac{\sigma _1\widehat{\sigma }_3}{\widehat{\sigma }_2^2}\le \dfrac{k_1^*k_2e_0(k_3^*(e_0+i_0)+k_{-3}^*)(k_{-1}+k_2)}{\left( k_1^*k_2(e_0-i_0)+(k_3^*(e_0-i_0)+k_{-3}^*)(k_{-1}+k_2)\right) ^2}=:\nu ^*\nonumber \\ \end{aligned}$$

(85)

and

$$\begin{aligned} \kappa _*=\min \dfrac{\sigma _1\widehat{\sigma }_3}{\widehat{\sigma }_2^2}\ge \dfrac{k_1^*k_2(e_0-i_0)(k_3^*(e_0-i_0)+k_{-3}^*)(k_{-1}+k_2)}{\left( k_1^*k_2e_0+(k_3^*(e_0+i_0)+k_{-3}^*)(k_{-1}+k_2)\right) ^2}=:\nu _* \end{aligned}$$

(86)

to measure the disparity between the eigenvalues $\lambda _2$ and $\lambda _3$.

7.2.2 Numerical Simulations

We present numerical examples to gauge the accuracy of the reduction (84) with decreasing $\varepsilon ^*$:

1.
For our first example, we consider the case with $\pi :=\varepsilon (k_1^*,1.0,1.0,1.0,1.0.,k_3^*,k_{-3}^*,1.0)^{\text {tr.}}$ (see, Fig. 15). We include the values of $\varepsilon ^*,\,\varepsilon _*$ to indicate the variation of timescale ratios.
2.
For our second example, we again consider a case when parameters are of differing magnitudes. Once more, numerical simulations confirm that the QSS reduction (84) improves as $\varepsilon ^*\rightarrow 0$ along the parameter ray (see, Fig. 16).

7.2.3 The Case of Very Small $k_1$: Three Timescales

Finally, we discuss a scenario mentioned in Sect. 5.4. From equations (85) and (86), below one sees that both $\kappa ^*$ and $\kappa _*$ approach zero as $k_1^*\rightarrow 0$. This may indicate three timescales. Moreover, from equation (75), one sees that the plane defined by $c_1=0$ is invariant when $k_1=0$, thus nearly invariant when $k_1$ is small. A coordinate-independent approach to a three-timescale scenario was presented in Kruff and Walcher (2019), based on work of Cardin and Texeira (2017). We introduce two small parameters $\varepsilon _1,\,\varepsilon _2$ and

$$\begin{aligned} k_3=\varepsilon _1 k_3^\dagger ,\,k_{-3}=\varepsilon _1 k_{-3}^\dagger ,\, k_1=\varepsilon _1\varepsilon _2 k_1^\dagger , \end{aligned}$$

and rewrite system (75) with three timescales. As detailed in Kruff and Walcher (2019), the system admits a sequence of two reductions, with nested invariant manifolds:

A reduction to slow dynamics on a two-dimensional invariant manifold close to $c_1=0$, with reduced system

$$\begin{aligned} \begin{pmatrix} \dfrac{\textrm{d}s}{\textrm{d}\tau _1}\\ \dfrac{\textrm{d}c_2}{\textrm{d}\tau _1} \end{pmatrix}=\begin{pmatrix} 0\\ k_3^\dagger (e_0-c_2)(i_0-c_2)-k_{-3}^{\dagger }c_2 \end{pmatrix} \end{aligned}$$

(87)

with $\tau _1=\varepsilon _1 t$.

A subsequent reduction to “very slow” dynamics on a one-dimensional invariant manifold close to $c_1=0, c_2=\widetilde{c}_2$, with

$$\begin{aligned} \widetilde{c}_2 = \frac{k_3(e_0+i_0) +k_{-3} - \sqrt{(k_3(e_0+i_0) +k_{-3})^2-4e_0i_0k_3^2}}{2k_3}. \end{aligned}$$

The fully reduced one-dimensional equation is then

$$\begin{aligned} \dfrac{\textrm{d}s}{\textrm{d}\tau _2}=-\frac{k_2\cdot k_1^\dagger (e_0-\widetilde{c}_2)}{k_{-2}+k_{-1}}s \end{aligned}$$

with $\tau _2=\varepsilon _1\varepsilon _2 t$, or, restated in fast time,

$$\begin{aligned} \dot{s}=-\frac{k_2\cdot k_1(e_0-\widetilde{c}_2)}{k_{-2}+k_{-1}}s. \end{aligned}$$

(88)

Figure 17 illustrates the “slow–very slow” dynamics for a numerical example.

8 Discussion

While the underlying theory and the qualitative analysis concerning the reduction of biochemical and chemical reaction networks is well understood and rests on solid ground, there is a sizable gap between available theory and applications to parameter identification problems in laboratory settings, where heuristics and ad hoc approaches are (perforce) still prevalent. Closing the gap requires further, more precise theoretical results. The present paper contributes toward this goal, by introducing a general consistent method to obtain perturbation parameters, based on local linear timescales. Note that by its nature, our approach is focused on and limited to the local behavior.

We briefly recall the context and reviewing the results of the present paper:

1.
We start from a singular perturbation reduction with a well-defined critical manifold. This is crucial to ensure appropriateness of linearizability. Considering the three steps (as outlined in Introduction) that are necessary for a global quantitative estimate of the approximation error, our results amount to an essential part of Step 1. In absence of results concerning Steps 2 and 3, direct applications are limited. But, our results permit consistency checks, which show that certain common perturbation parameters are not feasible.
2.
Using classical results from algebra to approximate eigenvalue ratios in the asymptotic limit, we obtained parameters that are computable, palatable, and admit a biochemical interpretation.
3.
We first applied our methods to the Michaelis–Menten reaction mechanism. As it turns out, even for such a familiar system our approach provides new and elucidating perturbation parameters. Moreover, we included a partial discussion of Step 3 for the irreversible system with small product formation rate.
4.
For two relevant extensions of the Michaelis–Menten reaction mechanism (like the uncompetitive inhibition and competitive inhibition), and a non-Michaelis–Menten reaction mechanism (like the cooperative system with two complexes), we derived perturbation parameters in the spirit of Segel and Slemrod (1989), but without resorting to nonlinear timescales. This stands in contrast to the practice of using $\varepsilon _{BH}$ or $\varepsilon _{SSl}$, or ad hoc modifications of these. We augmented these results by an extensive discussion of numerical examples to illustrate the efficacy of these parameters, but also to highlight the importance of the compactness requirements we impose throughout. We also discussed one case that leads to a system with three timescales.
5.
Finally, we discussed exemplary cases of reduction from dimension three to dimension two for both reaction inhibition scenarios, to verify the feasibility of our approach. Numerical simulations illustrate the quality and accuracy of the approximations.

The remaining items (Step 2 and Step 3) as stated in the Introduction need to be handled on a case-by-case basis. We will provide a complete analysis of the irreversible Michaelis–Menten reaction mechanism with low enzyme in forthcoming work.

Notes

Other types of reduction scenarios do occur, but we will not discuss these in the present work.
Even for the familiar Michaelis–Menten system there are several parameters in use.
In several instances this assumption seems to be coupled with a too literal interpretation of the expression $\varepsilon \ll 1$.
The proximity of the phase–space trajectory to the slow manifold does not ensure that the time evolutions of the approximate solution and the true solution are close; see, e.g., Eilertsen et al. (2022, Fig. 4).
In a recent paper, Patsatzis and Goussis (2019) suggested a parameter involving s and c along a trajectory; taking the maximum over s and c yields $\varepsilon _{RS}$.
The particular argument in Segel and Slemrod (1989) is somewhat problematic since the notion of timescale is ambiguous for nonlinear systems.
A proof of this fact is sketched in Appendix 9.1, which also indicates that eigenvalue ratios are relevant for Step 3. The multiplicative constant reflects the effect of a coordinate transformation.
A similar problem is familiar from linearly stable stationary points.
Generically, there are exactly $m-2$.
In many applications, it will be possible to choose a positively invariant compact neighborhood, but this will not be required a priori.
Historically, this was the mechanism first discussed by Michaelis and Menten (1913).
The commonly used quasi-steady state reduction (see, for instance, Keener and Sneyd 2009, Section 1.4.1) reads $\dot{s}=-\dfrac{k_2e_0s}{s+K_S}$ and thus neglects the term involving $e_0$ in the denominator, although $e_0$ is not negligible here.
Step 1 for the case of low enzyme concentration is more involved. A complete discussion of Steps 1–3 will be given in a forthcoming paper.
The factor 2 could be replaced by any constant $>1$.
It is unproblematic to determine these explicitly, but the expressions are unwieldy.
Straightforward computation yields a condition on $k_3$ that ensures the maximum of r being attained at $s=0$.
This case does occur.
Recall a similar scenario for Michaelis–Menten in Sect. 4.4.
A dynamic transcritical bifurcation occurs at the point where the rank of $D_1h(x,\pi ^{\ddagger })$ is 1. See Krupa and Szmolyan (2001) for a general discussion of such scenarios.
The implicit assumptions in the proof of Proposition 3 are then not satisfied.
Incidentally, it is possible to compute the solutions to (84) via quadratures. The second equation is separable, and upon substitution the first equation is non-autonomous linear.
This is an order $\varepsilon $ approximation of the slow manifold in a singular perturbation setting.
One may replace $\sqrt{2}$ by any smaller constant which is $>1$.
The $\widehat{\lambda }_i$ can be represented as convergent power series in $(x,\varepsilon ^{1/m})$ for some positive integer m.

References

Abhyankar SS (1990) Algebraic geometry for scientists and engineers. American Mathematical Society, Providence
Book MATH Google Scholar
Arnold VI (1992) Ordinary differential equations. Springer, Berlin
Google Scholar
Berglund N, Gentz B (2006) Noise-induced phenomena in slow-fast dynamical systems. A sample-paths approach. Springer, London
MATH Google Scholar
Briggs GE, Haldane JBS (1925) A note on the kinetics of enzyme action. Biochem J 19:338–339
Article Google Scholar
Cardin PT, Texeira MA (2017) Fenichel theory for multiple time scale singular perturbation problems. SIAM J Appl Dyn Syst 16:1452–1452
Article MathSciNet MATH Google Scholar
Choi B, Rempala GA, Kim JK (2017) Beyond the Michaelis-Menten equation: accurate and efficient estimation of enzyme kinetic parameters. Sci Rep 7:17018
Article Google Scholar
Eilertsen J, Schnell S (2018) A kinetic analysis of coupled (or auxiliary) enzyme reactions. Bull Math Biol 80:3154–3183
Article MathSciNet MATH Google Scholar
Eilertsen J, Schnell S (2020) The quasi-steady-state approximations revisited: timescales, small parameters, singularities, and normal forms in enzyme kinetics. Math Biosci 325:108339
Article MathSciNet MATH Google Scholar
Eilertsen J, Stroberg W, Schnell S (2018) Phase-plane geometries in coupled enzyme assays. Math Biosci 306:126–135
Article MathSciNet MATH Google Scholar
Eilertsen J, Tyczynska MA, Schnell S (2021) Hunting $\epsilon $: the origin and validity of quasi-steady-state reductions in enzyme kinetics. SIAM J Appl Dyn Syst 20:2450–2481
Article MathSciNet MATH Google Scholar
Eilertsen J, Roussel MR, Schnell S, Walcher S (2021) On the quasi-steady state approximation in an open Michaelis-Menten reaction mechanism. AIMS Math 6:6781–6814
Article MathSciNet MATH Google Scholar
Eilertsen J, Schnell S, Walcher S (2022) On the anti-quasi-steady-state conditions of enzyme kinetics. Math Biosci 350:108870
Article MathSciNet MATH Google Scholar
Evans LC (2014) Partial differential equations, 2nd edn. American Mathematical Society
Eyers PA, Murphy JM (2016) The evolving world of pseudoenzymes: proteins, prejudice and zombies. BMC Biol 14:98
Article Google Scholar
Fenichel N (1979) Geometric singular perturbation theory for ordinary differential equations. J Differ Equ 31:53–98
Article MathSciNet MATH Google Scholar
Gantmacher FR (2005) Applications of the theory of matrices. Dover, Mineola
Google Scholar
Goeke A, Walcher S (2013) Quasi-steady state: searching for and utilizing small parameters. In: Johann A, Kruse H-P, Rupp F, Schmitz S (eds) Recent trends in dynamical systems. Springer, New York, pp 153–178
Chapter Google Scholar
Goeke A, Walcher S (2014) A constructive approach to quasi-steady state reduction. J Math Chem 52:2596–2626
Article MathSciNet MATH Google Scholar
Goeke A, Walcher S, Zerz E (2015) Determining “small parameters’’ for quasi-steady state. J Differ Equ 259:1149–1180
Article MathSciNet MATH Google Scholar
Goeke A, Walcher S, Zerz E (2017) Classical quasi-steady state reduction—a mathematical characterization. Physica D 345:11–26
Article MathSciNet MATH Google Scholar
Heineken FG, Tsuchiya HM, Aris R (1967) On the mathematical status of the pseudo-steady hypothesis of biochemical kinetics. Math Biosci 1:95–113
Article Google Scholar
Keener J, Sneyd J (2009) Mathematical physiology I: cellular physiology, 2nd edn. Springer, New York
Book MATH Google Scholar
Kruff N, Walcher S (2019) Coordinate-independent singular perturbation reduction for systems with three time scales. Math Biosci Eng 16:5062–5091
Article MathSciNet MATH Google Scholar
Krupa M, Szmolyan P (2001) Extending slow manifolds near transcritical and pitchfork singularities. Nonlinearity 14:1473–1491
Article MathSciNet MATH Google Scholar
Lam SH, Goussis DA (1994) The CSP method for simplifying kinetics. Int J Chem Kinet 26:461–486
Article Google Scholar
Michaelis L, Menten ML (1913) Die Kinetik der Invertinwirkung. Biochem Z 49:333–369
Google Scholar
Miller WG, Alberty RA (1958) Kinetics of the reversible Michaelis-Menten mechanism and the applicability of the steady-state approximation. J Am Chem Soc 80:5146–5151
Article Google Scholar
Murphy JM, Farhan H, Eyers PA (2017) Bio-Zombie: The rise of pseudoenzymes in biology. Biochem Soc Trans 45:537–544
Article Google Scholar
Noethen L, Walcher S (2007) Quasi-steady state in the Michaelis–Menten system. Nonlinear Anal Real World Appl 8:1512–1535
Article MathSciNet MATH Google Scholar
Noethen L, Walcher S (2011) Tikhonov’s theorem and quasi-steady state. Discrete Contin Dyn Syst Ser B 16(3):945–961
MathSciNet MATH Google Scholar
Palsson BO, Lightfoot EN (1984) Mathematical modelling of dynamics and control in metabolic networks. I. On Michaelis–Menten kinetics. J Theor Biol 111:273–302
Article MathSciNet Google Scholar
Patsatzis DG, Goussis DA (2019) A new Michaelis-Menten equation valid everywhere multi-scale dynamics prevails. Math Biosci 315:108220
Article MathSciNet MATH Google Scholar
Reich JG, Selkov EE (1974) Mathematical analysis of metabolic networks. FEBS Lett 40(Suppl. 1):S119–S127
Google Scholar
Schnell S (2014) Validity of the Michaelis-Menten equation-steady-state, or reactant stationary assumption: that is the question. FEBS J 281:464–472
Article Google Scholar
Schnell S, Maini PK (2000) Enzyme kinetics at high enzyme concentration. Bull Math Biol 62:483–499
Article MATH Google Scholar
Schnell S, Mendoza C (2001) A fast method to estimate kinetic constants for enzyme inhibitors. Acta Biotheor 49:109–113
Article Google Scholar
Segel LA (1988) On the validity of the steady state assumption of enzyme kinetics. Bull Math Biol 50:579–593
Article MathSciNet MATH Google Scholar
Segel LA, Slemrod M (1989) The quasi-steady-state assumption: a case study in perturbation. SIAM Rev 31:446–477
Article MathSciNet MATH Google Scholar
Seshadri M, Fritzsch G (1980) Analytical solutions of a simple enzyme kinetic problem by a perturbative procedure. Biophys Struct Mech 6:111–123
Article Google Scholar
Stroberg W, Schnell S (2016) On the estimation errors of $K_M$ and $V$ from time-course experiments using the Michaelis–Menten equation. Biophys Chem 219:17–27
Article Google Scholar
Tikhonov AN (1952) Systems of differential equations containing a small parameter multiplying the derivative (in Russian). Math Sb 31:575–586
Google Scholar
Tzafriri AR (2003) Michaelis-Menten kinetics at high enzyme concentrations. Bull Math Biol 65:1111–1129
Article MATH Google Scholar
Walter W (1998) Ordinary differential equations. Springer, New York
Book MATH Google Scholar
Wan FYM (2018) Mathematical models and their analysis. SIAM, Philadelphia
Book MATH Google Scholar
Zagaris A, Kaper HG, Kaper TJ (2004) Analysis of the computational singular perturbation reduction method for chemical kinetics. J Nonlinear Sci 14:59–91
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Mathematical Reviews, American Mathematical Society, 416 4th Street, Ann Arbor, MI, 48103, USA
Justin Eilertsen
Department of Biological Sciences, University of Notre Dame, Notre Dame, IN, 46556, USA
Santiago Schnell
Department of Applied and Computational Mathematics and Statistics, University of Notre Dame, Notre Dame, IN, 46556, USA
Santiago Schnell
Mathematik A, RWTH Aachen, 52056, Aachen, Germany
Sebastian Walcher

Authors

Justin Eilertsen
View author publications
You can also search for this author in PubMed Google Scholar
Santiago Schnell
View author publications
You can also search for this author in PubMed Google Scholar
Sebastian Walcher
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Santiago Schnell.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendix

In this section, we collect some technical matters and proofs, as well as recalling some known results for which a concise presentation seems appropriate and useful.

1.1 Lyapunov Function Arguments

Lyapunov functions can be used to estimate the approach to the slow manifold in a singularly perturbed system, as was mentioned in Introduction. This estimate gives rise to a small parameter $\varepsilon _L$ which controls the distance of the solution to the slow manifold. We give an account of the relevant facts here.

We first state an auxiliary result that goes back to Lyapunov.

Lemma 5

Let Q be a real $n\times n$-matrix, with eigenvalues $\mu _1,\ldots ,\mu _n$, and let $\delta >0$. Then there exists a scalar product $\left\langle \cdot ,\cdot \right\rangle $ on $\mathbb R^n$ such that for all x one has

$$\begin{aligned} \left( \min _{1\le i\le n}\textrm{Re}\,\mu _i -\delta \right) \,\left\langle x,\,x\right\rangle \le \left\langle x,\,Qx\right\rangle \le \left( \max _{1\le i\le n}\textrm{Re}\,\mu _i +\delta \right) \,\left\langle x,\,x\right\rangle , \end{aligned}$$

and

$$\begin{aligned} \left( \min _{1\le i\le n}|\mu _i|^2 -\delta \right) \,\left\langle x,\,x\right\rangle \le \left\langle Qx,\,Qx\right\rangle \le \left( \max _{1\le i\le n}|\mu _i|^2 +\delta \right) \,\left\langle x,\,x\right\rangle . \end{aligned}$$

This can be proven as in Walter (1998, Chapter VII, §30) [see, also Arnold (1992, Chapter 22)]. For matrices that are diagonalizable over $\mathbb C$ build a real basis from real and imaginary parts of a complex eigenbasis. For the non-diagonalizable case, by suitable choice of basis elements the nilpotent part can be chosen to have norm $<\delta $.

1.1.1 Estimates

This presentation follows (Berglund and Gentz 2006, Section 2.1 ff.), but for illustrative purposes, we are satisfied with a local version. Consider a smooth system

$$\begin{aligned} \begin{array}{rcl} \dot{x}&{}= &{}\varepsilon \widetilde{f}_1(x,y,\varepsilon ) \\ \dot{y}&{}= &{} f_2(x,y,\varepsilon ) \end{array},\quad \text {briefly} \begin{pmatrix} \dot{x}\\ \dot{y}\end{pmatrix}= F(\begin{pmatrix} x\\ y\end{pmatrix}) \end{aligned}$$

(89)

with $\begin{pmatrix} x\\ y \end{pmatrix}$ in some open subset of $\mathbb {R}^n$, $x\in \mathbb {R}^m$, and a nonnegative parameter $\varepsilon $. Moreover let $\begin{pmatrix} x_0\\ y_0 \end{pmatrix}$ be such that $f_2(x_0,y_0,0)=0$, and M a suitable compact neighborhood of this point. (More conditions on M will be implicitly imposed below, by further assumptions.)

Assume furthermore that
$$\begin{aligned} f_2(x,y,\varepsilon )=0 \Longleftrightarrow y=g(x,\varepsilon ) \end{aligned}$$
for $\begin{pmatrix} x\\ y \end{pmatrix} \in M$ and $\varepsilon \le \varepsilon _\textrm{max}$, with some positive $\varepsilon _\textrm{max}$ and a smooth function g. The zero set $Y_\varepsilon $ of $f_2(\cdot ,\cdot ,\varepsilon )$ in M will be called the slow manifold, or QSS manifold,^{Footnote 22} for $\varepsilon $. By Hadamard’s lemma, after possibly shrinking M there exists a smooth matrix valued function A such that
$$\begin{aligned} f_2(x,y,\varepsilon )=A(x,y,\varepsilon )\cdot (y-g(x,\varepsilon )). \end{aligned}$$
Thus, we may rewrite system (89) as
$$\begin{aligned} \begin{array}{rcl} \dot{x}&{}= &{}\varepsilon \widetilde{f}_1(x,y,\varepsilon ) \\ \dot{y}&{}=&{} A(x,y,\varepsilon )\cdot (y-g(x,\varepsilon )). \end{array} \end{aligned}$$
(90)
With
$$\begin{aligned} D_yf_2(x,y,\varepsilon )=A(x,y,\varepsilon ) + \left( D_yA(x,y,\varepsilon )\,\right) (y-g(x,\varepsilon )), \end{aligned}$$
one finds in particular
$$\begin{aligned} D_yf_2(x,y,\varepsilon )=A(x,y,\varepsilon ) \text { on } Y_\varepsilon . \end{aligned}$$
Now assume that all eigenvalues of $A(x_0,y_0,0)$ have negative real parts. By continuity and suitable choice of M and $\varepsilon _\textrm{max}$, all eigenvalues of $A(x,y,\varepsilon )$ have negative real part for $\begin{pmatrix} x\\ y \end{pmatrix}\in M$ and $0\le \varepsilon \le \varepsilon _\textrm{max}$. Due to Lemma 5, there exists a scalar product $\left\langle \cdot ,\cdot \right\rangle $ on $\mathbb {R}^{n-m}$ and some $\gamma >0$ such that
$$\begin{aligned} \left\langle z,A(x_0,y_0,0)z\right\rangle \le -2\gamma \left\langle z,z\right\rangle ,\quad \text { all } z\in \mathbb R^{n-m}. \end{aligned}$$
(Recall the correspondence between $2\gamma $ and eigenvalues.) Thus, we may assume that
$$\begin{aligned} \left\langle z,A(x,y,\varepsilon )z\right\rangle \le -\gamma \left\langle z,z\right\rangle , \quad \text { all } z\in \mathbb R^{n-m}, \end{aligned}$$
(91)
on M, with $0\le \varepsilon \le \varepsilon _\textrm{max}$. Denote by $\Vert \cdot \Vert $ the norm associated with this scalar product.

The following line of arguments is a slight variant of classical reasoning [which uses Gronwall’s lemma, see, e.g., Evans (2014, Appendix B) for the latter]. For solutions of (89) we find

$$\begin{aligned} \begin{array}{rcl} \dfrac{\textrm{d}}{\textrm{d}t} \left\langle y-g(x,\varepsilon ), y-g(x,\varepsilon )\right\rangle &{}=&{}2\left\langle y-g(x,\varepsilon ),\dot{y} -D_xg(x,\varepsilon )f_1(x,y,\varepsilon )\right\rangle \\ &{} =&{} 2\left\langle y-g(x,\varepsilon ),A(x,y,\varepsilon )\left( y-g(x,\varepsilon )\right) \right\rangle \\ &{} &{} \quad -2\left\langle y-g(x,\varepsilon ), D_xg(x,\varepsilon )f_1(x,y,\varepsilon )\right\rangle . \end{array} \end{aligned}$$

The first term on the right-hand side can be estimated by $-\gamma \cdot \langle y-g(x,\varepsilon ), y-g(x,\varepsilon )\rangle $. As for the second term, by Cauchy-Schwarz one has

$$\begin{aligned} \begin{array}{rcl} 2\left| \left\langle y-g(x,\varepsilon ), D_xg(x,\varepsilon )f_1(x,y,\varepsilon )\right\rangle \right| &{} \le &{}2\Vert y-g(x,\varepsilon ) \Vert _2\cdot \Vert D_xg(x,\varepsilon )f_1(x,y,\varepsilon )\Vert _2 \\ &{} \le &{}2\Vert y-g(x,\varepsilon ) \Vert _2\cdot \left( \Vert D_xg(x,\varepsilon )\Vert \cdot \Vert f_1(x,y,\varepsilon )\Vert \right) \end{array} \end{aligned}$$

with suitable norms in the second and third factor.

Now, there exists a positive constant $ \kappa =\varepsilon \widetilde{\kappa }$ such that

$$\begin{aligned} \Vert D_xg(x,\varepsilon )\Vert \cdot \Vert f_1(x,y,\varepsilon )\Vert \le \kappa . \end{aligned}$$

So, for $V:=\Vert y-g(x,\varepsilon )\Vert ^2$ one obtains the differential inequality

$$\begin{aligned} \frac{\textrm{d}V}{\textrm{d}t}\le -2\gamma V+2 \kappa \sqrt{V}. \end{aligned}$$

Comparison with the solution of the corresponding Bernoulli equation yields

$$\begin{aligned} V\le V(0)\exp (-\gamma t)+\left( \frac{\kappa }{\gamma }\right) ^2\cdot (1-\exp (-\gamma t)), \end{aligned}$$

(92)

thus $\Vert y-g(x,\varepsilon )\Vert =\sqrt{V(t)}$ can be estimated, e.g., by $\sqrt{2} \frac{\kappa }{\gamma }$ as $t\rightarrow \infty $.^{Footnote 23} Therefore, after a transient phase the proximity of the solution to the slow manifold is controlled by

$$\begin{aligned} \varepsilon _L:=\sqrt{2} \frac{\kappa }{\gamma }=\sqrt{2}\varepsilon \frac{\widetilde{\kappa }}{\gamma }. \end{aligned}$$

(93)

More precisely, once $V(0)\exp (-\gamma t)\le \left( \dfrac{\kappa }{\gamma }\right) ^2$, the stated estimate holds. The inequality is satisfied whenever

$$\begin{aligned} t \ge \frac{1}{\gamma }\,\log \left( \dfrac{\gamma ^2 V(0)}{\kappa ^2} \right) \sim \log \dfrac{1}{\varepsilon _L}, \end{aligned}$$

and this indicates that the time span for the approach to the QSS manifold is of order $|\log \varepsilon _L|$ in the fast timescale, and of order $\varepsilon _L|\log \varepsilon _L|$ in the slow timescale $\varepsilon _L t.$ (A more detailed analysis will provide a lower estimate by a variant of (91), and confirm that the asymptotic estimate cannot be improved.) In particular time spans of order 1 will not suffice for the transient.

In reaction network settings, $\varepsilon _L$ is a dimensional parameter (with dimension concentration); a suitable normalization needs to be chosen.

1.1.2 A Correspondence to Eigenvalues

We sketch the relation of the small parameter $\varepsilon _L$ to eigenvalues of the Jacobian. For the sake of simplicity, we only consider the linearization here, disregarding higher-order terms. Given the system

$$\begin{aligned} \begin{array}{rcccccc} \dot{x} &{} =&{} -\varepsilon \widetilde{U} x&{}+&{}\varepsilon \widetilde{V} y &{} &{} \\ \dot{y} &{} =&{} Wx&{}-&{}Zy&{}=&{} -Z\left( y-Z^{-1}W\right) x\\ \end{array}\quad , \text { briefly} \begin{pmatrix}\dot{x}\\ \dot{y}\end{pmatrix} =F(x,y,\varepsilon ), \end{aligned}$$

and keeping the notation from above, we have $A=-Z$, $g(x)=Z^{-1}Wx$, $D_xg=Z^{-1}W$. The slow manifold $Y_\varepsilon $ is given by $Wx-Zy=0$, up to higher-order terms. Moreover

$$\begin{aligned} \widetilde{f}_1=-\widetilde{U}x+\widetilde{V} y=\left( -\widetilde{U}+\widetilde{V}Z^{-1}W\right) x \quad \text { on }Y_\varepsilon . \end{aligned}$$

Now consider the eigenvalues of the matrix $DF=\begin{pmatrix}-\varepsilon \widetilde{U}&{}\varepsilon \widetilde{V}\\ W &{} -Z \end{pmatrix}$; see also Lemma 7. Thus, let $\alpha _0+\varepsilon \alpha _1+\cdots $ be an eigenvalue with eigenvector $\begin{pmatrix} x_0+\varepsilon x_1+\cdots \\ y_0+\varepsilon y_1+\cdots \end{pmatrix}$; $\begin{pmatrix} x_0\\ y_0\end{pmatrix}\not = 0$. For $\alpha _0\not =0$, comparing lowest order terms in the eigenvalue condition yields

$$\begin{aligned} x_0=0\text { and } Wx_0-Zy_0=\alpha _0y_0, \end{aligned}$$

thus $-\alpha _0$ is an eigenvalue of Z. By Lemma 5, we see that $2\gamma $ can be chosen near the nonzero eigenvalue of DF(x, y, 0) with smallest absolute real part.

For $\alpha _0=0$, thus the eigenvalue has order $\varepsilon $, comparing lowest orders in the eigenvalue condition yields

$$\begin{aligned} -\widetilde{U} x_0+\widetilde{V} y_0=\alpha _1 x_0\text { and } Wx_0-Zy_0=0, \end{aligned}$$

hence $\alpha _1$ is an eigenvalue for $-\widetilde{U}+\widetilde{V}Z^{-1}W=\widetilde{f}_1$. An upper estimate for $\varepsilon \Vert \widetilde{f}_1\Vert $ can be obtained from Lemma 5: Choose the order $\varepsilon $ eigenvalue with greatest absolute value, multiplied by some factor accounting for a coordinate change. Thus, we see that $\kappa $ is composed of the factor $\Vert Z^{-1}W\Vert $ (which reflects the geometry of the slow manifold), the absolutely largest eigenvalue of order $\varepsilon $ and some multiplicative constants from coordinate transformations. In our local setting, all the multiplicative constants mentioned above are of order one.

To summarize, the small parameter $\varepsilon _L=\kappa /\gamma $ is determined by the ratio of the largest absolute eigenvalue of order $\varepsilon $ to the smallest absolute real part of eigenvalues of order one. From this perspective, for slow manifolds of dimension one in particular, the relevance of the parameters $\varepsilon ^*$ and $\mu ^*$ is obvious. Their advantage lies in their (relative) computational accessibility. Likewise, $\varepsilon ^*$ is a both relevant and computationally accessible parameter for three-dimensional systems with two-dimensional slow manifolds.

1.1.3 Remarks on Steps 2 and 3

Lyapunov function arguments provide a small parameter $\varepsilon _L$ which characterizes closeness of a solution of (89) to the slow manifold. This takes care of Step 1 described in Introduction, and clarifies the role of eigenvalues up to ($\varepsilon $-independent) factors due to coordinate changes.

For the ultimate goal of obtaining quantitative estimates for the discrepancy between the true solution and the singular perturbation approximation, one needs to go further. In Step 2, an appropriate critical time for the onset of the slow dynamics, as well as an appropriate initial value for the reduced system, must be determined. As for Step 3, by a continuity and compactness argument, the right-hand sides of the full and the reduced equation differ by $\varepsilon _L$ times some constant. With this, and an error estimate for the initial value for the reduced system, continuous dependence provides an estimate of the approximation error on compact time intervals. Further work may be required, since one is mostly interested in unbounded time intervals, so one cannot rely only on standard continuous dependence theorems.

In the present manuscript, we generally did not address the determination of $\varepsilon _L$ in examples and case studies. The only exception is irreversible Michaelis–Menten with slow product formation (see, Sect. 4.1), which also contains partial results for Step 3. For the (more familiar and more relevant) irreversible Michaelis–Menten system with small enzyme concentration all three steps can dealt with completely (even if some complications arise), as will be shown in a forthcoming paper. For any system of dimension $>2$, even completing Step 1 seems quite demanding.

1.2 A Proof of Lemma 2

Proof

Part (a) is a special case of Lemma 7 below. To prove part (b), abbreviate $\sigma ^*_i(x):=\sigma _i(x,\widehat{\pi })$ for $x\in \widetilde{Y}\cap K$, $1\le i\le n-1$. Then the nonzero roots of the characteristic polynomial $\chi $ are the roots of

$$\begin{aligned} \zeta (x,\tau ):=\tau ^{n-1}+\sigma ^*_1(x)\tau ^{n-2}+\cdots +\sigma ^*_{n-1}(x). \end{aligned}$$

By the blanket assumptions, the $\sigma ^*_i$ are bounded above and below by positive constants, hence the absolute values of all zeros of the $\zeta (x,\cdot )$ are bounded above by some constant. Since $\widehat{\pi }$ is a TFPV, all zeros have negative real parts. Now assume that for every positive constant $\delta $, some $\zeta (x,\tau )$ has a zero with real part $\ge -\delta $. Then there exist sequences $(x_k)$ in $\widetilde{Y}\cap K$ and $(\mu _k)$ in $\mathbb C$ such that $\zeta (x_k,\mu _k)=0$ and $\textrm{Re}\,\mu _k\rightarrow 0$. Due to boundedness of the sequence $(\mu _k)$ and compactness of $\widetilde{Y}\cap K$ we may assume that the $\mu _k$ converge to $\mu ^*$, $\textrm{Re}\,\mu ^*=0$, and the $x_k$ converge to $x^*\in \widetilde{Y}\cap K$. By continuity $\zeta (x^*,\mu ^*)=0$; a contradiction. Part (c) follows by continuity and compactness arguments.

1.3 Parameter Dependence of Eigenvalues

Recall from (39) the definition

$$\begin{aligned} \widetilde{\sigma }_i(x,\varepsilon ):=\sigma _i(x,\widehat{\pi }+\varepsilon \rho ),\quad 1\le i\le n, \quad \widetilde{\sigma }_0:=1. \end{aligned}$$

We first prove (44), concerning the orders of the $\widetilde{\sigma }_i$ whenever $s>1$.

Lemma 6

Let $\widehat{\pi }$ be a TFPV for dimension s, with critical manifold $\widetilde{Y}$. Then for all $x\in \widetilde{Y}\cap K$ one has

$$\begin{aligned} \widetilde{\sigma }_i(x,\varepsilon )=\varepsilon ^{i-n+s}\widehat{\sigma }_i(x,\varepsilon )\text { for all }x\in \widetilde{Y}\cap K, \quad n-s\le i\le n, \end{aligned}$$

with polynomial $\widehat{\sigma }_i$.

Proof

The arguments we will use are similar to those in the proof of Goeke et al. (2015, Proposition 3). We set

$$\begin{aligned} \widetilde{h}(x,\varepsilon ):=h(x,\widehat{\pi }+\varepsilon \rho )\text { for }x\in \widetilde{Y}\cap K. \end{aligned}$$

There exists a local transformation of $\widetilde{h}$ into Tikhonov standard form. Thus, there exists a local analytic diffeomorphism $\Phi $ and a vector field $\widetilde{q}$ such that

$$\begin{aligned} D\Phi (x)\widetilde{h}(x,\varepsilon )=\widetilde{q}(\Phi (x),\varepsilon ) \end{aligned}$$

and consequently

$$\begin{aligned} D\Phi (x)D\widetilde{h}(x,\varepsilon )=D\widetilde{q}(\Phi (x),\varepsilon )D\Phi (x),\quad x\in \widetilde{Y}\cap K. \end{aligned}$$

Therefore the Jacobian of $\widetilde{h}$ at x and the Jacobian of $\widetilde{q}$ at $\Phi (x)$ are conjugate; in particular they have the same characteristic polynomial. Denoting by $\widetilde{\nu }_i(y)$ the coefficients of the characteristic polynomial of $D\widetilde{q}(y)$, this means

$$\begin{aligned} \widetilde{\sigma }_i(x,\varepsilon )=\widetilde{\nu }_i(\Phi (x),\varepsilon )\text { for all }x\in \widetilde{Y}\cap K. \end{aligned}$$

Since $\widetilde{q}$ is in Tikhonov standard form, we have

$$\begin{aligned} \widetilde{q}(y,\varepsilon )=\begin{pmatrix}\varepsilon \widetilde{q}_1(y,\varepsilon )\\ \widetilde{q}_2(y,\varepsilon )\end{pmatrix}, \end{aligned}$$

with $q_1$ having s entries, and

$$\begin{aligned} D\widetilde{q}(y,\varepsilon )=\begin{pmatrix}\varepsilon D\widetilde{q}_1(y,\varepsilon )\\ D\widetilde{q}_2(y,\varepsilon )\end{pmatrix}. \end{aligned}$$

Thus, every entry of the first s rows of the Jacobian is a multiple of $\varepsilon $, and with the Laplace expansion of the determinant this implies

$$\begin{aligned} \widetilde{\nu }_i(x,\varepsilon )=\varepsilon ^{i-n+s}\,\widehat{\nu }_i(x,\varepsilon ), \quad n-s< i\le n, \end{aligned}$$

and finally (44).

Now we turn to determining the orders of the eigenvalues.

Lemma 7

With objects and notation as in Lemma 6, let (44) hold, and furthermore consider the nondegeneracy conditions:

(i)
$\widehat{\sigma }_{n-s}(x,0)\not =0$ and $\widehat{\sigma }_{n}(x,0)\not =0$ on $\widetilde{Y}\cap K$.
(ii)
The polynomials
$$\begin{aligned} \widehat{\sigma }_{n-s}(x,0)\tau ^s+ \widehat{\sigma }_{n-s+1}(x,0)\tau ^{s-1}+\cdots +\widehat{\sigma }_{n}(x,0) \end{aligned}$$
(94)
admit only simple zeros, for all $x\in \widetilde{Y}\cap K$.

(a)
Whenever (i) holds, the zeros $\lambda _i(x,\varepsilon )$ of the characteristic polynomial can be labeled such that
$$\begin{aligned} \lambda _1(x,0)\not =0,\ldots , \lambda _{n-s}(x,0)\not =0\quad \text { on } \widetilde{Y}\cap K, \end{aligned}$$
and
$$\begin{aligned} \lambda _i(x,\varepsilon )=\varepsilon \widehat{\lambda }_i(x,\varepsilon ),\quad x\in \widetilde{Y}\cap K,\quad i>n-s, \end{aligned}$$
with continuous $\widehat{\lambda }_i$ such that $\widehat{\lambda }_i(x,0)\not =0$ on $\widetilde{Y}\cap K$, $n-s+1\le i\le n$.^{Footnote 24}
(b)
Whenever (ii) holds in addition to (i) then all $\widehat{\lambda }_i$, $n-s+1\le i\le n$, are analytic in $(x,\varepsilon )$.

Proof

The proof rests on the Newton–Puiseux theorem and on Hensel’s lemma; we refer specifically to Abhyankar (1990, Lectures 12 and 13). According to Newton–Puiseux, the equation $\lambda ^n+\sum \widetilde{\sigma }_i \lambda ^{n-i}=0$ admits series solutions

$$\begin{aligned} \lambda =\alpha \varepsilon ^\gamma +\cdots \end{aligned}$$

in rational exponents of $\varepsilon $, with a positive rational number $\gamma $ and $\alpha \not =0$. For such an expansion to hold with some $\gamma $ and $\alpha \not =0$, cancellation of lowest order terms in (10) is necessary. The lowest orders of the terms in the monomials are

$$\begin{aligned} (n-i)\gamma \text { for } 0\le i\le s,\quad \text { and } \quad (n-j)\gamma + j-n+s \text { for } s+1\le j\le n, \end{aligned}$$

and for cancellation one must have equality between two of these orders. Clearly two orders in the first block cannot be equal. Assuming that an order from the first block equals an order in the second block, we get

$$\begin{aligned} (n-i)\gamma =(n-j)\gamma + j-n+s\Rightarrow \gamma =\frac{j-(n-s)}{j-i}<1 \text { unless } i=n-s. \end{aligned}$$

But in case $\gamma <1$ the lowest order equals $s\gamma $, with no cancellation; so only $\gamma =1$ remains. Finally, if two orders in the second block are equal then one directly sees $\gamma =1$. This shows part (a).

Continuing the argument, $\gamma =1$ implies that precisely the monomials of degree $\le n-s$ contribute to the lowest order, and the ansatz yields

$$\begin{aligned} \widehat{\sigma }_{n-s}(x,0)\alpha ^s+ \widehat{\sigma }_{n-s+1}(x,0)\alpha ^{s-1}+\cdots +\widehat{\sigma }_{n}(x,0)=0, \end{aligned}$$

thus s distinct choices for $\alpha $ by condition (ii), and $\alpha \not =0$. By Hensel’s lemma, each choice for $\alpha $ yields a series $\lambda =\alpha \varepsilon +\cdots $, in positive integer powers of $\varepsilon $. This shows part (b).

Remark 6

In case $s=1$ the second condition is automatic. Therefore Lemma 2 (a) is also proven.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Eilertsen, J., Schnell, S. & Walcher, S. Natural Parameter Conditions for Singular Perturbations of Chemical and Biochemical Reaction Networks. Bull Math Biol 85, 48 (2023). https://doi.org/10.1007/s11538-023-01150-7

Download citation

Received: 19 November 2022
Accepted: 23 March 2023
Published: 26 April 2023
DOI: https://doi.org/10.1007/s11538-023-01150-7

Keywords

Mathematics Subject Classification

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Natural Parameter Conditions for Singular Perturbations of Chemical and Biochemical Reaction Networks

Abstract

Similar content being viewed by others

Critical Parameters for Singular Perturbation Reductions of Chemical Reaction Networks

Singular Perturbation Techniques and Asymptotic Expansions for Some Complex Enzyme Reactions

Stochastic enzyme kinetics and the quasi-steady-state reductions: Application of the slow scale linear noise approximation à la Fenichel

1 Introduction

1.1 Background

1.2 Overview of Results

2 Preliminaries

2.1 Tikhonov–Fenichel Parameter Values (a Review)

2.2 Dimensionless Parameters

Lemma 1

Proof

2.3 Timescales

Definition 1

2.4 Blanket Assumptions

3 Critical Manifolds of Dimension One

Lemma 2

3.1 Distinguished Small Parameters

Definition 2

Proposition 1

Proof

Remark 1

3.2 The Correspondence to Timescales

Lemma 3

Proposition 2

Proof

Proposition 3

Proof

Remark 2

3.3 Two-Dimensional Systems

Lemma 4

3.4 Three-Dimensional Systems

Proposition 4

Proof

Remark 3

Remark 4

4 Michaelis–Menten Reaction Mechanism Revisited

4.1 The Reversible Reaction with Low Enzyme Concentration

4.2 The Irreversible Reaction with Low Enzyme Concentration

4.2.1 Distinguished Small Parameters

4.2.2 Further Observations

4.3 The Irreversible Reaction with Slow Product Formation

4.3.1 Distinguished Small Parameters

4.3.2 Approach to the Slow Manifold

4.3.3 Estimates for Long Times

4.4 A Degenerate Scenario

5 TFPV for Higher Dimensions

5.1 Distinguished Small Parameters

Definition 3

Proposition 5

5.2 The Correspondence to Timescales

Proposition 6

5.3 Further Dimensionless Parameters

Proposition 7

Proof

Remark 5

5.4 Dimension Three

6 Case Studies: Reduction from Dimension Three to One

6.1 Cooperativity Reaction Mechanism

6.1.1 Asymptotic Small Parameters

6.1.2 Numerical Simulations

6.1.3 Exceptional Cases: Near-Degeneracy and Near-Invariance

6.2 Uncompetitive Inhibition Reaction Mechanism

6.2.1 Asymptotic Small Parameters

6.2.2 Numerical Simulations

6.2.3 Near-Invariance

6.3 Competitive Inhibition Reaction Mechanism

6.3.1 Asymptotic Small Parameters

6.3.2 Numerical Simulations

6.3.3 Near-Invariance

7 Case Studies: Reduction from Dimension Three to Two

7.1 Uncompetitive Inhibition Reaction Mechanism

7.1.1 Asymptotic Small Parameters

7.1.2 Numerical Simulations

7.2 Competitive Inhibition Reaction Mechanism

7.2.1 Asymptotic Small Parameters

7.2.2 Numerical Simulations

7.2.3 The Case of Very Small \(k_1\): Three Timescales