1 Introduction

Because natural CO2 sinks absorb 55% of all anthropogenic carbon emissions, they represent “a huge subsidy to the global economy worth half a trillion US$ annually” (Canadell et al. 2007). However, the efficiency of natural sinks at absorbing CO2 has decreased by 5% in the last 50 years, and this pattern is expected to continue (Canadell et al. 2007). One illustration of this trend is provided by the oceanic carbon sink, the decreased efficiency of which is believed to have caused a decline of up to 30% in the efficiency of the Southern Ocean sink over the last 20 years (Le Quéré et al. 2007). As consequences of this trend, greenhouse warming and the associated release of soil carbon favor a transition of the biosphere from a carbon sink to a source of pollution in the long run (Cox et al. 2000; Cramer et al. 2001; Joos et al. 2001; Lenton et al. 2006; Piao et al. 2008). According to Joos et al. (2001), “in the most extreme cases, the terrestrial biosphere becomes a source of carbon during the second half of the century.”

In the absence of a transnational institution that can impose an environmental policy, and due to the lack of enforceable global coordination, it is likely that decentralized decision-making processes at the national level related to polluting emissions will aggravate the decline in the efficiency of carbon sinks.

With rare exceptions, the problem of declining environmental absorption efficiency has been disregarded in dynamic game models of transboundary pollution, which are typically based on the assumption of a linear environmental absorption function (e.g., Li 2014; Benchekroun and Martín-Herrán 2016; Fünfgelt and Schulze 2016; Huang et al. 2016). More recently, a nonlinear absorption function has been proposed to account for declining environmental absorption efficiency (Mäler et al. 2003; Kossioris et al. 2008). This formulation, which combines the linear negative influence of the pollution stock with nonlinear positive feedback related to the release of past accumulated pollution, involves multiple steady state equilibria with different stability properties. Although this change in perspective allows more accurate handling of potentially irreversible environmental degradation, it relies on the assumption of an instantaneous change in the absorption efficiency that precludes investing in a restoration effort whose impact is, indeed, not instantaneous.

Economic study of these issues is of prime importance because, just like emissions reduction, restoration of carbon sinks’ efficiency is also a public good. With two superimposed free-rider problems, the failure of reduction policy might also impede restoration policy.

In this paper, we suggest a two-player differential game model of transboundary pollution that accounts for time-dependent environmental absorption efficiency, which allows switching of the biosphere from a carbon sink to a source, and the ability to invest in the restoration of environmental absorption efficiency. In this setup, we investigate the impact of negative externalities resulting from the transboundary pollution control non-cooperative game wherein countries are dynamically involved. To do so, we assess differences related to both transient and steady states between cooperative and non-cooperative pollution control and environmental absorption efficiency management.

To date, the theoretical debate on transboundary pollution control has led to the emergence of two main streams (cf. Jørgensen et al. 2010; Long 2010; Benchekroun and Long 2011). The first stream recommends that polluters commit to a predetermined plan of action over time whenever environmental absorption efficiency is linear (van der Ploeg and de Zeeuw 1991, 1992) or non-linear (Mäler et al. 2003; Kossioris et al. 2008). The second stream of literature prescribes that, in the context of linear environmental absorption efficiency, polluters use softly discounted, non-linear strategies to make decisions contingent on the current pollution level over time (Dockner and Long 1993).Footnote 1

In practice, only voluntary “contributions” to global emissions reduction, rather than commitments, were agreed to by the countries attending the successive International Conferences on the Climate since the implementation of the Kyoto Protocol in 2005. A recent conference, which took place in Warsaw in November 2013, illustrated how difficult it is to reach a global, enforceable agreement on emissions reduction. It is therefore important to analyze whether the absence of commitment at an international level on pollution control is likely to aggravate the decline in the efficiency of carbon sinks.

In this perspective, we compare the outcomes related to an open-loop Nash equilibrium (OLNE), which reflects the situation where polluters commit to a predetermined plan of action, and a Markov perfect Nash equilibrium (MPNE), which corresponds to the case where polluters make decisions contingent on the current state of the biosphere (Dockner et al. 2000; Long 2010). To characterize an MPNE solution near its steady state, we use both a carefully designed local quadratic approximation, and, to get a comparison, a numerical approximation based on the collocation method for PDEs (Judd 1998; Doraszelski 2003). Setting the cooperative solution as a benchmark, we identify which decision rule, OLNE or MPNE, better prevents the durable or transient switching of the biosphere from a pollution sink to a source, if any. The results show that the players’ transient behavior has a critical impact on the resulting steady state in terms of pollution stock and environmental absorption efficiency. More importantly, unexpected contrasts are found between the cooperative and non-cooperative steady states.

The paper proceeds as follows. Section 2 develops the differential game model and its properties. In Sects. 3 and 4, we derive the cooperative and open-loop Nash equilibria, respectively. In Sect. 5, we study the Markov perfect Nash equilibrium for a locally quadratic value function and evaluate its accuracy. In Sect. 6, we compare the results. Section 7 concludes the paper.

2 Model formulation

We extend the non-linear quadratic model in El Ouardighi et al. (2014) to a differential game of transboundary pollution. In the setup of a two-player game where players are, e.g., nations, we interpret the biosphere as the place of interaction between pollution and environmental absorption efficiency. The pollution stock is supposed to increase with anthropogenic emissions and to depreciate at an environmental absorption efficiency rate. The pollution stock has a destructive impact on absorption efficiency, which in turn increases the pollution stock. In contrast with the existing literature, we model environmental absorption efficiency as a state variable, which allows for negative absorption efficiency that would stimulate pollution even in the absence of anthropogenic emissions. This is in line with scientific evidence that the biosphere may switch from a carbon sink to a source (e.g., Cox et al. 2000; Canadell et al. 2007). To counteract such a switch, we consider two policy instruments: anthropogenic emissions and efforts to restore environmental absorption efficiency.

The rate of change of the pollution stock over time is written as:

$$ \dot{P}(t) = \sum\nolimits_{i} {e_{i} (t)} - A(t)P(t)\quad P(0) = P_{0} > 0 $$
(1)

where \( P(t) \ge 0 \) denotes the pollution stock at time \( t \), \( e_{i} (t) \ge 0 \), \( i = 1,2 \), player i’s current emissions, and \( A(t) \) environmental absorption efficiency.Footnote 2

Further, the rate of change of environmental absorption efficiency over time is:

$$ \dot{A}(t) = \sum\nolimits_{i} {w_{i} (t)} - \gamma P(t)\quad A(0) = A_{0} \ge 0 $$
(2)

\( w_{i} (t) \ge 0 \) being player i’s effort to restore environmental absorption efficiency, \( i = 1,2 \), and \( \gamma > 0 \) the marginal destructive impact of pollution on the evolution of absorption efficiency. The efforts to restore environmental absorption efficiency may include reforestation (UNFCC 2008), and geo-engineering technologies (such as iron fertilization, biochar), which aim at accelerating natural processes of carbon removal (Vaughan and Lenton 2011).

The formulation in (1)–(2) includes the pollution accumulation model with linear absorption efficiency as a special case if environmental absorption efficiency is totally insensitive to the destructive impact of the pollution stock, \( \gamma = 0 \), and there is no restoration effort of environmental absorption efficiency, \( w(t) = 0,\,\forall \,t \).Footnote 3 The formulation in (1)–(2) also extends the pollution accumulation model with zero absorption efficiency if environmental absorption efficiency is zero over the whole planning horizon, \( A(t) = 0,\,\forall \,t \) (El Ouardighi et al. 2015b).

Assuming that there is no anthropogenic intervention (which corresponds to the pre-industrial era), over an infinite horizon, i.e., \( e_{i} (t) = w_{i} (t) = 0 \), \( \forall \,t \ge 0 \), \( i = 1,2 \), the properties of the system (1)–(2) can be described in the phase diagram below. The system admits infinite steady states on the isocline \( \dot{A} = 0 \). The eigenvalues associated with such system are \( - A_{\infty } \) and \( 0 \), which means that the positive horizontal axis is a sink, and the negative horizontal axis is a source.

Based on the steady state \( \left( {A_{\infty } ,P_{\infty } } \right) = \left( {0,0} \right) \), the phase diagram can be divided into three regions:

  • A sustainability area, where the initial environmental absorption capacity acts as a sink for any pollution stock starting on the right side of the (dashed) saddle path.

  • An emergency area, where any path starting on the left side of the (dashed) saddle path and the right side of the \( \dot{P} = 0 \) isocline, though declining, tends to exhaust the pollution sink.

  • An irreversible area (the so-called Soylent green scenario), on the left side of the \( \dot{P} = 0 \) isocline, where the environment becomes a source of formerly stored pollution (Fig. 1).

    Fig. 1
    figure 1

    Pollution-absorption dynamics without human interference

The phase diagram shows that a large initial pollution stock combined with sufficiently low initial environmental absorption efficiency is a sufficient condition for switching from a pollution sink to a source.

Regarding the objective functions, we assume that each player seeks to maximize its intertemporal utility, \( U_{i} \). In this regard, we make the following assumptions. First, each player’s instantaneous revenue is a concave function of its own current rate of emissions, \( e_{i} (t)\left( {a_{i} - {{e_{i} (t)} \mathord{\left/ {\vphantom {{e_{i} (t)} 2}} \right. \kern-0pt} 2}} \right) \), where \( 2a_{i} > 0 \) is an upper bound for emissions, \( i = 1,2 \). This choice comes from the fact that the quadratic revenue function has been often used in dynamic game models with a linear absorption function (Rubio and Casino 2002; Wirl 2007).Footnote 4 The instantaneous social cost incurred by each player due to the pollution stock is a quadratic function, \( {{c_{i} P(t)^{2} } \mathord{\left/ {\vphantom {{c_{i} P(t)^{2} } 2}} \right. \kern-0pt} 2} \), where \( c_{i} > 0 \) is a pollution cost coefficient, \( i = 1,2 \) (Rubio and Casino 2002; Wirl 2007; Boucekkine et al. 2013). Finally, each player incurs an instantaneous increasing convex cost of effort to restore environmental absorption efficiency, \( {{f_{i} w_{i} (t)^{2} } \mathord{\left/ {\vphantom {{f_{i} w_{i} (t)^{2} } 2}} \right. \kern-0pt} 2} \), where \( f_{i} > 0 \) is a restoration cost coefficient, \( i = 1,2 \) (El Ouardighi et al. 2014; El Ouardighi et al. 2015a).

Letting \( r_{i} > 0 \) denote player i’s discounting rate, and assuming an infinite planning horizon, player i’s objective criterion is written as:

$$ \mathop {\text{Max}}\limits_{{e_{i} (.),w_{i} (.)}} U_{i} = \int\limits_{0}^{ + \infty } {e^{{ - r_{i} t}} } \left[ {e_{i} (t)\left( {a_{i} - {{e_{i} (t)} \mathord{\left/ {\vphantom {{e_{i} (t)} 2}} \right. \kern-0pt} 2}} \right) - {{c_{i} P(t)^{2} } \mathord{\left/ {\vphantom {{c_{i} P(t)^{2} } 2}} \right. \kern-0pt} 2} - {{f_{i} w_{i} (t)^{2} } \mathord{\left/ {\vphantom {{f_{i} w_{i} (t)^{2} } 2}} \right. \kern-0pt} 2}} \right]\text{d}t $$
(3)

subject to (1)–(2), \( e_{i} (t) \ge 0 \) and \( w_{i} (t) \ge 0 \), \( \forall \,t \), \( i = 1,2 \). In the remainder of the paper, we set \( f_{i} = 1 \), \( i = 1,2 \), without loss of generality. Further, invoking the symmetry assumption, which is standard in the literature on pollution control (e.g., Jørgensen et al. 2010; Long 2010), we use: \( a_{i} \equiv a \), \( c_{i} \equiv c \), and \( r_{i} \equiv r \), \( i = 1,2 \).

3 Cooperative solution

Cooperative solution strategies require that the players align their respective interests and maximize the overall utility over the planning horizon, that is:

$$ U = \int\limits_{0}^{ + \infty } {e^{ - rt} } \left[ {\sum\nolimits_{i} {e_{i} (t)\left( {a - {{e_{i} (t)} \mathord{\left/ {\vphantom {{e_{i} (t)} 2}} \right. \kern-0pt} 2}} \right)} - cP(t)^{2} - \sum\nolimits_{i} {{{w_{i} (t)^{2} } \mathord{\left/ {\vphantom {{w_{i} (t)^{2} } 2}} \right. \kern-0pt} 2}} } \right]\text{d}t $$
(4)

Accordingly, the current-value Hamiltonian is written as:

$$ H = \sum\nolimits_{i} {e_{i} \left( {a - {{e_{i} } \mathord{\left/ {\vphantom {{e_{i} } 2}} \right. \kern-0pt} 2}} \right)} - cP^{2} - \sum\nolimits_{i} {{{w_{i}^{2} } \mathord{\left/ {\vphantom {{w_{i}^{2} } 2}} \right. \kern-0pt} 2}} + \eta_{1} \left( {\sum\nolimits_{i} {e_{i} } - AP} \right) + \eta_{2} \left( {\sum\nolimits_{i} {w_{i} } - \gamma P} \right) $$
(5)

where \( \eta_{j} \equiv \eta_{j} (t) \) are (current value) costate variables, \( j = 1,2 \).

Assuming an interior solution and applying Pontryagin’s principle in the discounted case, the necessary conditions for cooperative solution are:

$$ H_{{e_{i} }} = a - e_{i} + \eta_{1} = 0 \Rightarrow e_{i} = a + \eta_{1} $$
(6)
$$ H_{{w_{i} }} = - w_{i} + \eta_{2} = 0 \Rightarrow w_{i} = \eta_{2} $$
(7)

\( i = 1,2 \), where the evolution of the costate variables is given by:

$$ \dot{\eta }_{1} = 2cP + \left( {r + A} \right)\eta_{1} + \gamma \eta_{2} $$
(8)
$$ \dot{\eta }_{2} = r\eta_{2} + \eta_{1} P $$
(9)

Equations (8) and (9) confirm that the players’ equilibrium strategies are symmetric.

Using (6)–(7) in (1)–(2), we obtain:

$$ \dot{P} = 2\left( {a + \eta_{1} } \right) - AP $$
(10)
$$ \dot{A} = 2\eta_{2} - \gamma P $$
(11)

Let us assume a steady state to study optimal control at this state. Assuming that \( \dot{P} = \dot{A} = 0 \) gives \( \sum\nolimits_{i} {e_{i} } = AP \) and \( \sum\nolimits_{i} {w_{i} } = \gamma P \). Differentiating these equations with respect to time and accounting for \( \dot{P} = \dot{A} = 0 \) yields steady efforts \( \dot{e}_{i} = \dot{w}_{i} = 0 \)\( i = 1,\;2 \). Differentiating (6)–(7) over any time interval and accounting for \( \dot{e}_{i} = \dot{w}_{i} = 0 \), \( i = 1,\;2 \), the costate variables are time invariant. This implies that if the pollution stock and the absorption efficiency are both steady, then the related costate variables are also steady, and each player performs time-invariant efforts.

Proposition 1

There exists a non-trivial cooperative steady state that is unique, given by:

$$ \left[ {P_{\infty } \quad A_{\infty } \quad e_{i\infty } \quad w_{i\infty } } \right]^{T} = \left[ {\frac{{r^{2} \gamma + h}}{{2\left( {4c + \gamma^{2} } \right)}}\quad \frac{{2\left( {2a - r\gamma } \right)\left( {4c + \gamma^{2} } \right)}}{{r^{2} \gamma + h}}\quad \frac{2a - r\gamma }{2}\quad \frac{{\gamma \left( {r^{2} \gamma + h} \right)}}{{4\left( {4c + \gamma^{2} } \right)}}} \right]^{T} $$
(12)

where\( h \equiv \sqrt {r^{4} \gamma^{2} + 4r\gamma \left( {2a - r\gamma } \right)\left( {4c + \gamma^{2} } \right)} > 0 \), \( i = 1,2 \).

Proof

See "Appendix A1".

The cooperative solution allows for a feasible steady state emissions rate compatible with the preservation of a strictly positive environmental absorption rate if \( a > {{r\gamma } \mathord{\left/ {\vphantom {{r\gamma } 2}} \right. \kern-0pt} 2} \). In (12), we observe that a lower discounting rate should result in a higher steady state emissions rate \( e_{i\infty } \) and environmental absorption efficiency \( A_{\infty } \) as well as lower pollution stock \( P_{\infty } \) and restoration efforts \( w_{i\infty } \). Note that there exists a trivial steady state solution characterized by zero emissions, pollution stock and restoration efforts and negative absorption efficiency, i.e., \( \left[ {P_{\infty }^{'} \quad A_{\infty }^{'} \quad e_{i\infty }^{'} \quad w_{i\infty }^{'} } \right]^{T} = \left[ {0\quad - r\quad 0\quad 0} \right]^{T} \). This solution is different from the irreversible solution found in Tahvonen and Withagen (1996), which implies ‘high’ pollution concentration that can be optimal. In our model, this second steady state is unstable and therefore cannot be optimal.

Corollary 1

At the steady state, the instantaneous cooperative utility is:

$$ u_{\infty } = \frac{{4a^{2} - r^{2} \gamma^{2} }}{8} - \frac{{\left( {r^{2} \gamma + h} \right)^{2} }}{{16\left( {4c + \gamma^{2} } \right)}} $$
(13)

Proof

See "Appendix A2".

The expression in (13) characterizes the utility of future generations in the cooperative case. It can be shown that \( {{\partial u_{\infty } } \mathord{\left/ {\vphantom {{\partial u_{\infty } } {\partial r}}} \right. \kern-0pt} {\partial r}} < 0 \), which implies that a larger discounting rate results in a lower utility of future generations under cooperation. Similarly, as \( {{\partial u_{\infty } } \mathord{\left/ {\vphantom {{\partial u_{\infty } } {\partial \gamma }}} \right. \kern-0pt} {\partial \gamma }} < 0 \), more vulnerable environmental absorption efficiency negatively affects the utility of future generations.

Proposition 2

The cooperative steady state is a saddle-point and the optimal path to the steady state is either monotonic or oscillatory.

Proof

See "Appendix A3".

That is, the saddle-path can either converge monotonically or oscillate toward the steady state. This result is consistent with previous findings where a clockwise spiraling path can go through the Soylent Green area, with negative environmental absorption efficiency during a finite time interval (El Ouardighi et al. 2014; El Ouardighi et al. 2015a).

Lemma 1

The saddle paths of the control and state variables near the cooperative steady state are:

$$ e_{i} (t) = a + \eta_{1\infty } + B_{1} e^{{\chi_{1} t}} + B_{2} e^{{\chi_{3} t}} $$
(14)
$$ w_{i} (t) = w_{i\infty } + B_{3} e^{{\chi_{1} t}} + B_{4} e^{{\chi_{3} t}} $$
(15)
$$ P(t) = P_{\infty } + B_{5} e^{{\chi_{1} t}} + B_{6} e^{{\chi_{3} t}} $$
(16)
$$ A(t) = A_{\infty } + B_{7} e^{{\chi_{1} t}} + B_{8} e^{{\chi_{3} t}} $$
(17)

where \( B_{1} , \ldots ,B_{8} \) are constants of integration and \( \chi_{1} ,\chi_{3} \) are eigenvalues with negative real part.

Proof

See "Appendix A4".

Equations (14)–(17) are useful to characterize the transient paths for the control and state variables in the cooperative solution.

4 Open-loop Nash equilibrium

An analysis of the OLNE strategy, which can be justified whenever the players are capable of precommitting themselves to announced time paths, offers valuable insights into the impact of negative externalities resulting from the transboundary non-cooperative game of pollution control and environmental absorption efficiency management. In addition, open-loop strategies provide a useful benchmark for subsequent assessment of the strategic effects related to Markovian strategies (Fudenberg and Levine 1988).

Using the objective function in (3), the current-value Hamiltonian of player i is written as:

$$ H^{i} = e_{i} \left( {a - {{e_{i} } \mathord{\left/ {\vphantom {{e_{i} } 2}} \right. \kern-0pt} 2}} \right) - {{cP^{2} } \mathord{\left/ {\vphantom {{cP^{2} } 2}} \right. \kern-0pt} 2} - {{w_{i}^{2} } \mathord{\left/ {\vphantom {{w_{i}^{2} } 2}} \right. \kern-0pt} 2} + \lambda_{i1} \left( {e_{1} + e_{2} - AP} \right) + \lambda_{i2} \left( {w_{1} + w_{2} - \gamma P} \right) $$
(18)

where \( \lambda_{ij} \equiv \lambda_{ij} (t) \) are player i’s (current value) costate variables, \( i = 1,2 \), \( j = 1,2 \).

Assuming an interior solution and applying Pontryagin’s principle in the discounted case, player i’s necessary conditions for OLNE are:

$$ H_{{e_{i} }}^{i} = a - e_{i} + \lambda_{i1} = 0 \Rightarrow e_{i}^{on} = a + \lambda_{i1} $$
(19)
$$ H_{{w_{i} }}^{i} = - w_{i} + \lambda_{i2} = 0 \Rightarrow w_{i}^{on} = \lambda_{i2} $$
(20)

\( i = 1,2 \), where the superscript “\( on \)” stands for open-loop Nash equilibrium.

The evolution of the costate variables is given by:

$$ \dot{\lambda }_{i1} = cP + \left( {r + A} \right)\lambda_{i1} + \gamma \lambda_{i2} $$
(21)
$$ \dot{\lambda }_{i2} = r\lambda_{i2} + \lambda_{i1} P $$
(22)

In the present case, the costate variables of both players coincide; that is, \( \lambda_{i1} \equiv \lambda_{1} \) and \( \lambda_{i2} \equiv \lambda_{2} \), \( i = 1,2 \), and Eqs. (21) and (22) confirm that the players’ equilibrium strategies are symmetric. Therefore, we are left with only two costate equations.

Using (21)–(22), (1)–(2) we obtain:

$$ \dot{P} = 2\left( {a + \lambda_{i1} } \right) - AP $$
(23)
$$ \dot{A} = 2\lambda_{2} - \gamma P $$
(24)

Note that here also, if the pollution stock and the absorption efficiency are both steady, then each player performs time-invariant control efforts.

Proposition 3

There exists a non-trivial OLNE steady state that is unique, given by:

$$ \left[ {P_{\infty }^{on} \quad A_{\infty }^{on} \quad e_{i\infty }^{on} \quad w_{i\infty }^{on} } \right]^{T} = \left[ {\frac{{r^{2} \gamma + k}}{{2\left( {2c + \gamma^{2} } \right)}}\quad \frac{{2\left( {2a - r\gamma } \right)\left( {2c + \gamma^{2} } \right)}}{{r^{2} \gamma + k}}\quad \frac{2a - r\gamma }{2}\quad \frac{{\gamma \left( {r^{2} \gamma + k} \right)}}{{4\left( {2c + \gamma^{2} } \right)}}} \right]^{T} $$
(25)

where\( k \equiv \sqrt {r^{4} \gamma^{2} + 4r\gamma \left( {2a - r\gamma } \right)\left( {2c + \gamma^{2} } \right)} > 0 \), \( i = 1,2 \).

Proof

See "Appendix A5".

Therefore, precommitment strategies also allow a feasible steady state emissions rate compatible with the preservation of a strictly positive environmental absorption rate under the feasibility condition \( a > {{r\gamma } \mathord{\left/ {\vphantom {{r\gamma } 2}} \right. \kern-0pt} 2} \). In (25), a lower discounting rate also increases the steady state emissions rate \( e_{i\infty }^{on} \) and environmental absorption efficiency \( A_{\infty }^{on} \), and lowers the pollution stock \( P_{\infty }^{on} \) and restoration efforts \( w_{i\infty }^{on} \). Here also, there exists a trivial steady state solution such that \( \left[ {P_{\infty }^{'on} \quad A_{\infty }^{'on} \quad e_{i\infty }^{'on} \quad w_{i\infty }^{'on} } \right]^{T} = \left[ {0\quad - r\quad 0\quad 0} \right]^{T} \), which is diverging and therefore not optimal.

Corollary 2

At the steady state, player i’s OLNE instantaneous utility is:

$$ u_{i\infty }^{on} = \frac{{4a^{2} - r^{2} \gamma^{2} }}{8} - \frac{{\left( {4c + \gamma^{2} } \right)\left( {r^{2} \gamma + k} \right)^{2} }}{{32\left( {2c + \gamma^{2} } \right)^{2} }} $$
(26)

Proof

See "Appendix A6".

The expression in (26) represents the utility of future generations from the perspective of player i in the OLNE case. It has the same properties as in the cooperative case regarding the influence of the discounting rate, \( r \), and the marginal impact of pollution on the evolution of absorption efficiency, \( \gamma \).

Proposition 4

The OLNE steady state has the saddle-point property and the optimal path to the steady state is either monotonic or oscillatory.

Proof

See "Appendix A7".

As in the cooperative setting, the saddle-path can either converge monotonically or follow an oscillatory trajectory toward the steady state.

Lemma 2

The saddle paths of the control and state variables near the OLNE steady state are:

$$ e_{i}^{on} (t) = a + \lambda_{1\infty } + D_{1} e^{{\xi_{1} t}} + D_{2} e^{{\xi_{3} t}} $$
(27)
$$ w_{i}^{on} (t) = w_{i\infty }^{on} + D_{3} e^{{\xi_{1} t}} + D_{4} e^{{\xi_{3} t}} $$
(28)
$$ P^{on} (t) = P_{\infty }^{on} + D_{5} e^{{\xi_{1} t}} + D_{6} e^{{\xi_{3} t}} $$
(29)
$$ A^{on} (t) = A_{\infty }^{on} + D_{7} e^{{\xi_{1} t}} + D_{8} e^{{\xi_{3} t}} $$
(30)

where \( D_{1} , \ldots ,D_{8} \) are constants of integration and \( \xi_{1} ,\xi_{3} \) are eigenvalues with negative real part.

Proof

See "Appendix A8".

Equations (27)–(30) enable the computation of transient paths for the control and state variables in an OLNE.

5 Markov perfect Nash equilibrium

The MPNE has the desirable property of subgame perfectness because current actions depend on the current state vector and time, and not only on time, as OLNE does. However, the non-linear quadratic structure of the game makes the search for an analytical solution of MPNE more difficult than for usual linear quadratic models. The main reason for this is that the two state variables interact multiplicatively in one state equation, which involves intricate third-order terms in the resolution process. Thus, the design of an approximation for the MPNE requires particular attention.

Using the objective function in (3), the Hamilton–Jacobi–Bellman (HJB) equations are:

$$ rV^{i} = \mathop {\text{Max}}\limits_{{e_{i} ,w_{i} }} \left[ {e_{i} \left( {a - {{e_{i} } \mathord{\left/ {\vphantom {{e_{i} } 2}} \right. \kern-0pt} 2}} \right) - {{cP^{2} } \mathord{\left/ {\vphantom {{cP^{2} } 2}} \right. \kern-0pt} 2} - {{w_{i}^{2} } \mathord{\left/ {\vphantom {{w_{i}^{2} } 2}} \right. \kern-0pt} 2} + V_{P}^{i} \left( {e_{1} + e_{2} - AP} \right) + V_{A}^{i} \left( {w_{1} + w_{2} - \gamma P} \right)} \right] $$
(31)

where \( V^{i} \left( {A,P} \right) \) is player i’s value function, \( i = 1,2 \).

Player i’s necessary conditions for Markov perfect Nash equilibrium in the case of an interior solution are written as:

$$ e_{i}^{mp} = a + V_{P}^{i} \left( {A,P} \right) $$
(32)
$$ w_{i}^{mp} = V_{A}^{i} \left( {A,P} \right) $$
(33)

After substituting the necessary conditions of optimality into the HJB equations, we obtain:

$$ rV^{i} = {{\left( {a + V_{P}^{i} } \right)^{2} } \mathord{\left/ {\vphantom {{\left( {a + V_{P}^{i} } \right)^{2} } 2}} \right. \kern-0pt} 2} - {{cP^{2} } \mathord{\left/ {\vphantom {{cP^{2} } 2}} \right. \kern-0pt} 2} + {{\left( {V_{A}^{i} } \right)^{2} } \mathord{\left/ {\vphantom {{\left( {V_{A}^{i} } \right)^{2} } 2}} \right. \kern-0pt} 2} + V_{P}^{i} \left( {V_{P}^{j} - AP} \right) + V_{A}^{i} \left( {V_{A}^{j} - \gamma P} \right) + aV_{P}^{i} $$
(34)

\( i,j = 1,2,\;i \ne j \). Invoking the symmetry assumption, we write: \( V^{i} = V \), \( V_{P}^{i} = V_{P} \) and \( V_{A}^{i} = V_{A} \), \( i = 1,2 \), which leads us to rewrite (34) as follows:

$$ rV^{i} = {{a^{2} } \mathord{\left/ {\vphantom {{a^{2} } 2}} \right. \kern-0pt} 2} + \left( {{{3V_{P} } \mathord{\left/ {\vphantom {{3V_{P} } 2}} \right. \kern-0pt} 2} + 2a} \right)V_{P} + {{3\left( {V_{A} } \right)^{2} } \mathord{\left/ {\vphantom {{3\left( {V_{A} } \right)^{2} } 2}} \right. \kern-0pt} 2} - {{cP^{2} } \mathord{\left/ {\vphantom {{cP^{2} } 2}} \right. \kern-0pt} 2} - \gamma V_{A} P - V_{P} AP $$
(35)

In linear-quadratic transboundary pollution models, a quadratic approximation of the value function for the evaluation of the MPNE is usually considered (see, e.g., Long 2010, Section 1.2.4). In the context of our non-linear quadratic model, assuming a local quadratic approximation of the form, with real parameters \( \alpha_{1} ,..,\alpha_{6} \), that is:

$$ V = \alpha_{1} + \alpha_{2} A + \alpha_{3} P + \alpha_{4} A^{2} + \alpha_{5} P^{2} + \alpha_{6} AP $$

may however not be relevant because it would lead to inclusion of two third-order terms, respectively proportional to \( A^{2} P \) and \( AP^{2} \), in the approximation of the HJB Eq. (35), that are likely non-negligible near the steady state. So, some care is needed to design the local quadratic approximation, to make such third-order terms locally negligible.

Assumption 1

We consider for the value function a local quadratic approximation with real parameters \( \alpha_{1} ,..,\alpha_{6} \), that is:

$$ V = \alpha_{1} + \alpha_{2} \left( {A - A_{\infty }^{mp} } \right) + \alpha_{3} \left( {P - P_{\infty }^{mp} } \right) + \alpha_{4} \left( {A - A_{\infty }^{mp} } \right)^{2} + \alpha_{5} \left( {P - P_{\infty }^{mp} } \right)^{2} + \alpha_{6} \left( {A - A_{\infty }^{mp} } \right)\left( {P - P_{\infty }^{mp} } \right) $$
(36)

where \( A_{\infty }^{mp} \) and \( P_{\infty }^{mp} \) are the steady state values (to be determined) of \( A \) and \( P \) for the MPNE. We assume that the parameters of the problem are such that the steady state values of all the state and control variables are positive. Note that the steady state utility, or social welfare of future generations, is approximated by \( r\alpha_{1} \).

By substituting the expression in (36) and its respective partial derivatives in (35), two third-order terms, respectively proportional to \( \left( {A - A_{\infty }^{mp} } \right)^{2} \left( {P - P_{\infty }^{mp} } \right) \) and \( \left( {A - A_{\infty }^{mp} } \right)\left( {P - P_{\infty }^{mp} } \right)^{2} \), appear in the approximation of the HJB equation, that are clearly negligible near the steady state and can therefore be neglected. Though this is sufficient to ensure the accuracy of our local quadratic approximation in (36), additional justifications based on a numerical procedure are provided hereafter; they compare this local approximation with the one obtained by a different procedure.

Proposition 5

Under Assumption 1, the MPNE strategies near the steady state are given by:

$$ e^{mp} = a + \alpha_{3} + 2\alpha_{5} \left( {P - P_{\infty }^{mp} } \right) + \alpha_{6} \left( {A - A_{\infty }^{mp} } \right) $$
(37)
$$ w^{mp} = \alpha_{2} + 2\alpha_{4} \left( {A - A_{\infty }^{mp} } \right) + \alpha_{6} \left( {P - P_{\infty }^{mp} } \right) $$
(38)

and the MPNE steady state is given by:

$$ \left[ {P_{\infty }^{mp} ,A_{\infty }^{mp} ,e_{\infty }^{mp} ,w_{\infty }^{mp} } \right]^{T} = \left[ {2\alpha_{2} /\gamma ,\gamma \left( {a + \alpha_{3} } \right)/\alpha_{2} ,a + \alpha_{3} ,\alpha_{2} } \right]^{T} $$
(39)

Proof

See "Appendix A9".

From (38) and (39), we note that both emissions and restoration effort MPNE strategies are locally linear. However, \( \alpha_{2} > 0 \) and \( - a < \alpha_{3} < 0 \) are required for a feasible steady state solution. The steady state solution involved by (39), if any, should be globally asymptotically stable for the linearized dynamical system presented in the proof of Proposition 5 (see "Appendix A9"). The Jacobian matrix associated with the MPNE controlled system is:

$$ J^{mp} = \left[ {\begin{array}{*{20}l} {4\alpha_{5} - A_{\infty }^{mp} } \hfill & {2\alpha_{6} - P_{\infty }^{mp} } \hfill \\ {2\alpha {}_{6} - \gamma } \hfill & {4\alpha_{4} } \hfill \\ \end{array} } \right] $$
(40)

Using (39), we get the trace and determinant of the Jacobian matrix, that is, respectively:

$$ Tr\left( {J^{mp} } \right) = 4\left( {\alpha_{4} + \alpha_{5} } \right) - \frac{{\gamma \left( {a + \alpha_{3} } \right)}}{{\alpha_{2} }},\;\;\;\left| {J^{mp} } \right| = 2\left\{ {2\alpha_{4} \left[ {4\alpha_{5} - \frac{{\gamma \left( {a + \alpha_{3} } \right)}}{{\alpha_{2} }}} \right] - \left( {2\alpha_{6} - \gamma } \right)\left( {\alpha_{6} - \frac{{\alpha_{2} }}{\gamma }} \right)} \right\} $$

The global asymptotic stability property imposes that \( Tr\left( {J^{mp} } \right) < 0 \) and \( \left| {J^{mp} } \right| > 0 \), and that both the eigenvalues \( \psi_{1,2} \) of the Jacobian matrix, that is:

$$ \psi_{1,2} = \frac{{\gamma \left[ {4\alpha_{2} \left( {\alpha_{4} + \alpha_{5} } \right) - \gamma \left( {a + \alpha_{3} } \right)} \right] \pm \sqrt {\gamma^{2} \left[ {4\alpha_{2} \left( {\alpha_{4} - \alpha_{5} } \right) + \gamma \left( {a + \alpha_{3} } \right)} \right]^{2} + 8\gamma \alpha_{2}^{2} \left( {2\alpha_{6} - \gamma } \right)\left( {\gamma \alpha_{6} - \alpha_{2} } \right)} }}{{2\gamma \alpha_{2} }} $$

have negative real parts.

Because our feedback solution is approximated around a small neighborhood of the steady state, convergence to the actual solution, if it exists, should be quite satisfactory. To confirm the accuracy of the quadratic model adopted in Assumption 1, we also use an alternative approach based on the collocation method for PDEs (e.g., Judd 1998; Doraszelski 2003; Dawid et al. 2015; Jaakkola 2015) to compute the MPNE strategies numerically (see "Appendix A10"). For the following constellation of parameters, that is:

$$ r = 0.075,\;\;\gamma = 0.05,\;\;a = 0.006,\;\;c = 0.035, $$

we used the approach described in "Appendix A9" and identified two MPNE strategies, each of them resulting into a globally asymptotically stable steady state for the associated linearized system. We selected the most profitable Nash equilibrium strategy according to the objective criterion in (3). The selected MPNE is a more profitable (higher) equilibrium because it is characterized by greater environmental absorption efficiency and lower pollution stock at the steady state than the less profitable (lower) MPNE. Note that, in practice, convergence to the higher equilibrium is not granted, as the lower equilibrium may well arise as a poverty-trap (e.g., Azariadis and Stachurski 2005) if the players are unable to coordinate. Therefore, a coordination device on the higher equilibrium is needed to escape from the poverty-trap and therefore allow for an increase in welfare to both players. The coefficient values of the local quadratic approximation of the value function for the higher equilibrium are given in Table 1.

Table 1 Coefficient values of the quadratic approximation of the value function for the higher equilibrium

The corresponding steady state is given by:

$$ \left( {P_{\infty }^{mp} ,A_{\infty }^{mp} ,e_{\infty }^{mp} ,w_{\infty }^{mp} } \right) = \left( {2.880045 \cdot 10^{ - 2} ,1.978042 \cdot 10^{ - 1} ,2.848426 \cdot 10^{ - 3} ,7.200114 \cdot 10^{ - 4} } \right) $$

along with the corresponding negative eigenvalues:

$$ \left( {\psi_{1} ,\psi_{2} } \right) = \left( { - 3.103781 \cdot 10^{ - 1} , - 5.767552 \cdot 10^{ - 3} } \right) $$

By numerically comparing the local quadratic approximation of the value function on a neighborhood of the steady state solution with its local approximation obtained using the collocation method on the same neighborhood (see "Appendix A10") with Matlab, a very small difference between the two approximations is obtained, as shown in Fig. 2.

Fig. 2
figure 2

Compared approximations of the value function with local quadratic conjecture and collocation method. a Local quadratic conjecture b collocation method

Figure 3 compares the two approximations in terms of the relative error of the quadratic approximation with respect to the one obtained by the collocation method. Such a relative error is defined as the absolute value of their difference, divided by the average on the neighborhood of the absolute value of the approximation obtained by the collocation method (the average of the absolute value is introduced to avoid an infinite relative error in case of zero crossings of the approximation). As one can see from the figure, a negligible relative error is obtained, which suggests that the initial conjectured quadratic approximation was so good that only a negligible update was performed by the collocation method to reach the default tolerance.Footnote 5

Fig. 3
figure 3

Relative error of the local quadratic approximation of the value function with respect to the local approximation obtained with the collocation method

For the local quadratic approximation, Fig. 4 shows the Bellman residual error, defined as:

$$ rV - \left[ {a^{2} /2 + \left( {3V_{P} /2 + 2a} \right)V_{P} + 3\left( {V_{A} } \right)^{2} /2 - cP^{2} /2 - \gamma V_{A} P - V_{P} AP} \right] $$

and provides a measure of the quality of the approximation, as it expresses the pointwise violation of the HJB Eq. (35). The figure shows a very small Bellman residual error. Moreover, for the local quadratic approximation, when it is computed in \( (A_{\infty }^{mp} ,P_{\infty }^{mp} ) \), the absolute value of the ratio between the Bellman residual error and the left-hand side of the HJB Eq. (35) is \( 3.4766 \cdot 10^{ - 7} \). Similar results hold for the approximation by the collocation method.

Fig. 4
figure 4

Bellman residual error for the local quadratic approximation of the value function

Finally, for the same choice of the problem parameters, we compare the value \( \alpha_{1} \) assumed in \( (A_{\infty }^{mp} ,P_{\infty }^{mp} ) \) by the local quadratic approximation of the value function, with the expression:

$$ \left.\left[ {e_{\infty }^{mp} \left( {a - e_{\infty }^{mp} /2} \right) - c\left( {P_{\infty }^{mp} } \right)^{2} /2 - \left( {w_{\infty }^{mp} } \right)^{2} /2} \right]\right/r $$

that is obtained by replacing in (3) the resulting approximation of the steady state solution for the MPNE: the two values so obtained (\( - 2.321435 \cdot 10^{ - 5} \) and \( - 2.321434 \cdot 10^{ - 5} \), respectively) are practically coincident. The results obtained clearly support the validity of the local quadratic approximation in Assumption 1.Footnote 6

6 Comparative analysis

In this section, we determine the main differences between the cooperative, OLNE and approximated MPNE strategies in terms of steady state and transient path. To do so, we use both analytical and numerical results.

Based on our analytical results for the cooperative and OLNE cases, we first set some important comparisons.

Proposition 6

It holds that:

$$ e_{i\infty }^{on} = e_{i\infty } $$
(41)
$$ P_{\infty }^{on} > P_{\infty } $$
(42)
$$ w_{i\infty }^{on} > w_{i\infty } $$
(43)
$$ A_{\infty }^{on} < A_{\infty } $$
(44)
$$ \sum\nolimits_{i} {u_{i\infty }^{on} } < u_{\infty } $$
(45)

Proof

See "Appendix A10".

That is, the steady state global emissions in the cooperative and the OLNE settings are equal. The same conclusion can be obtained with a logarithmic revenue function of current rate of emissions, that is, \( a_{i} \text{ln}e_{i} (t) \), \( a_{i} > 0 \), \( i = 1,2 \), and with a linear pollution cost, that is, \( cP(t) \), which makes it a robust result in that it is driven neither by the functional form of revenue function nor by that of pollution costs. Actually, this result is due to the particular transition equations assumed, which allow switching of the biosphere from a carbon sink to a source, and the ability to invest in the restoration of environmental absorption efficiency. The main implication of (41) is that the OLNE steady state imposes no sacrifice on future generations in terms of emissions compared with the cooperative steady state. This result departs from other transboundary pollution game models in which environmental absorption efficiency is represented as an instantaneous function of the pollution stock in that these models suggest that the steady state emissions in the cooperative setting should be lower than in an OLNE.

On the other hand, the result in (42) states that the steady state pollution is lower in the cooperative solution than in the OLNE. Intuitively, the difference between the two outcomes results from the fact that the players in the OLNE game do not take into account the social cost of a marginal increment in the pollution stock as in the cooperative game, but only their private marginal user cost. The result in (43) indicates that, compared with the cooperative solution, an OLNE leads each player to over-invest in environmental absorption efficiency. Note that the dominance of the OLNE investment in absorption efficiency over the cooperative investment effort refers to the steady state, which by definition is an asymptotic outcome where player i’s investment is needed only to make up for the destructive impact of the pollution stock on environmental absorption efficiency. Relatedly, in (44), steady state absorption efficiency is greater in the cooperative case than in the OLNE case. The intuition behind this result is that over-investment in absorption efficiency involved by the OLNE solution at the steady state compared to the cooperative solution does not suffice to negate the impact of the excess emissions during the transient path. This point will be checked on a numerical basis below. At the steady state, cooperative and OLNE emissions are equivalent, which implies equivalent revenues. However, the non-cooperative steady state involves not only a greater cost of pollution than the cooperative setting but also a greater cost of investment effort in environmental absorption efficiency. Therefore, future generations are better off if the players decide to cooperate rather than to precommit, as shown in (45).

Similar analytical comparisons involving the MPNE case are unavailable. To compare the cooperative, OLNE and MPNE solutions on a numerical basis, we select the base case parameter values in Table 2.

Table 2 Base case and range values

The base case in Table 2 reflects a favorable configuration which is characterized in relative terms by players’ patience (i.e., low discounting rate, \( r \)), strong environmental resilience (i.e., low destructive impact of pollution on absorption efficiency, \( \gamma \)), high marginal incentive for emissions (i.e., high upper bound on emissions, \( 2a \)) and low marginal pollution cost (i.e., low pollution cost function coefficient, \( c \)). The numerical solutions of the cooperative, OLNE and MPNE cases were computed with Maple 18.0.

To assess the sensitivity of the steady states, a broad range of values were used for all parameters (Table 2). In the MPNE case, the solutions were calculated for 10,000 combinations of the parameters with 10 values for each parameter \( r \), \( \gamma \), \( a \), \( c \). Among the 10,000 trials, 141 combinations of parameters gave rise to a single locally asymptotically stable solution and 162 combinations resulted in two locally asymptotically stable solutions. In the remaining 9697 combinations of parameters, no MPNE stable solution could be found.

We proceed in four steps. We start by identifying the stable steady states respectively associated with the cooperative, OLNE and MPNE solutions in the base case. Then we assess the sensitivity of these steady states to the various parameters in terms of comparative statics, whenever possible. We also represent the phase diagram in the state space, and the transient paths of the emissions rate and restoration effort for each case. Finally, we characterize the nature of convergence to the steady state as a function of the magnitude of the discounting rate for each case.

To compare the transient behaviors associated with the cooperative, OLNE and MPNE solutions, we assumed in all the subsequent cases a large initial pollution stock, i.e., significantly larger than the maximum corresponding stable steady state value resulting from the three solutions (\( P_{0} = 0.1 \)). In contrast, to assess the sensitivity of the transient paths to the magnitude of the initial absorption efficiency, we used two initial values, one being greater than the maximum stable steady state value among the three solutions, that is, \( A_{0} = 0.45 \), and the other being lower than the minimum stable steady state value among the three solutions, that is, \( A_{0} = 0.15 \).

We are primarily interested in stable interior solutions. As for the cooperative and OLNE solutions, the system of 8 non-linear equations derived for the MPNE gave rise to a single Nash equilibrium with globally asymptotically stable steady state. In the cooperative, OLNE and MPNE cases, the respective stability conditions were all satisfied. That is, the saddle-point property was fulfilled with two positive and two negative eigenvalues with no imaginary part in the cooperative case, \( \left( {\chi_{1} ,\chi_{3} } \right) = \left( { - 0. 4 2 7 ,- 0. 0 0 4} \right) \), and in the OLNE case, \( \left( {\xi_{1} ,\xi_{3} } \right) = \left( { - 0. 3 0 5 ,- 0. 0 0 8} \right) \). In the MPNE case, the global asymptotic stability criterion was granted with two negative eigenvalues with no imaginary part, \( \left( {\psi_{1} ,\psi_{2} } \right) = \left( { - 0. 0 1, - 0. 2 3 5} \right) \).

The main results are illustrated in Fig. 5. A unique (non-trivial) steady state is observed in each case, that is, \( B \) (i.e., intersection of the isoclines \( \dot{P} = 0 \) and \( \dot{A} = 0 \)) for the cooperative solution with \( \left( {A_{\infty } ,P_{\infty } } \right) = \left( { 0. 3 7 7 6, 0. 0 2 5} \right) \), \( B^{on} \) (i.e., \( \dot{P}^{on} = 0 \) and \( \dot{A}^{on} = 0 \)) for the OLNE with \( \left( {A_{\infty }^{on} ,P_{\infty }^{on} } \right) = \left( {0.2684,0.035} \right) \), and \( B^{mp} \) (i.e., \( \dot{P}^{mp} = 0 \) and \( \dot{A}^{mp} = 0 \)) for the MPNE with \( \left( {A_{\infty }^{mp} ,P_{\infty }^{mp} } \right) = \left( { 0. 1 8 8 3,0. 0 4 3 9} \right) \).

Fig. 5
figure 5

Comparison between cooperative, OLNE and MPNE steady states

For the chosen constellation of parameters, the phase diagram for the MPNE (Fig. 6) confirms that all paths starting from a reasonable neighborhood around the steady state, \( B^{mp} \), converge to the steady state. From (slightly) negative initial absorption efficiency, the convergence to the steady state is also granted with initially increasing then decreasing pollution stock. This pattern is driven by the fact that emissions reduction, though useful, is insufficient under negative absorption efficiency and only heavy restoration efforts can turn the biosphere to pollution sink. Therefore, the pollution stock can never decrease before a strengthening of the absorption efficiency. A similar requirement is observed in a cooperative solution with clockwise oscillatory convergence to the steady state (e.g., El Ouardighi et al. 2015a).

Fig. 6
figure 6

Phase diagram in the state space \( \left( {A,P} \right) \) for Markov perfect Nash equilibrium

In Fig. 5, we obtain \( P_{\infty }^{mp} > P_{\infty }^{on} > P_{\infty } \). This result is consistent with other differential game models that represent environmental absorption efficiency as an instantaneous function of the pollution stock because our model shows that the steady state pollution stock is lower in an OLNE than in a MPNE. Moreover, we also get \( A_{\infty }^{mp} < A_{\infty }^{on} < A_{\infty } \), so that the absorption efficiency in an MPNE is substantially lower than in an OLNE in spite of lower emissions and greater restoration efforts at the steady state. Therefore, the MPNE strategy leads to a stable steady state, which is more likely to be located in the emergency area, whereas the cooperative and OLNE strategies result in a steady state that has a better chance of falling into the sustainability area.

The figures below describe the sensitivity of the steady state to the parameters’ values for the control variables (Fig. 7a) and the state variables (Fig. 7b) in the three equilibria.

Fig. 7
figure 7

Sensitivity analysis for cooperative, OLNE and MPNE stable steady states. a Control variables b state variables

In the cooperative, OLNE and MPNE cases, a lower discounting rate, \( r \), moves the steady state further toward the sustainability area (7b). This suggests that players’ patience enhances the long run efficiency of the biosphere as a pollution sink because it leads to the substitution of lower emissions during the transient path for increased emissions at the steady state. Correlatively, the restoration effort should be reduced at the steady state (7a). A greater pollution cost coefficient, \( c \), also moves the steady state further toward the sustainability area in the cooperative, OLNE and MPNE cases (7b). Though it reduces the restoration effort at the steady state, it does not affect the emissions in the cooperative and OLNE cases, which suggests that emissions are actually reduced during the transient path only. Regarding the influence of an increase in the destructive impact of pollution on absorption efficiency, \( \gamma \), it results in lower emissions and greater restoration efforts at the steady state in all equilibria (7a). A move of the steady state toward the emergency area is observed in all equilibria, with reduced absorption efficiency and an increased pollution stock (7b). Finally, an increase in the revenue function coefficient, \( a \), leads to a complementary increase in the cooperative, OLNE and MPNE steady state pollution stock and absorption efficiency (7a), which results in both greater emissions and restoration efforts (7b).

In Fig. 8, the phase plane in the state space is depicted for the cooperative and non-cooperative solutions. In the case of (relatively) high initial absorption efficiency, i.e., \( A_{0} = 0.45 \), the cooperative, OLNE and MPNE strategies consist in decreasing the absorption efficiency slightly and the pollution stock dramatically in a finite time, and then decreasing the absorption efficiency significantly and increasing the pollution stock slightly until the corresponding steady state is reached, that is, \( B \) for the cooperative solution, \( B^{on} \) for the OLNE, and \( B^{mp} \) for the MPNE. In the case of (relatively) low initial absorption efficiency, i.e., \( A_{0} = 0.15 \), the cooperative, OLNE and MPNE strategies also consist in substantially decreasing the pollution stock first, and then in increasing the absorption efficiency while converging to the steady state. Overall, the cooperative, OLNE and MPNE transient paths are quite similar and differ only in their steady state respective values. Whatever the initially (positive) efficiency of pollution sinks, it is optimal in all cases to reduce the pollution stock substantially before strengthening the biosphere as a pollution sink. However, although a contingent strategy preserves the long run absorption efficiency somewhat, OLNE is more effective than an MPNE in managing pollution sinks’ efficiency.

Fig. 8
figure 8

Phase plane in the state space for the cooperative, open-loop and Markov perfect Nash cases from low and high absorption efficiency

The patterns in Fig. 8 are explained by the time paths of emissions rate and restoration effort shown in Fig. 9, where time is expressed as a logarithmic function along the X-axis. We first observe that in all equilibria, the greater the initial absorption efficiency, the greater the emissions rate and the lower the restoration effort.

Fig. 9
figure 9

Transient paths of emissions rate and restoration effort for the cooperative, OLNE and MPNE cases from low and high absorption efficiency. a Emissions rate b restoration effort

In the case of (relatively) high initial absorption efficiency, i.e., \( A_{0} = 0.45 \), the OLNE strategy exhibits greater emissions and initially lower and then greater restoration efforts than the cooperative strategy. In contrast, the MPNE strategy shows initially greater then lower emissions, and initially lower then greater restoration efforts than the cooperative strategy. The non-cooperative equilibria under high initial absorption efficiency are consistent with the tragedy of the commons in that they reflect both excessive emissions and underinvestment in absorption efficiency over the most economically influential part of the planning horizon, i.e., the time interval during which the discounted instantaneous profits are still strictly positive.

In the case of (relatively) low initial absorption efficiency, i.e., \( A_{0} = 0.15 \), the optimal policy in all equilibria consists of initially small and increasing emissions in gradual stages and initially positive and decreasing restoration efforts. The first jump in emissions occurs quickly, immediately after the dramatic decrease in the pollution stock. Along the time path, cooperative emissions are lower than OLNE emissions over a wide time interval, until the steady state where the emissions are the same in the cooperative solution and OLNE (\( e_{\infty }^{on} = e_{\infty } = 0.0047 \)). Lower transient emissions in the cooperative case require less restoration efforts than in an OLNE during the transient path and in the steady state (\( w_{\infty } = 0. 0 0 0 6 { < }\,w_{\infty }^{on} = 0. 0 0 0 8 \)). In an MPNE, emissions alternate between greater and smaller values than in the cooperative solution until the steady state (\( e_{\infty }^{mp} = 0. 0 0 4 1 \)) is reached. Restoration efforts are always greater than in the cooperative solution until the steady state (\( w_{\infty }^{mp} = 0. 0 0 1 \)) is attained. The emissions in an MPNE are lower and restoration efforts alternate between greater and smaller values than in the OLNE until the steady state.

Because transient emissions in an OLNE are greater than in a cooperative solution as a result of greater transient restoration efforts, the OLNE strategy under low initial absorption efficiency is consistent with the assumption of a voracity effect in that it transiently promotes both excessive emissions and overinvestment in absorption efficiency over the planning horizon (Benchekroun and Chaudhuri 2014). By comparison, an MPNE promotes lumpier substitutability between the two policy instruments than an OLNE does. At the steady state, the magnitude of emissions in an MPNE is limited by low absorption efficiency.

Figure 10 reports the overall utility associated with the cooperative, OLNE and MPNE cases under (relatively) high and low initial absorption efficiency, respectively.

Fig. 10
figure 10

Overall utility for the cooperative, OLNE and MPNE cases from low and high initial absorption efficiency

In the context of high initial absorption efficiency, the players’ joint overall utility is positive for all cases. Note that the overall utility under cooperation and the joint overall utility under non-cooperative equilibrium with commitment are almost similar. That is, the welfare cost for players with OLNE strategies that are not able to cooperate is limited. In comparison, the welfare cost for MPNE strategies regarding cooperation or commitment is greater. Under low initial absorption efficiency, the players’ overall utility is positive only for cooperation, while it is zero for OLNE and negative for MPNE. This decline in joint overall utility of non-cooperative strategies might be seen as a confirmation of the presence of a voracity effect. Here, the welfare cost for non-cooperative players that are unable to cooperate, and the welfare cost for players with MPNE strategies that are unable to commit are both significantly greater than previously. In terms of elasticity of the overall joint utility with respect to the initial absorption efficiency, denoted by \( \psi \) and defined as the percentage of decrease of the overall utility divided by the percentage of decrease of the initial absorption efficiency, we get \( \psi_{{{U \mathord{\left/ {\vphantom {U {A_{0} }}} \right. \kern-0pt} {A_{0} }}}} = 0.876 \) in the cooperative case, \( \psi_{{{{\sum\nolimits_{i} {U_{i} } } \mathord{\left/ {\vphantom {{\sum\nolimits_{i} {U_{i} } } {A_{0} }}} \right. \kern-0pt} {A_{0} }}}}^{on} = 1.49 \) in the OLNE case and \( \psi_{{{{\sum\nolimits_{i} {V^{i} } } \mathord{\left/ {\vphantom {{\sum\nolimits_{i} {V^{i} } } {A_{0} }}} \right. \kern-0pt} {A_{0} }}}}^{mp} = 2.399 \) in the MPNE case. These results imply that contingent strategies are more vulnerable in terms of social welfare to a decrease in initial absorption efficiency than commitment strategies are, which in turn are more affected than cooperative strategies are. That is, the absence of cooperation or commitment becomes more costly as the initial absorption efficiency of pollution sinks tends to exhaustion. This result explains why nations have been urged in recent years to commit to pollution control at an international level (Stern 2006; IPCC 2007; Stern 2015).

Finally, the steady state utility, or social welfare of future generations, is positive in the cooperative, OLNE and MPNE cases and its ranking is consistent with that reported in Fig. 10 from high initial absorption efficiency. The welfare cost of future generations is limited for OLNE strategies regarding cooperative strategy and significant for MPNE strategies regarding cooperation or commitment. In all cases, the decrease in initial absorption efficiency is more costly to current generations than to future ones because the future generations neither benefit from the revenues nor incur the costs related to the transient path toward the steady state.

Table 3 provides insights into the nature of convergence to the steady state depending on the discounting rate value for the set of lower bound values. The value of the discounting rate above, with which the path to the steady state becomes diverging in the case of the MPNE, is relatively low (\( r = 0.15 \)). In comparison with this threshold, the value of the discounting rate above, with which the saddle-path to the steady state becomes diverging, is greater both in cooperative solution and OLNE (i.e., \( r = 0.24 \)). As a result, more heavily discounted MPNE strategies are less robust to a definitive switching of pollution sinks to a source, while such switching would not occur in an OLNE. This implies that relatively impatient players should not opt for contingent strategies because they are more likely to make the evolution of the biosphere irreversibly uncontrollable. It is noteworthy that the threshold value of the discounting rate from which cooperative and OLNE paths are diverging is still monotonic, which departs from Tahvonen and Withagen (1996), who argue that an optimal path leading to irreversible pollution is typically non-monotonic.

Table 3 Threshold of impact of the discounting rate on the nature of convergence to the steady state

Figure 11 confirms the irreversible impact of a higher discounting rate (\( r = 0.15 \)) on the MPNE trajectory. Along the MPNE diverging path, which is also monotonic, restoration efforts are initially low and convexly increasing, but less rapidly than emissions. This pattern suggests that MPNE strategies where transient emissions and restoration efforts act as complements rather than substitutes are likely to result in a definitive switching of pollution sinks to a source. In comparison, although both cooperative and OLNE paths remain monotonic, they are more likely to end up in the emergency area. This result confirms that although commitment and cooperative strategies are more robust than MPNE strategies to a definitive switching of pollution sinks to a source, they are nevertheless vulnerable to players’ impatience, which directly affects future generations’ social welfare.

Fig. 11
figure 11

Phase plane in the state space for the cooperative, OLNE and MPNE cases from low absorption efficiency and higher discounting rate

7 Conclusion

In this paper, we suggest a non-linear quadratic differential game model of pollution control with a time-dependent environmental absorption efficiency that allows the biosphere to switch from a carbon sink to a source.

Using a cooperative solution as a benchmark, we showed that the fact that the players do not properly internalize the social cost of the pollution stock in the non-cooperative cases incites them to excessively substitute their transient path for steady state utility. This leads them to a disproportionate use of strategic substitutability between polluting emissions and restoration of environmental absorption efficiency during the transient path, which reduces the efficiency of carbon sinks and increases the pollution stock at the steady state. In both cooperative and non-cooperative steady states, the magnitude of emissions increases with absorption efficiency, while restoration efforts increase with the pollution stock, and vice versa.

We were then able to identify which decision rule, open-loop or Markov perfect Nash strategy, best prevents the biosphere from switching from a pollution sink to a source. Although it fails to reach the same environmental absorption efficiency as a cooperative solution, an OLNE imposes limited economic sacrifice at the steady state notably because it allows equivalent polluting emissions to the social optimum. This result justifies the optimistic tone of the call by some authors for rapid action against climate change (Stern 2015). However, an OLNE requires a greater restoration effort at the steady state than a cooperative solution does to partly compensate for the excessive emissions released during the transient path.

If the biosphere is still an efficient pollution sink, it is optimal in all cases to reach the steady state both by an overshooting emissions trajectory and an undershooting restoration efforts trajectory. If the efficiency of the biosphere as a pollution sink is already low, it is optimal to set initially low emissions and high restoration efforts, and then to increase emissions gradually as absorption efficiency increases, and decrease restoration efforts as the pollution stock declines. Our results suggest that the absence of cooperation and/or precommitment becomes more costly as the initial efficiency of pollution sinks tends to exhaustion. Also, a decrease in initial efficiency of pollution sinks is more costly to current generations than to future ones in both cooperative and non-cooperative strategies.

When both players are sufficiently patient, an MPNE and an OLNE do not differ much in terms of transient path and steady state. In this context, both strategies can prevent pollution sinks from turning into a source during the transient path and the steady state, including in the case where the absorption efficiency is low. However, although an MPNE may generate lower emissions and greater restoration efforts than an OLNE in the case of low absorption efficiency, it has a much greater chance of falling into the emergency area. In contrast, when both players are relatively impatient, MPNE strategies are less robust than OLNE strategies at preventing irreversible switching of the biosphere from a carbon sink to a source. Although more heavily discounted OLNE strategies can prevent pollution sinks from transiently turning into a source, these strategies are more likely to end up in the emergency area.

Regarding the methodological contribution of the paper, we suggest a particular structure of the conjectured value function to solve MPNE problems with multiplicative interaction between state variables in one state equation, so that third-order terms that arise in the Hamilton–Jacobi-Bellman equation are made negligible. Using a collocation procedure, we confirm the validity of the particular structure of the conjectured value function.

Overall, our results provide an update in the debate on pollution control in that they suggest that the absence of commitment at an international level on pollution control can preserve both current and future generations from experiencing switching of carbon sinks to a source only if the players remain sufficiently patient. More importantly, regardless of whether polluters commit to a plan of action or make contingent decisions, our study emphasizes the need for environmental policies where efforts to restore environmental absorption efficiency are an essential policy instrument.

The scope of our model could be extended to the case where the marginal destructive impact of the pollution stock on the evolution of absorption efficiency can be reduced by an effort to improve environmental resilience. Another possible extension of the model presented here could assume that the impact of restoration efforts on the evolution of absorption efficiency depends concavely on the level of absorption efficiency. These assumptions would require multiplicative separability between control and state variables, which would allow the emergence of a Skiba threshold (Grass et al. 2008) with multiple steady states and lead to the determination of history-dependent policies. Other future directions of research include an analysis of the Pareto optimality of the obtained equilibria and an extension of the model to the case of taxation rules.