1 Introduction

Technological progress is by far the most important determinant of economic growth in industrialized economies. During the last two decades, after the seminal works of Romer (1986), Aghion and Howitt (1992), and Grossman and Helpman (1994), many efforts have been put forward to try understanding and explaining the sources of technological advances. All the resulting works take the nature of the research sector as given, and the interaction among firms in the research industry has never been analyzed in depth thus far.Footnote 1 This is, however, one of the main fields of interest of computational and evolutionary economics; heterogeneous agent models, for instance, may help in explaining how innovation occurs, which are the dynamics of innovation and how innovation determines technological progress (see Dawid (2006) for an extensive survey). The goal of this paper is to bridge these two different branches of literature by developing a stylized but analytically tractable and micro-founded agent-based model of innovation to shed some light on the role that interactions among research firms might play in the process of economic growth. Once an almost traditional economic growth model is extended to allow some form of interaction among research firms along the lines outlined in Brock and Durlauf (2001) and Blume and Durlauf (2003), some traditional results, like the uniqueness of equilibrium, found in growth theory vanish. Indeed, such an interaction among research firms, by determining the rate of technological progress, plays a critical role in shaping the whole macroeconomic dynamics. We show that, under certain parameter conditions, the economy may be characterized by a multiplicity of balanced growth path (BGP) equilibria and a situation of low growth trap. We also show that the economy may eventually (endogenously) fluctuate between the low and high BGP, thus generating a growth cycle in which periods of low and high economic growth rates follow one another. Such a cycling behavior is due to the probabilistic nature characterizing the research industry; indeed, under a certain model parameterization the BGP equilibria turn out to be metastable: on a short timescale, they appear to be stable attractors, while on a longer timescale unpredictable random jumps lead the economy to sudden shifts toward the other BPG equilibrium. In this context, economic policy, aiming to modify the incentives associated with research activities, may be very effective in order to completely solve the low growth trap problem, avoiding also fluctuations in economic activity.

Our paper is thus related to different branches of the economic literature, namely computational and evolutionary economics, economic growth and business cycles theory. From the computational and evolutionary economics literature, we simply borrow the interest in analyzing the interaction between research firms and its eventual implications for technological progress and the long-run economic growth (Nelson and Winter 1982; Dawid 2006; Dosi et al. 2010). However, from a methodological point of view our approach is substantially different since we develop a very simple and tractable model, in which most of the results are analytically derived; simulations in our paper play only a marginal role and are instrumental to exemplify some interesting and potential outcomes. Economic growth theory is the main benchmark for our analysis since the model is an almost standard continuous-time model of optimal growth with endogenous technological progress (Acemoglu 2009). With respect to what traditionally assumed in this literature (Romer 1986; Grossman and Helpman 1994), we allow for a certain degree of diffusion in the pattern of innovation, meaning that in our framework technical progress is driven by the interaction among research firms.Footnote 2 To the best of our knowledge, no other study has thus far focused on the firms interaction in the research industry in a way comparable to ours; moreover, all the works identify a unique BGP equilibrium; thus, cyclical behavior cannot occur.Footnote 3 The understanding and characterization of cyclical patterns is the main interest of the business cycle theoryFootnote 4 (Kydland and Prescott 1982; King et al. 1988a, b), which besides adopting a discrete-time frameworkFootnote 5 (Evans et al. 1998; Canton 2002; Furukawa 2007), it also relies upon stochastic growth models in which the source of the shock is completely exogenous (Walde 2005). Probably, the work most close to ours is Bambi et al. (2014), which analyzes an endogenous growth model with expanding product variety showing that cyclical fluctuations may arise as a result of implementation delays in the innovation process. Although their setting is quite similar to ours (an almost traditional endogenous growth model), the mechanism underlying output fluctuations is substantially different since we do not allow for time delays but simply for some sort of interaction among firms operating in the research industry. Moreover, different from theirs, our model shows the existence of a growth trap threshold, allowing to clearly distinguish economies which will experience low and high growth rates, which is again simply due to the interaction among research firms.

Our paper is also closely related to the literature on poverty traps. The eventual existence and characteristics of poverty traps have been extensively analyzed in the literature since the seminal work by Skiba (1978). Different explanations of why multiplicity of equilibria and thus poverty traps may exist have been put forward, and they include increasing returns and imperfect competition, coordination failure, matching problems and increasing returns (see Azariadis and Stachurski 2005 for an exhaustive survey). However, all these theories proposed thus far outline sources of multiplicity in levels, thus suggesting that under certain conditions an economy may eventually end up in poverty that is a situation of stagnation with no long-run growth. Our model instead suggests the potential existence of equilibrium multiplicity in growth rates, meaning that an economy may eventually end up in a situation of long-run growth characterized by low growth rates. In order to distinguish this result from what traditionally discussed in the poverty traps literature, we refer to such an outcome as a “low growth trap.” To the best of our knowledge, apart from the very recent paper by Agénor and Canuto (2015) in an overlapping generation setting, there is no other study characterizing the eventual existence of low growth traps. The implications of the existence of a low growth trap threshold are, however, very intuitive and in line with empirical evidence: some countries will experience fast economic growth while others slow economic growth, meaning that income gaps will tend to widen over time, thus characterizing a situation of long-run divergence, as traditionally found in the empirics on economic growth, especially between developed and developing countriesFootnote 6 (Dowrick 1992; Pritchett 1997). Finally, our model predicts a very important role for economic policy, since in the case of a low growth trap the government, by simply rising the level of taxation on households in order to increase the revenues granted to research firms, may be able to completely solve the trap problem. This does not simply mean that the low growth trap threshold may be exceeded, as the traditional policy implication of poverty trap models (see for example Sachs et al. 2004, or more recently La Torre et al. 2015), but that the threshold itself will cease to exist, thus ensuring that the economy is able to experience fast economic growth.

The paper proceeds as follows. Section 2 focuses on the research industry and describes its peculiarities without considering its implications for the whole economy. Specifically, the research industry is populated by a large number of profit-seeking firms facing a dichotomous choice. On the one hand, these firms are heterogeneous in their propensity to innovate, and on the other hand, their decision whether to innovate or not is partly affected by the behavior of other firms in the industry through an externality component. We characterize the research industry dynamics deriving in the infinite dimensional case an explicit expression which allows us to describe the (aggregate) behavior of research firm in terms of the share of firms actively engaged in research activities. Section 3 integrates the research industry in a traditional macroeconomic model of endogenous growth, where the government finances research by taxing households, and the overall level of technology in the economy depends on the share of firms engaged in innovation. Section 4 shows that the BGP equilibrium, which strictly depends upon the behavior of research firms, may or may not be unique according to the magnitude of the externality-induced profit component; we also characterize the dynamic properties of different BGP equilibria, identifying the eventual existence of a low growth trap along with its policy implications. In Sect. 5, we focus on one important implication of the eventual multiplicity in BGP equilibria for the finite dimensional case; we show that when the number of research firms is finite, the probabilistic nature of the model implies that the locally stable equilibria turn out to be metastable: sudden and unpredictable regime switchings among the low and high regimes happen along trajectories, thus resulting in a cycling economic behavior. In Sect. 6, we propose a generalization of our baseline model in which the incentive to innovation is no longer constant, but it depends on the overall level of technological advancement in the economy; we show that despite the higher degree of sophistication in the model’s structure, the results are qualitatively similar to those in its baseline version. In Sect. 7, we discuss how our model relates to the middle-income trap hypothesis, suggesting that after a first stage of takeoff characterized by rapid growth developing countries may face a significant growth slowdown; differently from previous research which identify mainly inter-sectoral dynamics as a potential source of growth slowdowns, we argue that this may also be the result of intra-sectoral dynamics (driven by social interactions and technology diffusion) within the research industry. Section 8 presents concluding remarks and proposes directions for future research. Technical details about the rationale behind the random utility approach characterizing research firms’ payoff, and the metastability and probabilistic features of the transition times associated with the finite dimensional model are discussed in “Appendices 1 and 2,” respectively.

2 Research activities and intra-industry interactions

We consider a research industry populated by a large number of research firms which try to maximize the profits associated with their research activities; specifically, there exist N firms indexed by \(i=1,\dots , N\). For the sake of simplicity, we assume that the research choice is just binary; thus, we do not try to properly quantify research efforts. Thus, any research firm needs to decide whether to engage in research activities or not; thus, it needs to compare the profit it will obtain by performing research with the zero-profit associated with no research activities.

If a firm actively engages in research activities, it will give rise with no uncertainty to an innovation, which generates a given (fixed) amount of revenues \(h\ge 0\) associated with the sale of the (unitary) innovation.Footnote 7 In order to produce one unit of innovation, the firm faces a (stochastic) production cost \(z+\zeta _i\), where \(z\ge 0\) denotes the cost common to all the firms and \(\zeta _i\) is a random firm-specific shock. Apart from these private components of the profit structure, research profits are also affected by a social component associated with the number of firms actively engaged in research activities. Specifically, the size of the research industry through an externalityFootnote 8 channel determines whether profits, ceteris paribus, tend to rise or fall. There are two different cases that need to be considered: an increase in the number of firms actively engaged in research may increase the profit for the whole research industry and thus rise the profit of the individual research firm; alternatively, an increase in the number of firms actively engaged in research may decrease the profit for the whole research industry and thus lower the profit of the individual research firm. The former case represents the so-called standing-on-the-shoulder effect that is innovation by some firms increases the possibility of further innovation by others, while the latter case the “fishing-out effect,” that is innovation by some firms decreases the possibility of further innovation by others (Jones 2005). Formally, we model the individual firm research profits as in random utility models (see Brock and Durlauf 2001; Barucci and Tolotti 2012). Each firm is thus characterized by its specific innovative attitude \(\omega _{i,t}\in \{0;1\}\), where \(\omega _{i,t}=1\) (\(\omega _{i,t}=0\)) denotes that firm i is (is not) innovating at time t. The decision to engage in research activities to produce innovation is based on the following profit structure:

$$\begin{aligned} \pi _i(\omega _i)=\omega _i\left[ h-(z+\zeta _i) +J\left( \tilde{x}_i^e-\frac{1}{2}\right) \right] . \end{aligned}$$
(1)

If the firm does not innovate (\(\omega _{i,t}=0\)), the profit above is simply null, \(\pi _i(0)=0\). If the firm does innovate (\(\omega _{i,t}=1\)), the profit is equal to \(\pi _i(1)=\left[ h-(z+\zeta _i)+J(\tilde{x}_i^e-1/2)\right] \), where the first two terms represent the private component of profit while the third term is the social component related to the effect of externalities. The impact of the research externality is equal to \(J(\tilde{x}_i^e-1/2)\), where \(J\in {\mathbb {R}}\) determines the sign and the magnitude of the externality effect and \(\tilde{x}_i^e\) is the expectation of firm i about the average of the choices of other firms: \(\tilde{x}_i^e=\frac{1}{N-1}{\mathbb {E}}[\sum _{j\ne i}\omega _j]\). Note that the sign of J determines the type of externality affecting research firms: whenever \(J>0\), individual profits tend to increase as a result of the research performed by others (standing-on-the-shoulder effect), while whenever \(J<0\) individual profits tend to fall (fishing-out effect). The term \((\tilde{x}_i^e-1/2)\) states that in quantifying the impact of the (positive or negative) externality-induced profit component firms look at what the majority of other firms does. Indeed, the term 1/2 refers exactly to one half of the total population of research firms; thus, if \(\tilde{x}_i^e>1/2\), then firm i will expect more than half of the firms to do research. Finally, the random components of cost, \(\zeta _i\), \(i=1,\dots , N\) are i.i.d. random shocks drawn from a common distribution \(\eta \), which affect with different intensity the perceived profit of individual firms. Two remarks on the profit structure are needed. First of all, note that the profit \(\pi \) depends on the subjective expectation of the firm about others’ actions. With this respect, it can be seen as the realized profit once conditioned on agent’s expectation about others’ actions. Secondly, the random component of the profit is entirely related to the cost structure. We could in principle build a profit structure where randomness may impact jointly or separately both revenues and costs. Besides amounting in a more complicated probabilistic structure, this would not have any significant qualitative implications. For a more comprehensive discussion about the rationale behind the profit structure as in (1), we refer the reader to “Appendix 1.”

It can be easily verified that profits as in (1) turn into a probabilistic choice model where

$$\begin{aligned} {\mathbb {P}}(\omega _i=1|\, \tilde{x}_i^e)=\eta \left[ h-z+J\left( \tilde{x}_i^e-\frac{1}{2}\right) \right] . \end{aligned}$$
(2)

As shown in the literature on social interactions (see Blume and Durlauf 2003), a dynamic counterpart of such a model can be derived. Define

$$\begin{aligned} x^N_t=\frac{1}{N} \sum _{i=1}^N \omega _{i,t} \end{aligned}$$
(3)

as the fraction of innovative firms at time t and assume this quantity is observable; we refer to \(x^N_t\) as the “innovation share.” Similarly as in the static model, we assume that firms can decide whether to invest or not at any time t by considering its potential revenue h, cost z and the current value of the innovation share \(\widetilde{x}_i^{e}\). Indeed,

$$\begin{aligned} {\mathbb {P}}(\omega _{i,t+\varDelta t}=1|\, \omega _{i,t}, x^N_t)=\eta \left[ h-z+J\left( x_t^N-\frac{1}{2}\right) \right] . \end{aligned}$$
(4)

It turns out that the Markovian dynamics induced by (4) are difficult to study in the finite dimensional population model; nevertheless, it is possible to describe in closed-form the (deterministic) dynamics emerging from the asymptotic system when letting the number of research firms go to infinity. In particular, the following result describes the time evolution of \(x_t\) which is the fraction of innovative firms at time t when we let \(N\rightarrow \infty \). To this aim, we assume that the shocks \(\zeta _i\) follow a centered logistic distributionFootnote 9 with parameter \(\beta > 0\):

$$\begin{aligned} \eta (x)={\mathbb {P}}(\zeta _i\le x)=\frac{1}{1+e^{-\beta \, x}}. \end{aligned}$$

In this context, \(\beta \) is a measure of the dispersion of opinion in the population of firms: \(\beta =0\) would represent a situation in which the firms decide to innovate or not by tossing a coin; on the contrary, \(\beta \rightarrow \infty \) would mean that the firms do not receive any stochastic signal (i.e., the random cost component) and decide just by looking at the sign of \(h-z+J\left( x_t^N-\frac{1}{2}\right) \). In the next proposition, we provide a law of large numbers Footnote 10 for the stochastic process \(x_t^N\), showing that it converges to a limiting (deterministic) process \(x_t\), whose law of motion is described by a suitable differential equation characterizing the (deterministic) evolution of the share of innovative firms.

Proposition 1

Let \(x^N_t=\frac{1}{N} \sum _{i=1}^N \omega _{i,t}\) be the share of innovative firms at time t. Suppose \(\lim _{N \rightarrow \infty } x^N_0=x_0\). Then, when \(N\rightarrow \infty \), the family of stochastic processes \((x^N)_{N\ge 0}\), where \(x^N{:=}(x^N_t)_{t\ge 0}\), converges almost surely to \(x{:=}(x_t)_{t\ge 0}\), where \(x_t\) solves

$$\begin{aligned} \dot{x}_t=\frac{1}{2}\,\tanh \left\{ \beta \left[ h-z+J\left( x_t-\frac{1}{2}\right) \right] \right\} -x_t+\frac{1}{2}, \end{aligned}$$
(5)

for a given initial condition \(x_0\).

Proof

We can recover the standard Blume and Durlauf (2003) framework by rearranging the state variables to take values on \(\{-1;+1\}\). Define \(\zeta _i=1\) when \(\omega _i=1\) and \(\zeta _i=-1\) when \(\omega _i=0\). In this case, we have that

$$\begin{aligned} {\mathbb {P}}(\zeta _{i,t}=1|\, \zeta _{i,t}, m^N_t)=\eta (h-z+J/2\cdot m^N(t)),\\ {\mathbb {P}}(\zeta _ {i,t}=-1|\, \zeta _{i,t}, m^N_t)=1-\eta (h-z+J/2\cdot m^N(t)), \end{aligned}$$

where now \(m^N(t)=\frac{1}{N} \sum _i \zeta _{i,t}\) takes values on \([-1,1]\). Arguing similarly as in Barucci and Tolotti (2012), it can be shown that, under the assumptions of Proposition 1,

$$\begin{aligned} \lim _{N\rightarrow \infty } m^N_t=m_t, \end{aligned}$$

where \(m_t\) is the unique solution to

$$\begin{aligned} \dot{m}_t=\tanh \left\{ \beta \left( h-z+J \cdot \frac{m_t}{2}\right) \right\} -m_t;\quad m_0=2x_0-1. \end{aligned}$$
(6)

Since \(x_t=\frac{m_t+1}{2}\), Eq. (5) immediately follows. \(\square \)

Proposition 1 allows to approximate the random dynamic behavior of research firms through a deterministic equation which provides us with a simple but useful benchmark to characterize the outcome in the research industry. This allows us to analytically derive the outcome in the approximated deterministic version of the model that we shall introduce in a while and compare this with the “true outcome” in its stochastic version. Note, moreover, that in order to derive the approximated dynamic equation (5), random shocks play an essential role in generating heterogeneity in research firms’ behavior and thus in giving rise to potentially non-trivial outcomes (see “Appendix 1” for further details). In the following, we shall restrict our analysis to the non-trivial situation in which random shocks do affect firms’ decision and thus firms are effectively heterogeneous in their propensity to innovate. In such a framework, the quantity \(x_t\) characterizes the (approximated) fraction of innovative firms in a large economy of research firms subject to externalities and private signals. Since (5) provides us with an explicit expression for describing the behavior of research firms, as we shall see in the next section, it is now straightforward to incorporate the research industry in a canonical endogenous growth model. This allows us to understand to what extent the presence of firm interactions in the research industry is going to affect the macroeconomic outcome, further distinguishing between the standing-on-the-shoulders and the fishing-out cases. Since the role of the fixed cost z is negligible in our setting, for the sake of simplicity in the remainder we will set it equal to zero.

3 The macroeconomic model

Apart from the characterization of the research market which to some extent resembles what discussed in Marchese et al. (2014), the model is an almost standard endogenous growth model characterized by households, productive and research firms, and a government. Households try to maximize their lifetime welfare, by determining how much to consume given the dynamic evolution of capital. Productive firms produce competitively the unique final consumption good, by determining how many workers and how much capital to employ given the available technology. Research firms determine whether to invest or not in innovation, and overall technological progress depends on the share of research firms which actively engage in research activities. The government aiming at maintaining a balanced budget at any point in time levies taxes on households to finance such research activities. Households and productive firms are homogeneous; thus, we analyze their behavior as traditional representative agents. Research firms are instead heterogeneous in their propensity to innovate, and their behavior is consistent with what discussed in the previous section.

The representative household’s problem consists of maximizing its welfare given its initial capital endowment \(k_0\) and the law of motion of capital, \(k_t\), by choosing how much to consume, \(c_t\), and supplying inelastically labor. The household size, L, is constant, and it is assumed to be infinitely large. Welfare is defined according to the average utilitarian criterion;Footnote 11 thus, it is equal to the infinite discounted sum (\(\rho \) is the pure rate of time preference) of instantaneous utilities, which depend solely upon consumption. The instantaneous utility function is assumed to take the following isoelastic form: \(u(c_t)=\frac{c_{t}^{1-\sigma }-1}{1-\sigma }\), where \(\sigma >1\) is the inverse of the intertemporal elasticity of substitution. As usual lowercase letters denote per capita variables, while uppercase letters aggregate variables. The household’s problem in per capita terms can be written as:

$$\begin{aligned} \max _{c_t}&W=\int _{0}^{\infty } \frac{c_{t}^{1-\sigma }-1}{1-\sigma } e^{-\rho t}\mathrm{d}t \end{aligned}$$
(7)
$$\begin{aligned} s.t.&\dot{k}_{t}=(1 - \tau _t)( r_t k_t + w_t) - c_t , \end{aligned}$$
(8)

where \(r_t\) is the capital rental rate, \(w_t\) the wage rate and \(\tau _t\) a (time-varying) income tax rate. The first terms in the RHS of (7) represent the disposable income which needs to be allocated between consumption (\(c_t\)) and capital investments (\(\dot{k}_{t}\)).

Output is produced by competitive productive firms according to a Cobb–Douglas production function, combining labor, L (inelastically supplied by households), and capital, \(K_t\). The production function in per capita terms takes the following form:

$$\begin{aligned} y_t=A_t k_t^\alpha \end{aligned}$$
(9)

where \(\alpha \in (0,1)\) is the capital share while \(A_t\) a technological factor, representing total factor productivity. Productive firms take the level of technology as given and maximize their instantaneous profits, thus determining the rental rate of capital, \(r_t\) (and the wage rate, \(w_t\)).

Research firms indexed by \(i=1,\dots , N\) are heterogeneous in their propensity to innovate \(\omega _{i,t}\) and try to maximize the profits associated with their research activities. Their behavior is identical to what discussed in the previous section and thus is determined by the comparison between their profit when innovating (\(\omega _{i,t}=1\)) and when not (\(\omega _{i,t}=0\)). Whenever innovating they will sell their innovation at a price \(\tilde{h}_t=h y_t\) to the government,Footnote 12 which does somehow finance the research activities in the overall economy. We assume the number of research firms is infinitely large such that Proposition 1 holds.

The government, by taxing households, collects a tax revenue \(\tau _t y_t\) from each household, which is used to buy innovations at a price \(\tilde{h}_t\) (in units of output) from each research firm actively engaged in research activities, \(\sum _i \omega _i\). In order to maintain a balanced budget at any point in time, the government budget constraint reads as \(\tau _t y_t L=h y_t \sum _i \omega _{i,t}\), implying that \(\tau _t \frac{L}{N}=h \frac{\sum _i \omega _{i,t}}{N}\). Since the number or research firms N is infinitely large, the previous equation can be non-trivially verified only if the number of households L is infinitely large as well, such that the household to research firm ratio \(\ell =\lim _{(L,N)\rightarrow (\infty ,\infty )}\frac{L}{N}>0\) is constant and finite. Provided that both the number of households and research firms are infinitely large, the budget constraint can be rewritten as follows:

$$\begin{aligned} \tau _t \ell =h x_t, \end{aligned}$$
(10)

where \(x_t\) is the share of innovative firms whose dynamics is given in (5). Once an innovation is bought by the government, it is immediately released in the public domain to allow productive firms to use such an innovation for free to produce the final consumption good (Marchese et al. 2014). This means that the government plays an essential role by spreading innovations in the economy by buying them from firms, thus solving an important coordination problem. This is consistent with recent evidence suggesting that most technological advances have effectively been made possible by entrepreneurial activities pursued by governments (Mazzucato 2013).

By financing research activities, the government determines the time evolution of the total factors productivity. Indeed, the overall level of technology is determined by the interaction among research firms. Specifically, we assume that it evolves according to the following law of motion:

$$\begin{aligned} \dot{A}_{t}= \phi x_t A_t , \end{aligned}$$
(11)

where \(\phi >0\) is a scale parameter and \(x_t\) represents the share of research firms which actively engage in innovative activities. According to (11) for technological progress to occur, it does not matter the size of the research industry (i.e., how many research firms exist) but the relative size of innovative firms with respect to the industry. If none does research (\(x_t=0\)), then technological progress does not occur, while if all firms do research (\(x_t=1\)) then technological progress occurs at a strictly positive rate \(\phi \). For any situation different from these two extreme cases, the rate of technological progress will lie between 0 and \(\phi \); which specific rate will arise depends on the behavior of research firms and their interaction within the research industry.

In general equilibrium, all agents maximize their objective function and all markets clear. The economy is completely characterized by the following system of differential equations and the given initial conditions \(k_0\), \(x_0\) and \(A_0\):

$$\begin{aligned} \frac{\dot{c}_{t}}{c_t}= & {} \frac{1}{\sigma }\left[ \left( 1-\frac{h}{\ell } x_t\right) \alpha A_t k_t^{\alpha -1} - \rho \right] \end{aligned}$$
(12)
$$\begin{aligned} \dot{k}_{t}= & {} \left( 1-\frac{h}{\ell } x_t\right) A_t k_t^\alpha - c_t \end{aligned}$$
(13)
$$\begin{aligned} \dot{x}_{t}= & {} \frac{1}{2}\,\tanh \left\{ \beta \left[ h+J \left( x_t-\frac{1}{2}\right) \right] \right\} -x_t+\frac{1}{2} \end{aligned}$$
(14)
$$\begin{aligned} \dot{A}_{t}= & {} \phi x_t A_t \end{aligned}$$
(15)

Note from the above equations that the research industry dynamics turns out to be independent from other macroeconomic variables. Although this might seem a strong limitation of our model, as we shall see in Sect. 6, results will not be qualitative different even in a more sophisticated formulation in which research industry and macroeconomic outcomes affect each other. It seems convenient thus to present the model first in its simplest possible form. Apart from the case in which \(x_t\) converges to zero (which however will never be an equilibrium), the above system (12), (13), (14) and (15) is not stationary (i.e., it does not show any equilibrium at all); thus, in order to study its dynamic behavior it may be convenient to recast the system in a stationary system as traditionally done in the endogenous growth literature. From the equilibrium properties of this latter system, we will then be able to infer the properties of the BGP equilibrium associated with (12), (13), (14) and (15). A BGP equilibrium denotes a situation in which all variables grow at a constant (possibly nonnegative) rate, and deriving and discussing the characteristics of the BGP equilibrium is our main goal in next section.

4 BGP equilibrium

By introducing the variables \(\chi _t=\frac{c_t}{k_t}\) and \(\varphi _t=A_t k_t^{\alpha -1}\), denoting the consumption to capital ratio and the average product of capital, respectively, it is possible to recast the above system in the following stationary system:

$$\begin{aligned} \frac{\dot{\chi }_{t}}{\chi _t}= & {} \chi _t-\frac{\rho }{\sigma }- \frac{\sigma -\alpha }{\sigma }\left( 1-\frac{h}{\ell } x_t\right) \varphi _t \end{aligned}$$
(16)
$$\begin{aligned} \frac{\dot{\varphi }_{t}}{\varphi _t}= & {} \phi x_t -(1-\alpha )\left( 1-\frac{h}{\ell } x_t\right) \varphi _t + (1-\alpha )\chi _t \end{aligned}$$
(17)
$$\begin{aligned} \dot{x}_{t}= & {} \frac{1}{2}\tanh \left\{ \beta \left[ h+J \left( x_t-\frac{1}{2}\right) \right] \right\} -x_t+\frac{1}{2} \end{aligned}$$
(18)

At equilibrium, the above system is characterized by the following steady-state values:

$$\begin{aligned} \overline{\chi }= & {} \frac{(1-\alpha )\rho +(\sigma -\alpha )\phi \overline{x}}{\alpha (1-\alpha )} \end{aligned}$$
(19)
$$\begin{aligned} \overline{\varphi }= & {} \frac{(1-\alpha )\rho + \sigma \phi \overline{x}}{\alpha (1-\alpha )(1-\frac{h}{\ell } \overline{x})}, \end{aligned}$$
(20)
$$\begin{aligned} \overline{x}= & {} \frac{1}{2}\,\tanh \left\{ \beta \left[ h+J \left( \overline{x}-\frac{1}{2}\right) \right] \right\} +\frac{1}{2} \end{aligned}$$
(21)

where \(\overline{x}\) cannot be determined explicitly. However, since \(x_t\in [0,1]\), it follows that \(\overline{x}\) will always be nonnegative. This means that provided that \(\ell >h\), the steady-state values \(\overline{\chi }\) and \(\overline{\varphi }\) will be strictly positive. We summarize the results about the BGP equilibria and their stability in the following proposition.

Proposition 2

Assume \(\ell >h\); then along a BGP equilibrium, the economic growth rate, \(\gamma \), is strictly positive and given by the following expression:

$$\begin{aligned} \gamma\equiv & {} \gamma _c=\gamma _k=\frac{\gamma _A}{1-\alpha }= \gamma _y=\frac{\phi \overline{x}}{1-\alpha }>0, \end{aligned}$$
(22)

where \(\overline{x}\) denotes the steady-state value of \(x_t\). Moreover, there exist two positive threshold levels, \(J^t(\beta )\) and \(h^t(J,\beta )\), given by the following expressions

$$\begin{aligned} J^t(\beta )= & {} \frac{2}{\beta }\\ h^t(J,\beta )= & {} \frac{J}{2}\sqrt{\frac{\beta J-2}{\beta J}}+\frac{1}{\beta }\ln \left( \sqrt{\frac{\beta J}{2}}-\sqrt{\frac{\beta J-2}{2}}\right) , \end{aligned}$$

such that:

  1. (i)

    if \(J<J^t(\beta )\), there exists a unique \(\gamma ^*\) and the unique BGP equilibrium is saddle-point stable with a two-dimensional stable manifold;

  2. (ii)

    if \(J>J^t(\beta )\), then two alternative outcomes are possible:

    1. (a)

      if \(h> h^t(J,\beta )\), there exists a unique \(\gamma ^*\) and the unique BGP equilibrium is saddle-point stable with a two-dimensional stable manifold;

    2. (b)

      if \(h<h^t(J,\beta )\), there exist three BGP equilibria corresponding to three values \(\gamma _L<\gamma _M<\gamma _H\). The intermediate one is saddle-point stable with a one-dimensional stable manifold, whereas the two extreme ones are (locally) saddle-point stable, each with a two-dimensional stable manifold.

Proof

By plugging the steady-state values of \(\chi _t\) and \(\varphi _t\) back in the original equations (1215), it is straightforward to derive the BGP growth rate \(\gamma \), as in (22). The characteristics of \(\gamma \) strictly mimic those of \(\overline{x}\). Indeed, multiplicity is due to the possible multiplicity of the steady states of Eq. (5). As already shown in the literature (see Brock and Durlauf 2001), it turns out that, depending on the values of the parameters, we can have a unique stable equilibrium (\(\bar{x}\)) for (5) or three equilibria (\(\overline{x}_L<\overline{x}_M<\overline{x}_H\)), two of which are locally stable (\(\overline{x}_L\) and \(\overline{x}_H\)). A similar threshold value for J, equal to \(1/\beta \), is also derived by Brock and Durlauf (2001); note that the factor 2, appearing in our statement, depends on the transformation from the variable \(m_t\) to the rescaled variable \(x_t\) as shown in the proof of Proposition 1.

Concerning the value of \(h^t\), in Olivieri and Vares (2005) (see Sects. 4.1.1. and 4.3) it is shown that the fixed point problem \(m=\tanh (\tilde{\beta }( m+\tilde{h}))\) admits multiple solutions as soon as \(\tilde{\beta }>1\) and \(\tilde{h}< \sqrt{\frac{\tilde{\beta }-1}{\tilde{\beta }}}+ \frac{1}{\tilde{\beta }}\, \ln \left( \sqrt{\tilde{\beta }}-\sqrt{\tilde{\beta }-1}\right) \). According to (6) and assuming \(z=0\) without loss of generality, we can rewrite our equation in x in the form \(m=\tanh (\beta h + \frac{\beta J}{2}m)\). Therefore, \(h^t\) is derived from the above expressions by setting \(\tilde{\beta }=\beta J/2\) and \(\tilde{h}=2h/J\). From (22), if there are multiple equilibria for x, then the system admits multiple equilibria as well.

Concerning stability, by linearization around a steady state it is possible to analyze the (local) stability properties of the above system by deriving the following Jacobian matrix:

$$\begin{aligned} J(\overline{\chi }, \overline{\varphi },\overline{x}) = \left[ \begin{array}{ccc} \overline{\chi } &{} -\frac{\sigma -\alpha }{\sigma }(1-\frac{h}{\ell } \overline{x})\overline{\chi } &{} \frac{\sigma -\alpha }{\sigma }\frac{h}{\ell }\overline{\varphi }\,\overline{\chi }\\ (1-\alpha )\overline{\varphi } &{} -(1-\alpha )(1-\frac{h}{\ell } \overline{x})\overline{\varphi } &{} \phi \overline{\varphi }+(1-\alpha )\frac{h}{\ell }\overline{\varphi }^2 \\ 0 &{} 0 &{} \varLambda \\ \end{array} \right] , \end{aligned}$$
(23)

where \(\varLambda =\frac{\partial \dot{x}_t}{\partial x_t}|_{x_t=\overline{x}}\). It is straightforward to show that the eigenvalues are given by the following expressions \(\lambda _1=\varLambda \), and \(\lambda _{2,3}=\frac{\varDelta \pm \sqrt{\varDelta ^2+\varTheta }}{2}\), where \(\varDelta =\overline{\chi }-(1-\alpha )(1-\frac{h}{\ell } \overline{x})\overline{\varphi }>0\) and \(\varTheta =4\frac{\alpha }{\sigma }(1-\alpha )(1-\frac{h}{\ell } \overline{x})\overline{\chi }\ \overline{\varphi }>0\), from which it directly follows that \(\lambda _{2}>0\) and \(\lambda _{3}<0\). Independently of what the sign of \(\varLambda \) is, there exists at least one positive and one negative eigenvalue; thus, any possible equilibrium is saddle-point stable. Moreover, it is possible to show that \(\varLambda <0\) for \(\bar{x}_L\) and \(\bar{x}_H\) and \(\varLambda >0\) for \(\bar{x}_M\). Therefore, the stable manifold associated with the three equilibria has dimension 2 for \(\gamma _L\) and \(\gamma _H\) and dimension one for \(\gamma _M\). \(\square \)

The parameter condition required by Proposition 2 is needed in order to ensure that the BGP equilibrium is well defined. Intuitively, it requires that the household to research firm ratio (\(\ell \)) is large enough to provide the government with the resources needed to promote research activities (h). Along a BGP the economic growth rate \(\gamma \) depends negatively on \(\alpha \) and positively on \(\phi \) and more importantly on the equilibrium share of innovative firms \(\overline{x}\). This means that our model economy does not show any scale effect, since the growth rate is independent of any aggregate variable.Footnote 13 However, since the equilibrium share of research firms may not be unique,Footnote 14 also the BGP equilibrium turns out to be not unique, and this is strictly related to the size of the externality parameter, J. Indeed, Proposition 2 suggests that in the fishing-out case (\(J<0\)) there always exists a unique saddle-point stable BGP equilibrium; however, in the standing-on-the-shoulder case (\(J>0\)) there is a richer variety of possible outcomes. Whenever the standing-on-the-shoulder effect is weak (i.e., the magnitude of the positive externality is small), a unique stable equilibrium will emerge. In the case of a sufficiently large externality, then the number of equilibria depends on the value of the incentive mechanism provided by the amount of revenues obtained, h. A large h makes the equilibrium unique, whereas a small h gives rise to the presence of two locally stable equilibria.Footnote 15 As a matter of expositional simplicity, in the following we will refer to the case (i) in Proposition 2 as the “small externality case” and to the case (ii) as the “large externality case.” Note that the macroeconomic behavior closely resembles the behavior on the innovation share, and when the equilibrium innovation share is unique (multiple) then the BGP equilibrium is unique (multiple) as well. Specifically, in the case of multiple equilibria, if \(x_0<\overline{x}_M\) then \(x_t\) will converge to \(\overline{x}_L\) (and the BGP growth rate will be low, \(\gamma _L\)), while if \(x_0>\overline{x}_M\) then \(\overline{x}_H\) will be reached instead (and the BGP growth rate will be high, \(\gamma _H\)). Thus, the initial fraction of innovative firms plays a crucial role in determining which BGP equilibrium will be effectively achieved.Footnote 16

In order to understand more in depth what are the characteristics of the BGP equilibrium, we now analyze the behavior of the economy under a realistic model’s parameterization. Specifically, we set the inverse of the intertemporal elasticity of substitution, \(\sigma \), equal to 2, the rate of time preference, \(\rho \), to 0.04, the capital share, \(\alpha \) to 0.33 (Mullingan and Sala-i-Martin 1993); the scale parameter determining the rate of growth of technology \(\phi \) is calibrated to 0.04, in order to obtain an economic growth rate equal to 0.03 (in the case in which the equilibrium share of innovative firms is exactly equal to one half); the other parameter values are set arbitrarily in order to make sure that the assumption required in Proposition 2 is met and that our qualitative results are as clear as possible. We thus set the households to research firm ratio, \(\ell \), equal to 1000, the measure of the dispersion of opinion in the population of research firms, \(\beta \) equal to 1, while we let the revenue provided to research firms, h, and the size of the externality parameter, J, vary in order to see how they affect the BGP economic growth rate \(\gamma \). Table 1 summarizes the parameter values employed in our analysis.

Table 1 Parameter values employed in our simulation

In Fig. 1, we show how the BGP growth rate \(\gamma \) varies for different values of the externality parameter, J, whenever the revenue parameter h is set equal to 0. As expected from Proposition 2, for negative and positive but small enough values of the externality parameter a unique BGP and thus a unique economic growth rate, \(\gamma ^*=\frac{\phi \overline{x}^*}{1-\alpha }\) (equal to 0.03), exists. For larger values, three equilibria, namely \(\gamma _L=\frac{\phi \overline{x}_L}{1-\alpha }\), \(\gamma _M=\frac{\phi \overline{x}_M}{1-\alpha }\) and \(\gamma _H=\frac{\phi \overline{x}_H}{1-\alpha }\) with \(\gamma _L<\gamma _M<\gamma _H\), exist and the gap between the high and low economic growth rate, \(\gamma _H-\gamma _L\), rises with J.

Fig. 1
figure 1

Changes in the economic growth rate, \(\gamma \), for different values of the externality parameter, J (revenue parameter h set equal to 0)

Since the existence of either a unique or multiple BGP equilibrium is related to the size of the externality parameter, it may be convenient to separately analyze the cases in which the externality parameter is either small or large. In Fig. 2, we thus consider two alternative values \(J=1.9\) and \(J=2.5\), lying below and above the threshold value \(J^t=2\) (see Fig. 1), respectively, and show how the BGP growth rate \(\gamma \) varies with the revenue parameter, h. As discussed above, the small externality case represents a situation in which the research sector is characterized by either fishing-out (\(J<0\)) or weak standing-on-the-shoulder (\(0<J\le J^t\)) effects. In both the cases, equation (18) shows a unique stable equilibrium and consequently the BGP equilibrium is unique as well: \(\gamma ^*=\frac{\phi \overline{x}^*}{1-\alpha }\). The convergence to the steady state of the system (1618) will occur along a two-dimensional stable manifold. We can see that the unique economic growth rate increases with h; thus, the higher the incentive for research firms to engage in research activities the faster the economic growth (Fig. 2, top panel). The large externality case represents instead a situation in which the research sector is characterized by a strong standing-on-the-shoulder (\(J>J^t\)) effect. In this case, Eq. (18) shows three equilibria (\(\overline{x}_L<\overline{x}_M<\overline{x}_H\)), two of which are locally stable (\(\overline{x}_L\) and \(\overline{x}_H\)). As a consequence, the BGP equilibrium is not unique as well: we need to distinguish three BGP equilibria, characterized by an economic growth rate equal to \(\gamma _L=\frac{\phi \overline{x}_L}{1-\alpha }\), \(\gamma _M=\frac{\phi \overline{x}_M}{1-\alpha }\) and \(\gamma _H=\frac{\phi \overline{x}_H}{1-\alpha }\) with \(\gamma _L<\gamma _M<\gamma _H\), respectively. As seen from Proposition 2, the convergence to such three steady states of the system (1618) will occur either along a two-dimensional stable manifold (for \(\gamma _L\) and \(\gamma _H\)) or along a one-dimensional stable manifold (for \(\gamma _M\)). We can see that the high and low economic growth rates, \(\gamma _H\) and \(\gamma _L\), increase with h, while the medium one \(\gamma _M\) falls with h; thus, the higher the incentive for research firms to engage in research activities the faster the economic growth in each of the two stable equilibria (Fig. 2, bottom panel). The threshold value for h provided by Proposition 2 (and confirmed by our numerical simulation) is \(h^t\approx 0.078\); only whenever \(h<h^t\), three equilibria exist.

Fig. 2
figure 2

Changes in the growth rate, \(\gamma \), for different values of the revenue parameter h (\(0\le h\le 0.2\)); the externality parameter J set equal to either 1.9 (top panel) or 2.5 (bottom panel)

Figure 2 suggests some interesting policy implications, since it clearly shows how the revenue parameter impacts on the equilibrium economic growth rate. Indeed, in the large externality case whenever the revenue provided to research firms is small (\(h<h^t\)), three different BGP equilibria exist, and this is strictly related to the existence of three different equilibrium values for the innovation share. Therefore, the same economy may experience different growth rates according to how many research firms actively engage in research activities: if this share is small, the economic growth rate will be low, while if it is large the economic growth rate will be high. This means that the economy is potentially faced with a low growth trap, which may condemn it to grow slower than what it could potentially do. In such a framework, it is natural to wonder what policymakers can do in order to deal with this problem. As traditionally discussed mainly in the context of poverty traps (Sachs et al. 2004), an economy may escape its low growth trap by increasing the innovation share, thus allowing the initial share of research firms (\(x_0\)) to exceed its unstable middle equilibrium (\(\overline{x}_M\)). Such an outcome might be implemented by simply opening the economy to international trade and providing some incentive for foreign firms actively engaged in research activities to operate also on the domestic market; research activities at international level may thus provide the economy with the push it needs to achieve fast economic growth. However, policymakers may do much more than this, since they can effectively allow the economy not only to escape its low growth trap, but to even solve completely the trap problem. Indeed, by rising enough the revenue provided to each research firm such that \(h>h^t\), the innovation share will naturally converge toward its unique (higher) equilibrium value; the economic growth rate at equilibrium will be high, and thus, the economy will not be trapped into a low growth equilibrium. Such an outcome can be easily implemented by increasing the tax rate applied to households’ income in order to finance the increase in the revenue parameter. Indeed, in our model’s parameterization the tax parameter \(\hat{\tau }\) needed to escape the low growth trap is \(\hat{\tau }= \frac{h}{\ell }\overline{x}\approx 0.0413\%\). The result should be clear from Fig. 2; it can also be seen from Fig. 3 where we plot the equilibrium values of \(\overline{x}\) for two different values of the revenue parameter h. This clearly show that with a higher h a unique equilibrium \(\overline{x}\) (and thus also a unique BGP) may exist.

Fig. 3
figure 3

Equilibrium values \(\overline{x}\in [0,1]\) (marked with a star) found as the intersection between the bisector line and \(f(x)=\frac{1}{2}\tanh \left\{ \beta \left[ h+J \left( x-\frac{1}{2}\right) \right] \right\} +\frac{1}{2}\) (see Eq. (21)). Parameters values: \(h=0\) (left panel) and \(h=0.15\) (right panel), with \(J=2.5\)

5 Metastability and endogenous cycles

As already stressed in Sect. 2, the infinite dimensional research market obtained by letting N go to infinity is just a deterministic approximation of the more complex finite N-dimensional heterogeneous research industry described by Eqs. (14). We now focus on some important implications of the fact that the “true” model is actually not deterministic but stochastic since characterized by some intrinsic randomness. Indeed, when multiple BPG equilibria exists,Footnote 17 the finite dimensional system described in Sect. 2 has the remarkable property to exhibit a “metastable behavior.” As discussed in Mathieu and Picco (1998), a probabilistic system exhibits a metastable behavior when it remains for long times close to an apparent equilibrium, called metastable, and then it suddenly shifts to another attractor. We shall prove that, for some values of the parameters, these transition times happen with probability one and are unpredictable, in the sense that their distribution shows lack of memory. This phenomenon gives rise to a cycling behavior of the finite dimensional system: due to the sudden switching of the system toward the other attractor, the trajectories oscillate for long times close to one of the two (metastable) equilibria. Put differently, it is as if there were two different timescales: on a short timescale, the equilibria described in Proposition 2 are, apparently, stable; on a longer timescale, we instead observe a cycling behavior generated by the endogenous fluctuations between the two regimes.Footnote 18 Let us focus on the large externality case and specifically on a situation in which multiple BGP equilibria exist (case ii.b in Proposition 2). In such a case two stable BGP equilibria exist; as discussed above which equilibrium our economy will achieve depends upon the initial conditions \((\chi _0,\varphi _0,x_0)\), and in particular a critical role is played by the initial condition on \(x_t\). Indeed, the initial share of innovative firms determines whether the equilibrium share will be high or low, thus determining whether the economic growth rate, \(\gamma \), will be high or low. While this outcome is clear in the infinitely large number of research firms version of the problem, whether this holds true also for the finite version is not so obvious. In fact, in the finite version of the model, research firms are subject to random shocks which determine whether they will decide to innovate or not; in the infinite version, the effects of such shocks cannot be analyzed since the approximation provided by Eq. (18) turns out to be completely deterministic.

Let us denote by \(x_t^N\) the proportion of innovative firms at time t among a total population of N firms. As said, under the assumptions of case ii.b in Proposition 2, the trajectory \(x^N_t\) has a metastable behavior: it fluctuates close to one of the two equilibria, say \(\bar{x}_{L}\) and, after a random time, it suddenly jumps to values close to \(\bar{x}_H\) (and vice versa). We now state a proposition collecting the main properties of the so-called tunneling time, i.e., the time needed for a trajectory to leave the basin of attraction of one equilibrium. The cycling behavior is due to a sequence of such (random) tunneling times. Unfortunately, the results describing the probabilistic properties of the tunneling times are rather technical; in order to make the discussion more clear, we prefer to maintain this section as simple as possible avoiding technicalities and postpone to “Appendix 2” a more involved mathematical discussion. In order to define \(T_N\), the tunneling time of the process \(x_t^N\), we introduce two significant values in the state space of \(x_t^N\):

$$\begin{aligned} x_{L}^N=\frac{\lceil N\,\bar{x}_L\rceil }{N};\quad x_{M}^N=\frac{\lceil N\,\bar{x}_M\rceil }{N}; \end{aligned}$$

these are the values that the process \(x_t^N\) can reach and that better approximate (from above) the real values \(\bar{x}_L\) and \(\bar{x}_M\), respectively. The tunneling time of the process \(x_t^N\) is defined as the time needed to cross the basin of attraction of \(\bar{x}_H\), when starting close to \(\bar{x}_L\). More precisely,

$$\begin{aligned} T_N=\min _{t>0}\left\{ x^N_t=x_M^N \,,\, x^N_0=x_L^N \right\} . \end{aligned}$$
(24)

Note that this is exactly the first time at which the trajectory starting close to the equilibrium \(\bar{x}_L\) will cross \(\bar{x}_M\), hence entering the basin of attraction of \(\bar{x}_H\). Once the process has crossed \(\bar{x}_M\), it rapidly converges toward \(\bar{x}_H\). The main properties of the tunneling time are summarized in the next proposition.

Fig. 4
figure 4

Evolution of \(x_t^N\) (in blue) for a finite dimensional system of \(N=1000\) firms. In the top panel, we see the deviation from the expected trajectory \(\overline{x}^{(L)}_t\) suggested by the model in favor of \(\overline{x}^{(H)}_t\). In the bottom panel, we have a trajectory fluctuating around the two attractors. Parameters are as in Table 1 with \(h=0.01\) and \(J=2.14\) (top panel) and \(h=0\) and \(J=2.05\) (bottom panel)

Proposition 3

Consider the process \(x_t^N\) as described by Eqs. 34. Assume that \(J>J^t(\beta )\) and \(h<h^t(J,\beta )\), where the thresholds are as defined in Proposition 2. Then there exists a suitable constant \(\varDelta \) depending on \(\beta , J,h\) such that the tunneling time \(T_N\) has the following properties:

  1. (a)

    for all \(\delta >0\), \(\lim _{N\rightarrow \infty } {\mathbb {P}}\left( e^{N(\varDelta -\delta )}<T_N<e^{N(\varDelta +\delta )} \right) =1;\)

  2. (b)

    \(T_N/{\mathbb {E}}[T_N]\) converges in law to a unit-mean exponential random variable, as \(N\rightarrow \infty \).

A formal proof of the proposition is presented in “Appendix 2” where the explicit functional form of \(\varDelta \) is also provided. As suggested by Proposition 3, the random jumps happen with probability one for each trajectory, although the jump times \(T_N\) could be possibly large. Specifically, the transition times tend, for \(N\rightarrow \infty \), to an exponentially distributed random variable with expectation proportional to \(e^{\varDelta N}\), where \(\varDelta \) is a suitable constant depending on the parameters of the model. It turns out that, for values of J close to \(J^t\), the random time needed to exit the basin of attraction of the two locally stable equilibria is relatively small and trajectories showing growth cycles arise. In Fig. 4 (bottom panel), we provide an example showing that, for \(J=2.05\) (recall that \(J^t=2\)), this random time is reached early enough to be seen in the trajectory. More precisely, we show that the (stochastic) time series of \(x_t^N\) may deviate from its expected behavior predicted by Eq. (18). Recall that \(x_t\) describes exactly the deterministic evolution of the system under the modeling assumption that N is infinite. In the top panel, we show that \(x_t^N\) may converge to the equilibrium it is not supposed to achieve. Indeed, since the initial condition \(x_0=0.1\) is greatly lower than \(\overline{x}_M=0.4178\), we would expect the time series of \(x_t^N\) to fluctuate around the red-dashed trajectory \(x^{(L)}_t\) leading to the low equilibrium. However, in this particular simulation, this is not the case: the trajectory deviates and start fluctuating around the high equilibrium \(\overline{x}_H\). In the bottom panel, as said, we show that the finite dimensional trajectory \(x_t^N\) may spend quite a long time close to one of the two equilibria and then depart from it to reach the other one. What discussed for \(x_t^N\) has clear implications also in terms of the macroeconomic outcome: differently from what suggested by the (deterministic) theory, the system, even when the initial conditions are very close to the high BGP equilibrium, may converge toward the low BGP equilibrium or oscillate between the two BGP equilibria without converging to a steady state. Note that in the small externality case, in which the equilibrium is unique, such an effect naturally disappears. This suggests that government intervention may be essential not only to allow the economy to solve its eventual low growth trap problem but also to reduce the fluctuations (occurring with probability one) in economic activity.

We provide now some estimates of the expected tunneling time for the process \(x_t^N\). When considering the values of the parameters as in Table 1, \(h=0\) and \(J=2.05\) (see Fig. 4, bottom panel), \(\varDelta \approx 1.92\times 10^{-4}\) so that \(e^{\varDelta N}\approx 1.212\). This means that, on average, we expect \(e^{\varDelta N} \cdot N \approx 1212\) single transitions (i.e., single firms deciding to change their innovation policy) to actually observe a tunneling time. As a comparison, in Fig. 5, we show a trajectory of \(x_t^N\) where parameters are the same except for J which is now higher: \(J=2.14\). In this case, \(\varDelta =3.217 \times 10^{-3}\) and \(e^{\varDelta \,N}\approx 24.965\). Now, we expect 24,965 single transitions to observe a tunneling time. The comparison with the bottom panel of Fig. 4 shows that, as expected, the tunneling times are now less frequent: a longer timescale is needed in order to capture the transitions from one BGP equilibrium to the other.

Fig. 5
figure 5

Evolution of \(x_t^N\) (in blue) for a finite dimensional system of \(N=1000\) firms. Parameters are as in Table 1 with \(h=0\) and \(J=2.14\). The BGP innovation share equilibria are \(\bar{x}_H=0.7156\) and \(\bar{x}_L=0.2844\)

We conclude this section by providing an intuition about the effects of metastability on macroeconomic variables, and in particular on how metastability results in output fluctuations, thus generating a growth cycle. To this aim, in order to simplify computational problems we consider the special case in which the inverse of the intertemporal elasticity of substitution and the capital share perfectly coincide, that is \(\sigma =\alpha \). Such a case has been frequently analyzed in order to fully characterize transitional dynamics in similar growth models (Smith 2006; Xie 1994) since allowing to decouple some variables in system Eqs. 1618 and thus to obtain an explicit analytical expression for the evolution of main variables. Whenever \(\sigma =\alpha \), it is straightforward to show that the evolution of capital is given by the following expression: \(\dot{k}_t=\left( 1-\frac{h}{\ell }x_t\right) A_t\, k_t^\alpha -\frac{\rho }{\alpha }\, k_t,\) from which it is then possible to characterize the evolution of output from \(y_t=A_t\, k_t^\alpha .\) We wish to provide an intuition about the trajectories of \(y_t^N\), the level of per capita output, under the assumption that \(x_t\) is now substituted by \(x_t^N\). This can be done by discretizing the system of differential equations to obtain a (approximated) difference equations system where the variables are now \((k_t^N,\, A_t^N,\, y_t^N)\) where \(k_0^N=k_0\), \(A_0^N=A_0\),

$$\begin{aligned} \left\{ \begin{array}{l} k^N_{t+1}=k^N_t+\left( 1-\frac{h}{\ell }\,x_t^N\right) \,A_t^N\,(k_t^N)^\alpha -\frac{\varphi }{\alpha }\,k_t^N \\ A_{t+1}^N=\phi \, A_t^N\, x_t^N \end{array}\right. \end{aligned}$$

and \(y_t^N=A_t^N\, (k_t^N)^\alpha \), with \(y_0^N=A_0^N\, (k_0^N)^\alpha \). In Fig. 6, we plot (on a log-scale) the trajectory of \(y_t^N\) associated with the trajectory of \(x_t^N\) presented in Fig. 5. It is straightforward to observe that the slope of \(\log (y_t^N)\), representing the growth rate of per capita output, changes whenever a regime shift occurs: the slope is steeper when \(x_t^N\) fluctuates around \(\bar{x}_H\) while it is flatter when \(x_t^N\) fluctuates around \(\bar{x}_L\), meaning that the growth rate fluctuates between high and low values, thus giving effectively rise to a growth cycle. Even if the difference in the growth rates might apparently seem small, this is not the case: from \(\gamma =\frac{\phi \, \bar{x}}{1-\alpha }\), we can compute that the two BGP rates are substantially different; indeed, \(\bar{\gamma }_H=0.0427\) and \(\bar{\gamma }_L= 0.0170\) suggesting a 2.5% difference in the growth rate between the high and low regimes.

Fig. 6
figure 6

Evolution of \(y_t^N\) on a logarithmic scale (top panel) associated with the evolution of \(x_t^N\) (bottom panel)

6 A generalized model

The model that we have focused on thus far is based on the assumption that the incentive for innovative activities (i.e., the price paid by the government in order to purchase the innovation from research firms), h, is constant and exogenously given. This might seem a merely convenient ad hoc simplification allowing to decouple the research industry from other macroeconomic variables; however, as we shall see in a while even extending the analysis to a more general setting would lead to results qualitative similar to those just discussed in our baseline model. In order to look at this, we allow the incentive to be time-dependent and endogenous since depending on other macroeconomic variables. In this case, the government budget constraint reads as follows:

$$\begin{aligned} \tau _t\ell =h_tx_t, \end{aligned}$$
(25)

where \(h_t=H(x_t,A_t,t)\) for a suitable function \(H:{\mathbb {R}}^3\rightarrow [0,\ell ]\). Note that the upper bound is needed to ensure that \(\tau _t\le 1\) for all t. This specification of \(h_t\) suggests that the government may wish to provide stronger or weaker incentives to innovative activities according to the overall level of technological advancement \(A_t\) achieved in the economy. Therefore, the tax rate should also change with the level of technological advancement: the tax rate is now defined as \(\tau _t={\mathcal {T}}(A_t)\) where we do not impose any restriction a priori on the shape of the function \({\mathcal {T}}(\cdot )\). The above budget constraint in this case read as follows:

$$\begin{aligned} h_t=H(x_t,A_t,t)=\frac{\ell }{x_t} {\mathcal {T}}(A_t). \end{aligned}$$

We will discuss one specific example at the end of this section, but we firstly provide a generalization of Propositions 1 and 2. Note that, since now \(h_t=H(x_t,A_t,t)\), the Markov process \(x^N_t\) and the relative limit \(x_t\) are no longer disentangled from other macroeconomic variables: the dependence in \(A_t\) makes the derivation of the law of large numbers more complicated.Footnote 19 Indeed, we need to define a suitable two-dimensional stochastic process \((x_t^N,A_t^N)\) such that, when taking the limit for \(N\rightarrow \infty \), \((x_t^N,A_t^N)\) converges to the pair \((x_t,A_t)\) defined by Eqs. 1415. This is formally stated in the following proposition.

Proposition 4

For all \(N\ge 1\), consider the stochastic process \((x_t^N,A_t^N)\) defined as:

$$\begin{aligned} x_t^N=\frac{1}{N}\sum _{i=1}^N\omega _{i,t}\, , \quad A_t^N{:=}A_0^N\, e^{\int _0^t \phi \, x_s^N \, ds} \end{aligned}$$

where \(\phi >0\) and

$$\begin{aligned} {\mathbb {P}}(\omega _{i,t+\varDelta t}=1|\, \omega _{i,t}, x^N_t,A_t^N)=\eta \left[ H(x_t^N,A_t^N,t)-z+J\left( x_t^N-\frac{1}{2}\right) \right] . \end{aligned}$$
(26)

Moreover, assume that \(\lim _{N\rightarrow \infty }x_0^N=x_0\) and \(\lim _{N\rightarrow \infty }A_0^N= A_0\). Then, when \(N\rightarrow \infty \), the process \((x_t^N,A_t^N)\) weakly converges to \((x_t,A_t)\) solving

$$\begin{aligned} \dot{x}_{t}= & {} \frac{1}{2}\,\tanh \left\{ \beta \left[ H(x_t,A_t,t)+J \left( x_t-\frac{1}{2}\right) \right] \right\} -x_t+\frac{1}{2} \end{aligned}$$
(27)
$$\begin{aligned} \dot{A}_{t}= & {} \phi \, x_t A_t \end{aligned}$$
(28)

with initial conditions \((x_0,A_0)\).

Proof

Note that, in principle, the definition of \(A_t^N\) introduces a dependence on the past of the process \(x_t^N\), thus a loss of Markovianity. Nevertheless, we will see that the pair \((x_t^N,A_t^N)\) is still Markovian. To this aim, we introduce the infinitesimal generator of the process \((x_t^N,A_t^N)\) applied to functions \(f:{\mathbb {R}}^3\rightarrow {\mathbb {R}}\) with compact support:

$$\begin{aligned} \mathcal {L}_N f(x,A,t)= & {} N\, x_t^N\, \eta ^-(x^N_t,A^N_t)\, \left[ f\left( x-\frac{1}{N},A,t\right) -f(x,A,t)\right] \\&+ N\, (1-x_t^N)\, \eta ^+(x^N_t,A^N_t)\, \left[ f\left( x+\frac{1}{N},A,t\right) -f(x,A,t)\right] \\&+ \phi \,x_t^N A_t^N \, f'_A(x,A,t) + f'_t(x,A,t) , \end{aligned}$$

where

$$\begin{aligned} \eta ^+(x,A){:=}\eta \left( H(x,A,t)-z+J\left( x-\frac{1}{2}\right) \right) \end{aligned}$$

represents the probability for an agent with \(\omega _{i,t}=0\) to become active (i.e., to decide to do research) at time t and \(\eta ^-=1-\eta ^+\). Since \({\mathcal {L}}_N\) only depends on the variables \((x^N_t,A^N_t)\) computed at time t, we conclude that the stochastic process \((x^N,A^N)\) is Markovian.

Moreover, since all the derivatives are uniformly bounded, it turns out that

$$\begin{aligned} \lim _{N\rightarrow \infty } \sup _{(x,A,t)\in {\mathbb {R}}^3} \left| {\mathcal {L}}_N f(x,A,t)-{\mathcal {L}} f(x,A,t)\right| =0\, \end{aligned}$$
(29)

where

$$\begin{aligned} {\mathcal {L}} f(x,A,t)=[-x+\eta ^+ ]\, f'_x(x,A,t) + \dot{A} \, f'_A(x,A,t) + f'_t(x,A,t) . \end{aligned}$$
(30)

The form of \({\mathcal {L}}\) provided in (30) follows from the fact that, taking the first order approximation of \({\mathcal {L}}_N\),

$$\begin{aligned} {\mathcal {L}}_N f(x,A,t)= & {} N\, x_t^N\, \eta ^-(x_t^N,A_t^N)\, \left( -\frac{1}{N}\right) \, f'_x(x,A,t) \\&+\;N\, (1-x_t^N)\, \eta ^+(x_t^N,A_t^N)\, \left( \frac{1}{N}\right) \,f'_x(x,A,t) \\&+ \;\phi \,x_t^N A_t^N \, f'_A(x,A,t) + f'_t(x,A,t) + o\left( \frac{1}{N}\right) ; \end{aligned}$$

which can be rewritten as

$$\begin{aligned} {\mathcal {L}}_N f(x,A,t)= & {} [-x_t^N+\eta ^+(x_t^N,A_t^N)]\, f'_x(x,A,t)\nonumber \\&+\;\phi \,x_t^N A_t^N \, f'_A(x,A,t) + f'_t(x,A,t) + o\left( \frac{1}{N}\right) . \end{aligned}$$
(31)

Now it is easy to show that the limit of \({\mathcal {L}}_N\) is \({\mathcal {L}}\) as expressed in (30). Note that \({\mathcal {L}}\) is the infinitesimal generator of the process \((x_t,A_t)\), where

$$\begin{aligned} \dot{x}_t= -x_t+\eta ^+(x_t,A_t); \quad \dot{A}_t=A_t\phi \,x_t. \end{aligned}$$

This can be seen computing \({\mathcal {L}} x\) and \({\mathcal {L}} A\) (i.e., considering \(f\equiv x\) and \(f\equiv A\), respectively). Finally, by the functional form of \(\eta ^+\), we easily see that

$$\begin{aligned} \dot{x}_t=-x_t+\frac{1}{2}+\frac{1}{2}\tanh \left( H(x_t,A_t,t)-z+J\left( x_t-\frac{1}{2}\right) \right) . \end{aligned}$$
(32)

By virtue of Theorem 1.6.1 in Ethier and Kurtz (1986), Eq. (29) and the assumption on the convergence of the initial conditions ensure that the stochastic process \((x_t^N, A_t^N)\) weakly converges to \((x_t, A_t)\). \(\square \)

Apart from the complication introduced by the need to deal with a two-dimensional stochastic process, the results are qualitatively identical to those discussed earlier. When the number of research firms is infinitely large, we can approximate the model’s outcome through some deterministic differential equations describing the evolution of the innovation share and the evolution of technology. Differently from what seen earlier in our baseline setup, now the research industry and the macroeconomic outcome are more realistically mutually interconnected. By having generalized Proposition 1, it is now straightforward to derive a generalization of Proposition 2.

Proposition 5

Consider the economy described by Eqs. 1618 where \(h_t=H(x_t,A_t,t)\) for a differentiable function \(H:{\mathbb {R}}^3\rightarrow [0,\ell ]\) such that \(\lim _{t\rightarrow \infty } h_t=\bar{h}\in [0,\ell ]\) is well defined. Then, along the (asymptotic) BGP equilibria the economic growth rate is given by:

$$\begin{aligned} \gamma =\frac{\phi \bar{x}}{1-\alpha }, \end{aligned}$$
(33)

where \(\bar{x}\) is the solution of:

$$\begin{aligned} \bar{x}=\frac{1}{2}\tanh \left\{ \beta \left[ \bar{h} + J \left( \bar{x}-\frac{1}{2}\right) \right] \right\} +\frac{1}{2}. \end{aligned}$$
(34)

Proof

Just take the limit for \(t\rightarrow \infty \) and impose \(\dot{x}=0\) in (32). The rest of the proof follows arguing similarly as in Proposition 2. \(\square \)

Proposition 5 characterizes the BGP in our generalized model in which the innovation incentive is endogenous and time-varying. Note that the economic growth rate is the same as in our baseline model and the only eventual difference between the two frameworks is due to the eventual different equilibrium share of innovative firms. Concerning multiplicity/uniqueness of equilibria, the whole discussion of Proposition 2 still holds true as long as h is replaced by \(\bar{h}\in [0,\ell ]\). Indeed, two thresholds \(J^t(\beta )\) and \(h^t(J,\beta )\) still exist although, differently from the baseline case, we are not able to characterize them explicitly. Propositions 4 and 5 jointly suggest that the results previously discussed in our baseline setup still hold true even in a more sophisticated model in which innovation decisions are endogenous and related to macroeconomic outcomes. Therefore, also the related discussion of policy implications still apply, confirming the importance of taking into account social interaction within the research industry in order to understand the determinants of macroeconomic performance. Finally, also in this more general case, when multiple BGP equilibria exist, they are metastable in the sense of what discussed in Sect. 5; therefore, the trajectories of \(x_t^N\) will exhibit cycling patterns and endogenous fluctuations will occur over the long run.

We exemplify the above discussion by considering some specific functional form for \(h_t\) in order to further clarify the results.

Example 1

Let us consider: \(h_t=H(x_t,A_t,t)=\frac{\ell }{x_t} {\mathcal {T}}(A_t)\), where \({\mathcal {T}}(A_t)= \tau \cdot (1+ e^{-A_t})\), for \(0<\tau \le 1/2\). In this case, \(\bar{h}=\frac{\tau \, \ell }{\bar{x}}.\) In Fig. 7, we plot the value of the BGP growth rate \(\gamma \) as a function of the main parameter. In the top panel, we set \(\tau =4\cdot 10^{-5}\) and we let J vary, while in the bottom panel we set \(J=3\) and we let \(\tau \) vary; all other parameter values are set accordingly to Table 1. We can see that the bifurcation diagram is similar to what described in our baseline model. Finally, note that in case of multiplicity of equilibria, the equilibrium value for the tax rate and the innovation incentive \(\bar{h}\) change with the equilibrium prevailing in the economy. For instance, for \(J=3\) and \(\tau =4\cdot 10^{-5}\), \(\bar{h}\) takes value 0.2060 or 0.0427 if \(x_L\) or \(x_H\), respectively.

Fig. 7
figure 7

Changes in the growth rate, \(\gamma \), for different values of J (top panel) with \(-1\le J\le 4\) and h (bottom panel) with \(0\le h\le 0.0001\); \(\tau =4\cdot 10^{-5}\) in the top panel and \(J=3\) in the bottom panel

Example 2

Consider the case of \(\tilde{h}_t=h_t\cdot y_t\), where \(h_t=H(x_t,A_t,t)\in [0,\ell ]\) and \(y_t\) is the time-varying per capita output. In this case, (26) reads as follows:

$$\begin{aligned} {\mathbb {P}}(\omega _{i,t+\varDelta t}=1|\, \omega _{i,t}, x^N_t,A_t^N)=\eta \left[ H(x_t^N,A_t^N,t)\cdot y_t-z+J\left( x_t^N-\frac{1}{2}\right) \right] . \end{aligned}$$

It is not difficult to show that Propositions 4 and 5 still apply. The result is, however, different: being H bounded and \(y_t\) diverging to infinity when \(t\rightarrow \infty \), in (27) we have that \(\bar{h}\) is replaced by \(\lim _{t\rightarrow +\infty }\tilde{h}_t = +\infty \). Therefore, in this case, the unique solution to (27) is now \(\bar{x}=1\). This is rather obvious: if the incentive to innovation explodes, it is worth to enter the market of innovation and the result is trivial.

7 Middle-income trap

An interesting line of interpretation of our stylized model is related to the middle-income trap hypothesis.Footnote 20 This refers to the experience common to many developing countries (especially in Latin America and in the Middle East) in the second half of the nineteenth century, in which growth has significantly slowed down after a first stage of takeoff characterized by rapid growth (see Gill and Kharas 2007; Commission on Growth and Development 2008). This development process has allowed these economies to quickly move from a low-income to a middle-income status, but not to make the further leap needed to become high-income economies. This has advanced the hypothesis that there may exist a middle-income trap, thus preventing some economies to fill the gap with more advanced countries. What might be the specific hindrances affecting this second stage of economic development is still an open question, but these are likely to be substantially different from those involving the first stage in which traditional poverty traps are in place.

Understanding what may be the reason why some fast growing economies have failed to achieve a high-income status is an active and recent research question with clear policy implications. While empirical evidence supporting the existence of a middle-income trap seems robust and convincing, much less clear is from a theoretical point of view why fast growth might come to an end. On the empirical side, Eichengreen et al. (2012) showed that growth tends to slowdown at levels of per capita income of about $15,000 (at 2005 constant international PPP prices), suggesting that a critical role is played by a reduction in the growth rate of the total factor productivity (TFP); specifically, a drop in TFP growth represents about 85% of the fall in per capita income growth. Eichengreen et al. (2013) provided some additional evidence, showing that the distribution of growth slowdowns is not necessarily unimodal, and in particular two modes, one around $15,000 and another around $11,000, exist. On the theoretical side, very few works have tried to provide some explanation of growth slowdowns in middle-income countries, and they focus on reallocation or misallocation of workers between different economic sectors. A traditional argument suggests that while in earlier stages of development it may be possible to raise productivity by shifting workers from agriculture to industry, this process may come to an end whenever the share of workers employed in agriculture falls enough (Lewis 1954). A more recent explanation emphasizes that a low allocation of high skilled individuals in the research sector may give rise to low productivity growth; however, this situation of potential slow growth can be fixed by policy interventions (Agénor and Canuto 2015). Differently from these works in which inter-sector dynamics is the driver of eventual growth slowdowns, our paper provides an alternative explanation based entirely on social interactions and technology diffusion.

Along the lines of Agénor and Canuto (2015), whenever the economy experiences multiple BGP equilibria (Proposition 2, case ii.b), the intermediate BGP equilibrium \(\gamma _M\) (i.e., the low growth trap threshold) can be clearly interpreted as a middle-income trap, separating fast and slow growing economies. Note that the eventual existence of such a trap is determined by the outcome in the research industry, which is completely driven by social interactions among research firms. Thus, the research intra-sector dynamics only might explain why technological progress and thus economic growth tend to be high or low in specific economies. In order to relate this to the pattern advanced by the middle-income trap hypothesis, we need to understand why an economy initially (during a first stage of economic development) in a BGP with high economic growth rate, \(\gamma _H\), may end up (in the second stage of development) in a BGP equilibrium with low growth rate, \(\gamma _L\), later. In our setting, this is equivalent to a either a fall in the number of innovative firms \(x_t\) or a rise in the intermediate equilibrium of the innovation share, \(\overline{x}_M\); both the cases imply that an economy with an original high innovation share may end up with a low (compared with the critical threshold) innovation share, thus experiencing a slowdown in its economic growth. The former case may be triggered by a change in international policy, and specifically, it may occur as a result of the introduction of tariffs or other restrictive policies, which by providing negative incentives for foreign firms to operate on the domestic market leads some foreign firms engaged in research activity to exit the domestic research market. The latter case may instead be triggered by a change in domestic economic policy, and it may occur as a result of a reduction in the support provided to research firms which, by determining the amount of revenue received by innovative firms, tends to increase the intermediate equilibrium value of the share of firms engaged in research activities. While empirical evidence seems to supports our conclusions related to the negative relation between growth slowdowns and openness (Eichengreen et al. 2012), the available evidence does not allow to either support or refute those related to the positive (up to a certain point) nexus between growth slowdowns and research-enhancing policies.

Apart from the eventual existence of such a middle-income trap, our model differently from Agénor and Canuto (2015) suggests that also growth cycles may occur. This implies that also fast growing economies cannot claim to have definitely escaped their middle-income trap, since they may be cyclically pulled into situations of growth slowdowns. This reinforces our previous conclusions that policymakers can play a critical role in the development process. By actively intervening with specific policies they can completely solve the trap problem dampening the size of the growth fluctuations, promoting a smooth process of fast growth allowing the economy to eventually catch up with more advanced economies and become a high-income country.

8 Conclusion

Technological progress is by far the most important determinant of economic growth over the long run. However, whether and how the interaction among research firms in the research industry might determine technological progress has never been analyzed thus far in the growth literature. Thus, in this paper we have tried to fill this gap by allowing a certain degree of firms interaction. Specifically, we assume that firms decide whether to innovate or not by taking into account also what other research firms do. Such an interaction among research firms, by determining the rate of technological progress, plays a critical role in shaping the whole macroeconomic outcome. Indeed, we have shown that under certain parameter conditions, by mimicking the behavior of the share of innovative firms, the economy may be characterized by a multiplicity of BGP equilibria and eventually may face a situation of low growth trap. We have also shown that the economy may eventually (endogenously) fluctuate between the low and high BGP, thus generating a growth cycle in which periods of low and high economic growth rates follow one another. The potential existence of low growth traps and endogenous growth cycles suggest that the government might play an essential role in order to contrast such negative effects. In particular, by rising enough the tax rate applied to households’ income it could completely solve the low growth trap problem, thus avoiding further fluctuations in economic activity. All these results are robust in the sense that they hold true both in our baseline model in which the innovation incentive is constant and in its generalized version in which this is potentially time-varying and dependent upon other macroeconomic variables.

This paper represents a first attempt to enrich the macroeconomic dynamics in traditional models of endogenous growth by allowing a certain extent of externality in research decisions. The approach followed is thus quite simplistic on purpose in order to show in the simplest possible way (which is already all but simple from a mathematical point of view) which might be the potential implications of allowing for social interactions in traditional macroeconomic models. Of course, our framework has several limitations which need to be accounted for in future research. Specifically, the dichotomous choice of research firms to do or not to do research does not allow to quantify research efforts; this assumption needs to be relaxed in order to fully characterize research decisions and outcomes. Also the specification of the research market is overly simple, and adopting a more traditional setup with either horizontal or vertical product differentiation may shed some further light on the impacts of social interactions on macroeconomic outcomes. Extending the analysis along these lines is left for future research.