1 Introduction

Many recent empirical studies have focused on international technology diffusion and income convergence. The degree of technology diffusion depends on the absorptive capacity of developing countries. Human capital, total factor productivity (TFP), and domestic R&D are related to the ability of developing countries to benefit from international technology transfer and contribute to their own technological progress. International trade and foreign direct investment (FDI) are considered to be two major channels for embodied knowledge spillover. Gaps in technology levels become smaller because technological advances in one economy are likely to be transmitted to other economies when technological spillovers are allowed. Furthermore, some empirical studies have used a spatial econometric framework to test country convergence. Assuming that a spatial lag or autoregressive term accounts for spatial dependence, the result is a faster convergence rate. If spatial dependence is present but omitted, this omission leads to unreliable statistical inference. Technological spillover and interdependence have direct (own country) and indirect (spillover) effects on the rate of convergence and are also embodied in unobserved individual effects.

A series of papers have considered spatial dependence on technology diffusion models. In particular, they document the significant role of geography in determining technology diffusion across countries. Easterly and Levine (1997) show that the growth effects of neighboring countries relate to location and focus on influences from physically adjacent locations to explain cross-country differences in public policies and other economic indications. Ades and Chua (1997) show that regional instability, defined as political instability in neighboring countries, has a strong negative effect on a country’s economic performance. They argue that these adverse regional influences must be taken into account when projecting the future economic performance of countries. Comin et al. (2012) determine that understanding technology diffusion over space is crucial to understanding the speed of technology diffusion.Footnote 1 Spatial econometric methods allow for the estimation of interactions between countries and spatial dependence. Ertur and Koch (2007, 2011) estimate the rate of convergence in a spatial Solow model and a spatial Schumpeterian growth model. They use maximum likelihood methods for their estimations. Elhorst et al. (2010) estimate an extended Solow–Swan growth model using European regions. They estimate an unconstrained spatial Durbin model using general methods of moment (GMM), maximum likelihood, and a mixture of both, which allows the inclusion of fixed effects and spatial interaction effects. Fischer (2011) designs a reduced-form empirical spatial Mankiw, Romer, and Weil (MRW) model to assess the importance of cross-region technological interdependence for 198 regions across 22 European countries from 1995 to 2004. Ho et al. (2013) show that there are positive spillover effects of growth from one country to its trade partners and that the implied rate of convergence is higher after including a spatial autoregressive term and a spatial time lag term. Lu and Wang (2015) adopt a spatial dynamic panel data approach and spatial quasi-maximum likelihood to reestimate the speed of growth convergence in 91 countries based on technological interdependence and spatial externalities. Lyncker and Thoennessen (2017) indicate that geographic clustering is quite pronounced in support of the club convergence in income of 194 European NUTS-2 regions.

Applications of the dynamic spatial panel data model from the very beginning are of Elhorst et al. (2010) on economic convergence (see Debarsy et al. 2012; Segura 2017; Ciccarellia and Elhorst 2017; Silva et al. 2017). Bouayad-Agha and Védrine (2010) found empirical evidence of conditional convergence of European 191 regions over the period 1980–2005, using a dynamic spatial panel data model. Technological diffusion due to spatial externalities plays a major role in the convergence pattern of European regions. Yu and Lee (2012) adopt a dynamic spatial panel approach to study regional growth convergence spanning the period 1930–2006 for the U.S. 48 contiguous states. They introduce technological spillovers into the neoclassical framework, showing that the convergence rate is higher and there is spatial interaction. Gallo and Fingleton (2014) present alternative approaches (distribution dynamics) to regional growth and convergence empirics for the EU, implying constant probabilities of different income states but allowing movement of regions across income states. Antai et al. (2015) focus on estimating a dynamic spatial panel data model with a specified source of endogeneity for the time-varying spatial weight matrices when the time period T is short. They propose the two-stage instrumental variable (2SIV) estimation method and prove its consistency and asymptotic normality by employing the theory of asymptotic inference under near-epoch dependence. Lee and Yu (2016) apply the spatial Durbin dynamic panel models of under the two-stage least squares (2SLS) and maximum likelihood (ML) estimations to investigate the international spillover of economic growth through bilateral trade, showing that inclusion of Durbin terms can be important. Arbia and Paelinck (2003b) estimate regional convergence for 119 European regions using a Lotka–Volterra approach.Footnote 2 They estimate a difference equation system using ordinary least squares (OLS), where each region is represented by one equation, and use the stability conditions of this system to determine convergence. This approach implies that convergence depends on the parameters of the region itself as well as that of other regions. Following Arbia and Paelinck (2003a, b), Ditzen (2018) uses a general Lotka–Volterrra model to estimate convergence for 93 countries. This model employs an equation with a spatial time lag and common factors. The combination of spatial dependence and common factors has recently received attention in the literature and is summarized by Elhorst et al. (2016). The estimated model includes common factors and potentially heterogeneous slopes and dynamics in a way that is well motivated by the theoretical model and the current spatial econometrics literature (see Ertur and Musolesi 2017). Ditzen (2018) also considers multiple potential channels for cross-country spillovers captured by the spatial time lag. Here, the possible channels considered are trade, foreign direct investment, and high-skilled migration. After controlling for global spillovers using common factors, foreign direct investment and migration are found to produce the most substantial effects on spatial interactions. However, neither of these studies considered direct and indirect effects.

In this paper, we analyze convergence in relative income using a Schumpeterian technology diffusion model, allowing for spatial interdependence among 11 Asian countries from 1970 to 2014. Following the technological progress (horizontal innovation) described in the product variety model (Barro and Sala-i-Martin 1997, 2004; Novales et al. 2009) in Sect. 2, we demonstrate the technology transfer mechanism from a leading country in innovation to a follower country in imitation. Through sensitivity analysis, we demonstrate that our results support both the absolute and conditional convergence hypotheses even in the Schumpeterian technology diffusion model. We allow knowledge spillover and spatial interdependence among Asian countries in Sect. 3. Technology stock is measured as the total factor productivity (TFP) of the Asian country relative to that of the leader country, which in our models is the USA. This is regarded as a technology transfer mechanism. The product of the two variable ratios can represent the absorption capacity for accepting the technology of a leader country. Asian countries have a considerable geographical distance from the USA. The USA, the technological leader, is not included in the spatial weights and as a cross-sectional unit in this paper.Footnote 3 The diffusion of technological innovation from the USA spreads through transactions such as FDI (field investment type), productivity, or trade between Asian countries and the USA, and the analysis of existing technology diffusion models that do not consider spatial interdependency. However, we consider general interaction patterns with the complete structure of interaction between Asian countries. According to the distance weight, technology diffusion is only considered among Asian countries. We derive the empirical equations, CAP model, and GAP model from the Schumpeterian technology diffusion model (vertical innovation) in line with Ertur and Koch (2011), and the NEO model from the neoclassical technology diffusion model in line with Ertur and Koch (2007) and Fischer (2011, 2018).

If we follow the line with Ertur and Koch (2011), we have to model Japan as the technological leader in Asian region.Footnote 4 We assume that not only does the technology leader spread knowledge to Asian countries, but also that Asian countries contribute to technology diffusion to the leading country. We consider general interaction patterns with the complete structure of interaction between Asian countries. We then test the extended Ertur and Koch growth model (EK12_NEO model and EK12_CAP model) allowing spatial interdependence among 12 Asian countries to test the convergence hypothesis in Sect. 4.4. We also analyze whether the rate of income convergence between Asian countries including Japan is slowing or accelerating. This is because Japan (advanced country) and Asian developing countries may have different steady states of income. The convergence rate may be slower than when Japan is not included.

Following Lesage and Pace (2009) and Elhorst (2012, 2014), we estimate an unconstrained dynamic spatial Durbin panel model (DSDM) using a maximum likelihood, which allows for the inclusion of fixed effects and spatial interaction effects. Following LeSage and Pace (2009), we also estimate the direct, indirect, and total effects to yield an interpretation of the spatial spillover effects. This rest of the paper is organized as follows. Section 2 presents a model of Schumpeterian technological diffusion between two countries and shows the absolute and conditional convergence through the impulse responses to stochastic productivity shock. Section 3 establishes the dynamic spatial Durbin panel model that considers spatial interdependence. Section 4 is devoted to our empirical findings and results of estimation for the dynamic spatial panel model. Section 5 concludes the paper.

2 The Schumpeterian technological diffusion and convergence

2.1 The technological diffusion model

We consider the Schumpeterian growth model elaborated by Aghion and Howitt (1992), Howitt (2000), and Ertur and Koch (2011). We also follow the technological diffusion model described by Barro and Sala-i-Martin (1997, 2004) and Novales et al. (2009). Technological innovation can lead to an increase in the variety of producer products (horizontal innovation). Copying or adapting an intermediate good from the leader country to be used in a follower country has a fixed cost. However, the cost of adapting commodities from the leader country is usually lower than the cost of innovation. The follower country benefits from the low costs of imitation and advantages with respect to the discovery or implementation of frontier technologies. There is room for exploration of mechanisms of technology diffusion, including the interaction between innovation and technology diffusion. The final good is produced in a country according to the production function, and the law of motion of stochastic productivity is:

$$ \begin{aligned} Y_{t} & = A_{t} L_{t}^{1 - \alpha } \mathop \smallint \limits_{0}^{{Q_{t} }} x\left( v \right)_{t}^{\alpha } {\text{d}}v = \theta_{t} AL_{t}^{1 - \alpha } \mathop \smallint \limits_{0}^{{Q_{t} }} x\left( v \right)_{t}^{\alpha } {\text{d}}v,\quad 0 < \alpha < 1, \\ \ln \theta_{t} & = \phi \ln \theta_{t - 1} + \varepsilon_{t} ,\varepsilon_{t} \sim\,{\text{iid }}\,N\left( {0,\sigma_{\varepsilon }^{2} } \right),\quad 0 < \phi < 1, \\ \end{aligned} $$
(1)

where \( Y_{t} \) and \( L_{t} \) denote the output of the final good and the labor input at time t, \( Q_{t} \) measures the number of product varieties, and \( x\left( v \right)_{t} \) is the flow output of the \( v \)-th intermediate good \( v \in \left[ {0,Q_{t} } \right] \) used at time \( t \). We assume for simplicity a constant labor supply, \( L_{t} = L, \forall t \). The level of technology \( A_{t} = \theta_{t} A \) is random, and \( { \ln }\theta_{t} \) evolves according to an autoregressive process with random innovation, \( \varepsilon_{t} \). Based on the profit maximization condition for the producer of the final good, we find that monopoly prices for intermediate goods are all the same, \( P_{vt} = P = 1/\alpha \). With these monopoly prices, the demand for each intermediate good is \( x_{vt} = x_{t} = L\left( {\theta_{t} A\alpha^{2} } \right)^{{1/\left( {1 - \alpha } \right)}} \). A representative household maximizes the expected value of discounted time aggregate utility over an infinite horizon, with the single period budget constraint represented as

$$ \max_{{\left\{ {c_{t} ,a_{t + 1} } \right\}}} U = E_{0} \mathop \sum \limits_{t = 0}^{\infty } \beta^{t} \frac{{c_{t}^{1 - \sigma } - 1}}{1 - \sigma },{\text{st}}.\,\,c_{t} + a_{t + 1} = w_{t} + \left( {1 + r_{t} } \right)a_{t} . $$
(2)

where \( \beta \) is the discount rate, \( c_{t} \) is consumption per capita, \( \sigma \) is a constant relative risk aversion, \( a_{t} \) is per capita assets, \( w_{t} \) is the wage rate, and \( r_{t} \) denotes the interest rate. The representative consumer solves the optimization problem subject to budget constraints. The leader country is labeled as country 1 and the follower country as country 2.

The follower country can either copy goods that were previously invented in the leader country or innovate and develop its own intermediate goods. We also assume that shocks in the leader country influence the follower country, while shocks in the follower country do not affect the leader country. Adapting an intermediate good to be used in country 2 requires a fixed cost, \( v_{2t} \). \( v_{2t} \) increases with the proportion of commodities that has been copied by country 2 such as, \( v_{2t} = \eta_{2} \left( {\frac{{Q_{2t - 1} }}{{Q_{1t - 1} }}} \right)^{b} \) where \( \eta_{2} \) is the fixed cost of innovation, \( Q_{2t} < Q_{1t} \), and \( b > 0 \) is the elasticity of imitation cost function. When \( Q_{2t - 1} < Q_{1t - 1} \), the cost of imitation will be lower than the cost of innovation, \( v_{2t} < \eta_{2} \). The country 2 starts with fewer intermediate goods than the country 1: \( Q_{20} < Q_{10} \) and \( v_{20} < \eta_{2} \). We assume that an agent who pays \( v_{2t} \) would obtain a perpetual production monopoly for that intermediate good of country 2. The present value from imitating the \( v \)-th intermediate good in country 2 is the same as the imitation cost if there is free entry. Expressions in country 2 for the quantities of the intermediate good produced and of total output, as well as the flow of monopoly profits, are similar to those equations in country 1. By the same argument, the representative consumer solves the optimization problem subject to budget constraints. We introduce auxiliary variables, \( \mu_{1t} \equiv \theta_{1t}^{{\frac{1}{1 - \alpha }}} \), \( A_{1}^{'} \equiv \frac{{A_{1}^{{\frac{1}{1 - \alpha }}} \alpha^{{\frac{\alpha }{1 - \alpha }}} \left( {1 - \alpha } \right)}}{{\eta_{1} /L_{1} }}, \)\( q_{1t} \equiv \frac{{\eta_{1} }}{{L_{1} }}Q_{1t} \), and \( z_{1t} \equiv \frac{{c_{1t} }}{{q_{1t} }} \) for country 1 and then characterize the transitional dynamics of the economy and introduce auxiliary variables, \( \mu_{2t} \equiv \theta_{2t}^{{\frac{1}{1 - \alpha }}} , A_{2}^{'} = \left( {1 - \alpha^{2} } \right)A_{2}^{{\frac{1}{1 - \alpha }}} \alpha^{{\frac{2\alpha }{1 - \alpha }}} L_{2} , q_{2t} \equiv \frac{{\eta_{2} }}{{L_{2} }}Q_{2t} \), \( z_{2t} \equiv \frac{{c_{2t} }}{{q_{2t} }} \) for country 2. Therefore, we can express the relationship \( q_{1t + 1} /q_{1t} = \mu_{1t} A_{1}^{'} \left( {1 + \alpha } \right)\alpha^{{\frac{\alpha }{1 - \alpha }}} + 1 - z_{1t} \) for country 1. By combining the global constraint of resources and the Euler condition and describing the imitation cost function written in terms of these new variables, we have a system of two dynamic equations: a control variable, \( z_{2} \); and a state variable, \( v_{2} \). It is important to note that \( \left\{ {\mu_{1t} } \right\} \) and \( \left\{ {z_{1t} } \right\} \) are obtained for the country 1, but affect the country 2. Hence, shocks in country 1 influence country 2, while shocks in country 2 do not affect country 1. We introduce variables in log differences with respect to the steady state: \( \hat{z}_{2t} = \ln \left( {\frac{{z_{2t} }}{z}} \right) \) and \( \hat{\mu }_{2t} = \ln \left( {\mu_{2t} } \right) \). Hence, the approximation of two dynamic equations can be written as,

$$ \varGamma_{0} \left( {\begin{array}{*{20}c} {E_{t} \hat{z}_{2t + 1} } \\ {\hat{v}_{2t + 2} } \\ \end{array} } \right) + \varGamma_{1} \left( {\begin{array}{*{20}c} {\hat{z}_{2t} } \\ {\hat{v}_{2t + 1} } \\ \end{array} } \right) + \varGamma_{2} \left( {\begin{array}{*{20}c} {\hat{\mu }_{2t} } \\ {\hat{z}_{1t} } \\ {\hat{\mu }_{1t} } \\ \end{array} } \right) = 0, \Rightarrow \left( {\begin{array}{*{20}c} {E_{t} \hat{z}_{2t + 1} } \\ {\hat{v}_{2t + 2} } \\ \end{array} } \right) = \varGamma_{3} \left( {\begin{array}{*{20}c} {\hat{z}_{2t} } \\ {\hat{v}_{2t + 1} } \\ \end{array} } \right) + \varGamma_{4} \left( {\begin{array}{*{20}c} {\hat{\mu }_{2t} } \\ {\hat{z}_{1t} } \\ {\hat{\mu }_{1t} } \\ \end{array} } \right) , $$
(3)

where \( \varGamma_{3} = - \varGamma_{0}^{ - 1} \varGamma_{1} \), \( \varGamma_{4} = - \varGamma_{0}^{ - 1} \varGamma_{2} \). A well-determined equilibrium arises when \( \varGamma_{3} \) has one stable and one unstable eigenvalue.Footnote 5\( \left\{ {\mu_{1t} } \right\} \) and \( \left\{ {z_{1t} } \right\} \) are obtained for the country 1 that affects the country 2. The shocks \( \left\{ {\theta_{1t} } \right\} \) and \( \left\{ {Q_{1t} } \right\} \) in the country 1 influence the country 2. By computing the numerical solution to the model, we obtain the relationships \( \left\{ {y_{1t} } \right\}_{t = 0}^{T} \) and \( \left\{ {y_{2t} } \right\}_{t = 0}^{T} \). The steady state level of output for the leading and following country is obtained by

$$ Y_{1t} = L_{1} A_{1}^{{\frac{1}{1 - \alpha }}} \alpha^{{\frac{2\alpha }{1 - \alpha }}} \theta_{1t}^{{\frac{1}{1 - \alpha }}} Q_{1t} \,{\text{and}}\,Y_{2t} = L_{2} A_{2}^{{\frac{1}{1 - \alpha }}} \alpha^{{\frac{2\alpha }{1 - \alpha }}} \theta_{2t}^{{\frac{1}{1 - \alpha }}} Q_{2t} $$
(4)

2.2 Absolute and conditional convergence

Through sensitivity analysis, we determine whether this result supports the absolute or conditional convergence hypothesis, which follow the neoclassical growth model. We start from realizations for stochastic productivity shock in country 1 rather than for productivity shocks in both countries. We assume that the time series for the productivity shock in country 2 is set to zero at all points in time. While computing the numerical solution to the model (3), we used the benchmark parameterizations physical capital share of final good sector \( \alpha \) = 0.36, discount factor \( \beta \) = 0.96, impulse shock of size in the leading country \( \varepsilon_{1} \) = 0.1, and inverse of the intertemporal elasticity of substitution \( \sigma \) = 1.5 as common parameters in the literature, Novales et al. (2009) and Benhabib et al. (2017). The calibrated parameters are technology level of final good sector \( A_{1} \) = 1, population size \( L_{1} \) = 1, innovation cost \( \eta_{1} \) = 1, persistence parameter of the stochastic process of shock in sector of final good \( \phi_{1} \) = 0.9, standard deviation of the innovation in the stochastic process for theta \( \sigma_{\varepsilon 1} \) = 0.0 for the leader country. Impulse responses to a productivity shock in country 1 at time 10 of \( \varepsilon_{1} \) = 0.1 are computed. Figure 1 shows that all detrended per capita income variables experience permanent effects due to shocks in productivity. We obtain income (y1) in country 1, where it increases and shows the fastest adjustment to new steady state levels.

Fig. 1
figure 1

Absolute and conditional convergence: Impulse responses of income per capita to technology shock. (1) All variables are detrended. (2) y1: income per capita for leader country, y2–y2D: income per capita for following country. (3) The vertical axis represents the per capita income, and the horizontal axis represents the flow of time

Figure 1 also shows the time evolution of incomes in country 2 for sensitivity analysis at different values of parameters in Table 1. The calibrated parameters for follower country are \( A_{2} \) = 0.96 (< \( A_{1} \) = 1), \( L_{2} \) = 1, \( \eta_{2} \) = 1, \( \phi_{2} \) = 0.9, \( \sigma_{\varepsilon 2} \) = 0.001, and elasticity of the imitation cost function \( b \) = 0.5. However, income (y2) in country 2 increases more gradually and approaches new higher steady state levels. We generate the income realization y2A for the technology level of the final good sector with \( A_{2} \) = 0.98 and elasticity of the imitation cost function with \( b \) = 0.5. y2A approaches a new higher steady state level than y2 with \( A_{2} \) = 0.96 and \( b \) = 0.5. We assume that elasticity of the imitation cost function increases with \( b \) = 0.98, maintaining \( A_{2} \) = 0.98. The time series for consumption y2B shows higher transitional dynamics than that of y2A but eventually approaches the same steady state level as y2A. These results in the Schumpeterian technology diffusion model support the conditional convergence hypothesis. In a steady state, this benchmark model tends to generate a pattern of convergence for growth rate. Simulation y2C uses \( A_{2} \) = 1 and \( b \) = 0.5, where the technology level of the final good sector is greater than \( A_{2} \) = 0.98. The income (y2C) realization converges to the level of y1 in country 1. This result in the Schumpeterian technology diffusion model supports the absolute convergence hypothesis. We then consider stochastic productivity shocks in both countries. After an impulse shock in country 1 at time 10 of \( \varepsilon_{1} \) = 0.1, shocks in country 2 at time 30 of \( \varepsilon_{21} \) = 0.08(< \( \varepsilon_{1} \) = 0.1), follow sequentially.Footnote 6 We compute two income variables, y2D from a shock of \( \varepsilon_{21} \)(30). However, these values eventually approach new lower steady state levels than in country 1. This result supports the conditional convergence hypothesis, which follows the neoclassical growth model.

Table 1 Absolute and conditional convergence: sensitivity analysis

Conclusively, these results support both the absolute and conditional convergence hypotheses, which follow the neoclassical growth model. Improvements in government policy and technology level represented by increases in \( A_{2} \) cause country 2 to become intrinsically superior. This indicates that the government of country 2 should adopt policies that are more favorable to incentives encouraging production and investment or more effective enforcement of property rights. Numerical analysis suggests that, when the government is more efficient, imitation cost becomes more elastic, technological shock is greater, and technological absorptive power is greater, convergence to the leading country becomes faster.

3 The dynamic spatial Durbin panel model

Ertur and Koch (2011) consider a multicountry Schumpeterian growth model introducing technological diffusion. They assume that this productivity parameter is defined as follows: \( \lambda_{i} = \lambda \prod_{j = 1}^{n} \left( {\frac{{A_{jt} }}{{A_{it} }}} \right)^{{\gamma_{i} v_{ij} }} \) where the \( v_{ij} \) parameters present the specific access of country \( i \) to the accumulated knowledge of all countries, and productivity is a function of technology gap (\( \frac{{A_{jt} }}{{A_{it} }} \)) of country \( i \) with respect to its own technology frontier. The \( v_{ij} \) parameters present the specific access of country \( i \) to the accumulated knowledge of all other countries, assuming \( \sum_{j = 1}^{n} v_{ij} = 1 \) for \( i = 1, \ldots ,n \). This hypothesis can be as international spillover effect or as spatial externalities. The parameter \( \gamma_{i} > 1 \) measures the absorption capacity of country \( i \) which is assumed to be a function of its human capital stock (\( H_{i} \)) as \( \gamma_{i} = \gamma H_{i} \). Through the fundamental dynamic equation of aggregate physical capital accumulation and steady state analysis, they obtain the following econometric reduced form:

$$ \ln y_{it} = \beta_{0} + \beta_{1} \ln \frac{{s_{K,i} }}{{n_{i} + 0.05}} + \theta H_{i} \mathop \sum \limits_{j \ne i}^{n} v_{ij } \ln \frac{{s_{K,j} }}{{n_{j} + 0.05}} + \gamma H_{i} \mathop \sum \limits_{j \ne i}^{n} v_{ij} \ln y_{j} + \varepsilon_{i} , $$
(5)

where \( s_{K} \) is the investment rate, \( n_{i} \) is the population growth rate, the coefficient \( \theta \) is associated with the investment rate in physical capital divided by the effective depreciation rate of foreign country \( j \), for \( j = 1, \ldots ,n \), and \( \gamma > 0 \) is the spatial autocorrelation coefficient. They should therefore be estimated using the appropriate spatial econometric methods.

$$ y_{t} = \delta Wy_{t} + X_{t} \beta + WX_{t} \theta + \mu + \xi_{t} I_{N} + \varepsilon_{t} $$
(6)

where \( y_{t} \) denotes an \( N \times 1 \) column vector of the dependent variable for every unit (\( i = 1, \ldots ,N \)) in the sample at time t (\( t = 1, \ldots ,T \)), which represents the relative income between follower and leader countries. \( X_{t} \) presents an \( N \times K \) matrix of explanatory variables. All variables are measured in logarithms. The \( N \times N \) matrix W is a nonnegative weight matrix constructed from geographical distance. Since the spatial econometric model in Eq. (6) contains both X and WX variables, it is known as a spatial Durbin model. The vectors \( \mu = \left( {\mu_{1} , \mu_{2} , \ldots ,\mu_{N} } \right)' \) represent spatial fixed effects, and \( \xi_{t} I_{N} \) represent time-period fixed effects where \( I_{N} \) is an \( N \times 1 \) column vector of ones. The \( N \times 1 \) vector \( \varepsilon_{t} = \left({\varepsilon_{1t}, \ldots,\epsilon_{Nt}} \right)^{\prime} \) consists of i.i.d. disturbance terms, which have zero mean and finite variance \( \sigma^{2} \).

3.1 DSDM for Schumpeterian technology diffusion model (CAP model)

We develop a specification that explicitly includes this technological interdependence. The model will be estimated with a spatial econometric specification. The result obtained from the previous Sect. 2 supports the convergence hypothesis in the Schumpeterian technology diffusion model. Using the diffusion model results, the growth rate of production per worker in country 2 can be expressed in the following form:

$$ \gamma_{2} = \frac{{\frac{{{\text{d}}y_{2} }}{{{\text{d}}t}}}}{{y_{2} }} = \frac{{\dot{y}_{2} }}{{y_{2} }} = \gamma_{1} + G\left[ {\frac{{y_{2} }}{{y_{1} }}, \left( {\frac{{y_{2} }}{{y_{1} }}} \right)^{*} } \right] , $$
(7)

where \( \gamma_{2} = \frac{{\dot{y}_{2} }}{{y_{2} }} \) is the growth rate of per worker income in follower country, \( \left( {\frac{{y_{2} }}{{y_{1} }}} \right)^{*} \) is the steady state income level, and \( \gamma_{1} \) is the growth rate of leading economy. The partial derivatives of the function G satisfy \( G_{1} < 0 \), \( G_{2} > 0 \), and \( G\left( { \cdot , \cdot } \right) = 0 \) if \( \frac{{y_{2} }}{{y_{1} }} = \left( {\frac{{y_{2} }}{{y_{1} }}} \right)^{*} \). Country 2’s growth rate \( \frac{{\dot{y}_{2} }}{{y_{2} }} \) exceeds \( \gamma_{1} \) if \( \frac{{y_{2} }}{{y_{1} }} < \left( {\frac{{y_{2} }}{{y_{1} }}} \right)^{*} \). If the government of country 2 adopts policies that are more favorable to production and investment, the change in policy amounts to an increase in \( A_{2} \). In this case, \( \left( {\frac{{y_{2} }}{{y_{1} }}} \right)^{*} \) increases, and the growth rate \( \frac{{\dot{y}_{2} }}{{y_{2} }} \) increases on impact. This means that the result exhibits conditional convergence.Footnote 7 We now generalize the two-country model to a multicountry model where all countries follow the leader country. As in Eq. (7), the growth rate of country i can be written as:

$$ \log \left( {\frac{{y_{it} }}{{y_{it - 1} }}} \right) = \log \left( {\frac{{y_{1t} }}{{y_{1t - 1} }}} \right) - \omega \left[ {\log \left( {\frac{{y_{it} }}{{y_{1t} }}} \right) - \log \left( {\frac{{y_{i} }}{{y_{1} }}} \right)^{*} } \right] = \varphi + \log \left( {\frac{{y_{1t} }}{{y_{1t - 1} }}} \right) - \omega \left[ {\log \left( {\frac{{y_{it} }}{{y_{1t} }}} \right)} \right] , $$
(8)

where \( y_{it} \) and \( y_{1t} \) are the per capita income of the following country i and leader country 1 at time t, respectively, \( \varphi = \omega \left[ {\log \left( {\frac{{y_{i} }}{{y_{1} }}} \right)^{*} } \right] \), and \( \omega \) is the speed of convergence necessary to reach the income level of the leader country in the diffusion model. Similar to the procedure of beta convergence based on the neoclassical growth model, Eq. (8) can be written as

$$ \log \left({\frac{{y_{it}}}{{y_{1t}}}} \right) = \epsilon + b\log \left({\frac{{y_{it - 1}}}{{y_{1t - 1}}}} \right) = \epsilon + b\left({\frac{1}{1 - \alpha}} \right){ \log }\left[{\left({\frac{{A_{i}}}{{A_{1}}}} \right)\left({\frac{{\theta_{it - 1}}}{{\theta_{1t - 1}}}} \right)} \right] + b{ \log }\left({\frac{{Q_{it - 1}}}{{Q_{1t - 1}}}} \right), $$
(9)

where \( \log \left( {\frac{{y_{it} }}{{y_{1t} }}} \right) \) represents the relative income between follower and leader countries, \( \epsilon = \frac{\varphi}{1 + \omega} \), and \( b = \frac{1}{1 + \omega } \). Let \( R_{it} = q_{it} - q_{it - 1} \), where \( q_{it} = \log \left( {\frac{{y_{it} }}{{y_{1t} }}} \right) \). The ratio of output per worker in both countries depends on the ratios \( \frac{{\theta_{it} }}{{\theta_{1t} }} \) and \( \frac{{Q_{it} }}{{Q_{1t} }} \), where direct measures of \( Q_{1} \) and \( Q_{i} \) or \( \theta_{1} \) and \( \theta_{i} \) would not generally be available. The ratio of intermediate good number \( \frac{{Q_{it} }}{{Q_{1t} }} \) or the ratio of technology shock \( \frac{{\theta_{it} }}{{\theta_{1t} }} \) is replaced by \( {\text{gap}}_{it} = \frac{{{\text{tfp}}_{it} }}{{{\text{tfp}}_{1t} }} \), where \( {\text{tfp}}_{it} \) and \( {\text{tfp}}_{1t} \) are TFP of the follower country and leader country, respectively. The ratio of TFP (\( {\text{gap}} \)) is considered to be a technical shock coming from leader country. This is regarded as a technology transfer mechanism. Furthermore, the product of the two variable ratios can represent absorption capacity to accept the technology of a leader country.

We assume that the ratio of TFP (\( {\text{gap}} \)) is influenced by physical capital externalities because of learning by doing. In line with Ertur and Koch (2011), we assume the following form for the level of technology in region \( i \) at time \( t \), \( g_{it} = \frac{{\dot{A}_{it} }}{{A_{it} }} = \lambda \sigma k_{it}^{\varphi } \mathop \prod \nolimits_{j = 1}^{n} \left( {\frac{{A_{jt} }}{{A_{it} }}} \right)^{{\gamma_{i} v_{ij} }} \) where \( \lambda \) represents productivity parameter, \( A_{jt} \) aggregate level of technology for region \( j \), and \( k_{it} \) per worker physical capital.Footnote 8 The technical parameter \( \varphi \) with \( 0 \le \varphi < 1 \) represents the magnitude of externalities generated by \( k_{it} \) within region \( i \), respectively. Each region has differentiated access to foreign technology because of the connectivity terms, denoted by \( W_{ij} \). We assume that \( W \) is a weight matrix based on coordinates. The absorption capacity of country \( i \) (\( {\text{abs}}_{i} \)) is also assumed to be a function of human capital or foreign direct investment as \( \gamma_{i} = \gamma {\text{abs}}_{i} \). The empirical equation for the growth rate of relative income of the technology diffusion model can be written as,Footnote 9

$$ R_{it} = \beta_{0} + \beta_{1} q_{it - 1} + X_{t} {\rm B} + {\text{abs}}_{i} \mathop \sum \limits_{j \ne i}^{n} w_{ij} X_{t} \theta + \gamma_{1} \mathop \sum \limits_{j \ne i}^{n} w_{ij} q_{jt - 1} + \gamma_{2} {\text{abs}}_{i} \mathop \sum \limits_{j \ne i}^{n} w_{ij} R_{jt} + \varepsilon_{it} $$
(10)

where \( X_{t} \) presents a matrix of explanatory variables including technology transfer. In this paper, we consider general interaction patterns with the complete structure of interaction between countries. The leading country diffuses a technology to Asian countries through the gap of TFP. According to the distance weight, technology diffusion is considered only among Asian countries.Footnote 10 After considering the spatial independence among countries, we rewrite Eq. (10) in the DSDM matrix form for the Schumpeterian technological diffusion model:

$$ R_{t} = \delta WR_{t} + Z_{t} \beta + WZ_{t} \theta + \mu + \xi_{t} I_{N} + \varepsilon_{t} $$
(11)

where Z is a matrix of explanatory variables (\( q_{t - 1} \) the initial relative income, \( s_{kt} \) logarithm of rate of physical capital, \( {\text{gap}}_{it} \) the technology gap, and \( {\text{abs}}_{it} \): capacity of absorption). The rate of growth in the neighboring countries also reflects the spatial autocorrelation process implied by technological interdependence. To investigate the null hypothesis that the individual fixed effect and time-period fixed effects are jointly nonsignificant, we perform a likelihood ratio (LR) test. To test whether the spatial lag model or the spatial error model is a more appropriate description of the data than a model without any spatial interaction effects, one may use Lagrange multiplier (LM) tests for a spatially lagged dependent variable and for spatial error autocorrelation, as well as the robust LM tests which test for the existence of one type of spatial dependence conditional on the other (Elhorst 2014). The standard LM tests for spatial dependence in linear and panel regressions are derived under the normality and homoskedasticity assumptions of the regression disturbances. Baltagi and Yang (2013) introduce general methods to modify the standard LM tests such that they become robust against heteroskedasticity and nonnormality. The idea behind the robustification is to decompose the concentrated score function into a sum of uncorrelated terms so that the outer product of gradient can be used to estimate its variance.

The first hypothesis (\( {\text{H}}_{0} : \theta = 0 \)) examines whether the SDM can be simplified to the SAM, and the second hypothesis (\( {\text{H}}_{0} :\theta + \delta \beta = 0 \)) whether it can be simplified to the SEM.Footnote 11 These hypothesis can be assessed with either an LR test or a Wald test. If both hypotheses (\( {\text{H}}_{0} : \theta = 0 \) and \( {\text{H}}_{0} :\theta + \delta \beta = 0 \)) are rejected, the SDM best describes the data. Conversely, if the first hypothesis cannot be rejected, then the SAM best describes the data, provided that the (robust) LM tests also support the SAM. Similarly, if the second hypothesis cannot be rejected, then the spatial error model best describes the data, provided that the (robust) LM tests also pointed to the spatial error model. If one of these conditions is not satisfied, i.e., if the (robust) LM tests point to another model than the Wald/LR tests, then the spatial Durbin model should be adopted. This is because this model generalizes both the spatial lag and the spatial error model. Elhorst (2014) shows that the matrix of marginal effects of the expected value of the dependent variables with respect to the kth explanatory variable is given by the \( N \times N \) matrix,

$$ \left[ {\frac{{\partial E\left( {R_{t} } \right)}}{{\partial z_{1k} }} \cdots \frac{{\partial E\left( {R_{t} } \right)}}{{\partial z_{Nk} }}} \right] = \left( {1 - \delta W} \right)^{ - 1} \left[ {\beta_{k} + \theta_{k} W} \right] $$
(12)

Lesage and Pace (2009) note that interpretation of the estimates can be misleading and advocate decomposing the marginal effects into the direct and indirect effects. The differences between the direct effects and the coefficient estimates are due to feedback effects that arise as a result of impacts passing through neighboring countries and back to the countries themselves. LeSage and Pace define the direct effect as the average of the diagonal elements of the matrix on the right-hand side of (12), and the indirect effect as the average of either the row sums or the column sums of the nondiagonal elements of that matrix and as a summary indicator of the spillover effect.

3.2 DSDM for the extended neoclassical growth model (NEO model)

In the previous section, we set up the Schumpeterian technology diffusion model assuming that the level of technology \( A_{t} = \theta_{t} A \) is random, and \( { \ln }\theta_{t} \) evolves according to an autoregressive process with random innovation, \( \varepsilon_{t} \). Now, we examine the extended MRW exogenous growth model, which supports the convergence hypothesis. In the neoclassical growth model with the labor augmented technological progress (\( A_{t} = A_{0} e^{gt} \)), the equation for the growth rate of output per worker seems similar to Eq. (7). However, the differences are that \( \gamma_{1} \) is replaced by the rate of exogenous technical change, denoted by \( g \); \( \frac{{y_{2} }}{{y_{1} }} \) is replaced by \( \hat{y} \), the country’s output per effective worker; and \( \left( {\frac{{y_{2} }}{{y_{1} }}} \right)^{*} \) is replaced by \( \hat{y}^{*} \), the steady state level of output per effective worker. The neoclassical growth formula in the standard empirical model can be written as

$$ \gamma_{2} = \frac{{\dot{y}_{2} }}{{y_{2} }} = g + \varPsi \left[ {\hat{y}, \hat{y}^{*} } \right] , $$
(13)

where the partial derivatives of the function \( \varPsi \) satisfy \( \varPsi_{1} < 0 \) and \( \varPsi_{2} > 0 \), and \( \varPsi \left( { \cdot , \cdot } \right) = 0 \) if \( \hat{y} = \hat{y}^{*} . \) The value \( \hat{y}^{*} \) depends on elements included in the parameter \( A \), such as government policies, and on willingness to save. Higher values of \( A \) lead to increases in \( \hat{y}^{*} \). The parameter \( g \) is not directly observable and varies over time and across countries. The key aspect of Eq. (13) involves the absolute levels of term \( \varPsi \left( \cdot \right) \), whereas Eq. (7) shows that growth rate depends on a country’s characteristics expressed relative to those in the leader country.

We assume that aggregate level technology in region \( i \), \( A_{it} \) may not only depend on externalities generated by physical and human capital accumulated in that region, but also on the aggregate level of technology of its neighboring economies, \( A_{jt} \) with \( i \ne j \). In line with Ertur and Koch (2007) and Fischer (2011, 2018), we assume the following form for the level of technology in region \( i \) at time \( t \), \( A_{it} = \varOmega_{t} k_{it}^{{\phi_{K} }} h_{it}^{{\phi_{H} }} \mathop \prod \nolimits_{j \ne i}^{n} \left( {A_{jt} } \right)^{{\rho W_{ij} }} \) where \( \varOmega_{t} \) represents some amount of technological knowledge, \( A_{jt} \) is the aggregate level of technology for region \( j \), \( k_{it} \) is the per worker physical capital, and \( h_{it} \) is the per worker human capital. The technical parameters \( \phi_{K} \) with \( 0 \le \phi_{K} < 1 \) and \( \phi_{H} \) with \( 0 \le \phi_{H} < 1 \) reflect the spatial connectivity of \( k_{it} \) and \( h_{it} \) within country \( i \), respectively. Each region has differentiated access to foreign technology because of the connectivity terms, denoted by \( W_{ij} \). Under this assumption, the spatial extension model can be derived from the basic MRW model:

$$ \begin{aligned} \ln \left( { \frac{{y_{it} }}{{y_{it - 1} }}} \right) & = \beta_{0} + \beta_{1} \ln y_{it - 1} + \beta_{2} \ln s_{kit} + \beta_{3} \ln s_{hit} + \beta_{4} \ln \left( {n_{it} + g + \delta } \right) \\ & \quad + \,\theta_{1} \mathop \sum \limits_{j \ne i}^{N} w_{ij} \ln y_{jt - 1} + \theta_{2} \mathop \sum \limits_{j \ne i}^{N} w_{ij} \ln s_{kjt} + \theta_{3} \mathop \sum \limits_{j \ne i}^{N} w_{ij} \ln s_{hjt} \\ & \quad + \,\theta_{4} \mathop \sum \limits_{j \ne i}^{N} w_{ij} \ln \left( {n_{it} + g + \delta } \right) + \rho \mathop \sum \limits_{j \ne i}^{N} w_{ij} \ln \left( { \frac{{y_{jt} }}{{y_{jt - 1} }}} \right) + \varepsilon_{it} \\ \end{aligned} , $$
(14)

The growth rate of income per worker is negative function of the initial level of income per worker reflecting the convergence process with \( \beta_{1} < 0 \) and depends positively on savings in physical and human capital with \( \beta_{2} > 0 \) and \( \beta_{3} > 0 \). The economic growth depends negatively on the effective rate of the capital depreciation rate \( \left( {n_{it} + g + \delta } \right) \) with \( \beta_{4} < 0 \). In the same direction, it also depends on the saving in physical and human capital in the neighboring economies and negatively on their effective rate of capital depreciation rate. The growth rate depends positively on the initial level of income per capita in the neighboring economies. The rate of growth in the neighboring countries also reflects the spatial autocorrelation process implied by technological interdependence. After considering the spatial interdependence among countries, we rewrite Eq. (14) in DSDM matrix form for the extended MRW model:

$$ Y_{t} = \delta WY_{t} + X_{t} \beta + WX_{t} \theta + e $$
(15)

where \( Y_{t} \) is the growth rate of income per worker along the balanced-growth path, W is a weight matrix based on coordinates, and \( e = [\mu_{i} ,\xi_{t} , \varepsilon_{it} ] \).Footnote 12 We consider the extended MRW model including the other possibly time-varying variables affecting growth, which are FDI, the technology gap, and trade. Therefore, the explanatory variable vector, \( X = \left[ {y_{t - 1} , s_{kt} , n_{t} , s_{ht} ,fdi_{t} ,gap_{t} ,trd_{t} } \right] \), is presented.

The approach Lee and Yu (2016) proposed to obtain consistent results is a bias correction procedure of the parameters estimates obtained by the direct approach based on maximizing the likelihood function. This paper adopts the bias correction procedure when spatial fixed effects are controlled for, and time-period fixed effects are included.

4 Empirical results of the dynamic spatial Durbin panel model

4.1 Data

We consider data for the following 11 Asian countries over the period 1970–2014: China, Hong Kong, Indonesia, India, Korea, Malaysia, Philippines, Singapore, Sri Lanka, Thailand, and Taiwan.Footnote 13 We consider the USA as a leader country to observe the effects of the influence of technological innovation on the 11 Asian countries. Data for real income per capita, the physical capital, human capital investment share of GDP, and population growth rate are drawn from the Penn World Table 9.0 (Feenstra et al. 2015). Real income per capita is obtained by dividing real GDP at constant 2011 national prices in millions of US dollars by population in millions. Growth rate is based on average annual income growth rate during the time period. The relative income represents \( q_{it} = \ln (GDP_{it} /GDP_{USA t} ) \), where \( GDP_{it} \) is the real per worker GDP of Asian country i and \( GDP_{USA t} \) is the real per worker GDP of the leader country, the USA. A human capital index was estimated using data regarding average years of schooling from Barro and Lee (2013) and rates of return on education from Psacharopoulos and Patrinos (2004). Physical capital is obtained as the share of gross capital formation at current PPPs and export ratio as the share of merchandise exports at current PPPs. Exports by a follower country to leader countries facilitate the absorption of superior technologies from leaders. The increase in trade occurs due to product quality improvement and technological innovation in response to the demands of the importing country. FDI data are obtained by U.S. direct investment in Asian countries using balance of payments and direct investment position data obtained from the Bureau of Economic Analysis. As several empirical studies mention the importance of distinguishing the technology diffusion effects and productivity effects, we use both the ratios of FDI/physical capital and FDI/GDP in Asian countries.Footnote 14 The TFP level in an Asian country at current purchasing power parity (PPP) at USA = 1 is obtained. We use a weight matrix W based on coordinates (distance from latitude and longitude), where W is row-standardized.Footnote 15

4.2 Results for extended neoclassical growth model

Now, we test Eq. (15) for extended neoclassical growth model (NEO model) allowing spatial interdependence among 11 Asian countries to test the convergence hypothesis. To investigate the null hypothesis that the individual fixed effect and time-period fixed effects are jointly insignificant, we perform a likelihood ratio (LR) test. The LR test for joint insignificance of the individual fixed effects at income per capita is not rejected at 10% level (1.882, with 11 degrees, of freedom, p = 0.998). Conversely, the LR test for the joint insignificance of the time-period fixed effects is rejected at the 1% level (144.8, with 45 degrees, of freedom, p = 0.00). Based on these results, we include the time-period effects in the tested model. Table 2 reports our estimation results to determine whether the SAM or the SEM is more appropriate when we adopt a nonspatial panel data model. Provided that time-period fixed effects or both individual and time-period fixed effects are included, the classical LM tests of the hypothesis of no spatially lagged dependent variable term can be rejected only at 1 percent significance and 5 percent significance, respectively. Provided that individual fixed effects are included, both the hypothesis of no spatially lagged dependent variable and the hypothesis of no spatial autocorrelated error term can be rejected when using robust LM tests.

Table 2 LM test of for neoclassical technology diffusion model (NEO model)

Since the nonspatial model is rejected based on the results of Table 2, we estimate the spatial Durbin specification for Asian countries to test whether it can be simplified to the SAM or the SEM (general-to-specific approach), as shown in Table 3. We perform the Wald and LR tests to determine whether the SDM can be simplified to the SAM. The results of the Wald test (36.72, p = 0.00) and LR test (43.33, p = 0.00) indicate that the hypothesis can be rejected. Similarly, the hypothesis that the SDM can be simplified to the SEM cannot be rejected using a Wald test (37.12, p = 0.00) or an LR test (42.12, p = 0.00). This result indicates that SDM is the most appropriate specification for the relationship.

Table 3 LR and Wald tests for neoclassical technology diffusion model (NEO model)

The results for the estimated coefficients containing spatially lagged dependent variables of the spatial Durbin model are reported in Table 4. The Hausman test can be used to test whether the random effects model is more appropriate than the fixed effects model. Our results (58.17, p < 0.01) for per capita income indicate that the random effects model must be rejected. The estimated coefficient for the spatially lagged dependent variables (W \( \times \)dep.var) is 0.419 (p = 0.00), which is positive and significant. The results show that the income growth rate is influenced by per capita incomes in neighboring location countries as well.

Table 4 Estimation results for neoclassical technology diffusion model (NEO model)

Provided that individual fixed effects, time-period fixed effects or both individual and time-period fixed effects are included, the coefficient for the initial per capita income is − 0.089 (p = 0.00), − 0.085 (p = 0.00), and − 0.125 (p = 0.00), respectively, which are negative and significant. Following LeSage and Pace (2009), we estimate direct and indirect effects to yield an interpretation of the spatial spillover effects. We first consider the SDM with individual fixed effects. The direct effect of initial income is estimated to be − 0.094 at a significance level below 1%. The statistically significant indirect effect is − 0.059. The total effect is thus − 0.154, which is significant below the 1% level. The total effect of these feedback sources is greater than when we consider only the direct effect.

We focus on a model with both individual and time-period fixed effects. The estimates of direct, indirect, and total effect of initial income are − 0.119 (p = 0.00), − 0.108 (p = 0.00), and − 0.227 (p = 0.00), respectively, which are significant and negative at a level below 1%. These results support the conditional convergence hypothesis for per capita incomes in the 11 Asian developing countries. All of them are much larger than results presented in the previous literature. Elhorst et al. (2010) also pointed out that the use of the fixed effect panel model generates a higher convergence rate.

The coefficients 0.034 (p = 0.00) and 0.292 (p = 0.00) of the physical investment rate and human capital investment rate, respectively, on the growth of per capita income are positive, as predicted, and are significant. However, the coefficient of population growth on the growth of per capita income is also positive, which is not predicted by theory and is not significant. The coefficient of TFP is 0.246 at a significant level below 1%. The coefficients of FDI and export over GDP are 0.018 and 0.056 below 5% and 1%, respectively. When we compare the total effects of the estimated variables, the coefficient (0.343, p = 0.00) of TFP is greater at 1% than the coefficient (0.356, p = 0.12) of human capital, which is not significant at a level of 10% or greater. We find that the total effects of human capital investment rate, total factor productivity, and trade on relative income growth are positive and significant .

4.3 Results for Schumpeterian technology diffusion model

We first follow the previous research determining that the technology shock of the leader country is transmitted through FDI or trade.Footnote 16 Therefore, we incorporate variables to include the international transfer of technology into the diffusion model discussed above. We then test whether technology is transmitted either by FDI (\( fdi \)) or by international trade (\( trd \)) and whether the transmission is sped up by the human capital (\( s_{h} \)) or TFP (gap). After allowing spatial interdependence among Asian countries, we can write DSDM matrix form for the growth rate equation of relative income (GAP model)Footnote 17:

$$ R_{t} = \delta WR_{t} + Z_{t} \beta + W{\rm Z}_{t} \theta + e $$
(16)

where the explanatory variable matrix, \( Z = [q_{t - 1} ,s_{k} , s_{h} , gap, trd, fdi \), \( (fdi \times s_{h} ) \), \( \left( {fdi \times gap} \right) \), \( \left( {trd \times s_{h} } \right) \), \( \left( {trd \times gap} \right) ]\). Since the nonspatial model is rejected by classical (robust) LM tests, we estimate the spatial Durbin specification GAP models in order to test whether it can be simplified to the SAM or the SEM, as presented in Table 5. The hypothesis that the SDM can be simplified to the SAM or the SEM is not rejected by a Wald test or LR test. This indicates that SDM is the most appropriate specification for the relationship.

Table 5 Likelihood Ratio and Wald tests for spatial dependence (GAP model)

The results of the Hausman test indicate that the random effects model must be rejected. We do not report the results of models with individual fixed or time fixed effects to save space. We focus on a model with both individual and time-period fixed effects. For the same reason, we present only the total effects of the estimation results including the direct and indirect effect of explanatory variables. Each model includes a variable of technology transfer channel such as \( (fdi \times s_{h} ) \) at the GAP(1), \( (fdi \times gap) \) at the GAP(2), \( (trd \times s_{h} ) \) at the GAP(3), and (\( trd \times gap \)) at the GAP(4). Estimated coefficients for the lagged relative income \( q_{t - 1} \) are significant and negative at a level below 1%, respectively, and indicate convergence among Asian countries. The negative and significant coefficients for spatially lagged relative income imply that regional economies are disadvantaged by income growth in neighboring countries. In other words, relative income in a given country is negatively associated with income in the surrounding countries. The total effect of the human capital investment rate, TFP, and trade on the growth of relative income is positive and significant in the GAP models. However, the coefficients of other explanatory variables (W \( \ln s_{k} \), W \( \ln fdi \), \( W \)(\( \ln fdi \times \ln s_{h} \)), \( W(\ln fdi \times \ln gap \)), \( W \)(\( \ln trd \times \ln s_{h} \))) on the growth of relative income are not strong enough to reject the null hypothesis. The total effect (0.310, p = 0.00) of the technology transfer channel (\( trd \times gap \)) at the GAP(4) is significant at a level below 1%, and (\( fdi \times gap \)) at the GAP(2) is only significant at a level below 5%, respectively (Table 6).

Table 6 Estimation results for Schumpeterian technology diffusion model (GAP model)

Now, we test the convergence hypothesis for the Schumpeterian technology diffusion model (CAP model) among 11 Asian countries. New technologies which are considered to be a technical shock (the ratio of TFP, \( gap \)) coming from leading country are diffused through a variety of channels. Growth rate \( \left( {R_{t} } \right) \) depends positively on the initial relative income (\( q_{t - 1} \)), the ratio of technology shock (\( gap \)), and the absorption capacity in the neighboring economies. We test how this shock of technology is transferred in combination with the absorption capacity of the Asian country, physical capital (\( s_{k} \)), human capital (\( s_{h} \)), FDI (\( fdi \)), and trade (\( trd \)), among other factors. After allowing spatial interdependence among Asian countries, we can rewrite Eq. (11) in DSDM matrix form:

$$ R_{t} = \delta WR_{t} + Z_{t} \beta + WZ_{t} \theta + e $$
(17)

where the explanatory variable matrix, \( Z = [q_{t - 1} , gap \), \( (gap \times s_{k} ) \), \( (gap \times s_{h} ) \), \( \left( {gap \times fdi} \right) \), \( \left( {gap \times fdi\left( 1 \right)} \right) \), \( \left( {gap \times trd} \right)] \), \( fdi \) represents FDI/physical capital, and \( fdi \)(1) represents FDI/GDP. W is a weight matrix based on coordinate and \( e = [\mu_{i} ,\xi_{t} , \varepsilon_{it} ]. \)

To investigate the null hypothesis that the individual fixed effect and time-period fixed effects are jointly insignificant, we perform a likelihood ratio (LR) test. The LR test for joint insignificance of the individual fixed effects at income per capita is rejected at the 1% level. Based on these results, we include the individual and time-period fixed effects in the tested model. Provided that both individual and time-period fixed effects are included, the classical (robust) LM tests of the hypothesis of no spatially lagged dependent variable term can be rejected only at 1 percent significance. We did not present these results of LR and LM test for space limit.

Since the nonspatial model is rejected, we estimate the spatial Durbin to test whether it can be simplified to the SAM or the SEM as shown in Table 7. We perform the Wald and LR tests to determine whether the SDM can be simplified to the SAM or the SDM can be simplified to the SEM. The results of the Wald and LR tests for all CAP models indicate that the both hypothesis can be rejected. These findings indicate that SDM is the most appropriate specification for the relationship.Footnote 18

Table 7 LR and Wald tests for Schumpeterian technology diffusion model (CAP model)

We focus on a model with both individual and time-period fixed effects to save space. For the same reason, only total effects, excluding the direct effect and indirect effect of the estimated coefficients, are reported. The results for the estimated coefficients containing a spatially lagged dependent variables of spatial Durbin model are reported in Table 8. The Hausman test can be used to test whether the random effects model is more appropriate than the fixed effects model. Our results (54.25, p < 0.01) for the CAP(1) model, (52.34, p < 0.01) for the CAP(2) model, and (142.98, p < 0.01) for the CAP(3) model indicate that the random effects model must be rejected, respectively. The estimated coefficient for the spatially lagged dependent variables (W \( \times \)dep.var) is negative and insignificant. The results do not show that the relative income growth rate is influenced by per capita incomes in neighboring location countries as well.

Table 8 Estimation results for Schumpeterian technology diffusion model (CAP model)

Provided that both individual and time-period fixed effects are included, the coefficient for the initial relative income is − 0.121 (p = 0.00), − 0.122 (p = 0.00), and − 0.1127 (p = 0.00) for each CAP model, respectively, which are negative and significant. Following LeSage and Pace (2009), we estimate the direct and indirect effects to yield an interpretation of the spatial spillover effects. The estimates of total effect of initial relative income are − 0.205 (p < 0.01) for the CAP(1) model, − 0.194 (p < 0.01) for the CAP(2) model, and − 0.150 (p < 0.01) for the CAP(3) model which are significant and negative at a level below 1%, respectively. These results support the conditional convergence hypothesis among Asian countries.

The coefficients and total effects of absorption capacity, \( gap \times s_{k} \) and \( gap \times s_{h} \), are different from theoretical positive expected signs. One possible reason is that ratio of total factor productivity of the following country and leader country, technology gap (\( gap) \), is decreasing. Otherwise, the investment ratios of Asian countries are too high which may lead to the economies over-accumulated, then have a small negative effect on economic growth. The total effects of \( \ln gap \) and \( gap \times trd \) are positive and significant in all CAP models. However, the estimated coefficients of \( gap \times s_{k} \), \( gap \times s_{h} \), \( gap \times fdi \), and \( gap \times fdi\left( 1 \right) \) are either not significant or are negative. The shock of technology is transferred by absorption capacity in combination with trade in Asian countries.Footnote 19 Therefore, Asian countries should further promote trade liberalization, a channel for technology transfer, and open more markets.

4.4 Results for extended EK12_NEO and EK12_CAP model

We consider Japan as a leader country to observe the effects of the influence of technological innovation and the convergence on the 11 Asian countries. The Japanese economy had greatly influenced the economic growth of Asian developing countries. Following the technology diffusion model, described in Ertur and Koch (2011), we include Japan among 11 Asian countries to see its influence on neighboring developing countries. We assume that not only does the technology leader spread knowledge to Asian countries, but also Asian countries contribute to technology diffusion to the leading country. There is a feedback effect from technological followers to the technological leader. We consider general interaction patterns with the complete structure of interaction between Asian countries. Therefore, we allow knowledge spillover and spatial interdependence among 12 Asian countries. According to the distance weight, technology diffusion is considered only among Asian countries.

We now consider data for the following 12 Asian countries over the period 1970–2014: China, Hong Kong, Indonesia, India, Japan, Korea, Malaysia, Philippines, Singapore, Sri Lanka, Thailand, and Taiwan. Data for empirical tests are also drawn from the Penn World Table 9.0 and the Bureau of Economic Analysis. Real income per capita is obtained at the constant 2011 national prices and growth rate is based on the average annual income growth rate during the time.

Now, we test the extended Ertur and Koch growth model (EK12_NEO model and EK12_CAP model) allowing spatial interdependence among 12 Asian countries to test the convergence hypothesis. This result indicates that SDM is the most appropriate specification for the relationship except the EK(2) including the explanatory variable, tfp. The results for the estimated coefficients containing spatially lagged dependent variables of the spatial Durbin model are reported in Table 9. The estimated coefficients for the spatially lagged dependent variables (W \( \times \)dep.var) are 0.432 (p = 0.00), 0.424 (p = 0.00) and 0.438 (p = 0.00) which are positive and significant, respectively. The results show that the income growth rate is also influenced by per capita incomes in neighboring location countries. Provided that both individual and time-period fixed effects be included, the coefficient for the initial per capita income is − 0.055 (p = 0.00), − 0.081 (p = 0.00), and − 0.092 (p = 0.00), respectively, which are negative and significant at a level below 1%. These results support the conditional convergence hypothesis for per capita incomes in the 12 Asian countries. The coefficients of TFP, productivity, and trade which present technology diffusion channel are 0.013 (p = 0.00), 0.174 (p = 0.00) and 0.552 (p = 0.00) at a significant level below 1%, respectively. As predicted from EK(4) model, the coefficients 0.102 (p = 0.00) and 0.297 (p = 0.00) of the physical investment rate and human capital investment rate, respectively, on the growth of per capita income are positive and significant.

Table 9 Estimation results for neoclassical technology diffusion model (EK12_NEO model)

We follow previous research which determined that the technology shock of the leader country is transmitted through FDI, productivity, or trade. We test whether technology is transmitted either by FDI (\( fdi \)), productivity (\( tfp \)), or by international trade (\( trd \)) and whether the transmission is sped up by human capital (\( s_{h} \)). Each EK model includes a variable of a technology transfer channel such as \( (fdi \times s_{h} ) \) at the EK(5), \( (tfp \times s_{h} ) \) at the EK(6), and (\( trd \times s_{h} \)) at the EK(7). The estimated coefficients for the spatially lagged dependent variables (W \( \times \)dep.var) are 0.397 (p = 0.00), 0.394 (p = 0.00) and 0.402 (p = 0.00) which are positive and significant, respectively. The results show that the income growth rate is also influenced by per capita incomes in neighboring location countries. The coefficients of absorption capacity, \( (tfp \times s_{h} ) \) at the EK(6), and (\( trd \times s_{h} \)) at the EK(7) are positive and significant. However, the estimated coefficient of \( (fdi \times s_{h} ) \) at the EK(5) is insignificant and negative. The technology shock is transferred by absorption capacity in combination with trade or productivity in Asian countries. The coefficient for the initial per capita income is − 0.030 (p = 0.00), − 0.025 (p = 0.00), and − 0.054 (p = 0.00), respectively, which are negative and significant at a level below 1%. These results support the conditional convergence hypothesis for per capita incomes in the 12 Asian countries. We find that Japanese technological progressive spillover to neighboring Asian countries and Japanese economy play a part in accelerating the convergence of Asian economies (Table 10).

Table 10 Estimation results for Schumpeterian technology diffusion model (EK12_CAP model)

5 Conclusion

In this paper, we analyze convergence in relative income through technology transfer and absorption capacity among 11 Asian countries over the period 1970–2014. Following Barro and Sala-i-Martin (1997, 2004), and Novales et al. (2009), we present a model of Schumpeterian technology diffusion between two countries, one being a leader country in innovation, and the second being a follower that adopts the technology developed in the leader country. Through sensitivity analysis, we find that the results support both the absolute and conditional convergence hypothesis in Schumpeterian technology diffusion model.

We then extend the model to a multinational diffusion model to test the convergence hypothesis. Following Ertur and Koch (2007, 2011), Lesage and Pace (2009), and Elhorst (2010), we construct a dynamic spatial Durbin model (DSDM) for technology diffusion allowing spatial for interdependence. For an empirical test, we measure the technology shock (ratio of technology stocks of the follower country and the leader country) using the variable gap, which is measured by total factor productivity of the follower country relative to that of the leader country, the USA. We also assess the channels through which this shock of technology is transferred in combination with the absorption capacity of the following country, physical capital, human capital, and FDI.

DSDM is the most appropriate specification for the models based on the Schumpeterian technology diffusion results. The estimated coefficients on the lagged relative income are significant and negative at a level below 1%. These results support the convergence hypothesis among Asian countries. The total effect of human capital investment rate, TFP, and trade on the growth of relative income is positive and significant. However, the coefficients and total effects of absorption capacity through physical and human capital vary from theoretical positive expected signs. One possible reason is that ratio of total factor productivity of the following country and leader country is decreasing. Otherwise, the investment ratios of Asian countries are too high which may be lead to the economies over-accumulated and then have a small negative effect on economic growth. The shock of technology is transferred by absorption capacity in combination with trade in Asian countries. Therefore, Asian countries should further promote trade liberalization, a channel for technology transfer, and markets.