1 Introduction

A key assumption in the theory of spatial general equilibrium (SGE) is that capital is perfectly mobile within countries, which ensures that rates of return to capital are equated across the economy (Roback 1982; Krugman 1991; Glaeser and Gottlieb 2009). Although internal labor mobility may be inhibited by personal attachments to places (Nocco 2009), such frictions do not extend to capital, which is expected to be perfectly mobile. Empirical investigation of PICM (hypothesis of perfect internal capital mobility) has been impeded by lack of data. There are no data on regional or spatial returns to capital, which may be used to test the hypothesis that spatial returns to capital are equated. Nor are there data on regional capital stocks that may be used to investigate PICM indirectly, by testing restrictions on the relationship between regional capital stocks. It is no doubt for these reasons that we are unable to cite previous empirical investigations of internal capital mobility.Footnote 1

By contrast, there are numerous studies of international capital mobility, which are not reviewed here. Capital is expected to be more mobile internally than internationally since the latter involves exchange rate risk as well as country risk. Therefore, absence of international capital mobility does not necessarily presage the same for internal capital mobility.

We use capital stock data for Israel, constructed using the methodological proposal in Beenstock et al. (2011), to carry out indirect tests of PICM. Unfortunately, direct tests of PICM are not feasible since, in Israel as elsewhere, data on spatial returns to capital are not available. Specifically, annual capital stock data are generated for nine regions of Israel during 1987–2010. These data are used to test PICM according to the theory described in the next section, which predicts that the elasticities between pairs of regional capital–labor ratios should be unity. Since there are nine regions there are 36 pairs of regional capital ratios with which to test PICM. We test the joint hypothesis that these 36 pairwise elasticities are not significantly different from unity. Therefore, we do not require that each individual pairwise elasticity is unity.

Since the data for capital–labor ratios are nonstationary, OLS estimates of pairwise elasticities may be spuriously equal to unity. If, however, pairwise log differences between capital–labor ratios are stationary, these log capital–labor ratios are cointegrated with a unit elasticity, suggesting that capital is perfectly mobile between these pairs. To test the joint hypothesis of PICM we use critical values developed for panel cointegration tests, which do not require that all pairwise log capital–labor ratios be cointegrated. If the group augmented Dickey–Fuller statistic (GADF) is less than its critical value, the joint hypothesis that the 36 estimates of pairwise elasticities equal unity cannot be rejected, in which event the null hypothesis of PICM cannot be rejected. A methodological complication is that the 36 pairwise comparisons are unlikely to be independent because each region is involved in eight separate comparisons. The implications of this complication are discussed.

In summary, this is mainly an applied study that uses conventional econometric methods to test canonical theory regarding internal capital mobility.

2 Theory

Each region is assumed to produce homogeneous output using a common Cobb–Douglas production technology:

$$\begin{aligned} Q_j =A_j K_j^\alpha L_j^{1-\alpha } \end{aligned}$$
(1)

where j labels regions, Q, K and L denote output, capital and labor, and A denotes total factor productivity (TFP). The cost of capital in region j is \(\hbox {r} + \updelta -\,\hbox {s}_{\mathrm{j}}\) where r is the national rate of interest, \(\updelta \) is the rate of depreciation assumed to be the same across regions, and s denotes a subsidy to capital investment set by regional policy with respect to region j. It is assumed that all firms can raise finance at the rate r. Firms are assumed to equate the marginal product of capital (MPK) to the cost of capital, hence:

$$\begin{aligned} { MPK}_j =r+\delta -s_j =\alpha A_j k_j^{\alpha -1} \end{aligned}$$
(2)

where \(\hbox {k} = \hbox {K}/\hbox {L}\). PICM implies that Eq. (2) applies across all regions, in which case the capital–labor ratios (k) in regions i and j should be related:

$$\begin{aligned} A_j k_j^{\alpha -1}= & {} A_i k_i^{\alpha -1} +d_{ji} \nonumber \\ d_{ji}= & {} (s_i -s_j )/\alpha \end{aligned}$$
(3)

where \(\hbox {d}_{\mathrm{ji}}\) denotes the degree to which capital investment in region j is subsidized relative to region i, scaled by the inverse of the elasticity of output with respect to capital (\(\upalpha \)). According to Eq. (3) the gap between \(\hbox {MPK}_{\mathrm{j}}\) and \(\hbox {MPK}_{\mathrm{i}}\) increases with the gap between subsidies to capital in regions i and j. In the absence of regional investment subsidies (\(\hbox {d}_{\mathrm{ji}}= 0\)) Eq. (3) implies that MPK is equated between regions i and j so that:

$$\begin{aligned} \ln k_j =\mu _{ji} \ln k_i +\frac{1}{1-\alpha }\ln \left( {\frac{A_j }{A_i }} \right) \end{aligned}$$
(4)

where \(\upmu _{\mathrm{ji}} = 1\) if capital is perfectly mobile between j and i. PICM predicts that the elasticities between pairs of \(\hbox {k}_{\mathrm{j}}\) and \(\hbox {k}_{\mathrm{i}}\) are unity.

3 Bringing theory to data

In our empirical study for Israel there are 9 regions and 23 annual observations (1987–2010). If \(\hbox {d} = 0\) Eq. (4) may be written as:

$$\begin{aligned} \ln k_{jt} =\mu _{ji} \ln k_{it} +\frac{1}{1-\alpha _{ji} }\left( {\ln A_{jt} -\ln A_{it} } \right) +u_{jit} \end{aligned}$$
(5)

where u denotes a residual. There are \(\hbox {N} - 1\) pairwise comparisons for region j, so the total number of pairwise estimates of Eq. (5) is 1/2N(\(\hbox {N} - 1\)), which in our case is 36. Suppose that the panel data for lnk happen to be nonstationary (as we show below) but they are difference stationary, i.e. \(\hbox {lnk}_{\mathrm{j}}\) and \(\hbox {lnk}_{\mathrm{i}}\) are integrated time series of order 1, denoted by I(1). Estimation of Eq. (5) requires data on regional TFP, which in turn requires data on gross regional product. In the absence of such data, we assume that regional TFPs share common stochastic trends generated by \(\hbox {lnA}_{\mathrm{jt}} = \hbox {lnA}_{\mathrm{t}} + \hbox {e}_{\mathrm{jt,}}\) where \(\hbox {A}_{\mathrm{t}}\) refers to national TFP assumed to be difference stationary, and e denotes idiosyncratic shocks to regional TFP. The log differences in TFP in Eq. (5) are stationary because \(\hbox {lnA}_{\mathrm{jt}}- \hbox {lnA}_{\mathrm{it}} = \hbox {e}_{\mathrm{jt}} - \hbox {e}_{\mathrm{it}}\). Since I(1) time series such as \(\hbox {lnk}_{\mathrm{i}}\) are asymptotically independent of I(0) time series such as \(\hbox {e}_{\mathrm{j}} - \hbox {e}_{\mathrm{i}}\), OLS estimates of \(\upmu \) and \(\upalpha \) are independent. Therefore, even if TFP is unobserved, it is possible to test hypotheses regarding \(\upmu \) which should be unity if capital is perfectly mobile. In this case the test simplifies to:

$$\begin{aligned} \ln k_{jt} =\psi _{ji} +\mu _{ji} \ln k_{it} +u_{jit} \end{aligned}$$
(6)

which is estimated with T observations, and where \(\uppsi _{\mathrm{ji}}\) equals the log difference in average TFP between regions j and i, and \(\hbox {u}_{{\mathrm{jit}}}\) is the residual at time t from the ji’th pairwise time series regression. These residuals are unlikely to be independent for two reasons. First, \(\hbox {u}_{\mathrm{ji}}\) and \(\hbox {u}_{\mathrm{jk}}\) are directly related because they both involve region j. Second, \(\hbox {u}_{\mathrm{ji}}\) and \(\hbox {u}_{\mathrm{mn}}\) may be indirectly related through \(\hbox {u}_{\mathrm{jm}}\) and \(\hbox {u}_{\mathrm{in}}\). The econometric implications of this cross-section dependence are discussed below.

Unfortunately, the assumption that regional TFPs share common stochastic trends cannot be investigated empirically because regional data for capital stocks are required to measure regional TFP.Footnote 2 Whereas this assumption may be less plausible internationally since national TFP gaps may increase over time, it might be more justified intra-nationally, and especially in small countries such as Israel, within which productivity shocks are more likely to be common. As a robustness check, we assume that \(\hbox {lnA}_{\mathrm{jt}} - \hbox {lnA}_{\mathrm{it}}\) is trend stationaryFootnote 3 by specifying a deterministic time trend in Eq. (6) with coefficient \(\uptau _{\mathrm{ji}}\).

We distinguish between strong PICM where Eq. (4) holds in every time period, and weak PICM where it only holds in the long run. In either case the residuals (u) must be stationary so that \(\hbox {lnk}_{\mathrm{j}}\) and \(\hbox {lnk}_{\mathrm{i}}\) are cointegrated. In the former case, u must be serially independent for otherwise error correction would imply that PICM does not apply in the short-run. In the latter case error correction occurs between \(\hbox {lnk}_{\mathrm{i}}\) and \(\hbox {lnk}_{\mathrm{j}}\).

At first Eq. (6) is estimated imposing the restriction that \(\upmu _{\mathrm{ji}} = 1\) for all regional pairs. If the residuals (\(\hbox {u}_{\mathrm{ji}}\)) are stationary \(\hbox {lnk}_{\mathrm{j}}\) and \(\hbox {lnk}_{\mathrm{i}}\) are cointegrated, in which case PICM applies to regions j and i. If all pairwise residuals are stationary PICM applies in all regions. If the residuals are not stationary PICM is rejected. In the latter event Eq. (6) is freely estimated, i.e. without imposing \(\upmu _{\mathrm{ji}} = 1\). If the estimated residuals are stationary, \(\hbox {lnk}_{\mathrm{i}}\) and \(\hbox {lnk}_{\mathrm{i}}\) are cointegrated with \(\upmu _{\mathrm{ji}} < 1\) as estimated, in which case capital is imperfectly mobile between j and i. In this event the average degree of imperfect capital mobility is summarized by \(\upmu \)-bar, which is the average of the estimates of \(\upmu _{\mathrm{ji}}\). If instead these residuals are nonstationary, the hypothesis of imperfect capital mobility is rejected too.

Since capital may be perfectly or imperfectly mobile, and mobility may be strong or weak there are four possible types of corroboration:

  1. i)

    If in Eq. (6) \(\hbox {lnk}_{\mathrm{j}}\) and \(\hbox {lnk}_{\mathrm{i}}\) are cointegrated with \(\hat{{\mu }}_{ji} =1\) and \(\hat{{\lambda }}_{ji} =-1\) the hypothesis of strong PICM is corroborated between j and i.

  2. ii)

    If \(\hat{{\mu }}_{ji} =1\) and \(\hat{{\lambda }}_{ji} >-1\) the hypothesis of weak PICM is corroborated.

  3. iii)

    If \(\hat{{\mu }}_{ji} <1\) PICM is rejected, but capital is imperfectly mobile. If \(\hat{{\lambda }}_{ji} =-1\) imperfect capital mobility is strong.

  4. iv)

    If \(\hat{{\lambda }}_{ji} >-1\) imperfect capital mobility is weak.

  5. v)

    If in Eq. (6) \(\hbox {lnk}_{\mathrm{j}}\) and \(\hbox {lnk}_{\mathrm{i}}\) are not cointegrated, capital is not even imperfectly mobile.

3.1 Cross-section dependence and critical values for panel cointegration tests

Each pair constitutes a separate test of PICM. A joint test of PICM involves the 1/2N(\(\hbox {N}-1\)) pairwise comparisons. Let GADF denote the group average ADF statistic formed by the 1/2N(\(\hbox {N}-1\)) ADF statistics using the residuals of Eq. (6). In panel data the critical value of GADF is normally distributed due to the central limit theorem:

$$\begin{aligned} z=\frac{\sqrt{\frac{1}{2}N(N-1)}\left( {{ GADF}-E({ GADF})} \right) }{{ sd}({ GADF})} \sim N(0,1) \end{aligned}$$
(7)

This formula is similar to the one proposed by Pedroni (1999) for panel cointegration tests. E(GADF) denotes the expected value of GADF under the null hypothesis of no cointegration, and its standard deviation is denoted by sd(GADF). We use the critical values of E(GADF) and sd(GADF) calculated by Pedroni (1999, 2004) for heterogeneous panel data, because each region has T observations for \(\hbox {N} - 1\) comparisons. These critical values would be appropriate in the absence of cross-section dependence, i.e. where \(\hbox {u}_{\mathrm{ji}}\) and \(\hbox {u}_{\mathrm{jk}}\) are independent for all j, i and k. However, as noted, these residuals are unlikely to be independent within and between pairs. Since Pedroni assumed cross-section independence, his critical values are also likely to be too “liberal”. We therefore calculate the residual cross-section correlation matrix (\(\uprho _{\mathrm{ji}}\)), which should be diagonal under the null hypothesis. We use the Breusch–Pagan LM test statistic for cross-section dependence (BP), which is expected to be zero under the null hypothesis of no cross-section dependence. BP is unbiased provided \(\hbox {N} < \hbox {T}\) (Sarafides and Wansbeek 2012), which applies here.

If this null hypothesis is rejected, there are two main possibilities; cross-section dependence may be weak or strong (Chudik et al. 2011). In the former case the cross-section dependence is spatial and localized (Anselin 1988) whereas in the latter case the dependence is generic and is induced by common factors. In the former case shocks to \(\hbox {u}_{\mathrm{ji}}\) dissipate across space, whereas in the latter case they do not. Pesaran (2015) suggested the following test for weak cross-section dependence:

$$\begin{aligned} { CD}=\sqrt{\frac{{ TN}(N-1)}{2}}\bar{{\hat{{\rho }}}} \approx N(0,1) \end{aligned}$$
(8)

where \(\bar{{\hat{{\rho }}}}\) is the average estimate \(\uprho _{\mathrm{ji}}\). If this average is not significantly different from zero, the null hypothesis of weak cross-section dependence cannot be rejected. Unlike BP, CD depends on the sign of \(\uprho \). For example, if \(\hbox {N} = 3\), \(\hbox {T} = 100\) and the three values of \(\uprho _{\mathrm{ji}}\) are 0.4, 0.4 and −0.8, \(\hbox {CD} = 0\) but \(\hbox {BP} = 96\), in which the cross-section dependence is weak (spatial). If instead the values of \(\uprho _{\mathrm{ji}}\) are 0.4, 0.4 and 0.8, BP still equal 96 but \(\hbox {CD} = 32\), in which case the cross-section dependence is strong. In general, both types of cross-section dependence may be present.

Cross-section dependence should lower the critical value for GADF below Pedroni’s critical values. This means that if these pairs of lnk are not panel cointegrated using Pedroni’s critical values, they cannot be panel cointegrated according to panel cointegration tests with cross-section dependence.

The spatial variant of Eq. (6), which allows for weak cross-section dependence is:

$$\begin{aligned} \ln k_{jt}= & {} \psi _{ji} +\mu _{ji} \ln k_{it} +\delta _{ji} \ln \widetilde{k}_{jt} +\pi _{ji} \ln \widetilde{k}_{it} +u_{jit} \\ \widetilde{k}_{jt}= & {} {\mathop {\sum }\limits _{n\ne j}^N} {w_{jn} } k_{nt} \nonumber \\ \widetilde{k}_{it}= & {} {\mathop {\sum }\limits _{n\ne i}^N} {w_{in} k_{nt}}\nonumber \end{aligned}$$
(9)

where w denote spatial weights row-summed to one, \(\widetilde{k}_j\) denotes the spatial lagged dependent variable, and \(\widetilde{k}_i\) denotes its spatial Durbin counterpart. If the data are stationary the spatial lag coefficients (\(\updelta \) and \(\uppi \)) need to be estimated by maximum likelihood (Anselin 1988). However, if the data happen to be nonstationary, OLS estimates of these parameters are super-consistent (Beenstock and Felsenstein 2015) because \(ln\tilde{k}\) is I(1) whereas u is I(0). The elasticity of k in region j with respect to k in region i is:

$$\begin{aligned} \mu _{ji} +\delta _{ji} \frac{\partial \ln \widetilde{k}_j }{\partial \ln k_i}+\pi _{ji} \frac{\partial \ln \widetilde{k}_i }{\partial \ln k_i} \end{aligned}$$
(10)

which depends on the direct elasticity \(\upmu \) as well as the two spatial elasticities. Critical values for spatial panel cointegration have been calculated by Beenstock and Felsenstein (2017, chapter 7).

For strong cross-section dependence Pesaran (2006) has suggested the common correlated effects (CCE) estimator in which Eq. (6) is specified as:

$$\begin{aligned} \ln k_{jt} =\psi _{ji} +\mu _{ji} \ln k_{it} +\kappa _{ji} \ln \bar{{k}}_t +u_{jit} \end{aligned}$$
(11)

where \(\bar{k}\) denotes the average capital–labor ratio in period t and \(\kappa _{ji}\) denotes the loading of this common factor on the relation between \(\hbox {k}_{\mathrm{j}}\) and \(\hbox {k}_{\mathrm{i}}\). Critical values for panel cointegration tests using CCE have been calculated by Banerjee and Carrion-I-Silvestre (2011, 2014).

A final variant of Eq. (6) takes account of regional rates of capital subsidy, which represent s in Eq. (3). If the subsidy to capital investment increases in region j relative to region i, \(\hbox {k}_{\mathrm{j}}\) is expected to increase relative to \(\hbox {k}_{\mathrm{i}}\). We use data on cumulative regional investment grants to represent s:

$$\begin{aligned} \ln k_{jt} =\psi _{ji} +\mu _{ji} \ln k_{it} +\eta _{ji} (s_{jt} -s_{it} )+u_{jit} \end{aligned}$$
(12)

where \(\upeta \) is expected to be positive.

4 Data

Annual capital stock data for 9 regions of Israel (see Fig. 1) were constructed during 1987–2010 using the method proposed by Beenstock et al. (2011). The capital stock comprises plant and machinery. This method calculates capital investment in plant directly from regional data on building completions (square meters) in the business sector published by the Central Bureau of Statistics (CBS). The perpetual inventory method is used to construct regional capital stocks for plant in square meters, which are converted into constant price shekels using the implicit deflator for plant in the base year. Finally, CBS data for machinery capital at the national level (measured in constant price shekels) is allocated to the 9 regions according to the ratio of machinery to plant across the economy, i.e. plant and machinery are strict complements.Footnote 4 For example, if the value of plant in a region is \(\$100\), and the ratio of machinery to plant across the economy is 1.3, the value of machinery in the region is imputed to be \(\$130\) so that \(\hbox {K} = \$230\). Whereas Garofalo and Yamarik (2002) allocate the national capital stock using regional shares in value added; we use direct measures of plant. In the absence of direct measures of machinery, we allocate national machinery according to regional shares in plant. This allocation rule is more direct than the one used by Garofalo and Yamarik. Data on employment (L) for these 9 regions were constructed from Labor Force Surveys (CBS). Since geographic disaggregation in these surveys is not continuously available prior to 1987, this determines the starting point for our investigation.

Fig. 1
figure 1

Map

Capital–labor ratios (k) are plotted in Fig. 2 in 1000s of shekels at 2005 prices. Haifa region stands out as the most capital-intensive region of Israel because heavy industry has been concentrated in Haifa since Ottoman times and during the British Mandate (1921–1948). There are persistent and substantial differences between capital–labor ratios in the rest of the country. In 1987 the Dan region was the least capital intensive, but by 1996 it exchanged positions at the bottom of the distribution with Krayot. The Tel Aviv region, which in 1987 was in \(4{\mathrm{th}}\) position, temporarily moved up to \(2{\mathrm{nd}}\) position in 2003. On the whole,positions in the distribution appear to be quite stable. Following the wave of mass migration from the former USSR (1989–1995) capital–labor ratios naturally decreased especially in Haifa, which absorbed many immigrants. Subsequently, capital–labor ratios recovered, eventually surpassing what they were in the late 1980s. Because capital–labor ratios tend to increase over time, they cannot be stationary.

Fig. 2
figure 2

Capital–labor ratios. Thousands of shekels at 2005 prices

Fig. 3
figure 3

Investment grants (shekels at 2005 prices)

Since 1967 the Ministry of Trade and Industry (now the Ministry of Economics) has operated an Investment Center, which provides investment grants as part of its regional development policy. Businesses in designated regional development zones (A, B and C) are eligible to apply to the Investment Center for investment grants, which are awarded as percentages of the total investment, and which are highest in zone A and lowest in zone C. Priority is given to export businesses, and to industry rather than to services. The criteria have varied over time as have the zones eligible for regional development support. Figure 3 plots the allocation of investment grants at constant 2005 prices by the Investment Center in each of the 9 regions. The main beneficiaries have been the North and South, and since 2007 the budget of the Investment Center has been cut-back considerably. By contrast, the central regions (excluding Jerusalem) have received almost nothing. Figure 4 plots cumulative (since 1967) development grants received by the 9 regions.

Fig. 4
figure 4

Cumulative capital investment by the investment center (shekels at 2005 prices)

Table 1 Panel unit root tests

Panel unit roots tests are reported in Table 1 for the data in Figs. 2 and 3. The IPS statistics assumes that there is no cross-section dependence between the panel units, whereas the CIPS statistics assume that there is strong-cross section dependence. The IPS statistic confirms that lnk is difference stationary, as does the CIPS statistic. Matters are more complicated in the case of the data in Fig. 4 where both IPS and CIPS suggest that lns may be stationary. The problem is that for most regions lns maybe stationary, but in North, South and Jerusalem it is clearly nonstationary. These stationary components of lns cannot be cointegrated with lnk. Matters may obviously be different for North, South and Jerusalem.

5 Results

Results based on Eq. (6) are reported in Table 2. We begin by assuming that capital is perfectly mobile by imposing \(\upmu _{\mathrm{ji}} = 1\), hence the elasticity between pairs of capital–labor ratios is unity in columns 2 and 3. Results are reported for two auxiliary hypotheses concerning unobserved TFP. If TFP is hypothesized to be difference stationary GADF is −0.69, which easily exceeds its critical value of −2.00. Hence, the null hypothesis of PICM is clearly rejected. The BP statistic indicates the presence of cross-section dependence, implying that GADF* (which assumes cross-section independence) is too “liberal”. Therefore, the rejection of PICM is even stronger than its nominal pvalue.

If, however, TFP is hypothesized to be trend stationary, matters are different. GADF decreases sharply from −0.69 to −2.25 but is still larger than its critical value. The BP statistic decreases sharply too, hence although cross-section dependence remains, it has weakened. The pvalue of GADF is approximately 0.12, hence the rejection of PICM is not overwhelming. On the other hand, cross-section dependence implies the effective p value is larger than 0.12.

Table 2 Estimates of Equation (6)

In columns 4 and 5 of Table 2 \(\upmu _{\mathrm{ji}}\) is estimated by OLS and the average elasticity is reported at 0.701 when TFP is hypothesized to be difference stationary, and 0.986 when TFP is hypothesized to be trend stationary. GADF is statistically significant in the former case and is almost statistically significant in the latter case. In both cases cross-section dependence is smaller than in columns 2 and 3. These results suggest that if capital is imperfectly mobile, the degree of mobility is quite high. However, the continued evidence of cross-section dependence undermines the reliability of these results.

Table 3 Estimates of Eq. (11): common correlated effects

5.1 Robustness checks

Since the CD statistics in Table 2 reject the null hypothesis of weak cross-section (spatial) dependence, there is little purpose in testing PICM with respect to the spatial model in Eq. (9). On the other hand, Table 3 reports estimates of Eq. (11) since CCE is motivated by the presence of strong cross-section dependence. The CCE estimate of GADF in column 2 is much smaller than its OLS counterpart in Table 2. Nevertheless, it is not statistically significant because the critical value for GADF for CCE is considerably smaller than its counterpart in Table 2. Therefore, the CCE test of PICM rejects the null hypothesis. Moreover, when \(\upmu \) is freely estimated as in columns 4 and 5, the estimate of \(\upmu \) turns out to be zero. Notice that although CD tends to be smaller, as expected, in Table 3 than in Table 2, it continues to be statistically significant. Therefore, CCE does an imperfect job in capturing strong cross-section dependence.

Table 4 Estimates of Eq. (12)

The results in Table 2 ignore the potential effect of regional subsidies to investment, which are taken into consideration in Eq. (12), presented in Table 4. The estimate of \(\upeta \) (not reported) in Eq. (12) is positive as predicted by theory. However, the GADF statistics, which tend to be smaller than their counterparts in Table 2, fall short of their critical values. However, the pvalue of GADF in column 4 is approximately 0.1.

6 Conclusions

In the absence of data on regional returns to capital, data for regional capital–labor ratios in Israel are used to carry out indirect tests of the hypotheses of perfect and imperfect internal capital mobility. This issue is important because spatial general equilibrium theory assumes that capital is perfectly mobile within countries. Indeed, this is the first time that tests of internal capital mobility have been undertaken. If capital is perfectly mobile the elasticity between pairs of capital–labor ratios is expected to be 1 under Cobb–Douglas technologies.

Since the panel data in the study are nonstationary, these tests are carried out using panel cointegration methods where each panel consists of pairs of regions. Annual data for almost a quarter of a century are used for 9 regions in Israel, so there are 36 pairwise panels. Matters are complicated by the presence of strong cross-section dependence between these pairwise panels, which weakens standard panel cointegration tests. We therefore carry out robustness checks for strong cross-section dependence induced by common factors. Pairwise differences in unobserved regional total factor productivity are captured by pairwise fixed effects. Robustness checks are also carried out regarding unobserved regional differences in total factor productivity over time, which are hypothesized to be stationary or trend stationary. Finally, robustness checks are carried out with respect to the effect of regional investment grants on the elasticity between regional capital stocks.

The hypothesis of perfect instantaneous internal capital mobility is categorically rejected. Matters are more nuanced in the case of perfect long-term internal capital mobility depending on auxiliary hypotheses regarding total factor productivity. For example, if TFP is hypothesized to be trend-stationary, the null hypothesis cannot be rejected at pvalues of about 0.12, although it is rejected at conventional levels of significance. The same applies to the hypothesis of long-run imperfect capital mobility, regardless of how cross-section dependence is specified. However, if regional TFP differentials are assumed to be difference stationary, the hypothesis of imperfect capital mobility cannot be rejected at conventional levels of statistical significance when cross-section independence is ignored. In this case the average elasticity between pairwise capital–labor ratios is approximately 0.7 instead of 1. In summary, although the hypothesis of instantaneous perfect capital mobility is overwhelmingly rejected, the same does not apply to the hypothesis of long-term perfect capital mobility.

The usual disclaimers apply. First, if capital is miss-measured the results could be induced by measurement error. Since plant is measured directly the main source of measurement error would be in the regional allocation of machinery. Beenstock et al. (2011) have shown that similar measures of capital provide a good empirical account of regional wages over time, which increases confidence in the quality of the data. Moreover, the results should be robust with respect to stationary measurement error since the methodology is based on panel cointegration.

Second, the results are for Cobb–Douglas technologies, which are conveniently loglinear. In the CES case, for example, where:

$$\begin{aligned} Q_j =A_j [aK_j^\rho +(1-a)L_j^\rho ]^{1/\rho } \end{aligned}$$
(13)

it may be shown that PICM implies that the pairwise elasticities are:

$$\begin{aligned} \frac{\partial \ln k_j }{\partial \ln k_i }=\frac{1-\alpha \frac{k_i^\rho }{\alpha k_i^\rho +1-\alpha }}{1-\alpha \frac{k_j^\rho }{\alpha k_j^\rho +1-\alpha }} \end{aligned}$$
(14)

The elasticity is greater than 1 when \(\hbox {k}_{\mathrm{j}}\) exceeds \(\hbox {k}_{\mathrm{i}}\), and it tends to 1 as \(\uprho \) tends to zero, as in the Cobb–Douglas case. The elasticity varies inversely with \(\hbox {k}_{\mathrm{i}}\) and varies directly with \(\hbox {k}_{\mathrm{j}}\). Consequently, testing PICM for CES technologies involves nonlinear spatial panel cointegration theory, which has not yet attracted attention in the literature on spatial panel data, and therefore lies beyond the present terms of reference. Nevertheless, Eq. (14) serves to remind us that indirect empirical tests of PICM depend on auxiliary hypotheses regarding the technology of production.

This unsatisfactory situation would not have arisen had there been data on regional returns to capital, in which event PICM could have been tested directly. Even incomplete data on regional returns to capital would have provided an opportunity for external validation of the results obtained from indirect tests. In the complete absence of data on regional returns to capital external validation remains elusive.

If in a small country such as Israel it is difficult to find evidence in favor of perfect or even imperfect capital mobility, in large countries it might be even more difficult. At the very least, the results challenge the consensus that instantaneous perfect internal capital mobility is an innocuous empirical assumption in spatial general equilibrium theory.