1 Introduction

This paper contributes to the debate on growth and inequality in Italy. Italy represents one of the most interesting cases of a regional divide (Maffezzoli 2006). Although the Northern and Central parts of the country exhibit labor productivity levels comparable to the most industrialized high-income economies, the Southern regions are characterized by labor productivity levels that are similar to those of middle-income countries. This aspect has been investigated in several empirical papers that differ in the periods analyzed and in the techniques employed. The empirical literature is equally divided between parametric and nonparametric models. Despite the relevant number of papers, there is no general consensus on the major research questions. Some of these papers conclude that Italian regions do not display a common process of convergence; at best, only club convergence can be observed (Byrne et al. 2009; Piacentino and Vassallo 2011). In contrast, Maffezzoli (2006) found evidence of absolute β- and σ- convergence. Furthermore, the roles played by total factor productivity (TFP), technological catch-up, technological change, and physical and human capital accumulation as determinants of economic growth remain unclear. For instance, the results of Maffezzoli (2006), Di Liberto et al. (2008), and Conti (2009) suggest a strong impact of TFP on labor productivity, whereas Scoppa (2007) and Piacentino and Vassallo (2011) show that the role of TFP is reduced when human capital is taken into account.

From a methodological point of view, most of these papers are in accordance with growth accounting literature, which assumes a Cobb-Douglas production function. However, the use of a Cobb-Douglas specification can create misleading assumptions because the neutrality of the technology changes and yields biased results in the presence of inefficiencies (Färe et al. 1994).Footnote 1 In contrast, Data Envelopment Analysis (DEA) is a nonparametric technique that does not require an a priori specification of the functional form and allows inferences to be drawn when it is used in a bootstrap setting (Simar and Wilson 2008).

In this paper, DEA is employed to analyze the evolution of the labor productivity of 20 Italian regions from 1980 to 2006. Based on papers by Kumar and Russell (2002) and Henderson and Russell (2005), labor productivity is reassessed as efficiency changes (the change in the distance from the best-practice frontier), technology changes (shifts of the frontier), and physical and human capital change (movements along the frontier due to changes in the level of physical and human capital).

A further contribution of this study is that the distribution of regional output per worker and its dynamics between 1980 and 2006 are examined in detail. In fact, as discussed by Quah (1997) and Kumar and Russell (2002), approaches based only on the first moment of labor productivity distribution provide only a partial view of the process of growth. Instead, we analyze the effects of the four components of decomposition on the evolution of the distribution of regional labor productivity. Nonparametric kernel methods are also applied to test for the significance of the relative contribution of each component for changes in the shape of the distribution.

The primary results are as follows: (1) the differences in physical capital accumulation between Southern regions and the remaining areas of the country are the most important determinants of the Italian economic regional divide; (2) the human capital accumulation is the most important source of growth in the South of Italy; (3) two high-income regions (Lazio and Liguria) exhibit significant technological regress; and (4) the analysis of the density distribution of labor productivity reveals the presence of two convergent clubs. The results of (1) and (2), on the one hand, are the direct consequence of policies in the 1990s, which were devoted to reducing public capital funds and increasing the role of human capital in Southern Italy; on the other hand, they leave open the question of future government intervention to reduce the productive gap in the Southern regions. Result (3) indicates that the wealthiest Italian regions are not able to stimulate growth through innovation.

The rest of the paper proceeds as follows. Section 2 describes the applied methodologies used to categorize the production frontier. Section 3 presents the data. The empirical results are presented in Sect. 4. Finally, Sect. 5 concludes.

2 Methodology

Following Färe et al. (1994), Kumar and Russell (2002), Henderson and Russell (2005), DEA is used to decompose labor productivity growth into indices that describe changes in technology, efficiency, and physical and human capital accumulation. Once the productivity indices are calculated, their confidence intervals are estimated using bootstrap techniques (Simar and Wilson 1999).

Let K t and L t be the inputs used in the production process, capital, and labor at time t, and let Y t be the output. Then, following the literature on regional growth, human capital is included in the production function as a multiplicative augmentation of the physical labor input. Thus, the amount of the augmented labor input for region i at time t is given by \(\hat{L}_{it}=L_{it}h_{it}\), where h it is the average level of human capital.

Let e i,12 be the distance function for region i at time t 1 relative to the technology existing at time t 2. In the same manner, e i,21 denotes the distance function for region i at time t 2 relative to the technology t 1. Then, following Färe et al. (1994), the Malmquist productivity index between period t 1 and period t 2 is given by

$$ PROD={{\mathcal{M}}}(t_{1},t_{2})\equiv \frac{e_{22}}{e_{11}}\times \left[\frac{e_{21}}{e_{22}}\times \frac{e_{11}}{e_{12}}\right]^{\frac{1}{2}}, $$
(1)

where the ratio outside the square brackets measures the change in the output-oriented measure, i.e., the efficiency change (technological catch-up), and the remaining part is a measure of the shift in technology, i.e., technological change.

Under the assumption of constant returns to scale,Footnote 2 the production set can be reduced to a two-dimensional space, with \(\hat{y}=Y/\hat{L}\) and \(\hat{k}=K/\hat{L}\). Thus, the potential outputs per efficiency units of labor in period t 1 and t 2 are computed by \(\bar{y}_{1}(\hat{k}_{1})=\hat{y}_{1}/e_{11}\) and \(\bar{y}_{2}(\hat{k}_{2})=\hat{y}_{2}/e_{22}\). Similarly, \(\bar{y}_{1}(\hat{k}_{2})\) denotes the potential output per efficiency unit of labor at period t 2 of capital intensity using the technology existing at time t 1 and \(\bar{y}_{2}(\hat{k}_{1})\) denotes the potential output per efficiency unit of labor at period t 1 relative to the technology at time t 2. Finally, let \(\tilde{k}_{1}=K_{1}/(L_{1}h_{2})\) be the ratio of capital to labor at period t 1 if human capital does not change in period t 2. Then, \(\bar{y}_{2}(\tilde{k}_{1})\) is the potential output per efficiency unit of labor at \(\tilde{k}_{1}\) using the technology at time t 2. Similarly, \(\bar{y}_{1}(\tilde{k}_{2})\) denotes the potential output per efficiency unit of labor at \(\tilde{k}_{2}\) using the technology at time t 1, where \(\tilde{k}_{2}=K_{2}/(L_{2}h_{1})\) is the ratio of capital to labor at period t 2 if human capital is equal to the level at period t 1. Under the above conditions, the labor productivity index can be decomposed as follows (Kumar and Russell 2002; Henderson and Russell 2005):

$$ \begin{aligned} LABPROD= \frac{y_2}{y_1} &= \frac{e_{22}}{e_{11}} \times\left[\frac{\bar{y}_2(\hat{k}_2)}{\bar{y}_1(\hat{k}_2)} \frac{\bar{y}_2(\hat{k}_1)}{\bar{y}_1(\hat{k}_1)} \right]^{\frac{1}{2}} \times\left[ \frac{\bar{y}_1(\tilde{k}_2)}{\bar{y}_1(\hat{k}_1)} \frac{\bar{y}_2(\hat{k}_2)}{\bar{y}_2(\tilde{k}_1)} \right]^{\frac{1}{2}} \\ & \times\left[\frac{\bar{y}_1(\hat{k}_2)}{\bar{y}_1(\tilde{k}_2)} \frac{(h_2)^2}{(h_1)^2} \frac{\bar{y}_2(\tilde{k}_1)}{\bar{y}_2(\hat{k}_1)} \right]^{\frac{1}{2}} \\ &:= EFF \times TECH \times KACC \times HACC \end{aligned}. $$
(2)

where EFF represents the efficiency change (technological catch-up), TECH identifies the technological change, and KACC and HACC are the physical and human capital accumulation, respectively.

However, Eqs. (1) and (2) do not determine whether the changes in productivity and efficiency are real or merely artifacts. The true production frontiers are unknown and must be estimated from finite samples. Thus, following papers by Simar and Wilson (1998, 1999), a consistent bootstrap estimation procedure is employed to obtain bias-corrected confidence intervals. The idea underlying the bootstrap technique is to approximate the sampling distributions of the indices by simulating the data-generating process (DGP). In the case of individual distance function estimates, the corresponding bias may be substantial. However, as noted by Simar and Wilson (1999), because the components of the Malmquist index, as well the terms in relation (2), are ratios of distance functions, their overall bias may be less than for individual distance function estimates; both the numerator and the denominator are biased in the same direction. Moreover, in relation (2), there is a time dependence that must be taken into account when the efficiency scores are bootstrapped. To accomplish this, the smoothing kernel method is employed to estimate the joint density of efficiency scores in periods t 1 and t 2 [see Simar and Wilson (1999) and “Appendix A” for details) Footnote 3.

3 Data

The panel data set used in this work covers 20 Italian regions from 1980 to 2006. The main data sources are statistics provided by the Italian National Institute of Statistics (ISTAT). In particular, the value added (Y) at the constant 1995 price and the number of workers (L) in a full-time standard measure are obtained at the regional level from regional economic accounts (ISTAT 2006a). The average level of human capital employed in the determination of the augmented labor input (see Sect. 2) is given by \(h_{it}=e^{w_{i}s_{it}}\), where w i represents the time-invariant regional-specific return to schooling estimated by Ciccone (2004) and s it are the regional average years of schooling at time t, which are calculated based on the labor force drawn from the ‘ISTAT Labor Force Survey’ (ISTAT 2006b) by applying the methodology used by Bronzini and Piselli (2009). In particular, we attribute 0 to a person with no qualifications, 5 for completing primary school, 8 for lower secondary school, 10.5 for a professional diploma, 12.5 for people completing secondary education, 15.5 for a “short” degree (laurea breve), and 18 for a standard degree.

Because ISTAT only publishes data on the national stock of net physical capital, the regional stock at the constant 1995 price is obtained from the methodology described by Byrne et al. (2009), who use a matrix of regional shares to disaggregate national stock. In particular, the regional subdivision is based on the regional average share of investments (weight 0.75) and labor units (weight 0.25) in the preceding 15 years (Paci and Pusceddu 2000). Let NK t denote the net capital stock at the national level at time t, let I i,t denote the investment in physical capital for region i at time t, and let L i,t denote the number of workers for region i at time t. The stock of physical capital for region i at time t is equal to

$$ K_{i,t}=\left(0.75\times \overline{IS}_{i,t}+0.25\times\overline{L}_{i,t}\right)\times NK_{t,} $$
(3)

where the regional average share of investments (\(\overline{IS}_{i,t}\)) and labor (\(\overline{L}_{i,t}\)) at time t are computed as follows:

$$ \overline{IS}_{i,t}=\left(\sum\limits_{t-14}^{t}\frac{I_{i,t}} {\sum\nolimits_{i=1}^{20}{I_{i,t}}}\right)/{15},\quad \overline{L}_{i,t}=\left(\sum\limits_{t-14}^{t}\frac{{L_{i,t}}} {\sum\nolimits_{i=1}^{20}{L_{i,t}}}\right)/{15}. $$

Table 1 shows the summary statistics of the variables considered in the analysis. The variables are divided by a scale factor by region, setting the national average equal to 100.

Table 1 Summary statistics

4 Empirical results

4.1 Decomposition of the factors affecting labor productivity

Various approaches exist to estimate regional growth. In the traditional growth accounting literature, Cobb-Douglas or translog specifications are often used. In this paper, we test both the Cobb-Douglas and translog production functions using the procedure proposed by Hsiao et al. (2007). In particular, the test by Hsiao et al. reveals that both the Cobb-Douglas and translog specifications are not appropriate functional forms for describing the productive process at the regional level in Italy.Footnote 4

Thus, the nonparametric technique described in Sect. 2 is employed to determine the labor productivity evolution in Italian regions. Table 2 shows the bias-corrected efficiency scores and the estimated 95 % confidence intervals in 1980 and 2006. The table highlights that Southern regions are characterized by a lower level of efficiency, although their efficiency gap was reduced from 1980 to 2006. We show that Liguria, Lazio, and Valle D’Aosta are the most efficient regions, in line with the results obtained by Maffezzoli (2006) and Piacentino and Vassallo (2011). However, with the exception of Sicily, these regions do not appear to be significantly more efficient than regions located in the Center and North of the country (see Table 2). Furthermore, it is interesting to note that if the DEA efficiencies are considered as estimates, as in the present paper, and the confidence intervals are obtained accordingly, discussion of the regional efficiency rankings may be meaningless.

Table 2 Efficiency scores with estimated 95 % confidence intervals

The components of the labor decomposition from 1980 to 2006 for the 20 regions are reported in the first part of Table 3. Moreover, in the second part of Table 3, the evolution of labor productivity for the three macro-areas (North, Center, and South) are reported by considering two averaging methods: the geometric and the weighted average (Zelenyuk 2006). Because the two aggregation methods produce similar outcomes, we will refer to the results obtained by applying the geometric average.

Table 3 Percentage change of decomposition of labor productivity growth

From the last two columns of Table 3, it can be observed that both the physical and human capital accumulations are powerful determinants of economic growth in the three Italian macro-areas. However, the growth resulting from physical capital accumulation in the South (+11.56 %) is less than that registered in the Center (+22.50 %) and the North (+19.88 %) of the country. Figure 1 shows the physical capital growth of the three macro-areas. The Northern and Central regions show an annual percentage growth of 3.20 and 3.67 %, respectively, whereas the Southern regions are characterized by a slower annual percentage growth (2.03 %), especially after the 1990s, when public capital investments were drastically reduced (Picci 1999; Mastromarco and Woitek 2006; Montanaro 2003; Bronzini and Piselli 2009). In particular, it is interesting to compare the physical capital growth and the changes in capital accumulation over 2-year periods.Footnote 5 Even without testing the effectiveness of public capital on economic growth, the comparison of Fig. 1 and the results in “Appendix C” clearly confirm that the reduction of public intervention has had a negative impact on growth (Mastromarco and Woitek 2006).

Fig. 1
figure 1

Physical capital growth, 1980–2006

The human capital accumulation shows slight differences among the three macro-areas.Footnote 6 However, in the South, human capital accumulation has a greater impact on labor productivity growth than in the remainder of the country. This finding supports the policy devoted to reducing the gap in education levels between the South and the rest of the country.Footnote 7 Moreover, as noted by Di Liberto et al. (2008), the human capital accumulation increases the capacity of a region to absorb new technologies from its neighbors, fostering the process of convergence.

The average contribution of the total factor productivity component to labor growth differs broadly between the three macro-areas. In the South, the value of the component, which is far above the national average, is mostly attributable to the efficiency change. This finding seems to confirm the existence of a moderate process of technological catch-up, on average, between the South and the remaining areas of the country from 1980 to 2006. The results of technological change differ from those obtained in previous papers, which considered a different period of analysis (Maffezzoli 2006; Piacentino and Vassallo 2011). In particular, we discovered that the South has experienced significant technological progress; in other words, the poorer regions seem to have benefited from technology transfer (Bronzini and Piselli 2009). For the two remaining areas, there are no significant changes, and we cannot draw any definitive conclusions. However, it is interesting to note that two wealthy regions (Lazio and Liguria) exhibit a significant technological regress. The technological decline may be the consequence of several factors (that are not mutually exclusive): an input-biased technical change caused by non-neutral technical change (Färe et al. 1997), a different sectoral composition of the regional economies, a different relationship between investments in R&D and productivity, which is stronger in the high-tech sectors (Ortega-Argilés et al. 2010), or a reduction of exports, which is an important factor in stimulating innovation in Italy (Basile 2001). In the cross-country growth literature, technological regression was also found by Kumar and Russell (2002), Färe et al. (2006), Badunenko et al. (2008) and Ceccobelli et al. (2011, 2012). In these studies, the explanations provided for this result differed: for Färe et al. (2006) and Badunenko et al. (2008), the implosion of the frontier could be interpreted as an efficiency decline rather than a real technological regress; in the works of Ceccobelli et al. (2011, 2012), technological regress is the consequence of a declining level of capacity utilization of ICT technologies.

Finally, because the findings obtained from the analysis of Table 3 may depend on the choice of the initial and final time periods, the analysis is repeated over 2-year periods, and the results largely confirm the above conclusions.Footnote 8

4.2 Analysis of productivity distributions

Here, the paper turns to an analysis of the distribution dynamics of labor productivity, which is likely to be more informative than summary measures. We demonstrate the evolution of the distribution of labor productivity in the sample period, and we evaluate the degree to which each of the components of productivity change account for the change in the distribution of labor productivity between 1980 and 2006. We estimate the density of output per worker conditional on the year and present the resulting conditional density f(y|year) in Fig. 2. This figure indicates that the period of study was characterized by a labor productivity improvement, marked by rightward shifts of the respective distributions over time. Moreover, in line with the results of Piacentino and Vassallo (2011), the distribution of labor productivity tends toward a bimodal shape (polarization of the distribution),Footnote 9 highlighting the importance of conducting a distributional analysis.

Fig. 2
figure 2

Nonparametric conditional density f(y|year). Notes Likelihood cross-validation method is used to obtain the bandwidths, with h y  = 0.661 and h year  = 2.255

By using the decomposition of labor productivity growth, it is possible to explore how each of the components affects the productivity distribution over the period. Following Henderson and Russell (2005), labor productivity in 1980 is multiplied by each of the components, introduced in sequence. Figures 3, 4, 5, and 6 show the shifts in the distributions by sequential introduction of the four components of labor productivity. For instance, y E = EFF × y 1980 is the counterfactual distribution of labor productivity, which isolates the effect on the distribution of changes in efficiency, assuming no physical and human capital accumulation and no technological change. This counterfactual distribution is shown as a dashed curve in Panel B of Fig. 3, along with the distribution in 2006. In Panel A, the labor productivity distributions in 1980 and 2006 are shown.Footnote 10 The counterfactual distribution of the variable y ET = EFF × TECH × y 1980 isolates the effect on the distribution of changes in efficiency and technology, assuming no physical and human capital accumulation. This counterfactual distribution is shown as a dashed curve in Panel C of Fig. 3, along with the distribution in 2006. Similarly, Panel D of Fig. 3 highlights the effect on the distribution of capital accumulation, and Figs. 4, 5 and 6 show other sequencing combinations.

Fig. 3
figure 3

Counterfactual distributions of output per worker. Notes in each panel, the solid curve is the actual 2006 distribution of output per worker. In panel a, the dashed curve is the actual 1980 distribution of output per worker. The dashed curves in panels b, c and d are counterfactual distributions isolating, sequentially, the effects of efficiency change, technological change and physical capital accumulation for the 1980 distribution of output per worker

Fig. 4
figure 4

Counterfactual distributions of output per worker. Notes in each panel, the solid curve is the actual 2006 distribution of output per worker. In panel a, the dashed curve is the actual 1980 distribution of output per worker. The dashed curves in panels b, c and d are counterfactual distributions isolating, sequentially, the effects of technological change, physical and human capital accumulation for the 1980 distribution of output per worker

Fig. 5
figure 5

Counterfactual distributions of output per worker. Notes in each panel, the solid curve is the actual 2006 distribution of output per worker. In panel a the dashed curve is the actual 1980 distribution of output per worker. The dashed curves in panels b, c and d are counterfactual distributions isolating, sequentially, the effects of physical capital accumulation, efficiency change and human capital accumulation for the 1980 distribution of output per worker

Fig. 6
figure 6

Counterfactual distributions of output per worker. Notes in each panel, the solid curve is the actual 2006 distribution of output per worker. In panel a the dashed curve is the actual 1980 distribution of output per worker. The dashed curves in panels b, c and d are counterfactual distributions isolating, sequentially, the effects of human capital accumulation, efficiency change and technological change for the 1980 distribution of output per worker

The efficiency change shifted the tails of the distribution, reducing the dispersion of productivity but creating a bimodal distribution.Footnote 11 Using different sequencing combinations, it is evident that no other component leads to bimodality of the distribution Footnote 12. In fact, when the effect of efficiency change is not introduced (see Fig. 4), the bimodal distribution does not emerge.

Technological change seems to play a minor role in the increase of the output per worker. The only visible change is the reduction of the lower tail, confirming the technological progress of the low-income regions. However, physical capital accumulation is the driving force that contributes to increasing the dispersion of the productivity level between high- and low-income regions.

Human capital accumulation also shifts the productivity distribution toward the upper levels. Unlike physical capital, it seems that human capital does not change the shape of the labor productivity distribution. In other words, human capital accumulation has allowed the labor productivity level to increase without a difference across Italian regions.

To complete the previous graphical analysis, we test for the statistical significance of differences between actual and counterfactual distributions. We use the test proposed by Li et al. (2009). The null hypothesis states that the two distributions can be considered equal. The results of the test in Table 4 allow us to reject the null hypothesis that each of the components is solely responsible for moving the 1980 distribution to that of 2006. This is confirmed by the fact that the test fails to reject the null hypothesis only when the combined effect of technical change and physical and human capital accumulation are included.

Table 4 Distribution hypothesis tests

5 Conclusions

Using non-parametric methodologies, this paper investigated the role played by efficiency, technology, and physical and human capital as determinants of economic growth in 20 Italian regions. Cobb-Douglas and translog specifications of the production function were statistically rejected by the data. The methodology described by Henderson and Russell (2005) was applied to decompose the labor productivity of Italian regions from 1980 to 2006, and confidence intervals for the indices were obtained through a bootstrap approach. In some instances, the changes were not significant; thus, it is not possible to draw definitive conclusions. Consequently, any findings based on the original point estimates should be employed with caution.

In particular the empirical analysis has shown that, on average, physical and human capital accumulation are the main determinants of labor productivity growth in Italy. However, in the Northern and Central regions, labor productivity growth is driven primarily by physical capital accumulation, whereas in most of the Southern regions, it is driven by human capital accumulation. The analysis of the counterfactual distributions reveals that physical capital accumulation accounts for the increased dispersion of labor productivity. On average, a slow process of technological catch-up between the South and the remaining areas of the country was found, but the analysis of distribution dynamics shows that efficiency change seems to be the driving force behind the change in distribution from unimodal to bimodal (convergence clubs). From a policy perspective, we can argue that future policy interventions devoted to reducing regional gaps should be oriented toward supporting innovation and promoting investment in physical capital in the Southern regions.

Finally, the main limitations of the research are related to the use of a small sample and to the assumption of constant returns to scale, which is required to define Malmquist productivity index and to decompose labor productivity growth.