1 Introduction

In the EU, during the past decades, the issue of regional economic growth and convergence has taken on particular significance. Not only have European policy makers continually highlighted the importance of regional economic growth for closing the ‘competitiveness gap’ with the USA, more recently, with the Europe 2020-strategy, regional economic growth and convergence is increasingly indicated as essential for improving economic and social cohesion in the European Union and, hence, a successful European integration. The reason for this increase in policy focus on stimulating regional economic growth and convergence is the growing awareness among EU policy makers that a lack of convergence in economic productivity between regions may not only jeopardize the economic competitiveness of the EU as a global economic power but also could endanger the internal political, economic and social stability. In particular, as more dispersion in economic productivity and growth causes less developed regions to depend on support of more developed regions, the fear is that the presence of too large dispersions might disturb the balance of political power in the EU.

Together with ESI-Funding, Cohesion and Regional policy investments are key instruments to realize smart, sustainable and inclusive growth. With the European Commission spending roughly one-third of the EU Budget on Cohesion and Regional policy, with investments in the main drivers of economic growth: research and development, innovation, transportation, digital economy, energy security and efficiency, competitiveness of SMEs, and quality and accessibility of education, the question raises whether all these investments are helping EU-regions to grow more quickly. In addition, as the bulk of the financial support of the Cohesion and Regional policy and the ESI-Funds is particularly going towards the less developed Member States and regions, another important query is whether or not this consistent focus on less developed regions has actually benefited convergence and cohesion among EU Member States and regions. In the literature there is still ambiguity on this question with some academics finding positive effects (Kutan and Yigit 2007), others finding mixed effects (e.g. Becker et al. 2010) and other studies finding no considerable effects on cohesion in regional productivity (Delgado-Rodríguez and Álvarez-Ayuso 2008).

The present paper contributes to the literature by providing empirical evidence on regional economic growth and convergence in what has been an important period for the EU, i.e. the period 2000–2011 in which the Lisbon Strategy was rolled out. The paper estimates and analyses the productivity growth of the NUTS 2-regions in the EU over this period thereby using the latest update of regional productivity data as provided by Eurostat. In doing so, the paper attempts to provide answers to several policy-relevant research questions such as: (1) What was the evolution in labour productivity in the period 2000–2011? (2) Was the productivity growth realized by NUTS 2-regions within the same Member State largely similar or was there considerable interregional disparity?; (3) Were the relative contributions of efficiency change, technological change and changes in physical and human capital accumulation to productivity growth more homogeneous or rather heterogeneous across NUTS 2-regions?; and (4) Was there a trend of convergence or rather one of divergence in regional productivity in the EU?Footnote 1

The paper advocates the use of a nonparametric production frontier technique to study regional productivity growth.Footnote 2 The reason is that, as compared to the traditional (parametric) techniques used predominantly in the economic growth and convergence literature (for studies on the EU, examples include Bartkowska and Riedl 2012; Battisti and De Vaio 2008; Juessen 2009; Azomahou et al. 2011), nonparametric frontier methods make fewer and less strict assumptions in the modelling of the production frontier technology. In particular, whereas traditional techniques typically impose a specific “functional form” of the production function [e.g. Cobb-Douglas production function, constant elasticity of substitution (CES) production structure, etc.] that characterizes the existent production technology, nonparametric techniques specify the production function a posteriori from the observed data themselves. Moreover, given that inefficiencies can exist under this nonparametric framework (contrary to most traditional techniques), there is also the possibility to examine how efficiency change contributes to productivity growth. Next to providing an estimate of productivity growth, the nonparametric frontier technique derives estimates on the different sources of productivity growth.

This paper also contributes to the ongoing policy discussions about the economic growth and convergence issue that is at the heart of the EU regional policy agenda. One particular contribution, as discussed in the previous paragraph, is that the paper provides empirical evidence on regional economic growth and convergence in the EU-area as derived from a nonparametric analysis framework. According to the advocates of nonparametric techniques (e.g. Färe et al. 1994; Kumar and Russell 2002; Krüger 2003; Henderson and Russell 2005), this data-driven approach gives a more realistic framework to study productivity growth and convergence. A second contribution is that the paper explores regional differences in productivity and economic growth both within and between countries. This enables to identify whether changes in regional productivity have been relatively “even” or rather “uneven”. Such information is useful for policy makers at the European level as it provides them with an idea on the impact of given regional financial support of the Cohesion and Regional policy and the ESI-Funds. On the other hand, such information is also valuable for national policy makers as it shows which regions require further policy intervention and/or support. Another contribution is that the paper identifies the different components of productivity growth. In particular, the nonparametric frontier technique estimates for each individual region the contributions of efficiency change, technological change, physical capital accumulation or human capital accumulation to economic growth. It is argued that differentiating between the components of productivity growth is important particularly from the perspective of designing effective policies. It helps policy makers to reach informed decisions about which type of policies are most suitable for each region (given that they are typically subject to different economic, social and cultural conditions).

The paper is organized as follows. Section 2 discusses the main outcomes of previous empirical studies on regional productivity growth in the EU. Section 3 describes the nonparametric methodology that is used in the present paper to estimate and analyse regional productivity growth. Section 3 presents the regional (macro)economic data for the NUTS 2-regions in the EU for the period 2000–2011. Section 4 presents the results. Finally, Sect. 5 concludes and formulates several interesting topics for further research.

2 Earlier research on regional growth in EU

Earlier studies on the regional growth in the EU demonstrated the existence of slow growth and a slow process of convergence. Sala-i-Martin and Barro (1991) employed a (parametric) autoregressive model to investigate regional convergence within 8 Northern EU Member States (the six original Member States and Denmark and the UK) using economic data on 73 regions for the period 1950–1985. The estimations revealed a slow process of conditional convergence (speed of convergence of approximately 1.8%). Neven and Gouymte (1995) employed a mix of autoregressive and Markov chain models to examine regional convergence for the period 1975–1990, thereby using data on aggregate output per head for 108 regions for the period 1975–1980 and 142 regions for the period 1980–1990. One consistent finding was that the pattern of convergence differed considerably across subsets of regions and across different periods. Whereas in the early 1980s the pattern among the Northern regions was more one of stagnation or (slight) divergence, the second part of the eighties was more one of convergence. Among the more Southern EU-regions, on the other hand, a trend of convergence was found in the first part of the 1980s followed by a period of stagnation in the second part. De Siano and D’Uva (2006) applied tests on stochastic convergence on regional economic data for 123 regions belonging to 9 EU Member States for the period 1981–2000. The test for stochastic convergence revealed empirical evidence of club convergence with a strong tendency of convergence among the wealthiest regions and a trend of weak convergence among the remaining groups of EU-regions. Using a predictive density approach with roots in Bayesian econometrics, Canova (2004) examined regional economic growth in the EU thereby testing for convergence among 144 European NUTS 2-regions for the period 1980–1992. Results showed evidence for convergence poles characterized by different economic conditions. Crespo Cuaresma et al. (2014) employed a Bayesian Model Averaging approach to explore the determinants of regional economic growth of 255 NUTS 2-regions during the period 1995–2005, thereby accounting for the potential presence of spatial interaction among the regions. Results pointed out the presence of convergence, with an estimate of the speed of convergence of around 2%. However, as indicated by the authors, this estimate is largely influenced by the changes in growth experienced by the regions in the Central and Eastern European Member States. In particular, the growth rates experienced by the regions in the Central and Eastern European Member States were generally higher than the ones realized by the other regions. Beugelsdijk et al. (2015) investigated the within-country differences in economic development across Europe using a development accounting approach. Results indicated, among other things, very large productivity differences between the NUTS 2-regions and the presence of large within-country interregional productivity differences. Azomahou et al. (2011) explored productivity dynamics across EU-regions using a semi-parametric partially linear model. Estimations were made for two sets of NUTS 2-regions, one set including data on 157 regions from 9 EU countries for the period 1990–2005 and another set with data from 255 regions for 20 EU countries for the period 1998–2007. A consistent finding was heterogeneity in the convergence process with a trend of divergence among regions with low initial GDP per capita levels (low-income regions from new EU Member States), a trend of convergence among regions with medium initial levels of GDP per capita levels and no outspoken trend among the richest EU-regions. Estimation results also pointed out country heterogeneity and nonlinearity in the convergence process. Becker et al. (2016) used a regression discontinuity design framework to analyse regional growth in the EU during four consecutive programming periods (1989–1993, 1994–1999, 2000–2006 and 2007–2013) with a special focus on the effects of Objective 1 transfers. Results showed that EU regional transfers did generate additional growth across regions in the EU, the effects being somewhat stronger during the two first periods as compared to the two most recent programming periods.

In sum, a recurrent finding in past studies on regional productivity growth dynamics in the EU was the existence of a slow process of convergence. The estimates of speed of convergence found in these studies were all situated around 1–2% per annum. A second important finding was that the pattern of convergence differed considerably across subsets of regions and across different time periods. Thirdly, as can be noted in the short review of the empirical literature, a large majority of the techniques that have been applied by previous studies are parametric in nature.Footnote 3 As denoted above, particularly problematic to the need of parametric models to presume rather stringent assumptions is that there is typically a lot of uncertainty about the validity of the assumptions and, hence, a potential issue of model misspecification. The remainder of this paper advocates the use of a nonparametric production frontier estimation techniques after Färe et al. (1994), Kumar and Russell (2002), Henderson and Russell (2005), and Badunenko et al. (2013) to estimate and analyse the dynamics in regional productivity growth in the EU. The next section describes this technique in detail.

3 Methodology

3.1 Sequential production frontier technology

The regional productivity is measured using three input variables and one output variable. The output variable \(Y_{j,t}\) is the aggregate output of the region \(j\,(j=1,\ldots ,N)\) in the year \(t\,(t=1,\ldots ,T)\). The three input measures are labour \(L_{j,t}\), physical capital \(K_{j,t}\) and human capital \(H_{j,t}\). Following the recent production frontier studies (e.g. Badunenko et al. 2013) and the (macro)economics literature (see, e.g. Mankiw et al. 1992), the standard assumption is made that human capital augments the physical labour input by a multiplication. This yields human capital augmented labour \(\hat{L}_{j,t} =L_{j,t}H_{j,t}\). It represents the labour input measured in efficiency units for the region j in year t. By this assumption, the three input—one output setting is reduced to a two input—one output setting.

To determine the production frontier technology, an estimation methodology after Charnes et al. (1978) is used. This method is linked to the earlier work of Koopmans (1951), Farrell (1957) and Afriat (1972) on activity analysis and linear programming models for economic analysis. The key feature of this technique is that it estimates the production frontier in a nonparametric way, i.e. without assuming any functional format that characterizes the existent production frontier technology. Instead, this method defines the production frontier in a data-driven manner thereby building upon very general axioms of economic production theory (additivity, no free lunch, free disposability and homogeneity of degree one. For more on the axioms see Fried et al. 2008).

In the present analysis, a sequential version of this nonparametric frontier method as originally introduced by Diewert (1980) is used to specify the production frontier of the NUTS 2-regions. This version was also applied in previous studies examining country-level productivity growth (e.g. Henderson and Russell 2005; Badunenko et al. 2013). This approach constructs the production technology \(\varPsi _t\) in year t using the regional production data for the year t and all the previous years (i.e. \(\tau =1,\ldots ,t)\). This presumes an interdependence between production technology over time with the production technology in year t not only comprising technological innovations that were realized in year t but also the technological knowledge that was already obtained in the past. By this assumption, technological degradation is impossible. More formally, under the assumption of constant returns to scale, the sequential production set, i.e. the set of all feasible input-output combinations, is specified as follows:

$$\begin{aligned} \varPsi _{t} = \left\{ \langle Y,\hat{L},K\rangle \in {\mathfrak {R}}_{+}^{3}\left| \begin{array}{c} Y \le \mathop \sum \limits _{\tau \le t} \mathop \sum \limits _{j = 1}^N {\lambda _{j,\tau }}{Y_{j,\tau }}\\ {\hbox {s.t.}}\\ \hat{L}\ge \mathop \sum \limits _{\tau \le t} \mathop \sum \limits _{j = 1}^N {\lambda _{j,\tau }} {{\hat{L}}_{j,\tau }}\\ K \ge \mathop \sum \limits _{\tau \le t} \mathop \sum \limits _{j = 1}^N {\lambda _{j,\tau }}{K_{j,\tau }}\\ {{\lambda _{j,\tau }} \ge 0}\\ \left( {j = 1, \ldots ,N;\tau = 1, \ldots ,t} \right) \end{array}\right. \right\} \end{aligned}$$
(1)

With \(\lambda _{j,\tau }\) the activity levels indicating how relevant other (observed) regions are for constituting the benchmark against which the efficiency of the region j in year \(\tau \,(\tau =1,\ldots ,t)\) is to be assessed (i.e. the intensity variables from activity analysis used to transform data points in the sample into a production technology set).

3.2 Productivity growth and quadripartite decomposition

The Farrell output-oriented efficiency measure (Farrell 1957) is used to estimate the efficiency of region j in realizing aggregate output using the disposable inputs and the sequential production technology \(\varPsi _t\). For region j in year t, this output efficiency measure is obtained as follows:

$$\begin{aligned} e_{j,t} = \min \left\{ {\lambda \hbox {|}\langle Y_{j,t} \hbox {|}\lambda ,\hat{L}_{j,t} ,K_{j,t}\rangle \in {\varPsi }_t} \right\} \end{aligned}$$
(2)

The inverse of \(e_{j,t}\) specifies the maximal proportional expansion possible in the aggregate output of region j in year t given the input levels in year t and the, at that moment, available production technology \(\varPsi _t\). The output efficiency measure can take values lower than or equal to one. A value lower than one indicates that region j can expand its aggregate output given the used inputs and available production technology. The value also specifies the proportion with which region j should expand the aggregate output in order to become positioned at the production frontier. A value for the output efficiency measure equal to one specifies that region j is positioned on the production frontier and, hence, realizes the benchmark level of aggregate output that is technologically feasible given the employed inputs and the available production technology.

In the setting of estimating regional productivity change, regional output efficiency measures can be determined relative to the sequential production frontiers of the base period and the current period (i.e. \(e_{j,b}\) and \(e_{j,c})\). Both efficiency measures enable defining the benchmark levels for the aggregate output of region j for the respective periods: \(\bar{Y}_{j,b} =Y_{j,b} /e_{j,b}\) and \(\bar{Y}_{j,c} =Y_{j,c} /e_{j,c}\). Taking the ratio of the output efficiency measures and the associated benchmark levels for the aggregate output of region j for the current and base periods yields the productivity growth measure as in (3):

$$\begin{aligned} \frac{Y_{j,c} }{Y_{j,b} }=\frac{e_{j,c} }{e_{j,b}}\frac{\bar{Y}_{j,c}}{\bar{Y}_{j,b} } \end{aligned}$$
(3)

In order to get a better idea on the different components of productivity growth, several transformations of the input and output data are implemented. Firstly, the aggregate output per unit of physical labour (i.e. average labour productivity) using base period and current period data is defined as \(y_{j,c} =Y_{j,c} /L_{j,c}\) and \(y_{j,b} =Y_{j,b} /L_{j,b}\). Secondly, the aggregate output and physical capital levels per unit of human capital augmented labour for the region j in the current period and base period are specified as \(\hat{y}_{j,c} =Y_{j,c} /\hat{L}_{j,c}, \hat{y}_{j,b} =Y_{j,b} /\hat{L}_{j,b}, \hat{k}_{j,c} =K_{j,c} /\hat{L}_{j,c}\) and \(\hat{k}_{j,b} =K_{j,b} /\hat{L}_{j,b}\). Using \(\hat{y}_{j,c}, \hat{y}_{j,b}\), \(\hat{k}_{j,c}\), and \(\hat{k}_{j,b}\), the benchmark aggregate output level for the region j per human capital augmented unit of labour can be determined for the base period and current period data, i.e. \(\bar{y}_{j,b} \left( {\hat{k}_{j,b}} \right) =\hat{y}_{j,b} /e_{j,b}\) and \(\bar{y}_{j,c} \left( {\hat{k}_{j,c}}\right) =\hat{y}_{j,c} /e_{j,c}\). In addition, ratios of the physical capital in the current period (base period) to units of human capital augmented labour in the current period (base period) determined under the counterfactual presumption that human capital is of the level such as in the base period (current period), i.e. \(\tilde{k}_{j,c} =K_{j,c} /\left( {L_{j,c} H_{j,b} } \right) \) and \(\tilde{k}_{j,b} =K_{j,b} /\left( {L_{j,b} H_{j,c}}\right) \). Using these adjusted versions of physical capital, alternative definitions of the benchmark aggregate output levels for country j in the base period and current period can be specified: \(\bar{y}_{j,b} \left( {\tilde{k}_{j,b}}\right) \) and \(\bar{y}_{j,c} \left( {\tilde{k}_{j,c}} \right) \). Implementing the aforementioned transformations in the productivity growth ratio of region j as in (3) gives two possible decompositions, i.e. (4a) and (4b):

$$\begin{aligned} \begin{aligned}&\displaystyle \frac{y_{j,c} }{y_{j,b} }=\left( {\frac{e_{j,c} }{e_{j,b}}} \right) \times \left( {\frac{\bar{y}_{j,c} \left( {\hat{k}_{j,c}}\right) }{\bar{y}_{j,b} \left( {\hat{k}_{j,c}}\right) }} \right) \times \left( {\frac{\bar{y}_{j,b} \left( {\tilde{k}_{j,c} } \right) }{\bar{y}_{j,b} \left( {\hat{k}_{j,b} } \right) }} \right) \times \left[ \frac{\bar{y}_{j,b}\left( {\hat{k}_{j,c}} \right) }{\bar{y}_{j,b} \left( {\tilde{k}_{j,c}}\right) }. \frac{H_c }{H_b } \right] \\&\qquad \qquad \qquad \displaystyle \frac{y_{j,c} }{y_{j,b}}=\hbox {EFF}_j \times \hbox {TECH}_j^c \times \hbox {KACC}_j^b \times \hbox {HACC}_j^b \end{aligned} \end{aligned}$$
(4a)

and

$$\begin{aligned} \begin{aligned}&\displaystyle \frac{y_{j,c} }{y_{j,b} }=\left( {\frac{e_{j,c} }{e_{j,b} }} \right) \times \left( {\frac{\bar{y}_{j,c} \left( {\hat{k}_{j,b} } \right) }{\bar{y}_{j,b} \left( {\hat{k}_{j,b} } \right) }} \right) \times \left( \frac{\bar{y}_{j,c} \left( {\hat{k}_{j,c}} \right) }{\bar{y}_{j,c}\left( \tilde{k}_{j,b}\right) } \right) \times \left[ {\frac{\hat{y}_{j,c} \left( {\tilde{k}_{j,b} } \right) }{\bar{y}_{j,c} \left( {\hat{k}_{j,b} } \right) }.\frac{H_c }{H_b }} \right] \\&\qquad \qquad \qquad \displaystyle \frac{y_{j,c} }{y_{j,b}}=\hbox {EFF}_j \times \hbox {TECH}_j^b \times \hbox {KACC}_j^c \times \hbox {HACC}_j^c \end{aligned} \end{aligned}$$
(4b)

The first term in the decomposition is the efficiency change \((\hbox {EFF}_j)\)-component for region j. It measures the progress (regress) in the productivity growth realized by the region’s own idiosyncratic performance. More precisely, this component measures whether the region j was able to catch-up (or whether it fell back) to the productivity frontier. An \(\hbox {EFF}_j\)-value higher than one indicates that region j was able to catch-up with the productivity frontier. An \(\hbox {EFF}_j\)-value lower than one denotes that region j fell back to the productivity frontier. A value of unity indicates the status quo, i.e. no change in the productivity of region j vis-à-vis the benchmark level of productivity as determined by the productivity frontier. The second component in the decomposition is the technological change \((\hbox {TECH}_j)\)-component for region j. This component captures the diffusion of production technology and innovation among EU-regions. In the present analysis, \(\hbox {TECH}_j\)-values can be higher than or equal to one. A \(\hbox {TECH}_j\)-value higher than one reflects a shift outward of the productivity frontier suggesting an improvement in the available production technology during the period of study enabling higher productivity levels, all else equal. A \(\hbox {TECH}_j\)-value equal to one reflects that the productivity frontier remained unchanged. The third component is the physical capital accumulation component for region \(j\,(\hbox {KACC}_j)\). It measures the effect of physical capital accumulation, respectively, along the productivity frontier of the base period in (4a) and the current period in (4b). Somewhat similarly the fourth component in (4a) and (4b) measures human capital accumulation for region \(j\,(\hbox {HACC}_j)\) along the productivity frontier of the base period and the current period.

Given that the two productivity change measures as in (4a) and (4b) will usually have different values, to avoid any arbitrariness in the computations, Färe et al. (1994) suggested to calculate the productivity change as the geometric mean of the measures to obtain (5) (also known as the “Fisher ideal” index). The interpretation of the components and the component values remains unaltered by this adjustment.

$$\begin{aligned} \frac{y_{j,c} }{y_{j,b} }= & {} \hbox {EFF}_j \times \left[ {\hbox {TECH}_j^b \cdot \hbox {TECH}_j^c} \right] ^{1/2}\nonumber \\&\times \left[ {\hbox {KACC}_j^b \cdot \hbox {KACC}_j^c } \right] ^{1/2}\times \left[ {\hbox {HACC}_j^b \cdot \hbox {HACC}_j^c } \right] ^{1/2} \end{aligned}$$
(5)

Before concluding, note that, in the estimations, a robust (order-m) version of the nonparametric frontier estimation technique is used (Cazals et al. 2002). The reason is that nonparametric frontier estimation techniques such as the one presented above are deterministic in nature, which means that the position and the shape of the estimated production frontier and, hence, the estimations and decompositions of the regional productivity growth rates are sensitive to regions with macroeconomic data that are atypical and/or outlying. The essential idea of the robust approach is to develop a robust specification of the production frontier under a simple bootstrapping framework in which a high number of runs are performed (in casu 1000), in each of which a sub-sample of regions (randomly and i.i.d. drawn from the full sample) are used in the specification of the production frontier. As regions with atypical and/or outlying macroeconomic data do not form part of the sub-sample in every draw, the impact of such regions on the estimations of the quadripartite decomposition of productivity growth is effectively mitigated.

3.3 Multimodality and component analysis

Multimodality or the existence of several peaks in the distribution of the regional productivity can be seen as an indication of the existence of regional heterogeneity (Badunenko et al. 2013). To determine the modality of the distributions of the aggregate output per worker as in 2000 and 2011 as well as the counterfactual distributions of the aggregate output per worker as constructed by a stepwise introduction of the four different productivity growth components or combinations of these components, nonparametric multimodality tests are performed by the bootstrapped Silverman test for multimodality (Silverman 1981; Hall and York 2001). The null hypothesis in the Silverman test is that the distribution of the output per worker has at most \(\gamma \) modes (\({H}_{0}\) : number of modes \(\le \gamma )\). The alternative hypothesis is that this distribution has more than \(\gamma \) modes (\({H}_{1}\) : number of modes \(>\gamma )\). The test uses 10,000 bootstrap replications to compute the p value as the frequency that the critical bandwidth of the bootstrap sample is larger than the critical bandwidth of the given data (which suggests that sample data have more than \(\gamma \) modes). If the returned p value is smaller than the given level of significance (i.e. 0.05) the null hypothesis is rejected and the alternative hypothesis is adopted.

We also test for the statistical significance of the contributions of the different individual components of productivity growth or combinations of these components to the convergence in regional productivity between 2000 and 2011. In particular, we use the nonparametric test of Fan and Ullah (1999) (after a test initially proposed by Li 1996) to compare two unknown distributions, i.e. the estimated and counterfactual distributions. The null hypothesis is that the estimated distribution f(.) and the counterfactual distribution g(.) are equal, i.e. \(H_0 :f(.)=g(.)\). The alternative hypothesis is that the estimated and counterfactual distributions are statistically significantly different, \(H_1:f(.)=g(.)\). The idea is that a confirmation of the \(H_0\) denotes that the relative contribution of the included productivity change component is not statistically significant. The opposite holds for when the test rejects the \(H_0\). In that case, the estimated and counterfactual distributions do differ statistically significantly which implies that the relative contribution of the productivity change component to the overall productivity change is statistically significant (for more on formalities of the test, the interested reader is referred to Fan and Ullah 1999; Badunenko et al. 2008).

4 Regional (macro)economic data

The data for the three macroeconomic input variables and the aggregate output for the period 2000–2011 are collected from the Regional Database published by Eurostat. The definition of the NUTS 2-regions is according to the Nomenclature of Territorial Units for Statistics classification system of the European Parliament. In the NUTS 2-classification system there are six countries which have no underlying NUTS 2-regions due to their limited size: Lithuania, Malta, Cyprus, Latvia, Estonia and Luxembourg. For these countries, the national data are used. The final sample set includes 336 NUTS 2-regions (i.e. \(N=336\)).

The aggregate output (\(Y_{j,t}\)) is measured by the regional gross domestic product (GDP) at constant market prices, in million euro PPS. The input labour\((L_{j,t})\) is measured by the variable “economically active population by highest level of education attained”. Data on the physical capital stock\((K_{j,t})\) are not available in the Regional Database of Eurostat. Estimations for the physical capital stock are therefore obtained from an application of the perpetual inventory method (PIM) (Feldstein and Foot 1971; Epstein and Denny 1980) using regional investment data (available in the Regional Database). The PIM-formulation is as follows: \(K_{j,t} =I_{j,t} +\left( {1-\delta } \right) K_{j,t-1}\), where \(K_{j,t}\) and \(K_{j,t-1}\) are the gross capital stock of region j in year t and \(t-1; I_{j,t}\) is the gross fixed capital formation in region j between year \(t-1\) and year t, and \(\delta \) represents the depreciation rate of the physical capital stock (here \(\delta \) is set equal to 6% similar to in most previous studies, e.g. Badunenko et al. 2013). In the computations, the regional gross fixed capital formation was converted into constant prices PPS using a (country-level) GDP deflator and a relative price factor (PPS) as obtained through a comparison of regional GDP in current and constant prices. The PIM-method requires an initial value of the physical capital stock \((K_{j,0})\) in order to compute the time series of regional physical capital stock. To estimate this starting value, following the approach of Beugelsdijk et al. (2015), regional real investment data are backwardly extrapolated up to the starting year of the time series (i.e. the year 1970) so as to derive an estimate of the regional physical capital stock in that year. Using this estimate of \(K_{j,0}\), time series data of the physical capital stock for the regions are developed from which the data for the period 2000–2011 are used in the present study (for more details, see Beugelsdijk et al. 2015, pp. 8–9).

To estimate the human capital of the labour force, data on the distribution of the region population by education attainment are used (ISCED 0–2: pre-primary, primary, lower secondary education; ISCED 3–4: upper secondary, post-secondary non-tertiary education; ISCED 5–6: 1st and 2nd stage of tertiary education). The actual adjustment of the labour force for human capital is performed according to Hall and Jones (1999). In particular, human capital augmented labour is estimated by \(\hat{L} _{j,t} =L_{j,t} \cdot H_{j,t} =L_{j,t} \cdot \hbox {exp}^{\phi \left( {\varepsilon _{j,t} } \right) }\), with \(\phi (.)\) being a piecewise linear function with zero intercept and slopes equal to 0.134, 0.101 and 0.068 for human capital obtained, respectively, during the first 4 years of schooling, the next 4 years and education years beyond the 8th year (for more details, see Beugelsdijk et al. 2015).

Figure 1 displays the production technology frontiers as computed by the above discussed sequential production frontier approach for the years 2000 and 2011 in a two-dimensional \(\langle \hat{y},\hat{k}\rangle \)-space, which is readily derived from the initial three-dimensional \(\langle Y,\hat{L},K\rangle \)-space by the following transformations: \(\hat{y}=Y/\hat{L}\) and \(\hat{k}=K/\hat{L}\). The figure shows only minor differences in the shape and position of the frontiers in 2000 and 2011, with differences being situated at the higher levels of capital per efficiency unit of labour. This finding of technical change occurring mostly at higher levels of capitalization (non-neutrality of technological change) confirms what was found by previous studies for country-level data (Henderson and Russell 2005; Badunenko et al. 2013). A detailed study of Fig. 1 also shows that both the position and the shape of the production frontier is predominantly determined by three atypical/outlying regions, i.e. Région de Bruxelles-Capitale, Luxembourg, and Inner London. With very high levels of aggregate output per efficiency unit of labour these three regions determine the upper part of the production frontiers in both 2000 and 2011. The presence of these three regions is a prima facie reason for using a robust version of the production frontier technique.

Fig. 1
figure 1

Best practice EU production technology frontier 2000 versus 2011 (sequential version)

5 Empirical findings

Figure 2 displays the kernel distributions of the regional productivity and efficiency levels in the years 2000 (black solid curve) and 2011 (grey dashed curve). The vertical black solid line and grey dashed line in the plots represent the median regional productivity and efficiency values in the respective years. Some interesting preliminary findings emerge from these plots. Firstly, the plot in the left part of Fig. 2 clearly shows that the distributions of regional productivity as in 2000 and 2011 are multimodal (peaks at levels of output per worker around 30,000 and 55,000) with some differences in the shape of the distributions and in both cases a long right-hand tail. Secondly, the median values of the regional productivity in 2000 and 2011 indicate a very small decline to almost no change during the period of study. Thirdly, the plot in the right part of Fig. 2 shows that the kernel distribution of the regional efficiency as in 2000 is more situated to the right and hence at higher levels of efficiency as compared to the distribution in 2011. In addition, the median value of the regional efficiency in 2000 is considerably higher than the regional efficiency in 2011. This suggests that there has been a general decline in regional efficiency during the first decade of the twenty-first century.

Fig. 2
figure 2

Kernel distributions of regional productivity and efficiency in 2000 versus 2011

Table 1 Productivity growth and components (percentage changes)

The mean values of the regional productivity growth rates and the four components per EU Member State are listed in Table 1.Footnote 4 Several interesting findings emerge from this table. Firstly, the mean values for all EU-regions listed at the bottom of Table 1 show that, on average, regional productivity slightly increased (\(+\) 2.79%) in the EU over the period 2000–2011. Results suggest that this increase was mainly because of positive changes in physical capital and human capital accumulation (\(+\) 14.89 and \(+\) 6.85%) in the regions outbalancing the negative regional efficiency change (− 14.66%). Overall, technological change was negligible (− 0.05%). A second interesting finding is that several Member States saw all of their regions realizing positive productivity growth rates over the period 2000–2011. Examples include the Czech Republic, Denmark, Germany, Croatia, Lithuania, Estonia, Slovakia and Finland. The regions in Germany, for instance, realized growth rates ranging from a minimum of − 7.57% to a maximum of \(+\) 27.39%. Thirdly, the mean values for the productivity change index, (\(\hbox {TFP}-1\)) \(\times \) 100, indicate regress in productivity for the majority of the regions in Greece (− 10.16%), Italy (− 12.86%), Spain (− 8.74%), UK (− 12.05%), Hungary (− 12.89%) and Romania (− 44.47%) in the period 2000–2011. A possible explanation for this finding is that these Member States were hit comparatively hard by the consequences of the economic downturn in the second part of the period (as pointed out by the Sixth Cohesion Report, European Commission 2014). A fourth interesting finding is that several old EU Member States (e.g. Belgium, Denmark, Greece, Spain) and new EU Member States (Bulgaria, Hungary, Romania) saw all regions experiencing negative efficiency growth (i.e. \(\left( {\hbox {EFF}-1} \right) \times 100<0)\) in the period 2000–2011. Only Germany (\(+\) 2.46%), Croatia (\(+\) 4.02%) and the Czech Republic (\(+\) 4.68%) realized on average positive regional efficiency change. This finding of decline in efficiency in the large majority of the regions during the first decade of the twenty first century seems somewhat at odds with the typical mantra of European economic integration being advantageous in terms of higher efficiency and increased competition. It is not straightforward to give a coherent explanation for this finding. However, a plausible explanation is the negative impact of the financial and economic crisis with a decline in aggregate demand and supply (i.e. lower regional aggregate output levels) and, by result, a regress in the production efficiency. A fifth interesting result is that technological change contributed almost not (i.e. \(\left( {\hbox {EFF}-1} \right) \times 100\approx 0)\) to regional productivity growth over the period 2000–2011. Part of the explanation may well be related to the negative impact of the crisis in Europe with adopted (austerity) policies in most EU Member States placing more pressure on national and regional budgets, thereby limiting funding availability across all investment areas. The Sixth Report on Economic, Social and Territorial Cohesion (European Commission 2014) speaks of a decline of roughly 20% in public investment between 2008 and 2013 for the EU as a whole and of around 60% for the Southern European Member States (e.g. Greece and Spain). Finally, overall, physical capital and human capital accumulation contributed positively to regional productivity growth (mean values of \(\left( {\hbox {KACC}-1} \right) \times 100\) and \(\left( {\hbox {HACC}-1} \right) \times 100\) equal to \(+\) 14.89 and \(+\) 6.85%). As to the magnitude of these positive contributions, results suggest important differences between the regions in the old and new EU Member States. More precisely, as for the contribution of physical capital accumulation to regional productivity growth, the mean change values of, respectively, \(+\) 8.04 and \(+\) 21.74% for the regions in the old and the new EU Member States clearly indicate a more significant contribution in the new EU Member States. Particularly remarkable are the strong contributions of the physical capital accumulation component in Bulgaria (\(+\) 58.15%) and Lithuania (\(+\) 67.37%). As for the human capital accumulation component (i.e. column “\(\left( {\hbox {HACC}-1} \right) \times 100\)” in Table 1), results indicate a generally positive contribution to regional productivity growth in all old EU Member States and all but two new EU Member States (exceptions are Croatia and Lithuania). Note, however, that there are considerable differences across regions and/or Member States, with Member States such as Ireland (\(+\) 16.68%), Luxembourg (\(+\) 23.61%), Spain (\(+\) 16.97%) and Malta (\(+\) 14.55%) showing high positive contributions and regions in Member States such as Denmark (\(+\) 0.19%), Latvia (\(+\) 2.30%) and Romania (\(+\) 1.72%) experiencing on average only low contributions of human capital accumulation. As a possible explanation, Crespo Cuaresma et al. (2014) referred to the large variation in education systems across Member States and regions.

Fig. 3
figure 3

Productivity in 2000 versus TFP-component indices (robust versions)

Figure 3 visualizes the relationship between the initial regional productivity levels (i.e. output per worker in 2000) and the four productivity growth components. The plots reveal that all but one components relate statistically significantly to the initial regional productivity level. For the period 2000–2011, overall, technological change and physical capital accumulation are found to relate statistically significantly negatively (i.e. p values for the slope coefficients of the fitted regression lines of, respectively, 0.044 and 0.000) and human capital accumulation statistically significantly positively (\(p\,\hbox {value} = 0.000\)) to the regional productivity as in 2000. Only the efficiency change component is found to relate statistically insignificantly (\(p\,\hbox {value}\) of 0.614) to the regional productivity as in 2000. The positive changes in efficiency experienced by some regions more or less outbalanced the negative changes in efficiency seen in other regions. As to the finding of significant negative relationship for the technology change component, a brief look at the \(\hbox {TECH}\)-index values on the vertical axis reveals that although statistically significant, the relationship of technological change to initial regional productivity is trivial. The finding of a negative association between the contribution of physical capital accumulation to productivity growth and the initial level of regional productivity indicates that, in general, physical capital accumulation contributed more strongly to productivity growth for regions with lower initial levels of aggregate output per worker. An opposite pattern is found for the relation between human capital accumulation and initial regional productivity, i.e. regions that were more productive in 2000 having benefited more from human capital accumulation as compared to the regions that were initially less productivity. Overall, this finding suggests that physical capital accumulation advanced and human capital accumulation hampered convergence among regions over the period 2000–2011, with initially less productive regions having benefited more from physical capital accumulation, yet, less from human capital accumulation as compared to the regions that were more productive in 2000.

Table 2 Modality tests of the counterfactual distributions of output per worker (by stepwise introduction of TFP-components) (bootstrapping with 10,000 replications)
Table 3 Distribution hypothesis test (bootstrapping with 10,000 replications)

As to the multimodality of the distributions of regional productivity and the relation to the productivity components, the outcomes of the bootstrapped Silverman’s tests are reported in Table 2. For the actual and counterfactual distributions three Silverman tests were computed by setting \(\gamma \)-values equal to, respectively, 1, 2 and 3. The test results demonstrate that the distributions of regional productivity as in 2000 and 2011 are multimodal with the distribution in 2000 displaying 3 modes and the distribution in 2011 showing 2 modes. This result confirms to some extent what was found by previous studies (Gitto and Mancuso 2015; De Siano and D’Uva 2006), i.e. the shape of the kernel distribution of the regional productivity having multiple modes. As indicated in the discussion of Fig. 2, the two consistent peaks in the distribution of regional productivity are situated at levels of output per worker around 30,000 and 55,000. A screening of the outcomes of the modality tests for the different counterfactual distributions in Table 2 indicates that the introduction of the technological change and the physical capital accumulation component, individually or in combination, does not significantly impact the modality of the distribution of regional productivity. The shift in modality in the distribution of regional productivity from 2000 to 2011 is predominantly because of the human capital accumulation component.

Table 3 displays the outcomes of the nonparametric method of Fan and Ullah (1999) to test for the statistical significance of the contributions of the four components of productivity growth and combinations of these components to mean preserving changes in the distribution of regional productivity. The first test in Table 3 indicates that the actual distributions of aggregate output per worker as in 2000 and 2011 differ statistically significantly (p value < 0.05). This finding raises the question of what productivity components contributed significantly to the change in the distribution of aggregate output per worker over the period 2000–2011. With p values near to zero the test outcomes reject the notion of efficiency change, technological change and physical capital accumulation being individually responsible for moving from the distribution in regional productivity as in 2000 to 2011. The exception is the human capital accumulation component. With a p value of 0.079 for the counterfactual productivity distribution accounting solely for the human capital accumulation and of 0.225, 0.078, 0.226 and 0.120 for the counterfactual productivity distributions with combinations of productivity components including the human capital accumulation component, the test outcomes suggest that the accumulation in human capital contributed considerably to the change in the distribution of regional productivity during the period 2000–2011.Footnote 5

6 Concluding remarks and suggestions for further research

The paper examined multiple policy-relevant research questions: (1) What was the evolution in labour productivity in the period 2000–2011?; (2) Was the productivity growth realized by NUTS 2-regions within the same Member State largely similar or was there considerable interregional disparity?; (3) Were the relative contributions of efficiency change, technological change and changes in physical and human capital accumulation to productivity growth more homogeneous or rather heterogeneous across NUTS 2-regions?; and (4) Was there a trend of convergence or rather one of divergence in regional productivity in the EU during the period 2000–2011. The results of the analyses suggest the following answers to the questions raised.

  1. 1.

    As to the first research question, results suggest that in spite of recent European programs such as the Lisbon Agenda and ESI-Funding, regional productivity growth has been relatively limited for the whole EU. This finding of limited growth confirms what was found by other studies for previous periods (Sala-i-Martin and Barro 1991). Results also indicated that changes in regional productivity have been rather “uneven”. As to the old EU Member States, most regions in Denmark, Finland, Sweden, Germany, the Netherlands and Belgium experienced positive productivity growth whereas regions in the Southern EU Member States (i.e. Italy, Greece, Spain) mostly experienced regress in productivity. In the new EU Member States, a lot of regions in Croatia, the Czech Republic, Lithuania, Poland and Slovenia realized high to very high progress in productivity which enabled these regions to converge towards the regions in the North of Europe. On the other hand, most regions in Malta, Hungary, Bulgaria and Romania saw productivity declining in the period 2000–2011.

  2. 2.

    Detailed analyses of the regional productivity growth rates show considerable differences in the economic development and productivity of regions both between as well as within EU Member States. Based on the results, a possible explanation for the high within-country and between-country regional differences in productivity and economic development is the existence of interregional differences in the available mix of production factors (labour, physical capital and human capital).

  3. 3.

    The quadripartite decomposition of the regional productivity growth rates indicated that, in general, changes in efficiency predominantly deteriorated regional productivity, technological change had little or no effect on regional productivity, and changes in physical and human capital accumulation contributed positively to regional productivity growth. One possible explanation for the insignificant contribution of technological change could be the impact of the financial and economic crisis with investments in innovation slowing down in almost all EU Member States since 2008. Archibugi and Filippetti (2011), for instance, demonstrated that whereas there was a trend of progress and convergence in innovative potential in the EU during the years leading up to the economic crisis, innovative investments have slowed down in almost all EU Member States since 2008, with the negative impact on innovative investment particularly affecting catch-up Member States and regions. The net result may well be stagnation of technological contribution to productivity growth for the whole EU over the period 2000–2011.

  4. 4.

    The negative and nonsignificant contribution of, respectively, efficiency change and technological change to regional productivity growth was more or less homogeneous across regions. The positive contributions of changes in physical and human capital accumulation to regional productivity growth were also observed among most regions, though, with important interregional differences in magnitude of the contributions. In particular, changes in physical capital accumulation were found to benefit principally the productivity of the regions that were least productive in 2000. On the other hand, changes in human capital accumulation appeared to be an important driving force behind the evolution in regional productivity in the regions that were initially among the most productive (particularly regions located in the centre of Germany, Benelux countries and Scandinavia as well as Southern regions in the UK).

  5. 5.

    As to the question of whether there was convergence, the finding of multimodality in the distribution of regional productivity provides supports to the notion of club convergence (Galor 1996) in which groups of regions are evolving towards multiple equilibria, i.e. rich and developed regions (mostly regions in Denmark, Finland, Sweden, Germany, the Netherlands and Belgium) evolving towards a high productivity equilibrium and poor and less developed (mostly regions in Southern EU Member States and some of the Central and Eastern EU Member States) regions evolving towards an equilibrium of low productivity. Overall, the findings confirm what was found by previous (parametric) studies (Quah 1996; Crespo Cuaresma et al. 2014; De Siano and D’Uva 2006), i.e. a process of slow yet robust convergence across regions in Europe and the existence of convergence poles at the regional level in Europe.

Nevertheless, some caveats to the present study need to be emphasized. First, an important question is whether the assumption upheld in the present study of the technology frontier being uniform for all EU NUTS 2-regions is valid and realistic. Several theoretical papers such as Acemoglu and Zilibotti (2001) and Basu and Weil (1998) have raised doubts with the assumption of production technology being more or less homogeneous across economies. Their reasoning is that the technology frontier is not uniform in the sense that not all existing production technologies are equally well-matched to every economy. Basu and Weil (1998), for instance, argued that the implementation of new innovative technology is typically capital-intensive which makes it often difficult for economies with low capital endowments to adopt new technology. In addition, several empirical studies demonstrated that background conditions may hamper or benefit regional productivity. Examples of background conditions include the presence of a capital city (Crespo Cuaresma et al. 2014), local institutions (Acemoglu and Dell 2010), physical location (Quah 1996), etc. All of this suggests that the assumption of the production technology frontier being uniform across (groups of) regions in the EU should at least be further examined and tested.

Another caveat concerns the possible spatial correlation in the regional economic data (due to for instance knowledge spillovers or national government policies). As noted by Crespo Cuaresma et al. (2014), the issue of spatial correlation is much more pervasive in models of regional economic growth than in studies of country-level economic growth. An interesting topic for further research would be to examine productivity growth among European NUTS 2-regions using nonparametric production frontier techniques similar to the one in the present paper, thereby accounting for the spatial correlation between the regions.