1 Introduction

We exploit recent results on panel cointegration analysis to empirically study the determinants of (per-capita) health care expenditure (HCE) in the Italian regions in both the long and short term. We show that the long-term behaviour of HCE is mainly explained by GDP, whereas political and institutional variables shape the regions’ HCE dynamics in the short term.

The relationship between HCE and GDP has been the subject of a large portion of literature in health economics. Many early contributions employed cross sectional data to obtain estimates of this relationship, amongst others Gerdtham–Jönsson (2000), Gerdtham–Lothgren (2000), for OECD countries; Giannoni–Hitiris (2002), for the Italian case. Without exception, it has been found that most of the observed variations in HCE can be (also) explained by variation in GDP. Culyer (1988) noticed that these models are probably mis-specified, because they do not consider the public budget mechanism used to finance health care.Footnote 1 In explaining the evolution of health care, issues relating to budgetary processes and institutions, including the current decentralisation process in Europe (in countries such as France, Spain, Belgium) should, however, be accompanied by a careful analysis of what the data are actually saying.

With this purpose, we first consider the behaviour of the HCE and GDP (per capita), both at constant 2005 pricesFootnote 2 and transformed into logarithms, in the 20 Italian regions, from 1982 to 2009. The visual inspection (Fig. 1) shows an impressively similar behaviour of HCE and GDP which, like many other economic variables, exhibit strong comovements across Italian regions. Moreover, the Granger causality test on the considered variables rejects the hypothesis that GDP does not Granger cause HCE (see “Appendix 1”). This seems to support the view that GDP is actually driving HCE. Apparently, after the health care reform of 1978, Italy established a health care system based on the economic development of the country with a continuous upward trend observed in public HCE.

Fig. 1
figure 1

Behaviour of per capita HCE and GDP in the Italian regions (1982–2009)

In contexts like these, resorting to panel data offers a number of advantages for the analysis. In this respect, a number of articles studying the behaviour of HCE in OECD countries support the view of a positive association between HCE and GDP.Footnote 3 Here on the basis of a newly built panel data set consisting of all the 20 Italian regions and covering the period 1982–2009, we verify the hypothesis of cointegration between HCE and GDP.Footnote 4 We show the existence of a long-run equilibrium relationship between HCE and GDP.

In addition, we model the short-term behaviour of HCE. In this respect, we postulate the presence of short-term factors that might affect the HCE annual dynamics. In particular, we investigate whether those variables that have been traditionally credited to influence the HCE can find room in a short-term relationship. In modelling the short term, we refer to those theories explaining the peculiarity of the growth of the HCE as depending on demand and supply factors as well as on economic, political and institutional features of industrialised countries. On the one side, we refer to the body of empirical literature examining the traditional determinants of HCE, namely: per-capita GDP, proportion of population over 65 and under 5, migration rate, public finance share of health care spending, urbanization, unemployment rate, activity rate and number of practising physicians per capita (see, for example, Newhouse 1977; Leu 1986; Gerdtham et al. 1992; Hitiris–Posnett 1992; Gerdtham et al. 1998; Barros 1998; Di Matteo–Di Matteo 1998; Crivelli et al. 2006). On the other side, we refer to the recent debate in a political economy approach relating public expenditure growth to the role played by political and institutional elements as determinants (principal or secondary) of public expenditure (see Persson–Tabellini 1999, 2000, 2001, 2005; Milesi-Ferretti et al. 2002). These elements are the ones more frequently studied in the most recent (empirical and theoretical) literature, with different approaches to the context of the HCE. For example, Crivelli et al.(2006) apply the determinants approach to the Swiss cantons considering among the independent variables the cantonal index for direct democracy discussed by Trechsel–Serdult (1999) and Frey–Stutzer (2000); whereas Vatter–Ruefli (2003) present a comparative study of Swiss cantons investigating the role of specific political factors on HCE. In this paper we want to explore whether any role has been played by the factors mentioned above on choices relating to the health care system in Italy, at least in the short term.

Our finding seems to support those theories that consider public HCE growth inevitable, as it is determined by changes in the economic and social structure. On the one side, the so called “Wagner’s law”Footnote 5 seems to be confirmed in that economic and social progress implies an increase of government expenditures (here for health) proportional to the national product.Footnote 6 We refer to the literature originated by Wagner hypothesis—for example, the works by Peacock–Wiseman (1961), Timm (1961), Gupta (1967), Pryor (1969), Musgrave (1969), Alt (1980), Mann (1980) and Henrekson (1993), see the discussion below—and also to the health care literature, for example Newhouse (1992), explaining the raise in health spending with the relevance of technological progress, and the adoption of new, more expensive medical treatments. On the other side, our empirical results also confirm the view (Hall–Jones 2004) that health care is a superior good because as individuals get richer they choose to spend a larger proportion of their income on health care, with policy-makers merely adjusting public health spending to individual preferences, likely for electoral reasons.

The main implication of our finding is in term of fiscal sustainability. Indeed, the public debt in Italy stands at about 133 % of GDP (est. 2013), and any further increases of current expenditures would imply that future government spending may be mainly directed toward debt servicing.

In Sect. 2 we briefly describe the Italian health care system. In Sect. 3 we assess the model and depict the data. Section 4 reports the empirical results. Conclusions follow in Sect. 5.

2 The Italian National Health Care System

The Italian National Health Care System (NHS), founded in 1978, is a universal health care system that provides comprehensive health insurance coverage and uniform health benefits to the whole population, subject to user charges for certain services. The system, now organized according to three-level government structure (national, regional, and municipal), was originally highly centralized. An increased degree of decentralization for hospital care had already begun in 1972 when the administrative functions for hospital care assistance were transferred to regions with an ordinary bylaw. The state was left with residual competences, mainly related to fund rising. In 1978 an extended regionalization (and a strengthened role of municipalities) of the expenditures responsibilities became a key feature of the whole Italian health care system characterised as follows. Primary care in Italy is provided mostly by general practitioners who refer patients to specialist care and hospitals on a regional basis. They are independent self-employed practitioners paid on a capitation basis.Footnote 7 Ambulatory treatments, diagnostic and laboratory tests, specialist care, drugs, medical appliances and glasses are co-paid by patients at a price called ticket, determined according to income, age, health conditions and other individual characteristics. It is also possible for the regional government to outsource the delivery of medical health services. As a result, part of health care services is currently provided by accredited private units, with patients free to choose between public or private providers.Footnote 8

As mentioned, since 1978, the amount of funds financing the NHS has been fixed by the central government which determines the resources available for HCEs on a regional basis. The separation of revenue raising responsibilities of the central government and expenditure responsibilities of the regional governments, resulted in continuous increases of the Italian public HCEs, with Regions regularly running budget deficits determining large regional debts, subsequently repaid by the central government. This has led to periodic attempts of reforming the NHS financing mechanisms, with the main source of financing remaining public. In this respect three periods seem relevant. The first period 1978–1992 is characterised by the full financing of HCE by means of the National Health Fund, NHF, originating from general tax revenues with constrained destination to health care financing. In the second period, from 1993 to 1997, a mixed mechanism of financing was introduced, with the NHF having an apparent complementary role to the revenues deriving from health care social contributions levying on dependent labour income.Footnote 9 In the third period, from 1998 on, health care social contributions were substituted by a regional tax on production activities, by a regional surtax on personal income, by revenue sharing of the VAT revenue and excise duty on petrol. Apparently, regions have to rely on regional taxes for funding both health care and other regionally funded activities.Footnote 10 Nevertheless, the devised measures have not been sufficient to contrast the massive debt overhang originated by the health care system, with the consequence of increasingly serious problems of fiscal sustainability for the state (see Fedeli 2008). In this respect, it should be noted that regional differences in public finances also suggested the use of specific policies to solve specific situations of crisis: since 2007 for those regions under risk of financial default due to indebtedness originated by the health care system a policy of Repayment Plans (Piani di Rientro) begun with the following features.Footnote 11 The Plan is setup and agreed upon amongst the Region presenting structural health deficit, the Ministry of Health, and the Ministry of Economic and Finance. The plan outlines objectives and strategies to recover the financial equilibrium (recovery plan) and to remove the structural determinants of disequilibrium by means of targeted actions against the structural determinants of health deficits (turnaround plan) still keeping a given quality of assistance levels.Footnote 12 As we shall see below none of these policy action has had any effect in shaping the HCE.

3 The Model and the Data

The empirical analysis of long-run relationships among integrated variables with both a cross sectional dimension, N, and a time-series dimension, T, by means of panel cointegration techniques has been developed in two broad directions. On the one side, the approach mainly explored by Pedroni (1999, 2004) is based on the methodology of Engle and Granger (1987) whereby the residuals of a static least squares regression are subject to a unit root test. On this basis, however, some studies have failed to reject the no-cointegration null hypothesis even in cases where cointegration is strongly suggested by theory.Footnote 13 On the other side, Westerlund (2005, 2007) has recently developed new panel cointegration tests based on structural rather than residual dynamics (see below). The asymptotic results reveal that such tests have limiting normal distributions, and that they are consistent. Westerlund (2007) reports simulation evidence suggesting that the new tests maintain good size accuracy, and that they are more powerful than the residual based tests by Pedroni (2004). On this basis, we shall analyze the influence of per capita Italian regional GDP on per capita Italian regional HCE.

Notice here that the existence of a positive association between income and health care has been often established for OECD countries,Footnote 14 whereas some controversy remains on the direction of causality. Does the relation stem from health affecting individual working opportunities—and hence income—or from income providing a protective effect on health? A broad overview of the existing literature suggests that both directions of causality are indeed at work (see Smith 1999) and this is also the results suggested by the Granger causality test reported in “Appendix 1”. For this reason, we have also tested for cointegration the inverse relation of HCE on GDP again using the Westerlund tests above mentioned. In this case, however, cointegration is not supported by the Italian data (see “Appendix 2”). Thus, we concentrate on the impact of income on health expenditures only.

The basic model we postulate between HCE and GDP is the following:

$$\ln (HCE\_PC)_{it} = \theta_{0i} + \theta_{1i} (\ln (GDP\_PC))_{it} + \tau_{i} t + \mu_{i} + \varepsilon_{it}$$
(1)

where i = 1, …, 20 identifies the regions; t = 1982, …, 2009 identifies the years. Both HCE_PCit and GDP_PCit are per capita measures of, respectively, the regional HCEs and the regional GDP, both at constant 2005 prices and taken in logarithms. The time trend, t, to be tested, has been included. The interpretation of such a trend, if significant, would be that it captures the impact of technological change. \(\mu_{i}\) is the region specific effect, ɛ it represents the regression error.

The setting of Eq. (1), can be modified if accounting for cross-section dependence in the data. This can be generated by unobserved factors affecting all regions, but to a different degree. The model can be described as follows:

$$\ln (HCE\_PC)_{it} = \theta_{0i} + \theta_{1i} \ln (GDP\_PC)_{it} + \, \gamma_{i} ({\text{f}})_{\text{it}} \, + \tau_{i} t + \mu_{i} + \varepsilon_{it}$$
(2)
$$\ln (GDP\_PC)_{it} = \, \alpha_{i} + \varphi_{i} ({\text{f}})_{\text{it}} \, + \psi_{i} (g)_{it} + u_{it}$$
(3)

where f and g are unobserved factors (most likely, common shocks) affecting directly or indirectly [i.e. impacting on ln (GDP_PC) it ] the ln (HCE_PC) it . \(\, \varphi_{{i^{{^{{}} }} }}\) and \(\, ^{{^{{}} }} \psi_{i}^{{}}\) are the region specific factor loads which cause a heterogeneous response to common shocks.

If the variables in Eq. (1) [and (2)] are I(1) and cointegrated, the error term is I(0) process for all i. The principal feature of cointegrated variables is their responsiveness to any deviation from the long-run equilibrium. This implies an error correction model, in which the short-term dynamics of the variables in the system are influenced by the deviation from the equilibrium. As for the short-term, we postulate the dynamic panel specification as follows:

$$\Delta \ln (HCE\_PC)_{it} = \partial_{0i} +\Phi _{i} [\ln (HCE\_PC)_{i,t - 1} - \theta_{0i} - \theta_{1i} \ln (GDP\_PC)_{it} - \tau_{i} t{ ] } + \delta_{11i} {\text{ (X)}}_{\text{it}} \, + \varepsilon_{it}$$
(4)

Φ i is the error-correction speed of adjustment parameter, which is expected to be significantly negative if the variables exhibit a return to the long-run equilibrium, whereas if it is zero there would be no evidence for a long-run relationship.

\(\theta {\text{s }}\) are the long-run coefficients, which give the long-run relationship between the variables as from Eq. (1) [and (2)]. With the inclusion of \(\theta_{0i} \,\) a nonzero mean of the cointegrating relationship is allowed.

\({\text{X}}_{\text{it }} \,\) is a vector of explanatory stationary variables entering the dynamic specification and \(\delta {\text{s }}\) are the corresponding coefficients. In this respect, we test the variables that are traditionally credited to influence the HCE. In particular, we take into consideration the traditional structural, socio-economic and demographic characteristics by evaluating, in turn, the significance of: aging population (i.e. the proportion of people over 65 in each region) that has been often identified as the major culprit of the raise in public health care spending (see Bethencourt–Galasso 2008); the regional net migration rate; labour market indicators such as unemployment rate and activity rate; the number of public hospital beds; the density of hospital doctors and general practitioners; the number of public and private hospitals. Footnote 15 Following Culyer (1988) three variables—CGP(8292), CGP(9397), CGP(9809)—capturing homogeneous periods of central government budget policies for health-care public finances are considered; they refer to the central government fiscal reforms, respectively, of the periods 1982–1992, 1993–1997, 1998–2009.Footnote 16 We also consider the specific policies known as “repayments plans” addressed to those regions under the risk of financial default in the period considered—i.e. Abruzzo, Calabria, Campania, Latium, Liguria, Molise. Finally, following the political economy approach—in particular, Weingast et al. (1981), Tornell–Velasco (1992), Chari–Cole (1995), Persson–Tabellini (1999, 2000, 2001, 2005) and Milesi-Ferretti et al. (2002), but also Roubini–Sachs (1988, 1989), Lijphart (1984, 1994), Santagata (1995), Grofman (2001), Grilli et al. (1991)—we test a set of political and institutional variables, defined as follows: (1) the type of electoral system under which the regional government was elected is considered by means of the variable electoral_law, which takes on a value of 0 under the proportional system and a value of 1 under the plurality system. The regional electoral system changed from proportional to mixed plurality in 1994. (2) In 1999, the direct election of the president of the region, who becomes formally and substantially responsible for regional administration, was introduced for all Italian regions with ordinary bylaw (in 2001 for regions with extraordinary bylaw). The rationale of this institutional reform was to give direct administrative responsibility to the regional presidents as a measure aimed at curbing both deficits and expenditures. We capture the change in the electoral law related to the election of the president of the region by means of the variable president_election which takes on value 1 since the year when direct election was introduced. (3) The test has also been carried out considering the “political colours” of regional ruling coalitions over the period. In this respect, a variable depicting the centre-left ruling party coalitions over time has been considered. (4) Political and local elections are captured by the variable election, which takes on value 1 for the years of political and local elections and 0 otherwise. As for the impact of this variable on HCE, the theory says that under the proportional system the “universal” type of expenditure such as HCE should be higher than under the plurality system, which, in principle, favours those kind of expenditures addressed toward specific objectives (likely in favours of the electoral district or of specific interest groups supporting the political campaign of the elected representatives), see Persson and Tabellini (1999, 2000, 2001). In the same vein, Milesi-Ferretti et al. (2002) support the view that the more proportional the electoral system the wider variety of interests shall be accomplished, the more the electoral system stick to the single-member constituency the more the public expenditures shall accomplish local interests. By analogous reasoning, we should expect that after the introduction of the direct election of the president of the region, who becomes formally and substantially responsible for regional administration whose main duty is related to health care, the president himself is in the position to curb HCE, which is the main cause of regional deficits. As for the political colour of the regional ruling coalitions, at least for the Italian experience, we might expect left-wing ruling party coalition pay more attention to the health care system. Finally, in line with those theories of the electoral cycle, we would expect an increase of HCE in the proximity of both political and local election years (for the Italian case see Santagata 1995).

The dataset is a yearly panel data for all the 20 Italian regions for the period 1982–2009. The statistics of the considered variables are summarized in Table 1. Our source of data for structural health-care, economic and demographic characteristics is the Italian National Statistical Office (ISTAT), while the source of political and institutional variables is the Italian Ministry of Internal Affairs.

Table 1 Summary statistics of the considered variables

4 Empirical Results

We test for cointegration the model depicted in Sect. 3. We, first, assume cross-section independence in the data. As a robustness check, we, then, test for the presence of cross-section dependence. In both cases, we analyse the long- and short-term relations. The results obtained in either case provide evidence in favour of our specification. We report only the results which are significant for our analysis; full results are available from authors upon request.

4.1 Absence of Cross-Section Dependency

The first step is to test whether the variables are nonstationary or not. Dealing with longitudinal data, we refer to the test by Im et al. (2003), which is based on the assumption of no cross-sectional dependence. The results, reported in Table 2, indicate that, as for ln_HCE_PC, we reject the null hypothesis at the 1 % level of significance for a number of lags going from 1 to 6, whereas for ln_GDP_PC, once a linear time trend has been accommodated, we end up with a rejection of the null hypothesis at the 1 % level of significance for a number of lags going from 1 to 6 (in Table 2 we have marked in italicised values the statistics that accept the null hypothesis).Footnote 17

Table 2 Im–Pesaran–Shin test (IPS) for ln_H_PC and for ln_GDP_PC

The second step is to test whether HCE_PC and GDP_PC are cointegrated. As already mentioned, we refer to Westerlund (2005, 2007) and Persyn–Westerlund (2008). Their idea is to test the null hypothesis of no cointegration by inferring whether the error-correction term in a conditional panel error-correction model is equal to zero. The four tests are all normally distributed and are general enough to accommodate unit-specific short-run dynamics, unit-specific trend and slope parameters, and cross-sectional dependence. Two tests are designed to test the alternative hypothesis that the panel is cointegrated as a whole, while the other two test the alternative that at least one unit is cointegrated. The calculated values of the error correction statistics are presented along with asymptotic p values in Table 3. When using the asymptotic p values, for both the cases in which the deterministic chosen is constant only and constant and trend, all four tests lead to a clear rejection of the null hypothesis at 1 % level, which we take as strong evidence in favour of cointegration between ln_HCE_PC and ln_GDP_PC.

Table 3 Westerlund ECM panel cointegration tests between ln_HCE_PC and lnGDP_PC

As for the estimation of the cointegrating vector and the short-term dynamics, the recent literature has suggested several approaches. Table 4 reports the estimation results in three cases, i.e. (1) In Dynamic Fixed Effects Regression model (DFE), we restrict the coefficients of the cointegrating vector to be equal across all panels; moreover, DFE further restrict the speed of adjustment coefficient and the short-term coefficient to be equal.Footnote 18 (2) The mean group estimates (MG) are the unweighted mean of the N individual regression coefficients. They are calculated, as proposed by Pesaran–Smith (1995), with intercepts, slopes, coefficients and error variances allowed to be different across groups. Here they are reported as a two equation models. The full model estimates are available from authors on request. (3) The pooled mean group estimation (PMG) proposed by Pesaran et al. (1997, 1999) allows for heterogeneous short-term dynamics and common long-run effects of GDP. The long-run effects and the averaged short-run parameter estimates are reported in Table 4, the full estimates of a N + 1 multiple equation model are available from authors on request.

Table 4 Estimated error correction forms

In either case the output is divided into two equations: the first, labelled EC, shows the long-run equation; the second, labelled SR, shows the short-term dynamics of the ln_HCE_PC. The SR equation, in either case, has been obtained by testing separately groups of variables mentioned before (i.e. the traditional determinants, budget policy variables, political and institutional variables). In either case the variables which turn out to be significant are always the same, as reported in the specifications of the model in Table 4.

First, notice that, as for the long-run equation, in all cases, the estimated coefficient for ln_GDP_PC is significantly positive. Moreover, the time trend is never significant in the long term equation. This is quite interesting because the time trend has been commonly interpreted as the impact of technological progress on health spending. In this respect, the health care literature has emphasized the relevance of technological progress, and the adoption of new, more expensive medical treatments to explain the raise in health spending (see Newhouse 1992). Apparently, these new technologies are not so massively used in Italy.

The different estimation methods provide estimated coefficients not always so close to each other. As explained above, their main difference concerns assumptions on coefficients, with the DFE—at one extreme—imposing more restrictions, and the MG estimation representing the most flexible specification and allowing even slope coefficients to vary across countries. Comparing DFE, MG and PMG estimations, it turns out that the long-run GDP elasticity from DFE is about 0.94, the one from MG is about 0.92, and that from PMG is 1.596. The PMG constrains the long-run elasticities to be equal across regions, and this pooling across region gives efficient and consistent estimates when the restrictions are true. If the hypothesis of slope homogeneity is not true, i.e. the true model is heterogeneous, the PMG estimates are inconsistent, whereas the MG are consistent in either case.Footnote 19 Second, notice that in the short-run equation of all the estimated models, D1.ln_GDP_PC (GDP rate of growth) is significant and positive whereas D2.ln_GDP_PC (the variation of the rate of growth) is significant and negative. These results support the view that an increase of GDP increases HCE in both the long and the short-term (in the SR at a decreasing rate). Moreover, this seems to confirm the “Wagner law” also in the short term.

In the short-term, the effect of the estimated cointegrating vector (i.e. the ec term) is significant and negative, as expected, in all the three cases. The coefficients of the dynamics and the speed of adjustment terms are very similar in size, implying a slightly different short-term dynamics. Moreover, ln_HCE_PC is shown to be affected by the variable election, which captures both political and local elections. The variable president_election, which considers the introduction in the system of the direct election of the president of the region, is significant with positive value. The political colour of the regional ruling coalition is not significant and the same is true for those variables capturing the electoral system and the central government policies addressed both to all region and to specific regions by means of “repayment plans”.

Interesting enough is the result that the variable aging population is not significant in the short term. Analyzing the political support to social security and medicare, Bohn (1999) forecasts a further increase in spending driven by the change in the age profile of the electors. This, however, does not seem to be that case in Italy.

In Table 5, we test the difference in these models with the Hausman test (see Baum et al. 2003).

Table 5 Hausman restriction test on coefficients

The calculated Hausman statistic for MG versus PMG is 9.48. Being distributed as χ2(2), it leads to conclude that, under the null hypothesis, MG is preferred. The DFE model is subject to a simultaneous equation bias from the endogeneity between the error term and the lagged dependent variable (see Baltagi et al. 2000). In order to measure the extent of this endogeneity we again perform the Hausman test of DFE versus MG. The results indicate that the simultaneous equation bias is minimal for these data and therefore the DFE is preferred over MG. We further perform the Hausman test of PMG versus DFE. The calculated Hausman statistic is 0.06, indicating that the DFE is also preferred over PMG.

The first insight we gather from the above tables is that the Hausman test has provided some evidence in favor of the DFE estimation strategy. In addition, the usage of three different estimators provides indications on the specification of the cointegrating vector. The results are robust in that the sign of the long-term variables coefficients are always confirmed, only their size is, in one case, affected by the estimation method. Before adding further comments on the estimated coefficients, in the next section we perform a robusteness check of our approach considering the issue of cross section dependence.

4.2 Presence of Cross-Section Dependency

The presence of cross-section dependency within the framework of our dataset is highly likely. In this context, this correlation can be the result of local spillover effects between regions.Footnote 20 We investigate this issue by implementing the most commonly used CD-test for cross section dependency (Pesaran 2003, 2004). In Table 6, the CD-tests reject the null hypothesis of cross-section independency. We therefore proceed by repeating the sequence of tests described in previous section—i.e. testing for unit root and for the presence of cointegration, and finally estimating cointegrating relationships—but allowing for cross section dependence.

Table 6 CD-test on ln_HCE_PC ln_GDP_PC

We first run the t test for unit roots in heterogenous panels with cross-section dependence, proposed by Pesaran (2003).Footnote 21 We investigated results for a number of lags spanning from 1 to 6. The vast majority of the statistics, reported in Table 7, confirmed the nonstationarity. Only those statistics which have been highlighted in italicised values provide a different outcome.

Table 7 Panel unit root test, Pesaran (2007)

The results obtained require a further test to confirm that the variables are still cointegrated. Following Westerlund (2005, 2007) and Persyn and Westerlund (2008), we assume their same data generating process for their error correction test, and test for cross sectional independence in its residuals by means of the Breusch–Pagan statistic. In this respect, notice that this test requires T > N, which is our case. However, as our time series are rather short, given that some periods are lost in the calculation of differenced variables and lags, we test for independence of the first 12 cross-sectional units and assume the same short-run dynamics for all series. As for the relation between ln_H_PC, and ln_GDP_PC, the Breusch–Pagan LM test of independence gives χ2(55) = 155.662 (Pr = 0.0000). As this result strongly indicates the presence of common factors affecting the cross sectional units, we bootstrapped robust critical values for the test statistics related to the Westerlund ECM panel cointegration tests, keeping the short-term dynamics fixed. Table 8 shows that, when taking into account cross-sectional dependency, the tests still reject the null hypothesis of no cointegration.

Table 8 Westerlund ECM panel co-integration tests between ln_HCE_PC and ln_GDP_PC, bootstrapped (1,500 rep.) critical values

On these bases, we evaluate if the presence of cross section correlation changes at all the results when estimating the cointegration vector. The long-run coefficients estimated by means of the augmented mean group estimator (AMG), developed in Eberhardt–Teal (2010) (see also Bond–Eberhardt 2009), are reported in Table 9. The results of the AMG estimator provide evidence in favour of our specification in the absence of cross section correlation. Again, the time trend, which is often interpreted as capturing technological changes, does not turn out to be significant, whereas the ln_GDP_PC is still significant and correctly signed. Interestingly, its size is smaller than that resulting from all three models in Table 4, although we again find that the elasticity is greater than zero. The magnitudes of income elasticity equal to 0.18 imply that a 1 % increase in per capita income leads to an increase in the share of government spending for health of about 0.18 %. This result still supports Wagner’s law.Footnote 22 In this respect, also the view that health care is a superior good as suggested by Hall–Jones (2004) seems to be confirmed: as individuals get richer they choose to spend a larger proportion of their income on health care. Likely for electoral reasons (see below), policy-makers adjust public health spending to individual preferences.

Table 9 Augmented mean group estimator for long-run coefficients of the cointegrating vector

We can now move on to achieve an error correction representation as we did in the case of absence of cross section correlation. We now impose the long-term specification, K, estimated in Table 9 with the AMG estimator.Footnote 23 Table 10 reports the MG results after averaging the short-term parameter estimates.Footnote 24 This is of some help for a comparison with the results of Table 4. (The full estimates are available from authors on request).

Table 10 Mean group estimation: error correction form: constrained long-term

The results are markedly aligned with those presented in previous section (Table 4). Again, the effect of the estimated cointegrating vector (the ec term) is significant and negative, as expected. The coefficients of the dynamics and the speed of adjustment terms are very similar in size to the DFE specification. Still, this result supports the view that an increase of GDP increases HCE in both the long and the short term and seems to confirm those theories according to which public HCE growth is determined by changes of the economic and social structure. As before, in the short term the variable election is significant, confirming that during electoral years there is an increase of HCE. The variable president_election, which considers the introduction in the system of the direct election of the president of the region, takes on positive value, whereas the political colour of the regional ruling coalition is again not significant. This confirms that the role of the president of the region has really changed, acquiring greater power with respect to that of political parties. The variables capturing central government budget policies addressed both to all regions and to specific regions by means of Repayment Plans are never significant, confirming that the impact of policies aiming at controlling health care costs were not even effective in the short-term just like the changes in the electoral system.

5 Summary of results and conclusions

Hall and Jones (2004) argue that health care is a superior good, because, as individuals get richer, they choose to spend a larger proportion of their income on health care. In the Italian public health care system, this might mean that policy-makers are merely adjusting public health spending to the electorate’s wishes. Our empirical results confirm these findings.

In particular, with a panel of all the 20 Italian regions (1982–2009), we investigated the long-run relationship between HCE and GDP. Moreover, we tested for the short-term behaviour of HCE, controlling for additional structural variables which have been credited to affect it, namely, the traditional determinants of health expenditure, and the political and institutional features likely affecting the public choices related to the Italian health-care system.

We found confirmatory evidence of a long-run equilibrium equation between ln_HCE_PC and ln_GDP_PC, with the estimated long-run real income elasticity greater than zero. The method of estimation provided consistent results concerning the impact of income on shares of government spending in health care with income elasticity of about 0.18. This implies that a 1 % increase in per capita income leads to a 0.18 % increase in the share of government expenditure for health care in the long term. Thus, per capita income increases more than the increase in the share of the government spending in health.

The presence of cointegration also implies that there exists short-term dynamics which will lead to equilibrium in the long run. In this respect, notice that the short term adjustment coefficient, as expected, is negative and statistically different from zero, thus suggesting that any deviation of per-capita public spending for health care from the value implied by the long-run equilibrium relationship with per-capita GDP brings about a correction in the opposite direction. Here, the error correction coefficient signals a relatively slow adjustment to the long-run equilibrium. Moreover, we found that the share of government spending in health is affected by per capita income also in the short term. Indeed, the significant—and >1—coefficient of the change of rate of growth of GDP in the short-run equation of all estimated models confirms the presence of some instantaneous impact of increasing GDP on the size of public expenditure for health care, although at a decreasing rate given that the variation of the change of rate of growth of GDP takes on negative sign. This result, together with the result found for the long term equation, further supports those theories according to which the growth in public expenditure for health is a natural consequence of economic growth both in the long and the short term.

In the short term, the variable related to the presence of elections is always significant with positive sign, confirming that during electoral years there is an increase of about 0.015 % of HCE. This result supports the existence of a type of electoral cycle involving health care with an increase of the related expenditure in the electoral year. Finally, the introduction in the system of the direct election of the president of the region positively affects the HCE. This result signals that the role of the president of the region has indeed changed, having gained greater power with respect to that of political parties. Unexpectedly enough, the increased responsibility of the president of the region does not curb the increase of HCE as the regional institutional reform of the end of the ‘90s hoped for. The impact of the reform is, in fact, about 0.07 % increase of public expenditure for health. On the other hand, those theories belonging to the so called political economy approach which points out that expenditure of “universal” type, such as HCE, are likely to be raised in those countries under the proportional electoral systems are not supported by the Italian data on health. As well as the presence of a political cycle is not supported by the Italian data because the variables related to the “political-colour” of the regional ruling coalitions are never significant.

The overall conclusion that emerges from the empirical analysis is that there exists a long- and a short-term relationship between GDP growth and growth of public expenditure for health in Italy. The estimates also support the view, already noted by Wagner, that there is a proportion between public expenditure and national income which may not be permanently overstepped (Alt 1980). This suggests an equilibrium relative size of government rather than an ever-growing government sector. Our result, other than providing an empirical support for the existence of Wagner’s law, whose meaning should be further explored in the Italian context also for different types of public expenditure, has important policy implications. For example, after the current global financial recession, the Italian government should be cautious about its present and future spending. Indeed, as income shall rise again, people will choose relatively more government expenditure. The growth of HCE—as well as the growth of social security transfers and unemployment compensation, major programs redistributing cash,…—is consistent with this finding. Nevertheless, extra public spending poses the issue of fiscal sustainability. Indeed, the public debt in Italy stands at about 133 % (est. 2013) of GDP, and any further increases would imply that future government spending may be mainly directed toward debt servicing, likely at the expense of public expenditure on basic infrastructure (particularly education and health facilities).

Finally, our results generally support the view that a substantial part of the growth of per-capita health care expenditure is a response to voter demand irrespective of the “political colour” of the regional governments’ ruling coalitions. Although we have not tested, and cannot exclude, other plausible explanations—including the much discussed role of pressure groups and bureaucrats—our findings assign a non-negligible role to voters’ choice. In this respect, notice that Rowley and Tollison (1994) compare Wagner’s law with the principle of comparative advantage: the law explains the complementarity between the growth of the industrial economy and the associated growth in demand for public services. In other words, social progress brings about an increase in state activity which in turn means more government expenditure (Henrekson 1993). Wagner himself (1883) stated that there is a persistent tendency both towards an ‘extensive’ and an ‘intensive’ increase in the functions of the state: new functions are continually undertaken and old ones are performed more efficiently and on an extended scale, which increases the spending of the government. Public expenditure is resorted to perform these activities. Wagner asserts that, among the reasons for the fact that increases in real income lead to more demand for basic infrastructure, there may well be the fact that the government provides these same facilities more efficiently than the private sector.Footnote 25 This assertion, however, is still to be proved, and evidence should be provided to support the claim by Peacock and Scott (2000) that when the comparative advantage of government declines, the share of public expenditure in total GDP also declines.