1 Introduction

Governments tend to absorb a sizeable share of society’s resources and, therefore, they affect economic development and growth in many countries.Footnote 1 Throughout history high levels of economic development have been attained with government intervention. Where government did not exist, little wealth was accumulated. However, despite necessary, government intervention is not a sufficient condition for prosperity, if it leads to the monopolization of the allocation of resources and other important economic decisions, and societies did not succeed in attaining higher levels of income.Footnote 2

Theoretically there can be a point for a positive effect of government spending on growth, essentially via a Keynesian view upon which the government impulse to aggregate demand plays an upward role. Usually, one would then call upon fiscal policy to expand in order to foster growth, ceteris paribus. On the other hand, the negative effect of government spending on growth can stem from the ideas of the fiscal theory of the price level, where a after a contraction in government spending, the creation of expectations on consumers point to future lower taxes and an upward push in growth, also ceteris paribus.

In addition, economic progress is limited when government is zero percent of the economy (absence of rule of law, property rights, etc.), but also when it is closer to 100 percent (the law of diminishing returns operates in addition to, e.g., increased taxation required to finance the government’s growing burden—which has adverse effects on human economic behaviour, namely on consumption decisions). This idea is related to the so-called “Armey Curve” (Armey and Armey 1995), who borrowed a graphical technique popularized by Arthur Laffer. Friedman (1997) suggested that the threshold at which government affects economic growth is between 15 and 50 % of the corresponding national income.

The existing literature assesses identification issues related with establishing the growth effects of different types of public spending and taxation (notably the empirical work spanning from Barro 1990 to Gemmell et al. 2011), and also presents mixed results as to the relationship between government size and economic development (Wahab 2011). On the debate between the positive versus negative effects of government growth, Grossman (1988) suggested that a non-linear model was preferred in explaining its impact on total economic output (see Wu et al. 2010 for a recent study conditioning on the level of economic development).

On the one hand, government activities may have positive effects due to beneficial externalities, the development of a legal, administrative and economic infrastructure and interventions to offset market failures (Ghali 1998; Colombier 2009 Footnote 3).

On the other hand, the former may impact economic growth negatively due to government inefficiencies, crowding-out effects, excess burden of taxation, distortion of the incentives systems and interventions to free markets (Barro 1991; Bajo-Rubio 2000). Indeed, several studies report that the efficiency of government spending can increase, either by delivering the same amount of services with fewer resources or by using more efficiently existing spending levels (see Afonso et al. 2005; Angelopoulos et al. 2008). While the results vary, depending on the econometric model, the measurement of government size, the type of countries studied (rich vs. poor) and the time span considered, and a consensus is yet to be reached, the most recent studies typically find a negative correlation between government size and economic growth (Bergh and Henrekson 2010, 2011). Ceteris paribus, this is the underlying hypothesis in our study.Footnote 4 Moreover, Slemrod (1995) and Tanzi and Zee (1997) find a negative impact if the size of government exceeds a certain threshold. Controlling for other differences, European countries with government spending greater than 40 percent of GDP have had much lower growth rates during the last 15 years (Gill and Raiser 2012). The rationale behind this argument is that in countries with big governments the share of public expenditures designed to promote private sector productivity is typically smaller than in countries with small governments (Fölster and Henrekson 2001).

On the other hand,

Additional assessments are provided notably by Dar and Amirkhalkhali (2002) who report for OECD countries that Total Factor Productivity growth and productivity of capital are weaker in countries with larger governments. In the same vein, Afonso and Jalles (2013b), who report a negative effect of the debt ratio on growth, and that fiscal consolidation promotes growth, notably for OECD countries.

Our motivation also comes from Guseh (1997) who presents a model that differentiates the effects of government size on economic growth across political systems in developing countries. Growth in government size has negative effects on economic growth, but the negative effects are three times as great in non-democratic systems as in democratic systems. Understanding the link between government size, institutions and growth rests is important because, on the one hand, larger governments increase the potential for rent-seeking that adversely affects innovation. On the other hand, better institutions can mitigate rent-seeking activities and thereby induce the reverse effects on growth (see, e.g., Chaudhry and Garner 2007). Kirchgaessner (2001) provides a survey on the interactions between institutions and public finance.

Our paper includes several contributions: (1) we analyse a wide set of 140 countries composed of both advanced and emerging and low income countries, using a long time span running from 1970 to 2008, and employing different proxies for government size and institutional quality to increase robustness; (2) we build new measures of extreme-type political regimes which are then interacted with appropriate government size proxies in non-linear econometric specifications; (3) we make use of recent panel data techniques that allow for the possibility of heterogeneous dynamic adjustment around the long-run equilibrium relationship as well as heterogeneous unobserved parameters and cross-sectional dependence (e.g., Pooled Mean Group, Mean Group, Common Correlated Effects estimators, inter alia); and (4) we also deal with potentially relevant endogeneity issues.

Our results suggest the existence of a negative effect going from large governments to growth. Interestingly, government consumption is consistently detrimental to output growth irrespective of the country sample considered (OECD, Emerging and Low Income Countries). On the other hand, institutional quality has a significant positive impact on the level of real GDP per capita. Moreover, (1) the negative effect of government size on GDP per capita is stronger at lower levels of institutional quality, and (2) the positive effect of institutional quality on GDP per capita is stronger at smaller levels of government size. Furthermore, the stability of the coefficient estimates seems to have been relatively constant over time. Finally, our results prove themselves robust to several robustness analyzes.

The remainder of the paper is organised as follows. Sect. 2 presents the analytical framework. Sect. 3 presents the data. Sect. 4 elaborates on the econometric methodology and discusses our main results. Sect. 5 concludes.

2 Analytical framework

In this section we present a growth model that relates output and government size and it will provide the theoretical motivation for our empirical (panel) analysis in Sect. 3. Our model fits within a broader literature that expands a Barro (1991)-type model where government plays an active role.Footnote 5 We consider a typical economy with a constant elasticity of substitution utility function of the representative agent given by:

$$ U = \int\limits_{0}^{\infty } {e^{ - \gamma t} \frac{{c_{t}^{1 - \theta } - 1}}{1 - \theta }} dt $$
(1)

where c is per capita consumption, θ is the intertemporal substitution and γ is the (subjective) time discount rate or rate of time preference (a higher γ implies a smaller desirability of future consumption in terms of utility compared to utility obtained by current consumption. Population (which we assume identical to labour force, L) grows at the constant rate n, that is, \( L_{it} = L_{i0} e^{{n_{i} t}} \). Output in each country i at time t is determined by the following Cobb-Douglas production function:

$$ Y_{it} = K_{it}^{\alpha } G_{it}^{\beta } (A_{it} L_{it} )^{1 - \alpha - \beta } ,0 < \alpha < 1, \, 0 < \beta < 1, \, 0 < \alpha + \beta < 1. $$
(2)

Y is the final good, used for private consumption, G is public consumption expenditure, which proxies for government size, and K is investment in physical capital. We consider the case of no depreciation of capital. The output used to produce G equals qG (which one can think of as being equivalent to a crowding-out effect in private sector’s resources). A is the level of technology and grows at the exogenous constant rate μ, that is, we have

$$ A_{it} = A_{i0} e^{{\mu_{i} t + I_{it} \rho_{i} }} $$
(3)

with I it being a vector of institutional quality, political regime and other related factors that may affect the level of technology and efficiency in country i at time t, and ρ i is a vector of (unknown) coefficients related to these variables. In this framework, the state of labour-augmenting technology (A) depends not only on exogenous technological improvements determined by μ, but also on the level of institutional quality (such as the rule of law), the degree of democratic political foundations, etc. Institutions may be critical in facilitating technological breakthroughs, which may not occur without appropriate sound institutional environments. The presence of efficient and effective institutions ensures that labour can be used for productive purposes, instead of being wasted with red tape or rent seeking activities (North 1990; Nelson and Sampat 2001).

We begin by writing down the resource constraint for this economy in per worker terms, given by:

$$ \dot{K}_{t} = Y_{t} - C_{t} - qG_{t} \Leftrightarrow \dot{k}_{t} = y_{t} - c_{t} - qg_{t} - nk_{i} $$
(4)

where \( \dot{K}_{t} \) is the time derivative of physical capital and small letters represent per worker terms (after scaling down by L).

We now write the conditions that characterize the optimal path for the economy and determine the steady-state solution for private and public consumption and income per worker. The optimal path is the solution of:

$$ \begin{gathered} \mathop {\hbox{max} }\limits_{{c_{t} ,g_{t} }} \int\limits_{0}^{\infty } {e^{ - \gamma t} \frac{{c_{t}^{1 - \theta } - 1}}{1 - \theta }} dt \hfill \\ s.t.:\dot{k}_{t} = k_{t}^{\alpha } g_{t}^{\beta } A_{t}^{1 - \alpha - \beta } - c_{t} - qg_{t} - nk_{i} . \hfill \\ \end{gathered} $$
(5)

Solving the Hamiltonian’s corresponding first order conditions and after some manipulations yields:

$$ \begin{aligned} k^{*} &= A\left( {\frac{\alpha }{\theta \mu + \gamma + n}} \right)^{{\frac{1 - \beta }{1 - \alpha - \beta }}} \left( {\frac{\beta }{q}} \right)^{{\frac{1 - \beta }{1 - \alpha - \beta }}} \\ g^{*} &= A{}^{{\frac{1 - \alpha - \beta }{1 - \beta }}}\left( {\frac{\beta }{q}} \right)^{{\frac{1 - \beta }{1 - \alpha - \beta }}} k^{{*\frac{\alpha }{1 - \beta }}} \\ y^{*} &= k^{*\alpha } g^{*\beta } A^{{^{1 - \alpha - \beta } }} \\ c^{*} &= y^{*} - \left( {n + \mu } \right)k^{*} - qg^{*}. \\ \end{aligned} $$
(6)

A special case occurs when α + β = 1 and n = μ = 0 in which there is no transition dynamics and the economy is always in the balanced growth path.

We refrain from making full considerations on the model’s solution, but one, in particular, is worth makingFootnote 6: an increase in q (which implicitly proxies the overall size of the public sector translating the fact that more resources are needed/required to finance G) reduces both the optimal level of private consumption per worker (and physical capital per worker) and, more importantly, the optimal level of output per worker in this model economy.

Turning to econometric specification, in the steady state, output per effective worker (\( \hat{y}_{it} = Y_{it} /A_{it} L_{it} \)) is constant while output per worker \( \left( {y_{it} = Y_{it} /L_{it} } \right) \) grows at the exogenous rate μ. In general, output in effective worker terms evolves as \( \hat{y}_{it} = (k_{it} )^{\alpha } (g_{it} )^{\beta } \) and in (raw) worker terms, output evolves according to \( y_{it} = A_{it} \left( {k_{it} } \right)^{\alpha } \left( {g_{it} } \right)^{\beta } \). Taking logs on both sides we get ln \( y_{it} = \ln A_{it} + \alpha \ln k_{it} + \beta \ln g_{it} \), and using (3) and the fact that in (2) we have \( \left( {A_{it} L_{it} } \right)^{1 - \alpha - \beta } \) entering the utility function, we obtain,

$$ \ln y_{it} = A_{0} + (1 - \alpha - \beta )\mu_{i} t + (1 - \alpha - \beta )\rho_{i} I_{it} + \alpha \ln k_{it} + \beta \ln g_{it} . $$
(7)

Equation (7) describes the evolution of output per worker, as a function of a vector of institutional and political related variables, which may change over time, the size of the public sector or government, the level of physical capital and the exogenous growth rate of output. Given the production function relationship, (7) is valid both within and outside the steady-state and this is important, particularly, if one makes use of static panel data techniques for estimation purposes. Moreover, it is not dependent on assumptions on the behaviour of savings, hence offering a reasonable basis for estimation. Taking (7) as the baseline we will then augment it by splitting k it into physical (k) and human capital (h),Footnote 7 and use both a linear and non-linear specification (in which interaction terms are included), as follows:

$$ \ln y_{it} = b_{0} + b_{1} t_{t} + b_{3} I_{it} + b_{4} \ln k_{it} + b_{5} \ln h_{it} + b_{6} \ln g_{it} + \varepsilon_{it} $$
(8)
$$ \ln y_{it} = b_{0i} + b_{1} t_{t} + b_{3} I_{it} + b_{4} \ln k_{it} + b_{5} \ln h_{it} + b_{6} \ln g_{it} + b_{7} (I_{it} g_{it} ) + \eta_{it} $$
(9)

where the b’s are (unknown) parameters to be estimated, I it and g it denote the proxies for institutional quality and government size, respectively, and ɛ it and η it are model specific error terms satisfying the usual assumptions of zero mean and constant variance. Equations (8) and (9) provide the basis for the empirical models to be estimated in Sect. 4.

Finally, the variation of causality between government size and growth detected in cross-section and time-series papers suggests that there are important differences in the way in which governments influence economic performance across countries. We argue that it may reflect institutional differences across countries and, while this is a plausible conjecture, there is as yet little direct evidence to confirm that institutions and political regimes make a difference to the way in which governments affect economic outcomes.

3 Data

The dataset consists of an unbalanced heterogeneous panel of 140 countries over the period 1970–2010 in 5-year non-overlapping averages (to overcome short-run business cycle fluctuations as is common practice in the empirical growth literature).Footnote 8 Countries are grouped into advanced (OECD), emerging market economies (EME) and low income countries (LIC) based on the World Bank classification. Annual data on real GDP per capita (y) and gross fixed capital formation (inv) are retrieved from the World Bank’ World Development Indicators. We estimate the capital stock (Ky) using the perpetual inventory method, that is, Ky t  = Inv t  + (1 − δ)Ky t−1, where Inv t is the investment and δ is the depreciation rate. Data on Inv t comes from Summers and Heston’s PWT 7.1 as real aggregate investment in PPP. We estimate the initial value of the capital stock (Ky 0), in year 1950 as Inv 1950/(g + δ) where g is the average compound growth rate between 1950 and 1960, and δis the depreciation rate (set to 7 % for all countries and years). In addition we use average years of schooling from Barro-Lee dataset as our proxy for human capital.

The proxies for government size (our G) are government consumption (govcons_gdp), total government expenditures (totgovexp_gdp), total government revenues (totgovrev_gdp), and total government debt (govdebt_gdp) all expressed in percentage of GDP. They come from the IMF’s International Financial Statistics (IFS).

For institutional-related variables (our I) we rely on: (1) Freedom House’s Political Rights, Civil Liberties, (2) the Polity 2 index, a democracy index and the regime durability (in years) from Marshall and Jaegger’s Polity’s 4 database, (3) an index of democratization due to Vanhanen (2005), (4) a governance index and its 6 sub-componentsFootnote 9 from Kaufmann et al. (2009) (World Bank Governance Indicators)Footnote 10; (5) government fractionalization from the Quality of Government Dataset Footnote 11; (6) the KOF indices of economic, political and social dimensions of globalizationFootnote 12; years in office, index of legislative competitiveness, index of executive competitiveness, checks and balances and dummy for legislative election held all from the World Bank’s Database on Political Institutions.Footnote 13

4 Methodology and results

4.1 Baseline results

Equations (8) and (9) can be estimated directly using panel data techniques, which allow for both cross-section and time-series variation in all variables.

Table 1 presents our first set of results for the fixed-effects specifications covering different proxies for institutional quality (eight in total). We use government consumption for our baseline, as the more obvious spending policy measure available to the fiscal authorities, and discuss its individual inclusion in our regression of interest as well as its interaction with a variable I it .

Table 1 Baseline results, full sample: Fixed Effects estimation with interaction terms

We find a positive effect of the capital stock on the level of real GDP per capita throughout the different specifications regardless of the institutional variable employed. The positive role of human capital is also evident from these regressions in accordance to other studies. One also finds a statistically significant negative coefficient on the government size proxy in 10 out of 16 specifications. Its coefficient varies between −0.08 and −0.11, meaning that an increase in government size by 10 % points, is associated with a 1.1–0.8 % lower annual growth per capita. This order of magnitude is consistent with previous literature. Institutional quality, generally speaking, has equally a consistent and statistically significant positive impact on economic activity, though its order of magnitude differs between the selected proxy under scrutiny.

When statistically significant the interaction term is negative, meaning that (1) the negative effect of government size on GDP per capita is stronger at lower levels of institutional quality, and (2) the positive effect of institutional quality on GDP per capita is stronger at smaller levels of government size. The interaction term means that the marginal effect of government size will differ at different levels of institutional quality. However, this result depends on the proxy used for I it . Nevertheless, we obtain in most regressions considerably high R-squares. Moreover, when regional dummies are included, coefficients keep their statistical significance and sign (not shown).

We go on in further exploring the robustness of our results to other institutional proxies and the results are generally in line with ones already reported. In Table 2, we observe a statistically significant negative coefficient for the government size proxy in 12 out of 16 specifications, and the coefficient’s range compares broadly in magnitude with one reported before.

Table 2 Baseline results, full sample: Fixed Effects estimation with interaction terms

Moreover, if one uses also the World Bank’s Governance (WBG) indicators, as a robustness check and despite the fact that these are only available from 1996 onwards, the conclusions still hold (see Table 3). Likewise, and complementing the aggregate full sample estimation we present a bar chart (Fig. 1) by country group (Advanced Economies, Emerging Market Economies and Low Income Countries) where we display only the statistically significant coefficients, for the respective WBG indices. Voice and accountability as well as rule of law are the two most important governance aspects to impact GDP per capita in OECD countries; in emerging market economies more importance is attributed to government effectiveness; in low income countries regulatory quality and rule of law stand out as the most important indicators.

Table 3 World Bank Governance Indicators, full sample: Fixed Effects estimation with interaction terms
Fig. 1
figure 1

Growth regressions’ estimated coefficients using World Bank’s Governance Indicators by country group (only statistically significant). Note author’s computations

In relation to the evolution over time of the relevance of both government size and institutional quality, Figs. 2 and 3 summarise respectively the relevant statistical estimate coefficients coming from the estimation of Eq. (8). Checking the time dimension constitutes an additional robustness check and we do not observe any switching signs over time; more importantly for most proxies estimates have been relatively stable throughout the period under consideration.

Fig. 2
figure 2

Estimated coefficients of government consumption in regressions using different institutional proxies, over time (only statistically significant). Note author’s computations

Fig. 3
figure 3

Estimated coefficients of different institutional proxies, over time (only statistically significant). Note author’s computations

4.2 Dynamic panel estimation: accounting for endogeneity

In the analysis of empirical production functions, the issue of variable endogeneity is generally of concern. That said, instead of estimating static equations, we now allow for dynamics to play an active role. A negative correlation between government size and economic activity does not imply causality. In fact, the most obvious reason (among many) to suspect reverse causality a problem is that welfare states social insurance schemes act as automatic stabilizers. We reformulate our regression equation(s) to include additionally the lagged real GDP per capita (the remainder of the variables is unchanged). We estimate by means of the Arellano-Bover system-GMM estimator which jointly estimates the equations in first differences, using as instruments lagged levels of the dependent and independent variables, and in levels, using as instruments the first differences of the regressors.Footnote 14 Intuitively, the system-GMM estimator does not rely exclusively on the first-differenced equations, but exploits also information contained in the original equations in levels. The results in Table 4 for a selected number of institutional quality proxies support the previous findings. Moreover, diagnostic statistics, in particular the validity of the instrument set used (assessed via the Hansen test), are well behaved.Footnote 15

Table 4 Accounting for Endogeneity: System GMM estimation, full sample

4.3 Using new political system’s measures

Following Rodrik and Wacziarg’s (2005) approach we construct new (and more meaningful) democracy measures based on the variable Polity. The role of political systems and democracy in particular, on the government size-growth relationship is assessed by regressing three structural aspects of democracy (to be defined below) on 5-year averages of real GDP per capita growth rates.Footnote 16 Indeed, polity does not capture two important dimensions of political regimes—either their newness (following, for example, democratization or a return to authoritarian rule) or their more established (consolidated) nature.

Rodrik and Wacziarg (2005) define a major political regime change to have occurred when there is a shift of at least three points in a country’s score on polity over three years or less. Using this criterion we define new democracies (ND = 1) in the initial year (and subsequent four years) in which a country’s Polity score is positive and increases by at least three points and is sustained, ND = 0 otherwise. Established democracies (ED = 1) are those new democratic regimes that have been sustained following the 5 years of a new democracy (ND). In any subsequent year, if established democracies (ED) fail to sustain the status of ND, ED = 0. Using these criteria, they define sustained democratic transitions (SDT) as the sum of ND and ED. They use the same procedure, mutatis mutandis, to define new autocracies (NA), established autocracies (ES) and sustained autocratic transition (SAT).

This yields six distinct binary-type measures of the character of political regimes—ND, ED, NA, EA, SDT, and SAT—for most years during 1970–2010. Finally, Rodrik and Wacziarg (2005) define small regime changes (SM) as changes in polity from one year to the next that are less than three points.Footnote 17 There are several advantages from creating these new measures, which allow us to distinguish the impact of new and established electoral democracies and autocracies on economic development.Footnote 18

EndogeneityFootnote 19 between right-hand side proxies of democracy and autocracy and a standard set of control variables is corrected for by taking a system-GMM (SYS-GMM) approach—as detailed above. As suggested in Mauro (1995), Hall and Jones (1999) and Acemoglu et al. (2001), the democracy measures are instrumented by:

  1. 1.

    the durability (age in years) of the political regime type (durable) retrieved from Marshall and Jaeggers’ databaseFootnote 20

  2. 2.

    latitude: Hall and Jones (1999) launched the general idea that societies are more likely to pursue growth-promoting policies, the more strongly they have been exposed to Western European influence, for historical or geographical reasons. Two other possible instruments could be common and civil law, translating the type of legal origin of each country

  3. 3.

    ethnic fragmentation (ethnic): on a broad level, the role of ethnic fragmentation in explaining the (possible) growth effect of democracy can be derived from the literature on the economic consequences of ethnic conflict. It has been shown that the level of trust is low in an ethnically divided society (Alesina and La Ferrara 2000). Moreover, the lack of co-operative behaviour between diverse ethnic groups, leads to the tragedy of the commons as each group fights to divert common resources to non-productive activities (Mauro 1995)Footnote 21

Table 5 reports the results with the four proxies for government size. We observe that government size appear mostly with statistically significant negative coefficients. When interacted with SDT it has a positive and statistically significant coefficient, meaning that in democratic countries the negative impact of government size is mitigated but remains mostly negative. The remaining proxies keep the statistically negative coefficient, but interaction terms lose economic and statistical relevance. For the OECD sample the individual effects of the different proxies of government size are similar but interaction terms are never statistically significant. Emerging market and low income countries report a statistically negative coefficient on public consumption, total government expenditure and debt-to-GDP ratio, with the latter having a lesser detrimental effect in democratic countries (see Fig. 4 for a graphical summary). All in all, government consumption is the proxy that is more consistently and more detrimental to output growth and the one that should be used for the remainder of the paper.

Table 5 New political system’s measures: system GMM estimation with interaction terms, full sample
Fig. 4
figure 4

Estimated coefficients of the interaction terms of different government size proxies with new political system’s measures, by country group. Note author’s computations

As suggested by Ram (1986) another possible specification is the use of the growth rate of the government size proxy. We also tested this specification to determine its impact on growth across different levels of institutional quality. All variables were retained except G it that is now replaced by dG it /G it together with the corresponding interaction terms (results available upon request). Comparing with our previous results the coefficients of the linear term of government size proxies were positive and statistically significant in two out of five specifications. However, according to Conte and Darrat (1988) Ram’s specification is suitable for testing short-term effects, while the specification used in this paper assesses the effects of government size on the underlying longer-run economic activity.

4.4 Robustness checks

One concern when working with time-series data is the possibility of spurious correlation between the variables of interest (Granger and Newbold 1974). This situation arises when series are not stationary, that is, they contain stochastic trends as it is largely the case with GDP and investment series. Results of first (Im et al. 2003) and second generation (Pesaran 2007) panel integration tests (not shown) suggest that we can accept most conservatively that non-stationarity cannot be ruled out in our dataset.

It seems that the time-series properties of the data play an important role: we suggest that the bias in our models is the result of nonstationary errors, which are introduced into the fixed-effects and GMM equations by the imposition of parameter homogeneity. Hence, careful modelling of short-run dynamics requires a slightly different econometric approach. We assume that (8), or (9), represents the equilibrium which holds in the long-run, but that the dependent variable may deviate from its path in the short-run (due, e.g., to shocks that may be persistent). There are often good reasons to expect the long-run equilibrium relationships between variables to be similar across groups of countries, due e.g., to budget constraints or common technologies (unobserved TFP) influencing them in a similar way. In line with discussions in the empirical growth literature we shall assume that the long-run relationship is composed of a country-specific level and a set of common factors with country-specific factor loadings.

The parameters of (8) and (9) can be obtained alternatively via recent panel data methods. We resort to the Mean Group (MG) estimator (Pesaran and Smith 1995) and the Pooled Mean Group (PMG) estimator, which involves both pooling and averaging (Pesaran et al. 1999). These estimators are appropriate for the analysis of dynamic panels with both large time and cross-section dimensions, and they have the advantage of accommodating both the long-run equilibrium and the possibly heterogeneous dynamic adjustment process.

A second step is to make use of the Common Correlated Effects Mean Group (CCEMG) estimator that accounts for the presence of unobserved common factors whose presence factors is achieved by construction and the estimates are obtained as averages of the individual country-specific estimates (Pesaran 2006). A related and recently developed approach due to Eberhardt and Teal (2010) was termed Augmented Mean Group (AMG) estimator and it accounts for cross-sectional dependence by inclusion of a “common dynamic process”.

The panel analysis that follows is based on this unrestricted error correction ARDL(p,q) representation:

$$ \Delta y_{it} = \phi_{i} y_{it - 1} + \beta '_{i} x_{it - 1} + \sum\limits_{j = 1}^{p - 1} {\lambda_{ij} \Delta y_{it - j} + } \sum\limits_{q = 1}^{q - 1} {\gamma '_{ij} \Delta x_{it - j} } + \mu_{i} + u_{it} ,i = 1,2, \ldots ,N;t = 1,2, \ldots ,T $$
(10)

where y it is a scalar dependent variable, x it is the k × 1 vector of regressors for group i, μ i represents the fixed effects, ϕ i is a scalar coefficient on the lagged dependent variable. β i 's is the k × 1vector of coefficients on explanatory variables, λ ij ’s are scalar coefficients on lagged first-differences of dependent variables, and γ ij ’s are k × 1 coefficient vectors on first-differences of explanatory variables and their lagged values. We assume disturbances u it to be independently distributed across i and t, with zero mean and constant variance. Assuming that ϕ i  < 0 for all i, there exists a long-run relationship between y it and x it defined as:

$$ y_{it} = \theta '_{i} x_{it - 1} + \eta_{it} ,i = 1,2, \ldots ,N;t = 1,2, \ldots ,T $$
(11)

where \( \theta^ \prime_{i} = {{ - \beta^\prime_{i} } \mathord{\left/ {\vphantom {{ - \beta_{i} \prime } {\phi_{i} }}} \right. \kern-0pt} {\phi_{i} }} \) is the k × 1 vector of the long-run coefficients, and η it ’s are stationary with possible non-zero means (including fixed effects). Equation (10) can be rewritten as:

$$ \Delta y_{it} = \phi_{i} \eta_{it - 1} + \sum\limits_{j = 1}^{p - 1} {\lambda_{ij} \Delta y_{it - j} + } \sum\limits_{q = 1}^{q - 1} {\gamma '_{ij} \Delta x_{it - j} } + \mu_{i} + u_{it} ,i = 1,2, \ldots ,N;t = 1,2, \ldots ,T $$
(12)

where η it − 1 is the error correction term given by (5), hence ϕ i is the error correction coefficient measuring the speed of adjustment towards the long-run equilibrium.

Table 6 presents our first set of robustness results, and it includes for each sub-sample both the PMG and MG estimates using Polity as our chosen proxy for institutional quality which enters in linear form together with public consumption (Panel A) and well as in multiplicative form (Panel B).Footnote 22 For the PMG estimates, the OECD sub-group obtains a positive and statistically significant coefficient on institutions and statistically negative coefficients on government size when using the both the PMG and MG estimators. One should expect rich countries to get a negative correlation between government size and output if thought in terms of the Olson’s (1982) mechanism: organized interest groups tend to evolve, and struggle to get advantages for themselves in the form of transfers or legislation, which have a side effect, delaying the regular functioning and growth of economy. The scope for interest group action is likely to be greater in countries with larger governments, where there is increased potential for profits from rent-seeking activities, leading to a greater diversion of resources to unproductive ends (Buchanan 1980). Recently, Bergh and Karlsson (2010) also uncovered a detrimental output effect of larger governments in a panel of rich countries using the Bayesian Average over Classical Estimates approach. For both emerging and low income countries statistical significance of government size is weaker but the institutional proxy is still statistically significant for emerging countries and for low income countries.

Table 6 Estimations allowing for heterogeneous technology parameters but homogeneous factor loadings (with and without interaction terms)

The MG estimator provides consistent estimates of the mean of the long-run coefficients, though these will be inefficient if slope homogeneity holds. Under long-run slope homogeneity, the pooled estimators are consistent and efficient. The hypothesis of homogeneity is tested empirically in all specifications using a Hausman-type test applied to the difference between MG and PMG.Footnote 23 The p value of such a test is also present in Table 6, and only for the OECD the null is rejected, being the MG estimator more efficient, and the long-run slope homogeneity rejected.

An equivalent set of results with the interaction term between public consumption and our institutional proxy is presented in Panel B. It shows that in the case of the OECD the interaction term is negative and statistically significant. However, in the case of emerging and low income countries the interaction term is no longer statistically significant.

Finally, in Table 7 when ones allows for heterogeneous technology parameters and factor loadings, we still get negative and statistically significant coefficients on our government size proxy which is in line with Romero-Avila and Struch (2008) who found a negative a significant effect from government consumption (and transfers) on output. However, in the case of emerging and low income countries such effect is no longer as strong and sometimes even positive (even if not statistically significant).Footnote 24 We compare different econometric specifications for robustness and completeness. Still, for the OECD sample there are negative interaction terms: (1) the negative effect of public consumption on real GDP per capita is stronger at lower levels of institutional quality, and (2) the positive effect of institutional quality on real GDP per capita increases at smaller levels of public consumption.

Table 7 Estimations allowing for heterogeneous technology parameters and factor loadings (with and without interaction terms)

5 Conclusion

We provide new evidence on whether “too much” government is good or bad for economic progress and macroeconomic performance, particularly when associated with differentiated levels of (underlying) institutional quality and alternative political regimes. We conduct an empirical panel exercise taking 140 countries from 1970 to 2010 and employing different proxies for government size and institutional quality. Moreover, we make use of recent panel data techniques that allow for the possibility of heterogeneous dynamic adjustment around the long-run equilibrium relationship as well as heterogeneous unobserved parameters and cross-sectional dependence; we also deal with potentially relevant endogeneity issues.

Our findings can be summarized as follows: (1) generally speaking, our results seem to suggest that bigger governments tend to hamper economic activity and this finding is invariant to the selected government size proxy under scrutiny, country grouping analysed or estimation method employed; (2) institutional quality has a positive impact on the level of real GDP per capita, though results are dependent on the proxy considered; (3) the negative effect of government size on real GDP per capita is stronger at lower levels of institutional quality, and the positive effect of institutional quality on GDP per capita is stronger at smaller levels of government size.

A possible interpretation for our results rests on acknowledging that most countries may already be at the “slippery” side of the Armey curve, where spending is not productive enough and/or government financing is too distortionary. This study aims to contribute to the ongoing debate on the effect of government size on economic activity, and it managed to provide robust additional evidence on the existence of a negative relationship between the two concepts. At least one caveat should be mentioned and this relates to the fact that we do not differentiate between different public spending items. This is difficult at the aggregation level considered and would go beyond the scope of this paper. Nevertheless, Afonso and Jalles (2013a) study the relationship of education, social security and health public spending and the economic business cycle in more detail. Moreover, looking closely at what governments actually do and, equally important, how they finance their production and provision of public goods and services can reveal that different activities have differentiated impact on growth. Finally, the determination of the “optimal level of government” and other issues related to public sector efficiency, though interesting questions per se, were beyond the scope of this paper and remain good candidates for future research.