Introduction

Potential output, i.e. the maximum value of output obtainable by efficiently using the productive factors of an economic system with stable inflation, is a key variable for economic policy. The output gap, the difference between actual output and potential output, identifies both the cyclical fluctuations and the imbalances in the use of the production factors, which have an impact on prices.

Potential output and the output gap are very useful for policy institutions but extremely difficult to tackle empirically. They are unobserved variables, for which statistical offices do not provide figures and which can only be estimated with a very high degree of uncertainty due to a variety of factors. First, the econometric approaches presented in the literature to measure them are many and varied. Moreover, even with the same specification, the choice made by the analyst about the initial conditions and the variances of the stochastic processes can lead to very different measures of potential output (Fioramanti 2015; Fioramanti and Waldmann 2017). Given the approach and the econometric specifications, potential output estimates are also affected by the uncertainty generated by the data. Revisions of official time series and updates of the input macroeconomic forecasts engender further variability in the real-time estimates. As a result, the use of the output gap for economic policy decisions could lead to inappropriate ex post choices (Orphanides and Van Norden 2002). The fiscal policy of the European Union (EU) Member States is affected by the uncertainty surrounding the real-time estimates of potential output and the output gap, as these variables have a significant role in the main EU fiscal rules.

This paper proposes to estimate potential output and the output gap for the Italian economy using multiple models, all estimated by means of the unobserved components approach, as in Zizza (2006), but based on different economic theories. Specifically, we start from simple models that connect the gap with inflation (a bivariate model based on the Phillips curve) and with unemployment (a trivariate model based on the Phillips curve and on the Okun law), possibly corrected to consider structural changes that occurred during Italy’s double-dip recession; a production function is also proposed, characterised by a systemic approach in which the different components are estimated simultaneously, with a multivariate model. Finally, we consider a univariate statistical filter, based on the estimates of the Phillips curve from the bivariate model.

The proposed multi-model approach enables us to take account of a variety of methods proposed in the literature and therefore to provide an economic interpretation of the estimates obtained in the light of different theories. Differently from the multi-model approaches adopted in other studies for the Italian economy (as in Bassanetti et al. 2010) all the models proposed here are estimated with the same econometric approach (the unobserved components of time series). This consistency in the econometrics behind different models is extremely relevant in our context, as it allows for a comparison of the different measures of OG obtained; in this regard, we benefit of the plurality of output gap series obtained in order to build a measure of uncertainty. The econometric specifications of the models are selected taking into consideration the statistical properties and the real-time stability as well as the plausibility with respect to economic theory. This allows us to built a coherent and synthetic measure based on the performance of the five different specifications.

Moreover, the estimates of our models do not depend on a priori constraints or parameters, as in the institutional approach adopted by the European Commission (described in “The European Commission Model” section).

The analysis shows that the bivariate model for GDP and inflation is reliable from a statistical point of view and is consistent with the interpretation of the potential output as the level of GDP that does not generate inflationary pressures. The trivariate model also enables the identification of NAIRU, but is less stable when the information set is updated. The multivariate model, based on the production function, also takes account of the degree of capacity utilisation. This latter is the most suitable for the economic interpretation, but appears less stable than the bivariate model. Finally, the univariate statistical filter of GDP is an alternative to the simple statistical filters, such as the HP, in which the parameters are normally defined independently of the properties of the specific time series under examination.

The paper is organised as follows: after a synthetic review of the most relevant literature, section three outlines the use of the estimates of potential output for the PBO’s institutional purposes. Sections four, five, and six describe the models in detail, while section seven presents an assessment of the models along with some diagnostics. Finally, section eight compares our range of output gap estimates with the measures recently published by other organisations.

Literature Review

The literature on potential output and the output gap is huge, and the models used to extract these unobservable variables refer to different econometric approaches. One can use purely statistical models, which do not incorporate strong hypotheses about economic relations, but are rather based on the properties of time series, as in the case of the Hodrick and Prescott (1997) (HP) filter. An extension of this statistical approach relies on unobservable component models (Zizza 2006). Alternatively, potential output can be estimated on the basis of economic relationships, as in the case of methods based on the production function or on structural models that consider short-term frictions (Parigi et al. 2001; Vetlov et al. 2011). The approaches can be combined, giving rise to hybrid models, in which for example the production function is estimated by jointly extracting the unobserved components of the time series (Proietti et al. 2007; ECB 2018). Another line of analysis, although so far less explored in the literature, relies on large databases. For example, Szörfi and Töth (2018) exploit information drawn from economic surveys on capacity utilisation, Fantino (2018) produces estimates using firm-level data, and Murray (2014) uses principal component analysis to summarise a large number of indicators. Even when the same econometric approach is adopted, several possible combinations are available in the specification of the equations, which are equally acceptable from the statistical properties, however, leading to different estimates of potential output (Jarociński and Lenza 2018; Frale and De Nardis 2018).

The European Commission’s (EC) estimates of the output gap, which are based on the production function according to the Common Agreed Methodology (CAM), often differ from those of other international organisations such as, for example, the International Monetary Fund (IMF) or the Organization for Economic Cooperation and Development (OECD), which nevertheless use similar approaches. Compared with these, the EC estimates tend to be too procyclical and unstable in real time. In the case of Italy, the Ministry of the Economy and Finance (MEF) has several times observed that the estimates of the EC are not always interpretable within the framework of economic theory and are also highly influenced by the choice of parameters for the initialisation of estimates (as documented in the 2018 Economic and Financial Document (EFD)).

Fiscal councils invest in the analysis of potential output and the output gap for various reasons. The quantification of potential growth is useful for analysing the long-term sustainability of the public debt. In the medium term, moreover, the estimation of the output gap is necessary to assess the fiscal stance of fiscal policy, as well as inflation forecasts. Finally, in the short term, the estimation of the output gap identifies the cyclical position of an economy.

In the United States the Congressional Budget Office (CBO) recently documented the evolution of its model (Shackleton 2018), which is based on the production function. In Europe, research in this area has being conducted by various organisations (Casey 2018; Cuerpo et al. 2018; EUIFIs 2018).

In Italy, the Parliamentary Budget Office (PBO) has developed research into potential output already in the past (Fioramanti et al. 2015; Frale and De Nardis 2018).

The Methodology of the PBO

The estimate of potential output plays a central role in European fiscal governance, as it is used to calculate the output gap and therefore the structural balance of the Member States. Starting from January 2015, the European Commission has also used the output gap estimates to characterise the cyclical position of the Member States and therefore to determine the size of the required structural adjustment for individual countries.Footnote 1 Potential output is also used to calculate the expenditure benchmark and to verify the compliance with the debt rule.

The MEF adopts the methodology agreed among the European Commission and the Member States (CAM), which is based on the production function (the model is briefly described in “The European Commission Model” section). The CAM has been criticised by many, for both the lack of transparency of the initialisation of the estimates and the excessive procyclicality of the trend components, in particular the NAWRU, which in the case of Italy is determined on the basis of a model with low statistical fit.Footnote 2 From an econometric point of view, the CAM approach is not systematic because it is a patchwork of different approaches and methods.Footnote 3 Over the years, all of these criticisms have led to a lively discussion between national and European authorities, as well as among analysts.

The institutional duties of the PBO include the endorsement of macroeconomic forecasts and the assessment of the government’s public finance forecasts. The process is governed by a memorandum of understanding with the MEF which, in order to ensure the completeness of the information, shall also provide the PBO with the variables necessary to calculate potential output and the output gap. The PBO should be able to reproduce the European Commission’s estimates in order to assess the MEF’s fiscal policy; in view of the considerable uncertainty surrounding the quantification of the output gap, which in real time differs between the MEF and the EC despite the use of the same model, the PBO has also an interest in using alternative tools and measures of the output gap.Footnote 4

The PBO has conducted studies of potential output and the output gap for the Italian economy since its establishment and has recently focussed on the development of alternative models to the CAM. No dominant model has emerged from these analyses, as different models capture different statistical or economic relationships. It was therefore decided to adopt a multi-model approach, as in the case of the Bank of Italy (Bassanetti et al. 2010). Following this approach, in addition to obtaining a point estimate of potential output and the output gap, a plausibility interval can be generated to provide a measure of the uncertainty of the estimate.

Following the analysis, we selected the five models listed below:

  1. 1.

    a bivariate model for output and inflation;

  2. 2.

    a bivariate model for output and inflation with a cyclical shock in 2009;

  3. 3.

    a trivariate model for output, inflation and the unemployment rate;

  4. 4.

    a multivariate model using the production function approach but with an integrated system to estimate the various components;

  5. 5.

    a univariate statistical filter, the parameters of which are calibrated on the basis of the estimates from the bivariate model in point 1.

Model 1 considers a bivariate relationship between output and inflation, where GDP is decomposed into a trend and a cyclical component, and this latter enters a Phillips curve defined on the GDP deflator. Inflation also depends on the expectations of agents and exogenous foreign variables, such as oil price and the exchange rate. This model represents the minimum structure compatible with the standard definition of the output gap, i.e. the maximum value of GDP that does not cause prices acceleration.

Model 2 is an extension of the previous approach, taking into account the anomalous cyclical conditions of the last decade, inserting an intervention variable in 2009 to foster the fit of the equation on the economic cycle with the historical data.

The two models mentioned above do not allow to consider a critical element in estimating potential output, that is labour market. Therefore, model 1 has been extended to include an equation linking the employment gap to the output gap. In this specification (model 3), unemployment is decomposed into the trend component, namely NAIRU, and in the cyclical component, which is linked to the output gap with a lag of one period. This approach considers the Phillips curve as in the case of the bivariate model and the Okun (1962) law, although the relationship between the output and unemployment gaps is not strictly proportional, but is expressed by means of a coefficient estimated endogenously by the model.

An integrated view of economic phenomena is provided by the multivariate model (model 4), which frames the problem of estimating potential output within the context of the production function. The approach is characterised by multivariate econometric estimates for the factors of production, while in the EC model the NAWRU and trend TFP are estimated separately; GDP is reconstructed on the basis of the factors of production, namely labour, capital and the Solow residual, and the associated trend and cyclical components are then identified. The output gap is obtained as a combination of the gaps of the different series; a capacity utilisation indicator (CUBS) is also considered, calculated using Istat business confidence surveys. This model enables a broader economic interpretation of the phenomena underlying the potential growth of an economy, at the cost of the greater complexity of the structure.

Finally, potential output was estimated with model 5, a statistical filter, similar to the HP filter, characterised by two interesting features. On the one hand, it is calibrated to match the properties of the cycle estimated with the bivariate model 1; on the other hand, it may also be applied to each component of the production function equation in order to derive specific trends that are consistent with the filter GDP series.

In the empirical application for Italy, we have used yearly data from 1970 to 2019 provided by Eurostat and reconstructed for the historical part (before 1997) in the AMECO dataset by the European Commission. For the forward-looking interval, we use the forecast made by PBO for the validation of the Economic and Financial Document of 2019. The choice of annual data versus more disaggregated information (e.g. quarterly data) is bound by the availability of sufficiently long series to disentangle temporary versus structural components. As for the exogenous variables, oil price is collected from the public dataset of the Federal Reserve Bank of St. Louis, while inflation expectations are drawn from the Bank of Italy’s survey of inflation and growth expectations. In the multivariate model, the CUBS indicator computed by the European Commission on the basis of the Business and Tendency survey is also included (see for more details Havik et al. 2014).

Estimating Potential Output Using Unobserved Components Models

Unobserved components models for the time series are a very general but parsimonious framework for decomposing a variable into trend and cycle, on the basis of stochastic components that reproduce its features (for a complete discussion, see Harvey 1989). Following this approach, given a time series \(y_{t}\), for example GDP, we consider the decomposition into two underlying processes \(\mu _t\) and \(\psi _t\), such that:

$$\begin{aligned} y_t=\mu _t+\psi _t, \end{aligned}$$

where \(\mu _t\) represents the long-term trend component, while \(\psi _t\) is the transitory, or cyclical, term.

A very general formalisation of the trend \(\mu _t\) is given by the local linear trend model:

$$\begin{aligned} \begin{array}{ccccccccc} \mu _{t} &{} = &{} \mu _{t-1}&{}+&{} \beta _{t-1} &{}+ &{}\eta _t,\, &{} \eta _t\sim \text{ IID } N(0,\sigma _{\eta }^2), &{} \\ \beta _{t} &{} = &{} &{} &{} \beta _{t-1} &{}+ &{}\zeta _t, \,&{} \zeta _t\sim \text{ IID } N(0,\sigma _{\zeta }^2), &{} \end{array} \end{aligned}$$
(1)

where it is usually assumed that \(\eta _t\) and \(\zeta _t\) are independent causal errors.

The model encompasses a number of cases of special interest for the time series:

  • with \(\sigma ^2_\zeta =0\) the trend is a random walk with constant drift \(\beta _t=\beta _0\). If we also set \(\beta _0 = 0\) the process is a random walk without drift: \(\mu _t = \mu _{t-1}+\eta _t\).

  • if \(\sigma ^2_\eta =\sigma ^2_\zeta =0\) the trend is deterministically linear: \(\mu _t = \mu _0 + \beta _0 t\).

  • if \(\sigma ^2_\eta =0\) the trend is an integrated random walk, \(\Delta ^2 \mu _t = \zeta _{t-1}\). The restriction \(\sigma ^2_\eta =0\) is sometimes called a smoothness prior.

Within the analysis of economic series, the cyclical component \(\psi _t\) is generally represented with a second-order stationary autoregressive process:

$$\begin{aligned} \psi _t = \phi _1 \psi _{t-1}+\phi _2\psi _{t-2}+\kappa _t, \;\; \kappa _t \sim \text{ IID } N(0,\sigma _\kappa ^2). \end{aligned}$$
(2)

The coefficients \(\phi _1\) and \(\phi _2\) can be reparametrised as follows:

$$\begin{aligned} \phi _1 = 2 \rho \cos \lambda _c, \;\;\;\phi _2 = - \rho ^2, \end{aligned}$$
(3)

where the parameter \(\lambda _c\) is usually interpreted as cyclical frequency with a period equal to \(\tau _c = \frac{2\pi }{\lambda _c}\) and \(\rho\) is the damping factor for the cycle.

This specification has a long tradition in the analysis of time series (beginning with Yule 1927), thanks to its flexibility in modelling cyclical phenomena. This formulation has taken on a leading role in the output gap literature, beginning with Clark (1989).

Unobserved components models can be estimated either by maximum likelihood or by Bayesian techniques. In this paper, we adopt the former approach, but without imposing constraints on the variances of the stochastic processes as done in the CAM.

Bivariate Model for Output and Inflation

In the early 1990s, Kuttner (1994) proposed a model for the US economy based on the bivariate relationship between economic activity and prices. The output gap entered the inflation equation through a Phillips curve and potential output was defined as the level of output consistent with stable prices. The empirical literature has recently highlighted the importance of inflation expectations. Mavroeidis et al. (2014) documented its role in a neo-Keynesian context, which varies depending on the specification of the model and the estimation method; other contributions have been made by Ball and Mazumder (2011), Blanchard (2016) and Coibion and Gorodnichenko (2015). The critical aspects of estimating the Phillips curve for the purpose of analysing Italy’s potential output were examined by Fioramanti and Waldmann (2017), in particular for the accelerationist version of the curve. A central element in the estimation of the Phillips curve also concerns the choice of the price variable.

Fig. 1
figure 1

The confidence bands were constructed by adding and subtracting twice the conditional standard deviation of the component. The lower left-hand panel shows the difference between inflation and expectations and the exogenous variables (\(\pi _t-{\tilde{\gamma }}_e \pi _t^e-\sum _k {\tilde{\beta }}_k x_{kt}\)) along with the contribution of the output gap to inflation

Bivariate model for GDP (in log) and inflation (GDP deflator).

Our investigations for Italy seem to favour a price measure based on the GDP deflator, rather than wage inflation, in terms of stability of the results and coherence with the economic theory. Starting from these considerations, we define a bivariate model for GDP (\(y_t\)) and its deflator, as in the “triangle model” presented by Gordon (1997), extended by the inclusion of inflation expectations.Footnote 5 The real GDP series is decomposed into a trend and a cycle component, which corresponds to the output gap. The transitory component follows a second-order autoregressive process, while the trend is represented by a local linear trend model.Footnote 6 The Phillips curve includes an inertial component, inflation expectations and the output gap. The gap enters in both contemporaneous and lagged term, to capture its dynamics in addition to the level effect. The specification is completed by a number of exogenous variables, such as the terms of trade and the oil price. The accelerationist version is nested in this representation (for more details, see the extended formalisation presented in “The Bivariate Model with Output and Inflation” section).

Figure 1 presents the estimates of potential output, its growth rate and the output gap for the bivariate model presented in this section. It also shows the Phillips curve, which seems to fit the historical data well.

The bivariate model has a number of important properties in terms of the stability of the results (which is discussed in “Assessment of the Models” section), statistical significance, parsimony and economic interpretability (as it incorporates the theoretical relationship between output and prices). However, it is a simple model that does not consider the labour market and which is affected by the sharp recessions that have characterised the Italian economy over the last decade. The following sections analyse a number of possible extensions, both to improve the statistical fit of the model and to take into account labour market variables.

Bivariate Model with Cyclical Shocks

The residuals of the bivariate model, presented in the previous section, show an outlier in 2009, in coincidence with the global financial crisis. This induced us to assess the possibility of inserting shocks in the specification, in order to take into account changes in the properties of the series induced by the recession.

Compared with the basic model, a number of special cases have been considered to incorporate the effects of the double dip recession experienced by the Italian economy in the last decade: (i) a change in the level of potential output; (ii) a structural change in the growth rate of potential output; (iii) a shock in the equation for the cycle. We adopted an intervention variable, which eliminates the potential output outlier in 2009 (for more details, see “The Bivariate Model with Output and Inflation” section). The estimated model produces clear evidence of a break in 2009 in all three cases; however, there is no clear difference in terms of statistical significance between the three specifications of the shock. The Great Recession could therefore have produced structural changes in the series, although it is difficult to discriminate between changes in levels from those in the cycle.

In any case, it is plausible that a severe and prolonged economic crisis, such as that of the last decade, can be considered anomalous compared with the characteristics of a standard economic cycle. Ultimately, the bivariate model with the cyclical shock was chosen. From the comparison of Figs. 1 and 2, we see that the model modified with the cyclical shock produces a widening of the gap during the crisis and a more gradual reduction during the recovery.

Fig. 2
figure 2

The confidence bands were constructed by adding and subtracting twice the conditional standard deviation of the component. The lower left-hand panel shows the difference between inflation and expectations and the exogenous variables (\(\pi _t-{\tilde{\gamma }}_e \pi _t^e-\sum _k {\tilde{\beta }}_k x_{kt}\)) along with the contribution of the output gap to inflation

Bivariate model with cyclical shocks for GDP (in log) and inflation (GDP deflator).

Trivariate Model for Output, Inflation and Unemployment

Alternatively, potential output can be obtained starting from the empirical relationship of direct proportionality between the output gap and the unemployment gap, the so called Okun’s low. The existence and stability of this relationship has been repeatedly examined, with mixed results. Ball et al. (2017) consider the robustness of the law fifty years after its formulation and conclude that it appears strong and stable in most developed economies, albeit with differences between countries.

The bivariate model of the previous section can be extended to include a relationship that links the unemployment gap to the output gap. Okun’s law can be specified in two ways: the relationship between the gaps can be strictly proportional, with the output gap generating the common cycle in unemployment, or the unemployment gap can be decomposed into a component proportional to the output gap and a specific one.

Fig. 3
figure 3

Trivariate model for GDP (in log), inflation and unemployment

The empirical evidence suggests that the relationship between the employment gap and output gap is significant for Italy, although with a lag. The estimate of the trivariate model, presented in detail in “The Trivariate Model with Output, Inflation and Unemployment” section, produces the series shown in Fig. 3. The NAIRU obtained is not a deterministic trend, although it appears indistinguishable from it in statistical terms.

Multivariate Model Based on the Production Function

The production function approach is the most widely used among international institutions to estimate potential output.Footnote 7

The method assumes a neoclassical Cobb-Douglas production function, characterised by constant returns to scale, where aggregate output, Y, is a combination of total hours worked, L, and the capital stock, K:

$$\begin{aligned} Y = A L^\alpha K^{1-\alpha }, \end{aligned}$$
(4)

where A is total factor productivity (TFP) and \(\alpha\) is the elasticity of output to labour.Footnote 8 We decompose output (Y) into potential output (\(Y_{P}\)) and the output gap (\(Y_{G}\)),

(\(Y_{G}\)),

$$\begin{aligned} Y = Y_{P}\times (1+ Y_{G}), \end{aligned}$$

where designating the potential level of the variables with the subscript P, we have

$$\begin{aligned} \begin{array}{lll } Y_{P}= & {} A_{P} \times L_{P}^\alpha \times K^{1-\alpha }. \end{array} \end{aligned}$$
(5)

The European Commission estimates the different components of Eq. (5) separately, while in this paper a multivariate model is used, which obtains all of them simultaneously. There have been few attempts to do so: Proietti et al. (2007) considers different specifications of a multivariate system for (5) and recently the European Central Bank has developed a similar model (ECB 2018). The model proposed here assumes that the trend components of the various variables are orthogonal to each other, while the cycles are all correlated. If \(h_t\) denotes total hours worked, \(e_t\) the employment rateFootnote 9 , \(a_t\) the participation rate and \(p_t\) the working age population, we have the following relationship (in logarithms):

$$\begin{aligned} l_t = h_t+a_t+e_t+p_t. \end{aligned}$$

In addition, if \(k_t\) denotes the capital stock,

$$\begin{aligned} \begin{array}{lll} y_t &{}= &{} f_t+\alpha l_t + (1-\alpha ) k_t \\ &{}= &{} \mu _t+\psi _t \\ \mu _t &{}=&{} \mu _{ft} + \alpha (\mu _{ht} + \mu _{at}+\mu _{et} +p_t) +(1-\alpha ) k_t\\ \psi _t &{}=&{} \psi _{ft} + \alpha (\psi _{ht} + \psi _{at}+\psi _{et}). \\ \end{array} \end{aligned}$$
(6)

The first equation decomposes GDP using the production function approach. Potential output \(\mu _t\) is the linear combination of the underlying components TFP, labour and capital, while the output gap is the combination of TFP gap and hours worked (decomposed into the contribution of the participation rate, the employment rate and hours worked per capita). Population contributes exclusively in defining the potential output of the economy. To extract the cyclical component of TFP, we consider the composite indicator of capacity utilisation (CUBS) calculated by the European Commission (see “The European Commission Model” section).

Fig. 4
figure 4

Multivariate model—Trend components (in log) and the output gap

Figure 4 shows the trend extracted using the multivariate model, while Fig. 5 presents the gaps of all the element of the production function. The cyclical coherence of the extracted components is rather low, but increases in conjunction with the Great Recession.

Fig. 5
figure 5

Multivariate model. Gaps of the components and output (percentage rate of change)

The model generates estimates of the output gap for recent years that are larger than those produced with the other models. All the gaps (Fig. 5) tend to close, albeit at different speeds. However, the population and the capital stock contribute to the estimates of potential output, which have declined in the last few years and are not incorporated in the other models.

Statistical Model Based on a Univariate Filter

An alternative approach to extracting the long-term components of (4) consists in the application of a univariate filter to the individual time series.Footnote 10 A popular approach has been the use of the HP filter, which is also applied by the CAM for some variables of the production function (hours worked and participation rate). Although simple and easy to implement, the filter suffers from distortion at the end of the sample (end-point bias), as a result of which real-time estimates are unstable.

Fig. 6
figure 6

Statistical filter based on the bivariate model—Trend components (in log) and the output gap

In this paper, we use an ad hoc filterFootnote 11 derived from the optimal filter for estimating potential output and the gap in the bivariate model for output and inflation considered in subsection. The filter reflects the statistical properties of the GDP series, in terms of the decomposition of the series into cyclical and trend components and implicitly incorporates the concept of the potential output of the economy as the level at which prices remain stable. The potential output series is obtained by applying the smoother associated to the model of Eq. (1) to the GDP series, imposing the following restrictions derived from the parameters of the bivariate model (described in "Bivariate Model for Output and Inflation” section):

$$\begin{aligned} \sigma ^2_\eta = 0, \sigma ^2_\zeta = 0.0168\sigma ^2_\kappa , \rho = 0.56, \lambda _c = 0.04, \end{aligned}$$

where \(\rho\) is the damping factor of the cyclical wave and \(\lambda _c\) is the angular frequency of the cycle expressed in radians.

The choice of the bivariate model as a reference to constrain the filter parameters was prompted by the fact that it produces more stable estimates than those obtained with the trivariate model (see “Assessment of the Models” section). This stability is reflected by the statistical filter. Overall, the filter is justified by the need to achieve a reduction in volatility, optimised with reference to a Phillips curve model.Footnote 12

The same filter can be applied to the single components of Eq. (4) in order to obtain their trends, as shown in Fig. 6, coherently with the direct estimate of trend GDP. It emerges that the behaviour of hours worked is procyclical at the end of the sample, while the participation rate is less so.

Assessment of the Models

In order to better compare the results of the five models, we present a visual inspection of the different estimates.

Fig. 7
figure 7

GDP and potential output in the five models (percentage rate of change)

Fig. 8
figure 8

Potential output in the various models of the PBO and the European Commission (percentage rate of change)

In Fig. 7, the estimates of potential output growth since 1970 display a clear downward trend, consistent with the lower growth rates recorded for GDP. This underlying trend is common to all models, but there are differences that depend both on the information set and on the different econometric specification, as shown in Fig. 8 for a shorter time spam.

Comparing the two bivariate models, we see that the cyclical shock, marked by the start of the global financial crisis, produces faster growth of the potential output in expansionary phases; the intervention variable explains part of the decline in output in the first crisis, but subsequently reflects a decrease in capacity. The simple bivariate model produces less volatile estimates of potential output, thus with slower growth prior to the crisis, a negligible loss of potential output during the recession, but slower growth subsequently.

The potential growth estimated using the trivariate model appears even less procyclical than that obtained with model 1. The breakdown of the trivariate model is such that part of the contraction in GDP, attributed to the cyclical component, explains the gap in unemployment through Okun’s law. The negative relationship that links the two cycles implies a smaller NAWRU and therefore a larger potential labour factor in recessions. In the recent years, a large component of unemployment is considered cyclical, so values of the output gap tend to be lower than those in the bivariate model.

In the multivariate model, economic information is mainly connected with the Phillips curve, while for TFP, employment, hours worked and CUBS the decomposition is performed with a statistical cycle-trend model, albeit using a simultaneous estimate. The developments in potential output are more volatile than those obtained with the other specifications, since this model tends to estimate flat trend components. After 2015 the growth in potential output is low, mainly due to increases in the measure of capacity utilisation (CUBS).

Finally, potential output estimated with our filter lies between that estimated with the bivariate model and that under the trivariate model. From the comparison with the HP filter, however, it is evident that the potential output estimated with the models presented here is much less procyclical. This property also improves the stability of the models compared with the preliminary data, as documented in “Real Time Stability” section.

Real Time Stability

As mentioned earlier, policy-makers need to evaluate and take account of the magnitude of revisions, whenever a new observation enters the sample. For this purpose, it is possible to evaluate the characteristic of the proposed models in real time, i.e. considering the information sets that have become available over time. Macroeconomic indicators are regularly revised by the national statistical institutes in order to incorporate new information, so that time series are often modified between one publication and the next (generating the so-called data vintages), sometimes considerably. This phenomenon is very significant in quarterly series, but it also has an impact on annual data. To account for this, the five models selected here were estimated for ten vintages of GDP and deflator released from October 2014 to September 2018. Data revisions influence the estimates through two channels: (i) the statistical filter is applied to a different time series (with more data but also with the revision of past observations), and (ii) the model parameters are re-estimated and since the underlying data partially differ, the estimated parameters may also vary. The first panel of Fig. 9 allows us to evaluate the first effect, as it shows the estimates of the output gap obtained with the statistical filter proposed in section 6, which by construction in the exercise is not subject to variability of the parameters and therefore is only affected by the revision of the input data. As expected, the revisions have a larger impact on the final part of the sample. The other panels of the figure show the joint effect of the variability of data and parameters on the estimates.

Figure 9 shows that the two bivariate models are very stable, even the one with the shock. The series for the unemployment rate, on the other hand, appears difficult to be decomposed into a cyclical and trend component and therefore induces greater instability in the output gap in the models including it, i.e. the trivariate and the multivariate models. This latter appears to be the less stable, in particular in the last decade, a period in which the volatility of the economic cycle has grown. The complexity of the specification generally makes the estimations more difficult, presumably due to a basically flat likelihood.

Fig. 9
figure 9

Estimate of the real-time output gap

Inflation Forecasting Ability

All models examined here imply an economic relationship, such as the Phillips curve or Okun’s law. A criterion for evaluating them can therefore be based on the models’ ability to reproduce a stylised representation of the underlying theory and to project it forward. Since all the models consider, more or less explicitly, the Phillips curve, a simple exercise was performed to evaluate their capacity to forecast inflation. The sample was divided into two parts: the estimation interval, in which the model parameters were estimated, and the evaluation interval, for which the forecasts with the various models are compared. The exercise was repeated by recursively moving forward the estimation interval, by one observation at a time, in the range 2012–2018. Using annual data, the number of observations is necessarily small, so the analysis provides a qualitative assessment.

Table 1 shows different measures of prediction accuracy of the proposed models, together with those of a simple benchmark represented by a random walk process. The latter, despite its simplicity, is often a good predictor of inflation measures.

To assess the accuracy of the forecasts, a number of diagnostics are reported. Denoting the one-step ahead inflation forecast with \({\hat{y}}_{t+1/t}\) and the actual value with \(y_{t+1}\), Table 1 gives: the mean error (ME) (based on \({\hat{y}}_{t+1/t}-y_{t+1}\)), which indicates the distortion of the estimate, i.e. if there is a tendency to overestimate or underestimate the variable; the mean absolute error (MAE) which deals with cases of overestimation and underestimation in an analogous way \(\mid {\hat{y}}_{t+1/t}-y_{t+1} \mid\); the root mean squared forecast error (RMSFE) based on the mean of \(({\hat{y}}_{t+1/t}-y_{t+1} )^2\), which tends to emphasise large errors.

With the exception of the trivariate, all models generate better forecasts than the benchmark. The best model is the multivariate, especially after 2014. All the forecasts appear distorted (ME), in most cases producing underestimates except for the multivariate model. The errors tend to be of a similar size (as deduced by the proximity of MAE and RMSFE) without dominant observations. A more extensive assessment of the predictive power of the models would require the application of a statistical test, as the differences between the errors could derive from random factors. However, the small number of observations available makes these exercises unfeasible.

Table 1 Inflation forecasting error for the 2012–2018 sample (1).

Comparison with the Output Gap Estimates of Other Organisations

The PBO has developed five models, which together with the method agreed at the EU level and the HP filter constitute a set of useful tools for the evaluation of the output gap made by the MEF.

From the operational point of view, the models of potential output incorporate the results of other quantitative instruments of the PBO. As part of the endorsement process, the PBO first develops medium-term macroeconomic forecasts using the econometric model MeMo-It, taking into account the information coming from the nowcasting models. Subsequently, the forecasts for growth, inflation and the labour market are incorporated in the procedures for estimating potential output and the output gap.

The five proposed models are also combined to quantify the uncertainty that characterises the individual measurements obtained. Using the method adopted to endorse the macroeconomic scenario, it is possible to obtain a range of variation defined by the maximum and minimum values, as well as the median.Footnote 13

In addition, the results of the five models could be combined by using suitable pooling weights. These are not immediately available, as the models use a different information set and have different complexity. In principle, the weights should be proportional to the precision in the estimation of the output gap, but the latter is unobserved and thus some alternative validation strategy is needed. One possibility is to use the accuracy by which GDP is predicted one year ahead or, perhaps more consistently, to use weights proportional to the reciprocal of the mean square prediction errors of inflation (as reported in Table 1).

Figure 10 compares the output gap values obtained in the spring of 2019 by the major international institutions and the MEF with the PBO’s median, maximum and minimum, along with the proposed pooled estimates based on the predictive performance concerning inflation. The underlying macroeconomic scenario considered by different institutions is similar; therefore, the divergences in the OG are mainly due to the different models of the potential output. Compared with the time series of the European Commission, the median of the PBO model estimates does not differ considerably, the turning points are basically the same, but our output gap tends to be more volatile, as the potential output is less procyclical. However, the values estimated by the European Commission are almost always within the interval between the maximum and minimum values produced by the PBO models. The pooled measure appears quite similar to OECD estimate for the historical part, while it approaches the values proposed by the EC in the most recent years. As for the MEF’s estimates in the last EFD, which are publicly available for a shorter period, the values lie within or close to the confidence interval up to 2019, while starting from the following year they appear to move apart.

Finally, compared with the output gaps estimated by the OECD and the IMF, the past estimates are often within the range produced by the PBO models, at least from the early 2000s (previously the fluctuation bands are very narrow). From the forward-looking perspective, the OECD’s profile is more similar to that of the MEF, while IMF’s estimates are close to those of the median of the PBO models and the pooled series almost reaches the maximum.

Conclusions

Potential output and the output gap are central to economic analysis and policy. However, their measurement suffers from various sources of uncertainty, ranging from the specification of the model to data availability, as it is well documented in the literature. The difficulties are compounded in the Italian case, for which the long-run decline in the underlying growth rate of the economy and two consecutive recessions of unprecedented depth concurred to make rather difficult to disentangle the permanent dynamics from the transitory ones in both inputs and output.

This paper has proposed an eclectic approach to the estimation of potential output and the output gap. We considered alternative and complementary estimates that incorporate key economic relationships, such as the expectations-augmented Phillips curve, relating the level and change in inflation to the output gap, and multivariate specifications implementing the production function approach. Finally, we proposed an ad hoc filter, derived from a the optimal filter for GDP that distills the trend level of output that does not induce inflationary pressures, as an alternative to traditional filtering methods based on the Hodrick-Prescott filter.

While we cannot deny the intrinsic uncertainty of the measurement, the merits of our approach are the following. The use of multiple models enables an economic interpretation of the results in the light of different theories, and is also useful for assessing the variability of the estimates. The econometric specifications adopted are parsimonious and the estimation techniques do not require the imposition of particular restrictions on the parameters by the analyst, nor arbitrary prior assumptions on their distribution.

Interestingly, the estimates produced by the models are characterised by low procyclicality, which also explains their stability with respect to the preliminary data; the estimates are also consistent with the economic theory, as they improve the accuracy of the inflation forecasts. Estimates and forecasts of the output gap recently produced by other organisations tend to fall within the confidence interval calculated on the basis of the models presented here. Finally, the estimates can be combined so as to produce a consensus estimate using pooling weights that reflect the predictive ability with respect to inflation.

Fig. 10
figure 10

Estimates of the output gap—Spring 2019