1 Introduction

Business and Consumer Surveys (BCS) are a unique way of extracting empirical data on managers’ and consumers’ views on relevant variables from their economic environment. In 2011, BCS celebrated their 50th jubilee in the EU (European Commission 2014). Accordingly, over the last decades they have become an integral part of macroeconomic modelling. They are widely employed in empirical studies of two main sorts. Their first role is to serve as a data source for quantifying the prevailing business climate in particular branches of the national economy (Gayer 2005) or to get estimates of otherwise “intangible” factors such as expectations or perceptions (see, e.g., Antonides 2008). On the other hand, BCS are also utilized to construct composite leading indicators.

The role of leading indicators is especially accentuated in times of economic downfall (such as the recent global crisis). Namely, it is well established in the literature that standard macroeconomic forecasting models are of limited value due to the fact that they do not account for psychological factors such as the economic sentiment. The researchers are united in the conclusion that the intensity and longevity of this crisis was stirred up by the drastic downfall of economic sentiment (Kindleberger and Aliber 2011).Footnote 1

An efficient leading indicator would be a variable capable of predicting the targeted macroeconomic series several months/quarters in advance. Even a slightly lagging indicator might be very useful for the economic policy holders because of the publication lags in the data release calendar. This can be easily verified on the example of the European economic sentiment indicator (ESI). Namely, ESI is regularly published in the last week of each month, whereas quarterly GDP figures are published with a considerable time lag. For example, the September 2014 ESI was published on September 29, while the corresponding 2014 Q3 GDP figures were published by Eurostat as late as December 5, 2014.

Since the sole beginning of conducting the Joint Harmonised EU Programme of Business and Consumer Surveys in 1961, the methodology of constructing official European composite indicators has not altered much. The European Commission (EC) indeed does have firm arguments in favour of methodological harmonization: it enables long time spans of data at the national level (which would be impossible in case of frequent methodological changes and structural breaks), as well as it ensures data comparability among the Member States.

In calculating ESI, the EC employs data gathered from five distinct BCS sectors: the industrial sector, retail trade, the services, consumer sector, and construction. In order to obtain ESI, the EC weights individual sector data according to their relative share in the national economy. However, the chosen weights are not continuously altered to reflect the underlying changes in the economic system (e.g., due to the recent crisis, some other extreme event, or simply due to long-term structural economic shifts). Consequentially, the predictive accuracy of ESI has been brought into question recently (Gelper and Croux 2010). This obviously calls upon a reappraisal of ESI’s methodological foundations, which is precisely the issue that this paper aims to tackle.

Therefore, this paper analyzes standard ESI components for 26 individual EU Member States (Luxembourg and Ireland are not considered because of data unavailability), as well as for the aggregate EU and Euro area (EA). Using nonlinear optimization with constraints, a new weighting scheme is proposed for each of the observed countries. The novel weights are proposed using two separate methods. Firstly, GDP forecasting equations are estimated by OLS method using ESI as the predictor variable for various lead lengths (up to 12 months).Footnote 2 The weights are then chosen by minimizing the root mean squared error from the estimated equations. For the purpose of a robustness check, the same empirical exercise is then repeated by maximizing the correlation coefficient between ESI and GDP growth rates for up to 12-months lead lengths. Both employed estimation methods significantly enhance ESI’s forecasting accuracy, in some cases by as much as \(35\,\%\).

The paper is organized as follows. Section 2 offers a brief review of the most prominent ESI empirical studies. Section 3 explains the employed methodological framework, while Sect. 4 presents the obtained results. The concluding section offers clear policy implications and recommendations for future work on the topic.

2 Literature review

Existing empirical studies on economic sentiment mostly focus on ESI’s predictive characteristics with regard to targeted macroeconomic variables. As an example, consider one of the most influential studies of that sort. Gayer (2005) estimates several bivariate VAR models on aggregate euro area data. Each of the models comprises GDP growth and one of the BCS sectoral leading indicators (in retail trade, industry, the consumer sector, construction, and services) or the EC’s composite indicators (ESI and the Business Climate Indicator). Standard Granger causality tests point to accentuated predictive characteristics of BCS indicators. However, VAR-based out-of-sample GDP forecasts reveal a much more informative view of the issue. The obtained results firmly suggest that BCS indicators can be used as merely short-term predictors of GDP (one or two quarters in advance). Out of the observed indicators, ESI provides the largest added value in comparison to a benchmark AR(1) GDP model.

A similar study is done by Van Aarle and Kappler (2012). They also focus on the interrelationship between ESI and overall macroeconomic performance, but they expand the Euro area analysis by also modeling US data. Conventional tools within the VAR methodology (impulse response functions and variance decompositions) suggest that ESI shocks indeed positively feed into Euro area retail trade and industrial production, while its relationship with unemployment is negative. A comparable case is also shown for the US data. The only exception is that the European ESI is much more short-term than the US indicator (3 monthly lags vs. six lags in the US case).

It is worthwhile mentioning two papers that specifically compare BCS leading indicators’ quality in Old (OMS) and New EU member states (NMS). The first is Silgoner (2007), who examines the predictive content of ESI, its industrial subcomponent (industrial confidence indicator), and the BCS question focusing on industrial production expectations with regard to EU industrial production. It is found that all three measures Granger-cause industrial production. However, other obtained results seem quite contradictory: ESI is found to be a lagging (not a leading) indicator, while its forecasting performance is easily beaten by a simple autoregressive model. Out of the three competing measures, the production expectations balance of responses seems to be the best industrial production predictor. Silgoner (2007) then moves to the estimation of two separate panel regressions for OMS and NMS. It is found that all three measures of economic sentiment have considerably lower forecasting qualities in NMS then in OMS.

One of the most comprehensive existing studies of European BCS is written by Sorić et al. (2013). The authors utilize five bivariate panel VAR models for OMS and NMS separately, each of them comprising the BCS confidence indicator and its sector-related macroeconomic variable. The examined variables are retail trade volume, construction volume, personal consumption, industrial production (paired with their respective BCS confidence indicators), and GDP (paired with ESI). On the basis of standard Granger causality tests and innovation analysis, it is confirmed that the predictive characteristics of NMS’ BCS indicators (including ESI) are of comparable quality to OMS’. To be more specific, all BCS variables Granger-cause their respective macroeconomic tendencies with a lagging time of 4 quarters. The same conclusion is corroborated for both OMS and NMS. Although the authors utilize these results to state that the European BCS can be called a success story at their 50th jubilee, this does not mean that the predictive accuracy of BCS indicators cannot be improved.

This paper builds upon the study of Gelper and Croux (2010), who (to the best of the authors’ knowledge), are the only ones to provide an alternative weighting scheme for the European ESI. Namely, Gelper and Croux (2010) apply the partial least squares method and dynamic factor modelling to construct a novel ESI indicator. They conduct an analysis of BCS data from 15 EU OMS. Using correlation analysis with respect to the industrial production series, the authors prove that the partial least squares estimator outperforms both the official European ESI and the dynamic factor estimator. However, in terms of out-of-sample forecasting accuracy, the results are not that robust. It is found that (in the vast majority of the observed countries), the two proposed estimators do not offer any significant added value in comparison to the official ESI. Still, it is worth noting that the forecasting accuracy of the two novel estimators improves as the forecast horizon increases.

Summarising the conclusions drawn from the cited references, several points need to be emphasized. First, it is obvious that the issue of alternating the ESI weighting scheme deserves more attention since the existing literature is mostly silent on the topic. This paper aims to provide new insights by applying nonlinear mathematical programming with constraints, a methodology insofar neglected in related studies.

Second, the existing European ESI studies either aggregate the data in a panel framework (Silgoner 2007; or Sorić et al. 2013), or restrict the analysis to OMS (Gelper and Croux 2010). This study improves ESI’s predictive characteristics for as much as 26 individual EU Member States, and for the EU and EA aggregates. Thus, a more in-depth and wide-ranging study is offered.

Third, this paper offers a detailed sensitivity analysis of the “optimal” ESI weights with respect to changing forecast horizons (up to 12 months). Accordingly, it is clearly shown which of the BCS sectors contribute significantly to efficient GDP predictions for shorter, and which for longer forecast horizons. Conclusions can also be drawn about the quality of BCS in each of the 5 sectors examined in constructing the European ESI. Further, potential differences will be examined between the OMS and NMS.Footnote 3

Last, all previous ESI studies analyze quarterly data (which does not provide adequate data frequency to timely and accurately assess tipping points in the national economy) or employ industrial production as a proxy variable for total economic activity. Silgoner (2007, p. 203) even admits that the industrial production accounts for only 25 percent of the EU GDP, but still uses it as a GDP proxy because of its monthly frequency. This paper circumvents the proxy/frequency issue by estimating monthly GDP values for each EU Member State using the widely known Chow and Lin (1971) temporal decomposition technique.

3 Methodological issues

The empirical approach followed in this paper consists of several steps. To propose a new ESI weighting scheme for 26 individual EU Members, 15 ESI subcomponents are analyzed. The goal of this study is to find weights that will maximize the forecasting quality of ESI with respect to year-on-year GDP growth rates.

Fig. 1
figure 1

ESI and GDP growth at the EU level

The officially published ESI at the EU level is depicted (with respect to quarterly GDP year-on-year growth rates) on Fig. 1.

Since the GDP figures are published only at the quarterly level, the Chow and Lin (1971) procedure is utilized to estimate monthly GDP series for each of the EU economies. The technical details of the Chow and Lin (1971) temporal decomposition procedure are given in Sect. 3.1.

3.1 Estimating monthly GDP

The issue of estimating high-frequency GDP data is quite present in the literature for some time now. For example, Proietti (2006, p. 357) states that a vast number of developed western countries continuously employ temporal disaggregation for obtaining flash estimates of their monthly national economic accounts. In that context, the Chow and Lin (1971) procedure is found to be the most efficient and most widely used. Some of its recent empirical applications include Abeysinghe and Lee (1998), Abeysinghe and Rajaguru (2004), and Doran and Fingleton (2013). Some basic properties of the Chow and Lin (1971) procedure are given as follows. The method is used to decompose a low-frequency time series \((y_l)\) to a high-frequency one \((y_h)\). It is assumed that the variable of interest \((y_h)\) is modelled using a linear regression with \(\ p\) independent variables.

$$\begin{aligned} {\mathbf{y_{h}}}={\mathbf{X}} \beta +{\mathbf{u}}, \end{aligned}$$
(1)

where \({\mathbf {{y}_{h}}}\) is a \(\ 3n \times 1\) vector, \({\mathbf{X}}\) is a \(\ 3n \times p\) matrix of regressors, and \({\mathbf{u}}\) is a random vector with mean 0 and covariance matrix \({\mathbf{\Sigma}}\). Equation (1) is valid for \(\ 3n\) months (n quarters). Applying the generalized least squares regression (GLS), an estimate of \({\mathbf{\beta}}\) is found:

$$\begin{aligned} {\hat{\beta}}=[\mathbf{X}^\mathbf{T} \mathbf{C}^\mathbf{T}(\mathbf{C}\mathbf{\Sigma} \mathbf{C}^\mathbf{T}]^{\mathbf{-1}}\mathbf{X}^\mathbf{T}\mathbf{C}^\mathbf{T}(\mathbf{C} \mathbf{\Sigma}\mathbf{C}^\mathbf{T})^{\mathbf{-1}}\mathbf{y}_{\mathbf{l}}, \end{aligned}$$
(2)

where \({\mathbf{C}}\) is a \(\ n \times 3n\) matrix used to convert \(\ n\) quarterly observations of \(\ y_l\) into \(\ 3n\) monthly observations of \(\ y_h\):

$${\mathbf{C}} = \left[ {\begin{array}{lllllll} {1}& {0} & {0} & {0} & {0} & \cdots & {0} \\ {0} & {0} & {0} & {1} & {0} & \cdots & {0} \\ {} & {} & \cdots & {} & {} & {} & {} \\ {} & \cdots & {} & {1} & {0} & {0} & {} \\ \end{array} } \right]. $$
(3)

A crucial puzzle in the Chow and Lin (1971) procedure is the estimation of the covariance matrix \({\mathbf{\Sigma}}\). Namely, it is assumed that the monthly residuals from Eq. (1) follow an AR(1) process \(\ u_t=\rho u_{t-1}+\epsilon _t\), where \(\epsilon _t\) is \(\ WN(0,\sigma _\epsilon )\) and \(\ |\rho |<1.\)

It follows that \({\mathbf{\Sigma}}\) has the form:

$$ {\boldsymbol{\Sigma}} = \frac{{\sigma_\epsilon ^{\mathbf{2}}}}{{{\mathbf{1}} - {{\rho}}^{{\mathbf{2}}} }}\left[ {\begin{array}{lllll} {{1}} & {{\rho}} & \cdots & {{{\rho}}^{{{n} - {1}}} } & {} \\ {{\rho}} & {1} & \cdots & {{{\rho}}^{{{n} - {2}}} } & {} \\ \vdots & \vdots & \ddots & \vdots & {} \\ {{{\rho}}^{{{n} - {1}}} } & {{{\rho}}^{{{n} - {2}}} } & \cdots & {1} & {} \\ \end{array} } \right]. $$
(4)

The algorithm for obtaining \(\mathbf{\widehat{{y}_{h}}}\) is stepwise (Sax and Steiner 2013). First, a preliminary quarterly series is calculated as \(\mathbf{{y}_{p}}={\hat{\beta}}X\). The final estimate of \(\mathbf {\widehat{{y}_{h}}}\) is obtained as the sum of the preliminary quarterly series and the distributed quarterly residuals:

$$\begin{aligned} \mathbf{\widehat{{y}_{h}}=y_p+Du_{l}}, \end{aligned}$$
(5)

where \(\mathbf{{u}_{l}}\) is a \(\ n \times 1\) vector of differences between the estimated quarterly values of \(\ y_p\) and the actual values of \(\ y_l\). Likewise, \(\mathbf{D}\) is a distribution matrix:

$$\begin{aligned} \mathbf{D}= \mathbf{\Sigma} {\mathbf{C}^{\mathbf{T}}} (\mathbf{C} \mathbf{\Sigma} {\mathbf{C}^\mathbf{T}})^{-\mathbf{1}}. \end{aligned}$$
(6)

Here, the regressors used for estimating monthly GDP are retail trade volume and industrial production.

Fig. 2
figure 2

Quarterly and monthly GDP (2010 = 100) indices at the EU level

The obtained monthly GDP at the EU level (together with its quarterly counterpart) is depicted on Fig. 2.Footnote 4

3.2 ESI aggregation and data issues

ESI is the most comprehensive BCS composite indicator. It includes 15 individual response balances \((x_j)\) from five BCS sectors: industry, retail trade, construction, services, and the consumer sector (see European Commission (2014) for a detailed presentation of the 15 chosen questions).

The first step of calculating ESI at the national level is data standardization:

$$\begin{aligned} \ y_{j,t}=\frac{x_{j,t}-\bar{x}_j}{s_j}, \quad \forall j=1,2,\ldots ,15; \; t=1,2,\ldots ,n, \end{aligned}$$
(7)
$$\begin{aligned} \ \bar{x}_j=\frac{1}{T^\prime }{\sum \limits _{t=1}^{T^\prime } x_{j,t}}; \quad s_j=\sqrt{\frac{1}{T^\prime -1}\sum \limits _{t=1}^{T^\prime } (x_{j,t}-\bar{x}_j)^2}. \end{aligned}$$
(8)

The standardization procedure is applied over a frozen period \((T^\prime) \) to avoid continuous monthly revisions of ESI. The frozen period is set by the EC for each country separately, spanning from the starting date of conducting BCS to the first month of the current year. To ensure the inclusion of the most recent data, the frozen period is updated each January. A preliminary version of ESI \((z_t)\) is calculated as a weighted average of the standardized 15 subcomponents \((y_j, j=1,\ldots ,15)\):

$$\begin{aligned} \ z_t=\frac{\sum \nolimits _{j=1}^{15} w_j\cdot y_{j,t}}{\left( \sum \nolimits _{j=1}^{15} w_j\right) }, \quad t=1,2,\ldots ,n. \end{aligned}$$
(9)

In doing so, the questions related to stock volume (Q4 in the industrial survey and Q2 in the retail trade survey), as well as the unemployment level question (Q7 in the consumer survey) are included in ESI calculation with an inverted sign.

The weights \(\ w_j\) applied in ESI calculation are set arbitrarily by the European Commission. They are conceptualized to represent the shares of each sector in the national economy. The weights are fixed (not time-varying) and exactly equal for each EU Member State, as follows: industry 0.4; services 0.3; consumers 0.2; construction 0.05, and retail trade 0.05 (European Commission 2014).

The final estimate of ESI index is obtained by scaling \(\ z_{t}\) to have a long-term mean of 100 and a standard deviation of 10:

$$\begin{aligned} \ ESI_{t}=\frac{z_t-\bar{z}}{s_{z}}\cdot 10+100, \quad t=1,2,\dots ,n, \end{aligned}$$
(10)

where \( \ s_{z}\) is the standard deviation of \( \ z_{t}\). The aim of this step is to facilitate the interpretation of the published ESI figures.

The final aim of BCS is to provide short-term indicators at the EU and EA level. Therefore the EU and EA aggregate ESI indices are obtained as a weighted average of the national ESI indicators. The weights are alternated on a yearly basis, depending on the share of each national economy in the total EU/EA industrial sector, services, consumer, retail and construction sector (respectively).Footnote 5 This study focuses on improving the forecasting quality of ESI for individual EU countries, as well as for the aggregate EU and EA level. The only two Member States excluded from the analysis are Ireland and Luxembourg (because of data unavailability).

Fig. 3
figure 3

The share of industrial production in GDP of EU Member States in 2011. Note Country abbreviations used hereinafter are as follows: AT = Austria, BE = Belgium, BG = Bulgaria, CY = Cyprus, CZ = Czech Republic, DE = Germany, DK = Denmark, EE = Estonia, EL = Greece, ES = Spain, FI = Finland, FR = France, HU = Hungary, IE=Ireland, IT = Italy, LT = Lithuania, LU=Luxembourg, LV = Latvia, MT = Malta,NL = Netherlands, PL = Poland, PT = Portugal, RH = Croatia, RO = Romania, SE = Sweden, SI = Slovenia, SK = Slovakia, UK = United Kingdom

The authors argue that (despite the advantages of BCS harmonization), there are negative side effects of the official ESI weights being fixed and exactly equal across Member States. This approach can potentially lead to sub-optimal forecasting quality of ESI because the shares of the respective BCS sectors in GDP are diverse and volatile. This can be best proven by Figs. 3 and 4. The authors use the share of industrial production in GDP of EU countries, and the shares of services in EU GDP through time, as illustrative examples of how much the sector shares in GDP are both significantly different among the EU countries, as well as they are extremely time-variable.

The respective shares of each particular sector in each country’s GDP is shown in Tables 1, 2 and 3 (for 2005 and 2011). That way a wider perspective can be obtained on how much each of the analyzed sectors is volatile through time and different among the analyzed countries. The shares of services in GDP is obtained from the World Development Indicators database, while all the remaining sector shares are obtained from Eurostat. The estimation period used for each individual country depends on data availability. Explicit time spans for all analyzed countries are given in Table 6 in Appendix 1.

The 15 ESI subcomponents are obtained from the European Commission, while the GDP data (as well as the retail trade and industrial production series) are gathered from Eurostat. The source of the entire dataset is seasonally adjusted using Dainties, as suggested by the European Commission.

Fig. 4
figure 4

The share of services in EU GDP

3.3 Quadratic optimization with constraints

The ESI indicator is, in its essence, a simple weighted mean of standardized survey answers. The weights are arbitrarily chosen by the European Commission and have not experienced any major revision since its introduction. However, the European Commission (2014) states that the weights are chosen according to the “representativeness” of the sector in question and its tracking performance vis-à-vis the reference variable. Since ESI reflects attitudes and expectations about the economy as a whole, the usual reference variable is GDP growth rate. This paper aims to explore possible areas of improvement in ESI’s tracking performance.

3.3.1 Optimization problem

Tracking performance can be viewed from various aspects depending on its definition. The usual starting model is a simple regression equation with ESI as an independent variable and reference variable as the dependent variable (estimated for various ESI lead lengths).Footnote 6 Since ESI is a monthly indicator, its main purpose is to predict the behavior of the national economy prior to the publication of official data. ESI’s leading indicator qualities can be best quantified through the number of months/quarters it precedes to GDP movements. Due to lags between ESI publications and GDP data releases, ESI offers added value to GDP nowcasting/forecasting even as a coincident or slightly lagging indicator. Therefore, various prognostic horizons h are considered here (\(h \in \{-2,-1,0,1,\dots ,12\}\) months).

The optimization problem comes down to finding the optimal weights \(\mathbf{w'} = \left( w_{1},w_{2},\ldots ,w_{5}\right) \), which minimize the root mean square error (RMSE) for the simple regression model \(GDP_{t+h} = \alpha +\beta ESI_{t}+\varepsilon _t\). The problem can mathematically be formulated as follows:

$$\begin{aligned} \begin{array}{ll} \underset{{\mathbf{w}} ,\alpha ,\beta}{\min } \sqrt{\frac{1}{T-h-2}\displaystyle \sum _{t=1}^{T-h}\left( GDP_{t+h}-\alpha -\beta ESI_{t}\left(\mathbf{w}\right) \right) ^2 }\\ \\ {\rm{subject}}\quad {\rm{to}} \quad 0 \le {w_1},{w_2},\ldots ,{w_{5}} \le 1 \\ \\ \displaystyle \sum _{i=1}^{5} w_i = 1, \\ \end{array} \end{aligned}$$
(11)

where \(\alpha \) and \(\beta \) are regression parameters and T is sample size.

As defined by the European Commission (2014), the weights are bounded by 0 and 1, and in sum give unity. The problem in Eq. (11) can be simplified by omitting the square root function and omitting multiplication with a scalar \(\frac{1}{T-h-2}\). Further, transformations from \(w_1y_{1,t}+\dots +w_{5}y_{5,t}\) to ESI do not influence the optimization procedure and ultimately yield the same solution. With that in mind, the problem in Eq. (11) is equivalent to the following problem:

$$\begin{aligned} \begin{array}{ll} \underset{{\mathbf{w}} ,\alpha ,\beta}{\min} \displaystyle \sum _{t=1}^{T-h}\left( GDP_{t+h}-\alpha -\beta \left( w_1y_{1,t}-\dots -w_{5}y_{5,t}\right) \right) ^2 \\ \\ {\rm{subject}} \quad {\rm{to}} \quad 0 \le {w_1}, {w_2}, \ldots ,w_{5} \le 1 \\ \\ \displaystyle \sum _{i=1}^{5} {w_i} = 1, \\ \end{array} \end{aligned}$$
(12)

where \(y_{1,t},y_{2,t},\dots ,y_{5,t}\) are average of standardized survey answers as defined in Sect. 3.2. The optimization problem in Eq. (12) is simpler (has less functions) and is therefore expected to converge to the globally optimal solution. The problem consists of one equality constraint (\(\sum _{i=1}^{5} w_i = 1\)) and 5 bound constrains (\( 0 \le w_1,w_2,\ldots ,w_{5} \le 1\)). Parameters \(\alpha \) and \(\beta \) are not bounded and the problem is nonlinear in parameters. Therefore, a nonlinear optimization method should be used. Nevertheless, relation (12) can be viewed as a quadratic programming problem by substituting \(b_i=\beta w_i\) and changing constraints on weights to \(b_i\ge 0 \ \forall i=1,\dots ,5\) or \(b_i\le 0 \ \forall i=1,\dots ,5\) (parameters \(b_1,\dots ,b_{5}\) have the same sign). Finally, the optimization problem becomes:

$$\begin{aligned} \begin{array}{ll} \underset{{\mathbf{w}} ,\alpha ,\beta}{\min } \displaystyle \sum _{t=1}^{T-h}\left( GDP_{t+h}-\alpha -b_1y_{1,t}-\cdots -b_{5}y_{5,t} \right) ^2 \\ \text {subject} \;\; {\rm to} \;\; \text {sgn}\left( b_1\right) =\text {sgn}\left( b_2\right) =\dots =\text {sgn}\left( b_{5}\right) .\\ \end{array} \end{aligned}$$
(13)

The problem in (13) is a special case of a quadratic programming problem of form \(\min \mathbf{d'}{\mathbf b} + {\frac{1}{2}}{\mathbf{b}} {'\mathbf{D}} {\mathbf{b}} \) with constraint \( {\mathbf {A'}}{\mathbf{b}} \ge {\mathbf{b}}_{\mathbf{0}} \). When matrix \(\mathbf{D} \) is positive definite, the dual method of Goldfarb and Idnani (1982, 1983) can be employed to find the optimal parameters. The R package quadprog implements the algorithm and is used in estimating the unknown parameters.

ESI’s tracking performance can also be assessed by Pearson’s correlation coefficient between ESI and GDP growth rate for various lead lengths. For a purpose of robustness check of the results obtained from minimizing RMSEs, the problem of maximizing the correlation coefficient is also considered. The same constraints apply here as in problem (11). The problem can mathematically be formulated as follows:

$$\begin{aligned} \begin{array}{ll} \underset{\mathbf{w}}{\max }\displaystyle \frac{\sum _{t=1}^{T-h}\left( GDP_{t+h}-\overline{GDP}\right) \left( ESI_{t}-\overline{ESI}\right) }{\sqrt{\sum _{t=1}^{T-h}\left( GDP_{t+h}-\overline{GDP}\right) ^2}\sqrt{\sum _{t=1}^{T-h}\left( ESI_{t}-\overline{ESI}\right) ^2}} \\ \\ \text{subject}\;\;{\rm to}\;\;0 \le w_1,w_2,\ldots ,w_{5} \le 1 \; \text{and} \; \displaystyle \sum_{i=1}^{5} {w_{i}} = 1. \\ \end{array} \end{aligned}$$
(14)

After obtaining novel ESI weights through quadratic optimization with constraints, the new aggregate EU and EA ESI indicators are also proposed, using the officially utilized EC weights for each of the analyzed countries. Namely, treating these national weights as unknowns in the quadratic optimization problem would lead to obvious over-parametrization of the problem. In some cases the number of parameters to estimate would be even larger than the number of data observations.

3.3.2 Assessing the quality of the ESI indicator

One of the tasks of this study is to quantify the extent to which the official European ESI can be improved (in terms of forecasting accuracy). To provide evidence on the topic, the distance between the optimal weights obtained here and the EC weights is calculated by two norms: the Euclidean and the maximum norm. The precise formulae are:

$$\begin{aligned} \left||{\mathbf{w}}-{\mathbf{w}}^{EC}\right||_{2}=\sqrt{\sum _{i=1}^{5}\left( w_{i}-w^{EC}_{i}\right) ^2} \end{aligned}$$
(15)
$$\begin{aligned} \left||{\mathbf{w}}-{\mathbf{w}}^{{EC}}\right||_\infty =\underset{i=1,\ldots ,5}{\max}\left| w_i-w^{EC}_{i}\right| , \end{aligned}$$
(16)

where \(w^{EC'}=\left( 0.4, 0.3, 0.2, 0.05, 0.05\right) \).

If the optimal weights differ only slightly in comparison to the official EC weights, the norm will be close to zero. On the other hand, the maximum value of both norms is close to unity (\(\frac{2.95}{3}\)).

4 Estimation results

The empirical strategy followed in this paper allows different weights for each of the 5 analyzed BCS sectors. The “optimal” weights obtained by minimizing RMSE in GDP forecasting equations (for chosen forecasting horizons \(h \in \{-2,-1,0,1,6,12\}\)) are summarized in Tables 1, 2 and 3.Footnote 7

Table 1 Optimization results—optimal sector weights (part 1)
Table 2 Optimization results—optimal sector weights (part 2)
Table 3 Optimization results—optimal sector weights (part 3)

After carefully examining Tables 1, 2 and 3, several conclusions can be drawn. First, the highest weights are (on average) indeed attached to the industrial sector. Namely, the average calculated weights (for all examined countries and forecast horizons) are as follows: 0.282, 0.229, 0.240, 0.124 and 0.125 for the industrial, services, consumers, retail trade, and construction sectors (respectively). It is immediately evident that these results significantly deviate from the official ESI weighting scheme.

As far as the industrial sector is concerned, its hereby proposed weights should be put in reference to the Silgoner (2007, p. 203) argument that the share of industrial production in the EU GDP is only \(25\,\%\). To obtain fresh insight, Tables 1, 2 and 3 summarize the shares of each sector in GDP for individual countries, the EU, and EA. They reveal that the average share of industrial production in GDP for the analyzed EU countries is 23.6 % in 2011. Therefore, one can conclude that the optimization procedure applied here produces less biased industrial sector weight and moves it closer to its “true” value. It is no surprise that the net results is an enhancement of ESI’s forecasting accuracy with respect to GDP growth.

Also, the proposed industry weight is lower than the official EC one in the vast majority of countries (and forecast horizons). This is completely in line with the global long-term trend of deindustrialization (Brady 2006). In the dilemma between BCS harmonization (fixed weights) and optimal forecasting quality, the EC has opted for the former option.

The only countries that (on average) have higher industrial weight than the EC one are Greece, Spain, Italy, Poland, Romania and Slovenia. It instantly becomes evident these countries have either a long history of socialist industrialization, or a form of state capitalism dominated by large industrial corporations.

The official European ESI is calculated with a weight of as much as 0.30 attached to the services sector. Nevertheless, this analysis has shown that the importance and predictive characteristics of the services sector is not that pronounced. To be more specific, its average proposed weight is considerably smaller than the official one. This is one of the most striking study results. Namely, there is a consensus in the literature that the global economy has exhibited a long-term structural shift from a manufacturing economy to a service economy. For example, Dudzeviciute et al. (2014, p. 359) provide evidence that, on the European level, the services sector share in the total value added has increased from \(46.7\,\%\) to as much as \(70.8\,\%\) since the 1970s. To corroborate that, the average share of services in GDP is calculated for the analyzed countries (on the basis of Tables 1, 2 and 3), and it is equal to exactly 70 % in the 2011.

Although the stated process of tertiarization is irrefutable, the results presented here clearly show that the services sector’s forecasting power is rather weak. A few exceptions can also be found in that context. Table 1 reveals that the German and Danish ESI can be seriously improved by attaching exceptionally high weights to the services-related variables (the highest weight of as much as 1 is found for the German economy at \(h=1, 6\). Such pronounced dominance of the services sector in Germany is also corroborated by the official statistics. Namely, Franke and Kalmbach (2005) acknowledge the business services as the fastest growing sector of the German economy.

The consumer and retail trade sectors are intrinsically interdependent, so it is obvious that they exhibit similar properties. Both are attached considerably larger weights then in the official ESI calculation scheme. This is not surprising since, on average, the final consumption of households accounts for as much as \(60.9\,\%\) of GDP in the analyzed EU countries in 2011 (see Tables 1, 2 and 3). Moreover, the strong relationship between personal consumption and GDP is well-established in the literature (Crossley et al. 2013; Tapsin and Hepsag 2014).

One particularly interesting feature of the consumer and retail trade sectors is observed here. Namely, both sectors on average exhibit a growth in significance for larger forecast horizons. This is particularly emphasized for the consumer sector, making it clear that the information needed for long-term GDP forecasting lies in the hands of consumers. This explains why macroeconomic forecasts can often be improved if consumer sentiment is taken into account (see Batchelor and Dua (1998) and all the paper cited there). Short-term GDP predictions are, on the other hand, to the greatest extent influenced by the industrial sector.

The construction sector weights are mostly rather small throughout the analyzed model specifications. Some of the exceptions include France, Germany, Finland, and Spain (among EA members),Footnote 8 as well as e.g. Croatia and Sweden (among non-EA countries) for some of the analyzed lead lengths. All the mentioned countries (except Croatia) are highly developed. Namely, the literature offers clear evidence of strong correlation between income and dwelling investments in such countries. As an example, Arestis and Gonzalez-Martinez (2015) characterize the construction sector as strategically important due to its strong forward and backward linkages in OECD economies. Leamer (2007) also identifies housing investments as a remarkably accurate recession predictor for the US economy. Croatia, on the other hand, is a quite atypical European economy, highly dependent on its construction sector. It is well-documented that the entire Croatian economic expansion from 2000 to 2008 was founded on a real estate bubble (Tkalec and Vizek 2014).

Apart from interpreting the obtained results from the aspect of actual sector shares in GDP, one might also raise a question of the BCS respondents’ capacity to form accurate macroeconomic forecasts. This notion relates to the famous rational expectations hypothesis (see e.g. Sabrowski 2008), which states that economic agents are fully rational (their expectations are correct on average) and take into account the whole available information set. In that context, the “optimal” weights obtained here prove that the industrial sector managers (on average) poses the deepest knowledge of the underlying economic mechanisms, which enable them to form the most accurate predictions.

In order to extract homogeneous groups of countries with respect to the obtained sectoral weights, cluster analysis is applied on the five sector weights (cases for the 5 variables are obtained as average weights over the 15 analyzed lead lengths; \(h \in \{-2,-1,0,1,\dots ,12\}\)). The optimal number of five distinctive clusters is obtained on the basis of Charrad (2014) procedure in R software. Using the K-means clustering method, the analyzed countries are grouped in the following way. The first cluster comprises Germany, Denmark, and Estonia. The second cluster consists of the Czech Republic, France, Hungary, Lithuania, Netherlands, and Slovakia. They are followed by Cyprus, Latvia, Croatia and Sweden in cluster three. The fourth groups comprises Greece, Spain, Italy, Poland, Romania, and Slovenia. The final group is made of Austria, Belgium, Bulgaria, Finland, Malta, Portugal, and the UK.

Fig. 5
figure 5

Means plot for the obtained 5 country clusters

The plot of cluster means is depicted on Fig. 5.

It is evident that the clusters are extracted based on the dominant sector in their member countries. For example, the first cluster is dominated by the services sector since all three countries exhibit extremely large services weights.

Likewise, the third cluster is consisted of construction-driven economies. Members of the fourth cluster are dominated by large industrial corporations, while the economies in the fifth cluster can best be described by consumer sentiment. Finally, the second cluster represents a “moderate” group of countries, mostly characterized by typical (average observed) weights for all five BCS sectors.

Fig. 6
figure 6

The official and new ESI with respect to GDP growth at the EU level

Fig. 7
figure 7

The official and new ESI with respect to GDP growth at the EA level

The newly obtained ESI indicator at the EU and EA level (at forecast horizon \(h=0\)) is depicted on Figs. 6 and 7, along with the corresponding monthly GDP series. It is evident that the “new” ESI follows GDP more closely, without any time lagging. Therefore the hereby proposed calculation scheme has brought about an improvement in ESI’s nowcasting quality. Similar evidence is also obtained for other forecast horizons. However, it is important to inspect whether the results obtained up to this point are solely an in-sample characteristic, or are the drawn conclusions valid even for the out-of sample forecasting.

For each of the analyzed countries, the cut-off point has been set separately to the last 24 and 36 monthly observations. Using the dataset up to that point in time, a pseudo out-of-sample forecasting exercise is performed. The results are presented in Table 4.

Table 4 Comparison of ESI indicator quality—out of sample

The results presented in Table 4 range from very bad (in case of Spain and Germany) to considerable improvements (Czech Republic, Belgium, Finland, Hungary, etc.). Just as in Gelper and Croux (2010), an improvement is also noticed on the aggregate EU level for the 36 months cut-off point. However, the same cannot be said about the 24 months cut-off point. In general, at least an incremental rise of ESI’s forecasting quality is observed for the vast part of analyzed countries.

The optimization problem in (14) is also considered here for the purpose of a robustness check. The analysis yields very similar results as the RMSE minimization in Tables 1, 2 and 3, and is available in Tables 7, 8, 9, 10 and 11 in Appendix 2. Again, ESI’s in-sample forecasting quality is considerably enhanced, while the out-of-sample results are not so convincing.

To quantify the extent to which the weights obtained through RMSE minimization differ from the official ESI figures, the obtained Euclidean and maximum norms are presented in Table 5. Further, an index of the obtained RMSEs is presented in the final column. An index value larger than 100 corresponds to an improvement in the newly proposed weighting scheme in comparison to the official EC weights.

Table 5 Comparison of ESI indicator quality

All three applied distance measures reveal similar tendencies. The last two rows of Table 5 are particularly interesting, showing that (on average) the proposed weighting scheme offers somewhat more added value in OMS than in the NMS. What strikes as the most peculiar result is that Germany, one of the founding EU members, exhibits one of the largest ESI improvement potentials (regardless of the applied distance measure). Namely, it is well-founded in the literature that the German ESI performs rather badly in GDP forecasting. For example, Schröder and Hüfner (2002) compare the forecasting accuracy of German ESI to other composite indicators (IFO business expectations measure, the Purchasing Managers Index, and the ZEW indicator of economic sentiment). Their results reveal that, out of the analyzed indicators, ESI has the worst leading characteristics. Moreover, ESI is found to be not a leading, but a lagging indicator of total economic activity.Footnote 9

Additionally, a parallel can be drawn between the results from Tables 1, 2 and 3 and the ones in Table 5. Namely, the majority of countries with exceptionally high construction sector weights (Germany, Finland, Spain, Croatia, etc.) are among the best-placed Member States according to the ESI improvement obtained by this study. This proves that attaching higher weight to the construction sector adds to the accuracy of ESI-based predictions, and corroborates the relevance of the construction sector in determining the business cycles of developed European countries.

The results presented in Table 5 can by no means be interpreted by stating that the sole methodological basis of administering the surveys (sample selection, non-response treatment, etc.) is better in NMS than in OMS. Table 5 merely reveals how much room for improvement does each particular Member State have in terms of ESI’s predictive accuracy. In fact, one can say that the BCS data from OMS have the potential to generate more accurate GDP predictions, while the same point is less valid for the NMS.

5 Conclusion

The process of Euro integration and harmonization in the area of official statistics has offered several valuable advantages to both economic practitioners and researchers, as well as to economic decision-makers of any kind. BCS data are now fully harmonized in all EU Member States. This ensures the application of the best international practice of conducting the BCS and enables a multi-country comparative analysis of BCS results.

However, by insisting on data comparability, the European Commission has also triggered some negative side effects of the integration process. Most importantly, the European ESI is calculated equally in all EU Member States, applying the exact same (arbitrarily chosen) sector weights. This inevitably leads to bad ESI forecasting performance in at least some EU countries. The necessity of conceptualizing more accurate macroeconomic forecasting models has been emphasized through the recent global crisis in a rather painful manner.

Therefore, this paper applies nonlinear optimization techniques to propose a novel ESI weighting scheme for each of the 26 analyzed individual Member States, the EU and EA. The weights are found by minimizing the RMSEs obtained from simple GDP forecasting equations including ESI as the predictor variable.

The obtained in-sample results have showed that ESI’s forecasting accuracy can be significantly improved by attaching larger weights to the retail trade and consumer sectors. On the other hand, the importance of the industrial sector is heavily overestimated by the official EC weights. The empirical analysis of this study has shown that lowering the industrial weights significantly improves ESI’s leading properties.

Moreover, it is proven that the OMS are characterized by somewhat larger potential for improving ESI. Namely, several alternative distance measures have shown that the prediction improvement of the hereby proposed weighting scheme is larger for those countries then for NMS. This can to some extent be explicated by the fact that the official ESI weights are obviously more in accordance with the structure of NMS economies.

The obtained results are proven to be robust by also finding weights that maximize the correlation coefficient between GDP and ESI for various lead lengths.

This study provides valuable information about the functioning mechanisms of EU Member States economies. The countries are clearly grouped into five clusters, each of them dominated by a particular sector.

The out-of-sample forecasting exercise offers mixed evidence on the quality of novel ESI calculation. The results are not uniform, but the proposed weighting scheme results in at least a marginal improvement for the majority of the observed countries, as well as for the EU in case of 3 years data cut-off point. Whether the obtained improvements are good enough to officially change the ESI weighting scheme, it is for the European Commission to decide.

If one wished to pinpoint clear policy implications from this study, they might be based on the following. Currently, the ESI data are revised at the beginning of each calendar year by changing the frozen period employed in its calculation. This means that past ESI figures are by no means comparable to the ones calculated on the basis of any of the formerly applied frozen periods. In other words, altering the weighting scheme at the same time would bring no additional cost to the European Commission. These yearly revisions of the applied weights could for example be based on the procedures applied here. This would to some extent raise the forecasting accuracy (both in-sample and out-of-sample) of ESI in individual Member States and at the aggregate EU level.

This paper suggests merely two of the possible methodological paths to improving ESI’s forecasting accuracy. Future research should certainly test whether the potentially low quality in an individual country’s ESI can be put in relation to the practice of conducting the surveys themselves. Namely, the European Commission does not publish data on, for example, exact response rates (but merely targeted response rates) or sampling errors for all countries and sectors.

Regardless, the calculation of European ESI and improving its predictive accuracy deserves more attention in future research.