1 Introduction

It is well known that macroeconomic data are released with a substantial delay. Additionally, in emerging market economies, low-frequency data (i.e., annual national accounts) rely on a smaller array of surveys and indicators than in advanced economies, and provide a partial picture of the economy. However, complete and up-to-date information on the current state of the economy is crucial for policy makers, market participants and public institutions. Indeed, agents periodically update their forecasts, and monitoring economic conditions in real time helps them assess whether the forecasts are on track or need to be revised. Similarly, the process of policymaking often requires long term projections of the economy that heavily rely on accurate initial conditions and forecasts. Therefore, constructing timely “predictions” of current economic conditions, namely nowcasts, is of fundamental importance for decision making.

A lot of information is contained in economic indicators that are available on a quarterly, monthly, weekly, and even daily basis, and, in principle, it is possible to use this information to build “predictions” of the current state of the economy. However, high-frequency data in emerging market economies are often scant, noisy, released with a lag, and can have missing observations. This complicates the difficult task of real-time monitoring and decision making as constructing timely indicators on current economic conditions for emerging market economies presents some extra challenges.

In this paper, we focus on Indonesia, the largest economy in Southeast Asia, which is rapidly gaining influence in the world economy. With a number of high-frequency data indicators available and yet facing problems that commonly plague emerging economy datasets, Indonesia provides an interesting training case for developing a nowcasting framework that can be applied to monitor other similar economies in the region.

Two main issues emerge with regard to monitoring in real time: how many and which indicators to select and what econometric model to use to extract information from the data. In this paper, we produce “predictions” of the current state of the Indonesian economy by estimating a Dynamic Factor model on a dataset of 11 indicators (followed closely by market operators) over the 2002–2014 period. Our choice of the model is based on the fact that it is parsimonious, it is able to cope with missing data and mixed frequency indicators, and it can potentially be estimated on a large number of variables. Further, since the seminal paper of Giannone et al. (2008), this model has become a standard tool for monitoring economic activity, as it has proved to be successful in nowcasting several economies, including emerging ones such as China Giannone et al. (2014) and Brazil Bragoli et al. (2015).

The rest of the paper proceeds as follows. Section 2 presents Indonesia’s GDP data, and discusses the problems of having several GDP series with different base years, and no official seasonally adjusted data. Section 3 discusses our nowcasting procedures and is divided in two parts: in the first part we describe the process of choosing a set of indicators that contains useful information on economic activity, and in the second we present the application of a dynamic factor model to Indonesia’s data.

Section 4 presents the evaluation of our model. Several results emerge. First, incorporating high-frequency data in a rigorous framework leads to an improvement in the forecast accuracy of Indonesia’s economy compared with simple univariate benchmarks. Second, too many variables are not always optimal for the purpose of monitoring as they can be noisy or uninformative (Bańbura et al. 2013; Luciani 2014b), particularly so when the target variable, namely Indonesia’s GDP growth, has limited number of observations. A careful selection of meaningful variables improves the forecast performance. Third, our model does well in predicting quarterly GDP growth when compared with private forecasters such as Bloomberg, and also does well in predicting annual GDP growth when compared with institutional forecasts of the International Monetary Fund and the Asian Development Bank.

Finally, Sect. 5 concludes.

2 Indonesia’s GDP data: atterns and issues

In this section we present Indonesia’s GDP data, and discuss a number of issues in the data that need to be carefully tackled before starting any monitoring process.

Fig. 1
figure 1

Source: CEIC Data Company (accessed 30 June 2015); Authors’ computation

Gross domestic product series for Indonesia. Notes The left plot shows quarterly GDP at constant prices in levels, where the unit of measure is Trillions of Rupiah. The right plot shows the year-on-year growth rate of quarterly GDP at constant prices. In both graphs the three lines represent the different base years. There was also an accounting change from SNA 1993 to SNA 2008.

The first issue is that there is no single long series for GDP available from official statistical sources. The left plot in Fig. 1 shows the level of quarterly GDP at constant prices in trillions of rupiah from 2000 to 2014, from the national source. The right-hand side plot shows the computed year-on-year (y-o-y) growth rate of quarterly GDP from 2000. The three lines refer to different base years, 1993, 2000 and 2010 respectively. Each line, however, is only available in a limited time span in that 1993 prices is available for 1993–2003, 2000 prices is available for 2000–2014 and the latest 2010 prices is available since 2010 onwards only.Footnote 1

The aggregation methodology was common between the 1993 and 2000 base years, but changed from SNA 1993 to SNA 2008 for the 2010 base year.Footnote 2 Base changes are common for GDP data, as they can incorporate changes in the economy’s structural composition. However, as seen in Fig. 1, growth rates in the few years of overlap between series were also affected.Footnote 3 Hence the composition of the aggregate series and methodologies used in their construction is brought into question, exacerbated by a lack of publicly available information on procedures used.

Fig. 2
figure 2

Source: CEIC Data Company (accessed 30 June 2015); Authors’ computation

Quarter-on-quarter growth of real GDP. Notes The left plot shows the seasonal fluctuations in q-o-q GDP growth, where real GDP data are reconstructed using y-o-y growth rates. The right plot shows q-o-q growth rates of real GDP series available, based on two different accounting methodologies, SNA 1993 and SNA 2008, and two different base years, 2000 and 2010.

The second issue is related to which time span to use in conducting the econometric analysis. Indeed, GDP growth in Indonesia is characterized by large fluctuations due to different crises hitting the economy. In particular, the Asian financial crisis in the late 1990s stands out as a unique episode for Indonesia’s growth path with GDP falling more than 15% in 1998 (not shown in Fig. 1). Growth recovered to about 5% in 2000 and stabilized somewhat at this yearly rate with a significantly lower volatility since 2002. Considering this pattern, we exclude the Asian financial crisis years from our sample and use the series only from 2002. This is unavoidable because the evolution of GDP growth over the financial crisis would dominate our estimates if not excluded. It would capture features of the data (i.e., co-movements) that are potentially spurious and misleading for the nowcasting exercise, as the relationships that held during the financial crisis could be different from those that hold during expansions and more moderate recessions.

The third issue with Indonesia’s GDP data is that the series exhibits a marked seasonal pattern when we compute quarter-on-quarter (q-o-q) growth rates (Fig. 2). Yet there is no seasonally adjusted GDP series available from official sources, which leaves us with the problem of having to deal with seasonality in the data, particularly when trying to combine series with different base years to obtain coherent and long time series. Indeed, as shown in the right-hand-side panel in Fig. 2, the seasonality in GDP growth data with 2010 base year exhibits a clear departure from the seasonal pattern in the series with 2000 base year. This adds to the complication of splicing the different series together. Suppose we start with the latest available GDP series that is of 2010 base, and extend it backwards using the q-o-q growth rates of the previous base series. As seen in the left-hand side plot in Fig. 3, this produces inconsistent seasonal patterns within the spliced series—that is, between the (growth of) actual data and the extended data. On the contrary, the reconstructed series based on y-o-y growth rates instead does not seem to have the same shortcoming.

The right-hand side panel of Fig. 3 plots the constructed long time series for GDP based on the two different splicing methods. It shows that while the series constructed using the y-o-y growth rates has consistent dynamics with that of previous base year, the one generated using the q-o-q growth rates has not. The latter exhibits a jump in GDP during the four quarters when the two shorter series were spliced. For this reason, splicing the series using a y-o-y growth seems to warrant a more consistent long series. Furthermore, we will also continue to use y-o-y growth rates in our analysis going forward.

Fig. 3
figure 3

Source: CEIC Data Company (accessed 30 June 2015); Authors’ computation

Reconstructing GDP series using y-o-y growth. Notes Since the latest available data starts in 2010, we have to reconstruct a long time series and the left plot displays the growth patterns using q-o-q and y-o-y growth rates. The right plot displays the reconstructed series using different growth rates.

Finally, we are aware that using y-o-y growth tackles the issue of seasonality and makes series smoother, but it also lags q-o-q growth. However, the alternative of using some standard procedure to seasonally adjust the series, and then evaluating the forecasting performance on the resulting unofficial series would be of little practical use. Not only would it entail defining of a new, somewhat arbitrary target variable, but it would also be of little relevance to policy makers, who focus on the official y-o-y GDP growth. Of course, had the Indonesia Statistics provided official q-o-q seasonally adjusted data, we would have used that as our target for the nowcast.

3 Nowcasting

Typically, GDP data provide the most comprehensive picture of the economy, by aggregating activity of different sectors. Unfortunately, the data come with long delays and are not available on a high-frequency basis. In most cases, data are published quarterly and, in some developing countries, even annually.

For Indonesia, GDP data are reported quarterly with a delay of about 5 weeks. This means that the growth rate of the economy in the first quarter that ends in March is not known until the second week of May. While this is a reasonable delay compared to most emerging and some advanced economies, it is still insufficient for the purpose of real-time monitoring, as it is not possible to make an assessment of the strength of economic activity for almost 5 months into the year.

To gauge the current state of the Indonesian economy in real time, we need to construct a prediction of GDP growth before the official data are released. This means that at each point in time we want to predict not only the current and next quarter estimates for GDP growth (henceforth nowcast and forecasts), but also, wherever the official data have not been published yet, the past quarter GDP growth (backcast).

Ideally, if the true data sources and compilation methods for GDP were known, we could simply attempt to reverse engineer the process performed quarterly by the Indonesian statistical office but at a higher frequency. Unfortunately, as discussed in Sect. 2, very little information on these methods is available from the statistical agency. This forces us (1) to build an information set that is informative for describing the GDP growth process, possibly by including variables that may be outside the scope of the statistical office but may still contain useful leading information, and (2) to choose an econometric model to build our prediction.

In the next two subsections we address these issues. In particular, in Sect. 3.1, we explain how we construct the database, while in Sect. 3.2 we introduce Dynamic Factor models (DFMs). We choose to use a DFM since it is a parsimonious model, it copes with missing data and mixed frequency, and it can potentially exploit a large information set. Furthermore, being framed in a Kalman filter framework, it allows a formal reading of the news flow coming from successive data releases. Finally, DFMs proved to be very successful in nowcasting several economies, including emerging ones.Footnote 4

3.1 What variables to select?

There is a wide set of potentially useful monthly and quarterly series that could help extract information on the state of the Indonesian economy. However, the limited number of time series observations available for our target quarterly y-o-y GDP growth (see Sect. 2), constrains our choice of both the variables and the model.

In principle, DFMs are consistently estimated when the number of variables is diverging to infinity, and in practice they are usually estimated on relatively large datasets. However, when the time series observation is severely limited, including too many variables is likely to introduce estimation uncertainty, which ultimately worsens the prediction performance. This is a particular source of concern when dealing with Indonesian data, as only few series display a marked comovement with the GDP quarterly dynamics and hence the risk is to introduce excessive noise in the model estimation. Furthermore, the literature on nowcasting with large-dimensional DFMs has reached the conclusion that, unless one needs to monitor the data flow with many data releases, there is no gain from too large a database as long as the variables on which the model is estimated are appropriately selected (Bańbura et al. 2013; Luciani 2014a, b). This then leads to questions on how many variables should be selected and on what basis.

A possible strategy to draw from a large pool of variables could be to rely on a purely mechanical statistical selection procedure. For example, Bai and Ng (2008) suggest selecting with the Least Angle Regressions (LARS algorithm) only those variables that are really informative for forecasting the target variable, while Camacho and Perez-Quiros (2010) suggest first selecting a core group of variables and then evaluating if other possible predictors are useful.

An alternative strategy, pioneered by Bańbura et al. (2013) and followed by Luciani and Ricci (2014), Giannone et al. (2014), and Bragoli et al. (2015), is to exploit the “revealed preferences” of professional forecasters who follow the Indonesian economy on the Bloomberg platform. These analysts subscribe to the Bloomberg news alert for specific data releases of the variables that they monitor and use them to form their expectations on current and future fundamentals of Indonesia. As Bloomberg constantly ranks the analysts’ demand for these alerts by constructing a relevance index for each macroeconomic indicator, we can select variables based on this relevance index.Footnote 5

We adopt the latter approach, as the automatic selection approach risks leading to an unstable choice of variables in a real-time scenario.Footnote 6 This instability would not only be difficult to justify from an economic standpoint, but it would also complicate the interpretation of the forecasts’ revisions. Moreover, we also tried the automatic selection approach, and the performance of our model was worse in this case than when we used the revealed preference approach (see Appendix A.1).

It turns out that for Indonesia, only a relatively small number of macroeconomic series are tracked in real time by the markets (see Table 1). On the one hand, there are indicators describing macroeconomic developments (e.g., GDP itself, car sales, exports, imports and manufacturing PMI). Moreover, given their direct influence on the foreign exchange and fixed income markets, analysts also monitor indicators that directly describe the monetary policy stance. These are the central bank reference interest rate as well as key monetary aggregates.

Table 1 Bloomberg calendar: follow the market revealed preference.
Table 2 Data description and data treatment.

Starting from the set of indicators in Table 1 followed by business analysts we constructed our database as follows:

  1. (1)

    We excluded all indicators that either had too few observations, or we could not retrieve. This is the case for PMI, for which data are available only starting in June 2012, and for Danareksa Consumer Confidence and Motorcycle Sales for which we were not able to retrieve data.Footnote 7

  2. (2)

    We screened each of the remaining indicators to understand whether they are followed by analysts because they convey information on the state of the real economy, or they are directly related to the stance of the Central Bank and its balance sheet. Therefore, because the Bank of Indonesia has an inflation target, we discarded CPI, and we also removed Foreign Reserves, and Net Foreign Assets as they are mainly related to the foreign exchange policy. However, we included monetary “instruments” like the bank reference rate and the M2, as proxies for the monetary stance, which itself should be forward-looking.

  3. (3)

    We then excluded those variables that are the sum of other variables in the database or are too similar to other series. So we kept Imports and Exports, but we excluded Current Account; and we kept M2, but we discarded M1.

  4. (4)

    Finally, we used our “expert judgment” and added a few indicators that we think provide some extra information about the Indonesian economy. To capture information regarding the increasing role of construction activity for the Indonesian economy, we included domestic cement consumption. To account for spillovers from the foreign sector into the domestic economy, we included a variable from the very timely Market PMI manufacturing survey. In particular, we included the aggregate for emerging economies, which is dominated by developments in China as well as countries in Asia. Finally, we also included a sectoral breakdown of the imports series to better capture the possibly different lead/lag characteristics of each of these series with GDP growth.

By following this strategy, we end up with a dataset of 10 macroeconomic indicators plus GDP (see Table 2). While GDP and Business Tendency Index are quarterly series, the remaining are at monthly frequency. The column “Delay” reports the publication delay expressed in number of days in our stylized calendar and shows substantial differences between series in terms of their publication delay. For example, the PMI for developing economies is published just 4 days after the reference month, while data on imports are released a month after the reference month.Footnote 8

A notable exclusion from our database is Industrial Production, particularly so because this variable is often used by analysts to assess high-frequency movements in economic activity in emerging market economies.Footnote 9 The underlying assumption is that movements in the industrial sector are a good approximation of aggregate economic activity. While the assumption may hold well for some economies, in others, the cyclical component of the index is not found to be sufficiently synchronized with GDP (Fulop and Gyomai 2012). The correlation between Indonesia’s GDP growth and the y-o-y growth rate of industrial production is in fact relatively weak (standing at 0.35 in the full sample) and displays hardly any comovement since 2010 (see Fig. 4). This is likely related to the decreasing contribution to GDP growth from the manufacturing sector, off-set by other sectors like construction and services, as the share of industrial production has been declining since 2002 and now accounts for about one-third of GDP.

Fig. 4
figure 4

Source: CEIC data company (accessed 30 June 2015)

Gross domestic product and industrial production. Notes The thick black line is y-o-y GDP growth rate, while the thick gray line is y-o-y growth rate of industrial production index. The thin horizontal black line indicates 0. The data for industrial production were converted to quarterly frequency by averaging monthly observations. Note that both variables have been standardized to be mean zero and variance one for comparison purposes, which is why there is no scale in the y axes.

3.2 Dynamic factor models

Factor models are based on the idea that macroeconomic fluctuations are the result of a few macroeconomic shocks that affect the whole economy and a number of sectoral/regional shocks that affect a part of the economy. Therefore, each variable in the dataset can be decomposed into a common part and an idiosyncratic part, where the common part is assumed to be characterized by a small number of common factors (\({\mathbf {f}}_t\)) that are time series processes meant to capture the comovement in the data (i.e., the business cycle).

Formally, let \(x_{it}\) be the i th stationary variable observed at month t, then

$$\begin{aligned} x_{it}&=\varvec{\lambda }_i {{\mathbf {f}}}_t + \xi _{it} \quad i=1,\ldots ,n \end{aligned}$$
(1)

where \({\mathbf {f}}_t\) is an \(r\times 1\) vector (with \(r\ll n\)) containing the common factors, and \( \xi _{it}\) is the i th idiosyncratic component. The vector of common factors evolve over time as a VAR(p) process driven by the common shocks \({\mathbf {u}}_{t}\sim \mathcal {N}(0,{\mathbf {I}}_r)\), while each idiosyncratic component follows an independent AR(1) model driven by the idiosyncratic shocks \(e_{it}\):Footnote 10

$$\begin{aligned}&\displaystyle {\mathbf {f}}_t=\sum _{s=1}^p {\mathbf {A}}_s {\mathbf {f}}_{t-s}+{\mathbf {u}}_t \end{aligned}$$
(2)
$$\begin{aligned}&\displaystyle \xi _{it} ={\mathbf {\rho }}_i \xi _{it-1} +e_{it} \quad i=1,\ldots ,n. \end{aligned}$$
(3)

Equations (1)–(3) define the DFM used in this paper. This is the model studied in Doz et al. (2011, 2012), which is a special case of the model studied in Forni et al. (2009). In this model, the common shocks and the idiosyncratic shocks are assumed to be uncorrelated at all leads and lags, while the idiosyncratic shocks are allowed to be cross-sectionally correlated, albeit by a limited amount (approximate factor structure). For a formalization of the assumptions and for a more comprehensive treatment of the DFM, we refer the reader to the aforementioned references and to the survey by Luciani (2014b).

Model (1)–(3) can be estimated by Principal Components (Stock and Watson 2002a; Bai 2003), by using a two-step estimator based on the Kalman Filter and Principal Components (Doz et al. 2011), or by maximum likelihood techniques through the EM algorithm (Doz et al. 2012). In this paper, we will use maximum likelihood, and in particular we will use the EM algorithm proposed by Bańbura and Modugno (2014), which can handle both mixed frequencies and missing data (see Appendix A.2).Footnote 11

4 Empirics

4.1 The forecasting exercise

To evaluate the performance of our model, we perform a pseudo real-time out-of-sample exercise. Predictions of Indonesian GDP growth are produced according to a recursive scheme, where the first sample starts in July 2002 and ends in December 2007, while the last sample starts in July 2002 and ends in December 2014. The model is estimated at the beginning of each quarter using only information available as of the first day of the quarter, and then the parameters are held fixed until the next quarter. For the estimation, we include two factors (\(r = 2\)) and two lags (\(p = 2\)) in the VAR model governing the evolution over time of the factors. An extensive discussion of why we included two factors can be found in Appendix A.3.

To perform our, exercise we construct real-time vintages by replicating the pattern of data availability implied by the stylized calendar (Table 2), and every time new data are released, we update the prediction based only on information actually available at that time. We label this exercise pseudo real time since we do not have access to the actual set of data vintages as they were released over time. As to GDP growth, revisions are very infrequent and relate to changes in national accounts methods rather than to the arrival of more complete information within a year. That is, quarterly figures are treated as the final number until a base change occurs. In those occasions the statistical office can revise substantially the growth profile of GDP, as documented in Sect. 2. Similar changes may occur also for the other variables in our database, but no information regarding the existence of well defined revision schedule is available.

4.2 Comparison against statistical benchmark

To judge the performance of our model and to evaluate the information contained in our dataset, we start by comparing our model with four benchmark univariate models.

Our first benchmark is the naive forecast, obtained from the random walk model on GDP growth. Let \(y^Q_t\) be the y-o-y GDP growth observed quarterly, that is, at month \(t=3,6,9,\ldots \), then the random walk model is: \(y^Q_t=y^Q_{t-1} +\varepsilon _t\). The second benchmark is a forecast from an autoregressive model of order two on GDP growth: \(y^Q_t=\rho _1 y^Q_{t-1} +\rho _2 y^Q_{t-2} +\varepsilon _t\). Given the high persistence in our target series introduced by the y-o-y transformation, these univariate benchmarks are inherently tough competitors to match in our real-time exercise.

Our third benchmark is a bridge model (Parigi and Schlitzer 1995). A Bridge model predicts GDP growth by using past GDP and one or more monthly indicators.Footnote 12 Formally, let \(x_{it}\) be a monthly variable observed at month t, then the Bridge model is then defined as follows:

$$\begin{aligned} y^Q_{t}= \mu + \alpha y^Q_{t-3}+\beta x^Q_{t} + \varepsilon _{t} \end{aligned}$$
(4)

where \(x^Q_t=\sum ^{3}_{j=1} \frac{1}{3} x_{t-j}\) is the monthly indicator aggregated at the quarterly frequency by a simple average. Equation (4) is estimated by OLS.

Our last benchmark is a MIDAS model. MIDAS models (Ghysels et al. 2007; Andreou et al. 2013) are similar to Bridge models, but in MIDAS models, first, the aggregated variables \(x_{t}^Q\) is constructed using a lag polynomial, and, second, for each forecast horizon it is estimated a different model:

$$\begin{aligned} y^Q_{t+h}= \mu + \beta x_{t}^Q + \varepsilon _{t} \end{aligned}$$
(5)

where \(h=\{-3,0,3\}\), \(x_{t}^Q=\sum _{m=0}^{11} \gamma (m,\theta )L^{m}x_t\), and \(\gamma (m,\theta )=\frac{ \exp (\theta _{1}m)}{\sum _m \exp (\theta _{1}m)}\). Equation (5) together with the polynomial \(\gamma (L,\theta )\) is estimated by nonlinear least squares.

In this paper, the predictions from the Bridge and from the MIDAS models are obtained by first estimating a model for each monthly indicator in the database—with the exception of PMI developing countries that was excluded because there are too few observations for this indicator—and then by averaging the prediction. Furthermore, whenever we have a missing observations in \(x_t\), we filled it by using an AR model.

Table 3 shows the root-mean-squared error (RMSE) at the end of each month of the DFM, an AR(2) model, a Random Walk, and the Bridge and the MIDAS models. The table is divided into three parts: the first part, labeled “Forecast”, reports the RMSE of the prediction of the next quarter; the second, labeled “Nowcast,” reports the RMSE of the prediction of the current quarter; and, the last, labeled “Backcast,” reports the MSE of the prediction of the previous quarter.Footnote 13

As we can see from the fact that the RMSEs in Table 3 are decreasing with each month, the DFM is able to correctly revise its GDP prediction as more data becomes available. Furthermore, compared with the univariate benchmarks, the RMSE of the DFM is consistently lower, up to a maximum reduction at the end of the first month after the reference quarter (i.e., row “backcast” in Table 3) of 55% compared with the Autoregressive model, of 15% compared with the Bridge model, and of 29% compared to the MIDAS model. This is an important finding, as it tells us that there is valuable additional information in the Indonesian high-frequency data that can be used to predict GDP growth.

Table 3 Root-mean-squared error: end of month.

In Table 4, we investigate which data release carries more information for the prediction with the DFM. Identifying such variables is particularly important since it can help policymakers understand which series to track while monitoring the Indonesian economy. More precisely, Table 4 shows the RMSE associated with each data release. We can see that some variables are particularly relevant for correctly updating the prediction of y-o-y GDP growth. Among these, the most important one are exports and imports, which account for the largest reduction in RMSE when the data are released.Footnote 14 Other relevant ones are GDP of the previous quarter, and Cement. Notice also that upon the release of some variables the “average” forecasting performance reported in Table 4 appears to deteriorate. Thus, it would be tempting to drop these variables in order to “improve” the overall forecasting performance. However, each of the variables may also improve the estimation of the model by exploiting the commonality in the data and hence make our forecasts more robust to one-off changes in a particular variable.

Table 4 Root-mean-squared error: data flow.

We conclude this section with a caveat that will apply to most of the empirical exercise. As we argued in Sect. 2, we had to use data only from 2002 onwards, and this has limited us in two ways. First, as discussed in Sect. 3, we had to restrict the number of variables to include in the model. Second, having only forecasts for 28 quarters, the robustness of our RMSE may be an issue. This is even more relevant when we attempt to assess the statistical significance of different forecast performances. To this end, in Table 3 we report Diebold and Mariano (1995) test of equal predictive accuracy. The DM statistics is asymptotically normally distributed, and although we have just 28 predictions, the p values in Table 3 was computed using the normal approximation.

4.3 Comparison against market benchmark

Another way to evaluate the forecasting performance of our model is to compare the prediction obtained with the DFM with the prediction of market operators. In Fig. 5, we compare our predictions with those of the Bloomberg Survey (BS). The BS consists of the median GDP prediction provided independently by a variable number of specialists a few days before GDP is released. Therefore, because the Bloomberg Survey is released a few days before previous quarter GDP, according to our terminology the BS prediction is a backcast, and we will compare it with our last prediction before GDP is released.

When comparing our prediction with Bloomberg, we have to be careful with respect to data revisions. The black line of the left plot in Fig. 5 represents the last vintage available of the GDP series, while the dashed black line in the right plot of Fig. 5 represents the first release of GDP growth.Footnote 15 As discussed in Sect. 2, these two series are quite different due to the fact that the Statistical office has changed both base year and the aggregation method, and, in particular from 2011 onwards the two series differ substantially. This fact is crucial when we attempt to make comparisons with truly historical forecasts. In particular, the analyst interviewed by Bloomberg were targeting the first release GDP growth (right plot), while the DFM has been estimated using—and therefore has been designed to target—the latest release of y-o-y quarterly GDP growth (left plot). Therefore, if we want to understand how good/bad the prediction produced by the DFM are compared to those of Bloomberg, we have to take into account that there is a difference in the target variable.

As we can see from Fig. 5, the Bloomberg Survey is tracking the first release well, with a RMSE of 0.249. Similarly, we can also see that our prediction is good; indeed, our RMSE is 0.287 which is just 15% worse than that of the Bloomberg Survey.

Fig. 5
figure 5

Sources: Authors’ estimates; Bloomberg and CEIC data company (both accessed 30 June 2015)

Prediction of previous quarter GDP. Notes in the left plot the black line is the last vintage available of the GDP series, and the gray line is the prediction obtained with the DFM the last day before GDP is released. In the right plot, the dashed black line is the first release of GDP, while the gray line is the median prediction from the Bloomberg Survey. Note that for comparison purposes the variables in the left and the right panel have been standardized, which is why there is no scale in the y axes. Precisely, the variables in the left panel have been standardized using the mean and the variance of GDP final release, while the variables in the right plot have been standardized using the mean and the variance of GDP first release. In other words, the black line in the left plot, and the dashed line in the right plot, have both mean zero and variance one.

Fig. 6
figure 6

Source: Authors’ estimates; Asian Development Bank, Asian Development Outlook database; CEIC Data Company; Concensus Economics, Monthly Concensus Forecasts; and International Monetary Fund, World Economic Outlook database (all accessed 30 June 2015)

Prediction of annual GDP growth rate. Notes the black line in the northwest plot is annual GDP growth, while the gray diamonds are the prediction obtained at the end of each month with the DFM. In all the other panels, the black line is annual GDP growth computed by using the GDP series released in real time, while the gray diamonds are the prediction of ADB, IMF, and CF. Data for CF are from Consensus Economics Inc.

4.4 Comparison against institutional benchmarks

Finally, by following Luciani and Ricci (2014) we can construct predictions for the current annual growth rate and compare them with those published by policy institutions.Footnote 16 In detail, we compare our predictions with those published by the ADB in the Asian Development Outlook, those published by the IMF in the World Economic Outlook, and the prediction by Consensus Forecast (CF).Footnote 17 The ADB publishes its predictions of current annual GDP growth twice a year, in April and in late September, approximately; the IMF also publishes its prediction twice a year but these are released on April and October. Predictions by CF are available each month.

The northwest (NW) panel in Fig. 6 shows annual GDP growth together with the prediction of current annual GDP growth obtained at the end of each month with the DFM. The other panels of Fig. 6 show predictions from ADB, IMF, and CF together with annual GDP growth. Note, however, that the annual GDP growth reported in the NW panel is different from that reported in the other panels. In the NW panel, we are reporting annual growth computed on the basis of the last vintage of available data (and before 2010 reconstructed as described in Sect. 2), while the other panels report annual growth computed on the basis of the real-time release of the GDP series. We do the latter because the ADB, the IMF, and CF predicted the series with this information at hand.

From Fig. 6 and the RMSE values in Table 5, we can see that the DFM is predicting annual GDP growth quite well, and, in particular, it correctly revises its prediction as more data become available during the calendar year. Furthermore, the prediction of the DFM is comparable to that of CF, and slightly superior to that of the ADB and the IMF.

We conclude this section by stressing two caveats on the results just presented: First, by using annual data, we have only 7 predictions, that means that the RMSE reported in Table 5 hinges on very few observations. Second, our exercise is pseudo real time in the sense explained in Sect. 4.1, as we avail of a better information set compared to the one exploited in real time by institutional forecasters.

Table 5 Root mean squared error: annual GDP growth.

5 Conclusion

In this paper, we have applied state-of-the-art techniques for nowcasting Indonesia’s GDP growth. Our approach is based on a Dynamic Factor model, to efficiently exploit monthly and quarterly variables and to properly account for the sequence of macroeconomic data releases.

We find that relying on market “revealed preferences” for certain indicators in the Indonesian economy is an effective guide for choosing what variables to include in our information set. To this end, we have relied on the Bloomberg platform, which tracks the relevance of each series for its subscribers, and also on our “expert judgment” by including a few indicators that we think provide extra information on the Indonesian economy. Based on this, despite using a relatively narrow set of variables, when focusing on the year-on-year growth rate, the Dynamic Factor model nowcast error falls by 55% compared with the benchmark AR model. Lacking a full time series of GDP revisions as well as for the information set used in the Dynamic Factor model we cannot assess how well our model predictions perform compared with those of experts’ forecast surveyed by Bloomberg. Still, our “pseudo-real-time” forecasting performance is comparable with the one achieved in a truly “real-time” setting by the median Bloomberg survey.

Furthermore, since our model can be used to forecast further ahead, we also compute calendar-year annual growth rates. This exercise allows us to compare the tracking of our Dynamic Factor model forecasts on a smoother and medium-term indicator of Indonesia’s growth, arguably more important for policy decisions than the (potentially erratic) quarterly growth rate. In this case our model compares well with the forecasts produced by the average of private sector expectations (Consensus Forecasts) as well as by the IMF World Economic Outlook.

Finally, our exploration into Indonesia’s data sheds light on a severe lack of valuable high-frequency statistics on economic growth, as well as on possible information gaps in the statistical framework underlying national accounts. Despite such limitations, the Dynamic Factor proposed in this paper constitutes a useful tool for policy makers and private sector experts aiming to nowcast Indonesia’s growth trajectory.