Abstract
We produce predictions of the current state of the Indonesian economy by estimating a dynamic factor model on a dataset of 11 indicators (followed closely by market operators) over the 2002–2014 period. Besides the standard difficulties associated with constructing timely indicators of current economic conditions, Indonesia presents additional challenges typical to emerging market economies where data are often scant and unreliable. By means of a pseudo-real-time forecasting exercise, we show that our model outperforms univariate benchmarks, and it does comparably well with predictions of market operators. Finally, we show that when quality of data is low, a careful selection of indicators is crucial for better forecast performance.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
It is well known that macroeconomic data are released with a substantial delay. Additionally, in emerging market economies, low-frequency data (i.e., annual national accounts) rely on a smaller array of surveys and indicators than in advanced economies, and provide a partial picture of the economy. However, complete and up-to-date information on the current state of the economy is crucial for policy makers, market participants and public institutions. Indeed, agents periodically update their forecasts, and monitoring economic conditions in real time helps them assess whether the forecasts are on track or need to be revised. Similarly, the process of policymaking often requires long term projections of the economy that heavily rely on accurate initial conditions and forecasts. Therefore, constructing timely “predictions” of current economic conditions, namely nowcasts, is of fundamental importance for decision making.
A lot of information is contained in economic indicators that are available on a quarterly, monthly, weekly, and even daily basis, and, in principle, it is possible to use this information to build “predictions” of the current state of the economy. However, high-frequency data in emerging market economies are often scant, noisy, released with a lag, and can have missing observations. This complicates the difficult task of real-time monitoring and decision making as constructing timely indicators on current economic conditions for emerging market economies presents some extra challenges.
In this paper, we focus on Indonesia, the largest economy in Southeast Asia, which is rapidly gaining influence in the world economy. With a number of high-frequency data indicators available and yet facing problems that commonly plague emerging economy datasets, Indonesia provides an interesting training case for developing a nowcasting framework that can be applied to monitor other similar economies in the region.
Two main issues emerge with regard to monitoring in real time: how many and which indicators to select and what econometric model to use to extract information from the data. In this paper, we produce “predictions” of the current state of the Indonesian economy by estimating a Dynamic Factor model on a dataset of 11 indicators (followed closely by market operators) over the 2002–2014 period. Our choice of the model is based on the fact that it is parsimonious, it is able to cope with missing data and mixed frequency indicators, and it can potentially be estimated on a large number of variables. Further, since the seminal paper of Giannone et al. (2008), this model has become a standard tool for monitoring economic activity, as it has proved to be successful in nowcasting several economies, including emerging ones such as China Giannone et al. (2014) and Brazil Bragoli et al. (2015).
The rest of the paper proceeds as follows. Section 2 presents Indonesia’s GDP data, and discusses the problems of having several GDP series with different base years, and no official seasonally adjusted data. Section 3 discusses our nowcasting procedures and is divided in two parts: in the first part we describe the process of choosing a set of indicators that contains useful information on economic activity, and in the second we present the application of a dynamic factor model to Indonesia’s data.
Section 4 presents the evaluation of our model. Several results emerge. First, incorporating high-frequency data in a rigorous framework leads to an improvement in the forecast accuracy of Indonesia’s economy compared with simple univariate benchmarks. Second, too many variables are not always optimal for the purpose of monitoring as they can be noisy or uninformative (Bańbura et al. 2013; Luciani 2014b), particularly so when the target variable, namely Indonesia’s GDP growth, has limited number of observations. A careful selection of meaningful variables improves the forecast performance. Third, our model does well in predicting quarterly GDP growth when compared with private forecasters such as Bloomberg, and also does well in predicting annual GDP growth when compared with institutional forecasts of the International Monetary Fund and the Asian Development Bank.
Finally, Sect. 5 concludes.
2 Indonesia’s GDP data: atterns and issues
In this section we present Indonesia’s GDP data, and discuss a number of issues in the data that need to be carefully tackled before starting any monitoring process.
The first issue is that there is no single long series for GDP available from official statistical sources. The left plot in Fig. 1 shows the level of quarterly GDP at constant prices in trillions of rupiah from 2000 to 2014, from the national source. The right-hand side plot shows the computed year-on-year (y-o-y) growth rate of quarterly GDP from 2000. The three lines refer to different base years, 1993, 2000 and 2010 respectively. Each line, however, is only available in a limited time span in that 1993 prices is available for 1993–2003, 2000 prices is available for 2000–2014 and the latest 2010 prices is available since 2010 onwards only.Footnote 1
The aggregation methodology was common between the 1993 and 2000 base years, but changed from SNA 1993 to SNA 2008 for the 2010 base year.Footnote 2 Base changes are common for GDP data, as they can incorporate changes in the economy’s structural composition. However, as seen in Fig. 1, growth rates in the few years of overlap between series were also affected.Footnote 3 Hence the composition of the aggregate series and methodologies used in their construction is brought into question, exacerbated by a lack of publicly available information on procedures used.
The second issue is related to which time span to use in conducting the econometric analysis. Indeed, GDP growth in Indonesia is characterized by large fluctuations due to different crises hitting the economy. In particular, the Asian financial crisis in the late 1990s stands out as a unique episode for Indonesia’s growth path with GDP falling more than 15% in 1998 (not shown in Fig. 1). Growth recovered to about 5% in 2000 and stabilized somewhat at this yearly rate with a significantly lower volatility since 2002. Considering this pattern, we exclude the Asian financial crisis years from our sample and use the series only from 2002. This is unavoidable because the evolution of GDP growth over the financial crisis would dominate our estimates if not excluded. It would capture features of the data (i.e., co-movements) that are potentially spurious and misleading for the nowcasting exercise, as the relationships that held during the financial crisis could be different from those that hold during expansions and more moderate recessions.
The third issue with Indonesia’s GDP data is that the series exhibits a marked seasonal pattern when we compute quarter-on-quarter (q-o-q) growth rates (Fig. 2). Yet there is no seasonally adjusted GDP series available from official sources, which leaves us with the problem of having to deal with seasonality in the data, particularly when trying to combine series with different base years to obtain coherent and long time series. Indeed, as shown in the right-hand-side panel in Fig. 2, the seasonality in GDP growth data with 2010 base year exhibits a clear departure from the seasonal pattern in the series with 2000 base year. This adds to the complication of splicing the different series together. Suppose we start with the latest available GDP series that is of 2010 base, and extend it backwards using the q-o-q growth rates of the previous base series. As seen in the left-hand side plot in Fig. 3, this produces inconsistent seasonal patterns within the spliced series—that is, between the (growth of) actual data and the extended data. On the contrary, the reconstructed series based on y-o-y growth rates instead does not seem to have the same shortcoming.
The right-hand side panel of Fig. 3 plots the constructed long time series for GDP based on the two different splicing methods. It shows that while the series constructed using the y-o-y growth rates has consistent dynamics with that of previous base year, the one generated using the q-o-q growth rates has not. The latter exhibits a jump in GDP during the four quarters when the two shorter series were spliced. For this reason, splicing the series using a y-o-y growth seems to warrant a more consistent long series. Furthermore, we will also continue to use y-o-y growth rates in our analysis going forward.
Finally, we are aware that using y-o-y growth tackles the issue of seasonality and makes series smoother, but it also lags q-o-q growth. However, the alternative of using some standard procedure to seasonally adjust the series, and then evaluating the forecasting performance on the resulting unofficial series would be of little practical use. Not only would it entail defining of a new, somewhat arbitrary target variable, but it would also be of little relevance to policy makers, who focus on the official y-o-y GDP growth. Of course, had the Indonesia Statistics provided official q-o-q seasonally adjusted data, we would have used that as our target for the nowcast.
3 Nowcasting
Typically, GDP data provide the most comprehensive picture of the economy, by aggregating activity of different sectors. Unfortunately, the data come with long delays and are not available on a high-frequency basis. In most cases, data are published quarterly and, in some developing countries, even annually.
For Indonesia, GDP data are reported quarterly with a delay of about 5 weeks. This means that the growth rate of the economy in the first quarter that ends in March is not known until the second week of May. While this is a reasonable delay compared to most emerging and some advanced economies, it is still insufficient for the purpose of real-time monitoring, as it is not possible to make an assessment of the strength of economic activity for almost 5 months into the year.
To gauge the current state of the Indonesian economy in real time, we need to construct a prediction of GDP growth before the official data are released. This means that at each point in time we want to predict not only the current and next quarter estimates for GDP growth (henceforth nowcast and forecasts), but also, wherever the official data have not been published yet, the past quarter GDP growth (backcast).
Ideally, if the true data sources and compilation methods for GDP were known, we could simply attempt to reverse engineer the process performed quarterly by the Indonesian statistical office but at a higher frequency. Unfortunately, as discussed in Sect. 2, very little information on these methods is available from the statistical agency. This forces us (1) to build an information set that is informative for describing the GDP growth process, possibly by including variables that may be outside the scope of the statistical office but may still contain useful leading information, and (2) to choose an econometric model to build our prediction.
In the next two subsections we address these issues. In particular, in Sect. 3.1, we explain how we construct the database, while in Sect. 3.2 we introduce Dynamic Factor models (DFMs). We choose to use a DFM since it is a parsimonious model, it copes with missing data and mixed frequency, and it can potentially exploit a large information set. Furthermore, being framed in a Kalman filter framework, it allows a formal reading of the news flow coming from successive data releases. Finally, DFMs proved to be very successful in nowcasting several economies, including emerging ones.Footnote 4
3.1 What variables to select?
There is a wide set of potentially useful monthly and quarterly series that could help extract information on the state of the Indonesian economy. However, the limited number of time series observations available for our target quarterly y-o-y GDP growth (see Sect. 2), constrains our choice of both the variables and the model.
In principle, DFMs are consistently estimated when the number of variables is diverging to infinity, and in practice they are usually estimated on relatively large datasets. However, when the time series observation is severely limited, including too many variables is likely to introduce estimation uncertainty, which ultimately worsens the prediction performance. This is a particular source of concern when dealing with Indonesian data, as only few series display a marked comovement with the GDP quarterly dynamics and hence the risk is to introduce excessive noise in the model estimation. Furthermore, the literature on nowcasting with large-dimensional DFMs has reached the conclusion that, unless one needs to monitor the data flow with many data releases, there is no gain from too large a database as long as the variables on which the model is estimated are appropriately selected (Bańbura et al. 2013; Luciani 2014a, b). This then leads to questions on how many variables should be selected and on what basis.
A possible strategy to draw from a large pool of variables could be to rely on a purely mechanical statistical selection procedure. For example, Bai and Ng (2008) suggest selecting with the Least Angle Regressions (LARS algorithm) only those variables that are really informative for forecasting the target variable, while Camacho and Perez-Quiros (2010) suggest first selecting a core group of variables and then evaluating if other possible predictors are useful.
An alternative strategy, pioneered by Bańbura et al. (2013) and followed by Luciani and Ricci (2014), Giannone et al. (2014), and Bragoli et al. (2015), is to exploit the “revealed preferences” of professional forecasters who follow the Indonesian economy on the Bloomberg platform. These analysts subscribe to the Bloomberg news alert for specific data releases of the variables that they monitor and use them to form their expectations on current and future fundamentals of Indonesia. As Bloomberg constantly ranks the analysts’ demand for these alerts by constructing a relevance index for each macroeconomic indicator, we can select variables based on this relevance index.Footnote 5
We adopt the latter approach, as the automatic selection approach risks leading to an unstable choice of variables in a real-time scenario.Footnote 6 This instability would not only be difficult to justify from an economic standpoint, but it would also complicate the interpretation of the forecasts’ revisions. Moreover, we also tried the automatic selection approach, and the performance of our model was worse in this case than when we used the revealed preference approach (see Appendix A.1).
It turns out that for Indonesia, only a relatively small number of macroeconomic series are tracked in real time by the markets (see Table 1). On the one hand, there are indicators describing macroeconomic developments (e.g., GDP itself, car sales, exports, imports and manufacturing PMI). Moreover, given their direct influence on the foreign exchange and fixed income markets, analysts also monitor indicators that directly describe the monetary policy stance. These are the central bank reference interest rate as well as key monetary aggregates.
Starting from the set of indicators in Table 1 followed by business analysts we constructed our database as follows:
-
(1)
We excluded all indicators that either had too few observations, or we could not retrieve. This is the case for PMI, for which data are available only starting in June 2012, and for Danareksa Consumer Confidence and Motorcycle Sales for which we were not able to retrieve data.Footnote 7
-
(2)
We screened each of the remaining indicators to understand whether they are followed by analysts because they convey information on the state of the real economy, or they are directly related to the stance of the Central Bank and its balance sheet. Therefore, because the Bank of Indonesia has an inflation target, we discarded CPI, and we also removed Foreign Reserves, and Net Foreign Assets as they are mainly related to the foreign exchange policy. However, we included monetary “instruments” like the bank reference rate and the M2, as proxies for the monetary stance, which itself should be forward-looking.
-
(3)
We then excluded those variables that are the sum of other variables in the database or are too similar to other series. So we kept Imports and Exports, but we excluded Current Account; and we kept M2, but we discarded M1.
-
(4)
Finally, we used our “expert judgment” and added a few indicators that we think provide some extra information about the Indonesian economy. To capture information regarding the increasing role of construction activity for the Indonesian economy, we included domestic cement consumption. To account for spillovers from the foreign sector into the domestic economy, we included a variable from the very timely Market PMI manufacturing survey. In particular, we included the aggregate for emerging economies, which is dominated by developments in China as well as countries in Asia. Finally, we also included a sectoral breakdown of the imports series to better capture the possibly different lead/lag characteristics of each of these series with GDP growth.
By following this strategy, we end up with a dataset of 10 macroeconomic indicators plus GDP (see Table 2). While GDP and Business Tendency Index are quarterly series, the remaining are at monthly frequency. The column “Delay” reports the publication delay expressed in number of days in our stylized calendar and shows substantial differences between series in terms of their publication delay. For example, the PMI for developing economies is published just 4 days after the reference month, while data on imports are released a month after the reference month.Footnote 8
A notable exclusion from our database is Industrial Production, particularly so because this variable is often used by analysts to assess high-frequency movements in economic activity in emerging market economies.Footnote 9 The underlying assumption is that movements in the industrial sector are a good approximation of aggregate economic activity. While the assumption may hold well for some economies, in others, the cyclical component of the index is not found to be sufficiently synchronized with GDP (Fulop and Gyomai 2012). The correlation between Indonesia’s GDP growth and the y-o-y growth rate of industrial production is in fact relatively weak (standing at 0.35 in the full sample) and displays hardly any comovement since 2010 (see Fig. 4). This is likely related to the decreasing contribution to GDP growth from the manufacturing sector, off-set by other sectors like construction and services, as the share of industrial production has been declining since 2002 and now accounts for about one-third of GDP.
3.2 Dynamic factor models
Factor models are based on the idea that macroeconomic fluctuations are the result of a few macroeconomic shocks that affect the whole economy and a number of sectoral/regional shocks that affect a part of the economy. Therefore, each variable in the dataset can be decomposed into a common part and an idiosyncratic part, where the common part is assumed to be characterized by a small number of common factors (\({\mathbf {f}}_t\)) that are time series processes meant to capture the comovement in the data (i.e., the business cycle).
Formally, let \(x_{it}\) be the i th stationary variable observed at month t, then
where \({\mathbf {f}}_t\) is an \(r\times 1\) vector (with \(r\ll n\)) containing the common factors, and \( \xi _{it}\) is the i th idiosyncratic component. The vector of common factors evolve over time as a VAR(p) process driven by the common shocks \({\mathbf {u}}_{t}\sim \mathcal {N}(0,{\mathbf {I}}_r)\), while each idiosyncratic component follows an independent AR(1) model driven by the idiosyncratic shocks \(e_{it}\):Footnote 10
Equations (1)–(3) define the DFM used in this paper. This is the model studied in Doz et al. (2011, 2012), which is a special case of the model studied in Forni et al. (2009). In this model, the common shocks and the idiosyncratic shocks are assumed to be uncorrelated at all leads and lags, while the idiosyncratic shocks are allowed to be cross-sectionally correlated, albeit by a limited amount (approximate factor structure). For a formalization of the assumptions and for a more comprehensive treatment of the DFM, we refer the reader to the aforementioned references and to the survey by Luciani (2014b).
Model (1)–(3) can be estimated by Principal Components (Stock and Watson 2002a; Bai 2003), by using a two-step estimator based on the Kalman Filter and Principal Components (Doz et al. 2011), or by maximum likelihood techniques through the EM algorithm (Doz et al. 2012). In this paper, we will use maximum likelihood, and in particular we will use the EM algorithm proposed by Bańbura and Modugno (2014), which can handle both mixed frequencies and missing data (see Appendix A.2).Footnote 11
4 Empirics
4.1 The forecasting exercise
To evaluate the performance of our model, we perform a pseudo real-time out-of-sample exercise. Predictions of Indonesian GDP growth are produced according to a recursive scheme, where the first sample starts in July 2002 and ends in December 2007, while the last sample starts in July 2002 and ends in December 2014. The model is estimated at the beginning of each quarter using only information available as of the first day of the quarter, and then the parameters are held fixed until the next quarter. For the estimation, we include two factors (\(r = 2\)) and two lags (\(p = 2\)) in the VAR model governing the evolution over time of the factors. An extensive discussion of why we included two factors can be found in Appendix A.3.
To perform our, exercise we construct real-time vintages by replicating the pattern of data availability implied by the stylized calendar (Table 2), and every time new data are released, we update the prediction based only on information actually available at that time. We label this exercise pseudo real time since we do not have access to the actual set of data vintages as they were released over time. As to GDP growth, revisions are very infrequent and relate to changes in national accounts methods rather than to the arrival of more complete information within a year. That is, quarterly figures are treated as the final number until a base change occurs. In those occasions the statistical office can revise substantially the growth profile of GDP, as documented in Sect. 2. Similar changes may occur also for the other variables in our database, but no information regarding the existence of well defined revision schedule is available.
4.2 Comparison against statistical benchmark
To judge the performance of our model and to evaluate the information contained in our dataset, we start by comparing our model with four benchmark univariate models.
Our first benchmark is the naive forecast, obtained from the random walk model on GDP growth. Let \(y^Q_t\) be the y-o-y GDP growth observed quarterly, that is, at month \(t=3,6,9,\ldots \), then the random walk model is: \(y^Q_t=y^Q_{t-1} +\varepsilon _t\). The second benchmark is a forecast from an autoregressive model of order two on GDP growth: \(y^Q_t=\rho _1 y^Q_{t-1} +\rho _2 y^Q_{t-2} +\varepsilon _t\). Given the high persistence in our target series introduced by the y-o-y transformation, these univariate benchmarks are inherently tough competitors to match in our real-time exercise.
Our third benchmark is a bridge model (Parigi and Schlitzer 1995). A Bridge model predicts GDP growth by using past GDP and one or more monthly indicators.Footnote 12 Formally, let \(x_{it}\) be a monthly variable observed at month t, then the Bridge model is then defined as follows:
where \(x^Q_t=\sum ^{3}_{j=1} \frac{1}{3} x_{t-j}\) is the monthly indicator aggregated at the quarterly frequency by a simple average. Equation (4) is estimated by OLS.
Our last benchmark is a MIDAS model. MIDAS models (Ghysels et al. 2007; Andreou et al. 2013) are similar to Bridge models, but in MIDAS models, first, the aggregated variables \(x_{t}^Q\) is constructed using a lag polynomial, and, second, for each forecast horizon it is estimated a different model:
where \(h=\{-3,0,3\}\), \(x_{t}^Q=\sum _{m=0}^{11} \gamma (m,\theta )L^{m}x_t\), and \(\gamma (m,\theta )=\frac{ \exp (\theta _{1}m)}{\sum _m \exp (\theta _{1}m)}\). Equation (5) together with the polynomial \(\gamma (L,\theta )\) is estimated by nonlinear least squares.
In this paper, the predictions from the Bridge and from the MIDAS models are obtained by first estimating a model for each monthly indicator in the database—with the exception of PMI developing countries that was excluded because there are too few observations for this indicator—and then by averaging the prediction. Furthermore, whenever we have a missing observations in \(x_t\), we filled it by using an AR model.
Table 3 shows the root-mean-squared error (RMSE) at the end of each month of the DFM, an AR(2) model, a Random Walk, and the Bridge and the MIDAS models. The table is divided into three parts: the first part, labeled “Forecast”, reports the RMSE of the prediction of the next quarter; the second, labeled “Nowcast,” reports the RMSE of the prediction of the current quarter; and, the last, labeled “Backcast,” reports the MSE of the prediction of the previous quarter.Footnote 13
As we can see from the fact that the RMSEs in Table 3 are decreasing with each month, the DFM is able to correctly revise its GDP prediction as more data becomes available. Furthermore, compared with the univariate benchmarks, the RMSE of the DFM is consistently lower, up to a maximum reduction at the end of the first month after the reference quarter (i.e., row “backcast” in Table 3) of 55% compared with the Autoregressive model, of 15% compared with the Bridge model, and of 29% compared to the MIDAS model. This is an important finding, as it tells us that there is valuable additional information in the Indonesian high-frequency data that can be used to predict GDP growth.
In Table 4, we investigate which data release carries more information for the prediction with the DFM. Identifying such variables is particularly important since it can help policymakers understand which series to track while monitoring the Indonesian economy. More precisely, Table 4 shows the RMSE associated with each data release. We can see that some variables are particularly relevant for correctly updating the prediction of y-o-y GDP growth. Among these, the most important one are exports and imports, which account for the largest reduction in RMSE when the data are released.Footnote 14 Other relevant ones are GDP of the previous quarter, and Cement. Notice also that upon the release of some variables the “average” forecasting performance reported in Table 4 appears to deteriorate. Thus, it would be tempting to drop these variables in order to “improve” the overall forecasting performance. However, each of the variables may also improve the estimation of the model by exploiting the commonality in the data and hence make our forecasts more robust to one-off changes in a particular variable.
We conclude this section with a caveat that will apply to most of the empirical exercise. As we argued in Sect. 2, we had to use data only from 2002 onwards, and this has limited us in two ways. First, as discussed in Sect. 3, we had to restrict the number of variables to include in the model. Second, having only forecasts for 28 quarters, the robustness of our RMSE may be an issue. This is even more relevant when we attempt to assess the statistical significance of different forecast performances. To this end, in Table 3 we report Diebold and Mariano (1995) test of equal predictive accuracy. The DM statistics is asymptotically normally distributed, and although we have just 28 predictions, the p values in Table 3 was computed using the normal approximation.
4.3 Comparison against market benchmark
Another way to evaluate the forecasting performance of our model is to compare the prediction obtained with the DFM with the prediction of market operators. In Fig. 5, we compare our predictions with those of the Bloomberg Survey (BS). The BS consists of the median GDP prediction provided independently by a variable number of specialists a few days before GDP is released. Therefore, because the Bloomberg Survey is released a few days before previous quarter GDP, according to our terminology the BS prediction is a backcast, and we will compare it with our last prediction before GDP is released.
When comparing our prediction with Bloomberg, we have to be careful with respect to data revisions. The black line of the left plot in Fig. 5 represents the last vintage available of the GDP series, while the dashed black line in the right plot of Fig. 5 represents the first release of GDP growth.Footnote 15 As discussed in Sect. 2, these two series are quite different due to the fact that the Statistical office has changed both base year and the aggregation method, and, in particular from 2011 onwards the two series differ substantially. This fact is crucial when we attempt to make comparisons with truly historical forecasts. In particular, the analyst interviewed by Bloomberg were targeting the first release GDP growth (right plot), while the DFM has been estimated using—and therefore has been designed to target—the latest release of y-o-y quarterly GDP growth (left plot). Therefore, if we want to understand how good/bad the prediction produced by the DFM are compared to those of Bloomberg, we have to take into account that there is a difference in the target variable.
As we can see from Fig. 5, the Bloomberg Survey is tracking the first release well, with a RMSE of 0.249. Similarly, we can also see that our prediction is good; indeed, our RMSE is 0.287 which is just 15% worse than that of the Bloomberg Survey.
4.4 Comparison against institutional benchmarks
Finally, by following Luciani and Ricci (2014) we can construct predictions for the current annual growth rate and compare them with those published by policy institutions.Footnote 16 In detail, we compare our predictions with those published by the ADB in the Asian Development Outlook, those published by the IMF in the World Economic Outlook, and the prediction by Consensus Forecast (CF).Footnote 17 The ADB publishes its predictions of current annual GDP growth twice a year, in April and in late September, approximately; the IMF also publishes its prediction twice a year but these are released on April and October. Predictions by CF are available each month.
The northwest (NW) panel in Fig. 6 shows annual GDP growth together with the prediction of current annual GDP growth obtained at the end of each month with the DFM. The other panels of Fig. 6 show predictions from ADB, IMF, and CF together with annual GDP growth. Note, however, that the annual GDP growth reported in the NW panel is different from that reported in the other panels. In the NW panel, we are reporting annual growth computed on the basis of the last vintage of available data (and before 2010 reconstructed as described in Sect. 2), while the other panels report annual growth computed on the basis of the real-time release of the GDP series. We do the latter because the ADB, the IMF, and CF predicted the series with this information at hand.
From Fig. 6 and the RMSE values in Table 5, we can see that the DFM is predicting annual GDP growth quite well, and, in particular, it correctly revises its prediction as more data become available during the calendar year. Furthermore, the prediction of the DFM is comparable to that of CF, and slightly superior to that of the ADB and the IMF.
We conclude this section by stressing two caveats on the results just presented: First, by using annual data, we have only 7 predictions, that means that the RMSE reported in Table 5 hinges on very few observations. Second, our exercise is pseudo real time in the sense explained in Sect. 4.1, as we avail of a better information set compared to the one exploited in real time by institutional forecasters.
5 Conclusion
In this paper, we have applied state-of-the-art techniques for nowcasting Indonesia’s GDP growth. Our approach is based on a Dynamic Factor model, to efficiently exploit monthly and quarterly variables and to properly account for the sequence of macroeconomic data releases.
We find that relying on market “revealed preferences” for certain indicators in the Indonesian economy is an effective guide for choosing what variables to include in our information set. To this end, we have relied on the Bloomberg platform, which tracks the relevance of each series for its subscribers, and also on our “expert judgment” by including a few indicators that we think provide extra information on the Indonesian economy. Based on this, despite using a relatively narrow set of variables, when focusing on the year-on-year growth rate, the Dynamic Factor model nowcast error falls by 55% compared with the benchmark AR model. Lacking a full time series of GDP revisions as well as for the information set used in the Dynamic Factor model we cannot assess how well our model predictions perform compared with those of experts’ forecast surveyed by Bloomberg. Still, our “pseudo-real-time” forecasting performance is comparable with the one achieved in a truly “real-time” setting by the median Bloomberg survey.
Furthermore, since our model can be used to forecast further ahead, we also compute calendar-year annual growth rates. This exercise allows us to compare the tracking of our Dynamic Factor model forecasts on a smoother and medium-term indicator of Indonesia’s growth, arguably more important for policy decisions than the (potentially erratic) quarterly growth rate. In this case our model compares well with the forecasts produced by the average of private sector expectations (Consensus Forecasts) as well as by the IMF World Economic Outlook.
Finally, our exploration into Indonesia’s data sheds light on a severe lack of valuable high-frequency statistics on economic growth, as well as on possible information gaps in the statistical framework underlying national accounts. Despite such limitations, the Dynamic Factor proposed in this paper constitutes a useful tool for policy makers and private sector experts aiming to nowcast Indonesia’s growth trajectory.
Notes
GDP for 2013 and 2014 based on the 2000 base year are preliminary figures projected by the statistical office: https://www.bps.go.id/linkTabelStatis/view/id/1217.
A detailed explanation of the System of National Accounts (SNA) can be found at http://unstats.un.org/unsd/nationalaccount/sna.asp.
In fact, even nominal GDP data for the overlapping time periods are not comparable between the different bases for GDP series. Data are available in Table VII.1 at http://www.bi.go.id/en/statistik/seki/terkini/riil/Contents/Default.aspx.
A non-exhaustive list of countries and papers is: the USA (Bańbura et al. 2011, 2013; Giannone et al. 2008), the Euro Area (Angelini et al. 2011; Bańbura and Rünstler 2011), Germany (Marcellino and Schumacher 2010), France (Barhoumi et al. 2010), Ireland (D’Agostino et al. 2012), Norway (Aastveit and Trovik 2012; Luciani and Ricci 2014), China (Giannone et al. 2014), Brazil (Bragoli et al. 2015), New Zealand (Matheson 2010), the Global Economy (Matheson 2013), and Latin America (Liu et al. 2012).
The implicit assumption here is that since based on their expectations on future fundamentals analysts allocate their investments, they know which series to monitor to form appropriate expectations on GDP growth.
As shown by De Mol et al. (2008) since there is a lot of comovement among macroeconomic data, the set of indicators selected with statistical criteria is extremely unstable.
As the index compiled by Danareksa was not available to us, we experimented with the household consumer confidence index compiled by the Bank of Indonesia. The latter, however, displays a trending pattern that appears difficult to reconcile with the state of the economy, and we hence discarded it.
For the policy rate, we adopted the assumption that it is observed the first day of the month following the reference month. For example, the policy rate for January is observed on February 1. Of course this is an approximation because we know what the policy rate is everyday in January. In principle, we could have accounted for daily observations in the interest rate since DFMs allow us to do so (Modugno 2014). However, Bańbura et al. (2013) have shown that including data at the daily frequency is not particularly useful for nowcasting GDP, so we adopted the convention that the interest rate is monthly and is observed on the first day after the reference month.
It is worth noting here that the use of y-o-y transformations likely add persistence also in the idiosyncratic component so that a higher order autoregressive process might be more appropriate. However, increasing the order of the autoregressive process of the idiosyncratic component from an AR(1) to an AR(2) implies adding eleven extra states to the model. In our case, where we have few time series observations, and the quality of the data is doubtfull, adding eleven extra states in the Kalman Filter is feasible, but for sure costly in terms of computation accurancy. In summary, although an AR(1) may be not enough for capturing the whole dynamic in the idiosyncratic process, the cost of adding an extra lag is way higher than the benefits in terms of forecasting accuracy.
As pointed out by Baffigi et al. (2004), differently from DFMs, bridge models are not concerned with particular assumption underlying the DGP of the data, but rather, the inclusion of specific explanatory indicators is based on the simple statistical fact that they embody timely updated information about the target GDP growth series.
Note that the AR(2) and the RW do not update the prediction beyond month 2 of the quarter, but update it between month 1 and month 2 due to the GDP data release. Also, the fact that the RMSE of the AR(2) and the RW in “Backcast” month 1, and “Nowcast” month 2 is different is an artifact due to the fact that when we “Backcast” our target variable is GDP from Q4 2007 to Q3 2014, whereas when we “Nowcast” our target variable is GDP from Q1 2008 to Q4 2014. Of course, had the evaluation period been longer, we would not have this problem.
The average share of exports to GDP in Indonesia in 2010–2014 was about 20% and even slightly lower for imports (at 19%). Trade data, however, provide information about activity across sectors in the economy, and the results in Table 4 show that this is useful information for predicting GDP.
Note that this is all the “real-time” information that we have available since, as mentioned in Sect. 4.1, we do not have any “real-time” information for the other variables in the dataset.
Let \(X^y_q=100\times \log (\hbox {GDP}^y_q)\) be GDP of the q th quarter of year y, and let \(Z^y=100\times \log (\hbox {GDP}^y)\) be GDP of year y. Then, by definition \(x^y_q=X^y_q-X^{y-1}_{q}\) is the y-o-y growth rate, while \(z^y=Z^y-Z^{y-1}\) is the annual growth rate. Following Mariano and Murasawa (2003), we make use of the approximation \(Z^y\approx (X^y_1+X^y_2+X^y_3+X^y_4)/4\), which allow us to write the annual growth rate as a function of y-o-y growth rates: \(z^y=Z^y-Z^{y-1}\approx (X^y_1+X^y_2+X^y_3+X^y_4)/4 - (X^{y-1}_1+X^{y-1}_2+X^{y-1}_3+X^{y-1}_4)/4 = (x^y_4+x^y_3+x^y_2+x^y_1)/4\).
Consensus Economics Inc. forecasts comprise quantitative predictions of private sector forecasters. Each month survey participants are asked for their forecasts of a range of macroeconomic and financial variables for the major economies.
By using the LARS algorithm as in Bai and Ng (2008), we selected 10 out of all available indicators that were retrieved for Indonesia.
With the exception of few technical and minor details, this is essentially the same model used by Giannone et al. (2008). In practice, every time we update the prediction we use a balanced panel to estimate the factor with PCA. Then we fit an AR(2) on the estimated factor and use the Kalman Filter to account for missing values at the end of the sample. Finally we estimate Eq. (4) with OLS. An alternative way to perform this exercise would be to use the collapsed dynamic factor model of Bräuning and Koopman (2014).
References
Aastveit KA, Trovik T (2012) Nowcasting Norwegian GDP: the role of asset prices in a small open economy. Empir Econ 42(1):95–119
Andreou E, Ghysels E, Kourtellos A (2013) Should macroeconomic forecasters use daily financial data and how? J Bus Econ Stat 31(2):240–251
Angelini E, Camba-Mendez G, Giannone D, Reichlin L, Rünstler G (2011) Short-term forecasts of euro area GDP growth. Econom J 14(1):C25–C44
Baffigi A, Golinelli R, Parigi G (2004) Bridge models to forecast the euro area GDP. Int J Forecast 20(3):447–460
Bai J (2003) Inferential theory for factor models of large dimensions. Econometrica 71:135–171
Bai J, Ng S (2008) Forecasting economic time series using targeted predictors. J Econom 146(2):304–317
Bańbura M, Modugno M (2014) Maximum likelihood estimation of factor models on data sets with arbitrary pattern of missing data. J Appl Econom 29(1):133–160
Bańbura M, Rünstler G (2011) A look into the factor model black box: publication lags and the role of hard and soft data in forecasting GDP. Int J Forecast 27(2):333–346
Bańbura M, Giannone D, Reichlin L (2011) Nowcasting. In: Clements MP, Hendry DF (eds) Oxford handbook on economic forecasting. Oxford University Press, New York
Bańbura M, Giannone D, Modugno M, Reichlin L (2013) Now-casting and the real-time data-flow. In: Elliott G, Timmermann A (eds) Handbook of economic forecasting, vol 2. Elsevier-North Holland, Amsterdam, pp 195–237
Barhoumi K, Darné O, Ferrara L (2010) Are disaggregate data useful for factor analysis in forecasting French GDP? J Forecast 29(1–2):132–144
Bragoli D, Metelli L, Modugno M (2015) The importance of updating: evidence from a Brazilian nowcasting model. J Bus Cycle Meas Anal 2015(1):5–22
Bräuning F, Koopman SJ (2014) Forecasting macroeconomic variables using collapsed dynamic factor analysis. Int J Forecast 30(3):572–584
Camacho M, Perez-Quiros G (2010) Introducing the euro-sting: short-term indicator of euro area growth. J Appl Econom 25(4):663–694
D’Agostino A, McQuinn K, O’Brien D (2012) Now-casting Irish GDP. OECD J Bus Cycle Meas Anal 2012(2):21–31
D’Agostino A, Giannone D, Lenza M, Modugno M (2016) Nowcasting business cycles: a Bayesian approach to heterogeneous dynamic factor models. In: Koopman SJ, Hillebrand E (eds) Dynamic factor models, vol 35. Advances in econometrics. Bingley, Emerald
De Mol C, Giannone D, Reichlin L (2008) Forecasting using a large number of predictors: is bayesian shrinkage a valid alternative to principal components? J Econom 146:318–328
Diebold FX, Mariano RS (1995) Comparing predictive accuracy. J Bus Econ Stat 13:253–263
Doz C, Giannone D, Reichlin L (2011) A two-step estimator for large approximate dynamic factor models based on kalman filtering. J Econom 164(1):188–205
Doz C, Giannone D, Reichlin L (2012) A quasi maximum likelihood approach for large approximate dynamic factor models. Rev Econ Stat 94(4):1014–1024
Forni M, Hallin M, Lippi M, Reichlin L (2003) Do financial variables help forecasting inflation and real activity in the Euro Area? J Monet Econ 50(6):1243–1255
Forni M, Giannone D, Lippi M, Reichlin L (2009) Opening the black box: structural factor models versus structural VARs. Econom Theory 25(5):1319–1347
Fulop G, Gyomai G (2012) Transition of the OECD CLI system to a GDP-based business cycle target. OECD composite leading indicators background note
Ghysels E, Sinko A, Valkanov R (2007) Midas regressions: further results and new directions. Econom Rev 26(1):53–90
Giannone D, Reichlin L, Small D (2008) Nowcasting: the real-time informational content of macroeconomic data. J Monet Econ 55(4):665–676
Giannone D, Agrippino SM, Modugno M, Reichlin L (2014) Nowcasting China. Mimeo
Jungbacker B, Koopman SJ, van der Wel M (2011) Maximum likelihood estimation for dynamic factor models with missing data. J Econ Dyn Control 35(8):1358–1368
Kasri R, Kassim SH (2009) Empirical determinants of saving in the Islamic banks: evidence from Indonesia. JKAU Islamic Econ 22(2):181–201
Kubo A (2009) Monetary targeting and inflation: evidence from Indonesia’s post-crisis experience. Econ Bull 29(3):1805–1813
Liu P, Matheson T, Romeu R (2012) Real-time forecasts of economic activity for Latin American economies. Econ Model 29(4):1090–1098
Luciani M (2014a) Forecasting with approximate dynamic factor models: the role of non-pervasive shocks. Int J Forecast 30(1):20–29
Luciani M (2014b) Large-dimensional dynamic factor models in real-time: a survey. Université libre de Bruxelles, Bruxelles
Luciani M, Ricci L (2014) Nowcasting Norway. Int J Cent Bank 10:215–248
Maćkowiak B (2007) External shocks, US monetary policy and macroeconomic fluctuations in emerging markets. J Monet Econ 54(8):2512–2520
Marcellino M, Schumacher C (2010) Factor MIDAS for nowcasting and forecasting with ragged-edge data: a model comparison for German GDP. Oxf Bull Econ Stat 72(4):518–550
Mariano RS, Murasawa Y (2003) A new coincident index of business cycles based on monthly and quarterly series. J Appl Econom 18(4):427–443
Matheson TD (2010) An analysis of the informational content of New Zealand data releases: the importance of business opinion surveys. Econ Model 27(1):304–314
Matheson TD (2013) New indicators for tracking growth in real time. J Bus Cycle Meas Anal 2013(1):51–71
Modugno M (2014) Now-casting inflation using high frequency data. Int J Forecast 29(4):664–675
Parigi G, Schlitzer G (1995) Quarterly forecasts of the italian business cycle by means of monthly economic indicators. J Forecast 14(2):117–141
Raghavan M, Dungey M (2015) Should ASEAN-5 monetary policy-makers act pre-emptively against stock market bubbles? Appl Econ 47(11):1086–1105
Stock JH, Watson MW (2002a) Forecasting using principal components from a large number of predictors. J Am Stat Assoc 97:1167–1179
Stock JH, Watson MW (2002b) Macroeconomic forecasting using diffusion indexes. J Bus Econ Stat 20(2):147–162
Acknowledgements
We would like to thank Dennis Sorino for excellent research assistance. This paper was written while Matteo Luciani was chargé de recherches F.R.S.-F.N.R.S., and he gratefully acknowledges their financial support. The views expressed in this paper are those of the authors and do not necessarily reflect the views and policies of the Asian Development Bank, of the Banca d’Italia or the Eurosystem, and of the Board of Governors or the Federal Reserve System.
Author information
Authors and Affiliations
Corresponding author
Appendix
Appendix
1.1 Robustness
In this Appendix we show robustness checks with respect to the composition of the dataset.
The first robustrness check aims to answer the question “what if we had selected variables in a different way?”. As discussed in Sect. 3.1, we constructed the database by selecting variables from the set of indicators that market analysts are monitoring, however, a priori there are other equally valid selection criteria. In particular, in performing this robustness check we considered (a) the option of using all the indicators in Table 1 (labeled Bloomberg selection), and (b) the option of selecting indicators automatically by using a statistical technique (labeled as automatic selection).Footnote 18
In Table 6, we show the RMSE for a DFM parameterized as described in Sect. 4.1 but estimated on the two datasets just described. As we can see, our database clearly delivers the best performance, though in forecasting the performance of the DFM estimated over the indicators followed by Bloomberg is comparable. The performance of the DFM is particularly disappointing when the indicators are selected with LARS. We believe that this is a consequence of having too few observations, and some missing values in the time series of our indicators.
The second robustness check aims to answer the question “what if we do not select indicators but include all variables that we are able to retrieve?”. To this end we use a bridge model as in Eq. (4), where the predictor \(x_t^Q\) is a factor extracted from a panel of monthly variables by using principal component.Footnote 19 The last column of Table 6 reports the RMSE for this model and the results confirm that including all variables does not appear to be a good strategy.
1.2 Technical aspects related to y-o-y transformations
In Sect. 3.2 we mention that to estimate the model we use the EM algorithm proposed by Bańbura and Modugno (2014), which can handle both mixed frequencies and missing data. While we refer the reader to Bańbura and Modugno (2014) and Bańbura et al. (2011, 2013) for detailed information on this algorithm and for a formal treatment, in this Appendix we provide basic information related to the use of year over year growth rates.
The first issue is to understand what is the relationship between monthly y-o-y growth rates, and quarterly y-o-y growth rates. Let t denotes months, and let \(Y^Q_t\) be the log-level of a quarterly variable. The y-o-y growth rate is then equal to \(Y^Q_t-Y^Q_{t-12}\) and we will denote it as \(y_t^Q\). Then, let \(Y^M_t\) be the monthly log-level of Y, and let \(y_t^M=Y^M_t-Y^M_{t-12}\) be the monthly y-o-y growth rate. In order to link \(y_t^Q\) and \(y_t^M\) we follow Mariano and Murasawa (2003), and we make use of the approximation \(Y_t^Q \approx Y_t^M + Y_{t-1}^M+Y_{t-2}^M\) which, after some simple algebra, allows us to write:
The second issue is how to write the factor model when we have a vector of monthly and quarterly y-o-y growth rates. Bańbura and Modugno (2014) solved this by treating quarterly series as monthly series with missing observations, and by assigning the quarterly observation to the 3rd month. Suppose for simplicity that we have only one monthly variable \(y_{1t}^M\), one quarterly variable \(y_{2t}^Q\), and one (monthly) factor \(f_t\), then by using (6) we have:
1.3 Why two factors?
The literature on factor models has shown that it suffices to include a small number of factors for forecasting (e.g., Stock and Watson 2002b; Forni et al. 2003). Furthermore, recent literature on small-medium DFMs (Bańbura et al. 2013; Luciani and Ricci 2014; Giannone et al. 2014; Bragoli et al. 2015) often includes one factor only. Therefore, a natural choice would be to follow the literature and to set \(r=1\).
However, these factor models are estimated for q-o-q growth rates, while we are estimating a model for y-o-y growth rates. So if the model for q-o-q growth rates has one factor, then the corresponding model for y-o-y growth rates has four factors: let \(X_t\) be a non-stationary variable in log-levels, and let \(x_t^y=X_t-X_{t-4}\) be the y-o-y growth rates and \(x_t^q=X_t-X_{t-1}\) be the q-o-q growth rates, so that \(x_t^y=x_t^q+x_{t-1}^q+x_{t-2}^q+x_{t-3}^q\). Then, if the true model is \(x_t^q=\lambda f_t+e_t\), we have \(x_t^y=\lambda (1+L+L^2+L^3)f_t+(1+L+L^2+L^3)e_t\), which can be rewritten as \(x_t^y=\lambda {\mathbf {F}}_t+(1+L+L^2+L^3)e_t\) where \({\mathbf {F}}_t\) is a \(4\times 1\) singular vector.
In light of this, we should set \(r=4\), and, indeed, by looking at the eigenvalues of the covariance matrix of the first subsample (July 2002–December 2007), we can see clearly three/four diverging eigenvalues. However, among these three/four eigenvalues, the first two clearly dominate—the first eigenvalue accounts for 70% of the total variance, the second for 20%, the third for 5%, and the fourth for 3%—thus suggesting that the other two mainly carry noise, which is why we choose to set \(r=2\). Furthermore, as a robustness check we estimated the DFM by setting either \(r=1\) or \(r=4\), and found (results not shown here) that including two factors proved optimal.
There are at least three possible alternatives to the strategy of setting \(r=2\). The first would be to estimate a model with four common factors driven by only one common shock, so that the vector \(\mathbf f_t\) in Eq. (2) is \(4\times 1\), while the vector \(\mathbf u_t\) is \(1\times 1\). A second possibility would be to impose the specific lagged structure described above (see, e.g., Camacho and Perez-Quiros 2010), and a third possibility would be to estimate dynamic factor loadings by using the Dynamic Heterogeneous Factor Models of (D’Agostino et al. 2016).
The reason we prefer to use the model with \(r=2\) to all the alternatives is that we want to keep the model as simple as possible, so that it easy to communicate the prediction obtained with our model to policy makers. Furthermore, all three alternatives complicate the state space representation and in our case where we have few time series observations, and the quality of the data is doubtful, while complicating the Kalman Filter is feasible it is costly in terms of computation accuracy.
Rights and permissions
About this article
Cite this article
Luciani, M., Pundit, M., Ramayandi, A. et al. Nowcasting Indonesia. Empir Econ 55, 597–619 (2018). https://doi.org/10.1007/s00181-017-1288-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00181-017-1288-4