Keywords

1 Introduction

The currency has a very important role for the Indonesian economy. Although non-cash payment system has grown rapidly, currency or cash payment is still more efficient for individual payment for small nominal value. Forecasting inflow and outflow can be an option to maintain the stability of currency. The prediction of the amount of currency demand in Indonesia is often referred as the autonomous liquidity factor, so in predicting the demand for currency by society will be difficult [1]. The development of inflow and outflow of currency both nationally and regionally has certain movement patterns influenced by several factors, such as government money policy. Moreover, it is also influenced by trend, seasonal and calendar variation effect caused by Eid ul-Fitr that usually occurred at different date in each year [2]. Decomposition of time series data into sub patterns can ease the process of time series analysis [3]. Hence, a forecasting method that could capture and reconstruct each component pattern in the data was needed.

This study proposes a forecasting method that combining Singular Spectrum Analysis as decomposition method and Neural Network (known as SSA-NN) for forecasting inflow and outflow data in both scheme, i.e. individual and aggregate forecasting. The SSA method was applied to decompose and reconstruct the time series patterns in inflow and outflow data which including trend, cyclic, and seasonal into several additive components, while the NN method is used to handling non-linear pattern which contained in inflow and outflow data. In addition, this study also focuses to learn whether the SSA-NN could handle calendar variation effect in time series, particularly the effect of Eid ul-Fitr to inflow and outflow data.

As widely known, SSA is a forecasting technique that combines elements of classical time series analysis, multivariate statistics, multivariate geometry, dynamic systems, and signal processes [4]. SSA can decompose common patterns in time series data, trend, cycle, and seasonal factors into some additive components separated by trend, oscillatory, and noise components. SSA was first introduced by Broomhead and King [5] and followed by many studies that applied this method [6, 7]. SSA has a good ability in characterizing and prediction of time series [8]. Furthermore, it is also known that SSA method can be used for analyzing and forecasting short time series data with various types of non-stationary and produce more accurate forecast [9, 10].

In the past decades, many researchers have increasingly developed SSA by combining this method with other forecasting methods. Hybrid SSA model tends to be more significant and provides better performance than other methods [12]. A combination of SSA and NN could forecast more accurately and it could effectively reconstruct the data [13, 14]. A comparative study was done by Barba and Rodriguez [15] also showed that SSA-NN produced better accuracy for multi-step ahead forecasting of traffic accident data. The rapid research development about the combinations of SSA showed that this method can improve forecasting performance and could be a potential and competitive method for time series forecasting [16,17,18].

In this study, two types of data are used, i.e. simulation and real data about the monthly inflow and outflow data of 12 banknotes denomination from January 2003 to December 2016. These data are secondary data obtained from Bank Indonesia. The data are divided into two parts, i.e. training data (from January 2003 to December 2014) and testing data (from January 2015 to December 2016). The forecast accuracy of the proposed method is compared to ARIMAX model by using RMSE, MAE and MAPE criteria. The results of the simulation study showed that the hybrid SSA-NN with aggregate forecasting scheme yielded more accurate forecast than individual forecasting scheme. Moreover, the results at real data showed that the hybrid SSA-NN yielded as good as ARIMAX model for forecasting of 12 inflow and outflow denominations. It indicated that the hybrid SSA-NN could not handle calendar variation pattern in all data series. Generally, these results in line with M3 competition results which concluded that more complex methods do not always yield better forecast than the simpler one.

The rest of paper is organized as follows: Sect. 2 reviews the methodology, i.e. ARIMAX, Singular Spectrum Analysis, and Neural Networks as forecasting method; Sect. 3 presents the results and analysis; and Sect. 4 presents the conclusion from this study.

2 Materials and Methods

2.1 ARIMAX

The ARIMAX is an ARIMA model with the addition of exogenous variables. This model has a similar form with linear regression that has additional variables such as trend, seasonal, and calendar variation factors, or other explanatory variables. The ARIMAX model which consists of linear trend (represented by \( t \) variable), additive seasonal (represented by \( M_{i,t} \) variables), and calendar variation pattern (represented by \( V_{j,t} \) variables) is written as follows:

$$ Y_{t} = \beta_{0} + \beta_{1} t + \sum\nolimits_{i = 1}^{I} {\gamma_{i} M_{i,t} } + \sum\nolimits_{j = 1}^{J} {\delta_{j} V_{j,t} } + N_{t} $$
(1)

where \( M_{i,t} \) is dummy variables for \( I \) seasonal effects, \( V_{j,t} \) is dummy variables for \( J \) calendar variation effects, and \( N_{t} \) is noise variable that follows ARMA model. The identification of calendar variation effect, particularly about the duration effect, can be done graphically by using time series plot [19].

2.2 Singular Spectrum Analysis (SSA)

SSA is a forecasting method that combines elements of classical forecasting, multivariate statistics, multivariate geometry, dynamic systems, and signal processing. SSA method does not require the fulfillment of statistical assumptions such as stationary, and ergodicity. The main objective of the SSA method is to decompose the original time series into several additive components, such as trend, oscillatory, and noise components [4]. In general, SSA has two main stages as follows:

a. Decomposition (Embedding and Singular Value Decomposition)

The procedure in embedding is to map the original time series data into a multidimensional sequence of lagged vector. Let’s assume \( L \) is an integer number represents window length with \( 1 < L < n \), the formation of lagged vectors where \( K = n - L + 1 \) is

$$ Y_{i} = (f_{i} ,f_{i + 1} , \ldots ,f_{i + L - 1} )^{T} , 1 \le i \le K $$
(2)

which has a dimension of \( L \). If the dimensions of \( Y_{i} \) are emphasized, then \( Y_{i} \) is referred as \( L \)-lagged vectors. The path matrix of the \( F \) series is illustrated as follows:

$$ {\text{Y}} = [Y_{i} : \ldots :Y_{K} ] = \left[ {\begin{array}{*{20}c} {f_{1} } & {f_{2} } & {f_{3} } & \cdots & {f_{K} } \\ {f_{2} } & {f_{3} } & {f_{4} } & \cdots & {f_{K + 1} } \\ {f_{3} } & {f_{4} } & {f_{5} } & \cdots & {f_{K + 2} } \\ \vdots & \vdots & \vdots & \ddots & \vdots \\ {f_{L} } & {f_{L + 1} } & {f_{L + 2} } & \cdots & {f_{n} } \\ \end{array} } \right] $$
(3)

Let \( {\mathbf{S}} = {\mathbf{YY}}^{{\mathbf{T}}} \) and \( \lambda_{1} ,\lambda_{2} , \ldots ,\lambda_{L} \) be the eigenvalues of the matrix \( {\mathbf{S}} \) Where \( \lambda_{1} \ge \lambda_{2} \ge \ldots \ge \lambda_{L} \ge 0 \) and \( U_{1} ,U_{2} , \ldots ,U_{L} \) are eigenvectors of the matrix \( {\mathbf{S}} \) corresponding to the eigenvalues. Note that \( d = { \hbox{max} }\left\{ i \right\} \) so that \( \lambda_{i} > 0 \) is the rank of the matrix \( {\mathbf{Y}} \). If \( {\mathbf{S}} = V_{i} = {\mathbf{Y}}^{{\mathbf{T}}} U_{i} /\sqrt {\lambda_{i} } \) for \( i = 1,2, \ldots ,d \), then SVD of the path matrix \( {\mathbf{Y}} \) can be written as

$$ {\mathbf{Y}} = {\mathbf{Y}}_{1} + {\mathbf{Y}}_{2} + \ldots + {\mathbf{Y}}_{d} $$
(4)

where \( {\mathbf{Y}}_{i} { = }\sqrt {\lambda_{i} } U_{i} V_{i}^{{\mathbf{T}}} \). Matrix \( {\mathbf{Y}}_{i} \) has rank 1 and often called as an elementary matrix. The set \( \left( {\sqrt {\lambda_{i} } ,U_{i} ,V_{i} } \right) \) is called \( i \)-th eigentriple to SVD.

b. Reconstruction (Grouping and Diagonal Averaging)

After SVD equation is obtained, the grouping procedure will partition the set of indices \( \left\{ {1,2, \ldots ,d} \right\} \) into \( m \) subsets of mutually independent, \( I_{1} ,I_{2} , \ldots ,I_{m} \). Let \( I = \left\{ {i_{1} ,i_{2} , \ldots ,i_{p} } \right\} \), the resulting \( {\mathbf{Y}}_{i} \) matrix corresponds to group \( I \) defined as a matrix with \( {\mathbf{Y}}_{I} = {\mathbf{Y}}_{{i_{1} }} + {\mathbf{Y}}_{{i_{2} }} + \ldots + {\mathbf{Y}}_{{i_{p} }} \). This matrix is calculated for groups \( I = I_{1} ,I_{2} , \ldots ,I_{m} \) and this step will lead to decomposition form as follows:

$$ {\mathbf{Y}} = {\mathbf{Y}}_{{I_{1} }} + {\mathbf{Y}}_{{I_{2} }} + \ldots + {\mathbf{Y}}_{{I_{m} }} . $$
(5)

Set \( m \) selection procedures, \( I_{1} ,I_{2} , \ldots ,I_{m} \) are called eigentriple groupings. If \( m = d \) and \( I_{j} = \left\{ j \right\} \), \( j = 1,2, \ldots ,d \), then the corresponding grouping is called elementary.

Let \( {\mathbf{Z}} \) be \( L \times K \) matrix with element \( z_{ij} , 1 \le i \le L, 1 \le j \le K \) for \( L \le K \). Let’s assume the values of \( L^{*} = { \hbox{min} }\left\{ {L,K} \right\} \), \( K^{*} = { \hbox{max} }\left\{ {L,K} \right\} \) and \( n = L - K - 1 \). If \( L < K \) then \( z_{ij}^{*} = z_{ij} \), and if \( L > K \) then \( z_{ij}^{*} = z_{ji} \). Diagonal averaging moves the \( {\mathbf{Z}} \) matrix to the series \( g_{1} ,g_{2} , \ldots ,g_{n} \) by following formula:

$$ g_{k} = \left\{ {\begin{array}{*{20}c} {\frac{1}{k}\sum\nolimits_{m = 1}^{k} {z_{m,k - m + 1}^{*} } } & {{\text{for }}\,1 \le k < L^{*} } \\ {\frac{1}{{L^{*} }}\sum\nolimits_{m = 1}^{{L^{*} }} {z_{m,k - m + 1}^{*} } } & {{\text{for }}\,L^{*} \le k < K^{*} } \\ {\frac{1}{n - k + 1}\sum\nolimits_{{m = k - K^{*} + 1}}^{{n - K^{*} + 1}} {z_{m,k - m + 1}^{*} } } & {{\text{for }}\,K^{*} \le k < n} \\ \end{array} } \right. $$
(6)

This equation corresponds to the average matrix element over the ‘antidiagonals’ \( i + j = k + 1 \). If the averaging diagonal is applied to the matrix \( {\text{Y}}_{{I_{k} }} \), then this process will obtain a reconstructed series \( F^{\left( k \right)} = \left( {f_{1}^{\left( k \right)} , f_{1}^{\left( k \right)} , \ldots ,f_{1}^{\left( k \right)} } \right) \). Therefore, the initial series \( f_{1} ,f_{2} , \ldots ,f_{n} \) are decomposed into a sum of the \( m \) reconstructed series, i.e.

$$ f_{j} = \sum\nolimits_{k = 1}^{m} {f_{j}^{\left( k \right)} } ,\,j = 1,2, \ldots ,n $$
(7)

2.3 Neural Networks

The most commonly used form of neural network architecture (NN) is Feedforward Neural Networks (FFNN). In statistical modeling, FFNN can be viewed as a flexible class of nonlinear functions. NNs has several unique characteristics features such as its adaptability, nonlinearity, arbitrary function mapping ability – make this method quite suitable and useful for forecasting tasks [20]. In general, this model works by accepting a vector from input \( x \) and then compute a response or output \( \hat{y}\left( x \right) \) by processing (propagating) \( x \) through interrelated process elements. In each layer, the inputs are transformed into layers using a nonlinear form, and it will be processed forward to the next layer. Finally, the output values \( \hat{y} \), which can be either scalar or vector values, are calculated on the output layer [21]. FFNN architecture with a hidden layer consisting of q unit neurons and output layer consisting only of one unit of neuron is shown as Fig. 1.

Fig. 1.
figure 1

FFNN architecture with one layer hidden, p input unit, q unit of neuron in hidden layer, and one output neuron unit

The response or output \( \hat{y} \) values are calculated by:

$$ \hat{y}_{(k)} = f^{0} \left[ {\sum\limits_{j = 1}^{q} {\left[ {w_{j}^{0} f_{j}^{h} \left( {\sum\limits_{i = 1}^{p} {w_{ji}^{h} x_{i(k)} + b_{j}^{h} } } \right) + b^{0} } \right]} } \right]. $$
(1)

where \( f_{j}^{h} \) is activation function in the j-th neuron in the hidden layer, and \( f^{0} \) is the activation function of the neuron in the output layer.

2.4 Hybrid Singular Spectrum Analysis and Neural Network

In general, SSA method is able to decompose a data series into trend, seasonal, and noise patterns. From the decomposition of data patterns, the forecasting will be done using NN with inputs are the lags of component or known as Autoregressive Neural Networks (ARNN). Forecasting can be used either individual or aggregate scheme. Individual forecasting is done by forecasting every major component formed without combining as a trend and seasonal. Specifically, the noise components will be always modelled in aggregate scheme.

Aggregate forecasting is done by summing the components that have same pattern. Thus, the forecast value is calculated from three main patterns, i.e. trend, seasonal, and noise. Then, the results of forecasting with ARNN on individual patterns will be summed to get the forecast of main series (forecast aggregation). These procedure stages are shown in Figs. 2 and 3 for individual and aggregate scheme, respectively.

Fig. 2.
figure 2

SSA-NN forecasting using individual forecasting

Fig. 3.
figure 3

SSA-NN forecasting using aggregate forecasting

The algorithm of SSA-NN has several steps as follows:

  1. a.

    Data series decomposition with SSA

    1. i.

      Embedding

    2. ii.

      Singular Value Decompotition (SVD)

    3. iii.

      Grouping

    4. iv.

      Diagonal Averaging

  2. b.

    Modeling the decomposition results using the NN method.

    1. i.

      Determine the input variables in NN based on the significant lags of the Partial Autocorrelation Function or PACF of stationary data [22].

    2. ii.

      Conduct a nonlinearity test using Terasvirta test.

    3. iii.

      Determine the number of units on hidden layer using cross validation method.

    4. iv.

      Estimate parameters/weights of NN by using backpropagation algorithm.

    5. v.

      Forecast the testing data.

  3. c.

    Summarize the results of the forecast at each component to get the forecast of testing data.

  4. d.

    Calculate the level of forecasting errors for testing data.

  5. e.

    Forecast data by using the corresponding NN model for each data component.

2.5 Model Evaluation

Cross-validation is used for model evaluation, which focusing only on the forecast results for out-sample or testing data [23]. The model evaluation will be done based on the accuracy of the forecast by using RMSE, MAE and MAPE which shown in following equation [24], where C is the forecast period:

$$ RMSE = \sqrt {\frac{1}{C}\sum\limits_{c = 1}^{C} {(Y_{n + c} - \hat{Y}_{n} (c))^{2} } } $$
(2)
$$ MAE = \frac{1}{C}\sum\limits_{c = 1}^{C} {|Y_{n + c} - \hat{Y}_{n} (c)|} $$
(3)
$$ MAPE = \frac{1}{C}\sum\limits_{c = 1}^{C} {\left| {\frac{{Y_{n + c} - \hat{Y}_{n} (c)}}{{Y_{n + c} }}} \right|} \times 100\% . $$
(4)

3 Results

3.1 Simulation Study

Inflow and outflow currency data are suspected to contain trend, seasonal patterns and influenced by certain calendar variations. To gain better knowledge and understanding about the proposed SSA-NN method, a simulation study was conducted by assuming the data are observed on the period from January 2001 to December 2016 or have 192 observations. In this simulation study, data were generated for each component of trend, seasonal, calendar variation patterns as well as random and non-random noise (has nonlinear pattern) as follows:

  1. a.

    Trend, \( T_{t} = 0.2 t \)

  2. b.

    Seasonal,

    $$ \begin{aligned} M_{t} & = 20 M_{1,t} + 23.7 M_{2,t} + 25 M_{3,t} + 23.7 M_{4,t} + 20 M_{5,t} + 15 M_{6,t} + \\ & 10 M_{7,t} + 6.3 M_{8,t} + 5 M_{9,t} + 6.3 M_{10,t} + 10 M_{11,t} + 15 M_{12,t} \\ \end{aligned} $$
  3. c.

    Calendar Variation,

    $$ V_{t} = 65 V_{1,t} + 46 V_{2,t} + 47 V_{3,t} + 18 V_{4,t} + 28 V_{1,t + 1} + 23 V_{2,t + 1} + 41 V_{3,t + 1} + 60 V_{4,t + 1} $$
  4. d.

    Linear Noise Series (white noise assumption is fulfilled),

    \( N_{1,t} = a_{t} \), where \( a_{t} \,\sim\,IIDN(0,1) \)

  5. e.

    Nonlinear Noise Series which follow ESTAR(1) model,

    \( N_{2,t} = 6.5 N_{2,t - 1} .\exp \left( { - 0.25 N_{t - 1}^{2} } \right) + a_{t} \), where \( a_{t} \,\sim\,IIDN(0,1) \).

There are two scenarios of simulation series that following equation,

$$ Y_{t} = T_{t} + M_{t} + V_{t} + N_{t} $$

where the scenario 1 consisting of trend, seasonal, calendar variation and noise that fulfill white noise, and the scenario 2 containing of trend, seasonal, calendar variation and noise that follow nonlinear ESTAR model. Both scenarios are used to evaluate the performance of SSA in handling all these patterns, particularly calendar variation effect pattern. The time series plot of simulation data are shown in Fig. 4.

Fig. 4.
figure 4

Time series plot of the scenario 1 and 2 of simulation data

Decomposition results with SSA indicate that the effects of calendar variation could not be decomposed on their own. The results show that the effects of calendar variations on aggregate forecasting are captured into seasonal components and partly as noise components. Furthermore, both individual and aggregate data are modeled by using NN and it will be summed to obtain the forecast of data-testing. The model evaluation of individual and aggregate forecasting is shown in Table 1.

Table 1. Model evaluation using individual and aggregate forecasting in simulation data

Table 1 shows that in this simulation study, an aggregate forecasting has better results than an individual forecasting. It can be seen from the RMSE, MAE and MAPE of aggregate method are smaller than the individual method, both in scenario 1 and 2. Based on Table 1, it could be concluded that SSA-NN method yield better forecast on data containing random noise than data containing nonlinear noise.

3.2 Inflow and Outflow Data

The data that be used as case study are the monthly inflow and outflow data of banknotes per denomination from January 2003 to December 2016. These data are secondary data that be obtained from Bank Indonesia. The data are divided into training data (from January 2003 to December 2014) and testing data (from January 2015 to December 2016). The description of the data is shown at Table 2.

Table 2. Research variable (in billion IDR)

The pattern of inflow and outflow of currency at Indonesia (National) from January 2003 until December 2016 shown in Fig. 5. The national inflow and outflow in Indonesia has generally fluctuated, although it declined in 2007 due to the implementation of Bank Indonesia’s new policy on deposits and payments to banks. While starting in 2011, the inflow and outflow data increase due to imposition of deposits and withdrawals. In general, the increasing value of national inflow and outflow is high in certain months occurred as the effect of calendar variations, i.e. Eid ul-Fitr. The Eid ul-Fitr is suspected to affect certain months in both inflow and outflow data. In addition, Eid ul-Fitr that occur on different week will also give different impact on the increasing amount of inflow and outflow.

Fig. 5.
figure 5

Inflow (a) and outflow (b) in Indonesia (billion IDR)

The effect of Eid ul-Fitr influences the amount of inflow of Bank Indonesia in the one month after Eid ul-Fitr. This is related to people’s habit to save money after carrying out Eid ul-Fitr holiday. In general, Eid ul-Fitr that occurs at the beginning of the month will result in a sharper increase in inflow. Otherwise, Eid ul-Fitr that occurs at the end of the month will yield the highest inflow in one month after this holiday. Additionally, the outflow was also affected by the occurrence of Eid ul-Fitr due to people tend to withdraw money to fulfill their needs during Eid ul-Fitr. In the month of Eid ul-Fitr, the highest outflow will happened when Eid ul-Fitr occurs at the end of the month. As for one month before Eid ul-Fitr, the highest outflow occurs when Eid ul-Fitr occurs at the beginning of the month.

3.3 Forecasting Inflow and Outflow Data Using ARIMAX

Forecasting inflow and outflow with ARIMAX method is using components as exogenous variables, which are trend, seasonal, and calendar variations component effects. These components are represented by the dummy variables as in Eq. (1).

The steps in ARIMAX method is to regress first the effects of trend, seasonal and calendar variation and then applying the ARIMA model on the residuals of this regression if these residuals not fulfill the white noise assumption. Based on model estimation at every denomination, the best ARIMA model for each residual time series regression is shown in Table 3. The ARIMAX equation model is obtained by combining time series regression model and the best ARIMA model from each of the inflow and outflow fractions. For example, the ARIMAX model for inflow Rp100.000,00 data can be written as:

Table 3. The best ARIMA model for each series
$$ \begin{aligned} Y_{7,t} & = 1.5 t + 2956.4 M_{1,t} - 1434.4 M_{2,t} - 624.8 M_{3,t} - 38.2 M_{4,t} - 213.4 M_{5,t} + \\ & 74.7 M_{6,t} - 595.2 M_{7,t} + 2941.3 M_{8,t} - 3925.5 M_{9,t} + 1433.0 M_{10,t} - \\ & 11.8 M_{11,t} + 428.9 M_{12,t} + 5168.9 V_{1,t} + 11591.4 V_{2,t} + 9110.3 V_{3,t} - \\ & 3826.1 V_{4,t} - 6576.5 V_{1,t + 1} - 13521.5 V_{2,t + 1} - 13059.2 V_{3,t + 1} + \\ & 3102.7 V_{4,t + 1} + \frac{{\left( {1 - 0.80 B} \right)\left( {1 - 0.17 B^{12} } \right)}}{{\left( {1 - B} \right)\left( {1 - B^{12} } \right)}}a_{t} . \\ \end{aligned} $$

3.4 Forecasting Inflow and Outflow Data Using SSA-NN

In the previous study, it showed that a NN model could not capture the trend and seasonal patterns well [25]. To overcome it, the proposed SSA-NN firstly reconstruct the components of data using SSA. Each fraction of the inflow and outflow is decomposed by determining the L value of half of the data (L = 84). The SVD process obtained 50 eigentriples values. The grouping step is done by determining the value of effect grouping (r) to limit the number of eigentriples to grouping the trend and seasonal components. The r value is obtained from the sum of the singular values that have the graph show the noise component. Due to the simulation data show that aggregate forecasting gives better results than individual forecasting, then the forecasting inflow and outflow use aggregate forecasting. To do this, it is necessary to grouping the component pattern, i.e. trend, seasonal, and noise components.

Based on the results of principal component, there are 12 main components in inflow Rp 100.000,00. In Fig. 6, the components that tend to slowly increase or decrease are trend components, while components that follow periodic patterns and have corresponding seasonal periods are grouped into seasonal components, and other components are grouped into noise. Subsequent groupings are made to identify incoming inputs on the NN model. The result of reconstruction of each components of inflow Rp 100.000,00 is shown in Fig. 7.

Fig. 6.
figure 6

Principal component plot of inflow Rp 100.000,00

Fig. 7.
figure 7

Grouping trend, seasonal, and noise components of inflow Rp 100.000,00

Then, each component in Fig. 7 are modeled by NN. Based on the best model with the smallest value of goodness of fit criteria, the final NN model for each component can be written as

$$ \hat{Y}_{7,t} = \hat{T}_{7,t}^{*} + \hat{S}_{7,t}^{*} + \hat{N}_{7,t}^{*} $$

where \( \hat{T}_{7,t}^{*} \) is standardized value of Tt, \( \hat{S}_{7,t}^{*} \) is standardized value of St, and \( \hat{N}_{7,t}^{*} \) is standardized value of Nt. The architecture model of NN for denomination currency of Rp100.000,00 for trend and noise components are shown in Fig. 8.

Fig. 8.
figure 8

NN architecture for trend (a) and noise (b) components of inflow Rp100.000,00

The SSA-NN modeling was performed on each denomination of inflow and outflow. In overall, hybrid SSA-NN model could capture well the pattern of calendar variation present in training data. However, in several denominations, the forecast value of testing data using SSA-NN model could not capture the calendar variation pattern (Fig. 9). The model evaluation using hybrid SSA-NN in each fraction presented in Table 4.

Fig. 9.
figure 9

Comparison of forecast value using SSA-NN and ARIMAX

Table 4. Model evaluation of hybrid SSA-NN model in each denomination

The forecast accuracy comparison between SSA-NN and ARIMAX methods for each denomination of inflow and outflow are shown in Fig. 9. Moreover, it is also necessary to analyze the reduced forecasting error for SSA-NN method compared to ARIMAX method. The comparison results of these methods are shown at Table 5.

Table 5. RMSE ratio between SSA-NN and ARIMAX methods

The ratio value less than one indicates that SSA-NN with aggregate forecasting scheme is better and capable for reducing forecast error than ARIMAX based on RMSE criteria. In general, the results show that hybrid SSA-NN method give better results for predicting 6 out of 12 denominations of inflow and outflow. It is indicated by the RMSE ratio value that smaller than 1, which mean SSA-NN produces smaller forecast error than ARIMAX. Moreover, these results in line with M3 competition results, conclusion, and implication, i.e. more complex methods do not necessary yield better forecast than the simpler one [26].

4 Conclusion

The results of simulation study showed that the proposed hybrid SSA-NN with aggregate forecasting scheme by grouping trends, seasonal, and noises yielded more accurate forecast than individual forecasting scheme. These results also showed that hybrid SSA-NN gave better performance in modeling series with random noise than nonlinear noise. Furthermore, the empirical study proved that Eid ul-Fitr had significant effect on the amount of inflow and outflow. The results for inflow and outflow data showed that hybrid SSA-NN could capture well the trend and seasonal pattern. Otherwise, this hybrid SSA-NN could not capture well the effects of calendar variations. Hence, it could be concluded that hybrid SSA-NN is a good forecasting method for time series which contain trends and seasonal only. Moreover, the results of forecast value comparison indicated that hybrid SSA-NN model performed as good as ARIMAX model, i.e. 6 of 12 denominations were better to be forecasted by the hybrid SSA-NN method, and the rests were more accurate to be forecasted by ARIMAX model. These results in line with the M3 competition conclusion, i.e. more complex methods do not necessary yield better forecast than the simpler one [26]. Hence, further research is needed to handle all patterns simultaneously, i.e. trend, seasonal, and calendar variation effects, by proposing new hybrid method as combination of SSA-NN and ARIMAX methods.