Forecasting tourist arrivals using dual decomposition strategy and an improved fuzzy time series method

Liang, Xiaozhen; Wu, Zhikun

doi:10.1007/s00521-021-06671-7

Forecasting tourist arrivals using dual decomposition strategy and an improved fuzzy time series method

S.I. : Neuro, fuzzy and their Hybridization
Published: 17 January 2022

Volume 35, pages 7161–7183, (2023)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Neural Computing and Applications Aims and scope Submit manuscript

Forecasting tourist arrivals using dual decomposition strategy and an improved fuzzy time series method

Download PDF

428 Accesses
2 Citations
1 Altmetric
Explore all metrics

Abstract

Tourist arrivals forecasting has become an increasingly hot issue due to its important role in the tourism industry and hence the whole economy of a country. However, owing to the complex characteristics of tourist arrivals series, such as seasonality, randomness, and non-linearity, forecasting tourist arrivals remains a challenging task. In this paper, a hybrid model of dual decomposition and an improved fuzzy time series method is proposed for tourist arrivals forecasting. In the novel model, two stages are mainly involved, i.e., dual decomposition and integrated forecasting. In the first stage, a dual decomposition strategy, which can overcome the potential defects of individual decomposition approaches, is designed to fully extract the main features of the tourist arrivals series and reduce the data complexity. In the second stage, a fuzzy time series method with fuzzy C-means algorithm as the discretization method is developed for prediction. In the empirical study, the proposed model is implemented to predict the monthly tourist arrivals to Hong Kong from USA, UK, and Germany. The results show that our hybrid model can obtain more accurate and more robust prediction results than benchmark models. Relative to the benchmark fuzzy time series models, the hybrid models using traditional decomposition methods and strategies, as well as the traditional single prediction models, our proposed model shows a significant improvement, with the improvement percentages at about 80, 70, and 50%, respectively. Therefore, we can conclude that the proposed model is a very promising tool for forecasting future tourist arrivals or other related fields with complex time series.

Integrated Fuzzy Time Series Model for Forecasting Tourist Arrivals

The Forecast Study of Sanya Tourism Income Based on Fuzzy Time Series Forecasting Model F5

Forecasting Tourist Arrivals in China Based on Seasonal Decomposition and LSSVR Model

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

From a worldwide perspective, tourism makes a great contribution to economic growth [1, 2]. Take the case of China, according to China National Tourism Administration, the total revenue of China’s tourism industry was 6.63 trillion yuan in 2019, which raises 11 percent compared with 2018 and accounts for more than 11% of China's GDP. Therefore, forecasting tourist arrivals plays an important role in forecasting future economic growth. Moreover, tourist arrivals forecasting can provide valuable reference for subsequent strategic planning and policy formulation [3, 4]. Accurate forecast of tourist arrivals can make the operation of travel agencies more effective and help tourist destinations to be better managed, which is very important to the sustainable development of the whole tourism industry and even the entire economy. In general, the study of tourist arrivals forecasting is of great significance to the whole society, both politically and economically. However, due to the complex characteristics of tourist arrivals series (e.g., seasonality, randomness, and non-linearity), tourist arrivals forecasting is still a difficult problem.

To solve this problem, a growing number of researchers are paying attention to the analysis and prediction of tourist arrivals. Meanwhile, numerous models have been formulated and designed to forecast tourist arrivals. According to related literature [5], single forecasting models that were widely used to forecast tourist arrivals can fall into two main types, i.e., econometric models and artificial intelligence (AI) models. The econometric models, such as autoregressive moving average (ARMA) [6], autoregressive integrated moving average (ARIMA) [7], exponential smoothing (ES) [8], and generalized autoregressive conditional heteroskedasticity (GARCH) [9], are more suitable for forecasting a relatively stable time series [10]. When forecasting data such as tourist arrivals with non-linear characteristic and rapid changes, it has been pointed out that econometric models perform poorly in achieving effective prediction results [11]. As for the AI models, the development of AI techniques has greatly promoted their application in various fields, including air quality early warning [12], the prediction of crude oil price [13], and electricity price [14]. The commonly used AI models for forecasting tourist arrivals include artificial neural networks (ANNs) [15], extreme learning machine (ELM) [16], and support vector machine (SVM) [17]. Compared with the econometric models, AI models are more effective due to their strong robustness and fault tolerance. All these forecasting methods have significantly promoted the sustainable development of world tourism industry.

However, almost every single forecasting model has its pros and cons, and even AI models are unlikely to achieve satisfactory performance in all scenarios. For example, due to the poor effect of extrapolation, narrow prediction scale, and high requirement on data quantity and quality, econometric models are unsuitable for data with high fluctuation and noise [18]; for ANNs, the prediction performance of the models will be affected by the initial weights and thresholds which are generated randomly [19]. For this reason, researchers started to turn their attention to developing hybrid forecasting models by incorporating some existing single methods. Numerous studies have shown that hybrid forecasting model can achieve relatively ideal effect and has become the current mainstream forecasting method [20].

In order to develop hybrid models for forecasting, some decomposition methods, such as variational mode decomposition (VMD) [21], empirical mode decomposition (EMD) [22], and wavelet transform (WT) [23], have been employed to extract the main features of raw series. Our previous work [24] has proved that data preprocessing with an effective decomposition method can significantly improve prediction performance. Specifically, data preprocessing strategies can fall into two types. One refers to “decomposition & de-noising” strategy [25]. Under this strategy, the noisy information of the original series is first removed, then the forecasting model is established by using the filtered time series. The other refers to “divide & conquer” strategy [26]. Under this strategy, raw series is first decomposed into several components, which then can be predicted using a determined prediction model respectively, and finally, the predicted values of all components are integrated to get the final results. In terms of tourist arrivals forecasting, Jiang and Ma [27] used fast ensemble EMD (FEEMD) method for data preprocessing to build a hybrid model, which performs well in forecasting future tourist arrivals. Similarly, by using WT for data preprocessing and kernel-based ELM and ARMA for forecasting, Yang et al. [28] developed a hybrid model for daily tourist arrivals forecasting, and the empirical results based on three real tourism markets show that the developed model has good linear and non-linear prediction abilities. In the above studies, the hybrid forecasting models can improve the prediction accuracy and thus perform better than all the considered benchmark models. Nevertheless, data preprocessing only using a single decomposition method in the hybrid model may not be able to fully extract the main features of the tourist arrivals series. Furthermore, inherent defects existed in some data decomposition methods, such as mode mixing and endpoint effect, may also limit their application in feature extraction [29]. In fact, problems such as incomplete data feature extraction and the inherent defects existed in decomposition methods will make it difficult for the hybrid model to achieve satisfactory prediction results. Therefore, to improve the prediction performance, it is worth further improve the data preprocessing techniques in future work.

In addition, there is a problem that the commonly used single forecasting models have a poor interpretation of the prediction results. The fuzzy time series (FTS) model which divides the universe of discourse based on historical data features can solve this problem well. However, most of the traditional FTS models divide the universe of discourse with equal widths and ignore the potential features of the data, which makes the prediction results still unsatisfactory [30]. To address this issue, scholars developed some novel methods for dividing the universe of discourse, such as genetic algorithms and clustering algorithms. Therefore, from the perspective of strengthening the interpretation of the results and improving model accuracy, it is of great value to further explore how to divide the universe of discourse of FTS by fuzzy C-means (FCM) algorithm.

To sum up, the above analysis shows that the existing studies are insufficient to comprehensively improve the forecasting effectiveness. Thus, it is very urgent for sustainable economic and social development to develop a novel forecasting model of tourist arrivals for the tourism industry and significantly improve the forecasting effectiveness.

This paper proposes a novel hybrid forecasting model of tourist arrivals using dual decomposition strategy and an improved fuzzy time series method. Two stages are included in this hybrid model: dual decomposition, and integrated forecasting. In the first stage, the seasonal adjustment method (i.e., X12-ARIMA [31]) is employed to decompose the tourist arrivals data to extract its significant seasonal characteristics, and then an improved empirical mode decomposition method (i.e., ICEEMDAN [29]) is applied to decompose the remaining component sequences for reducing data complexity. Then in the second stage, the FTS model with the universe of discourse divided by the FCM algorithm, i.e., FCM-FTS method, is used to model and predict each component sequence after the second decomposition, and the predicted values of all the components are linearly summed up to get the final results.

The main contributions of this paper can be summarized as below:

(1)
Most importantly, we develop a hybrid forecasting model with high accuracy and high robustness, and its effectiveness has been verified in forecasting Hong Kong’s inbound tourist arrivals. According to the experimental results, our hybrid model can decompose and extract the complex features of the raw series, thus obtaining more accurate and more robust prediction results. Hence, it is a very effective tool to predict real tourism markets and can provide valuable reference for tourism decision-making.
(2)
Our hybrid forecasting model has two major differences from the traditional hybrid approaches. Firstly, a different strategy for data preprocessing is presented. In most of the former research, individual decomposition approaches have been adopted generally to decompose the raw series, of which the main features may not be fully extracted by such data preprocessing strategy. Therefore, this paper presents a dual decomposition strategy based on X12-ARIMA and ICEEMDAN, which can overcome the drawbacks of the traditional data preprocessing strategies and further improve the prediction performance. Secondly, an effective clustering algorithm, i.e., FCM, is adopted to optimize the domain partition module of FTS model, of which the performance has been successfully improved.
(3)
In terms of numerical experiments, this paper not only compares the proposed hybrid model with five commonly used single forecasting models but also compares it with other six hybrid forecasting models using different data preprocessing strategies, which comprehensively demonstrates the superiority of our model. In addition, the benchmark models considered can represent currently popular modeling strategies and ideas, similar to the high-quality papers published in international journals in recent years. On the basis of comparative study, this paper verifies and demonstrates the significance of the components of our hybrid model in detail, such as the validity of X12-ARIMA and ICEEMDAN, as well as the superiority of dual decomposition strategy and FCM method. Moreover, this paper also verifies the robustness of our hybrid model. To sum up, we finally demonstrate that the developed novel tourist arrivals forecasting model has high superiority and practical values for the real tourism markets.
(4)
To verify the model prediction performance, this paper provides a scientific evaluation and an in-depth discussion of the prediction results. We use six typical criteria, including average error (AE), mean absolute percentage error (MAPE), root mean square error (RMSE), mean absolute error (MAE), Theil inequality coefficient (TIC), and index of agreement (IA), to evaluate the performance of the forecasting models. Moreover, we further demonstrate the superiority of the proposed model through an insightful discussion from five aspects: (a) the model robustness according to the prediction performance at different years; (b) the significance of the model from the perspective of statistics; (c) the forecasting effectiveness based on the comparative studies; (d) the improvement percentage relative to the benchmarks; and (e) the grey relational analysis of all the models involved.

The rest of the paper is arranged as follows. Section 2 introduces the main methods involved and the overall framework of the developed hybrid forecasting model. Section 3 mainly describes the data, conducts the comparative experiments, and analyzes the prediction results. Section 4 presents the related discussions. Finally, Sect. 5 concludes the study.

2 Methods

This section presents a hybrid model of dual decomposition and an improved fuzzy time series method for tourist arrivals forecasting. Specifically, Sects. 2.1–2.5 describe the relevant methods for decomposition and prediction respectively, and Sect. 2.6 provides the overall process of our hybrid model. Table 3 in Appendix 1 shows the used nomenclature in this paper.

2.1 X12-ARIMA

X12-ARIMA [31] is a popular seasonal adjustment method developed by the United States Census Bureau, which mainly includes two functional modules: regARIMA module and X-11 seasonal decomposition module. In particular, the regARIMA module can carry out various types of data preprocessing, such as outlier detection and correction, estimation, and elimination of the influence of calendar factors [32]. The X-11 seasonal decomposition module decomposes the preprocessed data through multiple iterations of moving average method to form a seasonal factors series and a seasonally adjusted series. For the purpose of this paper, we just introduce the basic algorithm for the X-11 seasonal decomposition module.

It is assumed that the monthly series can be decomposed into a seasonal factor (i.e., S), a trend-cycle factor (i.e., TC), and an irregular factor (i.e., I). Two main steps are involved in the X-11 seasonal decomposition module:

Step 1 Estimation of the initial components

Firstly, the 2 × 12 moving average method is applied to estimate the initial TC component sequences. Then, this TC component is subtracted from the raw time series to obtain the initial estimation of the seasonal-irregular component (i.e., SI). Next, the 3 × 3 moving average method is applied to estimate the initial seasonal component, which then is normalized by a 2 × 12 moving average. Finally, the normalized seasonal component is subtracted from the raw series to obtain the initial estimation of the seasonally adjusted series (i.e., SA).

Step 2 Final seasonal adjustment

Firstly, the Henderson moving average method is used to obtain the second estimation of the TC component from the initially estimated SA series. Then, this new TC component is subtracted from the raw series to obtain the second estimation of the SI component. Next, the 3 × 5 moving average method is applied to estimate a new seasonal component, which then is normalized by a 2 × 12 moving average. Finally, the normalized seasonal component is removed from the raw series to obtain the final SA series.

It is worth noting that the selection of the number of terms in the moving average is critical in the X-11 seasonal decomposition module. The higher the number of terms, the more irregular factors can be eliminated. But as the number of terms increases, more information is lost. For monthly series that change periodically on a 12-month basis, a centered 12-term moving average can be considered for obtaining the initial TC component and the normalized seasonal component. However, if the series to be decomposed is also an economic flow time series (such as the monthly tourist arrivals), a 2 × 12 moving average is required to ensure that each element of the newly generated sequence after using the moving average is aligned with that of the raw series. For other parts of the module, the number of terms in the moving average is specified with reference to the standard X-11 procedure [33].

2.2 ICEEMDAN

Traditional empirical mode decomposition methods, including empirical mode decomposition (EMD) [22], ensemble EMD (EEMD) [34], and complete ensemble EMD with adaptive noise (CEEMDAN) [35], have some problems such as mode mixing, noise, and redundancy, and pseudo components after decomposition. Aiming at these problems, Colominas et al. [29] proposed an improved complete ensemble EMD with adaptive noise (ICEEMDAN), which has a higher ability to extract the components of the complex time series with different time scale features. The following are the main steps and relevant formulas of this algorithm:

Step 1 Calculate the first residue of the original series using the following equation:

$$r_{1} = \frac{1}{I}\sum\limits_{i = 1}^{I} {M\left( {x + \beta_{1} E_{1} \left( {w^{i} } \right)} \right)} ,$$

(1)

where E_k() is an operator, which uses EMD method to decompose a series into several intrinsic mode functions (IMFs) and one residual, with the k-th IMF component (i.e., the k-th mode) as output; M() also represents an operator, which produces the local mean (i.e., the mean of the upper and lower envelopes) of a series; $x$ represents the original time series; $w^{i}$ indicates a realization of white noise, whose mean value is zero and variance is one, $i = 1,2,...,I$, and $I$ is the number of times that white noise is added; $\beta_{k}$ is the parameter that controls the energy of the white noise in each iteration, $k = 1,2,...,K$, and $K$ is the maximum iterations. Mode mixing is defined as either a single IMF consisting of components of widely disparate scales or a component of a similar scale residing in different IMFs [34]. The purpose of including white noise in this equation is to avoid the mode mixing problem so that the components of the complex time series with different time scales can be identified and extracted more accurately.

Step 2 Subtract the first residue from the original series to get the first mode $d_{1}$:

$$d_{1} = x - r_{1} .$$

(2)

Step 3 Obtain the second residue of the original time series, i.e., $r_{2}$, in the same way as in step 1, and finally obtain the second mode $d_{2}$ by the following equation:

$$d_{2} = r_{1} - r_{2} = r_{1} - \frac{1}{I}\sum\limits_{i = 1}^{I} {M\left( {r_{1} + \beta_{2} E_{2} \left( {w^{i} } \right)} \right)} .$$

(3)

Step 4 Obtain the k-th residue and k-th mode by the following equation:

$$d_{k} = r_{k - 1} - r_{k} = r_{k - 1} - \frac{1}{I}\sum\limits_{i = 1}^{I} {M\left( {r_{k - 1} + \beta_{k} E_{k} \left( {w^{i} } \right)} \right)} .$$

(4)

Step 5 Return to step 4 for next k until the residue can no longer be decomposed or $K$ is reached.

2.3 Fuzzy C-means clustering

The fuzzy C-means (FCM) algorithm is one of the commonly used clustering methods [36]. The basic idea of FCM algorithm is to continuously update the cluster centers of all data and the membership degrees of each data point belonging to all cluster centers through iterative calculation, until the dissimilarity index function and the iteration error reach the preset minimum value. The following are the main steps and related formulas of FCM algorithm:

Step 1 Calculate the number of cluster centers:

$$c = \left[ {{{\left( {x_{\max } - x_{\min } } \right)} \mathord{\left/ {\vphantom {{\left( {x_{\max } - x_{\min } } \right)} {\frac{{\sum\limits_{t = 2}^{{n_{1} }} {\left| {x_{t} - x_{t - 1} } \right|} }}{{n_{1} - 1}}}}} \right. \kern-\nulldelimiterspace} {\frac{{\sum\limits_{t = 2}^{{n_{1} }} {\left| {x_{t} - x_{t - 1} } \right|} }}{{n_{1} - 1}}}}} \right],$$

(5)

where $x_{t} (t = 1,2, \cdots ,n_{1} ) \in R$ is the element of the original series $x$, and $n_{1}$ is the number of elements in $x$. $c$ is the number of cluster centers, $c \in \left\{ {2,3,...,n_{1} - 1} \right\}$. $x_{\max }$ and $x_{\min }$ represent the maximum and minimum values in the original series, respectively; [] represents the rounding operation.

Step 2 Initialize the cluster centers. Randomly select $c$ samples in $x$ as the initial cluster centers $V(0) = \left\{ {{\text{v}}_{01} ,{\text{v}}_{02} ,...,{\text{v}}_{0c} } \right\}$.

Step 3 Calculate the membership matrix:

$$u_{ij} = \left( {\sum\limits_{r = 1}^{c} {\frac{{d_{ij} }}{{d_{rj} }}} } \right)^{ - 1} ,$$

(6)

where $d_{ij}$ is the Euclidean distance from the element $x_{j}$ to the cluster center $v_{i} , \, i = 1,2,...,c, \, j = 1,2,...,n_{1}$.

Step 4 Iterate new cluster centers:

$$v_{i} = {{\sum\limits_{j = 1}^{{n_{1} }} {u_{ij}^{m} x_{j} } } \mathord{\left/ {\vphantom {{\sum\limits_{j = 1}^{{n_{1} }} {u_{ij}^{m} x_{j} } } {\sum\limits_{j = 1}^{{n_{1} }} {u_{ij}^{m} } }}} \right. \kern-\nulldelimiterspace} {\sum\limits_{j = 1}^{{n_{1} }} {u_{ij}^{m} } }},$$

(7)

where $m$ is the weighted index of membership degree, which is used to adjust the fuzzy degree of the clustering results, generally $m = 2$.

Step 5 Repeat steps 3 and 4 iteratively until the condition $\left\| {V\left( {k + 1} \right) - V\left( k \right)} \right\| < \varepsilon$ is satisfied ($\varepsilon$ is the iteration stop threshold) or the maximum iterations are reached.

2.4 Fuzzy time series algorithm

On the basis of the fuzzy set theory and other concepts proposed by Zadeh [37], Song and Chisom [38, 39] established the fuzzy time series (FTS) model, which was successfully used to predict the enrollment data for the University of Alabama. Subsequently, traditional FTS model and its variants were widely applied in other fields (e.g., temperature, stock index, and network traffic) to perform forecasting and have achieved good forecasting results [40, 41]. The basic definitions of FTS are as below:

Definition 1 It is assumed that $U$ is a given universe of discourse, which can be divided into $n_{2}$ subintervals in order, then $U = \left\{ {u_{1} ,u_{2} , \cdots ,u_{{n_{2} }} } \right\}$. Define $A$ as the fuzzy set on the universe $U$, expressed as:

$$A = \frac{{f_{A} \left( {u_{1} } \right)}}{{u_{1} }} + \frac{{f_{A} \left( {u_{2} } \right)}}{{u_{2} }} + \cdots + \frac{{f_{A} \left( {u_{{n_{2} }} } \right)}}{{u_{{n_{2} }} }},$$

(8)

where $f_{A} ( \, )$ is the membership function of fuzzy set $A$, $f_{A} ( \, ) \in \left[ {0,1} \right]$; $f_{A} (u_{i} )$ represents the membership degree of the interval $u_{i} (1 \le i \le n_{2} )$ with respect to the fuzzy set $A$.

Definition 2 Let the original time series $Y = \left\{ {y_{t} } \right\} = \left\{ {Y(t)} \right\}(t = 1,2,...)$ be a subset of the real number field R. Define a set of fuzzy sets $f_{i} (t)\;(i = 1,2,...)$ on the series $Y$, and the series $F\left( t \right) = \left\{ {f_{1} \left( t \right),f_{2} \left( t \right), \cdots } \right\}$,then $F = \left\{ {F(t)} \right\}(t = 1,2,...)$ is a fuzzy time series defined on $Y$.

Definition 3 Suppose there is a fuzzy logical relationship (FLR), i.e., $R(t,{\text{t - 1}})$, between $F\left( t \right)$ and $F\left( {t{ - 1}} \right)$, which satisfies:

$$F\left( t \right) = F\left( {t - 1} \right) \odot R\left( {t,t - 1} \right),$$

(9)

then it is said that $F\left( t \right)$ is obtained only by $F\left( {t{ - 1}} \right)$ ($\odot$ is a combination operator). And set $F\left( {t{ - 1}} \right){ = }A_{{\text{i}}}$ and $F\left( t \right){ = }A_{j}$, then the FLR can also be expressed as: $A_{{\text{i}}} \to A_{{\text{j}}}$. Between them, $A_{{\text{i}}}$ and $A_{{\text{j}}}$ are called the left-hand side (LHS) and right-hand side (RHS) of the FLR, respectively.

Definition 4 All the single FLRs with the same LHS can be composed into the same fuzzy logical relationship set (FLRS). For example, the three FLRs ($A_{l} \to A_{r1}$, $A_{l} \to A_{r2}$, $A_{l} \to A_{r3}$) with the same LHS can be composed into one FLRS, which is expressed as $A_{l} \to A_{r1} ,\;A_{r2} ,A_{r3}$.

2.5 FCM-FTS model

For fuzzy time series, the unsupervised discretization method was generally used to obtain the equal-width intervals, which is simple and convenient. However, equal-width interval partitioning method is not very interpretable for the intervals and the forecasting results are not accurate enough [42]. The FCM clustering algorithm partitions the universe of discourse according to data characteristics, which is more objective. Furthermore, this algorithm can explain the actual meaning of each sub-interval by the explanation of the clustering center, which is more scientific and reasonable than the equal-width interval partitioning method. In this paper, the FTS model optimized by Chen [43] with the FCM algorithm partitioning the universe of discourse, i.e., FCM-FTS model, is applied for prediction. The specific steps are as follows:

Step 1 Detect the stationarity of the time series to be predicted by the augmented Dickey-Fuller (ADF) test [44]. If the series is stable, turn to step 2 directly. Otherwise, make the series stable by preprocessing it with the difference method [7].

Step 2 Divide the universe $U$ into $n_{2}$ intervals by the FCM clustering algorithm, then $U = \left\{ {u_{1} ,u_{2} ,...,u_{{n_{2} }} } \right\}$.

Step 3 Define the fuzzy set for the raw time series by determining the fuzzy membership function. Then, construct fuzzy set $A_{{\text{i}}}$ based on the intervals. And the fuzzy membership function $f_{{A_{i} }} (u_{j} )$ can be defined as follows [45]:

$$f_{{A_{i} }} (u_{j} ) = \left\{ {\begin{array}{*{20}l} {1,\;\;\;i = j} \hfill \\ {0.5,\;\;\;i = j + 1} \hfill \\ {0,\;\;\;{\text{others}}.} \hfill \\ \end{array} } \right.$$

(10)

Step 4 Fuzzify the actual values. Fuzzify a raw value to $A_{{\text{i}}}$ when the highest degree of membership of that raw value is in $A_{{\text{i}}}$ [43].

$$fuzzify(actual_{t} ) = A_{i} {\text{ if }}f_{{actual_{t} }} (A_{i} ) = \max [f_{{actual_{t} }} (A_{z} )],{\text{ z = 1,2,}}...{\text{,M,}}$$

(11)

where $f_{{{\text{actual}}_{t} }} (A_{z} )$ denotes the degree of membership of the actual value at t under $A_{z}$, and $M$ denotes the number of the fuzzy sets.

Step 5 Establish and group the FLR. According to the definition 3 and 4 in Sect. 2.4, the first-order FLR and FLRS are constructed for all fuzzy sets of the fuzzy time series.

Step 6 Determine and standardize the weight matrix. The weights can be calculated and standardized based on step 5, and then the centroid defuzzification method can be used to further calculate the defuzzification matrix.

$$\begin{gathered} W\_s(t) = (W_{1} ^{\prime},W_{2} ^{\prime},...,W_{k} ^{\prime}), \hfill \\ W_{{\text{i}}} ^{\prime} = W_{i} /\sum\limits_{i = 1}^{k} {W_{i} } , \hfill \\ \end{gathered}$$

(12)

where $W_{{\text{i}}}$ is the unstandardized weighting matrix element, and $W_{{\text{i}}} ^{\prime}$ denotes the standardized one. $W\_s$ represents the standardized weighting matrix.

Step 7 Obtain the forecasting results. Multiply the defuzzified matrix by standardized weighting matrices to obtain the rudimentary forecasting results:

$$\hat{F}(t) = D(t - 1) \times W\_s(t - 1),$$

(13)

where $\hat{F}(t)$ denotes the forecasting result and $D$ denotes the defuzzified matrix.

2.6 Overall process of the proposed model

To forecast tourist arrivals, we propose a novel hybrid model of X12-ARIMA, ICEEMDAN, FCM, and FTS, namely X12-ARIMA-ICEEMDAN-FCM-FTS model. This hybrid model includes two stages, i.e., dual decomposition and integrated forecasting. Figure 1 shows the overall process of our hybrid forecasting model, with four main steps involved as follows:

2.6.1 Stage 1: Dual decomposition

Step1: Considering the seasonal characteristics of the tourist arrivals data, first the original time series is decomposed by X12-ARIMA method, extracting the seasonal component and obtaining the seasonally adjusted series.

Step 2: ICEEMDAN is then used to decompose the seasonally adjusted series into n-1 intrinsic mode functions (${\text{IMF}}_{1}$, ${\text{IMF}}_{{\text{2}}}$,…, ${\text{IMF}}_{{n - 1}}$) with different time scale features and one smooth residual series (Residue), in order to reduce the data complexity.

2.6.2 Stage 2: Integrated forecasting

Step 3 The FCM-FTS method is used to model and predict the seasonal factors series, n-1 IMFs component series, and the residual series, respectively.

Step 4 Finally, the predicted values for all the components, respectively noted as SEA', ${\text{IMF}}_{1}$', ${\text{IMF}}_{{\text{2}}}$',…, ${\text{IMF}}_{{n - 1}}$', and Residue', are linearly summed up to get the final prediction results.

3 Experiment

In this section, we used the developed hybrid model to forecast Hong Kong’s inbound tourist arrivals from three countries (i.e., USA, UK, and Germany) for illustration and verification purposes. In particular, several related experiments were carried out with multiple control groups set up, and comparison and analysis were made from various aspects to verify the performance of our proposed model, in which the main parameters involved can be seen in Table 4 (in Appendix 1). Furthermore, final prediction results were taken as the average of 100 runs to avoid the influence of random factors.

3.1 Data description

The monthly tourist arrivals to Hong Kong from USA, UK, and Germany (simply noted as GER) are selected as data samples, as shown in Fig. 2. For each series, there are 168 observations, covering the period from January 2006 to December 2019, which can be obtained from Wind Database (http://www.wind.com.cn/). Meanwhile, to evaluate the model robustness, the samples are rolled backward for one year at a time, thus each sample can produce three subsamples with the same number of observations, covering the periods from January 2006 to December 2017, January 2007 to December 2018, and January 2008 to December 2019, respectively. The sample data are shown in detail in Table 5 (in Appendix 1). In addition, a link to the supplementary material related to this article (including the data and the code) can be found in Appendix 2.

In addition, the experiments conducted in this paper all perform one-step-ahead predictions. The data of each subsample can be divided into training set for model training and testing set for evaluating model performance. In particular, the data of the preceding 11 years (132 observations) are used as training set, while the following year (12 observations) as testing set. Finally, the monthly tourist arrivals in 2017, 2018, and 2019 are predicted, respectively. According to the results of the three forecasting years, the final prediction performance of the proposed model is evaluated.

3.2 Evaluation criteria

Considering that there is no universally applicable standard for prediction model error evaluation [46], we choose six popular criteria (i.e., AE, MAPE, RMSE, MAE, TIC, and IA) to evaluate the model prediction performance, as listed in Table 1. Obviously, except for the IA criterion, a smaller evaluation criterion means that the prediction is more accurate.

Table 1 Evaluation criteria

Forecasting tourist arrivals using dual decomposition strategy and an improved fuzzy time series method

Abstract

Similar content being viewed by others

Integrated Fuzzy Time Series Model for Forecasting Tourist Arrivals

The Forecast Study of Sanya Tourism Income Based on Fuzzy Time Series Forecasting Model F5

Forecasting Tourist Arrivals in China Based on Seasonal Decomposition and LSSVR Model

Explore related subjects

1 Introduction

2 Methods

2.1 X12-ARIMA

2.2 ICEEMDAN

2.3 Fuzzy C-means clustering

2.4 Fuzzy time series algorithm

2.5 FCM-FTS model

2.6 Overall process of the proposed model

2.6.1 Stage 1: Dual decomposition

2.6.2 Stage 2: Integrated forecasting

3 Experiment

3.1 Data description

3.2 Evaluation criteria

3.3 Experiment design

3.4 Experiment I

Remark 1

3.5 Experiment II

Remark 2

3.6 Experiment III

Remark 3

4 Discussion

4.1 Forecasting results at different years

4.2 Statistical hypothesis testing: Diebold-Mariano test

4.3 Forecasting effectiveness

4.4 Improvement percentage

4.5 Grey relational degree

5 Conclusions

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendices

Appendix 1

Appendix 2: Supplementary material

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation