An effective neural network and fuzzy time series-based hybridized model to handle forecasting problems of two factors

Singh, Pritpal; Borah, Bhogeswar

doi:10.1007/s10115-012-0603-9

An effective neural network and fuzzy time series-based hybridized model to handle forecasting problems of two factors

Regular Paper
Published: 12 January 2013

Volume 38, pages 669–690, (2014)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Knowledge and Information Systems Aims and scope Submit manuscript

An effective neural network and fuzzy time series-based hybridized model to handle forecasting problems of two factors

Download PDF

Pritpal Singh¹ &
Bhogeswar Borah¹

528 Accesses
28 Citations
3 Altmetric
Explore all metrics

Abstract

Fuzzy time series forecasting method has been applied in several domains, such as stock market price, temperature, sales, crop production and academic enrollments. In this paper, we introduce a model to deal with forecasting problems of two factors. The proposed model is designed using fuzzy time series and artificial neural network. In a fuzzy time series forecasting model, the length of intervals in the universe of discourse always affects the results of forecasting. Therefore, an artificial neural network- based technique is employed for determining the intervals of the historical time series data sets by clustering them into different groups. The historical time series data sets are then fuzzified, and the high-order fuzzy logical relationships are established among fuzzified values based on fuzzy time series method. The paper also introduces some rules for interval weighing to defuzzify the fuzzified time series data sets. From experimental results, it is observed that the proposed model exhibits higher accuracy than those of existing two-factors fuzzy time series models.

A refined method of forecasting based on high-order intuitionistic fuzzy time series data

Article 26 May 2018

Improving Fuzzy Time Series Approach by Using Machine Learning

Fuzzy-time-series network used to forecast linear and nonlinear time series

Article 01 March 2015

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Fuzzy time series forecasting method has been applied in several domains, such as stock market price, temperature, sales, crop production and academic enrollments. Application of fuzzy time series theory in forecasting problems was first introduced by Song and Chissom [28–30]. They presented the fuzzy time series model by means of fuzzy relational equations involving max–min composition operation, and applied the model to forecast the enrollments in University of Alabama. In 1996, Chen [4] used simplified arithmetic operations avoiding the complicated max–min operations, and their method produced better results. Later, many studies provided some improvements to the existing methods in terms of effective lengths of intervals, fuzzy logical relation and defuzzification techniques.

Hwang et al. in [12] used the differences of the available historical data as fuzzy time series instead of direct usage of raw numeric values. Unlike Song–Chissom and Chen approaches, Sah and Degtiarev’s proposed model [24] utilizes variations of the available historical data as fuzzy time series. Huarng tried to improve the forecasting accuracy based on the determination of effective length of intervals [10] and heuristic approaches [11]. Lee and Chou [17] forecasted the university enrollments by defining the supports of the fuzzy numbers that represent the linguistic values of the linguistic variables more appropriately.

Cheng et al. in [6] used entropy minimization to create the intervals. They also used trapezoidal membership functions in the fuzzification process. Chang [2] presented cardinality-based fuzzy time series forecasting model which builds weighted fuzzy rules according to calculating the cardinality of fuzzy relations. To obtain less number of intervals, Cheng [7] proposed a model using fuzzy clustering technique to partition the data effectively. Kai et al. [13] applied the K-means clustering algorithm to partition the universe of discourse into different groups. Singh and Borah [26] forecasted the university enrollments with the help of new proposed algorithm by dividing the universe of discourse of the historical time series data into different length of intervals.

Chen and Hwang [5] forecasted the daily average temperature of Taipei based on two-factors fuzzy time series. In this model, first factor is daily temperature, whereas the second factor is daily cloud density. They proposed two algorithms—Algorithm-B and Algorithm-B$^*$. Their experimental results show that the accuracy rate of Algorithm-B$^*$ is better than Algorithm-B. Lee et al. [20] proposed a new method to forecast the daily average temperature of Taipei and the Taiwan Futures Exchange (TAIFEX). In this model, high-order fuzzy logical relationship is constructed to increase the forecasting accuracy. Chang and Chen [3] forecasted the daily temperature using fuzzy C-means and fuzzy rules interpolation techniques. In this model, rules are constructed based on fuzzy C-means clustering algorithm. Then, this model performs fuzzy inference based on the multiple fuzzy rules interpolation scheme. Based on two-factors high-order fuzzy time series and automatic clustering techniques, Wang and Chen [32] proposed a new method to predict the daily average temperature and TAIFEX. Lee et al. [18, 19] presented a new method for temperature prediction and the TAIFEX forecasting based on two-factors high-order fuzzy logical relationships by hybridizing genetic algorithms with fuzzy time series method.

In this paper, we present a new model to deal with the forecasting problems of two factors. The proposed model is designed using fuzzy time series and artificial neural network (ANN). In this study, high-order fuzzy logical relationships are also employed to design the model. Hence, we have entitled this model as “Two-factors high-order neuro-fuzzy hybridized model.” The main purpose of designing such a hybridized model is explained next.

For fuzzification of time series data sets, the determination of length of intervals is very important. In case of most of the above discussed models [4, 11, 12, 28, 30], the lengths of the intervals were kept same. No any specific reason is mentioned for using the fixed lengths of intervals. Huarng [10] shows that effective lengths of intervals always affect the results of forecasting. Therefore, for the creation of effective length of intervals of the historical time series data sets, an ANN-based technique is adopted in this model.

Song and Chissom [28] adopted the following method to forecast enrollments of the University of Alabama:

$$\begin{aligned} Y(t)=Y(t-1)\circ R \end{aligned}$$

(1)

where $Y(t-1)$ is the fuzzified enrollment of year $(t-1),\,Y(t)$ is the forecasted enrollment of year “$t$” represented by fuzzy set, “$\circ $” is the max–min composition operator and “$R$” is the union of fuzzy relations. This method takes a lot of time to compute the union of fuzzy relations [5]. Therefore, to improve the efficiency of the proposed model, some rules for intervals weighing are proposed to defuzzify the fuzzified time series data sets. The proposed model exhibits higher accuracy than those of existing models [3, 5, 18–20, 32].

The rest of the paper is organized as follows: In Sect. 2, the basic concepts of fuzzy time series are briefly explained. Section 3 presents the application of ANN for creating intervals of historical time series data sets. In Sect. 4, new forecasting model based on hybridization of ANN with fuzzy time series is proposed. The performance of the model is assessed and presented in Sect. 5. Conclusions and directions for future work are discussed in Sect. 6.

2 Fuzzy sets and fuzzy time series-A brief overview

In $1965$, Zadeh [35] introduced the theory of fuzzy sets. According to Zadeh, “A fuzzy set is a class of objects with continuum of grades of membership. Such a set is characterized by a membership function which assigns to each object a grade of membership ranging between zero and one.” He also presented fuzzy arithmetic theory and its application [36–38]. Based on fuzzy sets theory, Song and Chissom [28–30] introduced the fuzzy time series concept. Here, we briefly reviewed some concepts of fuzzy time series from [28–30].

Definition 1

(Fuzzy Set) A fuzzy set is a class with varying degrees of membership in the set. Let $U$ be the universe of discourse, which is discrete and finite, then fuzzy set $A$ can be defined as follows:

$$\begin{aligned} A=\left\{ \mu _A(x_1)/x_1+\mu _A(x_2)/x_2+ \cdots \right\} =\Sigma _{i}\mu _A(x_i)/x_i \end{aligned}$$

(2)

where $\mu _A$ is the membership function of $A,\,\mu _A: U\, \rightarrow \left[0,1\right]$, and $\mu _A(x_i)$ is the degree of membership of the element $x_i$ in the fuzzy set $A$. Here, the symbol “+” indicates the operation of union and the symbol “/” indicates the separator rather than the commonly used summation and division in algebra, respectively.

When $U$ is continuous and infinite, then the fuzzy set $A$ of $U$ can be defined as:

$$\begin{aligned} A=\left\{ \int \mu _A(x_i)/x_i\right\} ,\forall x_i \in U \end{aligned}$$

(3)

where the integral sign stands for the union of the fuzzy singletons, $\mu _A(x_i)/x_i$.

Fuzzy time series concept was proposed in [29], and the main difference between the traditional time series and the fuzzy time series is that the values of the former are crisp numerical values while the values of the latter are fuzzy sets. The crisp numerical values can be represented by real numbers, whereas in fuzzy sets, the values of observations are represented by linguistic values. The definitions of fuzzy time series are briefly reviewed as follows:

Definition 2

(Fuzzy time series) Let $Y(t)(t=0, 1, 2, \ldots )$ be a subset of real numbers “$R$”L and the universe of discourse on which fuzzy sets $\mu _i(t)(i=1, 2, \ldots )$ are defined, and let $F(t)$ be a collection of $\mu _i(t)(i=1, 2, \ldots )$. Then, $F(t)$ is called a fuzzy time series on $Y(t)(t=0, 1, 2, \ldots )$.

From Definition 2, we can see that $F(t)$ is a function of time $t$ and $\mu _i(t)$ are the linguistic values of $F(t)$, where $\mu _i(t) (i=1, 2, \ldots )$ are represented by fuzzy sets and the values of $F(t)$ can be different at different times because the universe of discourse can be different at different times. Fuzzy time series can be divided into two categories which are the time-invariant fuzzy time series and the time-variant fuzzy time series.

If $F(t)$ is caused by $F(t-1)$, that is, $F(t-1)\rightarrow F(t)$, then this relationship can be represented as follows:

$$\begin{aligned} F(t)=F(t-1)\circ R(t,t-1) \end{aligned}$$

(4)

where $R(t, t-1)$ is the fuzzy relationship between $F(t)$ and $F(t-1)$. Here, “$R$” is the union of fuzzy relations and “$\circ $” is max–min composition operator. It is also called the first-order model of $F(t)$.

Definition 3

(Fuzzy time-variant and time-invariant series) Let $F(t)$ be a fuzzy time series, and $R(t, t-1)$ be a first–order model of $F(t)$. If $R(t, t-1)=R(t-1, t-2)$ for any time $t$, and $F(t)$ only has finite elements, then $F(t)$ is referred as a time-invariant fuzzy time series. Otherwise, it is referred as a time-variant fuzzy time series.

3 ANN and its application for creation of intervals

ANN is a computational model that is inspired by the human brain [1, 27]. ANN is composed of large number of interconnected nodes or neurons, which usually operate in parallel, and are configured in regular architectures. Researchers employ ANN in various forecasting problems (like electric load forecasting [31], short-term precipitation forecasting [16], long-rage summer monsoon rainfall forecasting [25], etc.), due to its capability to extract relationships between the input and output data.

Data clustering is a popular approach for automatically finding classes, concepts, or groups of patterns [9]. Time series data are pervasive across all human endeavors, and their clustering is one of the most fundamental applications of data mining [14, 23]. In literature, many data clustering algorithms [8, 22, 33] have been proposed, but their applications are limited to the extraction of patterns that represent points in multi-dimensional spaces of fixed dimensionality [34]. In our proposed model, a distance-based clustering algorithm, that is, the self-organizing feature maps (SOFM) are employed for determining the intervals of the historical time series data sets by clustering them into different groups. SOFM is developed by Kohonen [15], which is a class of neural networks with neurons arranged in a low-dimensional (often two-dimensional) structure, and trained by an iterative unsupervised or self-organizing procedure [21]. SOFM converts the patterns of arbitrary dimensionality into response of one-dimensional or two-dimensional arrays of neurons, that is, it converts a wide pattern space into a feature space. The neural network performing such a mapping is called feature map. The training process of SOFM consists of the following steps [27]:

step 1

Initialize the weights ($W_{uv}$) and learning rate ($\alpha $).

step 2

When stopping condition is false, then perform Steps 2–8.

step 3

For each input vector (X), perform Steps 3–5.

step 4

For each $v=1$ to m, compute the square of the Euclidean distance as:

$$\begin{aligned} D(v)=\sum _{u=1}^{n}(X_u-W_{uv})^2 \end{aligned}$$

(5)

step 5

Obtain winning unit index (J), so that $D(J)=$minimum.

step 6

Calculate weights of winning unit as:

$$\begin{aligned} W_{uv}(new)=W_{uv}(old)+\alpha [X_u-W_{uv}(old)] \end{aligned}$$

(6)

step 7

Reduce the learning rate ($\alpha $) by using the following formula:

$$\begin{aligned} \alpha (t+1)=0.5\alpha (t) \end{aligned}$$

(7)

step 8

Reduce radius of topological neighborhood network.

step 9

Test for stopping condition of the network.

Based on the above-mentioned algorithm, the historical time series data sets are partitioned into different length of intervals. These intervals are presented in Sect. 4.

4 Proposed ANN and fuzzy time series hybridized model

In this section, we introduce a new forecasting model based on hybridization of ANN with fuzzy time series. The architecture of the proposed model consists of six phases as shown in Fig. 1. For verification of model, the historical data sets of the daily average temperature and the daily cloud density from June 1996 to September 1996 in Taipei, Taiwan [5] are used, which are shown in Tables 1 and 2, respectively. In these data sets, the daily average temperature is called the main factor, and the daily average cloud density is called the second factor.

Table 1 Historical data of the daily average temperature from June 1996 to September 1996 in Taipei (Unit: $^\circ \text{ C}$)

An effective neural network and fuzzy time series-based hybridized model to handle forecasting problems of two factors

Abstract

Similar content being viewed by others

A refined method of forecasting based on high-order intuitionistic fuzzy time series data

Improving Fuzzy Time Series Approach by Using Machine Learning

Fuzzy-time-series network used to forecast linear and nonlinear time series

Explore related subjects

1 Introduction

2 Fuzzy sets and fuzzy time series-A brief overview

Definition 1

Definition 2

Definition 3

3 ANN and its application for creation of intervals

step 1

step 2

step 3

step 4

step 5

step 6

step 7

step 8

step 9

4 Proposed ANN and fuzzy time series hybridized model

Phase 1

Phase 2

Phase 3

Phase 4

Phase 5

Phase 6

5 Experimental results

6 Conclusions and directions for future work

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation