Hesitant probabilistic fuzzy set based time series forecasting method

Gupta, Krishna Kumar; Kumar, Sanjay

doi:10.1007/s41066-018-0126-1

Hesitant probabilistic fuzzy set based time series forecasting method

Original Paper
Published: 17 August 2018

Volume 4, pages 739–758, (2019)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Granular Computing Aims and scope Submit manuscript

Hesitant probabilistic fuzzy set based time series forecasting method

Download PDF

342 Accesses
40 Citations
Explore all metrics

Abstract

Uncertainties due to randomness and fuzziness coexist in the system simultaneously. Recently probabilistic fuzzy set has gained attention of researchers to handle both types of uncertainties simultaneously in a single framework. In this paper, we introduce hesitant probabilistic fuzzy sets in time series forecasting to address the issues of non-stochastic non-determinism along with both types of uncertainties and propose a hesitant probabilistic fuzzy set based time series forecasting method. We also propose an aggregation operator that uses membership grades, weights and immediate probability to aggregate hesitant probabilistic fuzzy elements to fuzzy elements. Advantages of the proposed forecasting method are that it includes both type of uncertainties and non-stochastic hesitation in a single framework and also enhance the accuracy in forecasted outputs. The proposed method has been implemented to forecast the historical enrolment student’s data at University of Alabama and share market prizes of State Bank of India (SBI) at Bombay stock exchange (BSE), India. The effectiveness of the proposed method has been examined and tested using error measures.

Fuzzy Time Series Forecasting Method Using Probabilistic Fuzzy Sets

A novel high-order fuzzy time series forecasting method based on probabilistic fuzzy sets

Article 02 May 2019

A New Hesitant Fuzzy-Based Forecasting Method Integrated with Clustering and Modified Smoothing Approach

Article 31 March 2020

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Time series forecasting has been an important area of research since age. Profound applications of time series forecasting are found in many fields that includes finance, engineering, medicines and management science, etc. Regression analysis, autoregressive integration moving average, simple moving average and simple exponential smoothing are few statistical models which are commonly used in conventional time series forecasting. However, these models touch the issue of probabilistic uncertainty of time series data, but fail to include non-probabilistic uncertainty that arises due to inaccuracies in measurement and linguistic representation. Need of fuzzy sets (Zadeh 1965) was felt in time series forecasting to overcome the limitations of conventional time series forecasting methods and to arrive a realistic results with higher accuracy rate in forecasted output with linguistic representation of time series data.

Song and Chissom (1993a, b, 1994) proposed time series forecasting models based on fuzzy sets (Zadeh 1965) to deal uncertainty in time series forecasting that arises due to vague, inaccurate and linguistic representation of time series data. Chen (1996) proposed simple arithmetic operators rather than complex max–min compositions operators used by Song and Chissom (1993a, b, 1994). Afterwards, many researchers (Chen and Hwang 2000; Huarng 2001; Song 2003; Lee and Chou 2004; Liu 2007; Cheng et al. 2008, 2016; Huarng and Yu 2006; Chen et al. 2009; Chen and Tanuwijaya 2011) proposed various fuzzy time series forecasting models with the innovation either in partitioning the universe of discourse or in fuzzy logical relations to enhance the accuracy in forecast. Chen and Chen (2014), Chen and Chen (2015), Chen and Phuong (2017), Wang and Mishra (2018) proposed various forecasting methods using granular computing, adaptive and intelligent fuzzy time series forecasting models. Chen and Chen (2011), Chen (2014), Ye et al. (2016), Yolcu et al. (2016), Kocak (2017) and Efendi et al. (2018) have proposed the high order fuzzy time series forecasting method based on fuzzy logic relations for stock trading. Recently, Bas et al. (2018) proposed ridge regression for forecasting using type 1 fuzzy function. Granular computing is intended as a convergence of numerous modeling approaches (Pedrycz and Chen 2011, 2015a, b). Various researchers (Livi and Sadeghian 2016; Wilke and Portmann 2016; Liu and Cocea 2017; D’Aniello et al. 2017) have used granular computing approach in modeling and computing with uncertainty, human-data interaction, machine learning and approximate reasoning. Deng et al. (2016) and Maciel et al. (2016) proposed time series forecasting models using multi-granularity and granular analytics.

Although fuzzy time series methods have achieved great success in forecasting in environment of non-probabilistic uncertainty, but failed to handle non-determinism. Non-determinism in fuzzy time series forecasting occurs due to hesitation. This hesitation is non-probabilistic and is due to single function in fuzzy set for both membership and non-membership and cannot be handled by random probability distribution. To deal with this non-probabilistic non-determinism in fuzzy time series forecasting, many researchers (Joshi and Kumar 2012a, b; Gangwar and Kumar 2014; Kumar and Gangwar 2015, 2016; Wang et al. 2016) developed intuitionistic fuzzy sets (Atanassov 1986) based time series forecasting models. Another non-probabilistic non-determinism in fuzzy time series forecasting occurs when time series data can be fuzzified using multiple valid fuzzification methods. Since difficulty of creating a common membership grade is not due to margin of error or possible distribution values, therefore, this non-determinism cannot be handled using IFS and type-2 fuzzy sets. Torra and Narukawa (2009) and Torra (2010) introduced the hesitant fuzzy set (HFS) as a new generalization of fuzzy sets to address this particular non-determinism. HFS provides an effective tool to eliminate the compromise among the membership grades of time series datum during fuzzification using multiple fuzzy sets. Bisht and Kumar (2016) proposed HFS based fuzzy time series forecasting model and claimed it’s out performance in financial time series forecasting. Recently, hesitant fuzzy linguistic sets have also been used by many researchers (Chen and Hong 2014; Lee and Chen 2015a, b; Joshi and Kumar 2018a, b; Joshi et al. 2018) in decision-making problems.

Probabilistic and non-probabilistic uncertainties are two conceptually different kinds of uncertainties which occur simultaneously in the system. One of the main advantages of fuzzy time series forecasting methods is their ability to handle non-stochastic uncertainty. However, these forecasting models do not possess the capabilities to handle stochastic uncertainties. Meghdadi (2001) introduced probabilistic fuzzy set (PFS) to consider both uncertainties in a single framework. Due to its main advantage of combining interpretability of fuzzy set with statistical properties, Liu and Li (2005) proposed a probabilistic fuzzy logic system for the modeling and control problems. Applications of PFS were explored by many researchers (Almeida et al. 2009; Hinojosa et al. 2011; Li and Huang 2012; Huang et al. 2012; Fialho et al. 2016) in various fields where probabilistic uncertainty plays equal and important role as non-probabilistic uncertainty. Xu and Zhou (2017) associated probabilistic to the elements of HFS and defined hesitant probabilistic fuzzy set (HPFS).

The motivation and contribution of this paper are to propose a novel time series forecasting method that can include both stochastic and non-stochastic uncertainties in hesitant fuzzy environment. Since profound applications of HPFS are found in decision making problem (Zhou and Xu 2017; Li and Wang 2017; Ding et al. 2017) to include both types of uncertainties, therefore, we develop a novel time series forecasting method using HPFS for the same reasons. Advantage of proposed forecasting method is its ability to handle uncertainties that are caused by randomness and fuzziness simultaneously and also increases flexibility of using more than one fuzzification method. Another advantage of proposed forecasting method is that it addresses issue of non-statistical non-determinism (hesitation) which arises due to the presence of multiple valid fuzzification methods for time series data. In this paper, non-determinism is included using two different methods of discretization of universe of discourse with equal and unequal length partitions. HPFS is constructed using a probability distribution function that associates probabilities to possible membership grades of time series data in multiple fuzzy sets. We propose an aggregation operator to aggregate the hesitant probabilistic fuzzy elements (HPFEs) using weights and immediate probabilities. Performance of proposed forecasting method is tested using bench mark problem of data of University of Alabama enrolments. As statistical uncertainty is an inherent characteristic of financial time series data, performance of proposed method is also tested on a financial time series data of SBI share price at BSE, India.

Rest of the paper is organized as follows: Basic definitions of fuzzy set, PFS, fuzzy time series, HFS and HPFS are presented in Sect. 2. In Sect. 3, we define the max min composition operator and aggregation operator and also include few examples to understand max–min composition operator and aggregation process for HPFS. This section also includes algorithm of proposed HPFS-based time series forecasting method. Efficiency of proposed forecasting method is tested using dataset of University of Alabama enrolments and SBI share prices in Sect. 4. This section also includes comparison of forecasted enrolments and SBI prices with few other existing forecasting methods. Finally, conclusions are presented in Sect. 5.

2 Preliminaries

In this section, we review the definitions of fuzzy set (Zadeh 1965), PFS (Liu and Li 2005) and fuzzy time series (Song and Chissom 1993a, b; Chen 1996). Definitions of HFS (Torra and Narukawa 2009; Torra 2010), HPFS (Xu and Zhou 2017) are also reviewed in this section.

Definition 1

(Zadeh 1965) Let $U=\{ {u_1},{u_2},{u_3}, ... ,{u_n} \}$ be a finite and fixed universe of discourse. A fuzzy set A on $U=\{ {u_1},{u_2},{u_3}, ... ,{u_n} \}$ is defined as follows:

$$A\;=\;\left\{ {\left\langle {u,\;{\mu _A}(u)} \right\rangle \left| {\forall u \in U} \right.} \right\}$$

(1)

Here ${\mu _A}\;:\;U\; \to \;\left[ {0,\;1} \right]$ and ${\mu _A}(u)$ represents degree of membership of u in A.

Definition 2

(Liu and Li 2005) Probabilistic fuzzy set (PFS) $\tilde {A}$ for a variable $u \in U$ and its fuzzy membership grade $\mu \in [0,1]$ can be expressed by a probability space $({V_u},\wp ,P).$ Here ${V_u}$ and $\wp \;$ are the set of all possible events $\{ \mu \in [0,1]\} {\text{and}}\;\sigma$ field respectively. The probability P is defined on $\wp$ for all element event E_i in ${V_u}$ satisfies the following conditions:

$$P({E_i}) \geqslant 0,\;\;\;P\left( {\sum {{E_i}} } \right)=\sum {P({E_i})} \;\;\;P({V_u})=1$$

(2)

Here ${E_i}$ is corresponding to an event $\mu ={\mu _i} \subseteq [0,1]$ and P(E_i) is probability for the event E_i. $\tilde {A}$ can also be expressed as the union of finite sub probability space as follows:

$$\tilde {A}\; \equiv \;\bigcup\limits_{{u \in U}} {({V_u},\wp ,P)} .$$

(3)

Definition 3

(Song and Chissom 1993a, b; Chen 1996) Let $Y(t)(t =... , 0,1,2,. ..)$ be subset of real numbers. A fuzzy time series F(t) on Y(t) is a collection of fuzzy sets ${f_i}(t)(i=1,2, ...)$. If there exists a fuzzy relation$R(t-1, t)$ such that $F(t)=F(t-1) \circ R(t-1, t)$, ($\circ$ is the max–min composition operator) then relation $F(t-1) \to F(t)$ indicates that F(t) is caused by only F(t − 1). This is called first-order model of fuzzy time series forecasting model F(t). If F(t) is caused by$F(t-1), F(t-2),..., F(t-n),$ then this fuzzy relationship is an nth-order fuzzy time series$F(t-n),..., F( t-2 ), F( t-1 ) \to F(t ).$

Definition 4

(Torra and Narukawa 2009; Torra 2010) Let $U=\{ {u_1},{u_2},{u_3}, ... ,{u_n} \}$ be a fixed set and ${h_A}\;:\;U\; \to P\;[0,\;1]$ be a function from U to the collection of subsets of $[0,\;1]$. An HFS H on U is a mathematical object of following form:

$$H\;=\;\left\{ {\left\langle {u,\;{h_A}(u)} \right\rangle \left| {\forall u \in U} \right.} \right\}$$

(4)

Here ${h_A}(u)$ is a collection of membership degrees of an element $u \in U$to the set H in $[0,\;1]$. Elements of HFS are called hesitant fuzzy element (HFE). Basic operations of union, intersection and complement on HFEs are defined as follows:

${h_1}^{c}\;=\;\left\{ {1 - \gamma \left| {\gamma \in {h_1}} \right.} \right\}$
${h_1} \cup {h_2}\;=\;\left\{ {{\gamma _1} \vee {\gamma _2}\left| {{\gamma _1} \in {h_1},\;} \right.{\gamma _2} \in {h_2}} \right\}$
${h_1} \cap {h_2}\;=\;\left\{ {{\gamma _1} \wedge {\gamma _2}\left| {{\gamma _1} \in {h_1},\;} \right.{\gamma _2} \in {h_2}} \right\}$

Here $\vee$ and $\wedge$ are max and min operators.

Definition 5

(Xu and Zhou 2017) Let R be a fixed set. HPFS on R is expressed as ${H_p}=\left\{ {{{\left\langle {\bar {h}({{{\gamma _i}} \mathord{\left/ {\vphantom {{{\gamma _i}} {{p_i}}}} \right. \kern-0pt} {{p_i}}})} \right\rangle } \mathord{\left/ {\vphantom {{\left\langle {\bar {h}({{{\gamma _i}} \mathord{\left/ {\vphantom {{{\gamma _i}} {{p_i}}}} \right. \kern-0pt} {{p_i}}})} \right\rangle } {{\gamma _i},\;{p_i}}}} \right. \kern-0pt} {{\gamma _i},\;{p_i}}}} \right\}$ where $\bar {h}({{{\gamma _i}} \mathord{\left/ {\vphantom {{{\gamma _i}} {{p_i}}}} \right. \kern-0pt} {{p_i}}})$ is a set of elements in ${{{\gamma _i}} \mathord{\left/ {\vphantom {{{\gamma _i}} {{p_i}}}} \right. \kern-0pt} {{p_i}}}$${\gamma _i} \in R,\;0 \leqslant {\gamma _i} \leqslant 1,\;i=1,2,...,\# \bar {h},$ where $\# \bar {h}$ is the number of possible elements in $\bar {h}({{{\gamma _i}} \mathord{\left/ {\vphantom {{{\gamma _i}} {{p_i}}}} \right. \kern-0pt} {{p_i}}}).$${p_i} \in [0,1]$ is the hesitant probability of ${\gamma _i}$ and $\sum\nolimits_{{i=1}}^{{\# \bar {h}}} {{p_i}} \;=\;1$.

3 Proposed hesitant probabilistic fuzzy time series forecasting method

In the proposed forecasting method, time series data are fuzzified using two different fuzzification methods to construct HFS. HFS is converted into HPFS using a suitable probability distribution function. Proposed method utilizes a novel immediate probabilities based aggregation operator to aggregate HPFEs to fuzzy set and max–min composition operator on fuzzy logical relations which are defined as follows:

3.1 Max–min composition operator for fuzzy set

Let R₁ and R₂ be two fuzzy relations on fuzzy sets A₁, A₂, A₃ such that ${R_1} \subseteq {A_1} \times {A_2}\,$ and ${R_2} \subseteq {A_2} \times {A_3}$. The max min composition (${R_1} \circ {R_2}$) of two relations ${R_1}$ and ${R_2}$ is expressed by the relation from A₁ to A₃ as follows:

For $(a,\,b) \in {A_1} \times {A_2},\,(b,\,c) \in {A_2} \times {A_3}$

$$\begin{gathered} {\mu _{{R_1} \circ {R_2}}}(a,\,c)=\mathop {\hbox{max} }\limits_{b} \left[ {\hbox{min} ({\mu _{{R_1}}}(a,\,b),\,{\mu _{{R_2}}}(b,\,c))} \right] \hfill \\ \,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,\,=\mathop \vee \limits_{b} \left[ {{\mu _{{R_1}}}(a,\,b) \wedge {\mu _{{R_2}}}(b,\,c)} \right] \hfill \\ \end{gathered}$$

(5)

Here $\vee ,\; \wedge$ are maximum and minimum operations, respectively.

Max–min composition operation is illustrated by following example.

Example 1

If ${R_1}={\left[ {\begin{array}{*{20}{c}} {\begin{array}{*{20}{c}} {0.21}&{0.51}&{0.19} \end{array}} \\ {\begin{array}{*{20}{c}} {0.37}&{0.92}&{0.76} \end{array}} \\ {\begin{array}{*{20}{c}} {0.71}&{0.97}&{0.39} \end{array}} \\ {\begin{array}{*{20}{c}} {0.91}&{0.42}&{0.81} \end{array}} \end{array}} \right]_{4 \times 3}}\;{\text{and}}\;{R_2}={\left[ {\begin{array}{*{20}{c}} {0.29}&{0.97}&{0.86} \\ {0.35}&{0.75}&{0.49} \\ {0.17}&{0.29}&{0.68} \end{array}} \right]_{3 \times 3}}$ are two fuzzy relations, then ${R_1} \circ {R_2}=\left[ {{c_{ij}}} \right];\;\;\left( {i=1,2,3,4\,\;{\text{and}}\,\;j=1,2,3} \right)={\left[ {\begin{array}{*{20}{c}} {\begin{array}{*{20}{c}} {{c_{11}}}&{{c_{12}}}&{{c_{13}}} \end{array}} \\ {\begin{array}{*{20}{c}} {{c_{21}}}&{{c_{22}}}&{{c_{23}}} \end{array}} \\ {\begin{array}{*{20}{c}} {{c_{31}}}&{{c_{32}}}&{{c_{33}}} \end{array}} \\ {\begin{array}{*{20}{c}} {{c_{41}}}&{{c_{42}}}&{{c_{43}}} \end{array}} \end{array}} \right]_{4 \times 3}}$

$$\begin{gathered} {c_{11}}=\hbox{max} [\hbox{min} (0.21,\,0.29),\hbox{min} (0.51,\,0.35),\hbox{min} (0.19,\,0.17)] \hfill \\ \;\;\;\;\,=\hbox{max} [0.21,\,0.35,\,0.17]=0.35 \hfill \\ \end{gathered}$$

3.2 Aggregation operator

Let H_p be a HPFS on reference set U and let ${h_{Hp}} :U \to P[0, 1]$ be a function that determines HPFEs. A mapping $O:P[0, 1] \to [0, 1]$ which is defined as follows is an aggregation operator that gives a fuzzy set ${H_{{A_i}}}\;=\;\left\{ {\left\langle {u,O({h_H}(u))} \right\rangle \left| {\forall {u_i} \in U} \right.} \right\}$

$$O\left\{ {{\mu _{{A_i}}}} \right\}\;=\;\frac{{\prod\nolimits_{{i=1}}^{n} {{{(1+(1 - {\omega _i}){\mu _{{A_i}}})}^{{{\hat {p}}_{{A_i}}}}} - \prod\nolimits_{{i=1}}^{n} {{{(1 - {\mu _{{A_i}}})}^{{{\hat {p}}_{{A_i}}}}}} } }}{{\prod\nolimits_{{i=1}}^{n} {{{(1+(1 - {\omega _i}){\mu _{{A_i}}})}^{{{\hat {p}}_{{A_i}}}}}+(1 - {\omega _{im}})\prod\nolimits_{{i=1}}^{n} {{{(1 - {\mu _{{A_i}}})}^{{{\hat {p}}_{{A_i}}}}}} } }}$$

(6)

Here ${\hat {p}_{{A_i}}}$ and ${\omega _i}$ are immediate probability (Yager et al. 1995) and weights of the membership grades ${\mu _{{A_i}}}$ and are defined as follows:

$${\hat {p}_{{A_i}}}\;=\;\frac{{{\omega _i}.{p_{{A_i}}}({\mu _{{A_i}}})}}{{\sum\nolimits_{{i=1}}^{n} {{\omega _i}.{p_{{A_i}}}({\mu _{{A_i}}})} }}\,$$

(7)

$${\omega _{\text{i}}}=\;\frac{{{d_i}}}{{\sum\nolimits_{{i=1}}^{n} {{d_i}} }}$$

(8)

Immediate probabilities and weights of the membership grades satisfy the condition of ${\text{ }}\sum\nolimits_{{i=1}}^{n} {{{\hat {p}}_{{A_i}}}} \;=\;1$ and $\sum\nolimits_{{i=1}}^{n} {{\omega _i}} \;=\;1$. ${\omega _{im}}$ is weight of maximum membership grades ${\mu _{{A_i}}}$. ${p_{{A_i}}}({\mu _{{A_i}}})$ is probability of the membership grades ${\mu _{{A_i}}}$ and is calculated using following Gaussian probability distribution function (Huang et al. 2012).

$${p_{{A_i}}}({\mu _{{A_i}}})\;=\;\frac{{{\xi _i}}}{{\sqrt {2\pi } {\varsigma _i}}}\left( {{e^{ - \frac{{({u_k} - ({\mu _{{A_i}}} - 1){\xi _i} - {m_i})}}{{2{\varsigma _i}^{2}}}}}+{e^{ - \frac{{({u_k} - (1 - {\mu _{{A_i}}}){\xi _i} - {m_i})}}{{2{\varsigma _i}^{2}}}}}} \right),\;\;\;{\text{(}}i=1,2,...{\text{, }}n{\text{)}}$$

(9)

Here${\mu _{{A_i}}}$, ${\xi _i}$, ${\varsigma _i}$ and ${m_i}$ are the membership grades, width, standard deviation, and mean of the fuzzy sets ${A_i}$, respectively.

Aggregation operator (Eq. 6) satisfies following property:

$$\hbox{min} \{ {\mu _{{A_i}}} \} \leqslant O\{ {\mu _{{A_i}}} \} \leqslant \hbox{max} \{ {\mu _{{A_i}}} \} ; \forall {\mu _{{A_i}}} \in [0,1]$$

Following example illustrates the process of construction of HPFS and aggregation of HPFEs using proposed aggregation operator.

Example 2

Let $H=\left\{ {\left\langle {1219,\{ 0.429, 0.473\} } \right\rangle , \left\langle {1123, \{ 0.739, 0.774\} } \right\rangle } \right\}$ be a HFS on reference set $U = \{ 1219, 1123\}$ which is constructed using two fuzzy sets ${A_1}=[732,1042,1352]$and ${A_2}=[732,1051,1370]$. Using weights${\omega _{\text{1}}}{\text{ = 0}}{\text{.493,}}\;{\omega _{\text{2}}}{\text{ = 0}}{\text{.507}}$, standard deviation of (732, 1042, 1352) and Eq. (9) probability of ${\mu _{{A_1}}}(1219)=0.429$ is calculated as follows:

$${p_{{A_1}}}(0.429)\;=\;\frac{{310}}{{\sqrt {2 \times 3.14} \times 253.11}}\left( {{{\text{e}}^{ - \frac{{\left( {1219 - (0.429 - 1) \times 310 - 1042} \right)}}{{2 \times 64066.67}}}}+{{\text{e}}^{ - \frac{{\left( {1219 - (1 - 0.429) \times 310 - 1042} \right)}}{{2 \times 64066.67}}}}} \right)=0.673$$

Similarly, other probabilities of membership grades are calculated and following HPFS is obtained.

$$H_{p}=\left\{ {\left\langle {1219,\{ 0.429(0.673), 0.473(0.686)\} } \right\rangle , \left\langle {1123, \{ 0.739(0.897), 0.774(0.908)\} } \right\rangle } \right\}$$

Using Eq. (7) corresponding immediate probabilities of membership grades are computed as follows:

$${\hat {p}_{{A_1}}}(0.429) =\frac{{0.493 \times 0.673}}{{(0.493 \times 0.673+0.507 \times 0.686)}}=0.488\quad {\text{and}} \quad {\hat {p}_{{A_1}}}(0.473) =\frac{{0.507 \times 0.686}}{{(0.493 \times 0.673+0.507 \times 0.686)}}=0.512,$$

$${\hat {p}_{{A_2}}}(0.739) =\frac{{0.493 \times 0.897}}{{(0.493 \times 0.897+0.507 \times 0.908)}}=0.49\quad {\text{and}}\quad {\hat {p}_{{A_2}}}(0.774) =\frac{{0.507 \times 0.908}}{{(0.493 \times 0.897+0.507 \times 0.908)}}=0.51$$

Using weights, immediate probabilities of membership grades and Eq. (6) HPFEs are aggregated to ${H_{{A_1}}}\,{\text{and}}\,{H_{{A_2}}}$ as follows:

$${H_{{A_1}}}=\frac{{((({{(1+(1 - 0.493) \times 0.429)}^{0.488}} \times {{(1+(1 - 0.507) \times 0.473)}^{0.512}}{\text{ }})-({{(1-0.429)}^{0.488}} \times {{(1-0.473)}^{0.512}}{\text{ }})))}}{{((({{(1+(1 - 0.493) \times 0.429)}^{0.488}} \times {{(1+(1 - 0.507) \times 0.473)}^{0.512}}{\text{ }})+(1 - 0.507)({{(1-0.429)}^{0.488}} \times {{(1-0.473)}^{0.512}}{\text{ }})))}}=0.453$$

$${H_{{A_2}}}=\frac{{((({{(1+(1 - 0.493) \times 0.739)}^{0.49}} \times {{(1+(1 - 0.507) \times 0.774)}^{0.51}}{\text{ }})-({{(1-0.739)}^{0.49}} \times {{(1-0.774)}^{0.51}}{\text{ }})))}}{{((({{(1+(1 - 0.493) \times 0.739)}^{0.49}} \times {{(1+(1 - 0.507) \times 0.774)}^{0.51}}{\text{ }})+(1 - 0.507)({{(1-0.739)}^{0.49}} \times {{(1-0.774)}^{0.51}}{\text{ }})))}}=0.758$$

Finally aggregated fuzzy set ${H_A}=\left\{ {\left\langle {1219,0.453} \right\rangle ,\left\langle {1123,0.758} \right\rangle } \right\}$ is obtained.

3.3 Algorithm of proposed HPFS-based time series forecasting method

Proposed HPFS-based time series forecasting method uses following algorithm.

Algorithm for proposed HPFS-based time series forecasting method

1.
Define the universe of discourse and include hesitancy by constructing different types of fuzzy sets to fuzzify time series data.
2.
Assign weights to membership grades of time series data in different types of fuzzy sets.
3.
Assign probability to membership grades using Gaussian probability distribution function.
4.
Calculate the immediate probability of membership grades and determine aggregated fuzzy set using aggregation operator.
5.
Use max–min operations on FLR to have fuzzified outputs and defuzzify them numerical forecast by centroid average formula.

Each step of above algorithm is further described in detail as follows:

Step 1: Define universe of discourse as $U=\,[{D_{\hbox{min} }}\; - \;\sigma ,\;{D_{\hbox{max} }}\;+\;\sigma ]$, where ${D_{\hbox{min} }}$ and ${D_{\hbox{max} }}$ are the minimum and maximum observed value and $\sigma$ is the standard deviation of the data. Fuzzify time series data using more than one valid fuzzification methods. In this paper, universe of discourse is partitioned into equal and unequal intervals and length of unequal intervals are determined using CPDA approach. Collection $\left\langle {u,\,{\mu _1}(u),{\mu _2}(u)} \right\rangle$ is an HFE; where $\,{\mu _1}(u)\,{\text{and}}\,{\mu _2}(u)$ are membership grades of a time series datum (u) in fuzzy sets with equal intervals ${A_{{e_i}}}$ and unequal intervals ${A_{u{e_i}}}$, respectively.

Step 2: Compute the weights to the triangular membership function ${\omega _i}$ using following expression.

$${\omega _i}\;=\;\frac{{{d_i}}}{{\sum\nolimits_{{i=1}}^{n} {{d_i}} }}$$

Here ${d_i}$ is length of corresponding intervals of fuzzy sets ${A_i}$.

Step 3: Take membership grade as random variable and use following probability distribution function (Huang et al. 2012) to associate probabilities.

$${p_{{A_i}}}({\mu _{{A_i}}})\;=\;\frac{{{\xi _i}}}{{\sqrt {2\pi } {\varsigma _i}}}\left( {{{\text{e}}^{ - \frac{{({u_k} - ({\mu _{{A_i}}} - 1){\xi _i} - {m_i})}}{{2{\varsigma _i}^{2}}}}}+{{\text{e}}^{ - \frac{{({u_k} - (1 - {\mu _{{A_i}}}){\xi _i} - {m_i})}}{{2{\varsigma _i}^{2}}}}}} \right),\;{\text{(}}i = {\text{1,2, }}...{\text{, }}n{\text{)}}$$

Here${\mu _{{A_i}}}$, ${\xi _i}$, ${\varsigma _i}$ and ${m_i}$ are, respectively, membership grades, width, standard deviation and mean of data that lies in fuzzy sets ${A_i}$.

Step 4: Compute immediate probability of membership grades for fuzzy sets ${A_i}$ is ${\hat {p}_{{A_i}}}$ using following expression.

$${\hat {p}_{{A_i}}}\;=\;\frac{{{\omega _i}.{p_{{A_i}}}({\mu _{{A_i}}})}}{{\sum\nolimits_{{i=1}}^{n} {{\omega _i}.{p_{{A_i}}}({\mu _{{A_i}}})} }}$$

Apply aggregation operator (Eq. 6) to have aggregated fuzzy set. Time series data is again fuzzified using following simple algorithm.

Step 5: Fuzzy logical relations (FLRs) are defined on fuzzy sets that are obtained using aggregation of HPFEs and is denoted as ${H_{{A_i}}} \to {H_{{A_j}}}$, where ${H_{{A_i}}}$ is the fuzzy production of the year n as current state and ${H_{{A_j}}}$ is the fuzzy production of the year n + 1 as next state.

Use max–min operations (Eq. 5) on FLR to have fuzzified outputs and defuzzify them numerical forecast by following centroid average formula:

$${\text{Numerical forecast}} =\frac{{\sum\nolimits_{{i=1}}^{n} {{f_i}{c_i}} }}{{\sum\nolimits_{{i=1}}^{n} {{f_i}} }}$$

(10)

Here ${f_i}$ is fuzzified output and ${c_i}$ is average of centroids for equal and unequal intervals.

For error measure RMSE and AFE are the general tools in fuzzy time series forecasting. RMSE, AFE, correlation coefficient and coefficient of determination are applied to estimate the execution of forecasting model. Following error measures, correlation coefficient and coefficient of determination are defined as:

$${\text{RMSE}}= \sqrt {\frac{{\sum\nolimits_{{i=1}}^{n} {{{({O_i}-{F_i})}^2}} }}{n}}$$

(11)

$${\text{Forecasting error}} ({\text{in}} \% ) =\;\frac{{\left| {{F_i}-{O_i}} \right|}}{{{O_i}}} \times 100$$

(12)

$${\text{AFE}} ({\text{in}} \% ) =\frac{{{\text{sum of forecasting error}}}}{n}$$

(13)

$${\text{Coefficient of correlation}} (R)=\frac{{n\sum {{O_i}{F_i}\;-\;\left( {\sum {{O_i}} } \right)\left( {\sum {{F_i}} } \right)} }}{{\sqrt {n\left( {\sum {{O_i}^{2}} } \right)-{{\left( {\sum {{O_i}} } \right)}^2}} \sqrt {n\left( {\sum {{F_i}^{2}} } \right)-{{\left( {\sum {{F_i}} } \right)}^2}} }}$$

(14)

$${\text{Coefficient of determination}} = {R^2}$$

(15)

Here ${O_i}$ and ${F_i}$ denote the observed and forecasted time series data, n is the number of data points and $\sigma$ is standard deviation of the given data. Positive and negative value of R indicates positive and negative linear correlation, respectively, between forecasted and observed time series data and it is lies between − 1 and 1 and R² is a non-negative value.

4 Experimental results

In this section, we apply the proposed fuzzy time series forecasting method in the hesitant probabilistic fuzzy environment for forecasting the enrolments of University of Alabama (Song and Chissom 1993b) and the SBI share prizes at BSE, India (Joshi and Kumar 2012a).

4.1 Forecasting historical enrolments data with proposed method

In this subsection the proposed HPFS-based forecasting method is implemented on historical data of the enrolments at University of Alabama (Table 1).

Table 1 Historical enrolments data of University of Alabama

Hesitant probabilistic fuzzy set based time series forecasting method

Abstract

Similar content being viewed by others

Fuzzy Time Series Forecasting Method Using Probabilistic Fuzzy Sets

A novel high-order fuzzy time series forecasting method based on probabilistic fuzzy sets

A New Hesitant Fuzzy-Based Forecasting Method Integrated with Clustering and Modified Smoothing Approach

Explore related subjects

1 Introduction

2 Preliminaries

Definition 1

Definition 2

Definition 3

Definition 4

Definition 5

3 Proposed hesitant probabilistic fuzzy time series forecasting method

3.1 Max–min composition operator for fuzzy set

Example 1

3.2 Aggregation operator

Example 2

3.3 Algorithm of proposed HPFS-based time series forecasting method

4 Experimental results

4.1 Forecasting historical enrolments data with proposed method

4.2 Proposed method for forecasting market price of SBI share

5 Conclusions

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Publisher’s Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation