Hybrid wavelet–artificial intelligence models in meteorological drought estimation

Taylan, Emine Dilek; Terzi, Özlem; Baykal, Tahsin

doi:10.1007/s12040-020-01488-9

Hybrid wavelet–artificial intelligence models in meteorological drought estimation

Published: 13 February 2021

Volume 130, article number 38, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Earth System Science Aims and scope Submit manuscript

Hybrid wavelet–artificial intelligence models in meteorological drought estimation

Download PDF

551 Accesses
7 Citations
7 Altmetric
1 Mention
Explore all metrics

Abstract

In this study, wavelet transform (W), which is one of the data pre-processing techniques, adaptive neural-based fuzzy inference system (ANFIS), support vector machine (SVM) and artificial neural networks (ANNs) were used to develop the drought estimation models of Çanakkale, Turkey. For these models, 3-, 6-, 9- and 12-months drought indices were calculated by standard precipitation index (SPI) and by using precipitation data of Çanakkale, Gökçeada and Bozcaada stations between 1975 and 2010 years. Firstly, ANFIS, SVM and ANNs models were developed to estimate calculated drought indices. Then SPI values of Gökçeada and Bozcaada stations were divided into sub-series by wavelet transform technique and these sub-series were used as input in W-ANFIS, W-SVM and W-ANNs models. When the developed models were compared, it was determined that the hybrid models developed by using preprocessing technique performed better. Among these models, it was observed that the W-ANFIS model gave the best results for 6-months period.

Research Highlights

Calculating of 3-, 6-, 9- and 12- months meteorological drought index with SPI
Developing ANFIS, SVM and ANNs drought models using SPI values
Decomposition of SPI values into sub-series by wavelet transform technique and developing hybrid drought models (W-ANFIS, W-SVM and W-ANNs) using subseries of SPI values
Comparing ANFIS, SVM and ANNs models with hybrid models
Obtaining appropriate results with hybrid models in meteorological drought estimation

Predicting hydrological droughts using ERA 5 reanalysis data and wavelet-based soft computing techniques

Article 30 November 2023

Hybrid wavelet packet machine learning approaches for drought modeling

Article 12 May 2020

Application of Hybrid ANN Techniques for Drought Forecasting in the Semi-Arid Region of India

Article 24 August 2023

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Drought has the most extensive impact among natural disasters occurring as a result of meteorological events. Drought is a natural phenomenon which causes water resources to be adversely affected and the hydrological balance deteriorates when precipitation falls significantly below normal. Meteorological drought has serious effects on agriculture, water resources and socio-economic conditions. Precise estimation of drought is important to reduce environmental damage. The standard precipitation index (SPI) which is one of the meteorological drought indices and developed by McKee et al. (1993) is the commonly used index which only needs precipitation data (Wang et al. 2015; Zarch et al. 2015; Paulo et al. 2016). However, it is seen that artificial intelligence methods, which have been frequently used to estimate hydrologic parameters in recent years, also give successful results in drought studies (Tripathi et al. 2006; Bacanli et al. 2009; Chen et al. 2010; Gocić et al. 2015; Tigkas et al. 2015). Mokhtarzad et al. (2017) developed artificial neural networks (ANNs), adaptive neural-based fuzzy inference system (ANFIS) and support vector machine (SVM) models to predict drought for Bojnourd meteorology station. They used temperature, humidity and seasonal precipitation values as input parameters and 3 months SPI values as output parameter for modelling. As a result of the developed models, they stated that SVM model gave better results than ANN and ANFIS models. Choubin et al. (2014) used eight climate indices, neuro-fuzzy model and SPI for forecasting drought conditions in Iran. They used the principal component analysis to identify these indices and the indices account for 81% of the variance. They suggested that neuro-fuzzy model could be used for drought forecasts. Jalalkamali et al. (2015) extracted an optimum model to forecast meteorological drought using SPI for Yazd Province in Iran. For this aim, they used multilayer perceptron ANN, ANFIS, SVM and ARIMAX multivariate time series. ARIMAX models demonstrated a better performance especially for a 9-month period, especially. Nguyen et al. (2017) searched the applicability of ANFIS to forecast drought by using SPI and standardized precipitation evapotranspiration index (SPEI) in Vietnam. They stated that the efficiency of the ANFIS models developed for SPI/SPEI was similar and ANFIS model developed for SPEI had better performance than SPI for a 12-month period, particularly.

In addition, the use of wavelet transform (W), one of the data preprocessing techniques, has increased in cases where artificial intelligence methods are insufficient in estimation studies (Nourani et al. 2009; Singh 2011; Belayneh and Adamowski 2012; Kisi and Cimen 2012; Mehr et al. 2014; Nourani et al. 2015). Santos and Silva (2014) proposed wavelet-neural network (W-ANN) models to forecast daily streamflow for 1-, 3-, 5- and 7-days ahead. They said that the proposed models had significantly better results than the ANN model. Khan et al. (2018) used standard index of annual precipitation (SIAP) for meteorological drought and standardized water storage index (SWSI) for hydrological drought. They developed and compared ANN and W-ANN models to estimate monthly drought. The results of the comparisons displayed that W-ANN models gave higher correlation coefficients than ANN models. Shirmohammadi et al. (2013) investigated the ability of ANN, ANFIS, W-ANN and W-ANFIS models to forecast meteorological drought in Iran. They derived that wavelet transform could improve modelling of meteorological drought and ANFIS models better performed than ANN models. Fung et al. (2018) developed support vector regression (SVR), boosting-support vector regression (BS-SVR) and wavelet-boosting-support vector regression (W-BS-SVR) models for drought prediction in Malaysia. They calculated standard rain evapotranspiration index (SPEI) values for 1-, 3- and 6-months using monthly precipitation, average temperature and evapotranspiration. They used the calculated SPEI values as input parameter for SVR, BS-SVR and D-BS-SVR models. Comparing the results, they found that W-BS-SVR models showed better performance than the other two models.

The aim of this study is to investigate the effects of wavelet transform on the success of ANFIS, SVM and ANNs models in meteorological drought estimation. For this purpose, the W-ANFIS, W-SVM and W-ANN models were developed and compared to the ANFIS, SVM and ANNs models in Çanakkale Province, Turkey.

2 Study region and data

Located in the northwestern part of Turkey, the study area covers the Çanakkale province, Gökçeada and Bozcaada islands and also the Dardanelles, which is an internationally important strait connecting Europe with Asia and also connecting the Sea of Marmara with the Aegean Sea.

The Çanakkale province is located between 25°35′–27°45′E and 39°30′–40°45′N and has an area of 9,737 km² (figure 1). Çanakkale has semi-humidity climatic conditions. Çanakkale has a total annual rainfall of 591.5 mm per year. The maximum amount of rainfall measured in 24 hrs was 137.8 mm (November 5th, 1956). For long years, the average temperature has been 15.0°C and the temperature has an increasing trend. The daily maximum temperature measured was 39.0°C (July 30th, 2007) whereas the daily minimum temperature measured was –11.8°C (February 14th, 2004) up to present. The average wind speed is 3.9 m/s in Çanakkale. The maximum wind speed measured was 139.3 km/hr (February 15th, 1991) up to present (GDTM 2017).

Besides the city of Çanakkale, the study area covers also two substantial islands of Turkey, which are Gökçeada and Bozcaada. Gökçeada, the largest island of Turkey, has a surface area of 289 km² and has a rough terrain. The island has a dam and three ponds. Bozcaada is the 3rd largest island of Turkey. The area of the island is 40 km² and the distance to the mainland is 6 km. There is Mediterranean climate in Bozcaada (GMKA 2017).

In this study, the precipitation data were used for three stations located in the Çanakkale province. These are Çanakkale, Gökçeada and Bozcaada stations. The precipitation data were obtained from the Turkish State Meteorological Service. Some statistical parameters of the precipitation data of these stations are given in table 1.

Table 1 The statistical parameters of the precipitation data.

Full size table

The homogeneity test was made to the precipitation data of Çanakkale, Bozcaada and Gökçeada stations. The double mass curve method was used for the homogeneity of the data of the three stations. The cumulative graphs were drawn using annual precipitation averages. When the changes in slope were examined, no breaks were observed in the slopes of three stations (figure 2).

3 Methods

3.1 Standardized precipitation index (SPI)

The standard precipitation index (SPI), which is a meteorological drought index and based on only precipitation data, was developed by McKee et al. (1993). SPI, which is obtained by dividing standard deviation of the difference of precipitation values from the mean value, has two advantages. Firstly, it is relatively easy to evaluate because it uses only precipitation data (Cacciamani et al. 2007; Belayneh et al. 2014). Secondly, SPI makes it possible to define drought on multiple time scales (Tsakiris and Vangelis 2004; Mishra and Desai 2006; Cacciamani et al. 2007). For drought analysis with SPI, at least 30 years of continuous data is required (Cacciamani et al. 2007; Belayneh et al. 2014). In the calculation of SPI, the long-term precipitation record for the desired period is fitted to the probability distribution and then converted to the normal distribution; therefore, the mean SPI value is zero (McKee et al. 1993; Edwards and McKee 1997). A positive value is obtained if the value of SPI is greater than the mean value (median for normal distribution) of the precipitation, and a negative value, if it is lower. As SPI is normalized, the wet and dry climates can be shown in the same way.

A drought event occurs when the index continuously reaches –1.0 value or less according to the SPI. Drought continues until the SPI value is zero (Tigkas et al. 2015). SPI drought categories are given in table 2.

Table 2 SPI categories (Tigkas et al. 2015).

Full size table

3.2 Discrete wavelet transform

The wavelet function ψ(t), named the mother wavelet, has shock characteristics and can quickly reduce to zero. $ \varPsi_{a,b}^{\left( t \right)} $ can be determined by compressing and expanding ψ(t), which is defined mathematically as $ \int_{ - \infty }^{ + \infty } \varPsi \left( t \right)dt = 0 $.

$$ \varPsi_{a,b}^{\left( t \right)} = \left| a \right|^{ - 1/2} \varPsi^{{\left( {\frac{t - b}{a}} \right)}} $$

(1)

if a and b are real numbers in equation (1), it can be stated as $ \varPsi_{a,b}^{\left( t \right)} $ is the consecutive wavelet, a is the periodicity factor and b is the time factor (Wang and Ding 2003; Tiwari and Chatterjee 2011).

In the wavelet transform, a function of variables a and b, parameter a can be defined as the expansion (a > 1) or contraction (a < 1) factor for different scales of the wavelet function. Parameter b can be defined as a shift of ψ(t). Continuous wavelet transforms of f(t) for f (t) ∈ L²(R) can be described as:

$$ w_{f} \left( {a,b} \right) = \left| a \right|^{ - 1/2} \int_{ - \infty }^{ + \infty } f \left( t \right)\varPsi^{*} \left( {\frac{t - b}{a}} \right)dt .$$

(2)

For the different scales of a and b, the relationships between signal and wavelet function are investigated by wavelet transform. From this, a scattering map with a wavelet coefficient $ w_{f} \left( {a,b} \right) $ is derived. Selecting the scales and positions as the forces of the two will make the analysis more accurate. Thus WMN can be obtained from equation (3) (Mallat 1989).

$$ \varPsi_{m,n} \left( {\frac{t - b}{a}} \right) = a_{0}^{{ -\, \frac{m}{2}}} \varPsi^{*} \left( {\frac{{t - nb_{0} a_{0}^{m} }}{{a_{0}^{m} }}} \right), $$

(3)

where m is the wavelet expansion and n is the wavelet translation. Parameters a₀ and b₀ are generally numbers, which are assumed to be 2 and 1, respectively (figure 3) (Tiwari and Chatterjee 2011).

The power-of-two logarithmic scaling of the translations and expansions are called a dyadic grid arrangement (Mallat 1989; Tiwari and Chatterjee 2011). $ w_{f} \left( {m,n} \right) $ occurs at a different time t, becomes as following equation,

$$ w_{f} \left( {m,n} \right) = 2^{ - m/2} \sum\limits_{t = 0}^{N - 1} f \left( t \right)\varPsi^{*} \left( {2^{ - m} i - n} \right) ,$$

(4)

where w = f(m, n) can be calculated as the wavelet coefficient with a = 2m and b = 2m. $ f\left( t \right) $ is a finite time series. n is the time translation parameter and is less than (2M–m–1). N is equal to the power of two and is calculated by N = 2M. However, 2m is the largest wavelet scale in the range < 1< m < M (when m = M).

Also, $ \bar{W} $ is the mean signal. Thus, the inverse discrete can be calculated as below:

$$ f\left( t \right) = \bar{W}\left( t \right) + \sum\limits_{m = 1}^{M} {W_{m} } \left( t \right),$$

(5)

where $ \bar{W}\left( t \right) $ is the approximation sub-signal and $ W_{m} \left( t \right) $ are detailed sub-signals (Tiwari and Chatterjee 2011).

Discrete wavelet transform processes with scaling function (low-pass filter) and wavelet function (high-pass filter). After the original data is processed in these functions, they are divided into approximation and detail components. Then, this procedure is repeated with successive approximations being decomposed in order (Tiwari and Chatterjee 2011).

3.3 Adaptive neural-based fuzzy inference system (ANFIS)

There are two critical processes in ANFIS proposed by Jang (1993). These are: (1) the determination of the rules and (2) the estimation of the parameters of the input and output membership functions by the backpropagation learning algorithm.

In the ANFIS structure, the fuzzy rule base is combined with artificial neural networks. Thus, the new fuzzy neural network is more advantageous in that organization of membership functions and the identification of fuzzy rules can be done automatically for a problem.

ANFIS aims to determine the parameters of the Sugeno type of fuzzy inference system using a hybrid learning model. The Sugeno fuzzy model was proposed by Takagi, Sugeno and Kang for a system approach that produces fuzzy rules from an input–output dataset. A typical rule structure in Sugeno fuzzy model consisting of two inputs x, y and one output z is in the following forms:

$$ \text{R}^{1}\!\! :{\text{If}}\; {x}\; {\text {is}}\; A_{1} \;{\text{AND}} \; {y}\;{\text {is}} \; {B_{1}} ,\;{\text{THEN}} \; {z} = f_{1} = p_{1} x + q_{1} y + r_{1} $$

(6)

$$ \text{R}^{2}\!\! :{\text{If}} \; {x} \; {\text {is}}\; {A_{2}} \;{\text{AND}}\; {y} \; {\text {is}}\; {B_{2}} ,\;{\text{THEN}}\; {z} = f_{2} = p_{2} x + q_{2} y + r_{2} $$

(7)

$$ \text{R}^{n}\!\! :{\text{If}} \; {x}\; {\text {is}}\; {A_{n}} \;{\text{AND}}\; {y}\; {\text {is}} \; {B_{n}} ,\;{\text{THEN}} \; {z} = f_{n} = p_{n} x + q_{n} y + r_{n} $$

(8)

where, A_i shows membership degree of inputs and the output level of each rule is weighted by w_i multiplied value which is power of the rule equation (9).

$$ w_{i} = f_{1} \left( {x_{1} } \right)*f_{2} \left( {x_{2} } \right). $$

(9)

Here, the output of each node gives the firing level of the rule to which it belongs. Then, firing strengths are normalized by the equation below,

$$ \overline{{w_{i} }} = \frac{{w_{i} }}{{w_{1} + w_{2} }}. $$

(10)

The latest total output value is calculated by equation (11)

$$ f = \mathop \sum \limits_{i} \overline{{w_{i} f_{i} }} = \frac{{\mathop \sum \nolimits_{i} w_{i} f_{i} }}{{\mathop \sum \nolimits_{i} w_{i} }} . $$

(11)

More detailed information can be obtained from Chang and Chang (2001).

3.4 Support vector machines (SVM)

Support vector machines (SVM) is a non-parametric classification method based on statistical learning theory (Vapnik 1995). SVM is widely used in different classification and regression problems with high-dimensional data sets and for the linearly separable case (Mountrakis et al. 2011; Kavzoglu et al. 2014). In artificial intelligence methods with overfitting problem, empirical risk minimization (ERM) theory is employed. SVM based on structural risk minimization (SRM) theory can display better performance than artificial intelligence. Unlike ERM, which minimizes error in training data, SRM minimizes the upper limit of expected risk. The other feature of SVM for determining the data structure is the transformation of the original data from the input space to a new feature space with the kernel function which is a new mathematical paradigm (Shahbazi et al. 2011). Therefore, to find the linear function ($ f\left( {x_{i} } \right) $),

$$ y_{i} = f\left( {x_{i} } \right) = w\phi_{i} \left( {x_{i} } \right) + b, $$

(12)

where $ \phi_{i} $ is a non-linear transformation function mapping input space into a higher dimension feature space, w_i represents a weight vector and b = bias. The linear function represents a non-linear relation between inputs $ (x_{i} ) $ and outputs $ (y_{i} ) $.

Firstly, SVM was developed for classification problems. Then, in recent years SVM has been based on regression. SVM applies regression by using an ε-sensitive loss function $ ||y - f\left( x \right)||_{\varepsilon } = \hbox{max} \left\{ {0,||y - f\left( x \right)|| - \varepsilon } \right\} $. This function is related to bigger errors than a certain threshold. SVM converts to the following forms:

$$ {\text{Minimize}}:\,\frac{1}{2}\left| {\left| w \right|} \right|^{2} + C\frac{1}{L}\left( {\sum\limits_{i}^{N} {\left( {\xi_{i} + \xi_{i}^{*} } \right)} } \right) $$

(13)

$$ {\text{Subject to}}\, \left\{ {\begin{array}{*{20}c} {w\phi \left( {x_{i} } \right) + b - y_{i} \le \varepsilon + \xi_{i} } \\ {y_{i} - w\phi \left( {x_{i} } \right) - b \le \varepsilon + \xi_{i}^{*} } {\xi_{i} ,\xi_{i}^{*} \ge 0} \\ {i = 1,2,3, \ldots ,N} \\ \end{array} } \right. $$

(14)

where L is the number of data points in the training dataset; C is model parameter; x_i is feature space data points; $ \xi_{i} $ and $ \xi_{i}^{*} $ are positive slack variables (Shahbazi et al. 2011).

3.5 Artificial neural networks (ANNs)

Artificial neural networks (ANNs) are defined as complex systems which are formed by interconnecting each other with different connection geometries of artificial neuron which are inspired by neurons in the brain. ANNs, which can be described as computing processes, can be likened to a black box that produces outputs for given inputs (Kohonen 1988).

An artificial neuron consists of five main parts: inputs, weights, summation function, transfer function and output. Inputs are information entering to a neuron from other neurons or external sources. The weights are values indicating the effect of another processing element on this processing element in the input set or in a previous layer. In figure 4, the effect of the input on neuron is determined by the weight. The summation function calculates the effect of all inputs by using weights on this process element. The function determines the net input of a neuron. Sum of net input (net) collected in neuron is obtained as:

$$ net = \sum\limits_{i = 1}^{n} {w_{ij} } x_{i} + b ,$$

(15)

where x_i is the input value of the i neuron; w_ij is weight coefficients, n is the total input numbers on neuron; b is the threshold value and $ \sum$ is summation function. The transfer function determines output of neuron by processing the net input obtained from the summation function. In general, the sigmoid function has been used as the transfer function (f (.)) in the multilayer perceptron model. The output of the neuron calculated using the sigmoid function is shown as follows:

$$ y_{i} = f\left( {net} \right) = \frac{1}{{1 + e^{ - net} }}. $$

(16)

Output (y_i) obtained from the neuron is transmitted to another neuron or as output of neural network (Oztemel 2003).

Artificial neural networks contain many neurons that are connected to each other. The connection of neurons is not random. In general, the network is formed in three layers by the neurons. The inputs are located in the input layer, while the outputs are obtained in the output layer. There are hidden layers between the input and output layers. Since the outputs of hidden layers cannot be observed directly, the numbers of hidden layers can be one or more (figure 5) (Kartalopoulos 1996). In figure 5, it shown that there are weighted connections between layers.

4 Results and discussion

Using precipitation data of Çanakkale, Bozcaada and Gökçeada stations located in the northwest of Turkey, the drought indices between the years 1975 and 2010 were determined by standardized precipitation index (SPI) method. These indices were used in developing various models with different artificial intelligence (AI) techniques, which are wavelet transform (W), adaptive neural-based fuzzy inference system (ANFIS), support vector machine (SVM) and artificial neural networks (ANNs) methods. The modelling stage consists of two parts: (1) modelling of SPI series generated according to historical records and (2) modelling of SPI series by using SPI subsets obtained by wavelet transform technique.

In the first part, the ANFIS, SVM and ANNs models were developed to estimate 3-, 6-, 9- and 12-months SPI series of Çanakkale station. Various model combinations were tried using the SPI series of Bozcaada and Gökçeada stations as input parameters. The data between 1975 and 2003 years were allocated to training set, whereas the rest of data were used in testing set. The ANFIS, SVM and ANNs models with the optimum performance are given in table 3. The performance of the models was determined by using determination coefficients (R²) and root mean squared error (RMSE) in equations (17) and (18).

$$ R^{2} = 1 - \frac{{\mathop \sum \nolimits_{i = 1}^{N} \left( {D_{{i\left( {\text{real}} \right)}} - D_{{i\left( {\text{model}} \right)}} } \right)^{2} }}{{\mathop \sum \nolimits_{i = 1}^{N} \left( {D_{{i\left( {\text{real}} \right)}} - D_{{i\left( {\text{mean}} \right)}} } \right)^{2} }} $$

(17)

$$ {\text{RMSE}} = \sqrt {\frac{1}{N}\mathop \sum \limits_{i = 1}^{N} \left( {D_{{i\left( {\text{real}} \right)}} - D_{{i({\text{model}) }}}}\right) ^{2} }, $$

(18)

where N is the total data number; D_i(real) is the SPI value; D_i(model) is the model result; and D_i(mean) is the mean SPI value.

Table 3 The results of ANFIS, SVM and ANNs models.

Full size table

In addition, the two-sample Kolmogorov–Simirnov (K–S) test was applied to select the model suitability with the calculated 3-, 6-, 9- and 12-months SPI values (tables 3 and 4). K–S test was performed at significance level 5%. H₀ and H_a hypotheses for this test were given as the two samples follow the same distribution and the distributions of the two samples were different, respectively. As the computed p-value is greater than the significance level 5%, the null hypothesis H₀ cannot be rejected.

Table 4 The results of the W-ANFIS, W-SVM and W-ANNs models.

Full size table

As can be seen from table 3, the performances of the models obtained for 6- and 9-months are higher than those obtained for 3- and 12-months according to these AI techniques. However, SVM model gave the best performance for 9- and 12-months, while the highest R² value was obtained as 0.846 for 6-month in ANNs model.

In the second part, the wavelet transform technique was used as the pre-processing technique in ANFIS, SVM and ANNs models, by considering that increasing input numbers (sub-series) could improve the performance of model. In the preprocessing stage, the 3-, 6-, 9- and 12-months SPI values of Gökçeada and Bozcaada stations were decomposed into eight detailed components (2-4-8-16-32-64-128-256) and one approximation component by using the discrete wavelet transform. Haar, Daubechies (db) and DMeyer (Dmey) wavelets, which are the mostly used wavelets in the discrete wavelet transform technique, were selected to form sub-series. Then, the correlation values were calculated between the sub-series and the SPI values of Çanakkale station. It was found the highest correlation values for the wavelets W1, W2 and W3 for types of Haar, db and Dmey wavelets. Together with the wavelets, the summations W1+W2, W1+W2+W3 were also tried as inputs to develop various hybrid W-ANFIS, W-SVM and W-ANNs models. Since the hybrid models developed with Dmey wavelet gave the highest R² values for W-ANFIS, W-SVM and W-ANNs models, the results of only these models developed by using Dmey wavelet were given in table 4.

According to table 4, the performance of the models improved with wavelet transform technique is higher than the models mentioned in table 3. When the ANFIS models were analyzed, it was seen that R² values increased and error values decreased for the models of 3-, 6- and 9-months, except for the models of a 12-month period. Similarly, the performances of the ANNs and SVM models slightly increased after the application of wavelet transform technique. On the other hand, the W-SVM model for 9-months seemed to have performed lower than the SVM model. When all of the models were analyzed, the highest performance improvement with wavelet transform technique was observed at the ANFIS model of a 9-month period, but the highest R² value (0.874) was obtained from W-ANFIS model for 6-month for testing set. The inputs of 6-months W-ANFIS model were W1+W2 subsets of Gökçeada and W1+W2+W3 subsets of Bozcaada. Accordingly, scatter diagrams of the ANFIS and W-ANFIS models for 6-month period are given in figures 6 and 7. Analyzing the scatter diagrams, it can be inferred that SPI values are in line with the results of the ANFIS and W-ANFIS models. Also, time series of SPI, ANFIS and W-ANFIS results were given in figure 8 for showing compatibility of the models.

Also, the cumulative distribution function graphs of ANFIS and W-ANFIS models for 6-month SPI values were given in figure 9 for the training and test sets.

According to the two-sample K–S test, H₀ hypothesis was found appropriate. The p value of 6-month ANFIS and W-ANFIS models were found to be 0.740 and 0.856 for testing set, respectively. These values were supported by the high R² values as seen in tables 3 and 4. Also, p values of 9-month ANFIS and W-ANFIS models were found high. Even though the performance of both 9-month and 6-month ANFIS and W-ANFIS models are similar values, the R² value for 6-month W-ANFIS model was considerably greater than other models which were showing better performance of 6-month W-ANFIS model.

5 Conclusions

In recent years, drought has been one of the most important problems in the world for the living and economy. The measures to be taken to observe and decrease the effects of drought, which is defined as the incapability of meeting the water demand due to the scarcity of water resource, have a great importance. Therefore, in water resources studies, not only artificial intelligence techniques that use examples to solve a specific problem and do not need expert knowledge, but also the use of data pre-processing methods in increasing the performance of these techniques have become crucial in recent years.

In this study, drought estimation has been performed for the Çanakkale province. Firstly, drought series have been obtained for the 3-, 6-, 9- and 12-months periods with SPI. Secondly, ANN, ANFIS and SVM models have been developed to estimate the drought series for 3-, 6-, 9- and 12-months periods. Later, by decomposing all these series to sub-series with wavelet transform technique, new input sets have been obtained; and finally, W-ANN, W-ANFIS and W-SVM models have been developed.

SVM model has been more successful in the drought forecast for 9- and 12-months period, whereas ANFIS and ANNs models have performed better in the estimation of 3- and 6-months SPI values, respectively. W-ANFIS models developed by wavelet transform technique gave better results in 3-, 6- and 9-months estimations. For 12-months SPI value estimation, W-SVM model has the best result. Additionally, it is observed that wavelet transform technique has improved the performance of ANFIS, ANN and SVM models in the drought forecasting of Çanakkale province. However, it is seen that improvement of model performance is better in ANFIS than the others especially, when data decomposition is applied to all three artificial intelligence methods. According to RMSE, R² and K–S test results, the most appropriate drought forecasting was obtained in 6-months W-ANFIS model. In conclusion, artificial intelligence methods seem to be applicable in drought forecasting and they perform better when improved with hybrid models.

References

Bacanli U G, Firat M and Dikbas F 2009 Adaptive neuro-fuzzy inference system for drought forecasting; Stoch. Environ. Res. Risk Assess. 23(8) 1143–1154, https://doi.org/10.1007/s00477-008-0288-5.
Article Google Scholar
Belayneh A, Adamowski J, Khalil B and Ozga-Zielinski B 2014 Long-term SPI drought forecasting in the Awash River Basin in Ethiopia using wavelet neural network and wavelet support vector regression models; J. Hydrol. 508 418–429, https://doi.org/10.1016/j.jhydrol.2013.10.052.
Article Google Scholar
Belayneh A and Adamowski J 2012 Standard precipitation index drought forecasting using neural networks, wavelet neural networks, and support vector regression; Appl. Comput. Intell. Soft Comput., http://dx.doi.org/10.1155/2012/794061.
Cacciamani C, Morgillo A, Marchesi S and Pavan V 2007 Monitoring and forecasting drought on a regional scale: Emilia–Romagna region; In: Methods and Tools for Drought Analysis and Management, Springer, Dordrecht, pp. 29-48, https://doi.org/10.1007/978-1-4020-5924-7-2.
Chang L C and Chang F J 2001 Intelligent control for modelling of real time reservoir operation; Hydrol. Process. 15(9) 1621–1634, https://doi.org/10.1002/hyp.226.
Article Google Scholar
Chen S T, Yu P S and Tang Y H 2010 Statistical downscaling of daily precipitation using support vector machines and multivariate analysis; J. Hydrol. 385(1–4) 13–22, https://doi.org/10.1016/j.jhydrol.2010.01.021.
Article Google Scholar
Choubin B, Khalighi-Sigaroodi S, Malekian A, Ahmad S and Attarod P 2014 Drought forecasting in a semi-arid watershed using climate signals: A neuro-fuzzy modeling approach; J. Mount. Sci. 11(6) 1593–1605, https://doi.org/10.1007/s11629-014-3020-6.
Article Google Scholar
Edwards D C and McKee T B 1997 Characteristics of 20th-century drought in the United States at multiple time scales; Climatology Report Number 97-2, Colorado State University, Fort Collins, Colorado, https://mountainscholar.org/handle/10217/170176.
Fung K F, Huang Y F and Koo C H 2018 Improvement of SVR-based drought forecasting models using wavelet pre-processing technique; In: E3S Web of Conferences, Vol. 65, 07007p, EDP Sciences.
Gocić M, Motamedi S, Shamshirband S, Petković D and Hashim R 2015 Potential of adaptive neuro-fuzzy inference system for evaluation of drought indices; Stoch. Environ. Res. Risk Assess. 29(8) 1993–2002, https://doi.org/10.1007/s00477-015-1056-y.
Article Google Scholar
Jalalkamali A, Moradi M and Moradi N 2015 Application of several artificial intelligence models and ARIMAX model for forecasting drought using the standardized precipitation index; Int. J. Environ. Sci. Technol. 12(4) 1201–1210, https://doi.org/10.1007/s13762-014-0717-6.
Article Google Scholar
Jang J S R 1993 ANFIS: Adaptive network based fuzzy inference system; IEEE Trans Syst. Man Cybernetics, pp. 665–685.
Kartalopoulos S V 1996 Understanding neural networks and fuzzy logic: Basic concepts and applications; Wiley-IEEE Press.
Kavzoglu T, Sahin E K and Colkesen I 2014 Landslide susceptibility mapping using GIS-based multi-criteria decision analysis, support vector machines, and logistic regression; Landslides 11(3) 425–439; https://doi.org/10.1007/s10346-013-0391-7.
Khan M, Muhammad N and El-Shafie A 2018 Wavelet-ANN vs. ANN-based model for hydrometeorological drought forecasting; Water 10(8) 998, https://doi.org/10.3390/w10080998.
Article Google Scholar
Kisi O and Cimen M 2012 Precipitation forecasting by using wavelet-support vector machine conjunction model; Eng. Appl. Artificial Intell. 25(4) 783–792, https://doi.org/10.1016/j.engappai.2011.11.003.
Article Google Scholar
Kohonen T 1988 An introduction to neural computing; Neural Networks 1(1) 3–16, https://doi.org/10.1016/0893-6080(88)90020-2.
Article Google Scholar
Mallat S G 1989 A theory for multiresolution signal decomposition: The wavelet representation; IEEE Trans. Pattern Analysis & Machine Intelligence 7 674–693, https://doi.org/10.1109/34.192463.
Article Google Scholar
McKee T B, Doesken N J and Kleist J 1993 The relationship of drought frequency and duration to time scales; In: Proceedings of the 8th Conference on Applied Climatology, Boston, MA: American Meteorological Society 17(22) 179–183.
Mehr A D, Kahya E, Bagheri F and Deliktas E 2014 Successive-station monthly streamflow prediction using neuro-wavelet technique; Earth Sci. Inform. 7(4) 217–229, https://doi.org/10.1007/s12145-013-0141-3.
Article Google Scholar
Mishra A K and Desai V R 2006 Drought forecasting using feed-forward recursive neural network; Ecol. Model. 198(1–2) 127–138, https://doi.org/10.1016/j.ecolmodel.2006.04.017.
Article Google Scholar
Mokhtarzad M, Eskandari F, Vanjani N J and Arabasadi A 2017 Drought forecasting by ANN, ANFIS, and SVM and comparison of the models; Environ. Earth Sci. 76(21) 729, https://doi.org/10.1007/s12665-017-7064-0.
Article Google Scholar
Mountrakis G, Im J and Ogole C 2011 Support vector machines in remote sensing: A review; ISPRS J. Photogramm. Remote Sens. 66(3) 247–259, https://doi.org/10.1016/j.isprsjprs.2010.11.001.
Article Google Scholar
Nguyen V, Li Q and Nguyen L 2017 Drought forecasting using ANFIS – a case study in drought prone area of Vietnam; Paddy Water Environ. 15(3) 605–616, https://doi.org/10.1007/s10333-017-0579-x.
Article Google Scholar
Nourani V, Alami M T and Vousoughi F D 2015 Wavelet-entropy data pre-processing approach for ANN-based groundwater level modeling; J. Hydrol. 524 255–269, https://doi.org/10.1016/j.jhydrol.2015.02.048.
Article Google Scholar
Nourani V, Komasi M and Mano A 2009 A multivariate ANN-wavelet approach for rainfall–runoff modeling; Water Resour. Manag. 23(14) 2877, https://doi.org/10.1007/s11269-009-9414-5.
Article Google Scholar
Oztemel E 2003 Artificial neural networks; Papatya Publications, Istanbul (in Turkish).
Google Scholar
Paulo A, Martins D and Pereira L S 2016 Influence of precipitation changes on the SPI and related drought severity. An analysis using long-term data series; Water Resour. Manag. 30(15) 5737–5757, https://doi.org/10.1007/s11269-016-1388-5.
Santos G C A and Silva G B L 2014 Daily streamflow forecasting using a wavelet transform and artificial neural network hybrid models; Hydrol. Sci. J. 59(2) 312–324, https://doi.org/10.1080/02626667.2013.800944.
Article Google Scholar
Shahbazi A R N, Zahraie B, Sedghi H, Manshouri M and Nasseri M 2011 Seasonal meteorological drought prediction using support vector machine; World Appl. Sci. J. 13(6) 1387–1397.
Google Scholar
Shirmohammadi B, Moradi H, Moosavi V, Semiromi M T and Zeinali A 2013 Forecasting of meteorological drought using Wavelet-ANFIS hybrid model for different time steps (case study: Southeastern part of east Azerbaijan province, Iran); Nat. Hazards 69(1) 389–402, https://doi.org/10.1007/s11069-013-0716-9.
Article Google Scholar
Singh R M 2011 Wavelet-ANN model for flood events; Proceedings of the International Conference on Soft Computing for Problem Solving (SocProS 2011). December 20–22, pp. 165–175. Part of the Advances in Intelligent and Soft Computing book series (AINSC, volume 131).
GDTM website 2017 http://www.izmir.mgm.gov.tr/files/iklim/canakkale_iklim. pdf (in Turkish).
GMKA website 2017 https://www.gmka.gov.tr/dokumanlar/yayinlar/GMKA-Bozcaada-Gokceada_Degerlendirme_ Raporu.pdf (in Turkish).
Tigkas D, Vangelis H and Tsakiris G 2015 DrinC: A software for drought analysis based on drought indices; Earth Sci. Inform. 8(3) 697–709, https://doi.org/10.1007/s12145-014-0178-y.
Article Google Scholar
Tiwari M K and Chatterjee C 2011 A new wavelet–bootstrap–ANN hybrid model for daily discharge forecasting; J. Hydroinform. 13(3) 500–519, https://doi.org/10.2166/hydro.2010.142.
Article Google Scholar
Tripathi S, Srinivas V V and Nanjundiah R S 2006 Downscaling of precipitation for climate change scenarios: A support vector machine approach; J. Hydrol. 330(3–4) 621–640, https://doi.org/10.1016/j.jhydrol.2006.04.030.
Article Google Scholar
Tsakiris G and Vangelis H 2004 Towards a drought watch system based on spatial SPI; Water Resour. Manag. 18(1) 1–12, https://doi.org/10.1023/B:WARM.0000015410.47014.a4.
Article Google Scholar
Vapnik V 1995 The nature of statistical learning theory; Springer-Verlag, New York.
Book Google Scholar
Wang W and Ding J 2003 Wavelet network model and its application to the prediction of hydrology; Nature Sci. 1(1) 67–71.
Google Scholar
Wang W, Zhu Y, Xu R and Liu J 2015 Drought severity change in China during 1961–2012 indicated by SPI and SPEI; Nat. Hazards 75 2437–2451, https://doi.org/10.1007/s11069-014-1436-5.
Article Google Scholar
Zarch M A A, Sivakumar B and Sharma A 2015 Droughts in a warming climate: A global assessment of standardized precipitation index (SPI) and reconnaissance drought index (RDI); J. Hydrol. 526 183–195, https://doi.org/10.1016/j.jhydrol.2014.09.071.
Article Google Scholar

Download references

Acknowledgement

Onur Özcanoğlu, who was one of the potential authors of the manuscript, passed away during the preparation of this manuscript. The other authors are grateful to him for his great contributions to this paper.

Author information

Authors and Affiliations

Department of Civil Engineering, Faculty of Engineering, Suleyman Demirel University, Isparta, Turkey
Emine Dilek Taylan
Department of Civil Engineering, Faculty of Technology, Isparta Applied Sciences University, Isparta, Turkey
Özlem Terzi
Department of Civil Engineering, Graduate School of Natural and Applied Sciences, Suleyman Demirel University, Isparta, Turkey
Tahsin Baykal

Authors

Emine Dilek Taylan
View author publications
You can also search for this author in PubMed Google Scholar
Özlem Terzi
View author publications
You can also search for this author in PubMed Google Scholar
Tahsin Baykal
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Emine Dilek Taylan: Data analysis, SPI analysis, model evaluating, and editing, writing and reviewing the manuscript. Özlem Terzi: Data analysis, SVM modelling, ANNs modelling, model evaluating, and editing, writing and reviewing the manuscript. Tahsin Baykal: Data analysis, wavelet transform analysis, ANFIS modelling, model evaluating, and editing, writing and reviewing the manuscript. Onur Özcanoğlu: ANFIS modelling and writing the manuscript.

Corresponding author

Correspondence to Emine Dilek Taylan.

Additional information

Communicated by Kavirajan Rajendran

Rights and permissions

Reprints and permissions

About this article

Cite this article

Taylan, E.D., Terzi, Ö. & Baykal, T. Hybrid wavelet–artificial intelligence models in meteorological drought estimation. J Earth Syst Sci 130, 38 (2021). https://doi.org/10.1007/s12040-020-01488-9

Download citation

Received: 09 December 2019
Revised: 13 May 2020
Accepted: 28 July 2020
Published: 13 February 2021
DOI: https://doi.org/10.1007/s12040-020-01488-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.