1 Introduction

Drought is the most frequent natural disasters adversely affecting most parts of the world (Wilhite and Glantz 1985; Lin et al. 2017). Conceptually, drought is characterized through the prolonged and abnormal state of water deficit or remarkable shortfall in precipitation, consequently, it affects the water resources (Wilhite et al. 2014). In general, drought can be grouped in four broader categories including hydrological, meteorological, agricultural, and socio-economic droughts (Mishra et al. 2007; Mishra and Singh 2010). A variety of drought indices (DI) has been derived for monitoring and defining the drought parameters like duration, severity, intensity, and areal extent (Mishra and Singh 2011). Some of the widely used DI for meteorological drought quantification and forecasting include the Standardized Precipitation Index (SPI) (McKee et al. 1993), Effective Drought Index (EDI) (Byun and Wilhite 1999), Standardized Precipitation Evapotranspiration Index (SPEI) (Vicente-Serrano et al. 2010). The computation of the EDI assigns different weights to recent and historical precipitation (Byun and Wilhite, 1999), which is advantageous over the other DI (Kim et al. 2009; Byun and Kim 2010). In related contrast, several studies have reported the robustness and effectiveness of the EDI over the many DI (Morid et al. 2006; Jain et al. 2015b; Deo et al. 2017a). Therefore, scientists have drawn attention to the drought forecasting for several purposes such as planning and management of freshwater resources, policymaking, drought preparedness, risk management, and mitigation (Wilhite 1986; Mishra and Singh 2011; Kisi et al. 2019).

The spatio-temporal analysis of droughts helps to recognize the overall impact of drought on a regional scale with different thresholds (Mishra and Singh 2009). The information obtained from regional drought analysis is vital for short and long-term planning and management of water resources systems. Also, the frequency distribution of drought occurrence is utilized to construct the drought severity–area–frequency (SAF), severity–duration–frequency (SDF), and severity–area–duration (SAD) curves. These curves are beneficial for evaluating the droughts in a region. Various studies have been conducted by the researchers on spatial and temporal patterns of drought at a regional scale by employing the SAF, SDF, and SAD curves (Kim et al. 2002; Mishra and Desai 2005b; Vicente-Serrano 2006; Mishra and Singh 2009; Jasim and Awchi 2020). Presently, the spatio-temporal patterns of drought have been analyzed through the application of remote sensing and geographical information system using various DI (Akhtari et al. 2009; Jain et al. 2015a; Patel and Yadav 2015; Manikandan and Tamilmani 2015; Thomas et al. 2015; Rahman and Lateh 2016; Zarei et al. 2016; Mitra and Srivastava 2017).

However, the accurate forecasting of the drought is a critical subject in recent times, and the extensive work has been conducted on modeling numerous characteristics of drought, but the major challenge is to implement suitable methods for forecasting the onsets and terminations points of the droughts. Over the last 30 years, regression models (Kumar and Panu 1997; Leilah and Al-Khateeb 2005), stochastic models (Mishra and Desai 2005a; Fernández et al. 2009; Durdu 2010; Bahrami et al. 2019), probability models (Cancelliere et al. 2007) and neural network models (Mishra and Desai 2006; Morid et al. 2007; Mishra et al. 2007; Bacanli et al. 2009; Özger et al. 2012) have been applied for this purpose. Zhang et al. (2017a, b) applied Autoregressive Integrated Moving Average (ARIMA), Artificial Neural Network (ANN), and Wavelet-ANN (W-ANN) models to forecast the SPI-6 and SPI-12 in the north of Haihe River basin, China. They found that the WANN performed superior to the alternatives in forecasting the multi-scalar SPI in the study basin. Rafiei-Sardooi et al. (2018) forecasted SPI-3 and SPI-12 in the Jiroft Plain of south Kerman Province (Iran) using Neuro-Fuzzy (NF) and ARIMA models. They found that the NF model significantly outperformed the ARIMA model for both time scales in the study region. Soh et al. (2018) examined the potential of Wavelet-ARIMA-ANN and Wavelet-Adaptive Neuro-Fuzzy Inference System (W-ANFIS) to predict the SPEI-1, SPEI-3, and SPEI-6 in the Langat River basin, Malaysia. They found that the prediction accuracy of both models was improved with the length of time scale (wavelet-ARIMA-ANN for SPEI-3 and SPEI-6, and W-ANFIS for SPEI-1) in the study basin.

In recent times, the extensive and successful application of simple and hybrid meta-heuristic or artificial intelligence (AI) models such as Support Vector Regression (SVR), ANN, W-ANN, ANFIS, Multilayer Perceptron Neural Network (MLPNN), Genetic Algorithm (GA), Gene Expression Programming (GEP), Extreme Learning Machine (ELM), Co-Active Neuro-Fuzzy Inference System (CANFIS), Support Vector Machine (SVM), Adaptive Neuro-Fuzzy Inference System (ANFIS), W-SVM, and k-Nearest Neighbor (KNN) have been found in meteorological drought prediction on regional and global scales (Barua et al. 2012; Bari Abarghouei et al. 2013; Belayneh et al. 2014; Jalalkamali et al. 2015; Maca and Pech 2016; Djerbouai and Souag-Gamane 2016; Ali et al. 2017; Mokhtarzad et al. 2017; Zhang et al. 2017b; Ibrahimi and Baali 2018; Komasi et al. 2018; Bahrami et al. 2019; Khan et al. 2020). Memarian et al. (2016) examined the potential of CANFIS for meteorological drought forecasting in Birjand (Iran) through SPI and climatic parameters. They found better predictability of the CANFIS model over the study area. Achour et al. (2020) analyzed and forecasted drought using the multi-scalar SPI (i.e., 3, 6, 9, and 12 months) by the MLPNN model in Algeria. They found that the MLPNN model with the Levenberg–Marquardt algorithm provides suitable results over the study region. Zhang et al. (2020) applied ARIMA, W-ANN, and SVM techniques to forecast drought in the Sanjiang Plain of China using 12-month scale SPEI. Results showed that the ARIMA model outperformed the other two models in the study region. However, the performance of AI-based models could be improved by tuning its parameters to achieve robust accuracy.

Recently, the new bio-inspired algorithms, i.e. Grey Wolf Optimizer (GWO), Spotted Hyena Optimizer (SHO), Particle Swarm Optimization (PSO), Genetic Algorithm (GA), Ant Colony Optimization (ACO) have more attention to solve optimization problems and improve the AI models. Improving the AI model could increase the prediction accuracy even in the drought prediction field. Aghelpour et al. (2020) utilized three indices; Palmer Drought Severity Index (PDSI), SPI, and multivariate-SPI (MSPI) to forecast the agricultural drought by employing the ANFIS, ANFIS-ACO, ANFIS-GA, and ANFIS-PSO at 31 synoptic stations positioned in Iran. Results revealed the better suitability of ANFIS-ACO and ANFIS-GA models over the study stations, and the bio-inspired algorithms improve the accuracy of the ANFIS model. The GWO (Mirjalili et al. 2014) is one of the modern nature-inspired algorithms widely used for improving AI models, even in the field of hydrology. Dehghani et al. (2019) proposed the ANFIS-GWO model for immediate-short-term to long-term influent flow prediction from a wastewater treatment plant located in Isfahan city, Iran. The results obtained support the superiority of hybrid GWO models. Tikhamarine et al. (2020b) applied the GWO algorithm to improve AI model accuracy for monthly streamflow prediction. In this work, the GWO algorithm was combined with ANN, SVM, and MLR models to predict the monthly streamflow of the Nile River at the Aswan High Dam, Egypt, and the results obtained support the ability of hybrid GWO models. Moreover, Maroufpoor et al. (2020) used the GWO algorithm to improve the ANN model for estimating the daily reference evapotranspiration and their results were very promising.

According to the reported literature and author knowledge, only limited work has been conducted on the prediction of meteorological drought (MD) by employing the hybrid meta-heuristic approaches. Moreover, the benefits of the GWO algorithm get attention to use it for meteorological drought prediction and compared with another new algorithm (i.e. Spotted Hyena Optimizer: SHO). The novelty of the current work is to investigate the potential capability of support vector regression (SVR) embedded with novel bio-inspired algorithms (i.e. Grey Wolf Optimizer and Spotted Hyena Optimizer) for meteorological drought prediction in Kumaon (six stations) and Garhwal (seven stations) regions with effective drought index. These methods were not previously implemented for this issue. For validation purposes, the performance of these models was evaluated by using five statistical indicators and through graphical interpretation.

2 Study location and data collection

The Uttarakhand State is divided into (i) Kumaon region including six districts namely Almora, Bageshwar, Champawat, Nainital, Pithoragarh and Udham Singh Nagar (Pantnagar), and (ii) Garhwal region including seven districts namely Chamoli, Dehradun, Haridwar, Pauri Garhwal, Rudraprayag, Tehri Garhwal, and Uttarkashi (Fig. 1). The geographical co-ordinate of Uttarakhand State lies between 28°43′ N and 31°28′ N latitudes, and 77°34′ E and 81°03′ E longitudes with varying elevation from 145 to 7796 m above MSL and comprises 53,812.47 km2 area. The region of Uttarakhand with high altitude (> 4572 m) is cold throughout the year and unreachable due to heavy snowfall (Nandargi et al. 2016). The annual rainfall ranges from 260 to 3955 mm, mostly received (60–85%) in monsoon season (June–September month) over the State (Basistha et al. 2008; Nandargi et al. 2016; Malik et al. 2019b, 2020; Malik and Kumar 2020). The heavy rainfall is received in the eastern edges of the Himalayan ranges and comparatively dry in the western region. The temperature varies between sub-zero -43 °C, and during the winter season ranges from 0 to 15 °C in the Uttarakhand State (Nandargi et al. 2016; Malik et al. 2019b, 2020; Malik and Kumar 2020). Over the state, the summer season extends from April to June, while the winter season covers from October to February months. Table 1 explores the information on geographical coordinates and rainfall data availability year (month) for Kumaon and Garhwal regions. The monthly rainfall data for study stations were obtained from CRC (Crop Research Centre) observatory set at G.B. Pant University of Agriculture and Technology, Pantnagar, and IMD (Indian Meteorological Department), Pune.

Fig. 1
figure 1

Study location map of Kumaon and Garhwal regions (Uttarakhand)

Table 1 Particulars of study stations and rainfall data availability

3 Methodology

3.1 Effective drought index

EDI is a globally accepted standard metric for meteorological drought quantification, introduced by Byun and Wilhite (1999). The computation of EDI includes the effective precipitation (EP), which is the precipitation amount with a time-dependent reduction function, and precipitation required for return to normal (PRN) conditions. EDI was first developed for monitoring drought duration and severity on the daily time scale (Byun and Wilhite 1999; Kim et al. 2009; Roudier and Mahe 2010). Nowadays, its principles are prolonged on monthly time scales to investigate the drought duration, drought severity, and drought intensity (Smakhtin and Hughes 2007; Morid et al. 2007; Dogan et al. 2012; Jain et al. 2015b; Deo et al. 2017a). The EDI is computed using the Eqs. (14):

$${EP}_{i}={P}_{1}+\frac{({P}_{1}+{P}_{2})}{2}+\frac{({P}_{1}+{P}_{2}+{P}_{3})}{3}+\dots +\frac{\left({P}_{1}+{P}_{2}+\dots +{P}_{12}\right)}{12}$$
(1)

where P = monthly rainfall, EP = effective precipitation for the ith duration of summation (DS).

The deviation of EP (DEP) form the mean of EP (MEP) is computed using Eq. (2):

$$DEP=EP-MEP$$
(2)

The precipitation needed for a return to normal \({(PRN}_{j})\) for the jth DS is determined using Eq. (3):

$${PRN}_{j}=\frac{{DEP}_{j}}{\sum_{N=1}^{j}(1/N)}$$
(3)

Finally, EDI is calculated as (Byun and Wilhite 1999):

$$EDI=\frac{{PRN}_{j}}{SD({PRN}_{j})}$$
(4)

where \(SD(PRN)\) indicates the standard deviation of PRN for the jth month in the record time.

3.2 Support vector regression (SVR)

The SVR is the extended version of SVM (support vector machine) created on regression functions. The SVM is an intelligent model dependent on the structural risk minimization (SRM) principle developed by Vapnik (1995). Smola (1996) added regression functions to the original version of the SVM model and created a new version of SVM to solve regression and prediction problems called SVR. The relationship among the input and output parameters is recognized (Su et al. 2018; Tikhamarine et al. 2019) using Eq. (5):

$$y=f\left(x\right)=w\times \phi \left(x\right)+b$$
(5)

where \(y\) is the output variable, \(x\) is the input variable, \(f\) is the regression function, ϕ is the mapping function, \(w\) and \(b\) are the weight and the bias, and both defined using Eqs. (67) (Su et al. 2018):

$$\mathrm{Minimize}:\frac{1}{2}{\Vert w\Vert }^{2}+C\sum_{i=1}^{N}({\xi }_{i}+{\xi }_{i}^{*})$$
(6)
$$\mathrm{Subject to}:\left\{\begin{array}{c}{y}_{i}-f\left(x\right)\le \varepsilon +{\xi }_{i}\\ f\left(x\right)-{y}_{i}\le \varepsilon +{\xi }_{i}^{*}\\ {\xi }_{i},{\xi }_{i}^{*}\ge 0, i=\mathrm{1,2},3,\dots .,N\end{array}\right.$$
(7)

where C is the penalty parameter, \({\xi }_{i}\) and \({\xi }_{i}^{*}\) are corresponding the slack variables and ε is the boundary value.

Employing the Lagrangian multipliers, the optimization problem largely converts to quadratic programming and a nonlinear regression function solution can be defined as below:

$$f\left(x\right)=\sum_{i=1}^{N}\left({\alpha }_{i}-{\alpha }_{i}^{*}\right)K\left(x,{x}_{i}\right)+b$$
(8)

where \(K\left(x,{x}_{i}\right)\) is the Kernel function and \({\alpha }_{i}\), and \({\alpha }_{i}^{*}\) are the dual variables.

There are several kernel functions such as linear, radial basis function (RBF), polynomial, and sigmoid (Ansari and Gholami 2015; Zhang et al. 2017a; Tikhamarine et al. 2020a). In this study, the RBF kernel function was used to optimized the SVR model, and written as:

$$K\left(x,{x}_{i}\right)=exp\left(-\gamma {\Vert x-{x}_{i}\Vert }^{2}\right)$$
(9)

In which, \(\gamma\) denotes the kernel parameter.

3.3 Meta-heuristic algorithms

3.3.1 Grey Wolf Optimizer (GWO)

Mirjalili et al. (2014) developed the GWO algorithm based on the hunting mechanisms of grey wolves and their strict hierarchy to search and hunt for prey. The grey wolf hierarchy includes four groups: alpha (α), beta (ß), delta (δ), and omega (ω). The leader is the alpha wolf and first in charge of decision-making in the pack and followed by beta, delta, and omega wolves, respectively. The mechanisms of wolves’ hunting are: tracking, encircling, and attacking the prey (Fig. 2). For more information about the GWO algorithm, readers can refer to Mirjalili et al. (2014).

Fig. 2
figure 2

source: Mirjalili et al. 2014)

Hunting behavior of grey wolves: a chasing, approaching, and tracking prey; b–d pursuit, harassing, and encircling; e stationary situation and attack (

3.3.2 Spotted Hyena Optimizer (SHO)

Spotted Hyena Optimizer (SHO) is a new optimization algorithm suggested by Dhiman and Kumar (2017). The SHO algorithm mimics the hunting behavior of the Spotted Hyenas in nature. In this algorithm, the optimal solution is found based on four steps of prey hunting; encircling, hunting, attacking prey (exploitation), and search for prey (exploration). Figure 3 illustrates the most important steps of the aforementioned approach. Readers can refer to Dhiman and Kumar (2017) for detailed information about the working functions of the SHO algorithm.

Fig. 3
figure 3

source: Dhiman and Kumar 2017)

Hunting algorithm performed in the algorithm of Spotted Hyena Optimizer (SHO): a the feasible next places of the members along with position vectors, and b attacking the prey (

In this paper, SVR was selected and developed as an artificial intelligence model for meteorological drought prediction. Two meta-heuristic algorithms (i.e. GWO and SHO) were selected and compared to each other to find optimal SVR parameters to increase model performance. In fact, the performance of the SVR usually depends on the correct identification of its parameters. Consequently, the optimal parameters could increase the accuracy of SVR to achieve robust performance. Furthermore, a high optimization algorithm is needed to solve this optimization problem. To build the SVR–GWO hybrid model, the GWO algorithm was combined with the SVR model to find ideal SVR parameters, while the SHO was combined with the SVR to build the SVR–SHO model. The steps and construction processes for SVR–GWO (Mirjalili et al. 2014) and SVR–SHO (Dhiman and Kumar 2017) are concise in Fig. 4. The steps involved in Fig. 4 are outlined below:

Fig. 4
figure 4

Flowchart of the proposed hybrid a SVR–GWO, and b SVR–SHO models

Step 1: Initialize the population of spotted hyenas and position of grey wolves.

Step 2: Select the initial parameters of SHO and GWO, and describe the maximum number of iterations.

Step 3: Calculate the fitness value of each search agent for SHO and GWO.

Step 4: Explore the best search agent in the given search space.

Step 5: Outline the cluster of optimal solutions until the acceptable results are found.

Step 6: Update the positions of the search agents in SHO (Dhiman and Kumar 2017) and GWO (Mirjalili et al. 2014).

Step 7: Check the search agent that is bypassing the search space and then modify it.

Step 8: Calculate the update search agent fitness value and update the vector if there is a better solution than the previous optimal solution.

Step 9: Update the group of spotted hyenas and grey wolves to updated search agent fitness values.

Step 10: Stop the SHO and GWO algorithm, if the stopping criteria are satisfied, otherwise back to step 5.

Step 11: Return the best optimal solution, after stopping criteria is satisfied, which is obtained so far.

3.4 Optimal input parameter nomination

After the computation of EDI at the study stations, the prediction of the meteorological drought was proceeded by using hybrid SVR–GWO, and SVR–SHO models. The optimal inputs were nominated by employing autocorrelation function (ACF) and partial-ACF (PACF) on EDI time-series of each study station at 5% level of significance (Tiwari and Adamowski 2013; Deo et al. 2017b; Malik et al. 2019c). The ACF and PACF were computed using Eqs. (10 and 11):

$${ACF}_{k}=\frac{\sum_{t=1}^{N-k}({Y}_{t}-\stackrel{-}{Y}) ({Y}_{t+k}-\stackrel{-}{Y})}{\sum_{t=1}^{N}({Y}_{t}-\stackrel{-}{Y})^{2}}$$
(10)
$$P{ACF}_{k,k}=\frac{ACF-\sum_{j=1}^{k-1}{PACF}_{k-1,j} {ACF}_{k-1}}{1-\sum_{j=1}^{k-1}{PACF}_{k-1,j} {ACF}_{k-1} }$$
(11)

where k displays the lag through the sequence \({Y}_{t}\), \(\stackrel{-}{Y}\) shows the average of the entire EDI data series, and N is the number of data points in the EDI series.

The PACF values for kth lags were tested at 5% level of significance by constructing the upper and lower critical limits (UCL and LCL) (Anderson 1942; Ljung and Box 1978) using Eq. (12):

$${UCL}/{LCL}=\frac{1.96}{\sqrt{\mathrm{N}}}$$
(12)

The final nomination of input (lags) for output (target) prediction was done based on the significant value of PACF at 5% confidence interval (cross the UCL and LCL) for all the study stations located in both regions. In statistical hypothesis testing, at 5% confidence interval (critical limit =  ± 1.96) given by Fisher (1925), and widely employed in numerous fields of water resources engineering (Sudheer et al. 2002; Landeras et al. 2009; Narasimha Murthy et al. 2019; Malik and Kumar 2020; Akar and Aksoy 2020; Malik et al. 2020). Table 2 explores the details of input variables (lags) used for output (EDI) prediction by constructing the SVR–GWO and SVR–SHO models at study stations. Figure 5a–f demonstrates the PACF results on EDI data series for Almora, Bageshwar, Champawat, Nainital, Pithoragarh, and Pantnagar stations of the Kumaon region, which indicates the significant (optimal) lags 1, 2, 5, and 6 for Almora Station (Fig. 5a), the lags 1, 2, and 6 for Bageshwar Station (Fig. 5b), the lags 1, 2, 5 and 6 for Champawat Station (Fig. 5c), the lags 1, 2, 4, 5, and 11 for Nainital Station (Fig. 5d), lags 1, and 9 for Pithoragarh (Fig. 5e), and the lags 1, and 2 for Pantnagar Station (Fig. 5f). Similarly, Fig. 6a–g illustrates the PACF results on EDI data series for Chamoli, Dehradun, Haridwar, Pauri Garhwal, Rudraprayag, Tehri Garhwal, and Uttarkashi stations of Garhwal region, which indicates the significant (optimal) lags 1, 7, and 9 for Chamoli Station (Fig. 6a), the lag 1 for Dehradun and Rudraprayag stations (Fig. 6b, e), the lags 1, and 2 for Haridwar and Tehri Garhwal stations (Fig. 6c, f), the lags 1, 2, 5, and 10 for Pauri Garhwal Station (Fig. 6d), and lags 1, and 7 for Uttarkashi Station (Fig. 6g). Table 3 summarises the details of training (70%) and testing (30%) EDI data percentages utilized for one month ahead meteorological drought prediction over the study regions.

Table 2 Optimized hybrid SVR models for EDI prediction in Kumaon and Garhwal regions
Fig. 5
figure 5

PACF analysis of EDI at a Almora, b Bageshwar, c Champawat, d Nainital, e Pithoragarh, and f Pantnagar stations in Kumaon region

Fig. 6
figure 6

PACF analysis of EDI at a Chamoli, b Dehradun, c Haridwar, d Pauri Garhwal, e Rudraprayag, f Tehri Garhwal, and g Uttarkashi stations in Garhwal region

Table 3 Calibration and validation datasets percentage for hybrid SVR models at study regions

3.5 Statistical indicators

The formulated hybrid SVR–GWO and SVR–SHO models in Kumaon and Garhwal regions for EDI prediction were evaluated by utilizing statistical indicators (mean absolute error: MAE, root mean square error: RMSE, Nash–Sutcliffe efficiency: NSE, coefficient of correlation: COC, and Willmott Index: WI), and through graphical inspection (time variation graph, and scatter-plot). The mathematical expression of MAE, RMSE, NSE, COC, and WI are as follows:

  1. 1

    Mean absolute error (Legates and McCabe 1999; Willmott and Matsuura 2005)

    $$MAE=\frac{1}{N}\sum_{i=1}^{N}|{EDI}_{pre,i}-{EDI}_{cal,i}| \left(0 <{ MAE }<{ \infty }\right)$$
    (13)
  2. 2

    Root mean square error (Legates and McCabe 1999; Willmott and Matsuura 2005)

    $$RMSE=\sqrt{\frac{1}{N} \sum_{i=1}^{N}({EDI}_{cal, i }- {EDI}_{pre,i}{)}^{2}} \left(0 <\mathrm{ RMSE }<{ \infty }\right)$$
    (14)
  3. 3

    Nash–Sutcliffe efficiency (Nash and Sutcliffe 1970)

    $$NSE=1-\left[\frac{\sum_{i=1}^{N}({EDI}_{cal,i }- {EDI}_{pre,i}{)}^{2}}{\sum_{i=1}^{N}({{EDI}_{cal,i }- \stackrel{-}{{EDI}_{cal }})}^{2}}\right] \left(-\infty <\mathrm{ NSE }< 1\right)$$
    (15)
  4. 4

    Coefficient of correlation (Taylor 2001; Moriasi et al. 2015)

    $$COC=\frac{\sum_{\mathrm{i}=1}^{\mathrm{N}}\left({EDI}_{cal,i }- \stackrel{-}{{EDI}_{cal }}\right) \left({EDI}_{pre,i} - \stackrel{-}{{EDI}_{pre}}\right) }{\sqrt{\sum_{\mathrm{i}=1}^{\mathrm{N}}({{EDI}_{cal,i }- \stackrel{-}{{EDI}_{cal }})}^{2} \sum_{\mathrm{i}=1}^{\mathrm{N}}({{EDI}_{pre,i} - \stackrel{-}{{EDI}_{pre}})}^{2} } } \left(-1 <\mathrm{ COC }< 1\right)$$
    (16)
  5. 5

    Willmott index (Willmott 1981)

    $$\mathrm{WI}=1-\left[\frac{\sum_{i=1}^{N}({EDI}_{pre,i} - {EDI}_{cal, i}{)}^{2}}{\sum_{i=1}^{N}({\left|{EDI}_{pre,i} - \stackrel{-}{{EDI}_{cal }}\right|+\left|{EDI}_{cal,i }- \stackrel{-}{{EDI}_{cal }}\right|)}^{2}}\right]\left(0 <\mathrm{ WI }\le 1\right),$$
    (17)

where \({EDI}_{cal}\), \({EDI}_{pre}\), \(\stackrel{-}{{EDI}_{cal}}\), and \(\stackrel{-}{{EDI}_{pre}}\) represents the calculated, predicted, mean of calculated, and mean of predicted EDI values for N observations in the ith dataset. A model with lower MAE and RMSE, and higher NSE, COC, and WI were considered relatively better for MD prediction at study stations.

4 Results and discussion

4.1 Drought prediction using hybrid SVR models

After the selection of significant lags/inputs for both regions as listed in Table 2 were utilized for 1 month ahead EDI prediction by employing the hybrid SVR–GWO and SVR–SHO models. These hybrid SVR models were calibrated and validated using 70% and 30% of EDI time-series data and evaluated through statistical indicators (MAE, RMSE, NSE, COC, and WI) and by visual inspection (time variation graph, and scatter-plot). Table 4 summarizes the validation results of SVR–GWO and SVR–SHO models at Almora, Bageshwar, Champawat, Nainital, Pithoragarh, and Udham Pantnagar stations. The MAE, RMSE, NSE, COC and WI of the SVR–GWO, respectively, ranges 0.363 to 0.622, 0.538 to 0.964, 0.575 to 0.860, 0.763 to 0.930 and 0.865 to 0.958 while those of the SVR–SHO varies from 0.379 to 0.623, 0.567 to 0.965, 0.553 to 0.765, 0.745 to 0.931, and 0.847 to 0.931, respectively. The ranges clearly reveal that the SVR–GWO is superior to the SVR–SHO in drought prediction based on EDI. It is observed from Table 4 that the SVR–GWO model ranked first at Almora, Bageshwar, Champawat, Nainital, Pithoragarh and Pantnagar stations with minimum MAE (RMSE) = 0.418, 0.445, 0.420, 0.379, 0.363, 0.622 (0.603, 0.652, 0.659, 0.565, 0.538, 0.965) and higher NSE/COC (WI) = 0.575, 0.721, 0.646, 0.697, 0.860, 0.660/ 0.763, 0.850, 0.817, 0.835, 0.930, 0.819 (0.865, 0.916, 0.891, 0.906, 0.958, 0.895), respectively. The hybrid SVR–GWO model is closely followed by the SVR–SHO model. The comparison of two methods and all six stations in Table 4 exposed the best prediction of EDI at Pithoragarh station.

Table 4 Statistical indicators results of hybrid SVR models during validation period at Kumaon and Garhwal regions

Similarly, Table 4 outlines the prediction results of SVR–GWO, and SVR–SHO models during the validation period for Chamoli, Dehradun, Haridwar, Pauri Garhwal, Rudraprayag, Tehri Garhwal, and Uttarkashi stations. The MAE, RMSE, NSE, COC and WI of the SVR–GWO vary from 0.253 to 0.402, from 0.363 to 0.658, from 0.558 to 0.805, from 0.754 to 0.899 and from 0.854 to 0.943 while the ranges of the SVR–SHO are 0.269 to 0.417, 0.379 to 0.667, 0.546 to 0.730, 0.755 to 0.893 and 0.854 to 0.923, respectively. In overall, the SVR–GWO has lower arrays of MAE and RMSE, and higher ranges of NSE, COC, and WI compared to SVR–SHO. As clearly seen from Table 4, the SVR–GWO model had the lowest MAE and RMSE and highest NSE, COC, and WI indicators at seven study stations located in the Garhwal region. Furthermore, a comparison across all two methods and seven stations in Table 4 shows that the SVR–GWO model (MAE = 0.253, RMSE = 0.363, NSE = 0.753, COC = 0.870, and WI = 0.929) yielded best prediction of EDI at Uttarkashi station.

The goodness-of-fit of two EDI predictive models was further assessed by plotting the time variation graphs and scatter plots of calculated vs predicted EDI values during the validation phase for six stations situated in the Kumaon region (Fig. 7a–f). These figures confirm the results presented in Table 4, illustrating the better performance of the SVR–GWO model with a coefficient of determination (R2) = 0.583, 0.723, 0.668, 0.698, 0.865, and 0.671 for Almora, Bageshwar, Champawat, Nainital, Pithoragarh, and Pantnagar stations compared to SVR–SHO, respectively. Also, the less degree of scattering was visible between predicted and calculated EDI observations for SVR–GWO models at six stations as illustrated in Fig. 7a–f. Based on the coefficient of determination of the regression line and degree of scattering, the SVR–GWO model appeared to have a better accuracy compared to the SVR–SHO model. Moreover, the SVR–GWO model with designated lags (Table 4) could be cast for prediction of MD based on EDI at six study stations of the Kumaon region.

Fig. 7
figure 7figure 7

Comparison of calculated and predicted EDI values by hybrid SVR models for validation period at a Almora, b Bageshwar, c Champawat, d Nainital, e Pithoragarh, and f Pantnagar stations in Kumaon region

Figure 8a–g demonstrates the time variation graphs and scatter plots between calculated and predicted EDI values by SVR–GWO, and SVR–SHO models during the validation period at seven stations positioned in the Garhwal region. It is noticed from these figures that the regression line has R2 = 0.717, 0.569, 0.577, 0.807, 0.666, 0.647, and 0.757 for the SVR–GWO model at Chamoli, Dehradun, Haridwar, Pauri Garhwal, Rudraprayag, Tehri Garhwal, and Uttarkashi stations which are higher than those of the SVR–SHO, respectively. Likewise, the degree of scattering was less between predicted and calculated EDI observations for the SVR–GWO model in the Garhwal region. Also, these figures clearly expose the SVR–GWO model with elected lags that could be utilized for MD prediction through EDI at seven stations of the Garhwal region.

Fig. 8
figure 8figure 8

Comparison of calculated and predicted EDI values by hybrid SVR models for validation period at a Chamoli, b Dehradun, c Haridwar, d Pauri Garhwal, e Rudraprayag, f Tehri Garhwal, and g Uttarkashi stations in Garhwal region

4.2 Discussion

To address the negative consequences of droughts on the ecosystem, the researchers must formulate the versatile models for better prediction or forecasting of drought, exclusively in the nations/ regions that are sensitive to climate change. In this study, two-hybrid SVR models have been used to predict the EDI in Kumaon (six stations) (and Garhwal seven stations) regions of Uttarakhand State utilizing data for the period of 1901–2015 (115-year), 1961–2016 (56-year), 1955–2015 (61 years), and 1951–2015 (65 years). Two types of meta-heuristic algorithms, i.e., Grey Wolf Optimizer and Spotted Hyena Optimizer were examined with the SVR technique. The results of the appraisal are evident in the better feasibility and predictability of the hybrid SVR–GWO model in Kumaon and Garhwal regions. Furthermore, with respect to the MAE (RMSE) of SVR–GWO model the prediction accuracy of SVR–SHO model was enhanced by 3.9% (2.6%) for Almora, 5.7% (3.0%) for Bageshwar, 6.3% (2.7%) for Champawat, 0.3% (0.4%) for Nainital, 28.3% (22.7%) for Pithoragarh, and 0.2% (0.1%) for Pantnagar stations placed in Kumaon region. Similarly, the prediction accuracy of SVR–SHO model was enhanced by 2.5% (1.0%) for Chamoli, 0.3% (0.4%) for Dehradun, 1.0% (1.3%) for Haridwar, 10.6% (8.3%) for Pauri Garhwal, 1.2% (0.9%) for Rudraprayag, 0.3% (0.2%) for Tehri Garhwal, and 5.9% (4.2%) for Uttarkashi stations in Garhwal region with respect to the MAE (RMSE) of SVR–GWO model. This analysis also confirms the superiority of the SVR–GWO model over the SVR–SHO model in both study regions.

The findings of the current research were compared with the other studies, conducted on meteorological drought (MD) prediction by employing the numerous hybrid meta-heuristic models (Minimax Probability Machine Regression: MPMR, Ensemble-ANFIS: E-ANFIS, M5T, Weighted Wavelet_Fuzzy_SVR: WW_F_SVR, Wavelet_Boosting_SVR: W_B_SVR, Multi-Input_Wavelet_Fuzzy_SVR: MI_W_F_SVR, and ANFIS coupled with Butterfly Optimization Algorithm: BOA, Ant Colony Optimizer: ACO, Genetic Algorithm: GA, and Particle Swarm Optimization: PSO) formulated based on several standardized drought indices are summarized in Table 5.

Table 5 Summary of application of meta-heuristic models in meteorological drought prediction over the globe in the last two decades

Overall, the results suggested that the applied methods, especially the SVR–GWO model, for regional drought prediction and proactive development strategies to combat drought disasters in the study region. Moreover, the forecasting of drought is a complex process, and endeavoring these forecasts blind is not advisable. The rewards of diverse methods depend on the basic properties or skills of models/ algorithms and the data series. So, it may be more practical and conclusive to select a suitable approach rendering to the characteristics of the data in the different study regions.

5 Conclusion

In the current research, the efficacy of two-hybrid methods; support vector regression with Grey Wolf Optimizer, and Spotted Hyena Optimizer was assessed to predict the meteorological drought by consuming EDI in Kumaon, and Garhwal regions of Uttarakhand State, India. The following conclusions were obtained from their comparison:

  1. i.

    The hybrid SVR–GWO model outperformed the SVR–SHO for meteorological drought prediction at all stations in both study regions, according to various statistical indicators (i.e., MAE, RMSE, NSE, COC, and WI), and through graphical scrutiny (i.e., time variation graph, and scatter-plot).

  2. ii.

    Comparison of two-hybrid SVR models at six stations of the Kumaon region and seven stations of the Garhwal region demonstrated that the SVR–GWO model produced the best prediction of EDI at Pithoragarh and Uttarkashi stations.

  3. iii.

    The obtained results from this research would support the hydrologists, agriculturalists, and water resources managers to formulate the drought mitigation strategy for sustainable development and management of water resources in the study region.