Introduction

Watershed prioritization is crucial for developing watershed management planning and better land management. It is essential to determine the watershed priority in the semi-arid and arid regions as it helps manage water in ungauged rivers. Spatial prioritization and watershed health help to understand watershed conditions, and it also helps derive better management strategies in the data-scarce region (Alilou et al. 2019).

Remote sensing (RS) and geographical information systems (GIS) are promising tools for extracting crucial parametrical information from watersheds. The RS and GIS technique helps establish interrelationship with parameters and decide the priority of watershed using the WSA technique (Malik et al. 2019). Many researchers have attempted to prioritize watersheds (Thakkar and Dhiman 2007; Patel et al. 2012, 2015; Samal et al. 2015; Kadam et al. 2017; Memon et al. 2020; Kulimushi et al. 2021a; Sengupta et al. 2021) and utilized different analysis techniques for watershed prioritization such as multicriteria decision analysis (MCDA) (Chowdary et al. 2013; Jaiswal et al. 2015; Samal et al. 2015; Vulević and Dragović 2017), Sediment Yield Index (Khan et al. 2001), Weighted Sum Analysis (WSA) (Aher et al. 2014; Memon et al. 2020), and Principal Component Analysis (PCA) (Farhan et al. 2017; Meshram and Sharma 2017). (Patel et al. 2022) has utilized AHP- and TOPSIS-based subwatershed prioritization to facilitate soil erosion susceptibility analysis of the Ami river basin. Furthermore, (Yadav et al. 2018) has prioritized subwatersheds for potential groundwater zone using topographic sheet and CARTOSAT data to ameliorate the flash flood and droughts, and (Kulimushi et al. 2021b) has utilized compound parameter index (Ci) and Erosion Hazard Rate Index (EHRI) derived from Analytical Hierarchy Process (AHP) for watershed priority assessment. Out of all prioritization, the WSA technique is the most familiar and prominent method for prioritizing the watersheds in the present era; however, the recent development of ML techniques helps to classify, predict, and forecast the laboratory and computer-simulated data for decision-making. Artificial Neural Network (ANN), support vector machine (SVM), and Random Forest are ML models applied in various engineering applications.

ML is a probabilistic model frequently applied for pattern recognition applications. It has been used for applications like fault diagnosis (Kankar et al. 2011), motor current signature analysis (Singh et al. 2014), compressive strength prediction (Sonebi et al. 2016), EEG (Upadhyay et al. 2016), surface roughness prediction (Patel et al. 2020), fault severity analysis (Akhenia et al. 2021), tool wear rate prediction (Shah et al. 2022), and many more applications, regardless of field.

For evaluating any type of ML model and for knowing how well the model predicts without biasedness, cross-validation is used. In k-fold cross-validation, the data is partitioned randomly into the k number of approximately equal sets, from which the model is trained on k-1 sets and tested on the remaining set. The procedure repeated k number of times, and the final result is the average of all the results obtained by repetition. The main benefit of applying this strategy is that each sample is used for training k-1 times and one time for testing, and the average results are obtained. The regression model is used to develop a mathematical function based on the experimental data in which parameters act as features. It is observed that in a feature vector, prediction capability can be improved with identifications of relevant and irrelevant features. To discard irrelevant features, a feature selection strategy can be used to determine the utility of individual features in a feature vector (Vakharia et al. 2015a, b). Other feature ranking techniques like mutual information (Dave et al. 2020), information gain (Naseriparsa et al. 2014), and ReliefF (Kononenko 1994) are also used to improve the accuracy and prediction of classification and regression problems.

With the availability of data, artificial intelligence techniques have been applied in a variety of civil engineering applications. According to literature research, it is found that machine learning strategies are extremely helpful in predicting and validating the experimental results. Furthermore, for decision-making, a particular algorithm may not be suitable; therefore, in such situations, detailed investigations are needed to assess the utility of ML algorithms for particular applications (Vakharia et al. 2022).

In the present paper, RS and GIS techniques are utilized to derive morphometric parameters and topo-hydrological parameters. Initially, the values and ranks to all 14 morphometric and 3 topo-hydrological parameters are assigned, and the CF values are calculated using well-known WSA techniques. After that, two feature ranking techniques, ReliefF and Pearson correlation coefficient, are applied to determine the weightage of morphometric and topological parameters. The weightage obtained from WSA, ReliefF, and PCC will be used to calculate the CF values. Two well-known ML models, SVM and MLP, with Tenfold cross-validations, are applied to identify the best CF values. The accuracy of ML models is examined with mean absolute error (MAE), correlation coefficient (Cr), and root mean square error (RMSE). Based on better prediction capability, decision-making is done for suitable watershed prioritization. As per the literature survey, most authors applied various techniques like parameter averaging, WSA, AHP, MCDM, and TOPSIS. To the authors’ best knowledge, decision-making for watershed prioritization using feature ranking and ML techniques is not explored till now. Therefore, to fill the gap, a novel approach for identifying relevant morphometric parameters and consequently the CF values for watersheds prioritization for the decision-making is executed in the present research. The methodology proposed will be helpful in providing an integrated solution to the decision-maker to restore the watersheds against any uncertain critical conditions. Figure 1 shows the watershed prioritization methodology based on feature ranking and ML.

Fig. 1
figure 1

Proposed methodology for watershed prioritization

Prediction using machine learning

In recent years, much attention has been focused on machine learning. One of the key causes that are driving its significance is the fact that machine learning provides a unified framework for introducing intelligent decision-making into many research areas. This is one of the primary driving forces behind its popularity in the industry as well as academics. Machine learning models can learn patterns in data whether the learning is supervised or unsupervised. The model is developed based on input data, which will be further used to predict unseen data.

Support vector machine

Support vector machine, more commonly referred to as SVMs, is an intelligent computational model developed for performing classification and regression analysis. In regression analysis, underlying relationships between the dependent variable Y (CF) and the independent variable X (morphological parameters) are determined. The relationship is generally mapped by using a kernel function, which described how the estimated Y deviates from the calculated values. Let the training data be represented as

$$\tau =\left\{\left({X}_{1},{Y}_{1}\right),\left({X}_{2},{Y}_{2}\right),\dots \dots ,\left({X}_{n},{Y}_{N}\right)\right\}\in \chi \times \mathfrak{R}$$
(1)

where χ represent the space for input patterns. If the kernel function is assumed to be linear, it can be expressed as

$$h\left(x\right)=\langle W,X\rangle +b$$
(2)

SVM is formulated as a convex optimization problem as

$$\mathrm{min }\frac{1}{2}{\Vert W\Vert }^{2}$$
(3)
$$\mathrm{s}.\mathrm{ t}.{Y}_{i}-\left(\langle W,{X}_{i}\rangle +b\right)\le \varepsilon ,\forall i$$
(4)
$$\left(\langle W,{X}_{i}+b\rangle -Yi\le \varepsilon ,\forall i\right)$$
(5)

Here, w represents the hyperplane orientation, and b is a scalar quantity known as intercept. When margin is considered Eqs. 3, 4, and 5 can be further modified as (Vapnik 1995)

$$\mathrm{min }\frac{1}{2}{\Vert W\Vert }^{2}+C\sum_{i=1}^{N}\left({\xi }_{i}+{\xi }_{i}^{*}\right)$$
(6)
$$\mathrm{s}.\mathrm{ t}.{Y}_{i}-\left(\langle W,{X}_{i}\rangle +b\right)\le \varepsilon +{\xi }_{i},\forall i$$
(7)
$$\begin{array}{c}\left(\langle W,{X}_{i}\rangle +b\right)-{Y}_{i}\le \varepsilon +{\xi }_{i}^{*},\forall i\\ {\xi }_{i},{\xi }_{i}^{*}\ge 0\end{array}$$
(8)

ξi,ξi* represents the slack variables which are introduced to deal with infeasible constraints. In present study, the penalty parameter “C” is chosen as 10 and radial basis kernel function is used to effectively map the nonlinearity in morphological parameters (Fig. 2a).

Fig. 2
figure 2

Prediction using a support vector machine and b multilayer perceptron

Multilayer perceptron (MLP)

Multilayer perceptron (MLP) is another type of ANN architecture which consists of several layers and functions as a global approximator for nonlinear input–output mapping. MLPs are composed of neurons called perceptron. In our model, morphological features which is represented as {g = g1, g2, g3, , gn) become an input to perceptron. The input morphological features are passed through a function u which computes the weighted sum of input features as follows:

The output of the perceptron is calculated by an activation function f, which is represented as

$$Y=f\left(u\left(X\right)\right)=\left\{\begin{array}{c}1, if u\left(X\right)>\theta \\ 0, otherwise,\end{array}\right.$$
(9)

The equation w1g1 + w2g2 + ⋯ + wngn − θ = 0 represents the equation of hyperplane. Output = 1 reflects that input morphological parameters lies above the hyperplane, and Output = 0 reflects that input morphological parameters lie below the hyperplane. Hence for output, it gives a set of continuous values and mean square error as a loss function. (Fig. 2b) shows the architecture of MLP with one hidden layer. In the present study, authors have used only one hidden layer based on prior experience.

Parameter estimations through GIS and remote sensing

To determine the CF for any catchment area of the River, a Rel-River catchment situated in Banaskantha district of Gujarat, India, is considered in the present study as shown in (Fig. 3a). Basin lies between 24° 50′ N to 24° 75′ N latitude and 72° 00′ E to 72° 45′ E longitude and has an area of 431 km2 (Memon et al. 2020; Darji et al. 2021). The basin area is partitioned into 51 microwatersheds, and Cartosat DEM 10 m resolution is used to calculate slope and watershed boundary. ArcHydro toolset is used for calculating and deriving the boundary of the watershed. Drainage boundaries are derived using Survey of India (SOI) toposheets (42D02, 42D03, 42D06), and the ordering of the drainages is given according to the proposed technique by (Strahler 1964) (Fig. 3b). ArcGIS 10.5 software was used to generate the boundary of watersheds, digitize drainages, and derive different morphometric parameters. Morphometric parameters such as linear aspects, areal aspects, and relief aspects (Horton 1932, 1945; Miller 1953; Schumm 1956; Melton 1957; Faniran 1968; Strahler 1997; Nooka Ratnam et al. 2005) and topo-hydrological parameters such as Stream Power Index (SPI) (Whipple and Tucker 1999), Sediment Transport Index (STI) (Moore and Burch 1986), and Topographic Wetness Index (TWI) (Beven and Kirkby 1979) are considered for the prioritization of watersheds (Table 1). The ranking of the watersheds is calculated based on the WSA technique established by (Aher et al. 2014). Every microwatershed has different and unique characteristics, and watershed priority is carried out to identify the vulnerable areas for erosion. The weights have been calculated based on the correlation between parameters and their values. Its relation does a ranking of the parameters to soil erosion. Based on the literature, the linear factor showed a positive correlation with soil erosion, so maximum priority is given to the higher value, i.e., rank 1. In contrast, shape factors negatively correlated with soil erosion, and the ranking for these parameters was given in reverse order.

Fig. 3
figure 3

Study area. a Rel river basin map. b Rel river drainage map

Table 1 Morphometric and topo-hydrological parameters considered in study

The final ranking and prioritization of watersheds have been done using CF values. The CF values are calculated with the help of morphological parameters weightage obtained from WSA, ReliefF, and Pearson correlation coefficient methods.

WSA techniques

The WSA is a statistical approach applied to calculate CF values. The CF values are estimated using cross-correction analysis by morphometric parameters and the assigned weightage. The statistical expression for CF is written as follows (Aher et al. 2014):

$$CF (WSA) = RMP \times \boldsymbol{ }WMP$$
(10)

where CF (WSA) represents compound factor, RMP represents the rank (preliminary priority) estimated from morphometric and topo-hydrological parameters, and WMP represents the weight of morphometric and topo-hydrological parameters obtained using a cross-correlation study (Table 2, 3, and 4). Weights are estimated with a ratio of morphometric and topo-hydrological parameters with a sum of the correlation coefficient value of each parameter (Table 5). Based on the weightage of parameters, a model is constructed for sorting watershed prioritization, which is computed as follows:

Table 2 Stream order of different watersheds
Table 3 Parameter values
Table 4 Preliminary priority ranking of linear, areal, and shape parameters
Table 5 Weight assigned by WSA techniques
$$\begin{array}{c}\mathrm{CF }(\mathrm{WSA})= (0.055447) \times {R}_{b} + (0.087678) \times {D}_{d} + (0.114486)\\ \times {F}_{u} + (0.111822) \times {R}_{t} + (-0.08424) \times {L}_{o}\\ \begin{array}{c}+ (0.025944) \times {R}_{f} + (-0.01748) \times {B}_{s} + (0.028237)\\ \times {R}_{e} + (-0.00423) \times {C}_{c} +(0.112984) \times {I}_{f}\\ \begin{array}{c}+(0.007042) \times {R}_{c} + (0.089087) \times C + (0.047331)\\ \times {R}_{h} + (0.115931) \times {R}_{n} + (0.090761) \times \mathrm{ STI}\\ + (0.105501) \times \mathrm{ SPI }+(0.113691) \times \mathrm{ TWI}\end{array}\end{array}\end{array}$$
(11)

As observed in Table 7 and Eq. 11, the highest CF value is 51.13, and it is for watershed 49, the second-highest value is 47.48, and it is for watershed 48, and so on for other watersheds, as shown in (Fig. 4a).

Fig. 4
figure 4

Watershed prioritized maps using a CF (WSA), b CF (RF), and c CF (PCC)

ReliefF

ReliefF is a feature ranking method for selecting important features for classification and regression problems. In this method, predictors that give different values for the same response values are penalized, and predictors that give different values for different response values are rewarded. Final predictor weights are computed using intermediate weights.

$${P}_{w}= \frac{{W}_{Rp}}{{W}_{R}}-\frac{{p}_{R}-{W}_{Rp}}{n-{W}_{R}}$$
(12)

where n is several instances, WRP is weighted with different response values and different values for predictor, WR is weighted with different values for response R, and PR is weights with different values for predictor p (Robnik-Šikonja et al. 1997).

The ReliefF method is used to assign the ranks to the morphometric parameters. If ranked first with the weightage of 0.058, whereas Fu is ranked second with the weightage of 0.056. Other subsequent values, ranks, and weights are mentioned in Table 6. The final equation of compound factor obtained using the ReliefF method is as follows:

Table 6 Weightage of parameters based on WSA, ReliefF, and Pearson correlation coefficient
$$\begin{array}{c}\mathrm{CF }\left(\mathrm{RF}\right)= \left(0.00532\right)\times {R}_{b} + \left(0.02866\right)\times {D}_{d} + \left(0.05694\right)\times {F}_{u}\\ + \left(0.04566\right)\times {R}_{t} + \left(0.02452\right)\times {L}_{o} + \left(-0.03597\right)\\ \begin{array}{c}\times {R}_{f}+ \left(-0.03246\right)\times {B}_{s} + \left(-0.03282\right)\times {R}_{e}\\ + \left(-0.0182\right)\times {C}_{c} +\left(0.05803\right)\times {I}_{f} +\left(-0.0198\right)\times {R}_{c}\\ \begin{array}{c}+ \left( 0.02878\right)\times C + \left(0.01382\right)\times {R}_{h} + \left(0.05113\right)\times {R}_{n}\\ + \left(0.02103\right)\times \mathrm{ STI }+ \left(0.02622\right)\times \mathrm{ SPI }+\left(0.03788\right)\\ \times \mathrm{ TWI}\end{array}\end{array}\end{array}$$
(13)

As observed in Table 7 and Eq. 13, the highest CF value is 14.37, corresponding to watershed 51, and the second-highest value is 12.78, corresponding to watershed 50; likewise, other subsequent CF values are calculated, which are shown in Table 7 and Fig. 4b.

Table 7 Final priority ranking based on CF (WSA), CF (RF), and CF (CC)

Pearson correlation coefficient (PCC)

The Pearson correlation coefficient (Cr) is generally used to measure and rank the relationship between the calculated and predicted values. A value near + 1 indicates a perfect correlation from the model output, whereas the value of − 1 indicates negative correlations. If a value is closer to 0, it means that there is no direct correlation between the calculated and predicted parameters (Gibbons 2014).

$$\overline{{X }_{p}}=\sum_{i=1}^{m}\frac{{X}_{\left(p,i\right)}}{m};\overline{{Y }_{q}}=\sum_{j=1}^{m}\frac{{Y}_{\left(q,j\right)}}{m}$$
(14)
$${\mathrm{PCC} }_{\left(p,q\right)}=\frac{\sum_{i=1}^{m}\left({X}_{\left(p,i\right)}-\overline{{X }_{p}}\right)\left({Y}_{\left(q,i\right)}-\overline{{Y }_{q}}\right)}{\sqrt{\sum_{i=1}^{m}{\left({X}_{\left(p,i\right)}-\overline{{X }_{p}}\right)}^{2}\sum_{j=1}^{m}{\left({Y}_{\left(q,j\right)}-\overline{{Y }_{q}}\right)}^{2}}}$$
(15)

Here, \({\mathrm{PCC}}_{\left(p,q\right)}\) gives the value of the Pearson linear correlation coefficient, m is the length of parameters, and \(\overline{{X }_{p}}\) and \(\overline{{Y }_{q}}\) give the values of a mean of each parameter.

The Pearson correlation coefficient method was used to assign the ranks to the morphometric parameters. If is assigned rank 1 with a weightage of 0.93, whereas Fu is assigned rank 2 with a weightage of 0.9197. Other subsequent values, ranks, and weights are cited in Table 6. The equation of compound factor based on the weightage of correlation coefficient ranked features is as follows:

$$\begin{array}{c}\mathrm{CF }(\mathrm{PCC}) = \left(0.3689\right)\times {R}_{b} + \left(0.8069\right)\times {D}_{d} + \left(0.9197\right)\times {F}_{u}\\ + \left(0.8659\right)\times {R}_{t} + \left(-0.7855\right)\times {L}_{o} + \left(0.1328\right)\times {R}_{f}\\ \begin{array}{c}+ \left(-0.071\right)\times {B}_{s} + \left(0.1487\right)\times {R}_{e} + \left(-0.0267\right)\times {C}_{c}\\ +\left(0.9373\right)\times {I}_{f} +\left(0.0484\right)\times {R}_{c} + \left( 0.8161\right)\times C\\ \begin{array}{c}+ \left(0.2701\right)\times {R}_{h} + \left(0.9104\right)\times {R}_{n} + \left(0.7203\right)\times \mathrm{ STI}\\ + \left(0.7744\right)\times \mathrm{ SPI }+\left(0.8488\right)\times \mathrm{ TWI}\end{array}\end{array}\end{array}$$
(16)

As observed in Table 7 and Eq. 16, the highest CF value is 405.67, and it is for watershed 51, the second-highest value is 375.3, and it is for watershed 49. The watershed map constructed is shown in Fig. 4c.

Results and discussion

The present paper calculates and derives the basic morphometric parameters using RS and GIS techniques. Various parameters are mentioned in Tables 2 and 3, and the ranks are assigned based on the relation of parameters with soil erodibility, which is shown in Table 4. In the present study, the prioritization of the watershed was decided based on CF values calculated through WSA, ReliefF, and the Pearson correlation coefficient method. Validation and comparison of CF values have been done using MLP and SVM models.

Watershed priority based on CF values

The watershed CF values are calculated using WSA, ReliefF, and the correlation coefficient method. For prioritization, the lowest value of CF is given the priority rank 1, the next lower value is given the priority rank of 2, and so on for all the 51 microwatersheds; CF values are calculated through the three methods mentioned above. Furthermore, the prioritized watersheds are categorized into five categories, i.e., very high, high, moderate, low, and very low. Finally, the categories of watershed maps are utilized for the decision-making system.

As observed in Table 7, the lowest CF values based on WSA are 1.34, hence, it is assigned rank 1, and the same ranking is assigned, which will be useful for prioritizing the watersheds (Table 7). The CF values calculated and the prioritized watersheds based on WSA, ReliefF, and Pearson correlation coefficient are shown. For CF values obtained through ReliefF, all the 51 microwatersheds of the Rel River catchment are classified into five priority categories such as (i) very high (− 1.355 to ≤ 1.789), (ii) high (1.789 to ≤ 4.933), (iii) medium (4.933 to ≤ 8.078), (iv) low (8.078 to ≤ 11.223), and (v) very low (> 11.223) as given in Table 8. It is observed through Table 8 that the 8 microwatersheds belong to very high category (WS no. 1, 3, 4, 5, 7, 8, 9, and 11), 8 microwatersheds under a high category (WS no. 2, 6, 10, 12, 14, 15, 17, and 19), 14 microwatersheds under medium (WS no. 16, 18, 20, 21, 23, 26, 30, 34, 35, 39, 41, 42, 44, and 45), 12 microwatersheds under a low category (WS no. 22, 24, 25, 27, 31, 32, 33, 38, 40, 46, 47, and 51), and 9 microwatershed under the very low category (WS no. 13, 28, 29, 36, 37, 43, 48, 49, and 50). The CF values obtained from Eqs. 11, 13, and 16 watershed are constructed, shown in Fig. 4a–c.

Table 8 Priority category based on CF (RF) values

ML model predictions

Two ML algorithms, MLP and SVM, have been utilized to predict the CF values for the prioritization of watersheds. The training and Tenfold cross-validation are performed on all the 51 microwatersheds, and the derived parameters are normalized in the range [− 1 to + 1] to minimize the biasing error. Results of the prediction are shown in Fig. 5a–f. The CF (WSA) values predicted through SVM and MLP are shown in Fig. 5a, b. It is observed that SVM accurately predicts the CF values with a very less average error of 0.04 whereas there is a slight deviation in prediction results with MLP with an average error of 0.90 for all the watershed values considered. Similarly when CF (RF) is predicted, then, an average error of 0.02 and 0.28 is observed from SVM and MLP, respectively, as shown in Fig. 5c, d. Further, the average error of 0.36 and 7.65 is observed when CF (PCC) is predicted, as shown in Fig. 5e, f. The results can infer that the best CF values for watershed prioritization should be obtained from ReliefF Eq. 13 as there are very few mean errors observed from both SVM and MLP. Moreover, to evaluate the suitability of ML algorithms, three performance metrics: mean absolute error, correlation coefficient, and root mean square error, are calculated, and the results are discussed below.

Fig. 5
figure 5

a–f Tenfold cross validation prediction results using SVM and ANN

Correlation coefficient

The correlation coefficient (R) is calculated to find the relationship between the dependent and independent variables. When R is observed as + 1, it indicates a perfect correlation between the two variables, while − 1 indicates a negative correlation between two variables. The \(\eta\) shows the total number of observations, \(\sum a\) shows the mean value of all the first variables in the data set, and \(\sum b\) represents the mean value of all the second variables in the data set. It is mathematically calculated by

$$R= \frac{\eta \sum ab-\sum a\sum b}{\sqrt{\eta \sum {a}^{2}}- \sum {a}^{2}[\eta \sum {b}^{2}-{\left(\sum b\right)}^{2}}$$
(17)

Mean absolute error

Mean absolute error (MAE) has been used to analyze the performance of the ML model. The difference between the predicted value and the actual value is calculated by

$$MAE= \frac{1}{n} \sum_{i=1}^{n}\left|{\mathrm{\boldsymbol{\odot}}}_{i}-\mathrm{\boldsymbol{\odot}}\right|$$
(18)

Here, ◉i shows the predicted value by the machine learning model, and ◉ shows the actual value from experiments.

Root mean square error

Root mean square error (RMSE) represents the standard deviation of the residuals. RMSE is commonly used in hydrology, forecasting, and regression analysis to verify experimental results.

$$RMSE= \frac{1}{n} \sqrt{\sum_{i=1}^{n}{\left({\mathrm{\boldsymbol{\odot}}}_{i}-\mathrm{\boldsymbol{\odot}}\right)}^{2}}$$
(19)

where i is variable, n represents data points, ◉i represent actual observations, and ◉ represents predicted observation.

Tables 9 and 10 show the results of SVM Tenfold and MLP Tenfold for CF (WSA), CF (RF), and CF (CC) values. The prediction of ML models is compared with three performance metrics correlation coefficient, MAE, and RMSE values. Initially, the performance of the ML model is evaluated through a correlation coefficient. It is observed that there is no significant deviation in performance metrics from both the SVM and MLP models. The correlation coefficient was observed as 1, which signifies a very good correlation between the calculated and predicted values.

Table 9 Statistical evaluation of SVM Tenfold model

Figure 6a, b shows the SVM Tenfold and MLP Tenfold results to compare the prediction of CF values using MAE and RMSE. From Fig. 6, it is clear that the minimum MAE is 0.0236 for CF (RF) using the SVM Tenfold model compared to the minimum MAE of 0.2856 for CF (RF) using the MLP Tenfold model. Furthermore, the minimum RMSE is 0.032 for CF (RF) using the SVM Tenfold model compared to the minimum RMSE of 0.440 for CF (RF) using the MLP Tenfold model. The maximum MAE is 0.3684 for CF (CC) using an SVM Tenfold compared to the maximum MAE of 7.6514 for CF (CC) using MLP Tenfold. Also, the maximum RMSE observed is 0.4361 for CF (CC) using SVM Tenfold, compared to the maximum RMSE of 12.26 for CF (RF) using the MLP Tenfold model. Results show that the SVM Tenfold gives better prediction capability as compared to MLP Tenfold for the prediction of CF values with CF (RF), followed by CF (WSA) and CF (PCC), respectively.

Fig. 6
figure 6

Tenfold cross validation prediction results using a SVM and b ANN

Figure 7a–c shows the variation in MAE and RMSE for the CF (RF), CF (WSA), and CF (PCC) predicted using the SVM Tenfold and MLP Tenfold models. The minimum MAE observed is 0.0236 with CF (RF), whereas the minimum MAE is 0.3684 for CF (PCC) from SVM Tenfold. The maximum MAE is 0.2856 with CF (RF) compared to 7.6514 with CF (CC) from MLP Tenfold. Furthermore, the minimum RMSE for predicting CF (RF) is 0.032 compared to 0.4361 in CF (CC) using SVM Tenfold. The maximum RMSE for predicting CF (RF) is 0.4402 compared to 12.2689 in CF (CC) using MLP Tenfold. It is observed that CF values calculated from ReliefF exhibit a higher correlation coefficient as well as very low MAE and RMSE values with both SVM and ANN models (Tables 9 and 10); therefore, CF values calculated from ReliefF are used for constructing watershed prioritization. The final watershed priority category map of 51 microwatersheds is shown in Fig. 8. It is observed that the percentage area of microwatersheds under the very high category is 13.45%, for a high category, it is 15.72%, for medium category is 30.24%, low category is 18.81%, and for very low category, it is 21.78%. This information is beneficial in implementing water management strategies regarding soil and water conservation measures. The result also shows its vulnerability to erosion and runoff potential. This watershed is mainly situated in the upstream part of the catchment or basin and high slopes and elongated basins.

Fig. 7
figure 7

MAE and RMSE variations with SVM and MLP for a CF (RF) and b CF (WSA) and CF (CC)

Table 10 Statistical evaluation of MLP Tenfold model
Fig. 8
figure 8

Watershed priority map based on CF (RF) values

Conclusion

In the methodology proposed, the utility of feature ranking is initially explored in detail for estimating the weightage of various morphological and topological parameters. Afterward, the CF values for various watersheds are calculated from WSA, ReliefF, and Pearson correlation coefficient, obtained from the assigned weightage of various morphological and topological parameters. To determine the best CF values, ML algorithms are explored in detail. Finally, the performance of models is estimated after comparing the three parameters, i.e., mean absolute error, correlation coefficient, and root mean square error. The findings are mentioned below:

  1. 1.

    The SVM model gives better results for predicting various watersheds as compared to MLP model.

  2. 2.

    The comparison of MAE and RMSE for predicting CF (RF), CF (WSA), and CF (CC) reveals that CF (RF) is the best model for the prediction of CF values as compared to other calculations; hence, it is utilized for prioritization of watersheds.

  3. 3.

    It is suggested that watersheds 1, 3, 4, 5, 7, 8, 9, and 11 have a very high vulnerability to soil erosion, followed by the rest of the watersheds in the study region. Hence, appropriate soil and water conservation measures should be adopted to protect against degradation.

Based on a study conducted, the integrated framework of RS, GIS, feature ranking, and machine learning seems to be an efficient water resource management technique for watershed ranking and prioritization. Tenfold cross-validation and feature ranking is a novel approach for accurately predicting CF for watershed ranking and prioritization. The proposed methodology should be extended to investigate the influence of more RS and GIS techniques as well as exploration of more morphometric and topological parameters and analyze its effect on constructing various watersheds for soil and water conservation. It should be noted that the performance of ML models for prediction is dependent on the calculated or extracted parameters.