1 Introduction

Rivers originated in Himalayan and Tibetan Plateau could supply almost 25% sediment load (Raymo and Ruddiman 1992). The lower parts of Himalayan sub-basin are now facing soil loss problems (Jain et al. 2001). The Shivalik formation (lower Himalaya), which is the source of Ganges river system, has weak geological combinations and, thus, susceptible to degradation environment and soil health. Therefore, it is vital to understand soil erosion, which will help to control erosion and ecological restoration. Soil is a natural resource, and natural and anthropogenic processes detach top surface of soil and aggravate soil erosion (Parveen et al. 2012). The natural resources and agricultural production are severely affected by the rapid action of soil erosion. The topography, low rainfall or prolonged dry periods, inept land use/land cover (Kumar et al. 2018a; Kumar et al. 2018; Bertalan et al. 2018), and natural calamities are different factors that determine the rate and process of erosion in a basin (Gitas et al. 2009). Kosmas et al. (1997) elaborated that basic properties of soil layer (upper soil thickness, silt, and organic matter percentage) are also responsible for severe erosion. Li and Fang (2016) explained that climate change directly alters trend and pattern of rainfall, and consequently due to intense rainfall, rate of runoff increases and accelerates soil erosion.

Remote Sensing and Geographic Information System (GIS) have been implemented in various available erosion models that predict soil loss (Borrelli et al. 2017; Uddin et al. 2016; Panagos et al. 2015a; Brady and Weil 2008). Currently, applications of remote sensing and GIS are a most appealing tool for natural resource management, complex hydrological problems and others (Rawat et al. 2020, 2019; Phinzi et al. 2020; Schlosser et al. 2020; Rawat et al. 2020; Barman et al. 2021; Singh and Singh 2018; Maliqi and Singh 2019; Bertalan et al. 2019; Choudhari et al. 2018; Abriha et al. 2018; Murmu et al. 2019; Kumar et al. 2017, 2018b; Thakur et al. 2016; Yadav et al. 2014). The uncertainty in digital elevation models (DEMs) may influence the results of hydrological investigations (Mondal et al. 2017). Many researchers have evaluated the accuracy of different DEMs (Szabó et al. 2015; Degetto et al. 2015; Rawat et al. 2019).

The Universal Soil Loss Equation (USLE; Wischmeier and Smith 1978), Chemicals, Runoff and Erosion from Agricultural Management Systems (CREAMS) (Knisel et al. 1980), Agricultural Non-Point Source Pollution (AGNPS) (Young et al. 1989), Revised Universal Soil Loss Equation (RUSLE) (Renard et al. 1991), and Modified Universal Soil Loss Equation (MUSLE) (Williams et al. 1975) are empirical models employed for soil erosion estimation. However, Morgan-Morgan and Finney (MMF) (Morgan et al. 1984), European Soil Erosion Model (EUROSEM) (Morgan et al. 1992), Griffith University Erosion System Template (GUEST) (Ciesiolka et al. 1995), Limburg Soil Erosion Model (LISEM) (De et al. 1998), and Water Erosion Prediction Project (WEPP) (Laen et al. 1991) are the physical process-based models that are also used to estimate soil loss.

Among these models, empirical model RUSLE is found to be useful for estimation of soil erosion rate (Borrelli et al. 2017; Thomas et al. 2018; Terranova et al. 2009; Lu et al. 2004; Nyakatawa et al. 2001; Biesemans et al. 2000). The output from RUSLE was found to be more close to observation (Mondal et al. 2017) and showed lower variability from observation using the fine resolution DEMs. Since the assessment of soil erosion by RUSLE is quite efficient from the prioritization aspect, it becomes more helpful for policymakers to understand the erosion prone area and remedial measures (Wijesundara et al. 2018). The coupling of RUSLE and GIS interface has several advantages and has shown good results to model soil loss (Perovic et al. 2013). Therefore, RUSLE has been successfully applied worldwide to asses soil erosion (Panagos et al. 2019; Koirala et al. 2019; Borrelli et al. 2017; Panagos et al. 2015a, b; Terranova et al. 2009; Lu et al. 2004; Uddin et al. 2016; Waikar et al. 2014), and a limited research has been conducted in the regions, which have complex topography (e.g., river system of Himalayan region).

The major goal of the study was to assess the spatial distribution of soil loss in a humid subtropical trans-boundary river basin. The objectives to fulfill the goal were (i) to assess soil loss by RUSLE model using DEMs of different resolutions; (ii) to point out the relevance of water retention of soil types in erosion process; (iii) to prioritize erosion zones in the Ghaghara river basin for conservation measures; and (iv) to determine efficiency of hypsometric analysis for recognition of basin’s age due to subsequent soil erosion in the basin. With these objectives, an integrated framework was developed for estimation of soil loss.

2 Materials and methods

2.1 Study area

The Ghaghara river basin is a trans-boundary river also known as Karnali in Nepal (Fig. 1). It originates near Lake Mansarovar (30.60° N, 81.48° E) with a catchment area of 127,950 km2. It meets with its tributary Sarda at Brahmaghat in India from where it is known as the Ghaghara river. It joins to the river Ganga at Doriganj situated downstream of Chhapra town, Bihar. The other important tributaries are Sarju, Rapti, and Little Gandak. There are two streams, Seti River and Bheri River, which drain into the Ghaghara river basin in Nepal. The river basin is heterogeneous from source to mouth and has the longest distance river in Nepal (~ 507 km). The dominant land use/land cover (LULC) type is cropland followed by mixed forest and grassland. The quaternary age sediments are dominant in the basin as Pleistocene age has older alluvium (yellow to brown color), and Holocene age has newer alluvium (gray to black color). The soil types are mostly clay loam, loam, and glacier type. The geomorphology of the Ghaghara river basin contains structural origin, which is high, moderate, and low dissected hills and valleys. In alluvial plain, the Ghaghara river basin shows a meandering pattern with several oxbow lakes and it is a humid subtropical and receives an average annual rainfall of 1041 mm (annual). The temperature distribution is 47 °C (max) in summer to 2 °C (min) in winter. The higher elevation zone is occupied by northeast and northwest, while the lower elevation zone occupies their lower alluvium zone (South-east). It has a higher discharge than the Ganga before its confluence near Maharajganj, Chhapra district of Bihar. It is a unique river with respect to fluctuation of discharge (very high discharge during monsoon and very low discharge during dry season), high sediment load, and channel instability.

Fig. 1
figure 1

Location map of the Ghaghara river basin is showing maximum and minimum elevation with major streams originated near the Mansarovar lake

2.2 Methodology

The soil erosion from the Ghaghara river basin was calculated using digital elevation models (DEMs), soil, and LULC map. These datasets have been used as input into RUSLE model for estimation of soil loss. The DEM was also used for computation of the basin's morphometric parameters, whereas soil parameters were used to calculate soil water holding capacity. The rainfall data-sets were used for computation of rainfall erosivity. Besides these geomorphic stages of basin, the hypsometric integral and hypsometric curve were estimated using the CalHypso tool avialable in QGIS. The process of work can be seen in the methodology chart in Fig. 2.

Fig. 2
figure 2

Adopted methodology for assessment of soil loss in the Ghaghara river basin

2.2.1 Topographic data

DEMs from SRTM (90 m and 30 m), MERIT 90 m, and ALOS 30 m were downloaded from webportal (http://srtm.csi.cgiar.org/), (http://hydro.iis.u-tokyo.ac.jp/~yamadai/MERIT_DEM/) and (https://www.eorc.jaxa.jp/ALOS/en/aw3d30/data/index.htm), and the stream delineation process from these four DEMs was performed using D-8 algorithm present in Arc Hydro Tool of ArcGIS. DEM was also used to generate the slope length factor (LS-factor) map for the study.

2.2.2 Soil types and meteorological datasets

Soil map of the study region was collected from Food and Agriculture Organization (FAO) soil datasets of 1:50, 00,000 scale (FAO 1977). The classes extracted from soil data are clay_loam (Bd29-3c-3661), loam (Bd34-2bc-3663), loam (Bd35-1-2b-3664), loam (Be74-2a-3675), loam (Be84-2a-3685), loam (Bk39-2a-3694), loam (Bk40-2a-3695), loam (I-Bh-U-c-3717), loam (I-X-2c-3731), loam (Jc50-2a-3743), loam (Je75-2a-3759), clay_loam (Rd30-2b-3851), and glacier-6998 (“Appendix- Table 6”). The rainfall was used for calculating the rainfall-runoff erosivity factor (R) to estimate the soil loss by RUSLE model.

2.2.3 Land use/land cover (LULC)

The LULC map from MODIS (https://archive.usgs.gov/archive/) of 0.5 km resolution was collected and used for the computation of P-factor MODIS of year 2015 has 15 LULC types: water, evergreen needle and broad leaf forest, deciduous needle, and broadleaf forest, mixed forest, shrubland, savanna, grassland, wetlands-mixed, agricultural land, built-up, cropland/woodland mosaic, snow or ice, and barren or sparsely vegetated. The cropland has highest areal extent followed by mixed forest and grassland, while the deciduous needle leaf forest has lowest areal extent in the year 2015 (“Appendix-Table 7”).

2.2.4 Estimation of morphometric parameters

The entire region was delineated into 30 sub-basins for detailed and sub-basin level morphometric analysis. The basic, linear, and shape morphometric parameters (Table 1) were estimated using the stream order generated during the delineation of basin. The method used for the ordering of the streams was developed by Strahler order. The numbers of streams (N) of a different order, stream length (Lu), area (A), perimeter (P), basin length (Lb ) are the basic parameters. Strahler (1964) suggested the smallest and un-branched streams are the first-order stream and when two first-order streams confluence, generate second-order stream, and when two second-order streams join, they form third-order and so on. Following the rule suggested by Strahler (1964), when two different order streams join together, the higher order should be counted.

Table 1 Morphometric parameters, their mathematical expressions and references are presented in table, which are used for assessment of soil erosion in the study area

The total length of an individual stream in each order is the stream length (Lu) of that order (Horton 1945). The drainage area (A) is the total area where the fluvial generated stream or systems of streams are drained in the corresponding space, and this area provides more detailed information about the basin. The total runoff and sediment load can be easily understood and estimated using the drainage area (A) (Pradhan et al. 2018). Further, basin length (Lb) helps to calculate the other linear parameters like form factor, shape factor, and elongation ratio. Ratnam et al. (2005) method was used to calculate the basin length for the study area.

Stream length ratio (RL), bifurcation ratio (Rb), drainage density (Dd), stream frequency (Sf), and length of overland flow (Lo) are the linear parameters. These linear parameters were calculated using the basic parameters information. The RL is a dimensionless ratio, which is simply a ratio of stream length, and this method was proposed by Horton (1945). The Rb is the ratio of stream number, and formula used here to calculate the Rb is given by Schumm (1956). Horton (1945) described the Dd ratio of stream length, and area of a sub-basin and Sf as ratio of stream number and area a sub-basin. Horton (1945) explained the length of overland flow (Lo) as one half of the drainage density of the sub-basin.

Furthermore, the shape parameters are form factor (Ff), elongation ratio (Re), circulatory ratio (Rc), shape factor (Bs), compactness constant (Cc) which have been calculated. The definition of the (Ff) was proposed by Horton (1945), which is the ratio of area (A) to the square of the basin length. The elongation ratio was estimated using the formula given by Schumm (1956) and is the ratio of diameter of a circle having the same area as of the basin and the basin length.

The circulatory ratio given by Strahler (1964) is the ratio of basin area (A) to the square of basin perimeter for which the area is calculated. The shape factor was estimated following the Horton (1932) rule, and it is the ratio of square of basin length (Lb) to area (A) of that basin. The compactness constant is proposed by Horton (1945) and also expressed by Gravelius (1914) as ratio of basin perimeter (P) divided by the circumference of a circle which has the same basin area (A). The linear and shape parameters were used to estimate the compound factor and then prioritization of sub-basin.

2.2.5 Compound factor and prioritization of sub-basin

The linear and shape parameters of morphometry play an important role to prioritize the basin based on compound factor. The linear parameters are directly concerned with erosion activity, and this means that the higher values of a linear factor in a sub-basin explain the high degree of erosion for that area. However, the shape parameters have the opposite concern to the linear parameters. Therefore, the sub-basin which has a lower value of shape factor that indicates the higher erosion in area. Thus, for our study area rank 1 is given to the sub-basin, which has a higher value of the linear parameters and lower value of the shape parameters, and rank 30 is given to the sub-basin, which has the lowest linear and highest shape factor, and so on. The compound factor has been estimated by the addition of all rank of shape and linear parameters, and after this, we have divided the total sum by the total no. of parameters. Based on results of compound factor, prioritization of sub-basin was performed by giving the first rank to the sub-basin, which has a lower value of the compound factor, and similarly, the last rank is assigned to the sub-basin which has a higher value of compound factor (Maurya et al. 2016).

2.2.6 RUSLE model

RUSLE is an advanced form of the USLE developed by Wischmeier and Smith (1978). This model was used at many places to estimate the erosion of the upper surface soil due to rainfall and other factors such as soil and length-slope factor (Pradhan et al. 2018; Panagos et al. 2015a).

The equation 1 of the RUSLE is given as follows:

$$A=R\times K\times \mathrm{LS}\times C\times P$$
(1)

where A: annual sediment yield (t ha−1year−1); R: rainfall erosivity factor (MJmmha−1 h−1year−1); K: soil erodibility factor (thahha−1 MJ−1 mm−1); LS: slope length and steepness factor; C: cover management factor (C); and P: conservation practice factor. LS, C, and P are the dimensions less factor.

The R factor was estimated using the annual rainfall based on Yu and Rosewell (1996) method. The K-factor was calculated using the soil properties and sand, silt, clay, and organic carbon by Sharpley and Williams (1990) formula. The slope length (L) and steepness factor (S) together constitute the topographic factor which was calculated based on Desmet and Govers (1996) in SAGA GIS. The LS-factor was calculated based on DEMs of SRTM 30 m, ALOS 30 m, MERIT 90 m, and SRTM 90 m, respectively. The value of C-factor was assigned by choosing representative values of crop management factor from Tables 5, 10, and 11 given in Wischmeier and Smith (1978). Here, the value of P-factor was selected using LULC map and from the published sources (Liu et al. 2015; Jain et al. 2001; Renard et al. 1996). Finally, all RUSLE factors are multiplied in order to determine soil loss in the Ghaghara river basin. Total four soil erosion rate maps were prepared using the four different LS-factors, which were generated by four different DEMs and keeping the same R-factor, K-factor, C-factor, and P-factor. Further, these RUSLE results were evaluated using the observed data of soil loss in the basin by the Central Water Commission (CWC), New Delhi, India.

2.2.7 ROSETTA Model

The ROSETTA from the Hydrus model was used to calculate soil hydraulic properties for the Ghaghara river basin. The water retention curve was prepared using ROSETTA, and this curve describes the water holding capacity of soil under differential pressure head (Maurya et al. 2016). The soils' hydraulic parameters are essential to study water resources. Still, usually, these data are not available on the appropriate scale (Schaap et al. 2001). ROSETTA is commonly used software to achieve the hydraulic parameters of the soil from the sand, silt, and clay proportional values (Pradhran et al. 2018). The ROSETTA model applies the hierarchical pedotransfer functions (PTFs) and a neural network algorithm with bootstrapping (Schaap et al. 2001). The hierarchy in pedotransfer functions (PTFs) allows the ROSETTA model to predict Van Genuchten (1980) water retention curves. The formula (2) for the water retention curve is as follows:

$${\theta }_{h}={\theta }_{\mathrm{r}}(\mathrm{residual water content })+\frac{[{\theta }_{\mathrm{s }}-{\theta }_{\mathrm{r}} ]}{({1+{\left(\alpha \mathrm{h}\right)}^{\mathrm{n}})}^{\mathrm{m}}}$$
(2)

where \(\theta \left(h\right)\) is the water content (\({\mathrm{cm}}^{3}/{\mathrm{cm}}^{3}\)) at soil water pressure head h (cm), α is the scaling parameter, n is the curve shape factor, and\(\,m=1-1/n\);\({\uptheta }_{\mathrm{s }}:\mathrm{ saturated water content}\) \({\uptheta }_{\mathrm{r}}:\mathrm{residual water content}\)

There are 33 kPa and 1500 kPa pressure head (H) defined in the ROSSTEA model to prepare the water retention curve using the sand, silt, and clay value of soil. The 33 kPa indicates the upper limit of water availability by plants known as field capacity, and the 1500 kPa indicates the lower limit of water availability by plants known as a wilting point, respectively (Adhikary et al. 2008). We have estimated field capacity of soil at 33 kPa for all soil types in the study catchment.

2.2.8 Multi-Criteria Evaluation (MCE) and weighting assignment

Multi-criteria evaluation (MCE) is a 5 steps pair-wise comparison relation process as (i) pair-wise ranking, (ii) decision matrix, (iii) eigenvalue/vector, (iv) consistency ratio, and (v) priority of criteria. Gupta and Srivastava (2010) explain the MCE, which is AHP based on assigning the weighting of each factor, keeping the view that which parameters are most influencing soil erosion and which parameters have the least importance cause the soil erosion in the basin. In the MCE, which supports the multi-criteria decision making and developed by Saaty (1980), ranking ranges 1 to 9 where 1 indicates less important parameter and 9 refers to the most responsible factor for soil erosion (Saaty et al. 1980; Srivastava et al. 2012). The ratio scales were generated using paired comparisons of criteria in AHP with some inconsistencies in judgments. Then, priorities (weightings) of criteria and consistency ratios are calculated (Goepel et al. 2018). The priorities (weightings) of the criteria are derived using the principal eigenvector (e) of the matrix M (3), as follows:

$${M}_{e}={\lambda }_{\mathrm{max}}\times e$$
(3)

According to Saaty (1980), the value of consistency ratio (CR ≤ 0.1). The CR and CI can be estimated using the following formula given below (4) and (5):

$$\mathrm{CR}=\frac{\mathrm{CI}}{\mathrm{RI}}$$
(4)
$$\mathrm{CI}=\frac{(\lambda \mathrm{max}-n)}{(n-1)}$$
(5)

where \(\lambda \mathrm{max}\) = largest eigenvalue and e = principle eigenvector

$$\mathrm{CI}=\left(Consistency\, index\right), \mathrm{RI}=(\mathrm{Random Inconsistency})$$

where n is the number of variables.

2.2.9 Final prioritized map

Further, results from field capacity (water holding capacity), compound factor, and RUSLE were used to generate a prioritized map for the distribution of erosion patterns in the basin. The prioritized map was prepared using the above three factors and the weighing overlay method on the GIS platform. Final erosion map was classified into five zones to the erosion variability in the region as very low, low, moderate, high, and very high zone in respect of erosion. Although four RUSLE maps were prepared from the four different LS-factors (from four different DEMs), final prioritized map of soil erosion was prepared using the RUSLE results (SRTM 30 m DEM), which was more closed to observed data in the basin.

2.2.10 Hypsometry of sub-basins

The hypsometry simply relates to the measurement of land elevation which aims to develop a dimensionless ratio of cross section area of the basin to its elevation (Dowling et al. 1998). Strahler (1952) stated that the hypsometric analysis could help to identify the erosion status at a different level. The hypsometric integral (HI) and hypsometric curve (HC) are the two special outcomes of the hypsometry (Singh and Singh 2018), which serves as an indicator of sub-basin condition (Ritter et al. 2002). The HI and HC have calculated using the DEM with QGIS (Quantum GIS 2019) using the CalHypso extension for the hypsometry at basin and sub-basin level. The basic method behind the calculation of hypsometric integral (HI) is given by Pike and Wilson (1971), which is simply an elevation-relief ratio. The following relationship is as given below Eqn. (6):

$$\mathrm{HI}=({\mathrm{Elev}}_{\mathrm{mean}}-{\mathrm{Elev}}_{\mathrm{min}})/({\mathrm{ Elev}}_{\mathrm{max}}-{\mathrm{Elev}}_{\mathrm{min}})$$
(6)

where \({\mathrm{Elev}}_{\mathrm{mean}}\): weighted mean elevation of the sub-basin; \({\mathrm{Elev}}_{\mathrm{max}}\): Maximum elevation within the sub-basin; and \({\mathrm{Elev}}_{\mathrm{min}}\): Minimum elevation within the sub-basin.

3 Results

Among these soil classes, loam (Be84-2a-3685) is dominant soil type with 41,275 km2 (32.12%) followed by loam (I-Bh-U-c-3717) and loam (Bd34-2bc-3663). The lowest coverage area soil is loam (Be74-2a-3675), having 27.56 km2 of total area. The dominant land use/land cover is cropland and illustrated in Fig. 3.

Fig. 3
figure 3

Land use/land cover (LULC) map is showing a class distribution in the study area

3.1 Morphometric analysis and prioritization of sub-basin

The results of morphometric analysis showed total fifth order of streams (first order to fifth order) for quantitative evaluation of basin and also to interpret the morphodynamic characteristics (“Appendix-Table 8”). The highest number of streams (465) was occupied by the first-order stream, while the lowest number (46) by the fifth-order stream. Among the sub-basin, a maximum of 64 streams laid in the sub-basin 18, while the minimum 7 streams were laid in the sub-basin 30. The results showed that first-order streams were of a smaller length stream, but their total length was greater than other higher-order streams for each sub-basin. The sub-basin 18 was occupied by the higher stream length (1186.93 km), while 30 sub-basins occupied by the lower stream length (117.56 km). The sub-basin no. 18 had the highest drainage area (9240.95 km2 \({\mathrm{km}}^{2}\)), while the sub-basin 30 had the lowest (958.71 km2) drainage area. The highest (1240 km) and lowest (349.76 km) value of perimeter was found in sub-basin 18 and sub-basin 30, respectively. Similarly, the basin length (Lb) \(({\mathrm{L}}_{\mathrm{b}})\) was highest (234.67 km) for the sub-basin 18 and lowest (64.79 km) for sub-basin 30.

The RLvalue for the basin was ranged from 5.513 (max.) to 0.524 (min.) for sub-basins 25 and 1, respectively, whereas value of Rb ranged from 4.392 (max) to 1.205 (min) for the sub-basins 13 and 28, respectively (“Appendix-Table 9”; Fig. 4 a-e). The overall Rb for the basin was 1.821. The other linear parameters as Dd were varied over 0.182 (max) to 0.072 (min) for sub-basins 25 and 6, respectively. The value of Lo was max (0.091) for sub-basin 25 and min (0.036) for sub-basin no. 6. The Sf showed maximum value (0.012) for the sub-basin 28 and minimum value (0.005) for sub-basins 23 and 24.

Fig. 4
figure 4

a Sub-basin-wise stream length ratio in the Ghaghara river basin, b sub-basin-wise bifurcation ratio, c sub-basin-wise drainage density, d sub-basin-wise length of over land flow, and e sub-basin-wise stream frequency in the Ghaghara river basin

The results of shape morphometric parameters showed that form factor (Ff) was varied from 0.228 (maximum) to 0.168 (minimum) for sub-basins 30 and 18, while elongation ratio (Re) was 0.539 (maximum) to 0.462 (minimum) for the sub-basins 30 and 18 (Fig. 5 a, b, c, d and e). The circulatory ratio (Rc) was ranged from 0.288 (maximum) to 0.058 (minimum) for sub-basins 17 and 28, while the shape factor was from 5.959 (maximum) to 4.379 (minimum) for the sub-basins 18 and 30. The compactness factor (Cc) for the study area varied from 4.158 (maximum) to 1.864 (minimum) for the sub-basins 28 and 17. The prioritization for the Ghaghara river basin was done, and sub-basin 19 was assigned as first rank following sub-basins 13 and 9, while the last rank was assigned to sub-basin 26 (“Appendix-Table 10”).

Fig. 5
figure 5

a Sub-basin-wise form factor in the Ghaghara river basin, b sub-basin-wise elongation ratio, c sub-basin-wise circulatory ratio, d sub-basin-wise shape factor, and e sub-basin-wise compactness factor in the Ghaghara river basin

3.2 RUSLE

The rainfall-erosivity factor (R) for the Ghaghara river basin was ranged from 298.8 (north-east and north-west part of the basin) to the 11,171.8 MJmmha−1 h−1y−1 (middle and lower part of the basin) (Fig. 6a). The soil erodibility factor (K) ranges between 0 and 1, where K-factor near to 0 means less susceptibility for erosion and K-factor near to 1 is represented higher susceptibility for soil erosion. The value of K-factor estimated for the basin ranges from 0.014 to 0.022 (thahha−1 MJ−1 mm−1) (Fig. 6b). The upper portion of the basin was occupied by the lower K-factor, while the middle and central portion was occupied by the higher erodibility factor in the basin. The cover management factor (C-factor) in the Ghaghara river basin ranged from 0 to 0.2. The higher value of C-factor was found in central and lower region of the basin, while the upper part of the basin was occupied by lower value of C-factor in the basin (Fig. 6c). The P-factor value in the Ghaghara river basin ranged from 0 to 1 (Fig. 6d). The moderate P-factor value occupied the lower central portion of the basin. The upper portion was mostly shown the higher value of P-factor nearly 1 or close to 1, but some portion of the upper basin was also occupied by the lower or zero P-factor value.

Fig. 6
figure 6

a Rainfall-runoff erosivity map (R factor), b soil erodibility factor map (K-factor), c cover management factor map (C-factor), and d support practice factor map (P-factor)

The value of LS-factor for SRTM 30 DEM varied over 0.03 to 100.66, and for ALOS 30 m DEM, it was 0.03 to 102.54 in the basin (Fig. 7a, b). The LS-factor for MERIT 90 m DEM was 0.03 to 33.34, while for SRTM 90 m DEM, it varied over 0.03 to 128 in the basin (Fig. 7c, d). The upper portion of basin augmented the higher value of LS-factor, while the lowest value of LS-factor laid in the middle and downstream portion of basin for all four DEMs.

Fig. 7
figure 7

Topographic factor maps (LS-factor) are showing high and low value of LS-factors for a SRTM 30 m DEM, b ALOS 30 m DEM, c MERIT 90 m DEM, and d SRTM 90 m DEM in the study area

3.3 Soil erosion rate and model performance

The annual rate of soil erosion was estimated by multiplication of five RUSLE factors (for different DEMs), which had shown soil loss in the basin. The erosion rates were 21.39, 18.31, 4.35, and 4.64 ton/ha/year for SRTM 30 m, ALOS 30 m, MERIT 90 m, and SRTM 90 m, respectively (Table 2). We had evaluated the RUSLE results for all DEMs using observed data, and the percentage of changes in soil erosion amount from observed were 13.98%, 26.38%, 82.51%, and 81.34%, respectively, for SRTM 30 m, ALOS 30 m, MERIT 90 m, and SRTM 90 m. The results showed that among the annual soil erosion rate estimated using different DEM, the RUSLE soil erosion rate of SRTM 30 m was more closed (13.98%) to observed data, followed by the soil erosion rate of ALOS 30 m (26.38%). The soil erosion rate by 90 m DEMs showed the poor results, whereas soil erosion from fine resolution DEM was closer to observed data.

Table 2 Annual rate of soil erosion statistics obtained from four different DEMs

3.4 Spatial variation in soil erosion rate

We had categorized the soil erosion rate into seven different ranges for better understating about the variation of the soil loss in the basin. These seven soil erosion categories were < 5, 5–10, 10–15, 15–20, 20–25, 25–30 and > 30 ton/ha/year for four different soil erosion rate maps (Fig. 8 a, b, c and d). Overall, the category < 5 ton/ha/year was maximally observed in all maps of soil erosion rate for all the DEMs and this category covered the greater upper portion of the basin for SRTM 30 m and ALOS 30 m DEMs. The category 5–10 ton/ha/year was mostly covered the lower middle portion for erosion rate map of ALOS 30 m and SRTM 30 m DEMs. The category 10–15 and 15–20 ton/ha/year covered the low portion for all DEMs. The higher erosion rate categories (20–25, 25–30 and > 30 ton/ha/year) were occupied by the south-east and the middle portion for SRTM 30 m and ALOS 30 m DEMs while middle portions for MERIT 90 m and SRTM 90 m DEMs.

Fig. 8
figure 8

The estimated soil erosion rate (ton/ha/year) using RUSLE for a SRTM 30 m DEM, b ALOS 30 m DEM, c MERIT 90 m DEM, and d SRTM 90 m DEM is showing the distribution pattern of soil erosion rate in the study area

3.5 Soil hydraulic parameters

Table 3 shows water holding capacity of soil types in the Ghaghara river basin. The soil clay_loam (Bd29-3c-3661) shows the highest value of field capacity (0.284 cm3/cm3), while the clay_loam (Rd30-2b-3851) has 0.257 cm3/cm3) and loam (Jc50-2a-3743 ) has (0.251 cm3/cm3), respectively. The other soil types showed moderate to lower capacity, while the lowest rank was accounted by glacier-6998 (0.22 cm3/cm3) (Fig. 9).

Table 3 Water holding capacity based on soil types in the Ghaghara river basin
Fig. 9
figure 9

A schematic diagram of water retention curve for different soil types in the Ghaghara river basin

3.6 Multi-criteria evaluation (MCE) and final prioritized map

For the Ghaghara river basin, to use the MCE-based methodology, three factors (compound factor from morphometric analysis, erosion of soil from RUSLE, and filed capacity (ROSETTA) from the soil had been constituted. MCE was applied to these factors to schedule soil erosion prone zone in the entire basin and so that subsequent conservation measures can be applied. Following, the procedure of AHP based MCE, the estimated consistency ratio was 0.02, which was less than 0.1 (Table 4) and according to Saaty (1980) it is considered as good and acceptable. The above three factors were explained by their decision matrix and priority criteria (weighting) using the MCE. The highest weighting was given to RUSLE model (71.6%) followed by compound factor (20.5%) and field capacity (7.8%) which had 2nd and 3rd rank, respectively. The result of the final prioritized map (Fig. 10) showed that erosion activity in the middle part of the basin ranged from high to very high (SB-3, SB-5, SB-7, SB-14, SB-26, and SB-30).

Table 4 Pair-wise comparison matrix by MCE-AHP method for weighting four factors
Fig. 10
figure 10

Prioritized map is showing areas which are needed interventions for conservation practices in the Ghaghara river basin

The northern upper (SB-1, SB-2, SB-4, SB-6, SB-8, SB-10, SB-11, SB-15, SB-17) and lower portion (SB-21, SB-22, SB-27) show moderate erosion. The SB-9, SB-16, SB-18, SB-19, SB-20, SB-23, SB-24 show low to very low erosion in the basin. The high erosion zone captured the 9.30%, while moderate erosion zone occupies the 46.72% area of the basin (Table 5).

Table 5 Percentage-wise soil erosion zone in the Ghaghara river basin

3.7 Hypsometric analysis

Hypsometric parameters (HI and HC) of the Ghaghara river basin were acquired at sub-basin level for the detailed analysis of mass movement (erosion) happening in the basin (Figs. 11, 12). The HI values are categorised into 3 zones for easy identification of the erosion prone zone, and these zones are as follows 0 to 0.3, 0.3 to 0.6, and above 0.6 , respectively. If the HI value of the sub-basin is 0.3, it means that 30% of original rock mass still exist in that sub-basin. The HI value of the Ghaghara river basin ranges from 0.032 (min) for the sub-basin number 22 to 0.664 (max) for the sub-basin number 28. The rest of the sub-basins (almost 50%) has value in between 0.3 and 0.6, while others are below the 0.3 and few are above the 0.6 level.

Fig. 11
figure 11

A schematic diagram showing hypsometric integral (HI) at sub-basin level in the study area

Fig. 12
figure 12

Sub-basin-wise hypsometric curve in the Ghaghara river basin

4 Discussion

The relation of master streams with their joining tributaries is the drainage system, which shows the topography effect on streams (Strahler 1964). The Ghaghara river basin occupies a more undulating structure from Himalayan chains (upper confluences) to Indo-Gangetic plain (downstream), resulting from different drainage patterns. The upper parts have a trellis pattern, which is a rectangular type (northern and north-east portion), while lower parts consume dendritic and sub-dendritic types. Strahler (1964) has explained the method for ordering streams based on hierarchical ranking. For the Ghaghara river basin, total fifth order of the streams was generated and stream order is inversely proportional to the stream length (Singh and Awasthi 2011). This inverse relation between stream order and stream length is indicator of variation in lithology, moderately steeper slopping pattern and high altitude flowing streams (Singh et al. 2013). Sethupathi et al. (2011) describe the role of stream length in recognizing hydrologic properties (permeability) of rock. Small streams that are large in number follow the less permeable path, while the longer streams of smaller counts follow a more permeable zone. Waikar et al. (2014) explain the role of stream length for revealing surface runoff as small length streams are characterized in slopping area and therefore finer texture, while the larger length streams are indicative of plainer zone. This is also happening in the Ghaghara river basin where larger length streams are maximally dominated in the plain area (lower and middle portion of the basin). The relative capacity of a rock to pass the fluid, discharge, and erosion condition in a basin can also be estimated by taking the ratio of stream length as suggested by Horton (1945) and which is stream length ratio (RL). In the Ghaghara basin, sub-basin 25 has the highest RL value, which is supported by permeable zone (almost sandy area) and gentler gradient than the sub-basin 1, which has the lowest RL value and depicts the opposite condition of sub-basin 25.

The above variation in the topographic condition due to RL in basin is also supported by Vittala et al. (2004). There are basically two types of bifurcation ratio (Rb)as one is low value and another is high value Rb. The low value Rb indicates those drainages which are not affected by the geologic constraints, while the high value Rb is primarily influenced by geologic constraints. Gajbhiye et al. (2014) stated that higher values of Rb is supported steeply sloping surface with narrow confined valley, therefore, higher chances of more runoff and less recharge in that area. In the Ghaghara river basin, sub-basins 13 and 17 have higher value of Rb, and these sub-basins are confined to the north-eastern part of the basin proving steeply sloping surface with a narrow confined valley.

Nag and Chakraborty (2003) stated that low Rb has characteristics of less structural disturbances in that area and lower value Rb is confined to the lower part of the Ghaghara river basin where drainages are not structurally distorted, yet they are flowing in plain of maturity. The lower part of the basin is permeable and have soft strata therefore a good chance to percolate/infiltrate the surface water to ground-water. The Dd most vital linear parameters depict the quantitative measure of average length of stream in a particular sub-basin or for the whole area. The lithological properties in terms of porosity and permeability of earth material can also be recognized using Dd therefore good in decision making for artificial recharge site. The low Dd area is generally occupied by highly registrant track, and dense vegetation (Nag 1998), and these areas in the Ghaghara river basin are captured by hilly/registrant tracks and middle part having characteristics of dense vegetation. Whereas high value Dd is found in basin where surface materials are weak and dominantly occupied by sparse vegetation cover.

The vegetation pattern and hydrologic properties of soil (permeability) have a big role in deciding surface runoff and therefore directly decide the density of drainages in that area (Dash et al. 2019). Yadav et al. (2020) also supported above that low Dd has coarse drainage, while high Dd has fine drainage texture. Moglen et al. (1998) stated that Lo is inversely related to Dd, and important linear parameter governs the hydrologic and physiographic characteristics of a drainage basin. Rama (2014) pointed out that lower Lo indicated quicker runoff process and vice versa. Here, for the Ghaghara river basin Lo varies from 0.036 to 0.091 with a mean value of 0.055, whereas upper tracks have lower value, while lower parts have a higher value.

The sub-basins in the Ghaghara river basin, those have a lower Lo laid in the hilly mountains region where faster runoff carries water from upstream to low lying area. Therefore, sub-basin those have higher value of Lo gets a chance to accommodate the more water for flooding during intense rainfall period and this is the main characteristics of lower part of the Ghaghara river basin, which causes flooding and erosion in the basin. However, the situation becomes the opposite during the dry period. The sub-basins 4, 5, 11, 15, and 28 have a higher Sf value, while other sub-basins have low to medium range.

The high Sf is the characteristics of impermeable subsurface lithology and high relief, while other hydrological properties like infiltration capacity and percolation also help to decide the Sf of the region. The sub-basins that are occupied by the highest value of Sf and impermeable lithology in the basin generally produce faster runoff. This faster runoff originates the floods downstream of the basin, and during the rainfall period, the downstream parts of the Ghaghara river basin, especially the Bahraich, Faizabad, Basti, Ambedkar Nagar, and Azamgarh, are inundated with the muddy water. One of the serious impacts of the flooding in the Ghaghara river basin is siltation, which is a complex environmental problem than landslides in the upper part of the basin. Therefore, it is a subject of keen interest to mitigate the hazard of flooding in cities/villages situated in plainer area where the Ghaghara river comes to mature stage and the chances of flooding and lateral meandering become more dangerous than to down-cutting the basin.

The form factor (Ff) is one of the important shape parameters of morphometry, which defines whether the basin is circular or elongated. There is a threshold value 0.78 for deciding the basin to be elongated (if Ff < 0.78) or circular shape (if Ff > 0.78) (Kumar et al. 2018a), and since Ff for the Ghaghara river basin is always less than 0.78, therefore, the basin is elongated in shape.

The lower part of the basin, especially sub-basins 25, 26, 28, and 30, has a higher value form factor than the upper part of the basin. Therefore, the sub-basin which has high form factor will reoprt high intensity (peak flow) within a short time, whereas more elongated sub-basins will show lower peak (low flow) of longer time. The factor which influences peak flow and low flow in a basin is basin length (Lb), as higher basin length reduces the peak discharge and vice-versa (Rao 2016). Therefore, this elongated sub-basin can be easily managed than a circular sub-basin. The elongation ratio (Re) characterized by the shape of the basin varies from 0.6 to 1.0 in the variability of climatic and geologic setup. The value of Re varies from 0 (in highly elongated shape) to unity 1.0 (in a circular shape). The value close to 1.0 depicts low relief, whereas Re close to 0.6–0.8 is usually associated with high relief (Strahler 1964). The infiltration capacity of the basin can be checked by the elongation ratio as high Re indicates high infiltration capacity, and in the Ghaghara river basin, the higher values are close to sub-basin 28 and 30 where the subsurface material (alluvium) is more permeable therefore higher infiltration capacity.

Reddy et al. (2004) stated about Re that areas with lower value are found in the zones which have a higher susceptibility to erosion and sediment load, and these areas are dominated in the low stream and upper stream of the Ghaghara river basin. The appraisal of flood prone areas can be easily performed using the shape morphometric parameter such as circulatory ratio (Rc) as the flood concerning parameters like stream length, and its frequency, geological structures, and climate, etc., have great control on Rc. The higher Rc causes chance of flooding during peak rainfall condition at outlet point of their sub-basins as this outlet becomes inlets for lower Rc sub-basin in the downstream part of river basin. The same is true for the Ghaghara river basin as the downstream sub-basins have lower Rc and receives more water from the upper part of the sub-basin having the higher Rc value.

Bali et al. (2012) reported the range of Rc from 0 to 1 and explain morphological stage using this range as lower value of Rc suggests young/mature stage of the river, while higher value indicates the old stage. For the Ghaghara river basin, the circulatory ratio always closer to the lower circulatory ratio, this suggests that the basin is in young to mature stage. The mean Rc value is 0.134, which is always less than unity and shows that the shape of the basin is not in a circular pattern, and Rc and Re both have confirmed that the Ghaghara river basin has an elongated shape of the basin.

The shape factor (Bs) which is inversely related to the form factor (Ff) defines shape irregularity of a sub-basin. The upper and middle sub-basins have higher shape factors, while the downstream sub-basin has a lower value.

Shape factor helps to quantify the head (highest) discharge at the pour point of the basin. As a tributary pattern of a circular-shaped basin is more compactly organized than an elongated-shaped basin having the same area and tributary flow joins to mainstream at roughly the same time, therefore, peak discharge will arrive faster at the outlet of circular basin. Thus, outlet gains the higher flood within a shorter duration. Choubey and Ramola (1997) described that the shape morphometric parameters have an inverse relation to the soil erosion; therefore, the compactness factor (Cc) should show the opposite trend to soil erosion, and it can be concluded that the lower the value of Cc depicts, higher erosion in that area and vice-versa. The results of compactness factor show that the sub-basins are having a lower value of Cc following the high to moderate erosion in that area and these sub-basins are situated in the upper portion.

The results of hypsometric analysis also validate the results of Cc as the upper stream sub-basins are in young to mature stage of the river, and hence, more erosion might be in process. At the same time, the middle and downstream sub-basins occupy the higher value of Cc in comparison with the upper portion of the basin and, therefore, lower erosion comparison to upstream. Since it has an inverse relationship to elongation ratio, consequently, the downstream sub-basins are more elongated. Gravelius (1914) states that if the Cc value is unity, then the basin will be perfectly circular, and for the Ghaghara river sub-basins, the value of Cc is greater than unity; therefore, it is an elongated river basin.

The compound factor and prioritization of sub-basin help to know the sub-basin susceptibility to loss of soil in the Ghaghara river basin. It is important to manage the whole basin at once step; therefore, the prioritization help to implement the conservation practices according to the sub-basin condition. Gajbhiye et al. (2014) also stated that the prioritization of sub-basin based on the ranking of sub-basins are needed for the conservation practices. The low prioritized sub-basin demands higher conservation practices (Pradhan et al. 2018) and vice-versa. The results of the compound factor showed that for the Ghaghara river basin, sub-basin no. 19 (rank 1) is more prioritized, followed by sub-basin no. 13 and 9.

The hydraulic properties of soil help to understand the erosion condition of that area, and therefore, the field capacity of the soil was estimated at different pressure heads for the Ghaghara river basin. The water retention curve depicts that soil clay_loam (Bd29-3c-3661) has more field capacity which means that this soil will retain and accumulate more water than other soil types, and therefore it will cause a severe erosion during high peak flow. Similar conditions will be followed by the clay loam (Rd30-2b-3851) and loams (I-Bh-U-c-3717 & Jc50-2a-3743 ) respectively. The soil erosion by RUSLE for all DEMs showed that upper middle and lower (south-east) portions of the basin have a high contribution to soil erosion.

It can be concluded that higher erosion zones are those areas that occupy their position mostly in hilly terrain and where headward and deep erosion process is mostly activated due to their natural topography. Mahapatra (2005) also supported the above argument that hilly terrain areas are characterized by headward and vertical erosion due to the high energy in streams. We have used outcome of RUSLE, compound factor, and water holding capacity (field capacity) to understand erosion pattern in the entire basin as erosion process is very complex and constitutes the role of various factors. Therefore, we have followed MCE-based AHP procedure in order to include the weightage of each factor. The meaning of weightage here indicated the efficiency of parameters to depict soil erosion. The RUSLE model has the highest weightage, followed by the compound factor. The lowest weightage was assigned by field capacity.

The final prioritized map was developed by overlay analysis and had been classified in to five zones to understand soil erosion in the entire basin. The final prioritized map of soil erosion shows that 9.30% of the basin falls under high erosion categories. The areas under low to very low erosion are contributing 40.41 to 3.51%, respectively, in middle and lower parts of the basin where the basin is almost in the monadnock stage (old). The percentage of moderate erosion is 46.72% in the lower, upper, and some middle parts of the basin. For more illustration of the variation of soil erosion in the basin, the total seven categories were generated and results depict the soil erosion rate < 5 ton/ha/year is covering the major portion of the basin for all DEMs. The results of soil erosion for all four DEMs depict that the finer resolution DEMs are predicting the good results and more close to observed data. Here, SRTM 30 m and ALOS 30 m DEMs show the 13.98% and 26.38% difference, respectively, from the observed data. In comparison, 90 m resolution DEMs as SRTM and MERIT show the higher differences from observed data and therefore represent the poor results. Hence, it can be concluded that the RULSE results for the SRTM 30 m DEMs followed by the ALOS 30 m are best suited for the Ghaghara river basin. The outcomes from the study are helpful in selection of appropriate DEMs and showed the effects of DEMs variability for soil erosion in basin. It can be concluded that fine resolution DEMs represent good results than the coarser DEMs in basin (Mondal et al. 2017), and hence, the prioritization of basin based on soil erosion will be helpful for recognition of vulnerable areas.

The hypsometric analyses help to know the disequilibria and landscape evolution in the Ghaghara river basin, and there are three types of hypsometric curves according to the geomorphic age of the river basin. Strahler (1952) described the convex upward shape curve for the young stage, while S shape curves for the mature stage have the upper portion concavity while the lower portion convexity. For the old stage (peneplain/monadnock) of the basin, the concave upward shape was suggested by Strahler (1952). Here, in the Ghaghara river basin most of the hypsometric curves of sub-basins are of all three types. Although not all the sub-basins are S-shaped, some are convex upward (sub-basins 28 and 30), and some others are concave upward following the equilibrium condition. The hypsometric integral (HI) values also help to grasp the geomorphic condition of the basin therefore plotted for 30 sub-basins. The HI values validate the same results as the hypsometric curves and the maximum sub-basins of the Ghaghara river basin occupy their position in the mature stage of the erosion cycle and moving toward the monadnock (old stage). Ritter et al. (2002) stated that these mature stages sub-basins would face moderate erosion but might be intense in the high runoff period or in the case of entrenched meandering. Therefore, the HC and HI are useful parameters to understand sub-basin health. The Ghaghara river basin occupies its position in the Himalayan region particularly in the lesser and Shivalik range, and these areas attain the mature stage from the young.

5 Recommendations and suggestions

The water erosion structure needs to be installed at the soil prone areas, stopping illegal sand mining by implementing the stringent rules and regulations; better understanding about the extreme event and return period of flood is required. The sustainable land use practices need to be focused in the region with no tillage activities, to control the forest clearing, and afforestation activities need to be promoted to intact soil particles. For arrest of lateral erosion, the levee formation and routing of excess water through canals at the time of peak discharge are recommended. The location-specific process-based models can be applied to allow the use of local parameters governing more erosion in that area. Moreover, the location specific model should use the input of those processes that are actually occurring in study area like landslides and mass wasting which are common processes that are occurring in the Ghaghara river basin. The high resolution of satellite data and DEM is recommended for the estimation of soil erosion to minimize the uncertainty in the results. The downstream siltation records of dams and reservoirs needed for more precise outcomes.

6 Conclusion

Estimation of soil erosion will be helpful to control soil loss in the Ghaghara river basin. Therefore, the current study has used RUSLE model for four DEMs to estimate soil loss in the basin. The other parameters, such as morphometric analysis and water holding capacity of soil, have also been calculated. Further, AHP-based MCE methodology has been adopted using three factors (soil loss from RUSLE, compound factor from morphometric analysis, and water holding capacity of soil) to identify soil erosion zone for subsequent conservation measures. The estimated annual soil erosion rates are 21.39, 18.31, 4.35, and 4.64 ton/ha/year for SRTM 30 m, ALOS 30 m, MERIT 90 m, and SRTM 90 m, respectively. The result of erosion rate for the SRTM 30 m DEM is more closed to observed data followed by the ALOS 30 m, while the coarse resolution DEMs depict the poor result. The prioritized map shows the five soil erosion areas which will be helpful in finding the erosion prone zone in the basin. The hypsometric analysis outcomes depict the geomorphic age of basin as old and mature stages form young. The study outcomes will be useful for researchers in appropriate selection of DEMs and to know the consistency in the model outputs due to different resolution DEMs from the observation data.