Introduction

Soil degradation is currently the most severe environmental problem worldwide, as it depletes topsoil nutrients, degrades agricultural land, and diminishes crop output (Pradeep et al. 2015). In a watershed, soil erosion can lead to severe land degradation issues. Recently, it has been found that researchers are actively working on analysing soil erosion and sediment yield, mainly focusing on tropical and subtropical regions (Issaka and Ashraf 2017). Globally, the annual soil erosion rate from cultivated land ranges from 22 to 100 tonnes per hectare (Girmay et al. 2020). One-third of the world's agricultural area is believed to be affected by soil erosion (Hurni and Meyer 2002). According to estimates from an assessment of soil deterioration in India, water and wind erosion affected around 1100 million hectares and 550 million hectares, respectively (Saha 2003). In the Himalayas and the Western Ghats, soil erosion affects about 45 per cent of the land area. Soil disintegration directly impacts the massive production of sediments and rapid deposition in watersheds. Human-caused soil deterioration affects approximately 1964.4 million hectares (M ha) of soil globally, with water erosion exacerbating the problem in 1903 M ha (Bhattacharyya et al. 2015). Several experts identified soil loss as a serious threat in mountainous areas. In the Brahmaputra basin highlands, this situation is particularly acute. An accurate estimation of soil erosion amount is required to determine the effect of soil erosion on a regional scale. A massive amount of silt in runoff water causes sedimentation of river beds and flood disasters; as a result, effective control and supervision should be a critical component of watershed planning and management (Xu et al. 2015). Because of its adaptability under a variety of ecosystem types and management scenarios, the RUSLE model is used in several studies to estimate net and average soil loss and sediment discharge at river outlets for various watersheds (Jain and Kothyari 2000; Ganasri and Ramesh 2016; Alewell et al. 2019; Li et al. 2020; Yadav et al. 2022). Manual or mechanical methods of assessing the above components with adequate precision for calculating soil loss are time-consuming and uneconomical because of the delicate interaction between climate, LULC, and other anthropogenic activities involved in a watershed (Duru et al. 2018). Many empirical models based on geomorphological factors were constructed to calculate previous soil loss and quantify sediments output. Several approaches are widely used, including the sediment yield index (SYI), the Universal Soil Loss Equation (USLE), and the Revised Universal Soil Loss Equation (RUSLE) (Renard 1997; Wischmeier and Smith 1978). As the USLE model and its revised form (RUSLE) can only measure soil loss and do not account for sediment transportation or deposition, the sediment delivery ratio (SDR) model is frequently used in conjunction with the USLE or RUSLE to assess the soil loss and sediment yields at the same time (Jain and Kothyari 2000). Using traditional approaches like a hydrographic survey to estimate sediment deposition in a watershed is time-consuming. It takes a lot of time and human resources, and it is not even cost-effective. Many researchers have used the RUSLE-SDR model to evaluate soil erosion and sediment production in various sites due to its high efficiency and ease of application (Gelagay 2016; Poirier et al. 2016; Yan et al. 2018). However, reliable measurement and erosion prediction become challenging because of the number of very sensitive and complex variables in the process of soil erosion. The ability to assess sediment yield has ramifications since it could be a useful tool in guiding future river management efforts. Advanced data-driven technologies, such as machine learning approaches and the generalized reduced gradient (GRG) nonlinear method, have been demonstrated to be effective in modeling environmental and natural resource modeling(Wenzel Jr and Melching 1987; Mirzaei et al. 2018; Hosseini et al. 2020; Niazkar and Zakwan 2021). For sediment rating curves and sediment yield in a watershed, the GRG nonlinear technique appears to outperform standard models (Vale et al. 2016; Zakwan and Ahmad 2021; Jabbar and Yadav 2022).

The Brahmaputra River watershed is one of the largest watersheds and carries massive sediment. The largest river island, Majuli, is formed because of the sediment deposition over the decades. As mentioned, the Brahmaputra basin yields a large amount of sediment every year. The formation of sand bars occurs throughout its course, and the river has become a braided river. Therefore, an assessment of the sediment yield at the outlet of each sub-watershed will provide a clear view of the sediment transport scenario. In the literature, it is found that spatially distributed models like the RUSLE-SDR model can be used accurately to assess soil loss as well as sediment yield from a watershed. Similar to other erosion models, RUSLE-SDR is a multi-input model. Still, some of its input parameters may not have all spatial data at the watershed scale for sediment yield estimation. And also, some inputs may not have good quality satellite images, requiring high computational techniques. Therefore, the output may not necessarily have satisfactory results. Moreover, obtaining reliable results from the RUSLE-SDR model depends on skilled professional opinions regarding data selection and methods for the computation of model inputs. Therefore, a numerical model that provides a simple technique and ease for estimation may help the engineers and planners quickly assess the same. Considering this aspect, it is intended to develop a numerical model based on the RUSLE-SDR model in this study. So, to develop the model, the sediment yields at different watershed levels are estimated considering outlets at various locations in the same river using the RUSLE-SDR model. After generating the data for sediment yield for about 14 years, a numerical model is developed using GRG nonlinear optimization technique with satisfactory performance statistics. The detailed study area, the methodology adopted, and the step-by-step procedure are discussed further in the following texts.

Study area and input data

Study area

The current study focuses on the Upper Brahmaputra River Basin (UBRB) up to Majuli River Island. It lies between longitude 84° 30' E to 95° 30' E and latitude 25° 00' N to 31° 30' N, as shown in Fig. 1. The study area covers a total area of about 372,385 km2. It is located at an elevation between 10 and 7312 m above mean sea level (Fig. 3a). Seasonal flow, silt transport, and channel structure are the essential characteristics of the Brahmaputra River (Mani et al. 2003). Due to frequent inundation and eroded floodplains damaging land and crops in the Brahmaputra valley and north bank tributaries, the Upper Brahmaputra valley is the worst flood-affected area (Hazarika et al. 2015). The tremendous inundation of the river Brahmaputra every year significantly impacts Majuli. The river flows in a firmly braided channel with multiple laterals, mid-channel bars, and islands in the Brahmaputra valley. The majority of them are transient, drowned during large rainy season flows, significantly varying geometry and locations. Majuli, the world's largest river island, is facing extinction due to the river's severe erosion problem (Mani et al. 2003). The Brahmaputra River has an average discharge of 20,000 m3/s. The basin's climate is monsoon-driven, with a distinct rainy season that lasts from June to October and accounts for 60–70% of yearly precipitation (Immerzeel 2008). Immerzeel (2008) classified the Brahmaputra watershed into three physiographic zones: the Tibetan Plateau (TP), the Himalayan Belt (HB), and the Floodplain (FP). The Upper Brahmaputra basin, which includes China, Bhutan, and India, consists of TP and HB.

Fig. 1
figure 1

Study Area map showing Brahmaputra River watershed up to Majuli River Island

Input data

The dataset essential for evaluating RUSLE factors and generating erosion hazard maps is prepared from various data sources. The digital elevation model (DEM) is obtained from https://lpdaac.usgs.gov/. The DEM is in 1 Arc Second (30 m) resolution (Fig. 2a). A soil map is downloaded from FAO DSMW (Fig. 2b). It is on a 1:5,000,000 scale and is classified into eleven categories following the UNESCO-FAO soil classification scheme (Table 1). The land use/land cover (LULC) maps for 2001–14 are MODIS land cover type (MCD12Q1). This product is downloaded from USGS earth explorer. Then, it is imported into Arc GIS and is classified into eight new LULC classes (Fig. 2c). It can be seen that forests and grasslands have occupied most parts of the watershed (Table 2). In some parts of the Himalayan region, all rainfall data are not available. Therefore, precipitation data are obtained from Climate Forecast System Reanalysis (CFSR). Daily rainfall data from 2001 to 2014 for 357 stations are prepared for the basin (Fig. 2d).

Fig. 2
figure 2

a DEM map, b soil map, c land use/land cover map for 2014, d GRIDED rainfall station map for the upper Brahmaputra River basin

Table 1 Soil classes for Brahmaputra River basin
Table 2 Land use/land cover classes for Brahmaputra River basin

Methodology

In this context, we have discussed the distributed model RUSLE and SDR through which we have calculated the sediments amount at the outlet of each watershed. We have also discussed the newly developed SRWA model and the GRG nonlinear optimization technique.

RUSLE model

The Revised Universal Soil Loss Equation (RUSLE) parameters are generally used # calculate average annual soil erosion in forest and agricultural environments. Wischmeier and Smith (1978) were responsible for introducing and improving the soil erosion factor. Five primary input factors, R, K, LS, C, and P, need the use of the RUSLE model. Each parameter is saved as a raster file, and the annual average soil loss is calculated in Arc GIS using the raster calculator, as shown in Eq. (1).

$$A = R \times K \times LS \times C \times P,$$
(1)

where A = Average annual soil erosion rate in t/ha/yr, R = Rainfall erosivity factor in MJ/mm/ha/hr/yr, K = Soil erodibility factor in t/hr/MJ/mm, L = slope length factor in m, and S = Slope steepness factor in (per cent), C = Crop management factor without unit, and P = Conservation practise factor without unit.

Rainfall erosivity factor (R)

Rain's impact on runoff and erosion is encapsulated by the R factor. One of the elements in the RUSLE model is the rainfall erosivity factor, which determines how much rain can erode the soil. This indicates that both the environmental circumstances and the data utilized in rainfall erosivity modeling may have an impact. The quality of available data for the selected research area is currently quite variable. As a result, using (Singh et al. 1981)'s method, the R factor is derived from the interpolated map:

$$R = 79 + 0.363 \times P,$$
(2)

where R = Rainfall erosivity factor (MJ/mm/ha/hr/yr), and P = Average annual rainfall (mm).

Soil erodibility factor (K)

Soil erodibility factor (K) specifies how vulnerable different soil types are to erosion and runoff rate (Renard 1997). The K factor indicates the cohesiveness or adhesiveness of soil particles and how prone a particular soil is to erosion on a given slope. The values for specific textural soil classes are derived from the soil erodibility factor (Wischmeier and Smith 1978). The FAO DSMW soil map is used to create the soil texture map for the research area. In the raster calculator class, the K–factor value is estimated using the formula shown below in Eq. (3):

$$K = 0.1317 \times F_{{{\text{csand}}}} \times F_{{{\text{cl}} - {\text{si}}}} \times F_{{{\text{orgC}}}} \times F_{{{\text{hisand}}}},$$
(3)

where \(F_{{{\text{csand}}}}\) = Factor that lowers the K indicator in soils with high coarse-sand content and raises it in soils with little sand. \(F_{{{\text{cl}} - {\text{si}}}}\) = Factor that yields low soil erodibility factors for soils with high clay-to-silt ratios. \(F_{{{\text{orgC}}}}\) = Factor that lowers K values in soils with high organic carbon content, whereas \(F_{{{\text{hisand}}}}\) = Factor that lowers K values in soils with extremely high sand content. The elements are calculated as follows:

$$F_{{{\text{csand}}}} = \left[ {0.2 + 0.3 \cdot \exp \left\{ { - 0.256 \cdot m_{{\text{s}}} \left( {1 - \frac{{m_{{{\text{silt}}}} }}{100}} \right)} \right\}} \right]$$
(3a)
$$F_{{{\text{cl}} - {\text{si}}}} = \left( { \frac{{m_{{{\text{silt}}}} }}{{m_{{\text{c}}} + m_{{{\text{silt}}}} }}} \right)^{2}$$
(3b)
$$F_{{{\text{orgC}}}} = \left( {1 - \frac{{0.25 \cdot {\text{orgC }}}}{{{\text{orgC}} + \exp [3.72 - 2.95 \cdot {\text{orgC}}}}} \right)$$
(3c)
$$F_{{{\text{hisand}}}} = \left( {1 - \frac{{0.7 \left( {1 - \frac{{m_{{\text{s}}} }}{100}} \right)}}{{\left( {1 - \frac{{m_{{\text{s}}} }}{100}} \right) + \exp [ - 5.5 + 22.9 \cdot \left( {1 - \frac{{m_{{\text{s}}} }}{100}} \right)}}} \right),$$
(3d)

where \(m_{{\text{s}}}\) = sand fraction content of 0.05–2.00 mm diameter (%), \(m_{{{\text{silt}}}}\) = silt fraction content of 0.005–0.05 mm diameter (%), \(m_{{\text{c}}}\) = clay fraction content (< 0.002 mm diameter) (%), and \({\text{ orgC}}\) = organic carbon (SOC) (%) (Wawer et al. 2005).

Slope length and steepness factor (LS)

The slope steepness directly impacts erosion, as a steeper slope causes more significant erosion. They produce topography on the topsoil erosion, including slope length and steepness that influence surface runoff speed (Risse et al. 1993). Flow accumulation and slope of the study area are required to determine the LS factor. The equation followed by (Bewket and Teferi 2009; Kamaludin et al. 2013) is implemented to find out the LS factor (Eq. 4).

$$LS = ({\text{FA}} \times {\text{cell size}}/22.1)^{m} \times \left( {0.065 + 0.045 S + 0.0065\, S^{2} } \right),$$
(4)

where LS = Slope length and Slope steepness factor, \({\text{FA}}\) = Flow accumulation expressed as a no. of grid cells; S = slope gradient in percentage; cell size = Grid cell size or the resolution of a DEM map, and m = exponent that depends upon slope steepness which is listed in Table 3 (Wischmeier and Smith 1978).

Table 3 Values of exponent m for slope class (Wischmeier and Smith 1978)

Cover and management factor (C)

This factor is defined as the conversion of soil erosion from cultivated land to equal soil erosion from clean tilled and continuous fallow land under particular conditions (Singh and Phadke 2006). The vegetation cover in an area provides resistance to surface flow as well as act as a shield for splash erosion due to rain. Therefore, crop management factor provides an idea about the soil losses that may incurred due to vegetation cover. Because some parts of the basin are located in China and Tibet, the C factor is unavailable for all crops. As a result, Table 4 shows the C values derived in the current study and those are calculated by Ganasri and Ramesh (2016).

Table 4 Cover management factors (C) for each land use/land cover type (Ganasri and Ramesh 2016)

Conservation practice factor (P)

The influence of land use strategies that limit runoff and thus lower erosion rates for a specific soil type is reflected by the support factor (P). The P factor depicts the soil loss ratio by a support method to upslope and downslope straight row farming (Wischmeier and Smith 1978). Land use characteristics are considered when considering management measures to reduce soil loss. Because the bulk of the upper Brahmaputra basin lacks suitable management methods, it is preferable to employ Wischmeier and Smith's P factor (Wischmeier and Smith 1978). P values are used for the current study region, according to Li et al. (2011) (Table 5).

Table 5 Conservation practice factor (P) from the land use/land cover classes (Li et al. 2011)

Sediment delivery ratio (SDR)

The RUSLE model only predicts average yearly soil erosion and ignores degraded sediment transit and deposition (Thomas et al. 2018). As a result, the sediment delivery ratio (SDR) can be applied in conjunction with the RUSLE estimate of average soil erosion to calculate the amount of sediments delivered to the watershed area's outlet over a given time period (Andersson 2010). Williams and Berndt's (1977) equation is used to compute SDR in this study:

$${\text{SDR}} = 0.627 \times \left( {{\text{SCS}}} \right)^{0.403}.$$
(5)

SCS refers to the slope of the principal river channel in the watershed, which is measured in percentages. ArcGIS is used to compute the slope of the main channel. In the absence of data, the SCS raster grid uses an average upstream cell slope, which has an impact on stream channel sediment delivery capacity.

Sediment yield (SY)

Although RUSLE cannot directly calculate sediment yield, it can be calculated using a combination of RUSLE and SDR (de Rosa et al. 2016). The percentage of eroded soil that does not deposit in the watershed while being transported to the outflow and reaches the downstream area is referred to as sediment yield. Climate, local topography, lithology, basin geometry, drainage networks, and land use/land cover types all have an effect on the sediment yield mechanism in a drainage watershed (Restrepo et al. 2006). Because some areas of the study region lack comprehensive periodic measurements, sediment yield is calculated using SDR and total soil loss. The SDR, which is the ratio of annual specific sediment yield to gross soil erosion (Fistikoglu and Harmancioglu 2002), can be expressed as follows:

$${\text{SY}} = \mathop \sum \limits_{i = 1}^{n} {\text{SDR}} \times A.$$
(6)

Here, SY = Annual sediment yield (tonne), A = yearly soil loss in the ith cell of a catchment, as calculated by Eq. (1), and SDR = Sediment delivery ratio. Figure 3 shows the flowchart of methodology for the RUSLE-SDR model.

Fig. 3
figure 3

Flow Chart of the methodology of RUSLE and SDR models

SRWA model

The RUSLE and SDR models show that sediment yield is more sensitive to precipitation than other catchment parameters. As shown in Fig. 4, there is an exponent relationship between annual precipitation and sediment yield, with a correlation of 0.52. Moreover, the whole watershed is divided into seven smaller watersheds. Sediment yield is calculated in each watershed using RUSLE and SDR models together. In this watershed, LULC has minimal effect on the change in sediment yield; hence in this model, sediment yield \((S_{{\text{y}}} )\) is a function of watershed area \(\left( A \right)\) and rainfall \(\left( R \right)\). Based on this variation of sediment, watershed area, and precipitation, one new model is selected here as follows:

$$S_{{\text{y}}} = f\left( {R,A} \right).$$
(7)
Fig. 4
figure 4

Relationship between annual rainfall (mm) and sediment yield (tonnes)

It can be written in the form of a nonlinear equation as:

$$S_{{\text{y}}}^{i} = \left[ {\alpha (A^{i} )^{\beta } + p(R_{{\text{y}}}^{i} )^{q} } \right],$$
(8)

where α, β, and p, q is the model parameters constant for \({\text{A}}^{i}\) and \({\text{R}}_{{\text{y}}}^{i}\), respectively. y denotes the year, and i is the watershed number ranging from 1 to 8; \(S_{y}^{i}\) = Sediment yield in the ith watershed for yth year (tonne); Ai = Area of ith watershed (sq. km), and \(R_{{\text{y}}}^{i}\) = rainfall in the ith watershed for yth year (mm).

Generalized reduced gradient (GRG) nonlinear method

In general, a user can use an excel solver to find the best value for an objective function in the target cell of a dataset. The Excel Solver adjusts the user-identified modifiable cells to maximize/minimize another target cell's value by working on the data set directly or indirectly connected to the target cell's goal function. Optimizing sediment model parameters is a highly nonlinear problem (Luo and Xie 2010). This tool can calibrate the design parameters of the sediment yield model developed in this study using powerful search techniques. The generalized reduced gradient (GRG) solver is used in this approach (Lasdon et al. 1978; Murtagh and Saunders 1978; Smith and Lasdon 1992). Leon Lasdon developed the GRG solver, a nonlinear optimization tool. GRG and its implementation have been demonstrated to be one of the most resilient and dependable techniques for solving challenging nonlinear programming problems over a longer period (Smith and Lasdon 1992). The optimization of these models was done with Microsoft Excel's solver extension and was based on Roddy's initial configuration (Roddy 2010). The generalized reduced gradient (GRG) nonlinear technique is utilized to estimate model parameters in this investigation. The GRG model creates solutions with ranks considering the variables with cross-over and mutation probability. The parameter sets are chosen that satisfy the objective function with the best solution. The GRG-nonlinear approach uses the multi-start parameter to boost the likelihood of a globally optimal solution. Figure 5 shows the flowchart of methodology for the SRWA model. The GRG nonlinear algorithm calculates the four coefficients in Eq. (8) using the Excel solver tool in this model. Model data are imported for the three variables rainfall, sediment yield, and watershed area based on the RUSLE-SDR model. In the SRWA model, the values for the four coefficients α, β, and p, q are estimated using optimization technique in excel solver.

Fig. 5
figure 5

Flow Chart of the methodology of SRWA model

Performance evaluation criteria

Several performance evaluation criteria are established to compare the results of different approaches (Mohan 1997; Toprak et al. 2009; Luo and Xie 2010). The following assessment criteria validate the Excel solver's and other parameter estimate algorithms' efficacy. The following indices were used to evaluate the performance of various parameter estimate approaches in terms of the relationship between observed and expected discharges. Standard statistical criteria such as the "coefficient of correlation" (CC), Nash–Sutcliffe model "efficiency coefficient" (CE) (Nash and Sutcliffe 1970), and root-mean-square error (RMSE) are used to assess the performance of the model. The direction and intensity of a linear relationship between observed and simulated data are measured by the correlation coefficient (CC) (Heathman et al. 2009). Negative and positive linear correlations are represented by plus and minus signs, with values ranging from 1.0 to + 1.0. The "efficiency coefficient" (CE) is an essential statistic for accounting the model fitness. The CE goes from 0.0 to 1.0, with 1.0 being the best. Values between 0 and 1 are considered acceptable; however, values less than 0 are not acceptable and indicate poor performance between the observed and simulated data (Jabbar and Yadav 2022). The RMSE is the residuals' standard deviation (prediction errors). It ranges from an optimal 0.0 to a big positive number and is determined as the ratio of RMSE to the standard deviation of the observed data.

$${\text{Correlation coefficient }}\left( {{\text{CC}}} \right) = \frac{{N (\sum (Q_{o} Q_{p} ) - (\sum Q_{o} )\left( {\sum Q_{p} } \right)}}{{\sqrt {(N(\sum \left( {Q_{p}^{2} } \right)} - \left( {\sum \left( {Q_{p}^{2} } \right)} \right)(N(\sum \left( {Q_{o}^{2} } \right) - (\sum \left( {Q_{o}^{2} } \right))}}$$
(9)
$${\text{Nash}} - {\text{Sutcliffe Efficiency }}\left( {{\text{CE}}} \right) = 1 - \frac{{\mathop \sum \nolimits_{i = 1}^{N} \left( {Q_{o}^{i} - Q_{p}^{i} } \right)^{2} }}{{\mathop \sum \nolimits_{i = 1}^{N} \left( {Q_{o}^{i} - \overline{{Q_{o} }} } \right)^{2} }}$$
(10)
$${\text{Root \,mean\, square Error }}\left( {{\text{RMSE}}} \right) = \sqrt {\frac{{\mathop \sum \nolimits_{i = 1}^{n} \left( {Q_{o}^{i} - Q_{p}^{i} } \right)^{2} }}{N}},$$
(11)

where i = variable ranges from 1 to n, n = total number of data, N = number of non-missing data points, \(Q_{o}\) is sediment yield of SRWA model; \(Q_{p}\) is simulated sediment yield of RUSLE-SDR model, and \(\overline{{Q_{o} }}\) = mean sediment yield of SRWA model.

Results and discussion

RUSLE-based soil loss estimation

A sediment yield model (SRWA model) is developed using the estimated value obtained from the RUSLE-SDR model. The model is developed using simple numerical techniques with the help of an excel solver, although the data used in the excel solver are generated with the help of spatial techniques like Arc GIS. To calculate the sediment output at the watershed outlet, the rainfall erosivity factor (R) is calculated using daily precipitation data (mm) from 357 gauging sites from 2001 to 2014. (Fig. 2e). The total amount of rainfall divided by the total number of rainy days yields the average annual rainfall. While observing the rainfall data for the entire Brahmaputra River basin, it is found that the rainfall distribution is highly heterogeneous. In most of the area, the downstream portion of the river receives comparatively higher precipitation, whereas the extreme upstream portion receives only snowfall. The downstream region near Majuli River Island receives the maximum rainfall, 2216 mm per year. To calculate spatial interpolation of rainfall data, ArcGIS uses the inverse distance weightage (IDW) method. With the help of Eq. (2), the R factor map is prepared for every year starting from 2001 to 2014. It can be observed from Fig. 6a that rainfall erosivity is high near the downstream sections of Majuli, whereas lower values are found in the middle and upstream portions of the watershed. After preparing the rainfall erosivity factor map, with the help of the soil texture map and using Eq. (3), the soil erodibility factor (K) map is prepared. From Table 1, it is found that lithosols and acrisols are the dominant soil types found in the watershed. About 66.09% and 13.80% of the total study area are occupied by lithosols and acrisols soil classes. Lithosols soil mostly consists of stones and rocks at the surface level. In the lithosols soils, the soil texture is more porous, leading to low water holding capacity (Triharyanto 2018). Acrisols soil type possesses low inherent fertility and is subjected to erosion on exposed slopes (Dent 1980). From the calculation done using Eq. (3), the values of the K factor vary from 0.0167 to 0.0233 and are shown in Fig. 6b. From the spatial distribution map, it has been observed that the soil erodibility factor (K) is found to vary from high to low from the upper Brahmaputra watershed to lower elevated areas, respectively. The values of K are found to be significantly high in the highest elevated part which indicates that the soil is more prone to erosion subject to receive of sufficient rainfall. The soil of the lower Brahmaputra basin is combined with clay particles, which provide higher erosion resistance.

Fig. 6
figure 6

a Rainfall erosivity factor (R) map for 2014, b soil erodibility factor (K) map, c slope length and steepness factor (LS) map, d cover and management factor for 2014, e conservation practice factor (P) maps for 2014, f average annual soil erosion rate map for 2014, g slope-based SDR map at flow path draped over DEM, h annual sediment yield map for the year 2014

Another important parameter to estimate soil erosion from a basin is its slope length and the steep factor (LS). The higher the stope of a basin, the lower the resistance during the surface flow, increasing more erosion towards the end. Therefore, the (LS) factor calculation becomes significant when estimating surface erosion. LS factor is calculated using Eq. (4) with the help of a slope and flow accumulation map extracted from DEM in the GIS platform. In this study, the highest elevation areas are found near the Tibet region of the Himalayan Mountain range, and the lower part of the basin resides in Arunachal and Assam, in India. While calculating the slope factor, the DEM is preprocessed in ArcGIS by filling the sinks and other depression areas; otherwise, it will provide some errors at the end of the programme. The exponent ‘m’ mentioned in Eq. (4) varies with the slope in a basin, and its values can be found in Table 3. Finally, the LS value for the selected study area is calculated and mapped in Fig. 6c, where it can be observed that the value ranges from 0 to 51.74. The LS factor map shows that the lower LS values are found in plain areas near Majuli, where the land is mainly used for farming and settlement.

The LULC maps of 2001–14 (Fig. 2c) are used to prepare crop management factor (C) maps. From Table 2, it is observed that in 2014, grasslands (57.34%) and forest cover (21.59%) are the two LULC classes that cover maximum watershed areas. It can also be seen that the change in LULC from 2001 to 2014 is minimal (Table 2). The crop management factor (C) maps for 2001–14 are created by reclassifying each land use category using the C values from Table 4. The C factor is determined to be between 0 and 0.63. (Fig. 6d). In 2014, it was discovered that grasslands and forests cover 57.34% and 21.59% of the watershed area, respectively (Table 2). For forest and farmland, C factor values of 0.5 and 0.003 are found in the majority of basin areas, respectively. Table 5 shows how the P factor maps for the years 2001–14 are created using the spatial analysis software in GIS. Due to a lack of data on conservation activities in some areas, the P factor value across the bulk of the watershed that is covered by forest is assumed to be 1. The watershed's P factor values are observed to range from 0 to 1. (Fig. 5e). The watershed's centre section has seen the most considerable conservation values, followed by the northeast.

The average annual soil loss maps for 2001–2014 are generated using the five RUSLE factors as R, K, LS, C, and P factors (Fig. 6f). The map is prepared by multiplying map layers of all the factors on a 30 × 30 m resolution grid using GIS by a cell-by-cell analysis. The current study's findings show a complex change in soil erosion caused by variations in rainfall amount over the study period. The watershed's average annual rainfall has increased in the north-eastern and downstream regions. This variation in rainfall amount has resulted in a shift in the pattern of soil loss rate over this time period. Critical soil loss levels are found in the north-eastern portions of the basin, where the most significant amount of soil loss is identified. After analysing the data and thematic maps, it is found that the study area produces a high rate of soil erosion amount. However, the higher elevated area produces less erosion, the central and downstream portions of the watershed yield high sediment.

Sediment yield estimation

The watershed's DEM is utilized to construct the flow route, which is then used by ArcGIS Tools to calculate the average mainstream channel slope for each raster grid. Equation (5) is used to create a sediment delivery ratio map. The SDR values for the major river channel in the basin range from 0 to 1. (Fig. 6g). Because the northeast section has steeper topography, the SDR values in the upstream and northeast regions of the watershed are greater. As a result, more eroded material will be moved into the channels and delivered to the outlet of the catchment from upstream places. Annual sediment yield maps for 2001–2014 are calculated by superimposing the SDR map over the annual soil erosion map using Eq. (6). As a result, the sediment output predicted along the whole basin's mainline up to Majuli River Island in 2014 ranges from 0 to 26,809 tonnes (Fig. 6h).

SRWA model parameters estimation

The RUSLE model-based sediment yield has seen changes in rainfall variability that have influenced sediment output during these periods (Table 6). In this case, the sediment-rainfall model parameters α, β, and p, q, should be computed for four parameters to calibrate the model for the selected watershed. A single objective function developed using Eq. (8) determines the sediment-rainfall series parameters to calibrate the models to meet the continuity requirement. An efficient optimization routine is necessary to estimate parameters with an objective function. Hence, the "Excel solver," which is simple to use, is used to find the model parameters. Parameter set values for the model are estimated for two cases: first calibration period (2001–2007) and then verification period (2008–2014), considering rainfall as the only variable. As sediment yield is a function of watershed area and rainfall in that watershed, sediment yield at the outlet of a watershed can be associated with the watershed area and total rainfall in a particular year. This relationship is shown using Eq. (8).

Table 6 Sediment yield (tonne) at different watershed outlets estimated by SDR-RUSLE model

The whole watershed is divided into seven more sub-watersheds taking outlets at seven different locations within the watershed. Sediment yield is calculated at each of the watershed (Fig. 7). Parameter values for Eq. (8) are estimated using the Excel solver's optimization technique (GRG nonlinear method). The parameter values are discovered to be = 1.39045, =−5.92783, p = 1.26481, and q = 1.10855 after a few optimization trials. In Fig. 8, a graphical display of sediment yield calculated by the SRWA and SDR-RUSLE models is plotted for the calibration period (2001–2007). Figure shows the comparison between sediment yields (tonne) of SRWA model and sediment yield (tonne) RUSLE-SDR model for verification period (2008–2014). Table 7 shows the model's statistical results during the calibration and verification periods. The statistical results reveal that the correlation coefficient (CE) for the calibration and verification periods is 0.969 and 0.964, respectively. The Nash–Sutcliffe coefficient of efficiency for the same period is 0.89 and 0.87, which is regarded as excellent. Table 7 shows the statistical performance in detail, revealing a strong correlation between the recently developed SRWA model and the SDR-RUSLE model. The estimated sediment graph for the calibration and verification periods is shown in Figs. 8 and 9. Figure 10 depicts the scatter plot of data obtained by the SRWA and SDR-RUSLE models for calibration and verification periods (2001–2014).

Fig. 7
figure 7

Maps showing all the eight watersheds

Fig. 8
figure 8

Comparison between sediment yields (tonne) of SRWA model and sediment yield (tonne) RUSLE-SDR model for 2001–07 for the upper Brahmaputra River basin

Fig. 9
figure 9

Comparison between sediment yields (tonne) of SRWA model and sediment yield (tonne) RUSLE-SDR model for 2008–14 for the upper Brahmaputra River basin

Table 7 Model performance in comparison with sediment yield using the RUSLE model in the Brahmaputra River basin
Fig. 10
figure 10

Scatter plot between sediment yields (tonne) of SRWA model and sediment yield (tonne) RUSLE-SDR model for 2001–14

In this study, the newly developed SRWA model has shown satisfactory performance during the calibration and validation processes. It can be seen that the GRG solver has found the optimal solution in about 100 iterations. Based on the above results, it can be observed that the GRG solver has shown comparatively good results in terms of CE, CC, and RMSE. Also, the GRG solver finds optimal solution for all the four sediment yield parameters. The implementation of the SRWA model at a sub-watershed level is simpler than other methods for sediment yield estimation. Therefore, the output of this model has excellent results with reasonable accuracy. Thus, in a larger watershed like the Brahmaputra River Basin, SRWA model can be applied to estimate sediment yield taking outlet at different locations of the watershed.

Conclusions

The upper Brahmaputra River basin's soil erosion and sediment output are examined utilizing an integrated RUSLE-SDR model in a GIS platform. All the thematic maps like R factor, K factor, LS factor, C factor, and P factor which are required to estimate soil erosion and sediment yield have been prepared. The developed map shows that the higher elevated areas have good vegetation cover with better support practice, which reduces soil erosion. On the other hand, the lower elevated portion produces more erosion and significant sediment yields. The present study applies the RUSLE and SDR models for eight watersheds in the upper Brahmaputra River basin to Majuli River Island. The sediment yield data are created from 2001 to 2014 using the SDR-RUSLE model. The information is further divided into two periods: calibration (2001–2007) and verification (2008–2014). A new model called the SRWA model has been constructed and calibrated for 2001–2008 and verified for 2008–2014. The SRWA model is calibrated in this study using a basic optimization approach (GRG nonlinear) available in the Excel solver. Thus, the Nash–Sutcliffe coefficient of efficiency of the model for calibration and verification period is 0.89 and 0.87, respectively. Hence, the new model can estimate sediment yield at any watershed/sub-watershed using input data such as watershed area and annual rainfall by calibrating the new model with sediment yield data obtained by the SDR-RUSLE model. Although the model performs well statistically in this investigation, there is still room for improvement using a better optimization strategy. In this sediment model, SRWA estimates the sediment yield occurrence at the sub-watershed level by only considering the most effective factors. But there may be other existing and actual sediment features, which may influence sediment yield amount somewhat from the actual values. Moreover, there may be data inefficiencies and model errors. This variation may cause some uncertainties in the result of the model. Also, we have limited this work within the above three influential parameters only by taking negligible effect of other parameters, although the model may not work in other additional parameters. Besides, this SRWA model is tested in a single watershed only and may not be suitable for different types of watersheds with different climatic and watershed characteristics. Further, this model should be applied to other kinds of the watershed by taking some additional influential parameters. In many watersheds, soil and riverbank erosion are common occurrences requiring attention. The method provided here can assess a watershed's sediment yield pattern and can be extended as needed. The study mentioned in this paper can be applied to another river basin to develop a numerical model to ease the estimation of soil erosion and sediment yield, which requires policymakers to make proper management decisions.