Introduction

From a hydrological perspective, erosion is the process whereby particles are detached from the soil surface in one place, transported by surface runoff, and deposited in another distinct location (Julien 2010). The driving forces in sediment transportation are predominantly wind, water or ice. For this motive, erosion is often classified into three corresponding categories: wind, water and ice erosion (Refahi 1996). Investigation on erosion was notably conducted by Ellison (1944) with a preliminary study of the mechanical effects of rain drops on soil particles. He demonstrated that rain drops can give rise to extensive soil erosion, which in turn can be further augmented due to lack of kinetic energy absorbing vegetation. Another noteworthy study is by Wischmeier et al. (1971), who conducted analytical studies in the United States to seek a relationship focused on erosion of farmland and construction sites. In the same year many further studies were carried out but at a regional level and the results were not applicable for large areas. Ella (2005) used the WEPP model to simulate erosion and sediment on hill slope environments located in small upland watersheds. His investigation was orientated towards establishing a relationship between vegetation cover and erosion in this specific terrain. He demonstrated that vegetation cover gives rise to the reduction of runoff and soil erosion. Flanagan and Nearing (1995) undertook a research on the Piracicaba River, Brazil, and evaluated the estimation of three methods including Cs, Universal Soil Loss Equation (USLE), and WEPP by contrasting results with observed field data. That study was specifically undertaken to evaluate the amount of erosion and the temporal distribution of sediment, apart from investigating the influence of soil characteristics on soil erosion estimation using the aforementioned methods. The results show that these methods are mutually exclusive. Although these three methods are not accurate in the estimation of soil erosion, one had the highest correlation with soil characteristics. This can be attributed to the empirical parameter used (Sparovek et al. 2000). Stolpe (2005) evaluated three erosion methods including the Revised Universal Soil Loss Equation (RUSLE), the Environmental Policy Integrated Climate (EPIC) model and the WEPP model in the Mediterranean climate of Chile. He found that USLE and WEPP models provide the same accuracy in the estimation of erosion. Recently, RUSLE has been applied on landslide prevention in Greece (Rozos et al. 2013a, b). Amore et al. (2004) compared the output from WEPP and USLE in three Sicilian sub-basins at distinct spatial scales. They demonstrated that WEPP can estimate the amount of sediment similar to the measured deposited sediment in reservoirs. In addition, they confirmed that using two different scales do not have any significant effect on the estimation. In another case study, Bhuyan et al. (2002) employed three erosion models including EPIC, WEPP and the Areal Nonpoint Source Watershed Environment Response Simulation model (ANSWERS) in a 1-year period for small plots with tillage. Each model was calibrated with the most sensitive parameters using measured data. This research show that in the row and second tillage systems the ANSWERS and WEPP models are more reliable than the EPIC model. In the case of a non-tillage system, WEPP and EPIC had a superior prediction as compared to that from ANSWERS. Bhuyan et al. (2002) concluded that a more precise and realistic prediction can be achieved for soil loss using the WEPP model when compared to the two other models. In a further study, Baigorria and Romero (2007) developed erosion risk and runoff model in Peru by integrating the geospatial information system (GIS) tools into the WEPP model and named it as GEospatial Modeling of Soil Erosion (GEMSE). Later, Defersha et al. (2012) tested two physical models including WEPP and Erosion 3D, to predict sediment and runoff responses at a watershed scale. Results of this study indicated that both the WEPP and Erosion 3D models are useful tools for watershed scale application in the estimation of runoff and sediment yield. However, in another study by Defersha et al. (2012), they conclude that both models perform well in the estimation of runoff. Nevertheless, the WEPP model performs better than Erosion 3D in the estimation of soil erosion. More recently, Mahmoodabadi and Cerdà (2013) compared WEPP results with observation data at a plot scale for arid and semi-arid terrain. They conclude that the model tends to overestimate the lower, but underestimate the higher erosion rates. Singh et al. (2011) compared the WEPP result with field data and found that WEPP underestimates the erosion rates, especially when subjected to rill erosion on steep slopes.

Execution of the WEPP model was also contrasted with ANSWERS and EPIC (Bhuyan et al. 2002), SWAT (Shen et al. 2009) and Erosion 3D (Defersha and Melesse 2012). They found that the performance of the WEPP model is similar to ANSWERS and Erosion 3D, and notably superior to EPIC and SWAT in sediment yield estimation. Pandey et al. (2008) used WEPP in Indian semi-tropical conditions by calibrating and validating with historical hydrologic data in a small agricultural watershed with medium slope conditions and annual precipitation of the order of 1300 mm. Sediment yield is highly dependent on the soil erodibility. Geospatial comparison of the soil erodibility has been addressed by Adhikary et al. (2014) using four different models in a semi-arid region of Central India. The geomorphological aspect of the soil is another important of aspect in soil erosion and sediment yield (Magliulo 2012). The related studies on geomorphological aspects of watershed, GIS application in soil erosion and mass movement can be found in the studies carried out by Mhangara et al. (2012), Bathrellos et al. (2012) and Chousianitis et al. (2014). The objective of this study is to estimate runoff and soil erosion in the Emameh sub-basin in northern Iran using WEPP coupled with the ArcGIS platform.

Methodology

Study area

Emameh is a representative sub-basin for the Latyan Dam watershed located between 35°57′05″–35°49′10″N and 51°38′40″–51 31′30″E in northern Iran. (see Fig. 1). The watershed area is about 37.2 km2 at a junction to the Jajroud River. Elevation ranges from 1781 m at the outlet to 3870 m at the headwaters (reference to the international datum WGS84). The main channel length is about 13 km, displaying a mean slope of 9.2 % (Nahvi 1994). The Kamarkhani streamflow station is situated at the mouth of the Emameh sub-basin.

Fig. 1
figure 1

Layout of the study area, Emameh watershed in North-West of Tehran, Iran

For better consideration of the methodology involved in this study, a short description of the data set and workflow is provided as follows.

Firstly, Geodatabase including Soli, Landcover and Digital Elevation Model (DEM) derivations are generated using ArcGIS10.1 platform. Watershed delineation and parametrization was performed based on high-resolution satellite-based DEM data and GIS tools and techniques. DEM optimization process is then carried out to generate agree-DEM which robust automated watershed delineation process. The WEPP model is then set up based on the field measurements and estimated parameters to simulate sediment and runoff simulation in Emameh watershed. Conceptual representation of undertaking study in Emameh watershed is shown in Fig. 2. Further details of the workflow are provided in the following.

Fig. 2
figure 2

Conceptual representation of research work

GeoWEPP

GeoWEPP is the geospatial interface for WEP. It is a collaborative project conducted by the Agriculture Research Service at Purdue University, and the USDA National Soil Erosion Research Laboratory (Minkowski and Renschler 2012). GeoWEPP uses GIS data to delineate watersheds used in WEPP. GeoWEPP was developed to run using ESRI’s ArcGIS, and for this motive ArcGIS and the associated Spatial Analyst extension are required to delineate basins. To predict sediment yield and runoff at watershed scale, GeoWEPP integrates the WEPP model and Topography Parameterization (TOPAZ) software within the ArcGIS environment. In GeoWEPP, the requisite input files include land cover, landuse, slope, climate, soil, and land management data. In TOPAZ the channel networks are generated based on two key factors, namely the minimum source channel length (MSCL) and the critical source area (CSA). Both of these parameters are dependent on the resolution of the DEM. According to Minkowski and Renschler (2012), the default value for CSA is 5 ha, and 100 m for MSCL. Referring to Zhang et al. (2009) and Yüksel et al. (2008), and considering the DEM resolution which is 30 m in this case, the MSCL was selected to be 100 m and the CSA was set at 5 ha. The lower these values, the greater the time required to run the process, with the added disadvantage of larger output files. Nevertheless, in areas of slight relief, or urban areas, higher resolutions are desirable to minimize errors.

Theoretical background of WEPP

The WEPP model uses the continuity equation in a steady state for the estimation of sediment load. The erosion modeling of hillslopes using the WEPP model has been presented by Foster and Lane (1987). According to Flanagan et al. (2007), the daily sediment yield is computed using the instantaneous concentration and flow values as follows:

$$L = \, 0.0864\;C \, Q ,$$
(1)

where L is the load in tonnes/day; C instantaneous concentration, in mg l−1; and Q instantaneous discharge, in m3 s−1. The 0.0864 constant is an empirical conversion factor for the transformation of sediment yield values from seconds to days and from milligrams to tonnes.

The annual and monthly sediment yield and runoff outputs from GeoWEPP are calculated to compare with the observed sediment yield and runoff. GeoWEPP uses the steady-state sediment continuity equation (Eq. 2) to estimate net detachment in hill slopes as follows:

$$\frac{\partial G}{\partial x} = D_{i} + D_{f} ,$$
(2)

where G is sediment load (kg/m2/s) at distance x from the origin of hill slope, x is distance down slope (m), D i is inter-rill sediment delivery rate to rill (kg/m2/s), and D f is rill detachment rate (kg/m2/s), D i is calculated based on Eq. 3.

$$D_{i} = K_{i} I_{\text{e}} \sigma_{\text{ir}} SDR_{\text{RR}} F_{\text{nozzle}} \left[ {\frac{{R_{\text{s}} }}{W}} \right] ,$$
(3)

where \(K_{i }\) is adjusted inter-rill erodibility (kg s/m4), I e is effective rainfall intensity (mm/h), σ ir is inter-rill runoff rate (mm/h), SDRRR is inter-rill sediment delivery ratio, F nozzle is an adjustment factor for sprinkler irrigation nozzle impact energy variation, R s is rill spacing (m), and W is width of rill (m). Rill detachment rate D f is calculated using Eq. 4:

$$D_{f} = K_{\text{r}} \left( {\tau_{f} - \tau_{\text{c}} } \right)\left( {1 - \frac{G}{{T_{\text{c}} }}} \right) ,$$
(4)

where K r is an adjusted soil erodibility parameter (s/m), τ f is flow shear stress (kg/m/s2), τ c is adjusted critical shear stress of the rill surface (kg/m/s2), and T c is sediment transport capacity of the rill flow (kg/m/s). Tc is defined by Eq. 5:

$$T_{\text{c}} = K_{\text{tr}} q_{\text{w}} s ,$$
(5)

where K tr is constant parameter, q w is flow discharge per unit width (m2/s), and s is the slope in percent (%). Finally, the rill detachment rate is calculated using Eq. 6:

$$\frac{\partial G}{\partial x} = D_{f} = \frac{{\beta_{\text{r}} V_{f} }}{q}\left( {T_{\text{c}} - G} \right) + D_{i} ,$$
(6)

where V f is the effective fall velocity of the sediment (m/s) and β r is the raindrop induced turbulence coefficient which has a range from 0 to 1.

Dem digital elevation

The Shuttle Radar Topography Mission Digital Elevation Model (SRTM-DEM) and the Advanced Spaceborne Thermal Emission and Reflection Radiometer ER) known as Global Digital Elevation Model (ASTER-GDEM) (JSS 2012) is the best freely available elevation data set which are suitable for hydrological and geo-hazards studies (Akbi et al. 2012). In this study, ASTER-GDEM version 2 provided by NASA was used to generate the slope profile and other physical factors (see Fig. 2a). ASTER-GDEM v2 has a spatial resolution of 28 m per pixel. This data is in public domain and free of charge which was released by NASA (http://reverb.echo.nasa.gov) in October 2011. Raw DEM for the study area is shown in Fig. 3a and stream network delineated by TOPAZ in Fig. 3b.

Fig. 3
figure 3

a The ASTER-GDEM of study area. b Stream network delineated by TOPAZ

The GeoWEPP model functions as a distributed model, i.e., the amount of sediment and runoff is calculated for each pixel individually. The model outputs are obtained in grid and text file format. The workflow for DEM optimization and channel identification is presented in Fig. 4.

Fig. 4
figure 4

Workflow of DEM optimization and channel network delineation

WEPP inputs

To process hillslope geometry, meteorological characteristics, soil properties, and land cover, the WEPP model employs four input files that include slope, climate, soil, and management files.

Slope profile editing window

The slope file is generated based on the necessary hillslope parameters such as slope gradient, shape, width and orientation along its length. GeoWEPP employs TOPAZ for watershed parameterization. The sub-watersheds and channels were derived from 28 m resolution of ASTER-GDEM version 2.

Climate data

The WEPP model uses CLIGEN (Climate Generator), which is a stochastic weather generation model (Min et al. 2011) to produce climate data files containing daily rainfall, temperature, solar radiation, and wind speed obtained from synoptic stations. To generate climate data, “Rock: Clime”, the Parameter-elevation Regressions on Independent Slopes Model (PRISM) database is used which estimates rainfall and temperature based on orographic effects (Daly et al. 1994). In “Rock: Clime”, the inputs of monthly mean rainfall and temperature values can be adjusted. For this investigation, the climatological dataset including maximum and minimum air temperature, relative humidity, rainfall, solar radiation, and wind speed listed in Table 1, were obtained from Iranian Water Resources Management Company.

Table 1 Climatological data from the Emameh watershed

Soil map and landuse

Soil properties of the study area do not exist in WEPP template tables. Therefore, it has to be keyed in by the user. Accurate representation of soil properties in WEPP is essential for estimating sediment yield and runoff. Critical parameters in the soil file are: soil texture, albedo, saturation level, hydraulic conductivity, rill erodibility, inter-rill erodibility, and critical shear (Yüksel et al. 2008). These parameters are obtained from field data collection or calculated by the WEPP model. In this study, soil parameters were measured based on a soil map and previous soil sampling reports from the Emameh basin. Land units were delineated using existing soil map generated by Research Institute for Soil and Water. Required soil information for the WEPP was obtained by soil sampling in the field by taking five samples from each land unit. Figure 5 shows the soil sampling for measuring bulk density of soil in Emameh basin.

Fig. 5
figure 5

a Soil units (Unit 1.1 represent mountain with steep slope, Unit 1.2 sandy loam with moderate depth and steep slope, Unit 1.3 sandy clay loam with deep soil and moderate slope, Unit 1.4 Clay loam with very deep soil and gentle slope, Unit 4.1 silty clay and gentle slope) and b landuse class of Emameh watershed

The values of the other properties such as rill and inter-rill erodibility, critical shear, and hydraulic conductivity were calculated by WEPP model. The Green-Ampt effective hydraulic conductivity equation is used to estimate hydraulic conductivity based on rainfall amount, surface cover, and runoff. Rill and inter-rill erodibility values can be determined depending on the sediment delivery equation in WEPP v2008.5. The soil unit map is obtained from the Research Institute for Soil and Water, Iran (See Fig. 5a). The landuse for the study were generated using Land Sat TM and aerial photo and field checking using handhold GPS.

Soil file for the WEPP models were generated based on five soil samples taken from each land units (see Appendix, Table 4). The sample was analyzed in soil laboratory and the necessary parameters were obtained.

Management file

The landuse is indicated in the management file and based on growth and mortality parameters. In this study, the necessary management file data (i.e., the amount of vegetation, tuff, litter, and woody debris) were obtained by field measurements and then entered into WEPP. The management file was generated for different landuse types (i.e., rangeland and rock), for each sub-watershed, and for every year of simulation. Based on the grass cover density, rangelands were assigned in 3 classes including poor, fair and good. Poor indicates grass density <25 %, fair indicates grass density greater than 25 % and less than 75 % and good indicate grass density greater than 75 %. Classified landuse map was obtained from the Natural Resources and Watershed Management Organization, Tehran (see Fig. 5b).

Results and discussion

Generally, the erosion assessment and the hazards assessment are useful tools for urban and regional planning (Bathrellos et al. 2012, 2013; Papadopoulou-Vrynioti et al. 2013; Chousianitis et al. 2014). In this research the WEPP model coupled with the GeoWEPP were used to estimate sediment yield and runoff discharge in the Emameh sub-basin. Predicted sediment and runoff were compared with observed data. Sediment load of rivers including suspended sediment and bed load are produced by erosion, physical destruction and chemical weathering of stones, etc. The most important factors of sediment load in the Emameh watershed are heavy precipitation, the high erodibility of river beds and banks, steep slopes, poor vegetation cover, and geological formations. Bed load sediment estimation (tonne/day) was made based on the relationship between the available rating curve of discharge (m3/s) and suspended sediment concentration mg l−1 (see Fig. 6).

Fig. 6
figure 6

Sediment rating curve of Kamarkhani station

Monthly simulation of sediment yield and runoff was successfully performed in the Emameh watershed and then compared with observed data (see Table 2). By considering the monthly average sediment yield (74 kg ha) and the area of the Emameh sub-basin (3720 ha), the total annual sediment yield is estimated at 3303.36 tonnes.

Table 2 Monthly observed and predicted sediment yield and runoff in Emameh sub-basin (1998–2008)

Statistical measures are employed to assess the practical use of WEPP model. Based on two statistical measures, Table 3 illustrates the strength of the prediction power of the WEPP model for the monthly simulation of sediment and runoff at Kamarkhani hydrometric station.

Table 3 Statistical measures utilized in Emameh watershed

Statistical measures

The accuracy of the model was assessed based on two statistical indices; the Nash–Sutcliffe Efficiency (NSE) (Nash and Sutcliffe 1970), and R-squared coefficient under the essential assumption that distribution of used data follows a normal distribution. The NS Efficiency coefficient is defined by Eq. 7. NSE varies from negative infinity to unity. Value closer to unity shows closer agreement between the estimated and observed data.

$${\text{NSE}} = 1 - \left( {\frac{{\mathop \sum \nolimits_{i = 1}^{n} \left( {P_{i} - O_{i} } \right)^{2} }}{{\mathop \sum \nolimits_{i = 1}^{n} \left( {O_{i} - \overline{O} } \right)^{2} }}} \right) ,$$
(7)

where P i is the predicted sediment yield and run off for each month, O i the observed sediment yield and runoff in each month, \(\overline{\text{O}}\) the mean observed sediment yield and runoff in each month.

The second statistical measure is the coefficient of determination denoted R 2. It indicates how well data points fit a statistical model. It is a statistic indicator which is frequently used in the context of statistical models whose main purpose is either the prediction of future outcomes or the testing of hypotheses, on the basis of other related information. It provides a measure of how well observed outcomes are replicated by the model, as the proportion of total variation of outcomes explained by the model (Steel and Torrie 1960). The coefficient of determination ranges from 0 to 1. The R 2 is defined by Eq. 8.

$$R^{2} = \left( {\frac{{n\mathop \sum \nolimits P_{i} O_{i} - \mathop \sum \nolimits O_{i} \mathop \sum \nolimits P_{i} }}{{\sqrt {n\mathop \sum \nolimits O_{i}^{2} - \left( {\mathop \sum \nolimits O_{i} } \right)^{2} } \times \sqrt {n\mathop \sum \nolimits P_{i}^{2} - \left( {\mathop \sum \nolimits P_{i} } \right)^{2} } }}} \right)^{2}$$
(8)

According to the Eqs. 7 and 8 capability of WEPP in runoff and sediment yield prediction were assessed based on the observed data.

R 2 shows a dispersal rate between the estimated and observed data. If they were equal, this factor would be 1. The results demonstrate a strong correlation between the estimates. The NSE, which shows the relative difference between predicted and observed data, was positive. This indicator also shows a significant agreement between the predicted and observed sediment yield in the watershed; however, some discrepancy is observed for runoff which may be associated with the numerous springs located in the upstream on the watershed that drains into the main river. The relation between the estimated and observed data is shown in Figs. 7 and 8.

Fig. 7
figure 7

The relation between estimated and observed sediment (a) and runoff (b)

Fig. 8
figure 8

Comparison between estimated and observed sediment (a) and runoff (b)

Conclusion

Results from the WEPP model shows that significant correlation exists between the estimated and observed sediment yield based on the R-squared value (0.99) and the NSE (0.92) in Emameh watershed. However, runoff shows relatively low correlations of 0.76 and 0.62 for R 2 and NSE, respectively. The low correlation is highly due to the contribution from the numerous springs in the basin that drains into the principal channel, thereby somewhat undermining the runoff estimation. In general, it is clear that the GeoWEPP model can assist watershed-related management institutions to generate quality predictions rapidly and quickly for sediment yield and runoff in textual or graphical format based on the digital database of the watershed. Moreover, the model provides information about the areas that have a high potential sediment delivery. As such, watershed management and conservation projects can be well oriented to minimize sediment yield and flood runoff.

The WEPP model is also capable of evaluating the strategies of vegetation residuals, pedogenesis system, border places and grazing management under potential water erosion.

As a result of this study and considering other referenced works, the WEPP model is found to have high degree of adaptability within diverse climatic conditions and distinct morphologies.