1 Introduction

Understanding and predicting sediment yield from catchments form an essential part of geomorphologic and ecosystem research, and are indispensable to support policy decisions dealing with off-site effects of soil erosion such as sediment deposition within river channels and reservoirs, flooding, coastal development, and contamination of floodplains and water bodies with agrochemicals and other pollutants associated with the eroded sediments (Clark 1985; Owens et al. 2005; Ramos and Martinez-Casasnovas 2004; Steegen et al. 2001; Syvitski and Milliman 2007; Verstraeten and Poesen 1999; WCD 2000; Woodward 1995). Moreover, river sediment fluxes play an important role in various natural geochemical cycles, such as the carbon cycle (Ludwig and Probst 1998; Ludwig et al. 1996). Before predictions of sediment yield can be made, understanding the integrated effect of diverse erosion and sediment transport processes in relation to land use and climate change is needed. In recent decades, significant progress has been made in the development of spatially distributed, process-based soil erosion models such as LISEM (de Roo 1998; de Roo et al. 1996), EUROSEM (Morgan et al. 1998), WEPP (Flanagan et al. 1995; Foster et al. 1995), LAPSUS (Schoorl and Veldkamp 2001) and PESERA (Kirkby et al. 2004). However, mainly because of the large data requirements and difficulty of describing the effects of all relevant erosion and sediment transport processes, operational application of these models at the catchment scale is often problematic. A first step towards understanding and prediction of sediment yield at the catchment scale, therefore, is the use of lumped regression equations (e.g. Achite and Touaibia 2000; Ali and de Boer 2008; Dendy and Bolton 1976; Grauso et al. 2008; Hadley et al. 1985; Langbein and Schumm 1958; Lixian et al. 1996; Ludwig and Probst 1998; Ludwig et al. 1996; Restrepo et al. 2006; Syvitski et al. 2005; Tamene et al. 2006; Verstraeten and Poesen 2001). In these equations, sediment yield is often related to catchment characteristics in terms of morphology, topography, lithology, climate, discharge and soil properties. Obviously, these regression relations not only serve prediction purposes but also provide insight into relations between catchment characteristics and sediment yield that can be used for the further development of more complex models.

Table 1 provides some examples of regression equations as reported in the literature for the prediction of sediment yield for various regions worldwide. An extensive overview and discussion of regression equations can be found in Ludwig and Probst (1998) and in Jansson (1982), who grouped equations according to drainage basin size and climatic conditions. The equations in Table 1 illustrate the range of variables used in regression studies. Some equations are based on only one variable, but most use several variables to predict sediment yield. The variables most often reported to be related to sediment yield can be grouped as topography (e.g. relief ratio, slope), climate (e.g. mean annual rainfall, erosivity index), soil and lithology (e.g. % of soil texture class or erodible lithology), hydrology (e.g. mean or maximum runoff discharge), vegetation cover or land use (e.g. % forest or orchards), drainage network (e.g. drainage density, stream length), and catchment morphology (e.g. basin area, form factor). Beside the variety in variables, it is interesting to observe the large differences in optimal prediction equations for sediment yield in different parts of the world. These differences highlight that (1) erosion and sediment transport processes depend on many interacting environmental factors; (2) a factor that is important in one area can be insignificant in another; and so (3) no straightforward globally effective relations are apparent, and hence region-specific relations are needed for predicting sediment yield.

Table 1 Examples of published regression relations for the prediction of sediment yield

The objective of this study is to assess the relation between sediment yield at the catchment scale and data on topography, climate, land use, catchment morphology, drainage network, lithology and soil properties, for NW Mediterranean geoecosystems, using 61 Spanish study catchments. Furthermore, the aim is to test to what extent these regional data can be used in a multiple linear regression model for the prediction of sediment yield. This study follows on from a previous study where the sediment yields of 22 Spanish catchments were related to environmental factors of climate, topography and land use (Verstraeten et al. 2003). In the latter study, it was observed that, although a relatively good regression equation was obtained (R 2 = 0.80), the model was very unstable since exclusion of outliers changed the model and greatly reduced the model performance (R 2 = 0.29). As explanation for the model instability, it was concluded that land cover information lacked detail, and that the process of gully erosion was not sufficiently represented by included variables. Therefore, beside a higher resolution for most data, in this present study, more factors that potentially can be a proxy for the occurrence of gully and bank erosion (e.g., lithology, drainage density) were included in the analysis, and the dataset was extended to 61 catchments for which measured sediment yield data were available from the literature. In addition to the first two objectives—for comparison and to test the potential of a global prediction equation as an alternative to regionally calibrated equations—the Area Relief Temperature (ART) sediment delivery model that was developed for assessment of sediment load of global rivers (Syvitski et al. 2003; Syvitski et al. 2005) was also applied to and validated for the 61 Spanish catchments.

2 Materials and methods

2.1 Sediment yield data of Spain

Sediment yield was calculated from published sedimentation rates in 61 reservoirs in Spain (Avendaño Salas et al. 1997a, b). Volumes of sediments retained in the reservoirs were measured by the Centro de Estudios Hidrográficos (CEH-CEDEX) of the Spanish Ministry of the Environment using bathymetric reservoir surveys. The methodology was extensively described in various reports (Avendaño Salas and Cobo Rayán 1997; Avendaño Salas et al. 1995; CEDEX 1992). At the moment of the survey, the reservoirs were operative between 20 and just over 100 years. Surveys were performed between 1977 and 1994. The reservoirs are located in diverse climatic, geologic and geomorphologic regions of the country (Fig. 1), but without representation of the relatively humid northwestern area. Almost half of all catchments are semiarid with an aridity index (UNEP 1997) below 0.5. Around 82% of the catchments have an aridity index between 0.29 and 0.65 and are thus considered to be semiarid to dry subhumid. The remaining catchments are humid–subhumid to humid. The measured total sediment yield (SY) varies between ∼3.5 and 5,300 × 103 t a–1, and the area-specific sediment yield (SSY) varies between 10 and 2,600 t km–2 a–1. The average surface area draining to the reservoirs is about 1,251 km2.

Fig. 1
figure 1

Location of the 61 reservoirs in Spain

2.2 Quantitative catchment properties and correlations

An extensive database of quantitative variables was prepared for each of the 61 catchments (Table 2). The selection of variables included in the analysis was based on the availability of the data for all 61 catchments. Furthermore, it was intended to include variables that are most widely reported to be related to soil erosion and sediment transport processes characterising topography, climate, land use, catchment morphology, drainage network, lithology and soil properties. The variables related to catchment morphology, drainage network and topography were extracted from the Shuttle Radar Topography Mission (SRTM) 3 arc-second Digital Elevation Model (DEM) distributed by the USGS EROS data centre (http://seamless.usgs.gov/; Rabus et al. 2003), using IDRISI software (Eastman 2003). This DEM was mosaiced and resampled to 100-m spatial resolution by bilinear interpolation. Beside the calculation of catchment area, slope gradient, minimum and maximum elevation, the DEM was used to extract a river network for all catchments by assuming a channel when the upslope contributing area is larger than 0.2 km2. This threshold roughly corresponds with the river network and drainage density indicated on topographical maps at a scale of 1:50,000. One of the reasons for the use of drainage density as an explanatory factor is that it may be a proxy for gully presence. Since no numbers or maps were available on gully density for all catchments, it was decided to extract the river network as detailed as possible based on the SRTM DEM, in the hope that this drainage density would also be representative for gully density. The threshold of 0.2 km2 provided the most detailed reasonable drainage network without deformations of the drainage network. For calculation of the climate variables, gridded (1 km) mean monthly rainfall and air temperature data, based on station data of the Spanish National Meteorological Institute (INM) for the period 1971–2000, were used. Mean annual precipitation for the catchments varies between 350 and 1,400 mm, and the mean annual temperature varies between 8°C and 18°C. The Precipitation Concentration Index (PCI; Oliver 1980), the Modified Fournier Index (MF; Arnoldus 1977), and the rainfall erosivity R factor (Rf; Renard and Freimund 1994) from the Revised Universal Soil Loss Equation were calculated as:

Table 2 Summary and source of variables used in the correlation analysis
$$ PCI = \frac{{\sum\limits_{{i = 1}}^{{12}} {p_i^2} }}{{{P^2}}} * 100 $$
(1)
$$ MF = \frac{{\sum\limits_{{i = 1}}^{{12}} {p_i^2} }}{P} $$
(2)
$$ {\hbox{Rf}} = 0.7397 * {\hbox{M}}{{\hbox{F}}^{{1.847}}},\;{\hbox{for}}\;{\hbox{MF}} < 55 \; {\hbox{mm}} $$
(3)
$$ {\text{Rf}} = 95.77 - 6.081*{\text{MF}} + 0.4770*{\text{MF}}^{2} ,\;{\text{for}}\;{\text{MF}} > 55{\text{mm}} $$
(4)

Here, p i is the average monthly precipitation (millimetre), and P is the average annual precipitation (millimetre). Land use information was extracted from the CORINE Land Cover map (CLC2000) of the European Environment Agency (EEA 2000). The CORINE map has a resolution of 100 m, and contains 44 land cover classes of which the classes that were considered most relevant for erosion prediction were selected. The variables related to lithology, soil texture and soil type were extracted from the European Soil Geographical Database at 1-km resolution (ESB 2004). This database (ESDBv2 Raster Archive) is freely available from the website of the European Soil Bureau (http://eusoils.jrc.it/).

2.3 Statistical analysis and model validation

A correlation analysis was performed to obtain insight into the explanatory value of individual variables for prediction of sediment yield and to identify the relation between the explanatory variables. Then, in order to find a predictive model of sediment yield, a stepwise linear regression analysis was performed with SAS statistical software (SAS 1999), using only those variables with a relatively high correlation coefficient from the correlation matrix. The stepwise regression combines forward and backward selection using a significance level threshold of 0.15. Since model development for absolute sediment yield (SY; t a–1) or specific sediment yield (SSY; t km–2 a–1) provides different results (de Vente et al. 2005, 2008), model equations were calibrated for both SY and SSY.

To compare the obtained linear regression equations with other previously developed prediction equations, we also applied the Area Relief Temperature sediment delivery model (ART) that was developed for assessment of prehuman sediment load of rivers worldwide (Syvitski et al. 2003, 2005). The ART model is a nonlinear regression model that was trained on a global database of 340 rivers and consists of 5 regression equations for 5 climatic zones (see also Table 1). The ART model equations predict long-term sediment load (Qs; kg s−1) based on catchment area (A), large-scale relief (height difference; HD), and mean annual air temperature (T m). The ART regression equation for temperate regions at northern latitudes (>30°) was defined by Syvitski et al. (2003) as:

$$ {\hbox{Qs}} = 6.1 * {10^{{ - 5{ }}}} * {{\hbox{A}}^{{0.55{ }}}} * {\hbox{H}}{{\hbox{D}}^{{1.12}}} * {{\hbox{e}}^{{0.07 * {\rm{Tm}}}}}\left( {{{\hbox{R}}^2} = 0.63;\;{\hbox{n}} = 162} \right) $$
(5)

Although ART was calibrated mostly for large catchments, the model is assumed to be applicable at a global scale and was also applied to smaller catchments (Syvitski and Kettner 2007). Therefore, Eq. 5 was applied to the 61 Spanish catchments, and the predicted sediment load (Qs; kg s−1) was converted to absolute sediment yield (SY; t a−1) by multiplication of Qs with 31,536, and to area-specific sediment yield (SSY; t km−2 a−1) by subsequent division with catchment area.

The prediction accuracy of all regression equations was evaluated by the proportion of explained variance (R 2), the adjusted R 2, the Model Efficiency (ME) and the Relative Root Mean Square Error (RRMSE). The ME and RRMSE are calculated as (Nash and Sutcliffe 1970):

$$ ME = 1 - \frac{{\sum\limits_{{i = 1}}^n {{{\left( {{O_i} - P{d_i}} \right)}^2}} }}{{\sum\limits_{{i = 1}}^n {{{\left( {{O_i} - {O_{{mean}}}} \right)}^2}} }} $$
(6)
$$ RRMSE = \frac{{\sqrt {{\frac{1}{n}\sum\limits_{{i = 1}}^n {{{\left( {{O_i} - P{d_i}} \right)}^2}} }} }}{{\frac{1}{n}\sum\limits_{{i = 1}}^n {{O_i}} }} $$
(7)

Here, n stands for the number of observations, O i is the observed value, Pd i is the predicted value and O mean is the mean observed value. The ME can range from −∝ to 1 and represents the part of the initial variance accounted for by the model. So, the closer the ME approaches 1, the more efficient the model is. The RRMSE is independent of the units, and the smaller the RRMSE value, the more accurate the model prediction.

Validation is necessary to test the robustness of the linear regression equations and explore the potential for extrapolation to other datasets. This is mostly done by splitting the dataset into two parts, one for model development and one for model validation. In this case, the models were validated using the “Jackknife” procedure (Shao and Tu 1995). In this procedure, n − 1 catchments are used for calibration of the model. This calibrated model is then used for prediction of sediment yield in the remaining catchment. This is repeated n times in order to obtain independently predicted sediment yield for each catchment. This means that 61 test models were constructed, with the same independent variables, each of which excluding one reservoir at a time, and after which the model was applied for predicting SY and SSY for the remaining reservoir. The observed sediment yield of the excluded catchment was compared to the predicted value of the corresponding test model. Model performance was evaluated by the R 2, ME and the RRMSE.

3 Results

3.1 Correlation between catchment properties and sediment yield

Table 3 shows a correlation matrix between catchment properties and area-specific (SSY) and absolute (SY) sediment yield. The nonparametric Spearman correlation coefficient was used since practically none of the variables are normally distributed. Some of the variables have a lognormal distribution (e.g. A, SL, DD, S, RR, MAP, MF), while most of the land use, lithology and soil-related variables show a skewed distribution because of many zero values for catchments where this class does not occur. Only variables with a relatively high correlation coefficient (>0.20) and those for which a relevant relation with other variables was found are included in Table 3. Only a few variables are significantly (α = 5%) correlated with sediment yield. The most important explanatory variables for both SY and SSY are catchment area (A), stream length (SL), relief ratio (RR), Modified Fournier Index (MF), RUSLE R factor (Rf), catchment perimeter (Peri) and the percentage of poorly vegetated areas (PV). The high correlation coefficients between some of the variables point towards (multi-)collinearity between some variables. For example, stream length (SL), catchment area (A) and perimeter (Peri) are strongly correlated, and also, the percentage of soils with a fine texture and the percentage of argillaceous rocks show a strong correlation. For comparison, correlations were also assessed based on the Pearson correlation coefficient. Although correlations were slightly different, this provided the same selection of main explanatory variables as based on the Spearman correlation coefficient.

Table 3 Correlation matrix of a selection of catchment properties and sediment yield for 61 Spanish catchments, indicating the Spearman correlation coefficient

Figure 2 illustrates some of the important correlations in scatter plots. Catchment area shows a strong negative correlation with SSY and with slope gradient. In other words, large catchments have lower mean slope gradients and lower SSY than small catchments. All climate variables show a positive correlation with SSY, stressing the importance of rainfall amount and seasonality. Perhaps, counterintuitively, a negative correlation was observed between mean annual precipitation and drainage density. Less surprising is that most arable land appears to be on less steep slopes, while on steep slopes, high percentages of forest are found. Although the correlation matrix shows relatively high correlations, the scatter plots of Fig. 2 reveal ambiguous relations between SSY and most land cover and lithology classes, in part due to the many zero observations for these classes. The clearest positive correlations were found between the percentage of poorly vegetated areas (PV) and both SSY and SY, and the percentage olive and almond orchards (OlAlm) and SSY (see Fig. 2 and Table 3).

Fig. 2
figure 2

Scatter plots to illustrate some of the relations found between environmental variables and sediment yield

3.2 Multiple linear regression equations

Because of the lognormal distribution of SY and SSY, the natural logarithm of area-specific sediment yield (lnSSY) and absolute sediment yield (lnSY) were used as dependent variables in multiple linear regression analysis using stepwise forward and backward selection. The models with the highest explanatory value for lnSSY and lnSY respectively are:

$$ \ln {\hbox{SSY}} = 1.353 + 0.065 * {\hbox{S}} + 0.108 * {\hbox{RR}} + 0.337 * {\hbox{PCI}} - 0.026 * {\hbox{Mat}} - 0.013 * {\hbox{Lime}} - 0.041 * {\hbox{Meta}} + 0.038 * {\hbox{Fluv}} $$
(8)

(R 2 = 0.55; Adj-R 2 = 0.49; ME = 0.55, RRMSE = 0.14; p value < 0.0001; n = 61)

$$ \ln {\hbox{SY}} = 10.18 + 0.049 * {\hbox{S}} + 0.011 * {\hbox{MF}} + 0.025 * {\hbox{OlAlm}} - 0.027 * {\hbox{Mat}} - 0.013 * {\hbox{Lime}} - 0.038 * {\hbox{Meta}} + 0.041 * {\hbox{Fluv}} + 0.003 * {\hbox{Peri}} $$
(9)

(R 2 = 0.57; Adj-R 2 = 0.50; ME = 0.57, RRMSE = 0.07; p value < 0.0001; n = 61)

Although there is a relatively high correlation between slope gradient (S) and the Relief Ratio (RR; see Table 3), there is no problem of collinearity in Eq. 8 since the tolerance and variance inflation factor are within acceptable ranges (SAS 1999). The relatively good calibration results are, however, strongly influenced by a few observations as is indicated by the studentised residual of the predictions (i.e. studentised residual >2 or <−2). Based on this criterion, the El Gergal catchment was identified as an outlier for SSY, and the catchments of Embarcaderos, Riudecañas and Oliana were considered as outliers for SY (Fig. 3). For the Oliana and El Gergal catchments, no explanation was found for the high residual of the predictions, whereas Embarcaderos and Riudecañas represent by far the largest and smallest catchments of the database, respectively, which may be an explanation as to why they appear as outliers. The models were calibrated again without the outliers using a stepwise regression with the same variables, resulting in the following equations for lnSSY and lnSY respectively:

$$ \ln {\hbox{SSY}} = 1.891 + 0.067 * {\hbox{S}} + 0.110 * {\hbox{RR}} + 0.278 * {\hbox{PCI}} - 0.029 * {\hbox{Mat}} - 0.012 * {\hbox{Lime}} - 0.042 * {\hbox{Meta}} + 0.039 * {\hbox{Fluv}} $$
(10)

(R 2 = 0.58; Adj-R 2 = 0.52; ME = 0.58; RRMSE = 0.13; p value < 0.0001; n = 60)

$$ {\text{In}}\;{\text{SY}} = 6.768 + 0.055*{\text{S}} + 0.382*{\text{PCI}} - 0.025*{\text{Mat}} - 0.012*{\text{Lime}} + 0.059*{\text{Fluv}} + 0.005*{\text{Peri}} $$
(11)

(R 2 = 0.58; Adj-R 2 = 0.53; ME = 0.58; RRMSE = 0.06; p value < 0.0001; n = 58)

Fig. 3
figure 3

Calibration results of the multiple linear regression models. Presented are the natural logarithm of predicted and observed area-specific sediment yield (SSY) and absolute sediment yield (SY) with all observations (upper) and after removal of outliers (lower)

Again, the tolerance and variance inflation factor indicate that there is no collinearity problem in these equations, so all parameters were maintained.

The main part of variance in SSY is explained by the Relief Ratio (RR), both in the equation with and without outliers (Table 4). All other parameters explain much less of the variance in SSY, with lowest contribution from the percentage of Fluvisols (Fluv), which is also the least significant model parameter, with and without outliers. For SY, the main part of variance is explained by the catchment perimeter (Peri), followed by the Modified Fournier Index (MF). Least variance in SY is explained by the percentage of olive and almond orchards (OlAlm), which together with the percentage of Fluvisols (Fluv) are the least significant model parameters. Without outliers, Peri is the single most significant parameter explaining by far the main part of variance. In the model without outliers, the MF is substituted by the PCI, but with a much lower contribution to the explained variance in SY.

Table 4 Summary of statistics per variable for the regression equations for area-specific sediment yield (SSY) and absolute sediment yield (SY)

Calibration results of both models were only slightly better without outliers than with all observations (Table 5 and Fig. 3). Therefore, it was decided to validate the models both with and without the outliers using the “Jackknife” procedure as described above (Shao and Tu 1995). When validated for all 61 observations, the R 2 and the ME decrease dramatically compared to the calibration for both SY and SSY (Table 5 and Fig. 4). There is a large scatter around the 1:1 line, and the ME for both models is negative, meaning that the model produces more variation than is present in the measured values. Without the outliers, the validation results are better but still far worse than the calibration results. Validation results for SY are slightly better than for SSY, both with and without outliers. For comparison, it was also tested if a log transformation of those variables with a lognormal distribution included in the analysis would provide different results for the regression analysis. Although prediction equations were slightly different, validation results were very similar to those described above. Since normality of explanatory variables is not required for regression analysis, and to maintain the focus of this paper, we do not present all details of this exercise here.

Table 5 Summary of calibration and validation results of the regression equations for area-specific sediment yield (SSY; t km−2 a−1) and absolute sediment yield (SY; t a−1)
Fig. 4
figure 4

Jackknife validation results of the multiple linear regression models. Presented are the natural logarithm of predicted and observed area-specific sediment yield (SSY) and absolute sediment yield (SY) with all observations (upper) and after removal of outliers (lower)

3.3 The ART sediment delivery model

The ART model (Syvitski et al. 2003, 2005) was applied directly for prediction of sediment yield in the 61 Spanish catchments. Table 6 and Fig. 5 show an evaluation of the quality of the predictions by comparison of predicted and observed sediment yield. Based on the studentised residual of the predictions with the ART model, several outliers were identified, and so, the model was also applied without these outliers (see Table 6 and Fig. 5). For SY, the catchments of Embarcaderos, Cijara and Doña Aldonza appeared as outliers, while for SSY, the outliers were Riudecañas, La Cierva, Conde de Guadalhorce, and El Gergal.

Table 6 Validation of the ART model (Syvitski et al. 2003) for area-specific sediment yield (SSY; t km−2 a−1) and absolute sediment yield (SY; t a−1) as observed in the Spanish catchments
Fig. 5
figure 5

Validation results of the ART model (Syvitski et al. 2003). Presented are the predicted and observed area-specific sediment yield (SSY) and absolute sediment yield (SY) with all observations (upper) and after removal of outliers (lower)

Both with and without outliers, the R 2 of the ART model predictions is relatively high. However, for all ART model predictions, the ME is very low, and the RRMSE is more than 1, meaning that predictions strongly deviate from the line of unity. The model generally overestimates sediment yield (SSY and SY), especially for the lower sediment yield values. Validation results for SSY are better than for SY, both with and without outliers. For SSY, removal of outliers results in a higher R 2 and ME, whereas for SY, removal of outliers results in a higher R 2 but much lower ME.

4 Discussion

Based on the calibration of the linear regression equations, it seems that predictions of absolute (SY) and area-specific sediment yield (SSY) are reasonably accurate. There is strong agreement in the environmental factors that appear in the equation for prediction of SY and SSY, and model performances are also comparable. Both regression equations represent at least one variable related to topography, climate, vegetation, lithology and soil characteristics, as such representing a range of environmental factors. Most significant model parameters were related to topographic relief (RR for SSY), catchment size and annual rainfall distribution (Peri and MF for SY). On the other hand, the poor validation results imply that the models are very unstable and are not useful for extrapolation to other catchments, although after removal of the outliers the validation results became somewhat better. This stresses the importance of performing a model validation and not trusting only calibration results, as is often done in regression studies. This was also confirmed by Verstraeten and Poesen (2001) who found that good regressions in calibration do not always imply good model validation results. Likewise, Grauso et al. (2008) reported that application of regression equations developed for a range of Italian catchments did not yield accurate results for prediction of sediment yield in 16 Sicilian catchments. Yet, still most studies presenting a regression equation include a correlation coefficient for calibration but do not apply a proper validation procedure with independent data (e.g., Delmas et al. 2009; Grauso et al. 2008; Restrepo et al. 2006; Syvitski et al. 2003; Tamene et al. 2006; Verstraeten et al. 2003).

Notwithstanding the use of more variables representing a wider range of environmental characteristics for 61 catchments, the regression in this study was not more stable and did not perform better than the regression equation calibrated for a subset of 22 catchments by Verstraeten et al. (2003). Although fewer variables were used in their analysis, some very similar variables were selected in the stepwise regression of both studies (i.e. Relief Ratio, olive and almond orchards, % Matorral Forest, poorly vegetated areas). Yet, both the regression equations for the 22 and for the present 61 catchments performed worse than the semiquantitative Factorial Scoring Model (FSM) that was applied to the same catchments and showed better calibration and validation results (i.e. R 2 = 0.72; ME = 0.72; RRMSE = 0.65; de Vente et al. 2005). FSM is an expert model where five factors (i.e. topography, lithology, vegetation cover, gullies, catchment shape) are used to characterise a watershed in the vicinity (∼5 km) of the reservoir and the main tributaries by providing a score between 1 and 3. An index is calculated by multiplying the five scores, and together with catchment area, this index is used to obtain a regression equation for the prediction of SSY (de Vente et al. 2005, 2006; Verstraeten et al. 2003).

The ART model with only three variables that was calibrated for global rivers (Syvitski et al. 2003) provides much higher R 2 values between predicted and observed sediment yield (SSY and SY) of the Spanish catchments than the linear regression equations (Eqs. 811). However, the low ME and high RRMSE of the predictions with the ART model suggest that these are not necessarily better. For most catchments, the ART model strongly overpredicts sediment yield. According to Syvitski et al. (2003, 2005), the overestimation of sediment yield can be explained by the fact that reservoirs trap substantial volumes of sediments and therefore reduce the expected sediment yield for undisturbed conditions. For the Spanish reservoirs, this might indeed explain part of the overestimated sediment yield by the ART model, because there are reservoirs upstream of most of the reservoirs used for the present study. On the other hand, important errors can also be due to factors that are not considered by the model such as human impact (including conservation measures), lithology (de Vente et al. 2005; Woodward 1995) tectonics and the rate of rock uplift (Hovius 1998). Part of these issues may be solved by the BQART model, a successor of the ART model that integrates factors on river discharge (Q), lithology and human impacts, including dams and erosion control measures (B; Syvitski and Milliman 2007). Yet, reliable and detailed information on these factors is often difficult to obtain. It should also be remembered that the ART model was calibrated mostly for large global river basins worldwide. Calibration of the ART model for the Spanish data, similar to that done for the other regression equations, may give much better results.

Although some additional variables may be required, as well as a more regionalised calibration of model parameters, it is certainly promising to observe that the ART model with only three variables has the potential for reasonable predictions of sediment yield at the global scale. Moreover, the relevance of these three variables is confirmed by the most significant model parameters in the multiple regression equations of this study. As in the ART model, factors related to topography, catchment size, and climate (RR, Peri, MFI) explained the main part of variation in sediment yield. The RR is a combination of the A and R factor of ART, whereas catchment perimeter is highly correlated with A (see Table 3). On the other hand, the ART model uses temperature instead of a rainfall-related factor. Syvitski et al. (2003) considered temperature as a more complete climate factor than rainfall alone, since temperature relates to the occurrence of convective, high-intensity rainstorms, but also to soil formation, soil erosion, runoff from melting snow, and freeze–thaw cycles (Harrison 2000; Syvitski et al. 2003). The role of mean annual temperature (Tm) was, however, not confirmed by our regression analysis, maybe because the differences in temperature between the Spanish study catchments were not large enough.

There are various possible explanations for the unstable model performance of the linear regression equations. First of all, land use information does not necessarily represent the percentage soil cover by vegetation, whereas the latter is especially important for erosion predictions (Casermeiro et al. 2004; Gyssels et al. 2005; Molina et al. 2007). Secondly, due to land use changes, land use as indicated on the CORINE Land Cover map (CLC2000) may not be representative for land cover conditions during the period for which the measured reservoir sedimentation rates are valid. Large-scale reforestations and extensions of olive and almond orchards that have taken place in the last decades (Cammeraat and Imeson 1999; García-Ruiz 2010; Poesen et al. 1997; Rojo Serrano 2003; van Wesemael et al. 2003) may provide a false view of the soil protection by vegetation during the lifetime of the reservoir. Furthermore, due to the low resolution of the soil and lithology maps, important detail is probably missed, and so, a higher resolution may increase the explanatory value of these variables. Next, drainage density is a typical example of a scale-dependent variable (Abrahams 1984; Vogt et al. 2003). In this study, we automatically extracted a river network from a DEM, using a threshold of 0.2 km2, and used this river network to calculate the drainage density. Surprisingly, a (nonsignificant, p value of 0.08) negative correlation was found between drainage density and SSY, and no correlation was observed with SY, which contrasts with various studies that considered drainage density to be strongly related to both sediment yield and mean annual precipitation (Abrahams 1984; Langbein and Schumm 1958). In agreement with our results, some studies reported a decreasing or complex correlation between drainage density and mean annual precipitation (Abrahams 1984), whereas others reported an increasing drainage density with increasing precipitation (Daniel 1981). Most studies explained the relation between mean annual precipitation and drainage density by differences in vegetation cover and runoff generation under diverse climatic conditions. In our analysis, however, drainage density did not appear as a significant variable in any of the multiple regression models (alpha 5%). A better result may be obtained when additional information on lithology, vegetation cover and climate is used to determine the threshold value in a stratified manner (Colombo et al. 2007; Vogt et al. 2003), or when drainage density is extracted from aerial photographs (Allen 1986).

Errors in the measured sediment yield and in the source data of the explanatory variables may also explain part of the poor model validation results. Furthermore, the variation in environmental conditions and in the dominant erosion and sediment transport processes between the 61 catchments may be too large to be explained by one single regression equation, or it may be that important explanatory variables were missing in the analysis. For example, mean annual runoff-related variables might explain an important part of mean annual sediment yield (Restrepo et al. 2006; Syvitski et al. 2003). Comparison of soil loss rates obtained from field plot studies with reservoir sedimentation rates in Mediterranean environments also suggests that gully erosion, mass movements and bank erosion are mainly responsible for sediment export at the catchment scale (de Vente et al. 2007, 2008; Poesen and Hooke 1997; VanMaercke et al. 2011). These processes operate at distinct locations in the landscape, and factors controlling these processes are difficult to parameterise by one single value for the entire catchment.

Many of the possible improvements to deal with the abovementioned problems require a major amount of additional processing and input of data that are often not easily available for large watersheds. Yet, at least equally important as data quality is the model concept used. Apart from different variables used for prediction, some models multiply factors (ART and FSM), while other models use the sum of variables (regression). Based on the current information, it seems that those models accounting for the interaction between environmental factors and for nonlinear relations through multiplying factors and by using power functions, such as the FSM and ART models, potentially perform better than linear models such as the linear regression models (de Vente 2009).

Although the linear regression model was not stable, the correlation analysis and regression equations do provide more insight into relevant relations between environmental factors and sediment yield, and increase our understanding of sediment dynamics at the catchment scale. This helps to identify important factors that constrain erosion and sediment transporting processes, and to select factors to be considered in more sophisticated model approaches. For example, the negative relation between catchment area and SSY that was found in this study is often reported in the literature, and is explained by the fact that during transport of sediments through a catchment, increasingly more sediment is trapped on footslopes, alluvial plains and other sinks, while erosion rates decrease due to decreasing hillslope gradients downstream (de Vente et al. 2007; Walling 1983). This theory is supported by the negative correlation between catchment area and mean hillslope gradient and the positive correlation between slope gradient and SSY observed in this study. Furthermore, our results suggest a positive correlation between mean annual precipitation and sediment yield, although the increase in sediment yield with annual rainfall is not as strong as was reported for Moroccan catchments (Woodward 1995). This uninterrupted, positive relation deviates from the relations suggested in earlier studies where SSY often first increases and then decreases with increasing mean annual rainfall (Langbein and Schumm 1958; Walling and Kleo 1979; Wilson 1973). This trend is still frequently referred to and was originally explained by low runoff rates and the absence of a protective vegetation cover in areas with very low precipitation amounts, and the presence of a vegetation cover protecting the soil beyond a certain threshold of annual precipitation (Langbein and Schumm 1958). The deviation from these trends in Spain and Morocco may be due to widespread human disturbances that impede a direct relation between rainfall amounts and vegetation cover that is expected in areas with a more natural vegetation cover.

For practical applications and to support policy decisions in a given region, regression equations form an attractive option because they are relatively easily applicable and it is tempting to apply an existing equation. However, the results of this study confirm that the use of regression equations beyond the region and conditions for which the model was developed is not a guarantee of good results. Furthermore, the regression analysis is based on lumped variables, which allows prediction of sediment yield but does not identify where the sediments originate, where erosion problems are most severe, or which erosion and sediment transport processes are dominant. To answer these questions, other spatially distributed approaches are required.

5 Conclusions

Correlation analysis reveals that, for 61 Spanish catchments, drainage area, catchment perimeter, stream length, relief ratio, the Modified Fournier Index and the percentage poorly vegetated areas have the highest correlation coefficients with area-specific (SSY) and absolute (SY) sediment yield. However, a combination of topographic, climatic, lithologic, vegetation and soil variables is required to provide the best linear regression equation to predict sediment yield. Although calibration results after a stepwise regression are reasonably good, validation of these equations shows that they are very unstable and cannot be used for extrapolation. On the other hand, application of the ART sediment delivery model (Syvitski et al. 2003) for prediction of sediment yield in the Spanish catchments shows a relatively high correlation between predicted and observed values, although predictions systematically overestimate observed sediment yield. Beside measurement errors in sediment yield and in the explanatory variables, there are various explanations for the poor model validation results in general: (1) not all controlling erosion and sediment transport processes are well represented by the available data; (2) land use is not representative of vegetation cover; (3) the land use, lithology and soil data are of a too low resolution; (4) land use is not a static property and will have changed during the lifetime of the reservoirs; and (5) nonlinear relations between sediment yield and environmental variables, and nonlinear interactions between those variables are not (sufficiently) accounted for by the models.