1 Introduction

Soil erosion exerts significant impacts on societal development. Severe soil erosion leads to soil nutrient loss, water quality degradation, river siltation, and reservoir capacity loss (Liebe et al. 2005; Khaba and Griffiths 2017; Emamgholizadeh et al. 2018; Gonzalez 2018). Quantifying the sources of sediments is critical for developing appropriate and site-specific conservation measures that effectively control soil erosion within a watershed (Zhang et al. 2016a). Information on sediment sources is unfortunately limited, because most types of soil erosion and transport processes are spatially complex and difficult to measure at a watershed scale. Measuring the contributions from different sources of sediment is primarily completed using three methods. The first is a conventional method, such as erosion pins (Davis and Gregory 1994; Bak et al. 2013; Boardman et al. 2015), mapping (Cao and Coote 1993), remote sensing (Thakur et al. 2012), and photogrammetry (Barker et al. 1997), but these approaches are mainly used in a relatively small watershed with the same type of dominant soil erosion. Using the conventional method, it is almost impossible to directly measure multiple erosion types within a watershed. The second is the fingerprinting technique, which can estimate sediment source contributions using various natural sediment tracers, or fingerprints, by linear mixing models (Collins et al. 1996; Krause et al. 2003; Stone et al. 2014; Zhang et al. 2016a). The fingerprinting method is suitable for any watershed size and complexity, so long as distinct fingerprint properties exist in different sediment sources (Zhang et al. 2016a). The third is the application of artificial tracers (Haddadchi et al. 2014; Zhang et al. 2017), which is mainly suitable for small watersheds or erosion plots but not for large watersheds.

In early fingerprinting method studies, typically a single fingerprint property was used to identify a sediment source (Lance et al. 1986; Roo 1991; Olley et al. 1993). When collected sediment has more than two sources or the spatial variability of the source is large, the discrimination of a single property is limited. For more complex scenarios, a composite fingerprint with multiple properties of distinct types can be used to identify sediment sources, which could vastly improve the partitioning of a sediment source (Collins et al. 1997, 1998, 2010; Koiter et al. 2013; Laceby and Olley 2015; Zhou et al. 2016). An “optimal” composite fingerprint that was selected to obtain maximized discriminant ability with minimum tracers was proposed by Collins et al. (1997) and widely used in the literature. However, Zhang and Liu (2016) found there was weak relationship between the ability of a tracer in distinguishing sources and its power in estimating contributions of the sources to sediment mixtures (sink) if the tracer is non-conservative during transport. However, quantifying the conservativeness of a tracer in a watershed is not an easy task, if not impossible, requiring enormous amounts of sampling and chemical analyses of different particle size classes. To circumvent this weakness, an approach using multiple composite fingerprints was proposed by Zhang and Liu (2016). The new approach uses a maximum number of composite fingerprints composed of non-contradictory tracers to obtain multiple analytical or numerical solutions, and uses the mean estimates of the multiple composite fingerprints as source contributions. From a statistical point of view, the contributions of different sediment sources averaged over multiple composite fingerprints are more likely be closer to the real values than any estimates using a composite fingerprint alone, when multiple contribution proportions are approximately normally distributed. The method has been verified by contrasting with other methods (Zhang and Liu 2016), and further compared to the results from 137Cs analysis with greater confidence (Zhang et al. 2016b). Some studies showed that the method of multiple composite fingerprints could obtain reasonable results in watersheds dominated with eolian or water erosion (Liu et al. 2016; Zhang et al. 2016a).

In previous studies, the particle size fraction of < 0.063 mm is most commonly used to identify source contributions within a watershed. Furthermore, particle size correction factors are often used to addition to account for any remaining difference in grain size between the source and the downstream sediment (Collins et al. 1998; Walling 2005; Devereux et al. 2010; Hancock and Revill 2013; Palazon and Navas 2017; Koiter et al. 2018). Other particle size fractions are also used, such as < 0.002 mm fraction (Gingele and Deckker 2005), < 0.004 mm fraction (Wallbrink et al. 2003), < 0.010 mm fraction (Wilkinson et al. 2013; Laceby and Olley 2015), < 0.053 mm fraction (Zhang and Liu 2016), < 0.25 mm fraction (Evrard et al. 2013), as well as multiple particle size groups (Haddadchi et al. 2015). The multiple particle size tracking has distinct advantages to identify sediment sources, but is more time consuming and costly (Collins et al. 2017). The sediment sources of the different particle size fractions may have different transport modes, and also have different negative effects on a catchment network requiring different treatment strategies. Therefore, based on the particular situation present in a watershed, the particle size fraction used in identifying main sediment source needs to be selected carefully.

When using fingerprint technology to identify sediment sources, providing estimated uncertainty of source contributions allows for more informed decisions on sediment and water management (Minella et al. 2008). Thus, it is critical to pay more attention to uncertainty assessment (Walling 2013). In recent years, both statistical and analytical methods were used to assess uncertainty. The former includes Monte Carlo simulations and Bayesian uncertainty framework, and the latter includes Gaussian first-order approximations. Monte Carlo simulations are mainly used to repeatedly simulate, with random sampling, the distributions of each input tracer in sediment mixtures and from each source, and to generate the probability distribution for the output (Motha et al. 2003; Nagle et al. 2007; Collins et al. 2010; McKinley et al. 2013; Stone et al. 2014; Zhang and Liu 2016). Bayesian uncertainty frameworks are used to simulate probability distributions of estimated source proportions (Koiter et al. 2013; Stewart et al. 2015). In most studies, contributions of different sediment sources are calculated using average values of sediment sources and sediment mixtures in the mixing model, and the use of the averages is an important source of uncertainty in the source contribution estimates (Lamba et al. 2015). If the uncertainty band is not explicitly estimated, this situation often causes false certainty on estimated proportions. The Gaussian first-order approximation can be readily used to obtain uncertainty contributions from each potential source and mixture for the mean contribution of a certain sediment source; however, the method is rarely applied in the literature.

In the current investigation, a multiple composite fingerprinting method was used to quantify sediment source contributions in the upper reaches of the Danghe Watershed in Northwest China. Because both water and wind erosion predominate the watershed and as a result of sediment sorting during transport being severe, tracking multiple particle size groups was adopted to minimize estimation errors caused by sediment sorting and the Gaussian first-order approximation to quantify uncertainty of the mean source contributions.

2 Materials and methods

2.1 Study watershed

The area of investigation is within the mountainous region of the Danghe Reservoir Watershed located in Northwest China along the western Hexi Corridor. The Danghe watershed has a drainage area approximately 16,970 km2 and is largely composed of dunes, Gobi desert, and mountains. The mountain region comprises a majority of the overall watershed area (15,281 km2) and is the primary sediment source of the Danghe Reservoir (Niu et al. 2019). Thus, the mountain region was evaluated more closely as a focused area of interest for this investigation (Fig. 1c). To further partition the potential sediment sources, the mountain region was subdivided into north and south alluvial fans and high mountains (Fig. 1b).

Fig. 1
figure 1

Location of the study area and sampling sites (A: Map of China; B: Source are classification; C: Watershed map of the study area; D:Sampling sites; E: Sediment trap)

The alluvial fans are predominantly located on the piedmont of the mountains between 1800 and 2800 m above mean sea level. The alluvial fans located north of the Danghe River encompass a drainage area approximately 728 km2, while alluvial fans to the south incorporate 228 km2. The ground surface of the north alluvial fan is primarily a composite of hard rocks that mainly include quartz diorite, siliceous rock, and limestone, and they are weathering resistant. However, the south alluvial fan is mainly a composite of orange-red clastic rocks and a small amount of the exposed bedrocks (Proterozoic Sinian), and the former is more easily weathered and eroded than the latter. Furthermore, compared to the south alluvial fan, the north alluvial fan has an abundance of anthropogenic activities, such as agriculture and urban development. The surface materials are significantly different between the two alluvial fans.

The high mountains, defined as the area above 2800 m mean sea level, occupy a significant portion (~ 93.7%) of the research area with an area of 14,325 km2. The high mountains are characterized by rough steep terrains and alpine canyons under complex geomorphologic patterns. The geological composition of the mountains is complex with many types of rocks, such as flash feldspar, quartz diorite, monzonitic granite, mudstone, river and lake facies clastic rocks, sand, gravel, and silty clay. Therefore, the sediment mixture of mountains should be different from the alluvial fans.

The average annual rainfall is about 86–280 mm in this research area. The average annual rainfall amounts are different between alluvial fans and high mountains. Because the annual rainfall increases by 8–12 mm as the altitude increases by 100 m, the high mountains have greater amounts of precipitation than the piedmont fans. To compound the situation further, precipitation events in the high mountains tend to be more intense due to more active convection, often resulting in flash floods.

2.2 Sample collection and processing

The Danghe Reservoir Watershed upstream mountain region, which includes the north alluvial fan, south alluvial fan, and high mountains, was identified as potential sediment sources according to their associated geomorphic terrains. At the downstream boundary of the upstream mountain region, a short reach in the main river channel was selected to collect sediment samples (Fig. 1d). The idea is that the sediment within the downstream river channel is a combination or mixture of the three potential upstream sediment sources. The downstream river channel sediments were obtained using sediment collection traps (Phillips and Gregg 2001; Zhang and Liu 2016). The traps were constructed from stainless steel pipes (14 cm i.d., and 60 cm long) with outward pointing funnels mounted to each end (a 3-cm opening upstream and a 2-cm opening downstream). Three traps were anchored to the riverbed along a cross-section of the downstream river channel (Fig. 1e). The traps were retrieved and emptied of newly collected sediment on four separate occasions (5/Sept./2016, 18/July/2017, 2/Aug./2017, and 20/March/2018) after flood waters receded. The sediment samples were collected after two heavy rains with large floods on 18/July/2017 (runoff 11.18 million m3/event; rainfall 10.6 mm/event) and on 2/Aug./2017 (runoff 5.61 million m3/event; rainfall 7 mm/event), as well as after two small runoff events on 5/Sept./2016 (rainfall 4.3 mm/event) and 20/March/2018 (snow melt). In conjunction with the sediment traps, newly deposited riverbed surface sediment was also collected along the channel cross-section using a small stainless steel spade after flood receded. The surface sediment samples consisted of five random sampling locations, each of which included a composite of 5–10 subsamples within a single sampling area of roughly 4 m2. In total, nine samples were collected.

Limited accessibility to the high mountains presented a particular challenge for collecting distributed samples that represent the sediment properties of the entire upper domain. Therefore, a single channel section, one which controls the drainage of the high mountains, was selected to collect lumped riverbed surface sediment samples (Fig. 1). The high mountain samples were acquired after the two heavy rainfall-runoff events (18/July/2017 and 2/Aug./2017) and consisted of eight samples collected from the river channel following the spade grab sampling procedure described in the preceding paragraph. A total of eight riverbed surface samples represented the high mountain sediment (Table 1).

Table 1 Sample numbers for six particle size groups and the range of maximum organic matter contents (OMC, %) from sediment mixture

The north and south alluvial fans regions, though much smaller than the high mountains, have many tributaries, some of which are difficult to access. Thus, readily reachable tributary channels representative of typical surface rocks/materials and geomorphic units were chosen from both north and south alluvial fans to characterize the sediment from their respective areas (Fig. 1). At the downstream boundary of selected tributaries, which included three in the northern alluvial fan and four in the south alluvial fan, between 5 and 7 riverbed surface sediment grab samples were collected after the two heavy rainfall-runoff events (18/July/2017 and 2/Aug./2017) (Fig. 1). The sampling procedures remained consistent with the protocols described above and in total 19 samples were collected from the north alluvial fan tributaries and 21 samples from the south alluvia fan tributaries (Table 1). Additionally, nine soil samples were collected from the surface (0–2 cm) of the south alluvial fan where erosion was occurring. Jointly, a sum of 30 sediment samples were obtained from the south alluvial fan.

In the lab, the 66 soil or sediment samples were oven-dried at 105 °C for approximately 12 h. Subsequently, the samples were separated into six particle size categories (< 0.063 mm, 0.063–0.1 mm, 0.1–0.25 mm, 0.25–0.5 mm, 0.5–1.0 mm, 1.0–2.0 mm) using an electric sieve shaker (Table 1 and Electronic Supplementary Material).

2.3 Chemical analysis

In order to perform sediment source fingerprinting or element tracing procedures, the geochemical properties of the sediment samples were analyzed. This process involved crushing all sediment samples in Table 1, including each of the six particle size groups, into powders finer than 75 μm using a multipurpose grinder. Subsequently, 4 g of the resulting powders was pressed into 32 mm diameter pellets under 30 tons of pressure using a pressed powder pellet technique. The resulting pellets from each of the sediment samples were then entered into a fully automated sequential wavelength dispersive X-ray fluorescence spectrometer, which provided element concentrations for 30 different compounds. The chemical elements and oxide compounds analyzed included Cl, P, Ti, V, Cr, Mn, Co, Ni, Cu, Zn, Ga, As, Br, Rb, Sr, Y, Zr, Nb, Ba, La, Ce, Nd, Pb, SiO2, Al2O3, Fe2O3, MgO, CaO, Na2O, and K2O. The element concentrations were in the units of μg g−1, and the oxide concentrations were in percentage. The analytical precision for the macroelements is 5%, and 25% for the microelements.

2.4 Linear mixing model and Gaussian uncertainty approximation

Mathematical un-mixing models are available for the quantitative distribution of sediment provenances. For this investigation, a linear un-mixing model based on the principle of mass conservation was used to estimate the contributions of the potential sediment sources using the following equations:

$$ \sum \limits_{s=1}^m{P}_s\times {S}_{s,i}={C}_i $$
(1)
$$ \sum \limits_{s=1}^m{P}_s=1\;\mathrm{and}\;0\le {\mathrm{P}}_s\le 1 $$
(2)

where Ps is the proportional contribution from a certain size group of source s; Ss,i and Ci are the mean concentrations of fingerprint property i in source s and sediments, respectively; and m is the number of sources. Only (m − 1) tracers are needed to analytically estimate the proportions of m sources under the constraint of 0 ≤ Ps ≤ 1 in Eq. (2). For the three sediment sources (north alluvial fan, south alluvial fan, and high mountains), two tracers were needed to analytically solve Eqs. (1) and (2). The average concentrations of each tracer, from each of the potential sediment sources and sediment mixtures, were used in Eq. (1). Both Eqs. (1) and (2) were solved individually for each of the six particle size classes. As a result of low organic contents in the sediment samples, as well as low concentrations of organic matter in most of the chemical analyses (Table 1), no correction was applied to the samples for organic matter. Some studies have shown that the use of weighting factors for organic matter has resulted in overcorrections (Laceby and Olley 2015). Because the element analysis was carried out for each particle size, rather than for an aggregated sediment sample, differences in grain size were reduced and, therefore, no particle size correction factors were used.

If we can assume that the two tracers from the three sediment sources and the sediment mixture are independent of each other, then Gaussian first-order approximation can be used to calculate the variance of fh (the same for fn and fs) (Phillips and Gregg 2001):

$$ {\sigma}_{f_h}^2={\left(\frac{\partial {f}_h}{\partial {C}_{m_1}}\right)}^2{\sigma}_{C_{m_1}}^2+{\left(\frac{\partial {f}_h}{\partial {C}_{h_1}}\right)}^2{\sigma}_{C_{h_1}}^2+{\left(\frac{\partial {f}_h}{\partial {C}_{n_1}}\right)}^2{\sigma}_{C_{n_1}}^2+{\left(\frac{\partial {f}_h}{\partial {C}_{s_1}}\right)}^2{\sigma}_{C_{s_1}}^2+{\left(\frac{\partial {f}_h}{\partial {C}_{m_2}}\right)}^2{\sigma}_{C_{m_2}}^2+{\left(\frac{\partial {f}_h}{\partial {C}_{h_2}}\right)}^2{\sigma}_{C_{h_2}}^2+{\left(\frac{\partial {f}_h}{\partial {C}_{n_2}}\right)}^2{\sigma}_{C_{n_2}}^2+{\left(\frac{\partial {f}_h}{\partial {C}_{s_2}}\right)}^2{\sigma}_{C_{s_2}}^2 $$
(3)

where fh, fn, and fs are the proportions of high mountains, north alluvial fan, and south alluvial fan. ∂fh (∂fn, ∂fs) is the partial derivative of fh (fn, fs) from Eq. (1). Cm, Ch, Cn, and Cs represent the mean concentration for sediment mixture (sink), high mountains, north alluvial fan, and south alluvial fan, respectively. The subscripts of 1 and 2 represent the first and second tracers in three sediment sources and sediment mixture. \( {\sigma}_m^2 \), \( {\sigma}_h^2 \), \( {\sigma}_n^2 \), and \( {\sigma}_s^2 \) are the variances of the mean concentrations for the sediment mixture, high mountains, north alluvial fan, and south alluvial fan. In reality, two tracers from the same source/sink samples may not be independent of each other. Therefore, covariance terms should be used to account for this (Phillips and Gregg 2001):

$$ {\displaystyle \begin{array}{c}{\sigma}_{f_h}^{2^{\prime }}={\sigma}_{f_h}^2+2\frac{\partial {f}_h}{\partial {C}_{m_1}}\frac{\partial {f}_h}{\partial {C}_{m_2}}{\rho}_{C_{m_1}{C}_{m_2}}{\sigma}_{C_{m_1}}{\sigma}_{C_{m_2}}+2\frac{\partial {f}_h}{\partial {C}_{h_1}}\frac{\partial {f}_h}{\partial {C}_{h_2}}{\rho}_{C_{h_1}{C}_{h_2}}{\sigma}_{C_{h_1}}{\sigma}_{C_{h_2}}+\\ {}2\frac{\partial {f}_h}{\partial {C}_{n_1}}\frac{\partial {f}_h}{\partial {C}_{n_2}}{\rho}_{C_{n_1}{C}_{n_2}}{\sigma}_{C_{n_1}}{\sigma}_{C_{n_2}}+2\frac{\partial {f}_h}{\partial {C}_{s_1}}\frac{\partial {f}_h}{\partial {C}_{s_2}}{\rho}_{C_{s_1}{C}_{s_2}}{\sigma}_{C_{s_1}}{\sigma}_{C_{s_2}}\end{array}} $$
(4)

where ρ is the correlation coefficients between the two tracers for the three sediment sources and sediment mixture. If the two tracers are independent of each other, then \( {\sigma}_{f_h}^{2^{\prime }}={\sigma}_{f_h}^2 \), or the variance of fh is\( {\sigma}_h^{2^{\prime }} \). An approximate 95% confidence intervals (CI) for fh (the same for fn and fs) can be obtained by a t distribution as fh ± t0.05,γ\( {\sigma}_{f_h} \), where γ represent the Satterthwaite approximation for the degrees of freedom associated with \( {\sigma}_{f_h}^2 \) (Phillips and Gregg 2001):

$$ \upgamma ={\left(\sum \limits_i{a}_i{V}_i\right)}^2/\sum \limits_i\frac{{\left({a}_i{V}_i\right)}^2}{d_i} $$
(5)

where ai and Vi are the coefficients and variance from the right of Eqs. (3) and (4) estimated by the average value of tracer concentration, and di is the degree of freedom of sample number (ni − 1). An Excel spreadsheet was used to calculated source proportions, standard errors, and 95% CI in this text. The absolute mean relative error (AMRE) of each element pair in any single particle size group was used to assess the results from the mixing model calculations against sampled values, which was calculated using Eq. (5):

$$ \mathrm{AMRE}=\left|{C}_i-\sum \limits_{s=1}^m{S}_{s,i}{P}_{as}\right|/{C}_i $$
(6)

where Pas is the contribution of s source in each particle size group.

2.5 Tracer screening

The tracer screening process began with the removal of non-conservative tracers according to element concentration. A tracer was considered non-conservative and not used for analysis if an individual or mean concentration of a tracer in a sediment mixtures fell outside of the range between the minimum and maximum concentrations of this tracer in all sources (Wilkinson et al. 2013). The remaining conservative tracers were then tested using the Kruskal–Wallis H test to remove the tracers that could not statistically discriminate among the three sources. For additional screening, the tracer variability ratio was used to remove tracers that have greater variability within sources than between sources (Pulley et al. 2015). The tracer variability ratio was calculated based on the percentage difference of the medians of the tracer concentrations between the two sediment source groups divided by the mean within a source group coefficient of variation (%) for the two source groups. Any tracer with variability ratio less than 1 in any source groups was rejected (Pulley et al. 2015). Finally, the estimated proportions for each source were calculated using analytical solutions. Any pair of tracers that yielded a negative proportion indicated an obvious error, which could stem from sampling error, chemical analysis error, a certain degree of non-conservativeness, or contradiction between the two tracers; and thus were removed. The remaining tracer pairs with positive solutions were used for obtaining the mean contributions of the three sediment sources.

3 Results

3.1 Proportional contributions in different particle size groups

Out of the original 30 chemical elements and oxide compounds analyzed in the particle size group of < 0.063 mm, there were 10 tracers that passed the range and conservativeness tests, the Kruskal–Wallis H test, and the tracer variability ratio test (Table 2). The remaining 10 tracers produced 19 non-contradictory tracer pairs. These 19 tracer pairs were used to estimate the value of source proportions, standard deviation (SD), lower limit (L), and upper limit (U) of the 95% CI (Table 3). The frequency distributions of the 19 tracer pairs from the particle size group of < 0.063 mm are shown in Fig. 2. The estimated proportions of multiple composite fingerprints varied widely for the three potential sediment sources (0.008 to 0.478 for north alluvial fan, 0.007 to 0.573 for south alluvial fan, and 0.202 to 0.598 for high mountains). However, if the estimated proportions of the three sources from any single tracer pair under a certain particle size group were considered, then the estimated proportions from this tracer pair were not so meaningful. This is demonstrated by the wide 95% CI for the each of 19 pairs were larger, indicating that any estimates of the three sediment sources for the mixed sediment from any tracer pairs were unreliable. The large differences in proportions estimated by each tracer pair are hypothesized to primarily result from the differences between tracers in their analytical errors, sampling errors due to spatiotemporal variation, and degrees of the conservativeness during transport. To reduce estimated uncertainties and average out errors, all tracer pairs that produced positive solutions were used to estimate the average proportions, as well as the 95% CI. The resulting source proportion estimates showed significantly narrower 95% CI, and as a result, were more meaningful. Through the use of the average values of all 19 tracer pairs for estimating source contributions, the accuracy was increased and uncertainty reduced. The results also showed that the number of utilized composite fingerprints is crucial for producing meaningful source contributions. In this particle size group, the overall average proportions (SE) were 35.1% ± 0.03, 25.8% ± 0.04, and 39.1% ± 0.03 for the north alluvial fan, south alluvial fan, and high mountains, respectively (Table 4). The absolute mean relative error (AMRE ± SD) calculated with Eq. 5 was approximately 1.3% ± 1.0% (Table 5).

Table 2 Mean, standard deviation (SD), tests of normality of tracers, and Kruskal–Wallis H test in each particle size group
Table 3 Estimated proportions (EP, %), standard deviation (SD), lower limit (L), and upper limit (U) of the 95% confidence intervals for three sources from each tracer (only listing the particle size group of < 0.063 mm)
Fig. 2
figure 2

Relative frequency distributions of the mean proportional contributions from the north alluvial fan, south alluvial fan, and high mountains, estimated using multiple composite fingerprints for the six particle size groups

Table 4 Estimated proportion (EP, %) using the means of multiple composite fingerprints, standard error (SE), lower limit (L), upper limit (U) of the 95% confidence intervals, and the number of tracer pairs (NTP) for six particle size groups
Table 5 Mean source contributions estimated using multiple composite fingerprints for three sources (NAF = north alluvial fan, SAF = south alluvial fan, HM = high mountains, and SM = sediment mixture)

For the particle size group of 0.063–0.1 mm, there were 14 tracers that passed the range and conservativeness tests, the Kruskal–Wallis H test, and the tracer variability ratio test (Table 2). The remaining 14 tracers yielded 28 tracer pairs that produced positive solutions or proportions. The frequency distributions of the estimated proportions from the 28 tracer pairs are shown in Fig. 2. The estimated source proportions ranged from 0.055 to 0.424 for the north alluvial fan, from 0.057 to 0.697 for the south alluvial fan, and from 0.248 to 0.684 for the high mountains. In this particle size group (0.063–0.1 mm), the estimated proportion average (± SE) for the three sediment sources were 14.2% ± 0.02, 39.5% ± 0.03, and 46.3% ± 0.02 for the north alluvial fan, south alluvial fan, and high mountains, respectively (Table 4), with AMRE being 2.2% ± 0.7% (Table 5).

There were 13 tracers that passed all the screening tests in the particle size group of 0.10–0.25 mm (Table 2). The remaining 13 tracers yielded 33 tracer pairs that produced positive solutions for the three sediment sources. The frequency distributions of the estimated proportions from the 33 tracer pairs are shown in Fig. 2. The estimated source proportions ranged from 0.014 to 0.647 for the north alluvial fan, from 0.002 to 0.821 for the south alluvial fan, and from 0.127 to 0.820 for the high mountains. In this particle size group (0.10–0.25 mm), the estimated mean proportions (SE) for the three sediment sources were 29.9% ± 0.03, 22.6% ± 0.04, and 47.5% ± 0.03 for the north alluvial fan, south alluvial fan, and high mountains, respectively (Table 4), with AMRE being 5.2% ± 3.4% (Table 5).

There were eight tracers that passed all the screening tests in the particle size group of 0.25–0.5 mm (Table 2), and from those eight tracers, 10 tracer pairs produced positive solutions for the three sediment sources. The frequency distributions of the estimated proportions from the 10 tracer pairs are shown in Fig. 2. The estimated source proportions ranged from 0.007 to 0.300 for the north alluvial fan, from 0.013 to 0.306 for the south alluvial fan, and from 0.395 to 0.813 for the high mountains. In this particle size group (0.25–0.5 mm), the estimated mean proportions (SE) for the three sediment sources were 16.5% ± 0.03, 16.7% ± 0.03, and 66.8% ± 0.05 for the north alluvial fan, south alluvial fan, and high mountains, respectively (Table 4), with AMRE being 3.4% ± 2.4% (Table 5).

Only six tracers passed all the screening tests in the particle size group of 0.5–1 mm (Table 2), and seven tracer pairs produced positive solutions for the three sediment sources. The frequency distributions of the estimated proportions from seven tracer pairs are shown in Fig. 2. The estimated source proportions ranged from 0.052 to 0.432 for the north alluvial fan, from 0.070 to 0.462 for the south alluvial fan, and from 0.329 to 0.878 for the high mountains. In this particle size group (0.5–1 mm), the estimated mean proportions (SE) for the three sediment sources were 19.6% ± 0.05, 24.9% ± 0.05, and 55.5% ± 0.09 for the north alluvial fan, south alluvial fan, and high mountains, respectively (Table 4), with AMRE being 3.5% ± 1.4% (Table 5).

There were 12 tracers that passed all screening tests in the particle size group of 1.0–2.0 mm (Table 2), and from those 12 tracers, 29 tracer pairs produced positive solutions for the three sediment sources. The frequency distributions of the estimated proportions from 29 tracer pairs are shown in Fig. 2. The estimated source proportions ranged from 0.018 to 0.567 for the north alluvial fan, from 0.004 to 0.632 for the south alluvial fan, and from 0.266 to 0.878 for the high mountains. In this particle size group (1.0–2.0 mm), the estimated mean proportions (SE) for the three sediment sources were 26.3% ± 0.03, 17.0% ± 0.03, and 56.7% ± 0.03 for the north alluvial fan, south alluvial fan, and high mountains, respectively (Table 4), with AMRE being 4.0% ± 2.0% (Table 5).

3.2 Contributions of different sediment sources

In order to determine the contributions of the three potential sediment sources to the downstream sediment mixtures, the estimated proportions of each particle size group from each source were multiplied by the mass fraction of the sediment in the corresponding mixed sediment size groups (Table 5). The results showed that the contributions of the three sediment sources to the mixed sediment were 24.5% for the north alluvial fan, 26.5% for the south alluvial fan, and 49.0% for the high mountains during the duration of this investigation. The total estimation error was approximately 3.5%. The contributions from each individual size group were also presented. The 0.10–0.25 mm particle size group produced the largest mass percentage and accounted for 27.1% of the mixed sediment, of which 8.1%, 6.1%, and 12.9% were the contributions from the north alluvial fan, south alluvial fan, and high mountains, respectively. On the contrary, the 1.0–2.0 mm particle size group produced the smallest mass percentage and only accounted for 0.7% of the mixed sediment, of which 0.2%, 0.1%, and 0.4% were the contributions from the north alluvial fan, south alluvial fan, and high mountains, respectively. Consequently, with less than 1% of the largest particle size group being represented in the mixed sediment, this indicates that the 1.0–2.0 mm particles were too coarse to be transported by the runoff events which occurred during the duration of this investigation.

3.3 Uncertainty of mean contributions

The uncertainty contributions of the estimated source proportions for the three sources were calculated using Eqs. (3) and (4). The relative uncertainty contributions of the three sources and sediment mixture for the variation of fh (the same for fn and fs) were estimated by dividing the respective terms on the right hand side of Eq. (4) by \( {\sigma}_{f_h}^2 \) (Table 6). Overall, compared with the uncertainty contributions from each source, the uncertainty contributions from the sediment mixture accounted for the majority of the total uncertainty. The overall averaged uncertainty contributions from the sediment mixture to the estimated proportions of the three sources were 65.8%, 67.6%, and 66.8% for the north alluvial fan, south alluvial fan, and high mountains, respectively. Correspondingly, the overall averaged uncertainty contributions from the north alluvial fan to the proportions of the three sediment sources were 14.4%, 13.0%, and 12.5%, from the high mountains were 11.1%, 10.4%, and 11.6%, and from the south alluvial fan were 8.7%, 9.1%, and 9.1% for the north alluvial fan, south alluvial fan, and high mountains, respectively.

Table 6 Percent uncertainty contributions from north alluvial fan, south alluvial fan, high mountains, and sediment mixture towards the total uncertainty of the north alluvial fan, south alluvial fan, and high mountains proportion estimates for each particle size group

4 Discussion

Compared with a mean, a median is less sensitive to extreme values for a skewed distribution. If the extreme values are outliners, the median is preferred to the mean to minimize error. Otherwise, the mean may be preferred to give more weights to the extreme values. In this study, we tested the contributions of three sources based on the means and medians of multiple composite fingerprints for six particle size groups, and the results indicated that there were little differences between the means and medians of multiple composite fingerprints except for a few individual cases, and but the order of contributions of three sources was the same between the means and medians in six particle size groups. We also found that the sum of the contributions of three sources from the medians of multiple composite fingerprints does not equal to 1 or 100% for each of the six particle size groups, varying from 86 to 113% (Table 7). That is to say that the solutions using medians instead of means are not closed. If the medians are used, we would underestimate contributions by as much as 14% or overestimate contributions by 13%. Such errors would be avoided if the means are used.

Table 7 The estimated contributions of three sources using the medians of multiple composite fingerprints for six particle size groups

The results showed that the high mountains consistently contributed more sediment to the downstream sediment mixture than either the north or south alluvial fan for all six particle size categories. The estimated proportions from the high mountains were significantly higher than the other two sources, as illustrated by the non-overlapping 95% CIs in Table 4. This consistency suggests the multiple particle size tracking method used in this investigation worked well under the boundary conditions and has great potential in minimizing errors in proportion estimation resulting from the non-conservativeness of tracers caused by particle sorting and enrichment of fines during sediment transport. The contributions between the north and south alluvial fans were overall comparable for all six particle size groups, and were not statistically different, likely resulting from their similar topographies and proximities to the sediment sampling location (sink).

The high mountain area accounts for 93.7% of the research area that expectedly had the largest contributing proportion of downstream sediment mixture, which was nearly 49% during the duration of this investigation. The high percentage of sediment is likely a combination of several factors, including the high mountain extreme relief, steeper channel slope, greater orographic effects, and more precipitation compared to the piedmont fans, and subsequent flash flooding. Collectively, a larger amount of high mountain sediment can be conveyed into the lower reaches of the river system during runoff events.

Compared with the high mountains, the alluvial fans on both sides of the river have much smaller drainage areas and less rainfall, but their total contribution of sediment is slightly larger than the high mountains. These results are attributed to (1) alluvial fan surface materials being finer and less consolidated than those in the mountains, which are easier to transport by surface runoff; and (2) transport distances to the mixed sediment sampling location from the fans are significantly shorter than from the high mountains, giving rise to greater sediment transport efficiency for eroded materials from the fans than from the high mountains.

Considering only the piedmont fans, the north alluvial fan has a larger drainage area and more anthropogenic disturbances, yet produced less sediment yield. The estimated contribution of the south alluvial fan is being slightly larger than that of the north alluvial fan. This may be explained by the geology of both alluvial fans. Specifically, in the north alluvial fan, the surface is mainly exposed rocks, and is not easy to erode and weather. In contrast, the south alluvial fan surface materials are mainly a composite of orange-red clastic rocks, and are easily weathered, eroded, and transported, indicating that the type of surface materials played an important role in providing an ample source of sediments for erosional transport.

In Section 3.3, these results revealed that the sediment mixture contributed the largest portion of uncertainty compared to the estimates of the three sediment source proportions. This can be explained as follows. First, each sediment source area is relatively uniform in geology and geomorphology with comparatively smaller spatial variation than the entire study watershed, while the downstream sediment sampling points (sink) incorporate the three sediment source areas and inherit greater spatial variation. Second, the research area is large, and most of the area is not easily accessible. It is possible that the collected samples do not fully represent the sediment properties of the sources, resulting in underestimation of the within source variability. Third, the sediment in each source area is transported to the sediment sink area, and there may be chemical transformations between elements during the transport. These would increase variability in sediment properties measured at the sink area.

The temporal variability should be further stressed and discussed. Because of spatial variations in storm patterns, storm centers, erosion processes, and rates, variation of collected sediment tends to change with seasons and storm sizes, adding additional temporal variation on top of the already large spatial variation. Therefore, there are greater temporal and spatial variations in the entire watershed than any other potential singular sediment source, which generates large uncertainty (Zhang et al. 2016b). This result suggests that a greater number of samples should be taken for sediment mixtures than for each source to combat the augmented spatial and temporal variations in order to increase the accuracy of the source proportion estimation and minimize the uncertainty caused by the above possible reasons.

5 Conclusions

From a statistical point of view, the estimated contributions of the three sediment sources from any single composite fingerprint were unreliable because of sampling errors, chemical analysis errors, and degree of conservativeness varied with each tracer. The proportional contributions from each sediment source estimated with different composite fingerprints varied considerably, and the 95% CIs of the predicted proportions were quite wide.

To improve the accuracy of estimated mean contributions, all multiple composite fingerprints were averaged to produce relative narrow 95% CIs. During the duration of this investigation, the overall averaged sediment contributions were 49.0%, 26.5%, and 24.5% from the high mountains, south alluvial fan, and north alluvial fan, respectively. The total mean absolute relative error was 3.5%, indicating the multiple composite fingerprinting method can greatly improve the accuracy of sediment contribution estimates. The results also showed using multiple particle size groups has significant potential with regard to increasing the accuracy of proportion estimation, especially when sediment sorting and enrichment of fines are of concern. The proportional contributions from the high mountains were largest for each particle size group, showing the consistency among multiple size tracking under the study conditions and the potential of overcoming tracer non-conservativeness due to sediment sorting.

The Gaussian first-order approximation showed that the overall uncertainty contribution was more than 60% from the sediment mixture, while less than 15% from any of the three individual sediment sources. This result indicates that a larger sample size for sediment mixture, compared with that of each source, is generally needed to minimize uncertainty of the proportion estimates. The larger uncertainty for the sediment mixture is a byproduct of the sediment sampling location (sink), which inherits both spatial and temporal variations from all contributing sources. The uncertainty contributions are additionally compounded by temporal variations resulting from spatial variations of soil erosion rates that vary with storm patterns, storm size, and seasons. In addition, the large uncertainty with sediment mixture may also indicate that samples from sources may not adequately capture the range of the sediment properties of the sources, or that sediment may undergo geochemical alternation within channels during transport. This conclusion needs to be further examined under different climates and geographic conditions where different erosion and sediment transport processes prevail.