Introduction

As the key channel connecting terrestrial and marine ecosystems, river systems play a crucial role in carbon transformation from land to the ocean (Regnier et al. 2013; Wehrli 2013; Schefuß et al. 2016). The flux of carbon in rivers can reflect hydrogeology, climate, land use, vegetation cover, and human activities in the basin. Carbon in rivers mainly originates from the atmosphere, fixed organic carbon by the photosynthesis reaction, terrestrial input by surface runoff, and microbial degradation in rivers (Raymond et al. 2008; Stets and Striegl 2012; Grand-Clement et al. 2014). Dissolved organic matter (DOM) is one of the main forms of carbon in water, with a wide range of molecular weights changes and complex molecular structures, which interact with the physical and chemical behavior of water bodies (Hassett 2006; Williams et al. 2016).

Chromophoric dissolved organic matter (CDOM) is the colored and photoactive fraction of DOM. Due to its strong absorption in the ultraviolet (UV) and visible light regions of the spectrum in aquatic ecosystems, CDOM concentration and composition changes can significantly affect the availability and spectral quality of underwater light fields (Kirk 2011). CDOM absorbs most strongly in the ultraviolet region, so it limits the penetration depth of biologically harmful UV-B radiation (280~320 nm), and protects aquatic organisms against harmful radiation (Stedmon et al. 2000; Huovinen et al. 2003). CDOM is decomposed into small molecular organic matter and inorganic nutrients through photochemical and microbial degradation processes to create conditions for the growth of aquatic organisms such as algae, microorganisms, and aquatic vegetation (Veuger et al. 2004; Osburn et al. 2009; Lapierre and Frenette 2009). However, these processes also release greenhouse gases such as CO2 and CH4, which will aggravate global warming (Granéli et al. 1999; Tranvik et al. 2009; Jørgensen et al. 2014). Moreover, the CDOM absorption of visible light competes with algae and aquatic vegetation for photosynthetically available radiation (PAR), which in turn affects the productivity and structure of the aquatic system. (Stedmon et al. 2000; Zhang et al. 2011a). CDOM absorption in the blue region of visible light overlaps with the absorption band of phytoplankton chlorophyll a (Chla) and total suspended matter (TSM), which will lead to the inhibition of phytoplankton photosynthesis, a reduction of primary productivity, and interference in the estimation of phytoplankton biomass and TSM concentration by remote sensing (Rochelle-Newall and Fisher 2002; Doxaran et al. 2002; Zhang et al. 2009). CDOM is complex in composition, mainly including humic acid and fulvic acid, which originate from aquatic biological secretions, algae and aquatic vegetation degradation, river inputs from land and marshes, atmospheric wet deposition through rainfall and release from sediment, especially in inland waters (Rochelle-Newall and Fisher 2002; Zhang et al. 2007a; Zhou et al. 2017; Burdige et al. 2004). In addition to the absorption of light, CDOM also fluoresces when excited by light in the UV and blue regions of the spectrum (Stedmon and Markager 2003; Stedmon et al. 2003). As a result, several optical approaches, including absorbance and fluorescence, have been used to reflect the internal and surrounding environment of the water body and can indicate the pollution status of the water body to a certain extent and help identify the origins of CDOM in aquatic ecosystems (Baker 2002; Vahatalo et al. 2005; Hudson et al. 2007).

The Three River Source Region (TRSR) is the source of the Yangtze, Yellow, and Lancang Rivers catchment area, which lies in the hinterland of the Tibetan Plateau, the highest and most extensive plateau in the world (Xu et al. 2018b; Fang 2013). The TRSR is called the “Chinese Water Tower”, and approximately 40 billion m3 of water is transported downstream of the source regions every year (Shao et al. 2017). As the source regions of the first and second largest rivers in China, respectively, the source region of the Yangtze River (TSYaR) and that of the Yellow River (TSYeR) have important ecological functions for water resource conservation and management, biodiversity protection, and ecological security in the Yangtze and Yellow River basins. These source regions have complex topography and a harsh climate, and the ecological environment is fragile and sensitive to climate change (Liu et al. 2014; Ding et al. 2018; Kensuke et al. 2005). However, in recent decades, due to anthropogenic activities and climate changes, stream flows have reduced, snow-capped glaciers have receded, vegetation coverage has changed, lakes have shrunk, rivers have dried up, and the hydrological environment has changed drastically around the world (Pirnia et al. 2019; Liu et al. 2014; Wang et al. 2001; Shi et al. 2017). In addition, soil and water conservation capacity has declined sharply and grassland productivity has declined, causing serious damage to the ecological environment of the source catchment regions and marking it difficult to restore it to health (Xu et al. 2018a, 2018b; Cai et al. 2015; Sheng et al. 2019). Taking the TRSR as an example, the State Council of China approved and implemented “The Three River Source Region Ecological Project” in 2005 to carry out ecological restoration and treatment for the ecological environment. Although this project has had a positive contribution to the restoration and protection of TRSR ecosystems, the ecological environment is still not completely restored (Liu et al. 2013; Jiang et al. 2017). Further, the optical properties of CDOM in TSYaR and TSYeR are largely unknown (Nima et al. 2016).

In this study, the CDOM of the main rivers and typical river segments of TSYaR and TSYeR was investigated. The main objectives were (1) to identify the CDOM spectral absorption characteristics and the fluorescence components in TSYaR and TSYeR; (2) to discuss the possible sources of CDOM; and (3) to explore the relationship between CDOM and environmental variables (potential environmental indicators and influential factors of CDOM). This study provides useful information for the role of CDOM in ecological and biogeochemical processes in TRSR areas. Furthermore, it provides a theoretical basis for water environmental protection and ecological restoration in the source area of rivers.

Materials and methods

Study area

TSYaR and TSYeR are located in the northeastern part of the Qinghai-Tibet Plateau (Fig. 1). The average elevation of these regions is > 4000 m (Wang et al. 2017), and the annual temperature ranges from − 5.6 °C in January to 7.8 °C in July (Sheng et al. 2019). Annual rainfall is between 262.2 and 772.8 mm mostly falling as snow and heavy rain (Jiang et al. 2017; Fang et al. 2011), and annual evaporation is 1204~1327 mm (Qian et al. 2006). The source regions of the two river systems extend over approximately 15.8 × 104 km2 and 12.2 × 104 km2, respectively (Yu et al. 2013).

Fig. 1
figure 1

Locations of sampling segments in the source regions of the Yangtze and Yellow Rivers, China

Fluvial morphology, drainage pattern, and riparian vegetation in TSYaR and TSYeR differ markedly (Yu et al. 2013). TSYaR is divided by quasi-parallel low mountains or ridges and developed relatively parallel drainage networks along the area between the low mountains and ridges (Yu et al. 2013; Pan et al. 2013; Li et al. 2018a, 2018b). The river network is dense in TSYaR, and the main tributaries are the Dangqu River in the south, the Tuotuo River in the middle, and the Chumaer River in the north (Wu et al. 2013). TSYeR originates from the Yueguzongliequ Basin of the northern Bayan Har Mountains and flows through the two largest lakes in the region, namely Zhaling Lake and Eling Lake (Li et al. 2013; Blue et al. 2013; Wan et al. 2014). TSYeR exhibits greater variability in morphology than TSYaR. The elevation ranges from 2680 to 6248 m and gradually decreases from the southwest to the northeast (Meng et al. 2016; Wu et al. 2018). The environmental gradient of riparian vegetative development (i.e., the type, quantity and diversity) is associated with interactions between fluvial morphology and the drainage pattern (Yu et al. 2014). The environment is more favorable for vegetation growth in TSYeR than in TSYaR (Yu et al. 2013), and the potential season for vegetation growth in TSYeR is much longer than in the vast majority of TSYaR (Yu et al. 2014). The main differences between the two regions are shown in Table S1.

Field sampling and measurements

A sampling campaign was conducted in TSYeR at the end of July and in TSYaR in early October 2018, i.e., the wet season with relatively abundant water in the two regions (Shen et al. 2003; Feng et al. 2018). A total of 36 water samples were collected, including 18 samples for each region. Six sampling river segments were chosen from sampling in TSYaR and TSYeR, respectively (Fig. 1). Three sampling sites were set in each sampling segment, and the spacing between each site was within the range of 100 to 500 m according to the in situ environment. Surface (0~0.5-m depth) water samples for CDOM absorption, fluorescence, dissolved organic carbon (DOC) concentration, and water quality analysis were collected in a 1-L acid-cleaned polyethylene bottle and held on ice in the field. These water samples were transported to the laboratory as soon as possible and stored in the dark at 4 °C. In addition, at each sampling site, environmental variables were measured after water sample collection. Latitude and longitude were registered using a handheld GPS device (G138BD GPS, UniStrong, China). Water temperature (WT), dissolved oxygen (DO), electrical conductivity (EC), and pH were measured using a multiparameter digital analyzer (HQ30d, HACH, USA). Surface velocity (v) was measured by a portable flow meter (Flowatch, Switzerland) and turbidity (Turb) was detected using a portable turbidimeter (2100Q, Hach, USA).

Water quality measurements

In the laboratory, water samples for DOC were filtered through 0.45-μm cellulose filters and then measured with a total organic carbon analyzer (Vario TOC, Elementar, Germany). The water samples were analyzed for total nitrogen (TN), nitrate nitrogen (NO3–N), nitrite nitrogen (NO2–N), ammonium nitrogen (NH4+–N), and total phosphorus (TP) by the Standard Methods of Environment Monitoring in China (National Bureau of Environment Protection Editor 2002). TN was determined using a UV spectrophotometric method with alkaline potassium persulfate digestion; NH4+–N concentration was determined using a salicylic acid spectrophotometry method; NO3–N concentration was determined using sulfamic acid UV spectrophotometry; NO2–N concentration was determined using an N-(1-naphthyl)-ethylenediamine spectrophotometric method; and TP concentration was determined using a spectrophotometric method with persulfate potassium digestion.

CDOM absorption measurement and analysis

Water samples for CDOM analysis were first filtered through precombusted (4 h at 450 °C) 0.7-μm GF/F glass filters (Whatman, UK) and then through 0.22-μm membrane cellulose filters (Millipore). The absorption coefficients of CDOM were measured between 200 and 800 nm (path length 1 cm) using a UV–visible spectrophotometer (DR6000, Hach, USA) with Milli-Q water (18.2 MΩ cm−1 resistivity) as a blank. The absorption coefficient (m−1) was calculated from the measured absorbance using Eq. (1) below (Bricaud et al. 1981; Kirk 2011):

$$ {a}_{\mathrm{CDOM}}\left(\lambda^{\prime}\right)=2.303\times A\left(\lambda \right)/r\kern0.75em $$
(1)

where aCDOM(λ′)(m−1) is the uncorrected CDOM absorption coefficient at wavelength λ, (λ) is the absorbance at wavelength λ, and r (m) is the optic pathlength of the quartz cuvette in meters (0.01 m in this study). A correction equation (Eq. (2)) was applied to correct the scattering by residual particles in the filtered sample absorption at 700 nm (Bricaud et al. 1981; Green and Blough 1994):

$$ {a}_{\mathrm{CDOM}}\left(\lambda \right)={a}_{\mathrm{CDOM}}\left(\lambda^{\prime}\right)-{a}_{\mathrm{CDOM}}(700)\times \lambda /700 $$
(2)

The spectral slope of CDOM can be modeled using a single exponential formula according to the nonlinear regression of Eq. (3) (Bricaud et al. 1981):

$$ {a}_{\mathrm{CDOM}}\left(\lambda \right)={a}_{\mathrm{CDOM}}\left({\lambda}_0\right)\times \exp \left[S\left({\lambda}_0-\lambda \right)\right]\kern0.5em $$
(3)

where aCDOM(λ0) is the absorption coefficient at reference wavelength λ0, which was calculated with 440 nm adopted as the reference wavelength in this study (Kirk 2011; Xu et al. 2018a). S (μm−1), the spectral slope, can reflect small changes in the spectral shape and has been used to explore the biogeochemical process and sources of CDOM. S275~295 (wavelength λ is 275~295 nm) values were correlated with CDOM composition or diagenesis and have been widely applied in DOM studies (Li and Hur 2017). The values were calculated using Eq. (3) and can be used to reflect the molecular weight of the CDOM composition and to characterize the source of the CDOM (Helms et al. 2008; Xia et al. 2018). The CDOM concentration was difficult to measure directly because of the complicated composition, aCDOM(355) is widely used as an indicator of relative CDOM concentration (Kirk 2011; Stedmon et al. 2007b; Zhang et al. 2007b) and was used as a surrogate for CDOM concentration in this study.

Another parameter was derived by absorbance and DOC concentration measurements: SUVA254 is defined as the absorbance at 254 nm divided by the DOC concentration (Seritti et al. 1998; Weishaar et al. 2003).

Three-dimensional fluorescence measurements and PARAFAC modeling

CDOM fluorescence excitation-emission matrices (EEMs) were determined using a fluorescence spectrophotometer (FluoroMax–4, Horiba Jobin Yvon, USA). EEM spectroscopy scanning ranges were 200~450 nm for excitation (Ex) and 250~600 nm for emission (Em). Spectral readings were collected in the ratio mode (Sc/Rc) at 5-nm intervals for Ex wavelength and at 1-nm intervals for Em wavelength. The bandpass width of both the Ex and Em was set at 5 nm. Water Raman scatter peaks were eliminated by subtracting a Milli-Q water blank EEMs from the EEM of the measured sample (Li et al. 2018). The Raman scattering of the sample was automatically deducted by the spectrometer system, and the influence of Rayleigh scattering applying factory supplied correction factors (Singh et al. 2010). Inner-filter effects were corrected for the EEM using the measured CDOM absorbance (McKnight et al. 2001; Murphy et al. 2013).

Parallel factor analysis (PARAFAC) is a statistical decomposition technique used to extract the most representative fluorescence components from the complex DOM mixture EEM dataset (Andersen and Bro 2003; Murphy et al. 2013). In this study, PARAFAC of EEM data was conducted using MATLAB 2018a software with the DOMFluor toolbox 1.7 (Bro 1997; Stedmon and Bro 2008). The model can be written as follows:

$$ {x}_{ijk}=\sum \limits_{f=1}^F{a}_{if}{b}_{jf}{c}_{kf}+{e}_{ijk}\kern0.5em $$
(4)

where i = 1, …, I; j = 1, …, J; k = 1, …, K; xijk is the fluorescence intensity of the ith sample at the kth Ex and jth Em wavelengths; aif is directly proportional to the concentration of the fth fluorophore in the ith sample; bjf and ckf are estimates of Em and Ex spectra (loadings) of fth fluorophore at wavelength j and k, respectively; F is the number of fluorescence components; and eijk is the residual noise, which represents unexplained variability in the model.

Due to the SNR (signal-to-noise ratio), the EEM datasets with Ex wavelength less than 250 nm and Em wavelength less than 300 nm were excluded from the calculation of PARAFAC. Considering that the different sampling times and the fluorescence components may differ in the two regions, the EEM data of the two regions were performed by PARAFAC, respectively. According to the PARAFAC results, 2 TSYaR and 1 TSYeR samples were removed because of possible problems. Finally, 16 EEM datasets for TSYaR and 17 EEM datasets for TSYeR were analyzed using PARAFAC modeling separately and were used in this study. Determination of the number of components was performed by split-half analysis and analysis of residuals and loadings (Stedmon et al. 2003). Several fluorescence parameters were used to quantitatively describe and distinguish the fluorescence characteristics of CDOM. The fluorescence index (FI) is defined as the ratio of the Em intensity at 470 nm to that at 520 nm, obtained at Ex 370 nm (Cory and McKnight 2005); the biological index (BIX) is defined as the ratio of Em intensity at 380 nm to that at 430 nm, obtained at Ex 310 nm (Huguet et al. 2009); the humification index (HIX) is defined as the ratio of two spectral regions (435~480 nm and 300~345 nm) from the Em spectrum, obtained at Ex 254 nm (Huguet et al. 2009) (since the bandpass width of Ex was set at 5 nm, we used the value of Ex at 255 nm).

Statistical analyses

The S275~295 values of CDOM absorption were obtained using a nonlinear least square regression (Loiselle et al. 2009). Linear discriminant analysis (LDA) was performed using CDOM absorption spectral parameters (aCDOM(355), SUVA254, S275~295), fluorescence parameters (FI, BIX, HIX), and the percentage of the fluorescence components (C1(%) and C2(%)) for the sake of classification and dimensionality reduction. Relationships between CDOM absorption spectral variables (aCDOM(355), SUVA254, S275~295), C1(%) and C2(%), the corresponding fluorescence intensity (I1 and I2) of the fluorescence components (C1 and C2), the fluorescence index (FI), biological index (BIX), humification index (HIX), and environmental variables (Elevation, WT, DO, EC, pH, v, Turb, DOC, TN, NO3–N, NO2–N, NH4+–N, TP) were determined by Spearman’s correlation analysis.

PARAFAC and graphs of CDOM fluorescence components constructed with MATLAB 2018a software. Distribution maps of segments and sampling sites were generated with ArcGIS 10.2 software. The mean, standard deviation, t test, LDA, Spearman’s correlation analysis, and other graphs were generated with Rstudio 1.1.463 (R 3.5.1 version) software (http://www.r-project.org/). “⁎”, “⁎⁎,” and “⁎⁎⁎” were considered statistically significant at the levels of 0.05, 0.01, and 0.001 (two-tailed), respectively.

Results

Environmental variables of TSYaR and TSYeR

The WT ranged from 0.3 to 6.2 °C and 4.7 to 21.4 °C in TSYaR and TSYeR, respectively. The WT in TSYeR generally increased with decreasing altitude (Table 1). The average concentrations of DOC were 5.13 ± 1.68 and 6.21 ± 1.78 mg L−1 in TSYaR and TSYeR, respectively, and there was no significant difference between the two regions (p > 0.05). The DO, pH, EC, and Turb showed no significant difference between the two regions (p > 0.05). In terms of the spatial distribution of river segments in each region, Turb and v tended to increase with decreasing elevation, while there were no obvious spatial distribution patterns for DO, pH, and EC. The water quality of the two regions was good, and some water quality indexes reached the type II~III water quality standard in the “Environmental Quality Standards for Surface Water of China (GB3838-2002).” The concentration of TN in TSYeR (1.72 ± 0.66 mg L−1) was significantly higher than that in TSYaR (0.87 ± 0.27 mg L−1) (p < 0.05), while other chemical indicators (NO2–N, NH4+–N, TP) did not differ significantly between the two regions. The concentrations of TN (0.72~2.71 mg L−1), NH4+–N (0.07~0.32 mg L−1), and TP (0.14~0.29 mg L−1) generally increased from the upper mainstream to the lower segments in TSYeR, while they had no obvious spatial distribution regularity in TSYaR.

Table 1 Mean and standard deviation (SD) of environmental variables in TSYaR and TSYeR

CDOM absorption spectral features

The absorption spectra demonstrated that the CDOM absorption coefficient decreased with increasing wavelength in an approximately exponential manner from the ultraviolet to the visible light waveband in TSYaR and TSYeR indicating a range of CDOM concentrations (Fig. 2). The absorption coefficient in the visible light waveband was small, especially after the wavelength reached 600 nm, the absorption coefficient gradually approached zero. In general, the CDOM absorption coefficient of TSYeR was significantly higher than that of TSYaR at wavelengths less than 643 nm (p < 0.05), which indirectly reflected the differences in CDOM concentration between the two regions.

Fig. 2
figure 2

Averages of CDOM absorption spectra (LightCoral (no. F8766D) and DarkTurquoise (no. 00BFC4) solid lines), plus standard deviations (gray-shaded area) measured in the source regions of the Yangtze and Yellow Rivers, China. Asterisk denotes being statistically significant at the levels of 0.05

The larger aCDOM(355) value, the higher the CDOM concentration of the corresponding sample. Figure 3A and B show the spatial variation of CDOM absorption coefficients (aCDOM(355)) in TSYaR and TSYeR, ranging from 0.57 to 4.49 m−1, with a mean of 2.07 ± 1.10 m−1; the coefficient of variation was 53.1%. The aCDOM(355) values of TSYaR and TSYeR had similar spatial variation characteristics: the aCDOM(355) value increased gradually from the upper branch to the mainstream (except in the TTH segment) in TSYaR and from the upper part of the mainstream to the lower segments in TSYeR (except in the HHY and TNH segments) (Fig. 3A). The mean absorption coefficient (aCDOM(355)) of TSYeR (2.97 ± 0.83 m−1) was significantly higher than that of TSYaR (1.18 ± 0.37 m−1) (p < 0.001) (Fig. 3B).

Fig. 3
figure 3

Bar plots and vioplot box plots showed the spatial distribution of CDOM absorption spectral parameters in all sections and two regions of TSYaR and TSYeR, respectively. (A, B) spatial variation of absorption coefficients at 355 nm (aCDOM(355)); (C, D) spatial variation of carbon-specific CDOM absorption at 254 nm (SUVA254); (E, F) spatial variation of the spectral slope for 275~295 nm (S275~295). Red box boundaries in vioplot box plots indicate the 25th and 75th percentiles; the whiskers represent the maximum/minimum values except for outliers; the inner horizontal blacked line is the median; the outside violin plot showed the data distribution and its probability density. Different lowercase letters above the bar plots mean significant difference at the 0.05 level. Single and double asterisks denote being statistically significant at the levels of 0.05 and 0.001, respectively

The values of SUVA254, which indicated the aromaticity and reactivity of CDOM, were different across all study segments (Fig. 3C, D), ranging from 0.66 to 4.45 L−1 m−1 mg, with a mean of 2.20 ± 0.96 L−1 m−1 mg. The SUVA254 value and aCDOM(355) had similar spatial variation characteristics, except that the values in the Maqu River segment (MQ) were lower than those in the Mentang (MT) and Tangnaihai (TNH) segments in TSYeR. Similarly, the spatial mean SUVA254 exhibited a marked difference between the two regions, and the mean SUVA254 value of TSYeR (2.85 ± 0.8 m−1) was significantly higher than that of TSYaR (1.46 ± 0.26 m−1) (p < 0.001) (Fig. 3D).

Additional optical characteristics and the distribution of spectral slope S275~295 values (wavelength at 275~295 nm) also showed differences in organic matter quality between all study segments. The S275~295 mean value was 17.19 ± 1.86 μm−1 and values ranged from 12.37 (JM) to 24.27 μm−1 (GEQ). The spatial distribution of S275~295 values did not exhibit an obvious trend in segments of TSYaR and TSYeR (Fig. 3E). From the regional perspective, however, the mean S275~295 value of TSYaR (18.25 ± 3.08 μm−1) was significantly higher than that of TSYeR (16.13 ± 1.89 μm−1) (p < 0.05) (Fig. 3F).

Overall, when all twelve river segments were considered together, it was found that the variation in the aCDOM(355) and SUVA254 values generally showed a consistent tendency (Fig. 3A, C), while the S275~295 values showed no clear regularity (Fig. 3E). The distributions of aCDOM(355) and SUVA254 values were relatively concentrated in TSYaR and scattered in TSYeR (Fig. 3B, D), and the distributions of S275~295 values in the two regions were opposite (Fig. 3F).

CDOM EEM–PARAFAC components and qualitative indexes

The EEM data of all samples measured in the two regions were analyzed by the PARAFAC method, respectively. The two components (C1 and C2) of the two regions were identified and subjected to a split-analysis validation procedure and a random assignment test, and the simulation degree exceeded 99.5%. These two components could be grouped into one category: humic-like component, which had highly similar (Ex/Em)max positions between each pair of components in the two regions (Table S2; Fig. 4). The characteristics of the two components were similar to those of CDOM previously reported in other aquatic environments (Table S2). C1 showed two excitations (Ex) maxima at 260 and 365 nm, with one emission (Em) maximum at 538 nm (TSYaR) or 539 nm (TSYeR), and C2 displayed two Ex maxima at ≤ 250 and 310 nm, with one Em maximum at 412 nm (TSYaR) or 416 nm (TSYeR). The fluorescence components were similar; however, the average proportions of the components were different in the two regions. In TSYaR, C1 and C2 accounted for 51.86% and 48.14%, respectively, while the corresponding proportions in TSYeR were 47.71% and 52.28%. Of course, what needs to be pointed out in this study was that although the two components were identified using the PARAFAC model, this did not mean that only two types of fluorophores were present in these samples.

Fig. 4
figure 4

The two fluorescence components were found from the results of PARAFAC modeling. Contour plots (a, b, e, f ) present spectral shapes of excitation and emission, positions of their maxima are given in Table S2. The loadings (c, d, g, h) derived from the two components PARAFAC model using split-half validation technique. Solid green lines represent Ex loadings for the whole dataset and solid blue lines represent Em loadings for the whole dataset

The fluorescence index (FI) in our study varied from 1.50 to 1.79, with a mean of 1.63 ± 0.07. The average FI in GEQ (1.73 ± 0.06) was significantly higher than that in the TTH (1.65 ± 0.02) (p < 0.05), but the values were not significantly different from those of other river segments in TSYaR (p > 0.05) (Fig. 5A). In TSYeR, the average FI in the HHY (1.67 ± 0.01) was the highest and was significantly different from other river segments (p < 0.05), and the lowest FI was found in the MQ segment (1.51 ± 0.02) (Fig. 5A). Regionally, the FI in TSYaR was significantly higher than that in TSYeR (p < 0.001) (Fig. 5B). The biological index (BIX) within all sampling segments ranged from 0.64 to 0.84, with a mean of 0.75 ± 0.06. Relatively high BIX values were found in the CMEH (0.81 ± 0.02) and ZMD (0.82 ± 0.02) river segments, which were significantly higher than that in the TTH (lowest) river segment (p < 0.05), and other river segments in TSYaR were not significantly different from each other (Fig. 5C). The BIX values of the HHY and MT river segments were significantly higher than those of other river segments (p < 0.05) but were not significantly different from each other (p > 0.05) (Fig. 5C). From a regional perspective, a significantly higher mean BIX was recorded in TSYaR (0.80 ± 0.01) than in TSYeR (0.71 ± 0.03) (p < 0.001) (Fig. 5D). The spatial distribution of the humification index (HIX) was different from that of FI and BIX in this study area. The highest HIX was found in the TTH (4.77 ± 0.41) and the lowest in the CMEH (2.72 ± 0.01). The HIX values of the JM and TNH river segments were relatively high, and the lowest value was recorded in HHY (2.19 ± 0.28) (Fig. 5E). Overall, the mean HIX value of TSYeR (4.69 ± 1.64) was significantly higher than that of TSYaR (3.76 ± 0.66) (p < 0.05) (Fig. 5F).

Fig. 5
figure 5

Bar plots and vioplot box plots showed the spatial distribution fluorescence parameters in all sections and two regions of TSYaR and TSYeR, respectively. (A, B) spatial variation of fluorescence index (FI); (C, D) spatial variation of the biological index (BIX); (E, F) spatial variation of the humification index (HIX). Red box boundaries in vioplot box plots indicate the 25th and 75th percentiles; the whiskers represent the maximum/minimum values except for outliers; the inner horizontal blacked line is the median; the outside violin plot showed the data distribution and its probability density. Different lowercase letters above the bar plots mean significant difference at the 0.05 level. Single and double asterisks denote being statistically significant at the levels of 0.05 and 0.001, respectively

The LDA results were a good representation of the differences in CDOM between TSYaR and TSYeR (Fig. 6). The first two LDA axes (LD1, 91.45%; LD2, 3.51%) explained 94.96% of the total variability in the CDOM spectral parameters. The first LD1 axis showed relatively high loadings for aCDOM(355) and FI. The BIX, C1(%), and C2(%) had loadings on the second LD2 axis, which only accounted for 3.51% of the variance. Most of the segments of TSYaR were clustered on the negative side of the LD1 axes, whereas the segments of TSYeR were clustered on the positive side of the LD1 axes. To some extent, the LDA results further showed the differences in CDOM between TSYaR and TSYeR.

Fig. 6
figure 6

Biplot of linear discriminant analysis (LDA) for CDOM spectral parameters (aCDOM(355), SUVA254, S275~295, FI, BIX, HIX) and the fluorescence components percentage (C1(%) and C2(%)) in the twelve segments of the two regions

Correlations between CDOM and environmental variables

Multivariate analyses were performed on CDOM parameters and environmental variables to demonstrate the influential factors of the water physicochemical properties for CDOM in this study area.

Spearman’s correlation analysis provided additional insight into the relationship between the CDOM parameters and environmental variables. There were different scale effects for the two regions in this study. In TSYaR, Spearman’s correlation analysis showed that DOC had a significant positive correlation with fluorescence intensity I1 (r = 0.832, p < 0.001) and I2 (r = 0.862, p < 0.001), and had a significant negative correlation with FI (r = − 0.785, p < 0.001); EC, NO3–N, and DOC have similar effects on CDOM, but the significance was relatively lower (Fig. 7a); In addition, there were significant positive correlations between v, WT, TP, Turb, and the CDOM absorption coefficients aCDOM(355) and SUVA254; and TN and DO had a significant negative correlation with S275~295 (r = − 0.564, p < 0.01) and aCDOM(355) (r = − 0.512, p < 0.01), respectively (Fig. 7a).

Fig. 7
figure 7

Spearman’s correlation analysis of CDOM absorption spectral parameters (aCDOM(355), SUVA254, S275~295), the fluorescence components percentage (C1(%) and C2(%)), the corresponding fluorescent intensity (I1 and I2) of the fluorescence component (C1 and C2), the fluorescence parameters (FI, BIX, HIX), and environmental variables (Elevation, WT, DO, EC, pH, v, Turb, DOC, TN, NO3–N, NO2–N, NH4+–N, TP). Single asterisk indicates coefficients at 0.05 significance level, p < 0.05; Double asterisks indicate coefficients at 0.01 significance level, p < 0.01; Triple asterisks indicate coefficients at 0.001 significance level, p < 0.001

Spearman’s correlation analysis of the CDOM parameters and environmental variables in TSYeR is shown in Fig. 7b. DOC was significantly positively correlated with fluorescence intensity I1 (r = 0.539, p < 0.05) and I2 (r = 0.657, p < 0.01), and significantly negatively correlated with SUVA254 (r = 0.613, p < 0.01); WT, Turb, and NH4+–N were positively correlated with aCDOM(355) and SUVA254 and negatively correlated with FI, while EC and Elevation were opposite to these water quality parameters. NO3–N had significant negative correlation with HIX (r = − 0.540, p < 0.05) and a significant positive correlation with BIX (r = 0.618, p < 0.01) (Fig. 7b).

Discussion

CDOM optical properties in TSYaR and TSYeR

Due to the complex chemical composition of CDOM, its concentration is generally expressed by the absorption coefficient at a certain wavelength (Kirk 2011; Stedmon et al. 2007b), such as aCDOM(355) in this paper. Studies have shown that absorption is near the limit or is negative for wavelengths longer than 350 nm for some very clear water bodies, and absorption was significantly positively correlated with TN, TP, Chla, and the trophic state index (TSI) (Zhang et al. 2018). The disturbance caused by Human activity was relatively low, and some water quality indexes reached to the type II~III water quality standard in the two regions. This was one of the reasons for the low CDOM concentration (with a mean of 2.07 ± 1.10 m−1), and values were even negligible (Shang et al. 2018). SUVA254 reflects the proportion of aromatization in the CDOM molecular structure (Weishaar et al. 2003) and indirectly characterizes the absorption capacity of CDOM for light (Zhang and Qin 2007; Morris and Hargreaves 1997). In this study, the average SUVA254 values in the two source regions were significantly different (Fig. 3C, D), indicating that the concentration of aromatic substances in CDOM was different, and the ability to absorb light was also different. It should be pointed out that the sampling times of the two regions were different. Although the water was abundant in the respective sampling periods, environmental variables differed with respect to the temporal distribution (Table 1). From the perspective of the CDOM composition, combined with the CDOM absorption parameters values (S275~295), it was found that greater SUVA254 resulted in lower S275~295 values, which corresponded to the large molecular weight of CDOM, consistent with other studies (Zhang and Qin 2007; Yacobi et al. 2003).

In this study, two components were grouped into the humic-like component by comparison with previously identified components (Table S2). This result does not suggest that only two types of fluorophores were present in these samples and does not suggest that both components were present in each sample of the two regions; it only means that humus-like substances were the main component of CDOM in this study area. C1 was likely derived from agricultural catchments, soil, or terrestrial/autochthonous sources, as reported in other widespread freshwater environments (Table S2, Stedmon et al. 2003; Stedmon and Markager 2005; Murphy et al. 2006; Murphy et al. 2008; Singh et al. 2010; Zhu et al. 2018). C2 resembled a microbial humic-like component and existed in forest, stream, rainwater, and wetland environments (Table S2, Stedmon et al. 2003; Cory and McKnight 2005; Williams et al. 2010; Cussa and Guéguenb 2012). Although the fluorescence components were similar, the average proportions of these fluorescence components (C1(%) and C2(%)) were different in the two regions, which might be due to the comprehensive effect in the two regions. In previous studies, strong relationships (p < 0.01) were observed between monthly mean net inflow runoff (Qnet) and the proportions of the fluorescence components (i.e., C2(%)), which suggested that Qnet influenced the CDOM n the fluvial plain Lake Taihu watershed (Zhou et al. 2018) and highly significant positive relationships were observed between the percentage of anthropogenic land use and C1 (r2 = 0.71, p < 0.001) in drinking water reservoirs (Shi et al. 2020). Changes in the relative proportions of fluorescence components C1 and C2 are indicative of impacts of biogeochemical processes on CDOM dynamics (Pitta et al. 2016).

Possible sources of CDOM in TSYaR and TSYeR

According to a previous study (Coble 2007), the sources of CDOM components in aquatic ecosystems are mainly allochthonous (i.e., terrestrial) and autochthonous sources (i.e., microbial/algae). The allochthonous sources mainly result from higher animal and plant residues in the soils, which are degraded by bacteria and fungi, and most of them predominately show humic-like peaks; the autochthonous sources mainly result from biological activities such as plankton, aquatic bacteria, and algae in aquatic ecosystems, and those sources are mostly expressed as a protein-like peak (Kirk 2011; Zhang et al. 2011b).

Several related parameters of CDOM spectral absorption can be used to reflect the sources and components of CDOM, such as SUVA254 and S275~295. In addition to characterizing the absorption capacity of light, SUVA254 can also be used to distinguish CDOM sources and types, which act as indicators of humus aromaticity and reactivity (Weishaar et al. 2003). This study showed the average SUVA254 value of the two regions was 2.20 ± 0.96 L−1 m−1 mg, which is relatively high compared with some studies (Hu et al. 2017; Yang et al. 2019). Allochthonous sources are believed to be more aromatic than autochthonous sources (McKnight et al. 2001). Thus, the relatively high SUVA254 value indicated a greater abundance of allochthonous sources in this study. The mean CDOM S275~295 value was 17.19 ± 1.86 μm−1, which was relatively low compared with that measured in some oceans and lakes (Pitta et al. 2016; Zhou et al. 2018). This result indicated that the allochthonous sources of humic compounds accounted for a large proportion in present study regions. In this study, especially in TSYeR, the aCDOM(355) and SUVA254 values from the upper reaches of the Yellow River to the downstream roughly showed a decreasing trend (Fig. 3A, C), which revealed the input of terrestrial humus.

The fluorescence index (FI), biological index (BIX), and humification index (HIX) can also be used to analyze the structures and sources of CDOM. FI Reflects the relative contribution of aromatic and non-aromatic amino acids to fluorescence intensity as an indicator of CDOM source and degradation (Cory and McKnight 2005). Studies have shown that an FI value > 1.9 indicates that CDOM is derived from the extracellular release of bacteria and algae and has significant endogenous production characteristics; an FI value < 1.4 reflects the origin of allochthonous sources such as terrestrial plants and soil organic matter (McKnight et al. 2001; Huguet et al. 2009). The FI in our study varied from 1.50 to 1.79, with a mean of 1.63 ± 0.07, and the values were closer to 1.4 based on the distribution of all FI values. Compared with TSYaR, CDOM in TSYeR had stronger allochthonous characteristics. The biological index (BIX) is used to measure and indicate the proportion of self-generated contributions in CDOM. BIX ranged between 0.6 and 0.8, indicating that the self-generated source contribution was small; when the value exceeds 1.0, the degree of CDOM degradation is high, and the characteristics of the self-generated source components are obvious (Birdwell and Engel 2010). BIX was recorded in TSYaR (0.80 ± 0.01) in TSYeR (0.71 ± 0.03), which indicated that the endogenous contribution to the CDOM of the two source regions was small and reflected that the allochthonous sources of TSYeR were more abundant than those of TSYaR, consistent with the FI. The humification index (HIX) is directly proportional to the degree of organic matter humification; HIX < 4 is associated with autochthonous sources, and HIX > 10 indicates strongly humified organic material, mainly of terrestrial origin (Huguet et al. 2009; Vacher 2004). The HIX values in our study varied from 1.94 to 6.95, and the values were greater than 4 in most river segments. In particular, TSYeR showed terrestrial input except for individual sites (Fig. 5E, F). In summary, based on the comprehensive CDOM spectral absorption and fluorescence parameters, CDOM was mainly derived from externally input humus, and TSYeR showed stronger allochthonous sources in this study region.

Relationship between CDOM and environmental variables in TSYaR and TSYeR

The concentration and composition of CDOM in different aquatic environments are mainly determined by factors such as soil type, vegetation type, runoff, rainfall, and human activities, which also indirectly affect biological sources such as aquatic plants, algae, bacteria, and microorganism degradation and secretion (Zhang et al. 2011b). Changes in land-use types in a river basin, such as agricultural farming, deforestation, and grassland degradation, which promote soil erosion, greatly increase the organic carbon output flux and the input of river humus (Shi et al. 2020). In our study, the soil type in TSYaR is mainly alpine steppe/frozen soils, and TSYeR is mainly alpine meadow soils; the vegetation coverage of TSYeR was significantly higher than that of TSYaR (Table S1). It is important to study the relationship between CDOM and water environment variables that play a crucial role in the carbon cycle of river ecosystems, the source and distribution of CDOM, and the impact of environmental factors on CDOM.

Some studies pointed out that CDOM responds well to climate change, trophic gradients, and anthropogenic disturbances, which are reflected in specific indexes such as storm events and increased precipitation, the trophic state index (TSI), and anthropogenic humic-like inputs (Ritson et al. 2014; Zhang et al. 2018; Zhou et al. 2019b). High turbidity limits the transmission of light in water and affects the photodegradation process of CDOM, which also indirectly affects the light absorption capacity of CDOM (Stedmon et al. 2007a; Zhang et al. 2011a). There were significant positive correlations between Turb and aCDOM(355) and SUVA254 in the two regions. In addition, it was found that the WT, TN, NH4+–N, aCDOM(355), and SUVA254 gradually increased with decreasing altitude, and there was a strong linear relationship between these variables (p < 0.05) in TSYeR (Fig. S1). This result is consistent with that between CDOM and nutrients in lake and river water (Hur and Cho 2012; Zhang et al. 2014). Water temperature affects the activity of microorganisms and the degradation of dissolved organic matter in water, further affecting the change in the concentration of CDOM (Zhou et al. 2019a; Boyd and Osburn 2004; Cole et al. 2011). The water source in TSYeR is mainly snow and melted ice from the north of Bayan Hara Mountain, rainfall, tributary inflow, and underground frozen soil water. Due to global warming, the rising temperature will inevitably lead to an increase in meltwater inflow in the source area and will also bring more external inputs through runoff. There were significant positive correlations between TP, nitrogen forms (TN, NH4+–N), and the CDOM absorption coefficients aCDOM(355) and SUVA254 in TSYaR and TSYeR, respectively. In view of this, CDOM may be an important source of nutrients for the growth of phytoplankton in water (Kowalczuk et al. 2009). NO3–N had significant correlation with HIX (r = − 0.540, p < 0.05) and BIX (r = 0.618, p < 0.01) in TSYeR (Fig. 7b). The activities of bacteria and microorganisms play an important role when they dominate the nitrification and degradation of organic matter.

The two regions were comprehensively studied, and the positive correlations between DOC concentrations and the corresponding fluorescence intensity indicated that humic-like fluorescence can adequately reflect the DOC concentration in rivers, which also indicated that the humic-like component was an important component of DOC (Fig. 7a, b; Fig. S2 [A, B]). There was a significant positive correlation between FI and EC in this study (Fig. 7a, b; Fig. S2 [C]). The reason may be that the structure and morphology of humic-like substances are affected by water ions, further affecting their fluorescence features (Chen et al. 2007; Zhao et al. 2017). Temperature plays an important role in the composition and spectral characteristics of CDOM by affecting some biochemical processes, such as nitrogen cycling and microbially mediated tundra soil carbon decomposition (Zhou et al. 2019a; Feng et al. 2020). In this study, WT and aCDOM(355) (r2 = 0.6416, p < 0.001) and SUVA254 (r2 = 0.5335, p < 0.001) showed significant linear correlations. The quality of the water environment was relatively good in this study area. For water bodies with heavy pollution and high trophic states or CDOM is mainly generated by internal sources, our results may produce some errors or uncertainties if applied. In addition, the potential weakness is that the number and frequency of samples in the study regions have certain influences and limitations on the research results. In the future, we will collect data at a higher frequency and a larger watershed scale and will also conduct in-depth research on the relationship and mechanism between CDOM and environmental variables under extreme events such as heavy rainfall and floods.

Conclusions

The CDOM absorption spectra of the source regions of the Yangtze and Yellow Rivers had a high degree of consistency, and the average absorption coefficient of TSYeR was significantly higher than that of TSYaR at wavelengths less than 643 nm (p < 0.05). Two fluorescence components (C1 and C2) were identified and grouped into one category: humic-like components using EEMs and the PARAFAC model; these components had highly similar (excitations/emission) max positions in the two regions. CDOM spectral absorption (SUVA254, S275~295) and fluorescence parameters (FI, BIX, HIX) indicated that the contribution of the allochthonous source to the CDOM of the two source regions was large and that the terrestrial humus input of the TSYeR was higher than that of the TSYaR. Multivariate analyses showed the relationship between CDOM spectral parameters and environmental variables. The positive correlations between DOC concentrations and fluorescence intensity indicated that humic-like fluorescence can accurately reflect the DOC concentration in rivers, and that the humic-like component was an important component of DOC in this study. There were significant positive correlations between TP, nitrogen forms (TN, NH4+–N), and aCDOM(355)  and SUVA254 in TSYaR and TSYeR, respectively. In particular, the WT gradually increased with the decrease in altitude in TSYeR, and due to global warming, the rising temperature may lead to an increase in meltwater inflow in the source area and will also result in more external organic inputs.