Introduction

Constructed wetlands are designed to remove pollutants from contaminated water within a more controlled environment by utilizing natural processes, including sedimentation, filtration, precipitation, volatilization, adsorption, plant uptake, and various microbial processes (Faulwetter et al. 2009). Constructed wetlands have several advantages over classical activated sludge systems, including low external energy requirements and operation costs, and lower—sometimes even the absence of—sludge production (Vymazal 2009; Ayaz et al. 2015). These advantages make constructed wetlands suitable for the treatment of various wastewaters (e.g., agricultural effluents, industrial wastewater, and aquaculture waters). Many studies have reported the removal of nitrogen, phosphorus, and chemical oxygen demand (COD) (Zou et al. 2009; Li et al. 2011; Zhang et al. 2015), but rarely have they studied changes in dissolved organic matter (DOM) in constructed wetlands.

DOM consists of a complex mixture of organic compounds and can be used to assess water quality (Coble 2007; Yamashita and Jaffé 2008). The presence and characteristics of DOM in domestic wastewater are of great interest with respect to constructed wetlands (Du et al. 2014), and its structure, composition, and relative abundance can influence its degradation capability in wastewater treatment processes. Characterization of DOM therefore plays an important role in understanding and managing constructed wetlands.

Spectroscopic techniques, especially fluorescence spectroscopy, provide information about the source and composition of DOM (Coble 1996; Stedmon et al. 2011; Guo et al. 2014). Synchronous fluorescence spectroscopy provides improved peak resolution and increased selectivity or structural signatures among the various DOM components, compared to the conventional fluorescence methods (Chen et al. 2003a, b). Fluorescence excitation–emission matrices (EEMs) are considered a useful tool for distinguishing among different types and sources of DOM in marine ecosystems (Lønborg et al. 2010; Kowalczuk et al. 2013), freshwater sources (Fellman et al. 2010), and water treatment systems (Baghoth et al. 2011).

Because of the heterogeneous nature of DOM and its complex chemical structure, however, significant spectral overlapping and peak shifting and broadening often occur, making the identification and interpretation of spectral signatures difficult. Therefore, various methods have been applied to the analysis of fluorescence spectra. The fluorescence index (FI) is calculated as the ratio of emission intensity at 450 and 500 nm, at 370-nm excitation, and has been used to distinguish microbially derived DOM sources from terrestrially derived sources (McKnight et al. 2001). The peak intensity ratios could provide additional information about the origin and structure of the biological macromolecules (Sheng and Yu 2006). More recently, fluorescence regional integration (FRI) has also been used to characterize DOM and to quantitatively analyze all the wavelength-dependent fluorescence intensity data from EEM spectra (Chen et al. 2003a, 2003b; Yu et al. 2011; He et al. 2011). In addition, parallel factor (PARAFAC) analysis identifies fluorescing components in the DOM of aquatic environments (Kowalczuk et al. 2009). The PARAFAC model may be an ideal technique for resolving the spectral overlapping of DOM. EEM-PARAFAC, combined with principal component analysis (PCA) scores, has shown that the differences in geology and associated land use control the dynamics of chromophoric DOM (Yao et al. 2011). Furthermore, two-dimensional correlation spectroscopy (2D-COS) has been applied to simplify complex spectra consisting of many overlapped peaks, and enhances the spectral resolution by extending peaks along a two-dimensional space (Noda and Ozaki 2004). 2D-COS allows one to easily identify the sequential order of any subtle spectral change arising from external perturbations (e.g., pressure, temperature, or concentration variations) over a two-dimensional correlation map. Hur and Lee (2011) utilized 2D-COS and synchronous fluorescence spectra to present the molecular heterogeneity of DOM fractions.

In this study, domestic wastewater was treated in five constructed wetland beds in series. The EEM spectra, combined with PARAFAC and FRI, were used to identify fluorescent (e.g., protein-like, fulvic-like, and humic-like) components, and to observe the resulting changes that occur in the DOM of constructed wetlands. The 2D-COS and synchronous fluorescence spectra can be used to analyze the sequential order of any subtle spectral change arising from external perturbations. Our objective was to evaluate the capability of these methods in detecting changes in fluorescent DOM fractions in constructed wetlands.

Materials and methods

Sample collection

Water samples for this study were collected from five constructed wetland beds in a series in the Ergou constructed wetland of the Fenghuang River. This constructed wetland system consists of a primary biological pond, primary plant gravel beds, a secondary biological pond, secondary plant gravel beds, and a sand plant filter. A constructed rapid infiltration system was used for the preliminary treatment of the constructed wetland. Wastewater passed through five wetland beds. Influent and effluent water samples were collected from a water intake (I1) and from five water outlets (E1–E5). Immediately after collection, the DOM samples were filtered through a pre-combusted (5 h at 450 °C) 0.45-μm PTFE membrane filter. All the samples were stored in the dark at 4 °C in pre-combusted amber glass vials until analysis. DOC concentrations of all the pre-filtered samples were determined using the combustion and non-dispersed infrared methods, utilizing a total organic carbon (TOC) analyzer (multi N/C 2100, Analytikjena, Germany).

Spectral measurements

Fluorescence EEM spectra were measured with a Hitachi F-7000 spectrofluorometer (Hitachi, Japan) equipped with a 150-W Xenon arc lamp as the light source. EEM spectra were obtained by scanning the emission spectra from 280 to 550 nm by incrementing the excitation wavelength at 5-nm intervals, from 200 to 450 nm. The spectra were recorded at a scan rate of 1200 nm min−1 using excitation and emission slit bandwidths of 5 nm. The fluorescence response to a neutral solution (Milli-Q water) was subtracted from the spectra of each sample, to eliminate Raman scattering effects (McKnight et al. 2001; Lawaetz and Stedmon 2009). Synchronous fluorescence spectra were recorded for samples with a constant offset (Δλ = 55 nm) between excitation and emission wavelengths, at 5-nm slit widths. The scan speed, spectral range, and response time were set to 240 nm min−1, 250–550 nm, and auto, respectively. Several different offset values were initially investigated. We chose to use Δλ = 55 nm because it obtained the most significant differences in the spectra DOM samples.

PARAFAC modeling

The PARAFAC method of analyzing EEMs has been described in the literature (Bro 1997). This analysis was carried out in Matlab 7.0 software using the DOMFluor toolbox. PARAFAC was applied to our EEM dataset (96 samples × 55 emissions × 47 excitations) with some analytical and statistical assumptions as reported by Stedmon and Bro (2008). PARAFAC models with three to seven components were computed for the EEMs. Determination of the correct number of components was based on split-half analysis, residual analysis, and visual inspection (Stedmon and Bro 2008). The sum of squared residuals in the excitation and emission scans is plotted in Supplementary Fig. S1. It seems clear that the step from six to seven component models offered little improvement in fit, suggesting that six or fewer components were adequate for this data (Supplementary Fig. S1). Split-half analysis compares the excitation and emission loadings of the models run on separate splits of the data using Tucker congruence coefficients (Lorenzo-Seva and Berge 2006) and indicates whether the model is validated. We found that six components can be split-half-validated and that they accounted for 99.0393 % of the variance. Therefore, six components could be identified using the PARAFAC models. F max then gave estimates of the relative abundance of each component.

Regional integration analysis

In this study, the EEM was divided into five regions: region I, tyrosine-like materials, excitation/emission (Ex/Em) <250 nm/<330 nm; region II, tryptophan-like materials, Ex/Em <250 nm/<380 nm; region III, fulvic-like materials, Ex/Em <250 nm/>380 nm; region IV, soluble microbial by-product–like materials, Ex/Em >250 nm/<380 nm; and region V, humic-like materials, Ex/Em >250 nm/>380 nm) (Chen et al. 2003a, b). By normalizing the cumulative excitation–emission area volumes to relative regional areas (nm2), the normalized area volumes were calculated according to the literature (Chen et al. 2003a, b). The P i, n value (I = I–V) was used to represent the area volume of each region.

Determination and analysis of 2D-COS

2D-COS has been widely applied to resolve peak overlapping problems and to enhance the spectral resolution by extending peaks along a second dimension (Noda and Ozaki 2004; Hur and Lee 2011; Xu et al. 2013; Li et al. 2014). The theoretical basis of 2D-COS has also been illuminated by Ozaki et al. (2001). Hence, the reconstructed spectral data sets (synchronous and asynchronous) were transformed into a new spectral matrix suitable for 2D-COS mapping, using the “2D Shige” software released by Kwansei-Gakuin University (Nakashima et al. 2008).

Statistical analyses

The correlation and ANOVA analyses were performed using SPSS 16.0 software. Correlation analyses were used to examine the relationships among variables. Pearson’s correlation coefficient (r) was used to evaluate the linear correlation between two parameters. Significance levels were reported as non-significant (NS; p > 0.05), significant (a, 0.05 > p > 0.01), or highly significant (b, p < 0.01). The linear regression model was used for all correlation analyses and was validated using analysis of variance (ANOVA).

Results and discussion

Fluorescence EEM spectra with regional integration analysis

The distribution of P i,n in the DOM samples from the regional integration analysis is listed in Table 1. The P i,n values suggested a dramatic decrease of fluorescent organic matter in the effluents compared to those in the influents, indicating that fluorescent organic matter was removed by the constructed wetland. However, Table 1 demonstrated that the P i,n values from regions I–V in the E2 sample were lower than those in the other effluents. The P i,n values from these regions in the E3 sample showed a slight increase over those in the E2. The removal of organic matter in constructed wetlands is complex and depends on a variety of removal mechanisms, including sedimentation, filtration, precipitation, volatilization, adsorption, plant uptake, and various microbial processes (Vymazal 2009). These processes generally directly and/or indirectly influence the production and degradation of DOM.

Table 1 Distribution of P i,n (×106, aunm2 [mg/L]−1) of DOM sample from regional integrity analysis

The P p values in the influent were markedly lower compared to the P h values (Table 1). Regions III and V suggested the presence of aromatic organic compounds, which have a higher degree of aromatic poly-condensation and chemical stability (Santos et al. 2010). Senesi et al. (2003) suggested that the broader excitation and emission bands, and the longer emission wavelength, in region V, were associated with humic substances composed of high-molecular-weight aromatic organic compounds. The ratio P h/P p can indicate changes in the DOM structure. P h/P p suggested a marked increase in the effluent DOM compared to that in the influent. The P V, n/P III, n value was used to assess the humification degree of the DOM. Similarly, the P V, n/P III, n values increased markedly in the effluent DOM compared to those in the influents, suggesting an increase in the humification degree.

Table 2 shows the changes in the removal ratios in the five constructed wetland beds. Fluorescent DOM indicated a marked decrease in the first constructed wetland bed. The removal ratios for regions I, II, and IV varied in the order P I, n > P IV, n > P II, n, suggesting that labile protein-like material (region I) was degraded easily by the constructed wetland beds. The removal ratios of P V, n with a high degree of aromatic poly-condensation and chemical stability were relatively lower than those of P III, n. Although P i, n values demonstrate an obvious increase in the E3 compared to those in the E2, they also can be removed sequentially in the subsequent wetland beds. The removal ratios of P i, n for all regions varied in the order P I, n > P IV, n > P II, n > P III, n > P V, n in the I1/E5, indicating that protein-like materials displayed higher removal ratios compared to the humic-like substances.

Table 2 The removal ratio (%) of P i, n in the constructed wetland beds

Changes in the EEM-PARAFAC components of DOM

Although regional integration analysis can be used to quantitatively analyze EEM spectra and to determine the configuration and heterogeneity of DOM, the traditional “peak picking” technique is problematic for interpreting the multi-dimensional nature of EEM datasets due to the overlapping of the various fluorescence peaks (Maie et al. 2007; Yamashita et al. 2010). The EEM spectra, combined with PARAFAC, however, provide higher resolution on EEM fluorescence components in DOM than the traditional peak picking technique. This approach was therefore used to detect variations in DOM composition in the constructed wetland. In this study, six individual components (C1–C6) were validated for this dataset using the PARAFAC model (Fig. 1), i.e., two protein-like substances (components 1 and 6), three humic-like substances (components 2, 3, and 5), and one non-humic component (component 4). Supplementary Figs. S2 and S3 show the excitation and emission loadings and the split-half analysis identified by PARAFAC models.

Fig. 1
figure 1

Six different fluorescent components identified by the PARAFAC model form the influent and effluent DOM in constructed wetland

As shown in Table 3, component C1 was composed of two peaks with the excitation maxima at 234 and 280 nm and the emission wavelength at 335 nm, which was related to an autochthonous tryptophan-like fluorescence. This component is a typical protein-like substance that can be observed in both marine (Coble 1996; Murphy et al. 2008; Kowalczuk et al. 2009) and terrestrial waters (Yao et al. 2011; Guo et al. 2014)

Table 3 Descriptions of the six components identified by PARAFAC analysis in this study

Component C2 exhibited primary (and secondary) fluorescence peaks at an excitation/emission wavelength pair of 250 (360) nm/440 nm. This component was also similar to the traditional terrestrial humic-like peak C (Coble 1996), whereas component C1 was expected to consist of compounds that are more hydrophobic and larger in molecular weight units (Ishii and Boyer 2012).

Component C3 had a primary fluorescence peak at an Ex/Em wavelength pair of 235 nm/410 nm and a secondary peak at 335 nm/410 nm, which were similar to the traditionally terrestrial fulvic-like fluorescence peak A (Coble 1996; Kowalczuk et al. 2009), and to the PARAFAC components of terrestrial humic-like materials (Murphy et al. 2008; Yuan et al. 2015).

A blue shift can be observed for C4, with respect to C2 and C3, fluorescing at a primary (secondary) excitation/emission wavelength pair of 245 (315) nm/375 nm. This component is similar to a non-humic substance and lies between tryptophan and an N-peak (Coble et al. 1998), which could be a combination of a tryptophan-like substance (peak T) (Coble 1996) and biologically labile matter produced in aquatic environments (Singh et al. 2010). C4 was reported as a non-humic component of freshly produced biologically labile material (Yamashita and Jaffé 2008) and a protein- or amino-acid-like component (Kowalczuk et al. 2009).

Compared with the humic-like components C2 and C3, component C5 suggested a marked red shift, composed of two excitation maxima, at 280 and 345 nm, both with the emission maximum centered at 500 nm. The spectral features were also similar to soil-derived humic acid originating from wetlands and marshes (Singh et al. 2010; Yuan et al. 2015). Stedmon and Markager (2005) have reported that this component could also be attributed to the leaching processes from agricultural catchments, given the long residence time of water in these catchments.

Component C6 showed two excitation maxima at 230 and 275 nm at 305-nm emission loading, which was categorized as the previously defined autochthonous tyrosine-like fluorescence peak B. This component was also similar to autochthonous protein-like PARAFAC components (Murphy et al. 2008; Yamashita and Jaffé 2008; Yao et al. 2011).

Figure 2 describes the changes in the PARAFAC-derived DOM components for the influent and effluents in the five constructed wetland beds. Levels of tryptophan-like C1 were highest in the influent DOM, compared to other components. However, protein-like C6 also obtained a high F max value. C3 was a dominant humic-like component and accounted for 19.29 %. Terrestrial humic-like C5 showed a markedly low F max value and only accounted for 8.45 %.

Fig. 2
figure 2

Changes of PARAFAC-derived DOM components for influent and effluents in the constructed wetland

As shown in Fig. 2, fluorescent components indicated a significant change in the influent DOM compared to the effluent DOM. However, the results were similar to those from regional integration analysis that fluorescent components showed an increase and/or a decrease in the effluents. A variety of removal mechanisms resulted in changes in the F max value. Table 4 shows the removal ratios of six components in the five constructed wetland beds. Through constructed wetland treatment, the removal ratios of fluorescent components were 56.21, 32.05, 49.19, 39.90, 29.60, and 45.87 % for components 1–6, respectively. In addition, C3 also indicated a relatively higher removal ratio than other humic-like substances, suggesting that humic-like substances at short wavelengths were degraded more easily than those at long wavelengths. Due to the presence of recalcitrant characteristics, humic-like components 2 and 5 showed low removal efficiencies in the I1 and E5 samples, of 32.05 and 29.60 %, respectively. The presence of recalcitrant humic-like substances and the production of fluorescent organic matters led to a total removal ratio of fluorescent components (∑) of only 43.92 %. The results suggest that most of fluorescent components were removed in the first constructed wetland bed.

Table 4 The removal ratio (%) of fluorescent component in the constructed wetland beds

2D-COS

A synchronous map is symmetrical around the main diagonal, and the correlation peaks will appear in the main diagonal (autopeak) and off-diagonal (cross-peak) areas. Figure 3 indicates that the synchronous map showed two autopeaks (280 and 338 nm). An autopeak suggests the overall susceptibility of the corresponding spectral region to the change in spectral intensity, as an external perturbation is applied to the system (Noda and Ozaki 2004). The intensities of the autopeaks decreased in the order 280 nm → 338 nm, suggesting that they were more susceptible to the changes in protein-like fluorescence intensity at 280 nm. Off-diagonal peaks in the synchronous map exhibited a positive correlation at 338 nm. The results demonstrated that fulvic-like substances at short wavelengths were more susceptible to such changes than humic-like substances at long wavelengths.

Fig. 3
figure 3

Synchronous and asynchronous 2COS maps generated from synchronous fluorescence spectra of DOM in the constructed wetland. Red represents positive correlation, and blue represents negative correlation; a higher color intensity indicates a stronger positive or negative correlation

The asynchronous map is anti-symmetrical around the diagonal line, showing no autopeaks and revealing the degree of the sequential changes at two different wavelengths (Hur and Lee 2011). Two positive and two negative peak areas were observed below the diagonal line (x 1) of the asynchronous map. A negative peak area was placed at the x 1 wavelength of 283 nm; this is an aromatic amino acid peak, indicative of a protein-like moiety in the DOM, specifically tyrosine (Barker et al. 2009). A positive peak at 298 nm is associated with tryptophan fluorescence (Yamashita and Tanoue 2003). Fulvic-like materials showed negative peak areas (310–360 nm) for x 1. A large positive peak area appeared at the x 1 wavelength of 403 nm and was related to humic-like substances. According to Noda’s rule (Noda and Ozaki 2004), the degree of the sequential changes for fluorescent DOM components followed the order 298 nm → 403 nm → 283 nm (310–360 nm) in the constructed wetland.

Spectroscopic indication of DOM evolution in the constructed wetland

The fluorescence indexes, FI, which are defined as the ratios of emission intensity at 450 and 500 nm at 370-nm excitation (McKnight et al. 2001), can be used as indicators of the relative contribution of DOM derived from terrestrial or microbial substances. FI values of 1 or greater (∼1.9 or higher) correspond to freshly produced DOM of biological or microbial origin, whereas values of 1.4 and lower reflect organic matter of an allochthonous (terrestrial) origin. The FI values showed a marked decrease in the E5 outlet-area sample compared to those in the I1 intake-area sample (Table 5), suggesting a decrease in microbially derived organic matter. This result also demonstrated the removal of fluorescent organic pollutants, especially for protein-like materials.

Table 5 Variation of fluorescence index and the ratio of fluorescence components

The untreated wastewater (I1) had high levels of protein-like components (C1 and C6) compared with those of the effluents (Fig. 2). The ratio of protein-like components to visible humic-like components can be used as an indicator for input wastewater organic pollution in various aquatic environments (Guo et al. 2011). The ratios of C1/C2 and C6/C5 in the E5 sample were obviously lower than the corresponding ratios in the I1 sample, indicating a reduction in organic pollution. The peak shifts in the excitation wavelengths from shorter to longer indicated an increase in molecule size and aromatic content.

The intensity ratios could provide additional information about the chemical nature of the biological macromolecules (Sheng and Yu 2006). The ratios of C2/C3 and C5/C3 were markedly lower in the I1 sample than in the other samples, suggesting that the humification of DOM increased in the effluent DOM. Table 6 shows that FI was negatively correlated with C2/C3, C5/C3, and P V, n/P III, n, indicating that microbially derived DOM (e.g., protein-like material) gradually decreased with the increasing humification degree. C1/C2 was correlated with C1/C5 (r = 0.968, p < 0.01). C6/C2 was correlated with C6/C5 (r = 0.962, p < 0.01). C2/C3 was correlated with C5/C3 and P V, n/P III, n (r = 0.963, p < 0.01, and r = 0.889, p < 0.05, respectively). The above results suggested that organic pollution decreased with increasing humification and chemical stability of the DOM.

Table 6 Pearson correlation between the intensity ratios and the FRI parameters

Conclusions

Fluorescence EEM combined with regional integration analysis was used to assess the changes in fluorescent DOM. This method could be used as a valuable research tool for detecting the qualitative removal potentials of different fluorescent DOM fractions. Protein-like materials show a higher removal ratio compared to humic-like substances. The total removal ratio of the fluorescent DOM fractions was 46.84 %. The EEM spectra combined with PARAFAC provides higher resolution on EEM fluorescence components in DOM than the traditional peak picking technique. The EEM-PARAFAC revealed high concentrations of degradable protein-like components (C1 and C6). The results also showed that humic-like substances at short wavelengths were degraded more easily than those at long wavelengths. 2D-COS was used to analyze the sequential order of subtle spectral changes arising from external perturbations and to identify the sequence of the removals for fluorescent DOM. The degree of the sequential changes in fluorescent DOM components followed the order 298 nm → 403 nm → 283 nm (310–360 nm) in the constructed wetland. Use of the combined intensity ratios may provide a quantification tracer for the changes in DOM fractions and chemical stability. The humification of DOM increased in the effluent DOM, and microbially derived DOM gradually decreased with increasing humification degree in the constructed wetland.