Introduction

Groundwater represents the largest store of unfrozen freshwater on earth (Aeschbach and Gleeson 2012). Influenced by increasingly intensive anthropogenic activities, such as inadequate fertilizer application or inefficient sewage disposal, many groundwater basins show increased levels of nitrate (Chaudhuri et al. 2012; Kaushal et al. 2011; Wang et al. 2017), resulting in water quality degradation (Elisante and Muzuka 2015; Rahmati and Melesse 2016). In addition, water eutrophication may occur, as well as negative impacts on human health (Camacho-Cruz et al. 2020; Wu et al. 2018), such as the “blue baby syndrome” and stomach cancer (Dan-Hassan et al. 2012; Knobeloch et al. 2000). Generally, NO3 concentrations above 13.39 mg L−1 indicate groundwater contamination as a result of human activities (Burkart and Kolpin 1993). The World Health Organization has established a nitrate value of 50 mg L−1 as the maximum allowed concentration (MAC) for drinking water (WHO 2003).

Nitrate, as the end product of nitrification process, is less bioavailable in soils (Lunau et al. 2012), and hence, nitrate contamination usually starts as a non-point source (Kaushal et al. 2011). Potential sources of nitrate are atmospheric deposition, soil N, manure, synthetic fertilizers, and wastewater from leaking sewerage networks or septic systems (Wang et al. 2017). Due to the complex transformation processes of nitrogen in aquifers, the most suitable approach to control nitrate contamination is identifying the potential sources and reducing the inputs (Ma et al. 2016; Wick et al. 2011). One method, the dual-isotope method (DIM, using δ15N–NO3 and δ18O–NO3), can be used to assess which sources are predominantly responsible for nitrate contamination in an aquatic environment (Deutsch et al. 2006) and has been successfully applied in recent years (Kim et al. 2015; Wang et al. 2017; Zhang et al. 2018a, 2018b). However, in complex environments with multiple nitrogen (N) sources, this method is limited (Clagnan et al. 2017; Minet et al. 2017). Although different N sources involved in terrestrial N cycling have characteristic and recognizable isotope compositions in water samples (Kendall et al. 2008), the origin of nitrate must be linked to the entire N cycle, since isotope composition of nitrate can be biased due to different nitrate sources and kinetic isotopic fractionation (Li et al. 2019; Soto et al. 2019). These processes include assimilation during photosynthesis, mineralization, nitrification processes, and microbial denitrification in anoxic environments. For example, in the early stage of nitrification in non-N-limited systems (e.g., a fertilized field), the newly formed NO3 can be 15N-depleted compared with the ammonia substrate due to the 15N enrichment factor εNO3-εNH4+, which ranges between −38 and −5‰ (Kendall et al. 2007). Also, nitrate assimilation during photosynthesis can cause large changes in the 1:1 relationship between the δ15N and δ18O of nitrate (Arce et al. 2014). As opposed to nitrification, denitrification mainly occurs in anaerobic environments, such as the hyporheic zone in rivers, with sufficient organic carbon and low oxygen concentrations (Huizenga et al. 2017). Isotopic enrichment caused by denitrification can cause an elevation of both δ15N–NO3 and δ18O–NO3, resulting in an δ15N–NO318O–NO3ratio of the residual nitrate along a slope of approximately 1:2. In addition, volatilization results in the enrichment of δ15N–NO3 when nitrate is derived from manure and sewage (Spalding et al. 2018a, 2018b

Different sources may have an overlapping range of δ15N–NO3 and δ18O–NO3, thus complicating the investigation of the causes of nitrate contamination. In such cases, calculation models such as IsoSource and SIAR have been used to quantify the contribution of different sources (Deutsch et al. 2006; Xue et al. 2009; Xue et al. 2012). The IsoSource program is based on the linear mixing model, which does not take the great uncertainty of N transformation process into consideration, such as the isotope fraction caused by nitrification, denitrification, and volatilization. The model uses the average isotope values from different sources, thereby ignoring the broad range of possible values of different sources (Liu et al. 2014). In this context, Xue (Xue et al. 2012) implemented a Bayesian isotope mixing model (SIAR) to determine the proportional contribution of different nitrate sources to six different surface waters. This program was developed based on the R language and can take more than three potential sources into consideration, regardless of the restrictions set by linear mixing model; it can calculate uncertainties at the same time and has therefore widely been used in recent years (Ding et al. 2015; Gautam and Iqbal 2010; Matiatos 2016; Meghdadi and Javar 2018a; Torres-Martínez et al. 2021). According to previous studies, there is no significant relationship between δ15N–NO3 or δ18O–NO3 vs. the NO3 concentration, suggesting that more than one biological process is responsible for the shifting of isotopic values (Archana et al. 2018). Therefore, additional indicators, such as major ions (e.g., SO42−, Cl, NO3, Br) and ionic ratios (e.g., Cl/Br, SO42−/Cl, and Cl/NO3), must be considered to better determine nitrate sources and transport processes (Murgulet and Tick 2013; Wang et al. 2013; Pasten-Zapata et al. 2014). As an addition factor, land use patterns play an important role in determining groundwater contamination (Taufiq et al. 2019), and the alteration of such patterns may have a direct impact on soil degradation (Sheikhy et al. 2017).

Geo-environmental surveys in the framework of the China Geology Survey (CGS) have revealed a deterioration trend in the environment of the shallow aquifer in the Nanyang Basin. In this area, groundwater quality of local aquifers directly affects water supply security of the South to North Water Diversion Project (SNWDP). However, studies investigating the N sources in this area are scarce (ChenMin-hua et al. 2011; Shi et al. 2012).

In this context, we focus on nitrate contamination of the Nanyang Basin, taking into account land use pattern analysis, the dual-isotope method (DIM), major ion analysis, and the Bayesian mixing model. The objectives of this study are as follows: (1) to determine the characteristics and distribution of nitrate contamination in the Nanyang Basin, (2) to combine DIM and hydrogeochemical characteristics of the shallow groundwater in the Nanyang Basin to identify the main processes controlling nitrate evolution, and (3) to perform source apportionment of nitrate contamination using SIAR. We provide a scientific basis for water quality management, with the aim to ensure the long-term water supply of the SNWDP.

Material and methods

Study area

The study area is located in the Nanyang Basin, Henan Province, in the most densely populated area in China. It covers an area of 18,242 km2 (latitude between 31°50′ 33″ N and 33°20′ 25″ N, longitude between 111° 36′ E and 113° 34′ 45″) and is an important granary area in China. The origin of the middle route of the SNWDP, Danjiangkou Reservoir, is located on its middle western side (Fig. 1). Apart from the southern outlet, Nanyang Basin is enclosed by mountains on three sides. Dozens of medium-small creeks flow out of the valleys, constituting the river net in the basin and finally converging to local main rivers in the southern central plain of the Nanyang Basin. The area has a subhumid climate with an annual precipitation of 826.7 mm and an annual mean evaporation of 1449.2 mm (ChenMin-hua et al. 2011), and the annual mean temperature of study area is 16.8 °C (Du et al. 2017). Anthropogenic activities, such as agriculture and livestock breeding, have intensified in recent years.

Fig. 1
figure 1

Location of the Nanyang Basin in China (upper left), relation between population and GDP since 1952 (lower left), DEM, nitrate sample location, and nitrate concentration levels (center). Statistical data were obtained from the Nanyang Statistical Yearbook; the concentrations of nitrate are based on the groundwater quality standard of China

Neotectonic movement since the early Pleistocene has largely been determining the landform and sedimental environment of the Nanyang Basin, thus controlling the characteristics of the main aquifers (Deng et al. 1996). The main aquifers of the basin can be divided into two groups: the upper aquifer (shallow aquifer) is unconfined or semi-confined, composed of Quaternary strata from the middle Pleistocene to the Holocene, lies at a depth of 10 to 50 m, and mainly consists of silty sand to coarser gravel. The second group is a confined aquifer at a depth of less than 200 m, composed of Neogene and early Pleistocene sediments. There is an aquitard between the two groups, mainly consisting of clay and silt, which impedes contaminant leaching to some extent. The shallow aquifer of the Nanyang Basin is mainly recharged by precipitation and the lateral flow from piedmont areas surrounding the basin and is the primary water source in Nanyang Basin. In recent years, due to the increasing water demand, both surface water and groundwater level continue to decline, and the interaction between surface and groundwater becomes more complicated especially during dry season. The southern outlet of the basin and the main channel of local water, going north–south through the center of the study area, are sinks for both groundwater and surface water (Fig. 1). Most of the groundwater supply is pumped from the alluvial deposit, which is rich in gravel and coarse sands, in the first and second terraces along both sides of the plain.

China is the world’s greatest producer and user of fertilizers, applying about 31% of the total amount of fertilizers used worldwide (FAO 2020; Su et al. 2013). Agricultural fertilization is common in the study area, which may result in severe nitrate contamination of the local rivers and aquifers (Ju et al. 2006; Wang et al. 2015; Piske and Peterson 2020). In the last 10 years, chemical fertilizers, such as urea and a NPK compound fertilizer, have typically been applied at rates of approximately 750–850 kg ha−1 year−1 for winter wheat and summer maize, whereas in the 1980s, rates were only about 200 kg ha−1 year−1. Although the Nanyang Basin is densely populated, its infrastructure is relatively undeveloped, and a part of the sewage is directly discharged into sinkholes or rivers, aggravating nitrate contamination of the groundwater.

Since land use patterns can have an impact on non-point source pollution, we analyze land use types via ENVI 5.3, using remote sensing image data for 2015 (Landsat OLI_TIRS8) (Fig. 2). The areas of different land use types in the study area are summarized in Supplemental Material (Table S2). Based on remote sensing analysis, the land use types in the study area were farmland (79.84% of total study area), residential area (12.82%), forest (4.1%), and surface water (2.77%). The area of other types such as grass land and bare land is negligible.

Fig. 2
figure 2

Land use pattern of the study area in 2015, the arrangement of the four sections used for hydrogeochemical analysis, and sampling locations along these sections

Field survey, sampling, and analysis

Filed survey and sampling

Field survey and sample collection were conducted in November 2016 for the entire basin and in June 2017 for a typical nitrate-contaminated area in the Nanyang Basin (see “Stable isotope composition of nitrate” for details). At each sampling point, depth to groundwater, electrical conductivity (EC), pH, dissolved oxygen (DO), and oxidation–reduction potential (ORP) were measured in situ (HQD 30, Hach, USA); groundwater depth was measured using a water level meter (model 102, Solinst, USA). In total, 160 water samples were collected, including surface water, shallow aquifer water, and deep aquifer water. Figure 1 shows the location of each sample and its nitrate concentration. Except for NO3, the major ions in water (Na+, K+, Ca2+, Mg2+, NO2, NH4+, SO42−, HCO3, CO32−, Fe, and Mn) were analyzed for each sample. Water samples were collected in 500-mL acid-washed polyethylene bottles, using a high-density polyethylene sampler, filtered through 0.45 μm membranes, transported to the laboratory, and stored at 4 °C prior to analysis (maximum storage time: 1 week).

A typical nitrate-contaminated area was selected based on data from the analysis at the whole-basin scale (n = 28). For these 28 samples, according to judgement getting from the inquiries to residents during field survey that whether the sampling sites are potentially threatened by nitrate contamination, 12 of them were selected to analysis the ratios of stable isotopic N and O of NO3, as well as the composition of δD-H2O and δ18O-H2O (5 samples).

Sample analysis

In the laboratory, the concentrations of anions (Cl, SO42−, NO3, NO2) were measured by spectrophotometry (Perkin-Elmer Lambda 35, USA) with a precision of 5%. The analysis of cations (Na+, K+, Ca2+, Mg2+, Fe, Mn) in the water samples was performed via inductively coupled plasma-mass spectrometry (Agilent 7500ce ICP-MS, Tokyo, Japan). NH4+ was analyzed by Nessler’s reagent spectrophotometric methods with a precision of ± 2%. Total alkalinity (as HCO3) was determined by titration with standard 0.1 N hydrochloric acid, using methyl orange and phenolphthalein as indicators, with a precision of ± 5%. All major ion results were accepted only when the charge balance error was within ± 6.5%, and many were better than ± 3% (Table S1 in Supplemental Material). These measurements were performed at the Groundwater Mineral Water and Environmental Monitoring Center at the Institute of Hydrogeology and Environmental Geology, Chinese Academy of Geological Sciences.

The values of δ2H-H2O and δ18O-H2O were analyzed by the Key Laboratory of Groundwater Sciences and Engineering, Ministry of Natural Resources, China. The δ18O value was determined using CO2–H2O equilibration mass spectrometry, and the δD value was determined using H2–H2O equilibration-mass spectrometry under the catalysis of platinum, followed by analysis with a MAT253 gas isotope mass spectrometer. The isotope results were reported as relative (per mil) deviation from the isotopic ratio of the sample, using a laser absorption water-vapor isotope analyzer (Picarro-i2120, CA, USA).

The measurements of δ15N-NO3 and δ18O-NO3 were carried out at the Chinese Academy of Agricultural Sciences, using the denitrified method based on the isotopic analysis of the nitrous oxide (N2O) produced by the denitrifying bacterium Pseudomonas aureofaciens (Sigman et al. 2001). The produced N2O was concentrated and purified on a trace gas system, and the isotopic composition was determined using an isotope ratio mass spectrometer (Thermo-Finnigan, USA). During the measurement of δ15N, USGS34 potassium nitrate (KNO3, δ15N = − 1.8 ± 0.2‰) and USGS32 KNO315N = + 180 ± 0.1‰), standards were used to correct the values obtained. For δ18O measurement, the samples were referenced using USGS34 (δ18O = + 27.9 ± 0.6‰) and USGS35 sodium nitrate (NaNO3, δ18O = +57.5 ± 0.6‰) standards.

The stable isotope ratios are expressed in delta (δ) units and a per mil (‰) notation relative to an international standard. The δ15N values are reported relative to N2 in atmospheric air (AIR), whereas the δ18O and δD values are reported relative to Vienna Standard Mean Ocean Water (VSMOW). The uncertainty (2σ) values for δ18O-H2O and δD-H2O were ± 0.1 and ± 1‰, respectively. The precision for δ15N-NO3 is 0.4‰, whereas that for δ18O-NO3 is 1.6‰ (VSMOW). All statistical analyses were carried out using the software package SPSS 26 (IBM Corp. Released 2019). The isotopic values (δ notation) were calculated via Eq. (1):

$$ \delta\ sample\ \left({\mbox{\fontencoding{U}\fontfamily{wasy}\selectfont\char104}} \right)=\left(R\ sample-R\ standard\right)/R\ standard\times 1000, $$
(1)

where R = D/H, 15N/14N, or 18O/16O. The 15N/14N values were reported relative to atmospheric air, and the D/H and 18O/16O values were reported relative to Vienna Standard Mean Ocean Water.

Bayesian isotopic mixing model

Based on qualitative deduction using the values of δ15N-NO3 and δ18O-NO3, the contributions of groundwater nitrate sources were quantified using the Bayesian isotopic mixing model (Parnell et al. 2010; Yang et al. 2013). Four potential sources were taken into consideration, based on the results of field survey and land use analyses: sewage and manure (SAM), atmospheric deposition (AD), synthetic fertilizer (SF), and soil nitrogen (SN). The basic equations of SIAR are as follows:

$$ {X}_{ij}={\sum}_{k=1}^k{P}_k{q}_{jk}\left({S}_{jk}+{C}_{jk}\right)+{\upvarepsilon}_{jk} $$
(2)
$$ {S}_{jk}\sim N\left({\mu}_{jk},{\omega}_{jk}^2\right) $$
(3)
$$ {C}_{jk}\sim N\left({\lambda}_{jk},{\tau}_{jk}^2\right) $$
(4)
$$ {\upvarepsilon}_{jk}\sim N\left(0,{\sigma}_j^2\right) $$
(5)

where Xij is the isotope value j of the mixture i, in which i = 1,2,3,…, N, and j = 1, 2, 3,…, J; Sjk is the source value k on isotope j (k = 1, 2, 3,., K) and is normally distributed with mean μ and standard deviation ω; Pk is the proportion of source k which needs to be estimated by the SIAR model; Cjk is the fractionation factor for isotope j on source k and is normally distributed with mean λ and standard deviation τ; and εij is the residual error representing the additional unquantified variation between individual mixtures and is normally distributed with mean = 0 and standard deviation σ.

Multivariate data analysis

Due to the distribution of most chemical data failed to meet the assumption of normality for parametric statistical analyses, the non-parametric Kruskal–Wallis test was used to analyze the differences of major ions concentration and isotopic compositions among different land use type (Zhang et al. 2018a; Torres-Martínez et al. 2021). The linear regression analyses and Spearman correlation coefficient were also used to determine the relation between major ions (Menció et al. 2016). All statistical analyses were supported by SPSS version 26 and Microsoft Excel.

Results

Hydrochemical characteristics of groundwater

Table 1 shows the descriptive statistics of major ions and in situ measurements. The concentration of NO3 ranged from below the detection limit to 1001 mg L−1, with an average of 102.95 mg L−1. The standard deviation of nitrate was 159.89 mg L−1, which was the second highest among all ions. The average levels of Ca2+ and HCO3 were 129.08 and 325.88 mg L−1, respectively; they were the main ions. Evapotranspiration is not the main factor determining water chemistry (Gibbs diagrams as Fig. S1.a and Fig. S1.b in Supplemental Material). Therefore, the values of EC, TDS, Na+, and Cl are likely to be influenced by water–rock interactions and anthropogenic activities. The standard deviations of EC and TDS were 816 μS/cm and 554.25 mg L−1, respectively, suggesting point source contamination, such as sewage outfalls or septic tanks, may greatly influence the groundwater compositions near the corresponding sampling sites. This assumption is supported by the statistics of Cl and Na+, with average values of 87.20 and 55.15 mg L−1. However, due to farmland being the main land use type, the application of fertilizer is ubiquitous in the study area. Whether the point source and non-point sources coexist and the relationship between them remain for further discussions.

Table 1 Descriptive statistics of major indices for different land use types in the Nanyang Basin in China

According to the analysis of all 160 samples, Ca-HCO3 was the most representative chemical compound of groundwater in the Nanyang Basin (38%), followed by Ca-HCO3-Cl and Ca-Na(K)-HCO3 (11.85% and 11.25%, respectively) and by Ca-Mg-HCO3 (8.12%). Other types of compounds were combinations of the above ions (Table S3). The quantity of each type was less than 7, probably as a result of strong mixing effects and anthropogenic activities. We generated a Piper diagram to highlight the different water facies in the study area (Fig. 3). The proportion of Mg2+ was generally stable for both shallow and deep aquifers. In most of the samples from the shallow aquifer, Ca2+ was the major cation and the Na+ was mostly below 40%. Generally, HCO3 was the major anion, although in some samples from the residential area, Cl was abundant, suggesting point source pollution. The sample distribution of the deep aquifer was more dense than that of the shallow aquifer; Ca2+ and HCO3 were also the major ions.

Fig. 3
figure 3

Piper diagrams of different land uses in shallow and deep aquifers. (a) Piper diagram for samples of the shallow aquifer and surface water, (b) Piper diagram for samples of the deep aquifer

According to Table 2, TDS and EC highly correlate with almost all ions except NO2. Ca2+ showed the highest correlation with TDS and EC. Although HCO3 had higher mean concentrations compared to SO42− and Cl, its correlations with the main cations, such as Ca2+, Mg2+, and Na+, were all weaker than those with SO42− and Cl. Nitrate was strongly correlated with TDS, Ca2+, Mg2+, and Na+.

Table 2 Correlation matrix of major ions in groundwater of the Nanyang Basin in China

Nitrate concentrations in different land use types

The average concentration of NO2 was 0.05 mg L−1, far below the average of NO3. Although we determined NH4+ for each sample, the values are not included in Table 1 as they were mostly below the detection limit. Hence, nitrate was the major form of dissolved inorganic nitrogen in the study area.

Based on our results, Cl was significantly correlated with NO3. Samples in different land use had significantly different NO3 and Cl concentrations (p = 0.008 and 0.049, respectively). Samples of surface water had the lowest concentrations and the narrowest distribution. Samples from the residential area had the highest concentrations of Cl and NO3, with average levels of 114.43 mg L−1 and 162.83 mg L−1, respectively (Fig. 4), and the highest number of outliers. Farmland had the second highest NO3 concentrations, ranging from below the detection limit to 784 mg L−1, with an average of 75.71 mg L−1 (Table 1). NO3 concentration between farmland and residential area was significantly different (p = 0.017); however, difference of Cl concentration between these two groups was insignificant (p = 0.072). Forest samples showed concentration range similar to that of farmland samples, with NO3 and Cl concentrations ranging from 10.7 to 99.9 mg L−1 and 33.0 to 171.9 mg L−1, respectively. Further details and discussions between Cl and NO3 were included in “Relationships between hydrochemistry and nitrate sources.”

Fig. 4
figure 4

NO3 and Cl concentrations of different land use types in the Nanyang Basin in China

Different parts of the Nanyang Basin show different hydrogeological conditions, population densities, and agricultural and industrial structures. Because of this, we selected four sections (Fig. 2). Sections A and B ran through the entire study area from north to south, converging with each other at A-10 (B-13). These two sections represented the main channel of the Tuan River and the Bai River, respectively, and are both tributaries of the Tangbai River. Sections C and D were east–west sections; the starting point of section C (e.g., C-1) was near the Danjiangkou reservoir, the origin of the SNWDP.

According to Table 3 and Fig. 5, for two north-to-south sections (sections A and B), the average NO3 levels did not exceed the average of the entire study area. Section A had the lowest average values for both NO3 and Cl. The relatively low concentrations of ions alongside major river indicate surface water mixing and, consequently, the dilution of the ion concentrations in the groundwater. Hyporheic zones, which are located on both sides of the river channel, may have also significantly contributed toward the reduction of nitrate through denitrification (Kim et al. 2018; Meghdadi and Javar 2018b). For sections C and D, which run through the study area from east to west, with a small amount of water bodies, the concentrations of NO3 and Cl were higher, most likely because of the absence of dilution caused by hyporheic zones and surface water (Huizenga et al. 2017). We collected nine samples from section D, and their mean concentrations were the highest among all four sections. However, according to Fig. 5d, the concentrations of NO3 and Cl for D-2 were abnormally high, suggesting the existence of strong point source contamination sample. For the other eight samples, D-1 had the highest concentrations, with 125.03 and 86.85 mg L−1 for NO3 and Cl, respectively. The values for D-3 to D-9 were below the overall mean values.

Table 3 NO3 and Cl average concentrations of different land use types along the four sections
Fig. 5
figure 5

Ion concentrations along the four hydrochemical sections. The red and black lines indicate changes in NO3 and Cl, respectively, along each section

Stable isotope composition of nitrate

As shown in Fig. 6, we collected 28 water samples from the selected nitrate-contaminated area to measure the concentrations of the main ions and to analyze isotopic composition (Table S4). Similar to the data obtained for the whole basin, Ca2+ −and HCO3 were the major cations and anions, and there were no significant differences for these two ions between two sampling campaign (p = 0.092 and 0.596, respectively). Cl and NO3 showed similar ranges for these two sampling campaign (p = 0.967 and 0.211 for Cl and NO3, respectively). The average Cl and NO3 levels are 74.60 ± 78.73 mg L−1 and 102.97 ± 101.59 mg L−1, respectively (mean ± SD, detailed data are shown in Table S4).

Fig. 6
figure 6

Location of the selected typical nitrate-contaminated area in in the Nanyang Basin

According to Fig. 9, the stable isotope composition of nitrate ranges from 3.88 to 18.64‰ (average 10.92 ± 3.63‰) for δ15N-NO3 and from − 0.43 to 7.09‰ (average 3.42 ± 1.97‰) for δ18O-NO3.No simple correlations were observed between δ15N–NO3 or δ18O–NO3 versus concentrations of NO3 (Fig. S2 and Fig. S3 in Supplemental Material).

Discussion

Relationships between hydrochemistry and nitrate sources

The Gibbs plot identifies the influences of water–rock interactions, evaporation, and precipitation on water chemistry (Feth and Gibbs 1971). Most samples were taken from the water–rock interaction zone (Fig. S1.a and Fig. S1.b). This finding was consistent with the local meteorological and hydrogeological features. However, some samples were outside the typical ranges, indicating the influence of anthropogenic activities.

The contributions of water–rock interaction and anthropogenic activities to groundwater could be further differentiated by the [Na++K+]/[HCO3] versus [Ca2++Mg2+]/[HCO3] diagram (Fig. 7). Water samples close to the [Na++K+]/[HCO3] = 0 and [Ca2++Mg2+]/[HCO3] = 1 were affected by the weathering of carbonate minerals, whereas samples along the [Ca2++Mg2++Na++K+]/[HCO3] = 1 line suggest contribution of the dissolution of silicate minerals. According to Fig. 7, a large part of the samples was located in quadrant B, related to groundwaters with an excess of cations (Ca2++Mg2++Na++K+) over HCO3, which cannot be ascribed to any of the abovementioned water–rock interactions. Therefore, anthropogenic activities can explain such cation enrichment (Buccianti et al. 2014; Gibrilla et al. 2020). This leads us to infer that NO3, K+, Na+, and Cl are mostly derived from agricultural fertilizers, animal waste, and municipal and industrial sewage, altering the chemical composition of the water.

Fig. 7
figure 7

Binary plot as [Na+ + K+]/[HCO3] vs. [Ca2++Mg2+]/[HCO3] as equivalent ratios

Elevated concentrations of Na+ and Cl are usually a result of the sewage and manure (SAM) inputs, although some synthetic fertilizers may also contain trace amounts of Cl. The addition of potassium chloride or calcium chloride to synthetic fertilizer has also been reported (Bakeer 2016; Oberhelman and Peterson 2020). The dissolution of halite could be another potential reason for the increasing of Na+ and Cl, which would cause Na+/Cl molar ratio to be 1:1.

The application of deicing agents in the study area could be eliminated as a potential source of Na+ and Cl in the study area since the annual mean temperature is 16.8 °C and the main land use type in our study area is farmland (Fig. 2 and Table S2). According to Fig. 8b, for samples in residential area, with the increasing of Na+ concentration, more samples were located above the 1:1 line (R2 = 0.78, p < 0.01), suggesting there were additional Cl compared with Na+. The potential source for excessive Cl should be sewage and manure since Na+ was rarely used as synthetic fertilizer (Roy et al. 1999). Samples in forest had an almost 1:1 Na+/Cl molar ratio (R2 = 0.79, p < 0.01), indicating that the dissolution of halite was the main cause of Cl in the relatively natural conditions. The slope of linear fitting line for farmland was also close to 1 (R2 = 0.78, p < 0.01), suggesting the applied of chloride contained fertilizer was limited and did not influence the concentration of Cl in the study area. According to the distribution of all samples, it was noteworthy that Na+ was more than Cl when concentration was less than 4 mmol/L. It was reported that Na+ is a major component of N containing organic effluents (Minet et al. 2017; Guo et al. 2020). Furthermore, the incongruent dissolution of Na-K silicate minerals can also cause the excess of Na+ relative to Cl (Buccianti et al. 2014).

Fig. 8
figure 8

(a) Scatter plot of NO3/Cl molar rate vs. Cl mmol concentration; (b) scatter plot of [Cl] vs. [Na+]. Note that the linear regression lines with linear fitting equations in same color were shown for samples in different land use types. And the black line denoted the dissolution of halite

Chloride in the water can indicate sewage and manure inputs. Due to the impacts of anthropogenic sources of nitrogen (Table 2), a significant positive correlation (p < 0.01) was observed between Cl and NO3 (r = 0.756). Therefore, the NO3/Cl method can provide more information to help distinguish the effect of N removal processes by dilution from denitrification (Chen et al. 2014; Widory et al. 2005). Domestic effluent is characterized by a low NO3/Cl ratio and a high Cl concentration (Lu et al. 2015). On the contrary, samples influenced by synthetic fertilizer would have high NO3/Cl ratios with relatively low Cl concentrations. With little impacts from water–rock interactions, the NO3/Cl method was suitable to distinguish the influences of SAM from those of SF. The average NO3/Cl ratio levels for residential area and farmland are 0.83 and 0.71, respectively (Table S5). Although samples in residential area and farmland had similar NO3/Cl levels, both ion concentrations were larger in residential area than in farmland (Table 1), suggesting the mixing of the shallow aquifer. Compared with samples of surface water, the average NO3/Cl ratio was 0.37, indicating that SAM was the main nitrate source for surface water. Furthermore, many samples were concentrated in the lower left part of Fig. 8a. The low NO3/Cl ratio with low Cl concentration indicates that these samples were probably influenced by soil nitrogen (SN) (Guo et al. 2020; Torres-Martínez et al. 2021). The mixing of different nitrate sources and high nitrification rate may also cause this situation.

According to the results of our field investigation, synthetic fertilizers are widely used in the study area. In contrast, the use of organic fertilizers is limited (Nanyang Municipal Government 2019). Nitrate was the main form of DIN, and high levels of NO3 in groundwater are often attributed to NO3 transport via leaching (Li et al. 2019; Meybeck 1982). Nitrate does not form any insoluble minerals that could precipitate or are easily adsorbed via aquifer pores. For most of the samples, the pH ranged between 7 and 8, whereas the DO was in the range of 4.3 to 12.4 (Table 1). Most surface water samples had a pH above 8 and high DO values (10.11 mg L−1). During the leaching process, with decreasing DO and pH levels, the concentration of NO3 increased, indicating that the shallow aquifer had acidic conditions.

Nitrate dynamics and source identification using isotope data

Most of the samples were within the range of SAM (typically from 5.2 to 25‰), and some were in the overlapping zone between SN and SAM, demonstrating the mixing effects of different sources. Only one sample was located in the overlapping zone of synthetic fertilizer and soil N (Fig. 9).

Fig. 9
figure 9

Relationship between δ18O- NO3 and δ15N- NO3 in a selected part of the study area. The range of isotopic composition derived from (Ding et al. 2015), (Xue et al. 2009) and (Yang et al. 2013)

The δ18O-NO3 level of nitrate derived from microbial ammonium nitrification can be calculated by Eq. (6), where the expected δ18O-NO3 is only the expected value when nitrification occurs. Here, δ18O-H2O is the δ18O value in the water and δ18O-O2 is the δ18O value in the air. Although the calculation using Eq. (6) can lead to unreliable results, it is generally acknowledged that the NO3 produced by nitrification has distinct δ18O–NO3, determined by the δ18O of H2O and by atmospheric O2 (Hinshaw et al. 2020). In theory, two oxygen atoms of the newly formed NO3 are derived from H2O oxygen, and the third oxygen atom is incorporated from atmospheric O2 (Mayer et al. 2001). Five samples of δ18O-H2O were measured in our study, resulting in an average value of − 7.22‰. According to a previous study near the Nanyang Basin (Du et al. 2017), the δ18O-O2 resulting from atmospheric O2 is 23.5‰. Based on the above assumptions, the theoretical δ18O-NO3 line was plotted (Fig. 9). Equation (6) is as follows:

Expected δ18Onitrate = 1/3 δ18O − O2 + 2/3 δ18O − H2O (6)

Except for the two samples with the highest and lowest δ15N-NO3 values, other samples were located near the line of estimated nitrification. The range of δ18O-NO3 also suggests high bioactivity in the unsaturated zone. Most of the nitrate in the study area was a result of microbial activity, with nitrification being the dominant leaching process (Minet et al. 2012). Although synthetic fertilizers are widely used, no samples fell into the “NO3- fertilizer” category.

Due to the mineralization-immobilization turnover (MIT) process, there is a complex exchange mechanism between δ18O-H2O and δ18O-O2. Large gross rates of NO3 immobilization by bacteria are followed by rapid remineralization (Mengis et al. 2001), and the hypothesis that crops preferentially take up nitrate while ammonia remains for nitrification would mask the original high δ18O-NO3 values (Roadcap et al. 2002). For the above reasons, the composition of δ18O-NO3 is not identical to that concluded based on laboratory conditions (Kendall et al. 2000). These findings suggest a possible limitation of DIM in identifying the contribution of nitrate fertilizers (Minet et al. 2012; Spalding et al. 2018a, 2018b).

According to Fig. 3, Ca2+ was the main cation in groundwater. By the analysis of the relationship between [Ca2+ + Mg2+] – [HCO3 + SO42−] and [Na+ − Cl] (Fig. S4 in Supplemental Material), there were potential additional sources of Ca2+ and Mg2+ besides the dissolution of carbonate mineral (e.g., calcite/dolomite) and evaporite (e.g., gypsum and anhydrite). Therefore, besides the strong water–rock interactions identified by Gibbs diagram (Fig. S1 and Fig. S2 in Supplemental Material), Eqs. (7) and (8) could represent the influences caused by nitrification to hydrochemical evolution due to anthropogenic activities (Kim et al. 2009). Equations (7) and (8) are as follows:

$$ \mathrm{N}{\mathrm{H}}_4^{+}\left(\mathrm{reducedN}-\mathrm{fertilizers}\ \mathrm{and}\ \mathrm{manure}\right)+2{\mathrm{O}}_2\to \mathrm{N}{\mathrm{O}}_3^{-}+2{\mathrm{H}}^{+}+{\mathrm{H}}_2\mathrm{O} $$
(7)
$$ \mathrm{C}\mathrm{aO}\left(\mathrm{lime}\right)+2{\mathrm{H}}^{+}\to \mathrm{C}{\mathrm{a}}^{2+}+{\mathrm{H}}_2\mathrm{O} $$
(8)

Denitrification is a nitrate removal process which causes δ15N-NO3 and δ18O-NO3 to increase linearly over a wide range of values along a slope between 0.5 and 0.8 (Granger and Wankel 2016). However, the flat slope of δ18O-NO3 vs. δ15N-NO3 in Fig. 9 was not indicative of isotope enrichment during denitrification. According to Table 1, the average DO was 8.23 mg L−1. At DO levels above 2 mg L−1, the aerobic environment will suppress denitrification, and oxygen will be the first electron donor instead of nitrate. In addition, the practical absence of Fe and Mn (mean concentrations of 0.50 and 0.16 mg L−1, respectively) also indicates that denitrification was not significant throughout the study area. For samples from the residential area, the availability of organic carbon in domestic effluent and the leaching of manure or slurries may provide sufficient organic carbon, acting as electron donor to facilitate denitrification in the shallow aquifer (Atekwana and Geyer 2018). Based on our results from the residential area, denitrification may be a secondary process, causing a slight enrichment in δ15N-NO3 and δ18O-NO3.

Based on the distribution of all 12 samples, with relatively stable compositions of δ18O-NO3, δ15N-NO3 levels were clearly elevated. The stable composition of δ18O-NO3 suggests that nitrate produced by nitrification was the main nitrate source in the study area. The mixing of different sources between SAM and SF caused the wide distribution of δ15N-NO3, and the influences of domestic effluent were greater than those of agricultural activities. In this sense, the leachate from SAM may cause δ15N-NO3 enrichment of the underlying water due to the lateral flow of groundwater (Pasten-Zapata et al. 2014). Organic nitrogen and ammonium are the major nitrogen forms in SAM (Ma et al. 2019; Minet et al. 2012), and both are easily mineralized into nitrate during the leaching process. At the same time, volatilization is the main mechanism that enriches δ15N-NO3 in SAM to values larger than 10‰. As a result of the mixing between different sources, δ15N-NO3 levels of samples in farmland were also increased.

Three samples fell into the category “soil organic N”. However, considering the high average concentrations of nitrate and chloride in groundwater, it was unrealistic to hypothesize that soil nitrogen alone could cause current level nitrate contamination (Pasten-Zapata et al. 2014). Furthermore, mineralization-immobilization turnover in the unsaturated zone will also increase the contents of organic N. Samples enriched with δ15N-NO3 also had higher chloride concentrations (Table S4), proving the contribution of SAM, which was larger than that of SAM. Once discharged into the natural environment, ammonia is volatilized, causing the enrichment of δ15N-NO3 (Lasagna and De Luca 2019).

Nitrate source apportionment using SIAR

Calculation of the source contributions in the general isotope mixing model is also a significant issue, because the isotopic ranges and values of different end-member sources are uncertain and overlap (Kim et al. 2015). To overcome this problem, this study applied a stable isotope mixing model with the Bayesian framework (SIAR). The results of SIAR are the probability distributions for each source. With the elimination of denitrification (Fig S2 and S3) and relatively limited measurements of nitrate isotopes, the fractionation factor in Eq. (4) is set to zero.

According to field survey and Fig. 9, AD, SAM, SF (ammonium fertilizer), and SN are regarded as four potential nitrate sources. The reasons why nitrate fertilizer was not regarded as potential nitrate sources were as follows: first, the most common types of nitrogen fertilizer used in China including urea, ammonium carbonate, and urea (Ding et al. 2015; Zhang et al. 2018b) and nitrate fertilizer are limited used. Second, nitrate is easier to move with infiltrating water than other nitrogen form; if nitrate fertilizer was widely applied in the study area, δ18O-NO3 of groundwater samples collected from farmland should be larger than 20‰. According to Fig. 9, our data did not verify this situation; the δ18O-NO3 for most samples was close to the theriacal nitrification values calculated by Eq. (6).

With the identification of nitrification as the main nitrogen transformation process in the study area, the δ18O-NO3 of SF and SN were calculated by Eq. (6) (Table S4). Due to the limited number of nitrate isotopes sample (n = 12), the range values of δ15N-NO3 and δ18O-NO3 could not derive from isotopic composition of samples in our study directly. In order to get reliable results of contribution from different nitrate sources, the most convenient and realistic way under our circumstance was to find these values in relevant literatures (Jin et al. 2018; Zhang et al. 2018b; Li et al. 2019). The reason why these literatures were selected was that their study areas had a similar fertilizer application rates, and so did the climate characteristics. The isotopes range for four potential nitrate sources is listed in Table 4.

Table 4 Summary of the δ15N-NO3 and δ18O-NO3 of different nitrate sources in the study area

The proportional contribution of four potential nitrate sources is illustrated in Fig. 10. The mean probability estimate (MPE) of different sources was ranked as: SAM > SN > SF >AD (Table S6). SAM was the main nitrate source for Nanyang Basin (MPE 45.41 ± 9.97%); SN (MPE 35.81 ± 15.47%) was the second nitrate source in the study area. The contributions of SF (MPE 17.87 ± 11.95%) were significantly lower than those of SAM and SN. As discussed in “Nitrate dynamics and source identification using isotope data,” nitrification and ammonium volatilization were the main nitrogen transformation process in the study area; the contribution of SN was in accordance with the high rate of nitrification in the study area. The high contribution of SN was also rationalized by the distribution of samples in Fig. 8a; for samples with low NO3/Cl and Cl, concentration was influenced by soil nitrogen.

Fig. 10
figure 10

Contributions of the four potential NO3- sources in the Nanyang Basin. Boxplots illustrate the 5, 25, 50, 75, and 95% credibility intervals of different sources for each sample

The mixing effects between different sources and mineralization-immobilization turnover (MIT) partially masked the real sources and initial isotope compositions. Hence, the high uncertainties in the apportionment gained by SIAR were inevitable and resulted from the broadly distributed proportion and the large standard deviation of potential sources. Despite the high uncertainties, SAM was the principal nitrate source for study area, with the lowest standard deviation among four potential sources. Therefore, reducing the output of SAM and the improvement of infrastructure in rural area are the key factors of nitrate contamination control in Nanyang Basin.

The high uncertainties were also a result of the collection of the isotope composition of main sources from the literature, which does not reflect the features in the study area; in addition, the sample numbers were too low. These limitations can be attributed to limited funds, although the results are still in agreement with those obtained via hydrochemical and stable isotope analyses.

Uncertainty analysis

With the combined use of hydrochemical data, nitrate isotopes, and Bayesian mixing model, we thought our conclusion is reliable. Due to the following reasons, the uncertainties contained in our study were inevitable. The main reasons were as follows: (1) SIAR contains certain inherent uncertainties, and small variations of isotopic ranges for each potential nitrate sources may result in significant changes in sources apportionment; (2) the relatively limited number of nitrate isotopes measurements; (3) nitrogen cycle is a sophisticated biogeochemical process, and the study of nitrate pollution caused by the destruction of the nitrogen cycle balance requires a combination of multiple tracers.

Based on the above main reasons, the following points should be paid attention to in the future nitrate contamination studies: (1) accurately determining the isotopic ranges for potential nitrate sources. In a specific study area, the soil in uncultivated land, manure of breeding farms, wastewater for local sewage outfalls, and fertilizer from local market should be collected deliberately in order to reduce the uncertainties in the contribution calculated by SIAR; (2) if research funding is sufficient, samples considering different media (i.e., shallow aquifer, deep aquifer, and surface water), different land use types, and different time periods should be collected to determine the spatial and temporal variation of ions concentration and isotopes compositions; (3) due to nitrogen dynamic process is controlled by many environmental factors and multi-tracers should be combined use to study this issue. Such as groundwater travel time estimated by 3H and chlorofluorocarbons (CFCs). Besides, measurements of δ15N and δ18O of NO2 and δ15N-NH4+ could further help identify the nitrogen isotope fractionation process. Measurements of δ11B, δ87Sr, and coprostanol could help solve the problem of overlapping ranges of nitrate isotopes. (4) Analyzing the diversity and community composition of microorganisms and measure specific functional gene abundance can also help to determine the nitrogen transformation process, for this process is driven by microorganisms.

Conclusions

The shallow aquifer at the origin of the South to North Water Diversion Project in central China is subjected to considerable nitrate contamination. The average of nitrate concentration in this area was 102.95 ± 159.89 mg L−1, indicating the strong mixing of potential sources to nitrate in our study area. With the aim to better identify sources and transformation processes of nitrate in groundwater, land use analysis, hydrochemistry, DIM (δ15N-NO3 and δ18O-NO3), and the Bayesian mixing model were used in combination. According to the molar ratio analysis and the significant correlation between the concentrations of nitrate and chloride, natural hydrochemical evolution was disturbed by anthropogenic activities.

Nitrification was the main process in subsoil and shallow aquifers, and the broadly distributed δ15N-NO3 indicated that ammonium volatilization of SAM and the mixing effects between SAM and SF caused the enrichment of δ15N-NO3. Denitrification was restricted by the oxidation environment in shallow aquifers, except for samples from some residential areas.

The Bayesian model (stable isotope analysis in R, SIAR) was used to apportion the contribution of potential sources, and sewage and manure were the dominant sources of nitrate, followed by soil nitrogen. The results of SIAR reflected the limitation of applying DIM in nitrate source determination, as the ranges of δ15N-NO3 partially overlapped for SF and SN. As the result of recurrent mineralization-immobilization turnover during the leaching of irrigation and precipitation, the nitrate from SF may have contained δ15N-NO3 from SN.

We could prove that despite the use of synthetic fertilizers, infrastructure and sewage disposal were the main factors contributing to nitrate contamination in the Nanyang Basin. Improved management strategies for untreated sewage are therefore critical to ensure the long-term water supply of the South to North Water Diversion Project.