Introduction

Shimabara City, Nagasaki, Japan, has experienced serious nitrate pollution of groundwater. Groundwater is commonly used for public water supply in the area. For this reason, Nakagawa et al. (2016) investigated water chemistry and nitrate pollution in the groundwater. They found that unaffected groundwater could be classified as Ca–HCO3, while nitrate-polluted groundwater was classified as Ca–(SO4 + NO3). Nitrate concentration was found to exceed Japanese drinking standard (10 mg L− 1) at 15 of 40 locations. The results indicated that NO3 pollution in the area is related to livestock waste, chemical fertilizer, and calcareous material to neutralize acid soil. Principal component analysis showed that the water chemistry is characterized by ion dissolution during groundwater flow and nitrate pollution sources. Other nearby areas, such as the Nishi River, has as well been confirmed to be polluted by nitrate. A typical denominator for the polluted areas is that they are drained by small rivers flowing from the mountainside to the seashore. For a better understanding of the nature of the nitrate pollutant source, geochemistry, and surface and groundwater interactions need to be better understood (Vrzel et al. 2018).

Surface water chemistry is affected by various biogeochemical processes (especially atmospheric precipitation, chemical weathering, and evapo-crystallization) and anthropogenic factors (Pant et al. 2018). Many researchers have investigated river water chemistry and controlling factors by different approaches, such as water quality index (WQI) (Kannel et al. 2007; Şener et al. 2017), principal component analysis (PCA) (Ouyang 2005; Le et al. 2017), factor analysis (Mir et al. 2016), geographic information system (GIS) (Şener et al. 2017), and temporal sampling (Sun et al. 2010; Mir et al. 2016). In recent years, pollution and degradation of surface water have been shown to cause environmental and potential social problems and risks to public health (Le et al. 2017). Human factors such as agriculture (fertilizers and agrochemical methods, irrigation, and livestock operation), industry, and wastewater have caused pollution by bacteria and virus, fecal sterols, and chemicals like inorganic ions, and trace and toxic metals for rivers in many countries (Skórczewski and Mudryk 2009; Bulut et al. 2010; Obiri-Danso et al. 2011; Chigor et al. 2012; Furtula et al. 2012; Li et al. 2014; Olkowska et al. 2014; Wilbers et al. 2014). These pollutants increasingly result in water chemical levels exceeding World Health Organization (WHO) guidelines.

In this study, we present a methodology that can be used to investigate nitrate pollution sources for the groundwater. The methodology builds on geochemical sampling together with advanced statistical treatment. The results can be used to estimate the relative contribution of different pollution sources to groundwater, impact on water quality, and management to improve the polluted sites. A first step for assessing influence from surface water to groundwater is the spatial distribution of water chemistry, nitrate pollution, and surface water forming factors using graphical methods (Stiff and Piper diagrams). Second, multivariate analyses (principal component analysis and hierarchical cluster analysis) are used in combination with the sampled data.

Materials and methods

Study area

Figure 1 shows the location of Shimabara City and sampling sites. Shimabara City is one of the districts forming Shimabara Peninsula. The city covers 82.8 km2, which constitute about 18% of the peninsula. The geology of Shimabara is formed by volcanic activities, resulting in volcanic rocks over-laying marine and non-marine deposits (Fig. 2; Geological Survey of Japan 2017). The city area is constituted by three main types of land use; forest, upland fields, and urban areas (Fig. 2). The upland fields are important for supplying vegetables (mainly Chinese radish and cabbage and carrot). On the mountain side of the upland fields, livestock facilities are located. Livestock numbers correspond to about 1000 milk cattle; 22,900 pigs; and 1,028,200 hens (2015). Most of the livestock production is located in the upstream areas of the Nishi and Yue Rivers. Average annual temperature was 17.1 °C in 2017. The average monthly temperature ranges from 7.0 to 18.9 °C. Annual rainfall was 1989 mm in 2017.

Fig. 1
figure 1

Location map of study area and sampling sites

Fig. 2
figure 2

Geology and land-use map

Water sampling and chemical analyses

The suggested methodology to investigate groundwater pollution sources in Shimabara involved water sampling with chemical analyses and advanced statistical analyses according to the below. Water samples were collected at 42 sampling sites from 15 rivers in Shimabara including a region of Unzen City on January 17 and 24, as well as on February 6 in 2017. In Yue and Nishi River, more sampling points were chosen to understand the transition of water chemistry from upstream to downstream. Water samples were collected directly from the center sections of the rivers in pre-washed bottles except for sampling sites 6 and 27. Physicochemical parameters such as pH, electrical conductivity (EC), oxidation redox potential (ORP), dissolved oxygen (DO), temperature, and HCO3 were determined in-situ. Portable meters (HORIBA D-51, and 54) were used for pH, EC, and ORP. DO and temperature were measured using a luminescence-based sensor (HACH HQ30d). HCO3 was quantified using titration method with 0.1 N HCl. Major dissolved anions (Cl, NO2, NO3, and SO42−) and cations (Na+, NH4+, K+, Mg2+, and Ca2+) were analyzed with ion chromatography (Metrohm 861 Advanced Compact IC).

Multivariate analyses

Multivariate analyses based on water chemistry data can be used to improve the understanding of relationships, source of ions, and factors regulating water chemistry. In this study, correlation analysis, principal component analysis (PCA), and hierarchal cluster analysis (HCA) were performed using all ion concentrations except for site 15, where river water is mixed with sea water. Multivariate analyses were performed using the statistical software JMP Pro 13 (SAS Institute Inc.). NO2 and NH4+ were not detected in a number of samples. The LOD (limit of detection 0.135 and 0.21 mg L− 1, respectively) values were used in the multivariate analysis for these samples.

Results and discussion

General surface water chemistry

Measured hydrochemical parameters for each site are presented in Table 1. Temperature of the surface water ranged from 4.9 to 17.2 °C. It reflects the atmospheric temperature during the sampling campaign (1.2–10.9 °C). The pH showed a small range with weak acidic conditions. The ORP was positive, ranging from 147 to 280 mV except for sites 19 and 27. Significantly small values (26.5 and 28.1 mV, respectively) were observed at these sites. The DO ranged from 7.8 to 11.6 mg L− 1 except for sites 30, 31, and 32 with notably small values from 0.70 to 2.48. These sites are located in the upstream areas of Nishi River, where ammonium (NH4+), nitrite (NO2), and nitrate (NO3) were detected. Due to small DO, this may result in small nitrification.

Table 1 Summary of measured hydrochemical parameters

Water chemistry patterns were assessed using Stiff and Piper diagrams based on major ion concentration data. Figure 3 shows the Piper diagram for the 42 surface water samples. Most of the samples are located in Class I or Class III. Class I is dominated by calcium and bicarbonate, corresponding to the common major element composition of the surface water. The water samples from the Unzen River (site 17–19), the southern parts of Shimabara (site 1–8, 28, and 29), the midstream of Nishi River (site 27 and 30), and the upstream of Yue River (site 33), are included in this class. Water samples dominated by calcium, nitrate, and sulfate are located in Class III. In case of groundwater, nitrate-polluted samples are found in Class III (Nakagawa et al. 2016). Thus, this class indicates nitrate-polluted surface water. Most of the samples from the northern parts of Shimabara (site 9–14, 16, 21–26, and 34–42) fall within this class. As shown in Table 1, all these sampling sited displayed high nitrate concentrations (> 37.5 mg L− 1). Two water samples are located in Class IV, commonly indicating sea water mixing with respect to sodium and chloride. One of the samples (site 15) is from the far downstream of Yue River close to the coastline. According to Chester and Jickells (2012), sodium and chloride are principal dissolved components of sea water. The other sample was collected from the upstream of Nishi River (site 31). This site is located on the mountainside (Fig. 1), not being exposed to sea water. Anthropogenic pollutants are probably important sources for Na+, K+, and NO3 at this site. Only one sample (site 32) was located in Class II related to sodium, potassium, and bicarbonate. As mentioned above, NH4+ and NO2 were found at site 31 and 32. These pollutants likely come from livestock waste from farms located close to the river.

Fig. 3
figure 3

Piper diagram for surface water samples

Figure 4 shows the spatial distribution of water chemistry using Stiff diagrams for the 42 sampling sites. Most of the samples display Ca–HCO3 or Ca–(SO4 + NO3) water types. Several samples are characterized as Na–Cl, (Na + K)–HCO3, (Na + K)–(SO4 + NO3), and Ca–Cl. This indicates mixing with seawater together with anthropogenic pollution. The spatial characteristics of surface water represented by the Stiff diagrams are similar to those of the groundwater (Nakagawa et al. 2016), indicating surface water–groundwater mixing. With river flow from upstream to downstream in the Yue River, water types transit from Ca–HCO3 to Ca–(SO4 + NO3), and further to Na–Cl due to sea water mixing, and general ion concentrations in the mainstream tend to increase at the lower site 22 and 23. Tributaries, with high ion concentrations, join the main river upstream these sites, resulting in significant increase in ion concentration. (Na + K)–HCO3 type water is transformed to Ca–HCO3 and eventually to Ca–(SO4 + NO3) along the river flow in Nishi River. The chemical composition drastically changed and decreased between site 27 and site 10 because of significant contribution of groundwater inflow. The water chemistry in the lower reaches of the river at site 9 and 10 is essentially the same as for groundwater collected close to these sites. Although Ca-HCO3 types were confirmed at the upstream of the Nishi River, the concentration of ions is higher than in the groundwater in this region.

Fig. 4
figure 4

Spatial distribution of water chemistry with Stiff diagram

For disinfection purposes, livestock farms use slaked lime (calcium hydroxide). After use, the lime is usually washed out to the rivers by rain. This means that the Ca2+ concentration will increase in the downstream. NO3 originating from livestock waste will likely be exposed to denitrification. This means that NO3 will decrease and HCO3 increase according to:

$${\text{5C }}+{\text{ 4N}}{{\text{O}}_{\text{3}}}^{ - }+{\text{ }}{{\text{H}}_{\text{2}}}{\text{O}} \to {\text{2}}{{\text{N}}_{\text{2}}}+{\text{ 4HC}}{{\text{O}}_{\text{3}}}^{ - }+{\text{ C}}{{\text{O}}_{\text{2}}}.$$

According to this relationship, livestock waste together with slaked lime will lead to high concentration of Ca–HCO3 as can be noticed at sites 27 and 30.

Nitrate pollution

Spatial distribution of nitrate (NO3 + NO2–N) concentrations is shown in Fig. 5. NO2–N was detected at two sampling sites 30 and 32 upstream of Nishi River. The concentration exceeded the drinking standard of 0.9 mg L− 1 for bottle-fed infants (WHO 2011). The nitrate concentration ranged from 1.0 to 27.5 mg L− 1 with an average of 9.7 mg L− 1. The highest concentration (27.5 mg L− 1) was found at site 31. The nitrate concentrations exceeded the Japanese drinking water standards of 10 mg L− 1 at 19 sampling sites. Relatively high concentrations of nitrate were observed in the rivers coming from the northern parts of the study area containing the upland fields.

Fig. 5
figure 5

Spatial distribution of nitrate (NO3 + NO2–N) concentration

Multivariate analysis

Correlation analysis

Correlation analysis was performed for all measured ions concentrations (Cl, NO2, NO3, SO42−, HCO3, Na+, NH4+, K+, Mg2+, and Ca2+) except for site 15, because ion concentrations here were much larger due to sea water mixing. Table 2 shows the results of the correlation analysis using 41 sampling sites. Unzen volcanic rocks distributed in Shimabara are composed of hornblende andesite to dacite, which has a phenocryst of plagioclase, hornblende, and biotite (Sugimoto 2006). The correlation in Table 2 follows naturally occurring geochemical processes: (1) the strong positive correlation between Na+–HCO3 and K+–HCO3 (r = 0.74 and 0.76, respectively) implies weathering of plagioclase (albite and K-feldspar). The chemical reaction can be expressed as (Sun et al. 2017):

Table 2 Correlation matrix between dissolved ions
$${\text{2NaAlS}}{{\text{i}}_{\text{3}}}{{\text{O}}_{\text{8}}}+{\text{ 2C}}{{\text{O}}_{\text{2}}}+{\text{ 11}}{{\text{H}}_{\text{2}}}{\text{O}} \to {\text{A}}{{\text{l}}_{\text{2}}}{\text{S}}{{\text{i}}_{\text{2}}}{{\text{O}}_{\text{5}}}{\left( {{\text{OH}}} \right)_{\text{4}}}+{\text{ 2HC}}{{\text{O}}_{\text{3}}}^{ - }+{\text{ 2N}}{{\text{a}}^+}+{\text{ }}{{\text{H}}_{\text{4}}}{\text{Si}}{{\text{O}}_{\text{4}}},$$
$${\text{2KAlS}}{{\text{i}}_{\text{3}}}{{\text{O}}_{\text{8}}}+{\text{ 2C}}{{\text{O}}_{\text{2}}}+{\text{ 6 }}{{\text{H}}_{\text{2}}}{\text{O}} \to {\text{A}}{{\text{l}}_{\text{2}}}{\text{S}}{{\text{i}}_{\text{4}}}{{\text{O}}_{{\text{1}}0}}{\left( {{\text{OH}}} \right)_{\text{2}}}+{\text{ 2HC}}{{\text{O}}_{\text{3}}}^{ - }+{\text{ 2}}{{\text{K}}^+}+{\text{ 2}}{{\text{H}}_{\text{4}}}{\text{Si}}{{\text{O}}_{\text{4}}}.$$

(2) Ca2+ is moderately correlated with HCO3 (r = 0.40), indicating weathering of plagioclase (anorthite) according to:

$${\text{CaA}}{{\text{l}}_{\text{2}}}{\text{S}}{{\text{i}}_{\text{2}}}{{\text{O}}_{\text{8}}}+{\text{ 2C}}{{\text{O}}_{\text{2}}}+{\text{ 3}}{{\text{H}}_{\text{2}}}{\text{O}} \to {\text{A}}{{\text{l}}_{\text{2}}}{\text{S}}{{\text{i}}_{\text{2}}}{{\text{O}}_{\text{5}}}{\left( {{\text{OH}}} \right)_{\text{4}}}+{\text{ 2HC}}{{\text{O}}_{\text{3}}}^{ - }+{\text{ C}}{{\text{a}}^{{\text{2}}+}}.$$

(3) The strong positive correlation between Na+, Mg2+, and Ca2+ (r = 0.59–0.80) implies weathering of hornblende that is expressed in the stoichiometric dissolution (Velbel 1989):

$${\text{N}}{{\text{a}}_{0.{\text{5}}}}{\text{C}}{{\text{a}}_{\text{2}}}\left( {{\text{F}}{{\text{e}}_{{\text{1}}.{\text{3}}}}{\text{M}}{{\text{g}}_{{\text{2}}.{\text{6}}}}{\text{A}}{{\text{l}}_{{\text{1}}.{\text{1}}}}} \right)\left( {{\text{A}}{{\text{l}}_{{\text{1}}.{\text{6}}}}{\text{S}}{{\text{i}}_{{\text{6}}.{\text{4}}}}} \right){{\text{O}}_{{\text{22}}}}{\left( {{\text{OH}}} \right)_{\text{2}}}+{\text{ 15}}{{\text{H}}^+}+{\text{ }}{{\text{H}}_{\text{2}}}{\text{O}} \to 0.{\text{5N}}{{\text{a}}^+}+{\text{ 2C}}{{\text{a}}^{{\text{2}}+}}+{\text{ 2M}}{{\text{g}}^{{\text{2}}+}}+{\text{ 1}}.{\text{3F}}{{\text{e}}^{{\text{2}}+}}+{\text{ 2}}.{\text{7Al}}{\left( {{\text{OH}}} \right)_{\text{2}}}^{+}+{\text{ 6}}.{\text{4}}{{\text{H}}_{\text{4}}}{\text{Si}}{{\text{O}}_{\text{4}}}.$$

Some of the observed correlation can be interpreted as effects of anthropogenic influence: (1) positive correlation between SO42−, NO3, Mg2+, and Ca2+ indicates influence from a common origin that likely is chemical fertilizers (Babiker et al. 2004). The correlation among these ions was r = 0.37–0.80. Ammonium sulfate [(NH4)2SO4] is commonly applied together with slaked lime and magnesia lime [(Ca, Mg)CO3] as fertilizer in this area (Nakagawa et al. 2016). No correlation was found between NH4+ and SO42+, which is consequence of nitrification changing NH4+ to NO3. (2) Strong and moderate positive correlation between Na+, K+, Cl, and SO42− indicates their common origin, and can be explained in terms of influence of pig farm wastewater (Fridrich et al. 2014). Cl is a predominant anion in manure from livestock waste (Oyanagi et al. 2004). NH4+ is often positively correlated with K+ and Cl in groundwater under pig farming (Fridrich et al. 2014). Similar results were found in this study. There was a strong and positive correlation between Na+, K+, and Cl (r = 0.74–0.81). NH4+ was positively correlated with Cl and K+. There are pig farms along the upstream of Nishi River close to site 32. Although the nitrogen supply from households is only one-tenth of nitrogen supply from the farm animal sector, (Fujii et al. 2016) considering that the domestic wastewater treatment coverage is only 52.4% (2014), (Committee on Nitrate Reduction in Shimabara Peninsula 2016) domestic wastewater is a possible source for Cl.

In view of the above, Cl, NO3, SO42−, and NH4+ can be seen as originating from anthropogenic activities. Although, Na+, K+, Mg2+, and Ca2+ generally can be regarded as having natural origin in non-polluted water, these cations are likely to be enhanced by anthropogenic activities as observed polluted water. As shown in Fig. 4, polluted water samples tend to have higher concentrations of these ions as compared to non-polluted samples.

Principal component analysis (PCA)

The ten hydrochemical variables (Cl, NO2, NO3, SO42−, HCO3, Na+, NH4+, K+, Mg2+, and Ca2+) were used as input to the PCA analysis. The input data were standardized before analysis. PCA was applied using the correlation matrix between chemical elements. Figure 6 shows the relationship between obtained PCs and investigated ions. In accordance with the Kiser criteria, only principal components (PC) with eigenvalue greater than 1.0 were kept for further analyses. Thus, three PCs were extracted. The eigenvalues for these PCs ranged from 1.09 to 5.38, explaining 88.3% of the total variance.

Fig. 6
figure 6

Relationship between extracted three PCs and ions

PC1, accounting for 53.8% of the total variance, had positive loadings for all ions, which indicate that this component is related to the mean ion concentration. PC2 was characterized by positive loadings for SO42−, NO3, Mg2+, and Ca2+, and negative loadings for HCO3, NH4+, NO2, and K+. It explained 23.6% of the total variance. The positive relationship among SO42−, NO3, Mg2+, and Ca2+ indicates a similar origin (chemical fertilizer) as mentioned above. The relationship between NO3, NH4+, and NO2 represents the nitrification process from NH4+ (NH4+ → NO2 → NO3) originating from the chemical fertilizer (NH4)2SO4. Negative HCO3 can be explained by the fact that nitrification decreases the HCO3 concentration. In contrast to the use of chemical fertilizers, the association between NH4+, NO2, and K+ implies the influence of livestock effluents. Thus, PC2 helps to identify the pollution source. PC3 is, however, also interpreted as a result from livestock effluents due to joint positive loading of NO3, NH4+, and K+. It accounts for 10.9% of the total variance.

Hierarchical cluster analysis (HCA)

HCA was performed based on the three PC scores outlined above (see also Amano et al. 2016). The classification result of each sample is represented in the dendrogram shown in Fig. 7. In total, 41 water samples were classified into four distinct groups. Samples of Groups 3 and 4 were combined with the other groups in the final linkage step, indicating that the water chemistry of these samples is least similar to that of other groups. A significant dissimilarity between Group 1 and 2 is expected, because the linkage distance between these is at a maximum. Group 3, as well, is dissimilar to Group 4 due to large linkage distance. Table 3 shows the average ion concentration of each group as representing the water characteristics. Group 1 is related to HCO3 for anions and Ca2+ for cations, which is a normal major element composition for surface waters. Group 2 is recognized as a nitrate-polluted group due to high nitrate concentrations. NO2 and NH4+ are classified in Group 3. The NO2 concentration is above the drinking water standard of 0.9 mg L− 1 as regards NO2-N. If all ions are converted to NO3, a total of 82.7 mg L− 1 of NO3 would result. Thus, although this group still satisfy the drinking water standard for NO3 (WHO 2011), it could potentially exceed the drinking water standard. Group 4, in which only one water sample was classified, generally showed large ion concentrations. In particular, Cl, NO3, K+, and NH4+ concentrations were larger than for any other group, indicating pollution by livestock wastewater.

Fig. 7
figure 7

Dendrogram for grouped surface water samples with site identification

Table 3 Averaged dissolved ion concentrations for each groups

Figure 8 shows the relationship between the three PCs for the 41 water samples classified into four groups. PC1, namely, mean ion concentration, apparently separates Group 3 and 4 from Group 1 and 2. Smaller PC1 scores explain that Group 1 has less ion concentration compared to Group 3 and 4. It can, thus, be confirmed that Group 1 has low ion concentrations and represents a non-polluted group. Group 2 displays higher PC1 scores as compared to Group 1. In addition, Group 2 is easily distinguishable due to positive PC2 scores as compared to the other groups. Positive PC2 is interpreted as influenced by chemical fertilizer, indicating that it has higher ion concentration than Group 1. Although Group 3 and 4 show similar PC scores for PC1 and 2, they are distinguished by PC3 (Fig. 8). Both groups have negative PC2 scores, which show the effect of livestock effluents. Positive PC1 and negative PC2 scores explain that Group 3 and 4 are polluted by livestock wastes. Positive PC3 scores reveal that Group 4 is significantly more influenced by livestock effluents as compared to Group 3 as shown in Table 3.

Fig. 8
figure 8

Scatter diagram showing relationship between three PCs

As shown in Fig. 9, distinct spatial distributions for each group can be observed. Samples in Group 1 were generally observed in the southern part in urban areas. A part of these are found in the upstream of the Yue River (33, 34, and 36) and Rivers of Unzen City (17, 18, 19, and 20). These sampling locations are situated in the forest and urban areas (Fig. 2). Sample 33 has small ion concentration, because the site is located in the most upstream part of the river and influence of agricultural activities is small. As well, samples 34 and 36 had less ion concentration compared to the downstream and tributaries. Although nitrate concentration was below Japanese drinking water standard, total concentration may be elevated due to agricultural activities. It is noted that Group 2, which is distinguishable with positive PC2 scores (chemical fertilizer), is distributed in the upland fields. There results concur with the previous studies (Nakagawa et al. 2016). A few samples that were classified into Group 3 and 4 are located only in the upstream of the Yue River. Consequently, samples collected in the northern and southern rivers flowing a short distance and urban areas close to the mountain did not present nitrate pollution. On the other hand, Yue and Nishi River, passing through the dense upland area with livestock farming, were seriously polluted by nitrate except for the three specific samples from the upstream of Yue River.

Fig. 9
figure 9

Distribution of each group in the study area

Conclusions

This study presented a methodology to investigate hydrogeochemical characteristics and nitrate pollution of surface water and influencing factors. Besides water sampling and chemical analyses, the methodology involved the use of Piper and Stiff diagrams and multivariate statistical analyses. The study revealed mineral dissolution as well as effects of anthropogenic activities on surface water chemistry. The surface water was, in general, weakly acidic and oxidizing conditions were shown by pH, DO, and ORP. Piper and Stiff diagrams indicated that the dominant water types are Ca–HCO3 and Ca–(SO4 + NO3), which is consistent with groundwater chemistry in the area. Local sea water and anthropogenic activities resulted in some minor water types such as Na–Cl, (Na + K)–HCO3, (Na + K)–(SO4 + NO3), and Ca–Cl. Correlation analyses indicated that weathering of gypsum, anhydrite, and silicate minerals is the dominant sources of ions. Chemical fertilizers and livestock waste influence surface water chemistry and cause severe nitrate pollution especially in the northern parts of the city. The nitrate contaminated samples above Japanese standard drinking water levels occupied 45% of all samples. PCA extracted three significant PCs explaining 88.3% of total variance. This reflects mean ion concentrations, and effects of chemical fertilizers and livestock effluents. Based on the PCA results, four distinct water groups were obtained by HCA. The chemical composition of these groups was different in terms of land use, and importance of the three PC scores. In general, the spatial distribution of surface water chemistry was similar to that of the groundwater, implying strong interaction between surface and groundwater. Although Shimabara City only uses groundwater for public water use, quantifying the influence of surface water on groundwater is required for resolving the nitrate pollution problem.