Groundwater resources have become an important resource for human survival, industrial development, and ecosystems (IAH 2015), especially in arid and semiarid regions. The high evaporation rate and limited precipitation in arid and semiarid regions make groundwater a valuable commodity in these regions (Li et al. 2015a). However, due to industrial and agricultural activities and unreasonable utilization of groundwater, some critical water sources have been seriously polluted with contaminants, such as fluoride, nitrate, arsenic, heavy metals, and synthetic organic compounds contaminants (Li et al. 2019a; Postigo and Barcelo 2015; Rodriguez-Lado et al. 2013; Soujanya et al. 2020). The inorganic cations and anions in groundwater could come from both natural and anthropogenic sources. Fendorf et al. (2010) showed that human activities can affect the release of arsenic to groundwater, influencing the natural distribution of arsenic. Currently, groundwater quality deterioration has become a global problem impeding sustainable development (Li et al. 2017) and affecting human health (He and Wu 2019; He et al. 2019; Zhou et al. 2020a).

The Guanzhong Basin is located in the arid and semiarid area of Shaanxi Province, China. Limited surface water in this area makes groundwater the main source of drinking water in rural and isolated urban areas. With the implementation of the West Development Strategy of China and the Belt and Road Initiative, the economy of Guanzhong Basin has experienced rapid development. However, economic development has brought several challenges to groundwater resources, such as pollution, land desertification, and groundwater overexploitation (Han 2003; Wu et al. 2017). At present, the Guanzhong Basin plays a significant role in the Belt and Road Initiative, and the problem of water will become more severe as its implementation proceeds. Howard and Howard (2016) emphasized that the economic success of the Belt and Road Initiative in Central Asian countries hinges on a sound and sustainable management of water resources. A considerable amount of literature on groundwater pollution in the Guanzhong Basin has been published. Arsenic, nitrate, and fluoride are among the contaminants found in the groundwater in various areas across the Guanzhong Basin, all of which have an adverse influence on human health (Li et al. 2014; Luo et al. 2014; Zhang et al. 2018). With the increased economic development, the safety of the water supply for human consumption is currently facing a great challenge in the Guanzhong Basin.

Because groundwater plays a crucial role in affecting economic development and anthropic activities in the Guanzhong Basin, rational development and utilization of groundwater are particularly important. Groundwater needs for human consumption, food production, energy, and environment have increased, which drives the development of water management globally (Gorelick and Zheng 2015). Several researchers have provided insights into water management. For example, an advanced agricultural irrigation method provided by Hu et al. (2010) can help to increase groundwater recovery and conserve agricultural water usage. Li et al. (2018a) proposed a rational and balanced groundwater and surface water utilization plan in the Yinchuan Plain, which provides new idea for water management in arid and semiarid areas. These studies describe novel ideas to decrease unreasonable and excessive water resource utilization and relieve the pressure of water management. However, efficient and effective groundwater management requires concerted efforts by the local authorities, scientists, and the public (Chen et al. 2018).

Groundwater contaminants affecting human health include fluoride, nitrate, organic matter, and microbial pathogens (Schwarzenbach et al. 2010; Ward et al. 2018). High fluoride levels in drinking water can cause dental and skeletal fluorosis (Kabir et al. 2019). Skin, lung, and bladder cancers, as well as hepatotoxicity and intellectual function impairment, are attributed to arsenic (Sun et al. 2011). Considering the consequences of water-quality deterioration, researchers are creating a database of known groundwater quality issues. Li et al. (2013a, b) conducted a comprehensive sensitivity analysis of the Technique for Order of Preference by Similarity to Ideal Solution (TOPSIS) method on water quality assessment, showing that the TOPSIS method is a feasible and reliable method for water-quality assessment. Adimalla et al. (2018) performed an assessment of groundwater quality focused on fluoride enrichment in a local area in India. Li et al. (2018b) performed groundwater quality assessment in the Loess Plateau, which determined the groundwater hydrochemical type and provided measures for sustainable groundwater quality management. Panneerselvam et al. (2020) applied GIS technology to evaluate the groundwater quality for domestic and irrigation purposes in hard rock region of India. Zhou et al. (2020b, c) assessed the groundwater quality for potable and irrigation usages and associated human health risks of two cities in North China Plain, providing scientific basis for sustainable groundwater utilization and protection. These studies promoted the development of groundwater quality assessments and provided a foundation for household, agricultural, and industrial groundwater utilization.

To promote sustainable use of groundwater, groundwater quality and the hydrogeochemical processes controlling the groundwater quality should be studied to obtain a full understanding of fate of groundwater contaminants. Various groundwater pollution studies have been performed in Guanzhong Basin (Zhang et al. 2019; Luo et al. 2014). Most of these studies attribute the pollution to anthropogenic activities and local geological background. In addition, previous studies (Bu et al. 2010; Liu et al. 2003; Wu et al. 2020) have used multivariate statistical techniques, such as principal component analysis (PCA) or factor analysis (FA), to trace the affecting factor of groundwater pollution. In this study, PCA, FA, and hierarchical cluster analysis (HCA) were performed on groundwater geochemical measurements from the Guanzhong Basin and determine the hydrogeochemical processes affecting the major ion groundwater chemistry. This work aims to facilitate groundwater quality protection and management in the Guanzhong Basin area.

Study Area

The Guanzhong Basin is located between east longitude 107° 30′–110° 30′ and north latitude 34° 00′–35° 40′. It sits in an arid and semiarid climate zone with a cold and dry winter and a sweltering summer. The average annual rainfall is approximately 580 mm, with 60% concentrated in July through September. The unconsolidated deposits of the Guanhzong Basin consist of alluvial sediments and loess accumulation. With flat terrain, fertile soil, abundant water resource, and good irrigation condition, the Guanzhong Basin is one of the most productive areas in Shaanxi Province, China. Weihe River, the largest tributary of the Yellow River, runs through the Guanzhong Basin. However, this river is seriously pollution by industrial wastewater and domestic effluents, posing great risk on the groundwater quality in this area. The shallow groundwater level depth was approximately 2–20 m below the ground surface before the 1990s in these second terraces, and now it is 10–40 m in water level depth (Luo et al. 2014). The swift decline of groundwater level should be attributed to the heavy groundwater abstraction in this basin for domestic and industrial purposes, resulting in ground surface subsidence, and ground fissures.

The study area is located in the central part of the Guanzhong Basin, and it is the most important area for human population growth and economic development in the basin. In the study area, land use types can be classified into grassland, forest, surface water, agricultural land, urban land, unused land, and other construction land (covering 2.89%, 2.78%, 2.85%, 65.62%, 23.16%, 0.02%, and 2.68%, respectively, of the total study area) (Fig. 1).

Fig. 1
figure 1

Location of the study area, land use/land cover, and groundwater sampling sites

Geologically, the study area is covered mainly by Quaternary sediments, which can be subdivided into four sublayers. From top down, they are the Holocene alluvial layer (Q alh ), the Upper Pleistocene alluvial layer (Q alP3 ), the Middle Pleistocene layer (Q alP2 ), and the Lower Pleistocene lacustrine layer (Q lP1 ). The Holocene alluvial layer (Q alh ) is composed of course sand, medium-coarse sand, gravel sand, and silty clay and is mainly distributed in the Weihe River bed, floodplain, and first terrace areas. The Upper Pleistocene alluvial layer (Q alP3 ) is distributed beneath the floodplain and the first terrace. The upper part of this layer is composed of gray-yellow silt and silty clay with a thickness of 15–25 m. The lower part is comprised by gray–white and gray–yellow medium-fine sand imbedded with silt and silty clay, ranging within 65–85 m in thickness. The Middle Pleistocene layer (Q alP2 ) is mainly distributed in the floodplain and first terrace. The lithology of this layer is mainly medium-coarse sand and medium-fine sand with a thin layer of silty clay imbedded. The Lower Pleistocene lacustrine layer (Q lP1 ) is composed of brown-yellow and gray-green silt, silty clay, and clay imbedded with 5–10 sublayers of unstable gray–yellow and rusty yellow fine sand, medium-coarse sand.

In the study area, the confined aquifer and the phreatic aquifer are the main water supply aquifers (Zhou et al. 2018). The phreatic aquifer is composed of Middle Pleistocene to Holocene alluvial sand, gravel, and is separated from the confined aquifer by several layers of silty clay. The confined aquifer consists of Quaternary alluvial and lacustrine sand and gravel. The thickness of the phreatic aquifer is in general in the range of 20–90 m, and that of the confined aquifer is usually 60–180 m (Zhou et al. 2018). Groundwater levels continue to fall due to the exploitation of the phreatic aquifer and confined aquifer. Compared with the groundwater level before 2005, the groundwater levels of the confined aquifer have declined on average 20–40 m in the recent 10 years, with a maximum drop of more than 120 m (Tao et al. 2013). Therefore, a large cone of depression in the groundwater has been formed. In addition, due to the influence of climate change and human activities on groundwater, groundwater quality is degrading, and ecological environment problems are constraining the socioeconomic development in the Guanzhong Basin (Wang et al. 2018). Atmospheric precipitation, river leakage, and irrigation infiltration are the main sources of groundwater recharge. In addition, lateral flow from the mountainous area also is a recharge source for groundwater. Regarding groundwater discharge, artificial abstraction, shallow groundwater evaporation, and lateral outflow are the main patterns of groundwater discharge.

Methods

Sample Collection and Analysis

Forty-four groundwater samples, including 22 samples (c1–c22) from the confined aquifer and 22 samples (q1–q22) from the phreatic aquifer, were collected in the study area of the Guanzhong Basin using polyethylene bottles (Fig. 1). Phreatic groundwater samples were collected from shallow wells with well depth ranging from 28 to 90 m. Confined groundwater samples were collected from deep wells with well depth ranging from 120 to 300 m. All of the wells belong to the regular monitoring wells monitored by local governmental institute. All bottles were washed three to four times with the groundwater to be collected before sampling. Water temperature and pH were directly measured in situ using portable, multiparameter devices (Hanna HI9811-5). After collection, the samples were sealed and stored at 4 °C until analysis.

Water samples were transported to the laboratory for measurement of physicochemical parameters, including major ions (Na+, K+, Ca2+, Mg2+, HCO3, SO42−, Cl, and CO32−), total dissolved solids (TDS), total hardness (TH), nitrate (NO3), fluoride (F), and chemical oxygen demand (COD). Na+ and K+ were determined using flame atomic absorption spectrophotometry. Traditional titrimetric measured method used to determine HCO3 and CO32−. EDTA titrimetric methods were adopted for analyzing Ca2+, Mg2+, and TH. NO3 and SO42− were analyzed by using spectrophotometer (ICS-1500) and F was tested by the ion-selective electrode method (ICS-90A). A standard solution of AgNO3 was used to determine the concentration of Cl in titration method. TDS was measured using drying and weighting approach. Dichromate method was used to analyze COD.

Duplicates were introduced during the testing to ensure the analytical accuracy in the experiment. Each sample was checked for accuracy using the ion charge balance error percentage (%CBE), as shown in Eq. (1). The results show that all samples have a %CBE less than ± 5% except sample c22 whose %CBE is − 8.578%. Therefore, sample c22 was not used in the study.

$${\text{\% CBE = }}\frac{{{\text{TC}} - {\text{TA}}}}{\text{TC + TA}} \times 100$$
(1)

where, TC and TA indicate the total concentrations of cations and anions, respectively. Both cations and anions are expressed in milliequivalent per liter (meq/L).

Multivariate Statistical Techniques

Multivariate statistical techniques are widely used over the world in groundwater-quality studies to determine the sources and factors affecting the groundwater chemistry (Wu et al. 2014, 2020). Before using PCA, FA, and HCA, the original data were standardized by calculating the Z score for each measurement, which was performed using MATLAB 2016. The standardization procedure eliminated the influence of different units of measurement and created a dimensionless dataset. PCA and HCA were conducted using MATLAB 2016, and SPSS 24 was used to perform the correlation analysis and FA.

Correlation Analysis

The physicochemical indices of the water were correlated by calculating the Pearson correlation coefficient (r). The formula is expressed as:

$$r = \frac{{\sum\limits_{i}^{n} {\left( {x_{i} - \overline{x} } \right)\left( {y_{i} - \overline{y} } \right)} }}{{\sqrt {\sum\limits_{i}^{n} {\left( {x_{i} - \overline{x} } \right)^{2} } } \sqrt {\sum\limits_{i}^{n} {\left( {y_{i} - \overline{y} } \right)^{2} } } }}$$
(2)

where r is the correlation coefficient between two variables (x and y), \(\bar{x}\) and \(\bar{y}\) are the means of the variables. Before determining the correlation of x and y, the value of r should pass the significance test.

Principal Component Analysis

PCA is a mathematical method of dimensionality reduction, which combines multiple indicators into a few indicators. PCA transforms the original variables into some integrated variables, called principal components (PCs) (Bu et al. 2010), and the PCs with the eigenvalues higher than one can be extracted (Cloutier et al. 2008). The PCs can be expressed by linear combinations (Wu et al. 2014):

$$y_{i} = c_{i1} x_{1} + c_{i2} x_{2} + \cdots + c_{ip} x_{p}$$
(3)

where y is the PCs, c is the component loading, x is the origin variable, i is the component number and ranges from 1 to p, and p is the total number of variables.

Factor Analysis

FA is generally conducted after PCA, and PCs are subjected to varimax rotation, generating varifactors (VFs). The FA can be expressed as Eq. (4).

$$x_{i} = a_{i1} f_{1} + a_{i2} f_{2} + \cdots + a_{im} f_{m} + \varepsilon_{i}$$
(4)

where x is the original variables, a is the factor loading, f is the common factor, ε is the special factor, i is the variables number, and m is the total number of common factors. The common factors can be represented as linear combinations of the original variable, which is called factor score function, as shown in Eq. (5).

$$f_{j} = b_{j1} x_{1} + b_{j2} x_{2} + \cdots + b_{jp} x_{p}$$
(5)

where f is the common factor, b is the variable loading, x is the origin variables, j is the common factor number, and p is the origin variables number.

Hierarchical Cluster Analysis

HCA is a multivariate statistical technique that can classify samples (Q mode HCA) or variables (R mode HCA) based on their similarity (Vega et al. 1998). In this study, both Q mode and R mode HCA were used with the Ward linkage rule and Euclidean distance to classify different clusters. The results of HCA are shown as a dendrogram, which can clearly explain the classification of the data.

Results and Discussion

Hydrochemical Characteristics

Groundwater hydrochemical characteristics are very complex and depend on multiple factors, such as weathering of rocks, cation exchange, recharge water quality, river water leakage, evaporation, and anthropogenic factors (Li et al. 2018b). To interpret the hydrochemistry of the groundwater in the study area, the statistical properties of the groundwater hydrochemical data are listed in Table 1.

Table 1 Major physicochemical indices of the groundwater in the study area

The pH value of the groundwater ranges from 7.56 to 8.74, indicating alkaline groundwater in nature. The concentrations of TDS and TH range from 96.00 to 3400.00 mg/L and 73.84 to 1214.00 mg/L, respectively. Groundwater TH in the study area varied greatly (confined water 73.84–572.00 mg/L, phreatic water 95.13–1214.00 mg/L). In the hydrogeological environment, confined water and phreatic water have different impact factors, resulting in the differences in the groundwater hydrochemistry between the two sources. As shown in Table 1, the concentration of Na+ ranges from 2.09 to 339.00 mg/L with a mean of 117.74 mg/L in the confined water, whereas that in the phreatic water varies from 17.30 to 706.00 mg/L with a mean of 149.35 mg/L. The concentration of Mg2+ ranges from 2.02 to 82.32 mg/L in the confined water, but in the phreatic water it varies from 6.06 to 254.30 mg/L. The other ions (K+, Ca2+, F, Cl, SO42−, HCO3, and NO3) have similar trends that the concentrations in the confined water are lower than those in the phreatic water. The cause of this phenomenon is that the phreatic water is more susceptible to pollution compared with the confined water.

Particularly, the mean concentration of SO42− in the phreatic water is 479.05 mg/L in the agricultural land (q8, q9, q18, q19) and the mean concentration of NO3 is 57.21 mg/L, both of which are much higher than the average concentrations of SO42− and NO3 in phreatic water (191.87 mg/L and 23.33 mg/L, respectively). However, there is an opposite finding for the concentration of COD. The high concentration of SO42− and NO3 in the phreatic may be attributed to the utilization of pesticides and fertilizers in the agricultural land and high groundwater level contributing to high groundwater evaporation. Groundwater with high COD level is concentrate in the urban land and other construction land. It can be inferred that the domestic and industrial sewage lead to the high concentration of COD.

A Piper diagram (Piper 1944) was used to study the hydrochemical types of groundwater. As shown in Fig. 2, most of the confined water and phreatic water samples are classified as HCO3–Na or HCO3–Ca·Mg types, and several water samplings can be classified as SO4·Cl–Ca·Mg or SO4·Cl–Na types. The anions were plotted mainly in zones B (No dominant type) and E (Bicarbonate type). The cations, however, fall into the zones B (No dominant type) and D (Sodium type). The triangle in lower left corner suggest that alkali mental has the advantage over alkali earth mental. The high sodium concentration may come from cation exchange and water percolation which introduces sodium from vadose zone into groundwater. The high concentration of HCO 3 , as indicated by the lower right triangle, shows that carbonate dissolution is an important process regulating the major anions. The observation of Piper diagram shows that groundwater chemistry may be regulated by the water–rock interaction and rock weathering.

Fig. 2
figure 2

Piper diagram showing the hydrochemical characteristics of phreatic water and confined water

Gibbs diagrams are a widely used tool for understanding the natural formation mechanism of groundwater. There are three mechanisms according to the Gibbs diagram: evaporation dominance, rock dominance, and precipitation dominance. Figure 3 shows that most of the water samples are plotted in the middle section of the diagrams. This observation suggests that rock weathering is the most significant natural mechanism governing the groundwater evolution in the study area. The dissolution of minerals and salts, such as calcite, dolomite, and halite, enter the groundwater through water–rock interactions. Figure 3 also shows that the phreatic water has a greater tendency for evaporation compared to the confined water. This is indicative of phreatic groundwater in an arid and semiarid area being typically subjected to greater evaporation compared with deeper subsurface water sources. Therefore, it is evident that phreatic water geochemistry in the study area is conjunctively influenced by major rock weathering and slight evaporation. However, the Piper diagram and Gibbs diagrams are only capable of analyzing the natural process. The anthropogenic processes should be evaluated by other techniques for further study.

Fig. 3
figure 3

Gibbs diagrams of the confined water and phreatic water

Correlation Analysis

Forty-three samples with 14 physicochemical indices were used in the correlation analysis, resulting in a correlation coefficient matrix shown in Table 2. Cl significantly correlates to Na+ (r = 0.944, p < 0.01), which implies that the dissolution of halite is a possible source of both Na+ and Cl. The relationship between Na+ and Cl also were plotted in Fig. 4a, which suggests that there are additional sources for Na+, because it is not a 1:1 relationship between the two ions. The dissolution of dolomite and calcite are the primary sources of Ca2+, Mg2+, and HCO3, so these ions should have a high correlation and HCO3 versus (Ca2+ + Mg2+) should be in a theoretical range (Li et al. 2016a). Table 2, however, shows that the correlation coefficient between Ca2+ with HCO3 is insignificant. Figure 4c shows that (Ca2+ + Mg2+) versus HCO3 does not plot within the theoretical range. This discrepancy could be attributed to the cation exchange between Ca2+ and Na+ in the groundwater. Schoeller (1965) proposed two chloro-alkaline indices (CAI-1 and CAI-2) for the study of cation exchange. They are defined as (Li et al. 2014):

$${\text{CAI}} - 1 = \frac{{{\text{Cl}}^{ - } - ({\text{Na}}^{ + } + {\text{K}}^{ + } )}}{{{\text{Cl}}^{ - } }}$$
(6)
$${\text{CAI}} - 2 = \frac{{{\text{Cl}}^{ - } - ({\text{Na}}^{ + } + {\text{K}}^{ + } )}}{{{\text{HCO}}_{3}^{ - } + {\text{SO}}_{4}^{2 - } + {\text{CO}}_{3}^{2 - } + {\text{NO}}_{3}^{ - } }}$$
(7)
Table 2 Matrix of correlation coefficients for the 14 water quality variables from the Guanzhong Basin
Fig. 4
figure 4

Plots of a Na+ versus Cl, b Ca2+ versus HCO3, c Ca2++Mg2+ versus HCO3 and d Ca2+ versus SO42−

As expressed in Eq. (8), positive values for the two indices indicate cation exchange, whereas negative values indicate reverse ion exchange as expressed as Eq. (9) (Li et al. 2016b). There are 97.67% samples with negative CAI-1 and CAI-2, and only one sample with positive values. Thus, cation exchange expressed in Eq. (9) is the important process controlling the groundwater chemistry. To confirm the occurrence of cation exchange reaction, the bivariate diagram of (Ca2+ + Mg2+)–(HCO3 + SO42−) as a function of (Na+ + K+ − Cl) in Fig. 5b shows a slop of − 1, indicating, again, the existence of a cation–anion exchange phenomenon.

$$2{\text{Na}}^{ + } + {\text{CaX}}_{2} = {\text{Ca}}^{2 + } + 2{\text{NaX}}$$
(8)
$${\text{Ca}}^{2 + } + 2{\text{NaX}} = 2{\text{Na}}^{ + } + {\text{CaX}}_{2}$$
(9)
Fig. 5
figure 5

Plots of a CAI-1 versus CAI-2 and b (Ca2++Mg2+)–(SO42−+HCO3) against Na++K+−Cl

The concentration of CO2 in water controls the dissolution of calcite and dolomite in the groundwater (Li et al. 2016a). As shown in Fig. 4b and c, molarity ratio of Ca2+ + Mg2+ versus HCO3 is close to 1:2, indicating that there are other processes influencing Ca2+ concentration. The dissolution of dolomite can be expressed as Eq. (10):

$${\text{CaMg}}({\text{CO}}_{3} )_{2} + 2{\text{H}}_{2} {\text{O}} + 2{\text{CO}}_{2} \to {\text{Ca}}^{2 + } + {\text{Mg}}^{2 + } + 4{\text{HCO}}_{3}^{ - }$$
(10)

Li et al. (2015b, 2019a, b) and Dar et al. (2011) have comprehensively explored fluoride and nitrate concentrations in groundwater, because these two contaminants have great impacts on groundwater quality and human health. As shown in Table 2, F has a significant correlation with pH and HCO3 (r = 0.392 and r = 0.548 at a significance level of p < 0.01, respectively), which indicates that the alkaline nature of water is more favorable for fluoride enrichment. Similarly, Li et al. (2014) and Adimalla et al. (2018) adopted some chemical reactions to explain the sources of fluoride in the groundwater and obtained the same conclusion about fluoride enrichment. There is a positive correlation between F and Na+ (r = 0.725 at a significance level of p < 0.01) in Table 2, which implies that the high concentration of Na+ will favor the enrichment of F in groundwater. Wu et al. (2020), Panneer et al. (2017), and Hossain et al. (2016) gained a similar finding about fluoride enrichment. The molar concentration of (Ca2+ + Mg2+) and HCO3 are plotted in Fig. 4c, which indicates that with the increased concentration of NO3, the molarity ratio of (Ca2+ + Mg2+) versus HCO3 tends to increase. Human activities contribute heavily to the concentration of nitrate in the groundwater (Bouchard et al. 1992). Hence anthropogenic processes may accelerate the release of Ca2+ and Mg+ from the surrounding minerals.

Gypsum is a potential source of Ca2+ and SO42− in the area, but the correlation coefficient of them is insignificant (r = 0.288). Figure 4d displays the concentration relationship between Ca2+ and SO42−, indicating that Ca2+ and SO42− are from other sources in addition to gypsum. Shown by Table 2, SO42− is positively correlated with NO3 (r = 0.424 at a significance level of p < 0.01), indicating that anthropogenic factors may be an important source of SO42−.

Generally, TDS and TH should be significantly correlated with major ions. TDS is significantly correlated with K+, Na+, Mg2+, F, Cl, SO42−, HCO3, and NO3 (r = 0.609, 0.901, 0.882, 0.567, 0.903, 0.925, 0.793, and 0.505, respectively, at significance level of p < 0.01). Table 2 shows that TH is significantly correlated to Ca2+, Mg2+, Cl, SO42−, HCO3, and NO3 (r = 0.568, 0.950, 0.717, 0.822, 0.838, and 0.656, respectively, at significance level of p < 0.01).

Saturation indices (SI) are usually used to describe the dissolution/precipitation of minerals in the groundwater (Li et al. 2010). The saturation indices of calcite, dolomite, fluorite, and gypsum at 25 °C were calculated via PHREEQC (Parkhurst and Appelo 1999) and plotted in Fig. 6 as a function of TDS. SI of fluorite and gypsum are both negative, which indicates that fluorite and gypsum are undersaturated in the groundwater and more fluorite and gypsum can dissolve into the groundwater under suitable conditions. The saturation indices of calcite and dolomite, however, are positive, except one sample, which indicates that calcite and dolomite tend to precipitate in the groundwater. With the increasing trend of TDS, the saturation indices of fluorite and gypsum slightly increase, which may cause the precipitation of calcite and dolomite.

Fig. 6
figure 6

Relationships of saturation indices of calcite, dolomite, fluorite, and gypsum with TDS

Principal Component Analysis and Factor Analysis

The number of components in PCA is determined by the Kaiser criterion, for which the components with eigenvalues greater than 1 are retained. As shown in Table 3a, the first three principal components (PCs) have eigenvalues greater than 1, and they represent 79.281% of the total variance in the hydrochemistry dataset. Table 3a presents the principal component loadings for these components, which shows the importance of the variables for each component. The component loadings can be classified as strong, moderate, and weak based on the absolute loading values of > 0.75, 0.75–0.5, and 0.50–0.30, respectively (Bu et al. 2010; Liu et al. 2003).

Table 3 Total variance explained by (a) PC and the loading matrix of PCs, (b) VF and the loading matrix of VFs

PC1 explains 50.423% of the total variance, and it has strong loadings on Na+, Mg2+, Cl, SO42−, HCO3, TDS, and TH (0.884, 0.951, 0.908, 0.942, 0.853, 0.961, and 0.913, respectively), and moderate loadings on K+, F, and NO3 (0.591, 0.580, and 0.603, respectively). PC1 can be regarded as the salinity component of the water. TDS and TH have high loadings in PC1, because TDS and TH are significantly correlated with most ions (Table 2). Na+, K+, Mg2+, Cl, HCO3, and F are the important indices contributing to salinity and mainly come from weathering and dissolution of halite, dolomite, and fluorite. As discussed previously, NO3 and SO42− are closely related to the anthropogenic processes. Therefore, PC1 represents a mixture of natural and anthropogenic processes.

PC2 and PC3 explain 20.836% and 8.562% of the total variance, respectively, which indicates that PC2 and PC3 are secondary factors. PC2 is characterized by strong loadings on pH and Ca2+ (0.864 and − 0.779, respectively) and moderate loadings on F and CO32− (0.634 and 0.725, respectively). The positive loadings on pH, F, and CO32− and negative loadings on Ca2+ precisely correspond to the correlation between these ions (Table 2). Some researchers have found that an alkaline environment is conducive to the enrichment of fluoride and obstructs the dissolution of calcite and dolomite. PC2 represents fluorite dissolution and can be indicative of natural process. PC3 has a moderate loading on COD (0.635) and is deemed as the organic matter component. COD is mainly from human activity. Therefore, PC3 represents anthropogenic processes.

The difference between PCA and FA is that the FA must undergo a rotation process on the basis of PCA. The first five varifactors (VFs) in the factor analysis explain 89.366% of the total variance. As shown in Table 3a, b, the VF1 is consistent with PC1, representing salinity component of water. VF2 has a strong positive loading on Ca2+ and strong negative loading on pH, which represent the release of calcium into the groundwater. VF3 is the consistent with PC3, representing anthropogenic processes. VF4 has a strong positive loading on NO3 (0.919), which represents the nitrate process in the water. VF5 has a strong loading on CO32− (0.890), indicating that carbonate process is significant in the Guanzhong Basin groundwater. As describe above, both PCA and FA could be used to analyze the main factors affecting groundwater hydrochemical characteristics. Compared with PCA, FA is more accurate in the interpretation of results.

Hierarchical Cluster Analysis

R mode HCA was used to the 14 water quality variables, and the results are shown in Fig. 7. Cluster 1 includes F, COD, K+, pH, CO32−, NO3, Ca2+, Mg2+, Cl, Na+, and SO42−. Cluster 2 includes HCO3, TH, and TDS. In cluster 1, COD, pH, and NO3 are mainly influenced by human activities. The other physicochemical indices in cluster 1 are mainly controlled by a natural process, such as the dissolution of minerals and soil leaching. Therefore, cluster 1 represents the groundwater chemical characteristics regulated by both natural and anthropogenic processes. The indices in cluster 2 are related to major ions, which are mainly controlled by natural processes, such as dissolution of calcite, dolomite, gypsum, and fluorite.

Fig. 7
figure 7

Dendrogram of R mode HCA of groundwater physicochemical indices

Figure 8 shows the result of the Q mode HCA for the analysis of the 43 groundwater samples. In this study, a phenon line was drawn across the dendrogram at a linkage distance of approximately 30. This position of the phenon line allows a division of the dendrogram into four clusters of water samples (C1–C4). The Stiff diagrams of the four clusters based on median concentrations are also included in Fig. 8. To describe the characteristics of each cluster of samples, the median values for each parameter are presented in Table 4.

Fig. 8
figure 8

Dendrogram of Q mode HCA of groundwater samples and stiff diagram of each clusters

Table 4 Concentration of parameters of different clusters

The dendrogram shows some indications of the level of similarity between the four clusters. C1 contains 23 samples, and C2 includes 12 samples. C1 and C2 are linked with a lower linkage distance compared to the other clusters, implying that they have a greater similarity. It can be inferred that C1 and C2 samples have the same or similar geochemical characteristics. C3 consists of seven groundwater samples, and C4 includes only one sample (q9). Weak similarities between the geochemistry of C4 and other cluster samples also are expected, because there is a large linkage distance between C4 and others. The Stiff diagrams (Fig. 8) and the data (Table 4) show that the four clusters are classified as disparate hydrochemical types. Samples from C1, C2, and C3 are classified as HCO3–Na·K type and the concentrations of all ions except CO32− are lower than the sample from C4. The sample in C4 is classified as SO4–Na·K type and has a much higher concentration in SO42− compared with samples in other clusters. Most parameters except CO32−, pH, COD increase from C1 to C4.

Samples in C1 have the lowest ion concentrations, which may indicate that they did not have a long residence time within the geosphere, and their interactions with the surrounding minerals are not sufficient. This is confirmed by the box plot of the saturation index in Fig. 9, which shows that there is a slight increase in the saturation indices from C1 to C4. This might be due to the increasing water–rock interactions or by human activities. The trend in the saturation index implies that calcite and dolomite gradually become supersaturated, and fluorite and gypsum are close to saturation. The saturation indices of the minerals in C4 are higher than those in the other clusters, which may be explained by a longer residence time and sufficient interaction with the surrounding minerals. The concentration of nitrate in C4 is 107.2 mg/L, indicating that the sample from C4 is significantly influenced by anthropogenic processes.

Fig. 9
figure 9

Box plot of a calcite, b dolomite, c fluorite, and d gypsum saturation indices

Groundwater Management in the Guanzhong Basin

As presented earlier, the natural processes and human activities are regarded as the main influencing factors. On one hand, the anthropogenic processes can introduce more sulfate, nitrate, chloride, and organic matter into the groundwater, which influence the groundwater quality and human health. On the other hand, human activities can change the hydrogeological conditions, such as groundwater level, which may further change the hydrochemical characteristics. Considering the importance of Guanzhong Basin in the Belt and Road Initiative, it is urgent to adopt a sustainable groundwater quality management. Groundwater quality improvement programs should be conducted.

Weihe River, the biggest river running through the Guanzhong Basin, has close contact with local groundwater. However, wastewater produced by upstream industry and downstream domestic sewage have polluted the Weihe River. This will further affect the groundwater quality by through river water percolation. The local government should prevent the river water from further pollution and enhance the treatment of industrial wastewater and domestic sewage. In addition, the rural toilet should be reformed to prevent the groundwater from pollution of bacteria, nitrate, and sulfate. Fertilizers and pesticides should be rationally used in the agricultural area, which will reduce the non-point source pollution. For areas with high fluoride and total hardness caused by natural factors, the government should organize tap water defluorination removal projects or install water purifiers to reduce potential human health risks. Meanwhile, the regular groundwater monitoring in the Guanzhong Basin is an indispensable part of groundwater management.

The groundwater quality is affected by multiple factors, including climate change, land use change, human activities, surface water percolation, and social development. The necessary step to ensure effective protection of groundwater is enhancing cooperation among different departments. Thus, groundwater research should be combined with natural sciences and social sciences, so that groundwater management can be more sustainable.

Conclusions

The sources and influencing factors of groundwater quality in the central parts of the Guanzhong Basin were determined using hydrogeochemical, statistical and multivariate statistical techniques. We found that both natural and anthropogenic processes influence the groundwater chemistry. The findings can be summarized as follows:

  • Most of the major ions in the groundwater are predominantly regulated by natural processes, as indicated by the correlation analysis. The chloralkaline indices demonstrate that cation exchange is a significant process influencing Na+ and Ca2+. Sulfate and nitrate, however, are mainly influenced by anthropogenic processes, such as industrial and domestic effluents, elevated groundwater level due to irrigation.

  • The extracted five VFs from factor analysis explain 89.366% of the total variance and are related to salinity, calcium ions, organics, nitrates, and carbonates, respectively. Overall, the salinity, calcium ion, and carbonate components are mainly controlled by natural processes. VF3 and VF4, however, represent anthropogenic processes. It should be noted that the anthropogenic and natural processes cannot completely be separated. Factor analysis is more effective in explaining the characteristics of groundwater hydrochemistry than principal component analysis in this study.

  • R mode and Q mode HCA applied in this study. In the R mode HCA, cluster 1 represents the combination of natural and anthropogenic processes. Cluster 2 includes HCO3, TH, and TDS, which are mainly governed by natural processes. Four geochemically distinct clusters can be classified by Q mode HCA. Samples from C1, C2, and C3 consist of HCO3–Na·K or HCO3–Na type samples, and the sample from C4 is of the SO4–Na·K type. Sample from C4 comes from a phreatic aquifer, with high concentrations of major ions.

  • Most of groundwater in the study area is generally suitable for multiple uses, except groundwater in a few locations. The usage of fertilizers and pesticides, and domestic and industrial wastewater could lead to negative influence on human health. Thus, continuous groundwater quality monitoring is necessary for sustainable groundwater quality management and protection. In further research, trace metals should be considered, and this will result in a more comprehensive understanding of groundwater quality.