Introduction

In recent decades, industrial and population growth have led to increased demand worldwide for freshwater resources, especially groundwater. The over-exploitation of groundwater along with contamination caused by human activities has put freshwater supplies at risk (Li et al. 2017). Trace elements are among the most hazardous contaminants of water due to their distinct characteristics including chemical stability, weak degradation, wide range of sources, serious toxicity, bioaccumulation and difficulties in remediation (Li et al. 2015; Sun et al. 2016), and they can pose significant threats to humans and the environment. Some trace metals are essential nutrients at low levels but can lead to severe poisoning at higher concentrations. Trace element contamination mainly originates from natural (weathering and dissolution of rocks) and anthropogenic (mining, industrial and agricultural) processes and activities that can deteriorate the quality of water for drinking, agricultural and industrial purposes (Barzegar et al. 2017).

Knowledge of potential sources of trace elements and their content in groundwater is important for sustainable and effective management of groundwater resources. The potential origin of trace elements in a groundwater system can be determined by graphical and multivariate statistical methods and modeling approaches (e.g., diffusion and receptor models, Sun et al. (2016)). Multivariate methods are increasingly being used because of their easy application and ability to compare large amounts of information, which is very difficult to do using number tables or univariate statistics (Bingöl et al. 2013). Among the multivariate approaches, correlation analysis, cluster analysis (CA) and factor analysis (FA) are the most widely used statistical methods to determine the source of groundwater quality degradation (Yidana et al. 2010; Purushothaman et al. 2014; Wu et al. 2014; Barzegar et al. 2017, 2018 and Soltani et al. 2017). Correlation analysis deciphers the relationships between hydrochemical variables to identify their potential origins. CA reveals similarities between samples and groups of samples. FA does not require prior knowledge of the number of sources and source characteristics of the contaminants, and is used to determine the potential hydrogeochemical processes and effective factors (e.g., natural and anthropogenic) in groundwater quality. Many studies to identify the origins of trace elements in groundwater by multivariate statistics have been carried out in different parts of the world (Li et al. 2014; Purushothaman et al. 2014; Esmaeili et al. 2018; Barzegar et al. 2017, 2018). These studies showed that multivariate approaches were able to discover the potential origins of contaminants in water as well as hydrogeochemical processes. For instance, Barzegar et al. (2018) investigated the potential source of some selected trace elements in groundwater of the Shabestar area, northwestern Iran, using multivariate statistical analysis. The outcomes indicated that weathering and dissolution of rocks, particularly silicates and evaporites, ion exchange processes, water salinity, and alkalinity were the main factors resulting in the release of trace elements into the groundwater of the area. The impacts of nitrate along with zinc in fertilizers and agrochemicals were responsible for 10% of the total variance in groundwater quality. Esmaeili et al. (2018) evaluated the potential origins of trace elements including iron (Fe), manganese (Mn), zinc (Zn), chromium (Cr), lead (Pb), cadmium (Cd), aluminum (Al) and arsenic (As) utilizing correlation coefficient, CA and FA analyses in groundwater of the Qareh-Ziaeddin area, in northwestern Iran. They inferred that weathering and dissolution of evaporitic formations, igneous rocks with fluoride-bearing minerals and serpentinite rocks, denitrification and cation exchange, were the prevailing processes in the groundwater of the study area. Moreover, FA confirmed the geogenic impacts with a total variance of 83% on the groundwater quality.

Exposure to potentially toxic chemicals such as trace elements in water can cause serious adverse human health effects, including a variety of cancers, intellectual disabilities, and neurological, cardiovascular, kidney and bone diseases (Adriano 2001; Järup 2003; Li and Qian 2011; Prüss-Ustün et al. 2011). Potential exposure to a mixture of trace elements can be evaluated by a health risk assessment, which is used to understand the potential of a risk source to introduce risk agents into the environment, to estimate the quantity of risk agents that come into contact with human–environment boundaries, and to quantify the health consequences of exposure (Ma et al. 2007; Giri and Singh 2015). Many researchers (e.g., Nguyen et al. 2009; Rahman et al. 2009; Wongsasuluk et al. 2014; Celebi et al. 2014; Li et al. 2014; Liang et al. 2016; Barzegar et al. 2018) have reported on the human health risks of trace elements in groundwater. For example, Liang et al. (2016) evaluated the health risk of As pollution in drinking groundwater in southern Taiwan based on the hazard quotient (HQ) and target risk (TR). Results showed that the HQ values for the 95th percentile were above the value of 1, and TR levels exceeded the threshold value of 10E−6. They concluded that 0.01–7.50% of the population’s HQ levels were higher than 1 and 77.7–93.3% of the population was in the high cancer risk category and had a TR value of 10E−6. The TR calculation outcomes inferred that the use of groundwater for drinking purposes put people at risk of As exposure. Wongsasuluk et al. (2014) investigated the human health risk of heavy metals in drinking water from shallow groundwater wells in an agricultural area in Ubon Ratchathani province, Thailand. According to their results, unacceptable non-carcinogenic health risk levels were detected at some water wells for As, Cu, Zn and Pb with HQs ranging from 0.004 to 2.901, 0.053 to 54.818, 0.003 to 6.399 and 0.007 to 26.80, respectively. The hazard index (HI) values (ranging from 0.10 to 88.21) exceeded allowable limits in 58% of the wells. Although the concentrations of these elements were within acceptable ranges for drinking, As had an unacceptable cancer risk at some water wells, indicating the significance of actual consumption rates in estimating health risk factors.

The Khoy area in Iran’s West Azerbaijan province is a fertile and industrialized area. The drinking water in the villages and in the city of Khoy, as well as irrigation of agricultural lands, is very dependent on groundwater. The trace element status of the groundwater and associated health risks in the study area have not previously been reported. Therefore, this study was carried out on the groundwater system of the Khoy area to: (1) identify and measure trace elements including iron (Fe), manganese (Mn), arsenic (As), lead (Pb), cadmium (Cd), cobalt (Co), zinc (Zn), chromium (Cr), aluminum (Al) and nickel (Ni) and determine their potential origins; and (2) evaluate the human health risk of these elements to children and adults through drinking water intake.

Materials and methods

Study area description

The Khoy plain is located northwest of Urmia Lake, in the northern part of West Azerbaijan province, Iran, and is part of the Aras River drainage basin. The Khoy plain has an area of about 600 km2 and is bordered by mountains of which the highest is Avrin Mountain (1300 m) in the west of the plain. Surface water resources of the area are mostly two rivers, the Aland and Qotur-Chay, which originate in the western highlands and discharge into the Aras River in the northeast of the plain. Figure 1 shows the location of the study area including the digital elevation model (DEM) of the Khoy basin and the main rivers. The study area is characterized by a cold arid climate and the annual average rainfall is 292 mm. The 20-year (1997–2017) temperature and humidity averages are 11.6 °C and 60%, respectively. Khoy city is the most important city in the area with more than 348,000 inhabitants. Groundwater is extensively utilized for drinking, agriculture and industrial purposes in the area. The main crops cultivated in the region are sugar cane, sunflower, beets, wheat, barley, corn, lentils, beans, tomatoes, potatoes, chickpeas, alfalfa, melons and watermelons.

Fig. 1
figure 1

Digital elevation model (DEM) of the Khoy area and location of the rivers

Geological and hydrogeological setting

The Khoy area is structurally part of the Khoy-Mahabad sub-zone (the intersection of the Alborz and Sanandaj-Sirjan zones) (Nabavi 1976), which has experienced frequent occurrences of magmatic activity. Magmatic and metamorphic facies, and also recent alluvium, are abundant facies in the area. Figure 2 shows the geological units of the Khoy area. Except for metamorphic rocks with a lithological composition of gneiss, migmatite and schist that are probably related to the Precambrian era, rock units of the Late Precambrian make up the Kahar Formation composed of slate, shale, coal with flakes of graphite, north of Khoy city and are considered the oldest units (Yeganeh et al. 2015). Mesozoic rocks consisting of ophiolitic complex, mafic and ultramafic rocks (e.g., diorite, gabbro, basalt, serpentinite, dunite and harzburgite) and metamorphic rocks (e.g., gneiss, marble, schist related to the Cretaceous period) are found in the western, northern and southwestern parts of the Khoy plain. The Miocene units are made up of conglomerate and represent a disruptive environment. The Upper Miocene to Pliocene deposits, composed of sandstone and conglomerate with marl, shale, layers of gypsum and salt, are known as the Upper Red Formation and can be observed in the west, southeast and northeast of the plain. Units of the Pliocene period are a composition of conglomerate, marl and sandstone, which are distributed in the south and southeast of the plain (Radfar et al. 2005).

Fig. 2
figure 2

Geological map of the study area (after Radfar et al. 2005) and location of the sampling sites with water types

Generally, alluvial deposits in the plain resulted from deposition of the Aland and Qotur-Chay Rivers and are mainly composed of gravel, sand and clay. The old terraces are bonded with clay loam, and therefore, their permeability is low; recent terraces, located along the rivers, are made of separated sandy grains with high permeability. According to information from geophysical studies, log drillings and geological investigations, the main aquifer was formed in recent alluvial deposits (Quaternary). Aquifers of the Khoy plain consist of two main types: unconfined and confined. Figure 3 shows the schematic position of the aquifers in the plain. The unconfined aquifer covers the entire plain, whereas the confined aquifers are situated in the west and central parts of the plain. Figure 4 illustrates a geological cross section of the unconfined aquifer in the Khoy plain. Generally, the confined aquifer in the Qotur-Chay River watershed consists of alluvium, which is not uniform in direction of flow and depth. The fine-grained clay and silty layers cause a separation among the deposits, which result in a confined aquifer. In the Aland River watershed, tectonic movement has an important role in the formation of the confined aquifer in the western part of the plain. Displacement of the strata and layers with low permeability on top of coarse-grained layers by the Firooraq fault resulted in the confined aquifer in the Aland River watershed. Nowadays, due to over-exploitation of groundwater (especially in the north), the confined aquifer has been modified to an unconfined aquifer hydraulically. Total thickness of the aquifers ranges between 20 and 200 m in the northwest and central part of the plain. The transmissivity values for the unconfined aquifer range between 50 and 1500 m2/d and pumping tests in the area resulted in an average storage coefficient of 0.05.

Fig. 3
figure 3

Schematic position of the aquifers and unconfined groundwater flow direction in the Khoy plain

Fig. 4
figure 4

Geological cross section of the unconfined aquifer in the Khoy plain

The groundwater flow direction is mainly from southwest to northeast. The bedrock of the plain includes marl and conglomerate, which are predominant in the northern and southern parts of the plain, respectively.

Water sampling and analysis

Groundwater water samples were collected from 54 water sources including wells, springs and qanats in July 2017 in the Khoy plain. Figure 2 shows the locations of the sampling points. There are 15 overflow artesian wells in the confined aquifers from which four water samples were collected. Measurements of the pH and electrical conductivity (EC) were recorded in the field using portable instruments, viz. Metrohm 826 and HANNA HI 9033, respectively. Samples were collected in polyethylene bottles, and included one filtered and unacidified sample for major and minor elements, and one filtered and acidified sample for trace element analysis. Acidification and filtration processes were performed using nitric acid and 0.45-μm membrane filters, respectively. Major and minor ions and species were analyzed by titration (for analysis of Ca, Mg, HCO3 and Cl), flame photometer (for analysis of Na and K) and spectrophotometry (for analysis of SO4, NO3, F and SiO2) according to AHPA (2005) at the Water Laboratory of the University of Tabriz. Accuracy of the chemical analysis of the major ions was checked by computing the cation–anion charge balance (Eq. 1):

$${\text{Charge}}\;{\text{balance}} = \frac{{\left( {C - A} \right)}}{{\left( {C + A} \right)}} \times 100,$$
(1)

where C and A are the concentrations of total cations (Ca + Mg + Na + K) and total anions (HCO3 + Cl + SO4 + NO3), respectively, in meq/L, and charge balance is the percentage of ionic balance error. The charge balances for all the samples were within the limit of ± 5% suggested by Domenico and Schwartz (1998).

Trace elements (Fe, Mn, Al, Zn, Cr, Pb, Cd, Co, Ni and As) were measured using a Varian Spectra 220 Atomic Absorption Spectrometer with GTA 110 at the Water Quality Control Lab of East Azerbaijan Province. Field blank and duplicate procedures were used for quality control assurance. Calibration of equipment was performed linearly with Merck Standard Solutions for each element. The quality control sample was used to check the linearity after every ten samples. Accuracy of the trace element analyses was determined by analyzing certified reference material, NIST SRM. Also, background amendments for all analyses and the nitrous oxide–acetylene flame were used to minimize the risk of interference.

Multivariate statistics

To perform the multivariate statistics, preprocessing (i.e., log transformation and standardization) was employed on the dataset to normalize the measured water quality variables and eliminate the effect of different units of parameters, respectively. For the correlation analysis, a Pearson correlation was applied to decipher the relationships between water quality variables. The strength (r) and significance (p value) of the relationships are two important factors. A larger r indicates a stronger relationship (Barzegar et al. 2016), and in this study, r > 0.7, 0.7 > r > 0.5 and r < 0.5 were considered as strong, moderate and weak relationships, respectively. The smaller the p value, the more significant the relationship (Barzegar et al. 2016).

All measured water quality variables were considered during the hierarchical clustering analysis (HCA) using Ward’s method (Ward 1963). This method uses an intra-cluster variation to form clusters, and the clusters are formed by maximizing the homogeneity within each cluster (Hardle and Simar 2007). The squared Euclidean distance is commonly adopted to measure distance or dissimilarity between clusters. The squared Euclidean distance usually expresses the similarity between two samples, and the distance can be shown by the difference between analytical values of the samples (Otto 1998; Bhuiyan et al. 2011; Barzegar et al. 2017).

Factor analysis (FA) is generally employed to recognize the hidden dimension, which may not be explained by direct analysis. The principal component analysis (PCA) is a method to reduce the volume of a large dataset with minimum loss of information (Singh et al. 2017). In total, 19 water quality variables including pH, EC, Ca, Mg, Na, K, HCO3, SO4, Cl, NO3, F, SiO2, Fe, Mn, Al, Zn, Cr, Pb and As were considered to carry out the FA. The varimax rotation method and Kaiser’s criterion (Kaiser 1960) were applied to optimize factor loadings to bring about a simple structure and to extract factors with eigenvalues greater than 1, respectively. Accordingly, factor loadings above 0.75 were considered as high and 0.5–0.75 as medium loadings (Tziritis et al. 2016, 2017). The FA revealed a 0.595 value for the Kaiser–Mayer–Olkin (KMO) and 171 (p < 0.0001) for Bartlett’s sphericity, indicating that FA was appropriate in providing significant reductions in the dimensionality of the data. The statistical analysis was carried out using SPSS v.21 (IBM Corp 2012).

Human health risk

The non-carcinogenic (chronic) human health risk of trace elements in the groundwater of the Khoy plain was investigated using the method suggested by the U.S. Environmental Protection Agency (USEPA 1999). Chronic daily intake (CDI) of a single trace element through direct ingestion for children and adults was calculated by Eq. 2 (USEPA 1999; Li et al. 2016; Wu and Sun 2016; Zhang et al. 2018):

$${\text{CDI}} = \frac{{C_{{\rm W}} \times {\text{IRW}} \times {\text{EF}} \times {\text{ED}}}}{{{\text{BW}} \times {\text{AT}}}},$$
(2)

where CDI is exposure dose through intake of groundwater (mg/kg per day), Cw is contaminant concentration in groundwater (mg/L), IRW is the water ingestion rate (L/day, IRW = 2 L/day for adults and 1 L/day for children), EF is exposure frequency (days/year, EF = 365 days/year), ED is exposure duration (years, ED = 70 years for adults and 6 years for children), BW is body weight (kg, BW = 70 kg for adults and 15 kg for children) and AT is average exposure time for non-carcinogens effects (days, AT = 25,550 days for adults and 2190 days for children) (Bortey-Sam et al. 2015; Duggal et al. 2017; Barzegar et al. 2018).

The hazard quotient (HQ) was then calculated by Eq. 3 (Li et al. 2016; Zhang et al. 2018):

$${\text{HQ}} = \frac{{\text{CDI}}}{{\text{RfD}}},$$
(3)

where RfD is the reference dose of a specific element (mg/kg per day) and considered as Fe (0.7), Al (1), Cr (1.5), Mn (0.046), As (0.0003), Zn (0.3), Pb (0.0035), Cu (0.04) and Ni (0.02) (mg/kg/day) (Duggal et al. 2017; Barzegar et al. 2018).

The summation of the HQ values of the trace elements gave the hazard index (HI) of each groundwater sample:

$${\text{HI}} = \sum {\text{HQ}} .$$
(4)

The classification of non-carcinogenic human health risk based on USEPA (1999) is presented in Table 1 (Bortey-Sam et al. 2015; Su et al. 2017). HI > 1 indicates that the non-carcinogenic risk of the contaminant exceeds the acceptable limit; HI < 1 demonstrates that the non-carcinogenic risk is within the acceptable limits (USEPA 1999; Li et al. 2016; Su et al. 2017; Zhang et al. 2018).

Table 1 Classification of non-carcinogenic risk (USEPA 1999)

Results and discussion

Hydrochemistry

Descriptive statistics of the measured water quality parameters are given in Table 2. The pH, with a median value of 7.63, was neutral. EC values ranged between 460 and 7100 µS/cm and were above the permissible level of 1500 µS/cm (WHO 2011) for 40% of the samples. The relative abundance of major and minor cations and anions, based on median values, was found to be Ca > Mg > Na > K and HCO3 > SO4 > Cl > NO3. The order of trace elements content was: Fe > Zn > Al > As > Cr > Pb > Ni > Mn > Co > Cd. The statistical parameters, e.g., skewness and kurtosis, revealed that the majority of the measured parameters, which were outside the range of − 2 to + 2 (Wu et al. 2010; Barzegar et al. 2016), significantly deviated from normality. The levels of EC, F, Pb, Cd, Zn, As and all the major ions except K exceeded the permissible levels for drinking water as recommended by WHO (2011).

Table 2 Descriptive statistics of the measured and analyzed parameters for groundwater samples of the Khoy plain

A Piper diagram (Piper 1944) was employed to identify water types in the study area (Fig. 5). The majority of samples from the unconfined aquifer and all the samples from the confined aquifers belonged to the Ca(Mg)–HCO3 water type, characterized by a temporary hardness feature. The spatial distribution of the water types is shown in Fig. 2. Six samples, mainly from the southern part of the plain, showed a saline Na–Cl water type. Samples 1, 7, 8, 31, 33 and 45, mainly from the west of the plain (discharge zone), lay in the E domain of the Piper diagram where none of the cations and anions were predominant. This indicated a mixing zone.

Fig. 5
figure 5

Piper diagram of the groundwater samples in the Khoy plain

Correlation analysis

The results of the correlation analysis for the groundwater quality variables in the Khoy plain are given in Table 3. There are moderate to strong correlations between EC and major ions including Na (r = 0.860, p < 0.01), K (r = 0.838, p < 0.01), HCO3 (r = 0.623, p < 0.01), Cl (r = 0.858, p < 0.01) and SO4 (r = 0.820, p < 0.01), which showed the contribution of these ions to the salinity of the groundwater. The negative moderate correlation between pH and EC (r = − 0.493, p < 0.01) indicated the medium effect of acidity on the dissolved content of the water. The pH was also negatively correlated with Ca (r = − 0.741, p < 0.01), HCO3 (r = − 0.759, p < 0.01) and SiO2 (r = − 0.382, p < 0.01), which indicated the effect of HCO3 content on pH and the role of acidity on the dissolution of the silicate minerals. The correlations between pH and Ni (r = − 0. 628, p < 0.01), and pH and Co (r = − 0.622, p < 0.01), possibly indicated the role of pH (acidic environment) in releasing these metals into groundwater (Belzile et al. 2004; Balkhair and Ashraf 2016; Barzegar et al. 2016). A strong correlation between Ca and HCO3 (r = 0. 782, p < 0.01) suggested that some of these ions may have been derived from dissolution of calcite. The correlation of Cl with Na (r = 0.951, p < 0.01), SO4 (r = 0.640, p < 0.01) and Cl and K (r = 0.818, p < 0.01) may be attributed to dissolution of halite (NaCl) and sylvite (KCl) minerals and irrigation return flow (Srivastava and Ramanathan 2008). Moderate to high correlations between Ca and Ni (r = 0.783, p < 0.01), Cd (r = 0.591, p < 0.01), Co (r = 0.823, p < 0.01) and Na and Cd (r = 0.594, p < 0.01) indicated the possible co-enrichment of these metals with the mentioned ions and evaporitic formations as potential sources as suggested by Salminen et al. (2005). The correlation of Na and K (r = 0.778, p < 0.01) demonstrated the cation exchange process in the groundwater system (El Alfy et al. 2017). A moderate correlation between F and Na (r = 0.500, p < 0.01) and F and K (r = 0.545, p < 0.01) might be attributed to the weathering and dissolution of igneous rocks with F-bearing minerals in the area (e.g., cryolite, biotite and amphiboles). Moderate to high correlations between Al and Fe (r = 0.561, p < 0.01), and Cd and Co (r = 0.803, p < 0.01), were attributed to similar geochemical processes and conditions for the release of these metals into the groundwater (Barzegar et al. 2015, 2016). Moderate correlation between Cl and Cd (r = 0.630, p < 0.01), and Cl and Co (r = 0.500, p < 0.01), suggested that these metals were released into groundwater by forming complexes in solution with evaporites (e.g., CdCl or CoCl) (Salminen et al. 2005; Soltani et al. 2017). Moderate correlations of As with Na (r = 0.500, p < 0.01), and As with K (r = 0.552, p < 0.01), possibly suggested a common geogenic origin of As and conditions that enhanced its mobility.

Table 3 Correlation matrix for the water quality parameters in the Khoy plain

Cluster analysis (CA)

Based on the dendrogram produced by the HCA, groundwater samples were classified into two distinct clusters (Fig. 6a). Clusters 1 and 2 contained 80% and 20% of the total samples, respectively. Each cluster was divided into two sub-clusters (i.e., sub-clusters 1-1 and 1-2 for cluster 1 and sub-clusters 2-1 and 2-2 for cluster 2). Figure 6b shows the spatial distribution of the sub-clusters in the plain. To distinguish the important physicochemical parameters affecting the differences between sub-clusters, a one-way ANOVA was employed (Table 4). In this analysis, a significance value equal to zero showed a significant difference. There was a significant (sig. < 0.05) difference for all physicochemical parameters except Cr, Fe and Zn.

Fig. 6
figure 6

a The dendrogram of groundwater samples for HCA; b spatial distribution of samples based on sub-clusters

Table 4 Results of the one-way ANOVA analysis between sub-clusters

Box and whisker plots for the water quality variables were used to compare the sub-clusters (Fig. 7). According to the median values, pH and NO3 concentrations were elevated and EC, Mg, Na, K, HCO3, Cl, SO4, SiO2, Pb, Fe and As were lowest in sub-cluster 1-1 compared to the other sub-clusters. This sub-cluster was mainly located in the western and middle of the Khoy plain (Fig. 6b). The groundwater in these areas had a relatively low EC and high pH, which were mainly attributed to the recharge area in the west of the plain, to intensive agricultural activities and to the high coarse materials content of this part of the aquifer. Sub-cluster 1-2 had the lowest concentrations of NO3 (attributed to the high fine materials content of this part of the aquifer) and Cr (attributed to the low impact of ultramafic and serpentinite rock), and was mainly situated in the east of the plain (Fig. 6b). Sub-cluster 2-1 had the highest As concentration, EC and dependent variables (i.e., Na, Cl, Mg), and was mainly located in the south of the plain, except for sample 5 in the northeast. These samples might be affected by the volcanic rocks, marl, sandstone, conglomerate and limestone in the southwest of the plain. Some parameters such as Ca, HCO3, SO4, F, SiO2, Ni, Co, Pb, Cr, Al, Mn and Zn were elevated in sub-cluster 2-2. The samples of this sub-cluster were situated in the northern parts of the plain, and its characteristics could be attributed to the rocks in these areas (e.g., limestone, amphibolite, metamorphic and ultramafic rocks, serpentinite).

Fig. 7
figure 7

Box and whisker plots of the water quality parameters in the cluster analysis

Factor analysis (FA)

The results of FA for physicochemical variables in the groundwater of the Khoy area are given in Table 5. The first factor accounted for 36.9% of the total variability (69.1%) and included EC, Na, K, Cl, SO4, F and As parameters. This factor explained the effect of salinity on groundwater quality, which was due to weathering and dissolution of evaporitic minerals and also leaching of fertilizers from agricultural areas. Rowland et al. (2006) and Anawar et al. (2011) reported that As mainly occurs in the groundwater system in three phases including Fe and Mn oxides, organic matter and silicate–sulfate. Although there was no strong correlation between As and other measured water quality parameters (there was a moderate correlation with Na and K), the presence of As and SO4 indicated a silicate–sulfate phase as the possible source of As in the groundwater of the study area. As shown in Fig. 8a, FA1 contributed to the groundwater of the northeast and south of the plain where elevated values of EC were observed due to dissolution of the Miocene and Pliocene formations and evaporation occurring in shallow water tables.

Table 5 Factor loadings for the FA of the samples in the Khoy plain
Fig. 8
figure 8

Spatial distribution of factor scores for the FA

The second factor with 13.2% of the total variance included pH, Ca, HCO3, Pb and Mn. The presence of pH with negative strong loading indicated the effect of acidity on weathering and dissolution of the carbonate minerals and also in releasing Pb and Mn. This might indicate the release of Mn from minerals including rhodochrosite (MnCO3). Mn is partitioned into ferromagnesian silicates and enriched in mafic and ultramafic rocks relative to felsic lithologies. Carbonate rocks, particularly dolostone, may also contain high concentrations of Mn (Wedephol 1978), which can be deduced from the association between Mn and HCO3. The spatial distribution of FA2 (Fig. 8b) confirmed that the northern parts of the aquifer were influenced by the weathering and dissolution of the formations.

FA3 had 11.5% of the total variance and included NO3 with negative loading and SiO2, Al and Fe with positive loadings, which possibly indicated the effect of both anthropogenic and natural processes on groundwater quality. The association between Mn and Fe could be attributed to the ready substitution of Mn with Fe in minerals. The occurrence of NO3 with antithetic (negative) loading indicated anoxic conditions in the groundwater system (Anawar et al. 2003; Guo et al. 2011; Esmaeili et al. 2018) and also denitrification and nitrate reduction combined with other geochemical processes (Levins and Gosk, 2008; Usman et al. 2014). Mn and Fe are greatly influenced by redox conditions and easily mobilized as Mn2+ and Fe2+ under anoxic conditions (Hylander et al. 2000). In addition, agricultural activities and traffic might be a potential source for Mn in the groundwater. Geogenic sources of Fe and Mn are generally considered to be much more important than anthropogenic ones in the environment (Yang and Sanudo-Wilhelmy 1998), and the presence of SiO2 may confirm the source as silicate minerals. Note that a single FA may interpret multiple processes, as reported previously (e.g., Arslan 2013; Kumaresan 2008; Yidana et al. 2010). The spatial distribution of FA3 is illustrated in Fig. 8c.

FA4 accounted for 7.4% of the total variance and included Mg, Cr and Zn with positive loadings. This factor was explained by weathering and dissolution of the mafic rocks including pyroxene, amphibole, serpentine and mica. Also, Zn is readily partitioned into silicate minerals by substitution with Mg, which has a similar ionic radius (Salminen et al. 2005). FA4 mainly covered the northern part of the plain in which igneous rocks are predominant (Fig. 8d).

Health risk assessment

The calculated HQ of trace elements to evaluate non-carcinogenic risk via ingestion for children and adults is given in Table 6. The order of HQ, according to the median values, was the same for both population groups: As > Pb > Zn > Fe > Mn > Ni > Cu > Al > Cr.

Table 6 Non-carcinogenic risk of trace elements in terms of hazard quotient (HQ) for children and adults in the Khoy area

Several similar studies have been carried out regarding the health risk assessment of multiple trace elements in the groundwater of Iran (e.g., Sakizadeh and Mirzaei 2016; Barzegar et al. 2018). In the study conducted by Barzegar et al. (2018), mean values of HQ indicated that As, with about a 95% contribution to the total non-carcinogenic risk and an HQ content greater than 1, was the most likely element in the groundwater of the Shabestar area, northwestern Iran, to induce a health risk in both population groups. HI values ranged between 0.49 and 11.33 (mean of 4.37) for children and between 0.21 and 4.85 (mean of 1.87) for adults. It was concluded that the dominant chronic risk level was high for children and low for adults. In Shush and Andimeshk, Khuzestan Province, southern Iran, Sakizadeh and Mirzaei (2016) reported the mean non-carcinogenic health risks for Cr, Cu and Mn were 0.62, 0.38 and 0.05, respectively. These results illustrated that as a whole the mean non-carcinogenic risk was below 1; however, Cu at ten stations (11% of total samples) and Cr at one station (1% of total samples) had HI values higher than 1. The lowest health risk was associated with Mn with a maximum HQ of 0.7, and the highest risk was from Cu with a HQ greater than 2.5 at some stations. In the current study, As and Cr were the elements with the maximum and minimum values of HQ, respectively, for children and adults, and had the highest and lowest impacts on the non-carcinogenic risk of groundwater in the area. The HQ for As exceeded the safe level (HQ = 1) for children (HQmedian = 4.33) and adults (HQmedian = 1.85), and HI ranged between 1.09 and 15.08 (mean of 4.37) for children, and from 0.46 to 6.46 (mean of 1.87), for adults. The HI for children for all samples was higher than one, which indicated that all samples were outside the acceptable range of non-carcinogenic risk. The minimum and maximum values of HI were 0.46 and 6.46 for samples 53 and 19, respectively, for the adult group, and were 1.09 and 15.08 for the same samples, respectively, for the children’s group.

Comparison of the two aquifer types in the Khoy area (Fig. 9) showed that HQs of the unconfined aquifer samples were greater than those of the confined aquifer. The differences between the median HQs of aquifer samples for As (for adults: HQunconfined = 2.23 and HQconfined = 0.23; for children: HQunconfined = 5.22 and HQconfined = 0.55) and Pb (for adults: HQunconfined = 0.07 and HQconfined = 0.02; for children: HQunconfined = 0.16 and HQconfined = 0.04) were significant. The differences between aquifer HQs for other elements (e.g., Ni, Cd, Co, Cr, Al, Fe, Mn and Zn) were not significant for either population group. The median HI values for the unconfined and confined aquifer samples were 2.89 and 0.62 for adults, and 6.74 and 1.46 for children, respectively. Therefore, the health risk associated with water consumption from the unconfined aquifer was higher than that from the confined aquifer in the study area.

Fig. 9
figure 9

HQ medians for unconfined and confined aquifers of the Khoy plain

The spatial distribution of HI values for children (Fig. 10a), according to the USEPA (1999) classification, showed that the samples with medium risk (30% of total samples) were mainly located in the west of the plain where the EC values of the groundwater were low. The samples with high risk (70% of total samples) for children were located in the northern and southern parts of the plain. The HI of samples for adults was classified into three levels of chronic risk, including low (34% of samples), medium (16% of samples) and high (45% of samples) according to the USEPA (1999) classification. The spatial distribution of the HI for adults (Fig. 10b) illustrated that high-risk samples were mainly situated in the north and south of the plain.

Fig. 10
figure 10

Spatial distribution of hazard index (HI) values and the classification of samples based on non-carcinogenic risk a for children and b for adults

The health risk assessment outcomes showed that people living in the Khoy area who were exposed to trace metals through drinking water had a non-carcinogenic risk. Therefore, a permanent monitoring and remediation of trace elements (particularly As) in groundwater of the area is crucial.

Conclusions

Water quality parameters measured in 54 water samples from the Khoy plain in northwestern Iran showed that EC values and its major dependent ions (except K) were above the permissible levels for drinking according to the USEPA (1999). Moreover, the concentrations of some trace elements (i.e., Pb, Zn, Cd and As) exceeded the permissible levels for drinking and could affect the health of the local people.

Multivariate analysis, including correlation, cluster and factor analyses, showed the contribution of both natural and anthropogenic factors to groundwater quality in the area. Based on the cluster analysis (CA), water samples were categorized into two distinct clusters and each cluster included two sub-clusters. The ANOVA statistical procedure revealed that all of the physicochemical parameters except Cr, Fe and Zn affected the difference between sub-clusters. Sub-cluster 1-1 samples in the western and middle of the plain had elevated concentrations of pH and NO3 and the lowest levels of EC, Mg, Na, K, HCO3, Cl, SO4, SiO2, Pb, Fe and As compared to other sub-clusters. Sub-cluster 2-1 samples in the southern part of the plain contained elevated values of As, EC and its dependent variables (i.e., Na, Cl, Mg). The quality of samples in the sub-clusters was mainly affected by the geological units around sample locations.

Factor analysis (FA) indicated that the quality of groundwater in the Khoy area was mainly controlled by geogenic factors. Salinity was the major cause of deterioration of water quality in the area, which may be attributed to the weathering and dissolution of evaporitic minerals from Miocene and Pliocene units and leaching of fertilizers from agricultural areas. It can be concluded that an acidic environment may have resulted in the release of some trace elements (e.g., Pb, Mn, Ni and Co) into groundwater in the area. Redox conditions were another possible factor resulting in the release of Fe and Mn into the groundwater from silicate minerals, which was justified by negative loadings of NO3 in FA3. The presence of Cr and Zn was attributed to mafic rocks including pyroxene, amphibole, serpentine and mica in which Zn is readily partitioned into silicate minerals by substitution of Mg.

Based on chronic health risk assessment results, Cr and As had the lowest and highest impacts on the non-carcinogenic risk of groundwater in the area with hazard quotient values of 0.0001 and 11.55, respectively. The health risk associated with consumption of unconfined aquifer water was higher than that of confined aquifer water in the area. The high-risk samples for both children and adults were mainly situated in the northeast and southwest of the plain where the groundwater was saline. Decision makers should pay attention to groundwater management in these areas where over-exploitation of groundwater may cause elevated salinity and redox conditions, which could generate aquifer conditions that would further release trace elements into the groundwater.