Introduction

The groundwater quality deterioration gives an adverse effect to growing human populations and economic development, particularly elevating nutrients leading to eutrophication and metal pollution in the aquatic environment (Nriagu and Pacyna 1988; Peierls et al. 1998; Holloway et al. 1998; Li et al. 2008; Pekey et al. 2004). The natural sources of the metals include volcanism, bedrock erosion (Krishna et al. 2009; Pekey et al. 2004) and anthropogenic activities; particularly, mining and mineral processing have dominant influences on the biogeochemical cycles of trace metals (Krishna et al. 2009; Nriagu and Pacyna 1988; Nriagu 1989, 1996). Metal pollution leads to serious human health hazards through the food chain and the loss of biodiversity, and harms the environmental quality. Recent researches of trace elements and metals show highly interesting records (Zhang et al. 2009). Of which, their spatial variability reflects geological parent materials and anthropogenic sources in geographic heterogeneity (Imperato et al. 2003; Prasanna et al. 2012).

The methods of integrating numerous variables associated with water quality in a specific index are increasingly desired in national and international scenarios. Therefore, several researchers have developed various indices, technically referred to as water quality indices (WQIs) (Lermontov et al. 2009). Usually, water pollution indices are practical and comparatively simple approaches for evaluating the influence and source of overall pollution. The pollution indices were proposed to provide useful and comprehensible guiding tools for water quality executives, environmental managers, decision makers, and potential users of a given water system. The WQI was initially developed in the USA by Horton (1965) and has been widely used in Africa and Asia (Shoji et al. 1966; Handa 1981; Erondu and Nduka 1993; Palupi et al. 1995; Mohan et al. 1996; Li et al. 2009; Prasanna et al. 2012). Correlation and cluster analyses (CA) have been considered as more trustworthy approaches for data mining of matrices from environmental quality assessment (Astel et al. 2007, 2008; Simeonova and Simeonov 2007), and are widely used in water quality assessment.

This study evaluates the metal concentration in groundwater at a coastal area of Busan Metropolitan city, Korea. The main objective of this case study is to assess the groundwater quality using integrated approaches of water quality pollution indices and statistical analyses (Correlation and Cluster analyses). This study also identifies the pollution status and probable sources of metals in the groundwater.

Study Area

The proposed study area, Busan Metropolitan City (Fig. 1) is the second largest city in Korea with a population of 3.7 million and is located at the coast in the southeastern part of the Korean Peninsula. Groundwater utilization has been restricted by seawater intrusion at the coastal area of Busan City (Chung et al. 2014). The city has very long tunnels for three underground subway lines. The total groundwater discharged from the subway lines is approximately 15,000 \(\hbox {m}^{3}/\hbox {day}\) (5.5 million \(\hbox {m}^{3}/\hbox {year}\)) through subway tunnels of 96 km in length. There is serious groundwater drawdown, and groundwater quality is degraded around the subway tunnels. Many groundwater wells around subway tunnels have been abandoned due to the decrease of groundwater quantity or the deterioration of groundwater quality. Groundwater flow directions are indicated in Fig. 2. Geumnyeon Mountain is a large groundwater recharge area, and Baek Mountain is a small recharge area. Groundwater flows mainly from Geumnyeon Mountain to Gwanganri Beach and Suyeong River, but the downstream of groundwater returns to the subway lines. The large drawdown of groundwater level around the subway causes this problem. The seawater intrusion around the beach also disturbs the downstream flow of groundwater. Groundwater flow to the beach is not smooth because of seawater intrusion and return flow to the subway. The geology of the study area is composed of andesitic volcanic breccias, tuffaceous sedimentary rocks, rhyolitic rocks, and intrusive granodiorites and granite porphyries. The geologic time period of the study area is the Cretaceous in Mesozoic Era (Chang et al. 1983; Son et al. 1978).

Fig. 1
figure 1

Location map of the study area

Fig. 2
figure 2

Groundwater flow direction map of the study area

Materials and Methods

Groundwater Sampling

Groundwater in Busan city is available throughout the year, and utilized mainly for homes, offices, restaurants, hotels, trees, and grasses. Forty groundwater samples were collected during the year of 2010. The drilled bore wells were relatively uniformly distributed throughout the study area. Groundwater was sampled directly from water faucets after pumping for 10 minutes to remove stagnant water (APHA 1995). Groundwater samples were collected from high-quality polyethylene bottle through \(0.45~\upmu \hbox {m}\) cellulose nitric membrane filter to eliminate suspended materials. Most of the borehole wells were developed in bedrock, and the depth of the boreholes ranged from 30 to 350 m. The diameters of the borehole wells were 6–8 inches. Groundwater samples for the analysis of metal components were acidified to \(\hbox {pH}<2.0\) in the field. All samples were stored in ice chests at \(4~^{\circ }\)C, and transported directly to the laboratory, where they were analyzed within 2 weeks. pH, EC, and DO were measured in the field using portable instruments: Therm Orion \(250\hbox {A}^{+}\), U.S.A. for pH; TOA CM-14P, Japan for electrical conductivity (EC); and Istek 25D, Korea for dissolved oxygen (DO). Metals were analyzed in the laboratory using Inductively Coupled Plasma–Atomic Emission Spectrometer (ICP–AES, Shimadzu ICPS-1000 III). Accuracy of Quality Assurance (QA) was measured using spiked samples with known concentrations of solutes, and precision was checked by blind duplicate samples from the same site. This is to verify that decontamination procedures and laboratory protocols were adequate (Koterba et al. 1995).

Heavy Metal Evaluation Index

Metal evaluation index provides an overall quality of the water with respect to metals (Edet and Offiong 2002) and is expressed as

$$\begin{aligned} \mathrm{HEI}=\sum _{i=1}^n {\frac{H_c }{H_{mac} }}, \end{aligned}$$

where \({ {H}}_\mathrm{c}\) and \({ {H}}_\mathrm{mac}\) are the monitored value and maximum admissible concentration (MAC) of the \(i\mathrm{th}\) parameters, respectively.

Heavy Metal Pollution Index

The index of heavy metal pollution was developed by assigning a rating or weighting (Wi) for each chosen parameter. The rating is an arbitrary value between 0 and 1, and its selection reflects the relative importance of individual quality considerations. It can be defined as inversely proportional to the standard permissible value \((S_i)\) for each parameter (Horton 1965; Mohan et al. 1996; Reddy 1995). In this present study, the concentration limits (i.e., the standard permissible value \((S_i)\) and highest desirable value \((I_i)\) for each parameter) were taken from the WHO standard. The uppermost permissible value for drinking water \((S_i)\) refers to the maximum allowable concentration in drinking water in the absence of any alternate water source. The desirable maximum value \((I_i)\) indicates the standard limits for the same parameters in drinking water (Bhuiyan et al. 2010). The HPI, assigning a rating or weighting \((W_i)\) for each selected parameter, is determined using the expression below (Mohan et al. 1996):

$$\begin{aligned} \mathrm{HPI}=\frac{\sum _{i=1}^n {W_i Q_i } }{\sum _{i=1}^n {W_i } } \end{aligned}$$

where \(\hbox {{ {Q}}}_{i}\) and \(\hbox {{ {W}}}_\mathrm{i}\) are the sub-index and unit weight of the \(i\mathrm{th}\) parameter, respectively, and n is the number of parameters considered. The sub-index \((\hbox {{ {Q}}}_{ {i}})\) is calculated by

$$\begin{aligned} Q_i =\sum _{i=1}^n {\frac{\left\{ {M_i (-)l_i } \right\} }{S_i -l_i }} \times 100, \end{aligned}$$

where \({ {M}}_{ {i}}\), \({ {I}}_{ {i}}\) and \({ {S}}_{ {i}}\) are the monitored heavy metal, ideal, and standard values of the \(i\mathrm{th}\) parameter, respectively. The sign (\(-\)) indicates numerical difference of the two values, ignoring the algebraic sign. Generally, pollution indices are, estimated for any specific use of the water. The proposed index is intended for the purpose of water quality. The critical pollution index value of water quality is 100.

Degree of Contamination (DOC)

The contamination index (DOC) summarizes the combined effects of several quality parameters considered harmful to domestic water (Backman et al. 1997) and is calculated as follows:

$$\begin{aligned} Cd=\sum _{i=1}^n C_\mathrm{{fi}} \end{aligned}$$

with

$$\begin{aligned} C_\mathrm{fi} =\frac{C_\mathrm{Ai}}{C_\mathrm{Ni} }-1 \end{aligned}$$

where \({ {C}}_\mathrm{fi},\, { {C}}_\mathrm{Ai}\) and \(C_\mathrm{Ni}\) represent contamination factor, analytical value and upper permissible concentration of the \(\hbox {i}\mathrm{th}\) component, respectively. N denotes the ‘normative value’ and \(\hbox {C}_\mathrm{Ni}\) is taken as MAC. Finally, GIS based maps were developed for the pollution indices extracted from DOC, HEI and HPI of the data set using Arc GIS (ver.10.2).

Statistical Analyses

The analytical data were subjected to statistical analyses using Statistica software ver.8 (StatSoft 1999). Cluster analysis was applied to identify groups of samples with similar heavy metal contents. CA was formulated according to the Ward-algorithmic method, and the rescaled linkage distance was employed for measuring the distance between clusters of similar metal contents. Q-mode CA was used to determine the association of different groundwater quality parameters and pollutant sources. Pearson’s correlation matrix was also used to identify the elements’ relationship.

Results and Discussion

Metal Distributions

The descriptive statistical values of physical parameters and metal concentrations are presented in Table 1. The analytical results indicate that pH of groundwater samples varied between 6.19 and 7.94, which is a little acdic to alkaline. EC varied from 227 to 21,400 \(\upmu \hbox {S/cm}\) due to the influences of seawater and sewage leaked from sewers. The level of DO (3.9–9.9 mg/L) in groundwater samples is more or less low, and it could infer the presence of pollutants that consume the oxygen in water (Akinbile and Yusoff 2011). The mean concentrations of metals in groundwater are in the descending order as \(\hbox {Zn}> \hbox {Mn} > \hbox {Cu} > \hbox {Fe} > \hbox {Cd}\). The relationship between metal loads (Fe + Mn + Cu + Zn + Cd) and pH for the groundwater samples is shown in Fig. 3, and all samples were plotted in the field of near-neutral and low metal sector (Ficklin et al. 1992; Caboi et al. 1999).

Fig. 3
figure 3

Classification of groundwater samples based on the plot of metal load and pH

Table 1 Descriptive statistical data of physicochemical parameters (all the values in mg/L except pH and EC \((\upmu \hbox {S/cm})\)

Pollution Indices

Evaluations of water quality pollution were carried out using heavy metals of groundwater samples (Edet and Offiong 2002). The study area was classified into three categories according to HEI values, that is, low (\(<\)10), medium (10–20), and high (\(>\)20) categories. HEI values ranged from 0.34 to 10.63. According to HEI distribution, 97  % of the samples fall in a low pollution zone, and only 3 % of the groundwater samples are included in medium and high pollution zones. The medium and high pollution zones are located at the south and east parts (Fig. 4a). HPI values of all groundwater samples were calculated using MAC (maximum acceptable concentration), (Siegel 2002). The values are useful to assess the groundwater quality in each sample point. The study area was classified into three zones according to HPI values, that is, low (\(<\)90), medium (90–180), and high (\(>\)180) categories. HPI value ranged between -3 and 135. According to HPI distribution (Fig. 4b), 57 and 42  % of the samples fall in low and medium pollution zones, respectively, and only 1 % of samples are above the critical limit of 100 proposed by Prasad and Bose (2001). The sampling stations (1–4, 7–10, 12, 13, 18, 20, 23, 28, 31–34, 36) in a medium category are considered as polluted groundwater. The above mentioned sampling points are located near the sea and the subway. They were contaminated by seawater intrusion, salinized river water, and sewage leaked from sewers. The study area was classified into three zones according to DOC values, that is, low (\(<\)1), medium (1–3), high (\(>\)3) categories. According to DOC distribution (Fig. 4c), 85 % of the samples fall in a medium zone, 10 % in a low zone, and 5 % in a high pollution zone. The sampling stations (3, 9, 18, 24, 25) in high pollution zones are located near residential area, the subway or the Suyeong River effluents. Groundwater in the zones were affected by salinized river water and leaked sewage.

Fig. 4
figure 4

Distribution of pollution indices of groundwater samples (a) HEI (b) HPI (c) DOC

Correlation and Cluster Analyses

Pearson’s correlation coefficient matrices for the analyzed parameters are presented in Table 2. The statistically significant level is \(p \quad <\) 0.05. pH correlated with Fe (r = 0.36) and Mn (r = 0.34). The solubility of Fe and Mn minerals was strongly redox controlled, particularly at near neutral pH (Prasanna et al. 2012). EC also shows significant correlated with Mn (r = 0.65). DO correlated with Fe and Zn. Metal pairs Fe–Zn, Mn–Cu, and Cu–Zn show good correlations with respective correlation coefficient (r) values of 0.32, 0.34, and 0.32, respectively, due to the interaction between rock and groundwater.

Table 2 Correlation coefficients for parameters of groundwater samples

Cluster analysis (CA) was also performed to understand the pH, EC, DO and metals groupings in the data set and the results are presented in Fig. 5. The Q-mode CA performed on the samples produced four clusters. Cluster 1 consists of 13 stations (1, 19, 38, 15, 4, 11, 3, 9, 7, 8, 2, 6, 5). It reflects the influence of sewage leaked from sewers. Cluster 2 includes of stations of 12, 39, 10, 13, 24, 26, 25, 36, 29, 17, 35, 20, 32, 18, 14, 16, 37, 21, 22, 34. This is due to the accumulation of metals from bedrocks in groundwater. Cluster 3 contains stations of 27 and 33, and it shows the influence of groundwater discharged from the subway. Cluster 4 includes stations of 28, 40, 30, 23, 31. It indicates contamination by seawater intrusion.

Fig. 5
figure 5

Dendrogram analyses of the groundwater samples

Conclusion

Groundwater quality in the study area was influenced by various kinds of contamination sources such as seawater intrusion, sewage leaked sewers, salinized river water, and groundwater discharged from the subway. By metal load and pH, groundwater quality was characterized in the near-neutral and low metal sector. The calculated values of HPI showed that 43 % of samples were above the critical index value (100), and 57 % of samples were within the low HPI zone. Based on the HEI, 3 % of samples fell in medium and high pollution zones, and 97 % of samples existed in a low pollution zone. DOC suggested that 95 % of the groundwater samples fell in low and medium zones, and 5 % of the samples located in highly polluted zones. Correlation and cluster analyses clearly elucidated that seawater intrusion, sewage leaked from sewers and salinized river acted as the definite donors of this metals. This present case study underscored the importance of an integrated approach of pollution indices and statistical analyses for groundwater quality.