Index Terms

1 Introduction

Groundwater constitutes the largest readily available fresh water reserve on earth. It is the most vital natural resource, which forms the core of the ecological system. It has become the major source of water supply for drinking, domestic, household, agricultural, industrial, recreational, and environmental activities, etc. The usefulness of water for particular purpose is determined by its quality. Groundwater quality is mainly controlled by the range and type of human influence as well as geochemical, physical, and biological processes occurring in the ground.

Groundwater supply schemes consist typically of a large number of hand-pumped boreholes, providing untreated, unmonitored, and often unconnected supplies (Hiscock, 2009). A poor quality of water threatens human health and plant growth (Olajire & Imeokparia, 2001), thereby adversely affecting economic development and social prosperity (Milovanovic, 2007). It is therefore crucial to establish the current status of groundwater quality in the study area and thus its appropriateness for use in drinking and other purposes.

The determination of groundwater quality for human consumption is important for the well-being of the ever-increasing population. Groundwater quality depends, to some extent, on its chemical composition (Wadie & Abduljalil, 2010) which may be affected by natural and anthropogenic factors. Variation in groundwater quality in an area is a function of physical and chemical parameters that are greatly influenced by natural processes such as geological formations and anthropogenic activities.

Multivariate techniques such as factor analysis, principal component analysis, and cluster analysis are powerful techniques to identify the underlying processes that control groundwater chemistry, grouping of samples of similar composition and origin, and to predict the variables that differentiate the sampling stations temporally and spatially. These techniques have widely been used as unbiased methods in the analysis of groundwater quality data to characterize groundwater composition influenced by natural and anthropogenic factors (Lambrakis et al., 2004; Dragon, 2006).

2 Study Area

This study is carried out in Hadejia Local Government Area, and it lies in the northeastern corner of Jigawa State between latitude 12° 13′–13° 60′ N and longitude 90° 22′–11′ 00E (Orisakwe, 2013). The topography of the area is dominated by the river Hadejia which it lies on its Northern bank which drained into Lake Chad, which at some points is 1–2 m above the surrounding plains including the town. The climate of the area is semi-arid. It is characterized by a long dry season and a short wet season from June to September. The total annual rainfall ranges from 600 mm in the northern part to 762 mm in the southern areas (Orisakwe, 2013). The regional vegetation falls within the Sudan Savannah type.

3 Sample Collection and Analysis

Groundwater samples for the analysis were collected from 20 different sites monthly in February, March, and April, 2019 during the dry season. All the samples were collected and treated in accordance with the (American Society for Testing and Material, 2004; American public health association [APHA], 2005). The boreholes were allowed to flow for about 3 min before the water is collected. Temperature, pH, electrical conductivity, and total dissolved solids were measured in situ using Hanna pH 210 model microprocessor, EQ-660 digital conductivity meter, and Hatch 446,600 model TDS meter, respectively. T/H, Ca, and Mg were determined using titrimetric method (American Society for Testing and Material, 2004). NO3−, Cl−, Fe, Cu, Zn, Mn, Cr, and Pb were determined using CHEMetrics Vacu-Visual self-filling ampoules water testing kits with CHEMetrics Photometric Multi-Element Analyzer model V-2000. The samples were chemically analyzed at the Center for Energy, Research and Training, Ahmadu Bello Univeristy Zaria, Nigeria.

4 Results and Discussion

The values for physicochemical variables were presented in Table 1. Parameters values such as EC, pH, Temp., TDS, TH, Ca, Mg, Cl−, and NO3− were found to be below or within the permissible limit set by the (NSDWQ, 2007) in majority of the boreholes. The results similarly showed that concentrations of Mn, Cr, and Pb are slightly higher above the maximum permissible limit of Nigerian standard for drinking water quality (NSDWQ, 2007) and hence the need for the replacement of the affected wells (Table 2).

Table 1 Value of physicochemical parameter in groundwater of Hadejia
Table 2 PCA rotated component matrix with Kaiser normalization

The rotated factor loading, eigenvalues, percentages of variance, and cumulative percentages of variance associated with each factor for PCA are summarized in Table 3. The results show that five (5) principal components were extracted and rotated using the varimax normalization (Liu et al., 2003) which accounts for more than 78% of the total variance, which is quite good and can be relied upon to identify the main source of variation in the hydrochemistry. The factor loading was sorted according to the criteria of (Bartram & Balance, 1996) by which strong, moderate, and weak loadings corresponds to absolute loading values of >0.75, 0.75–0.50, and 0.50–0.30, respectively.

Table 3 Mean values of geochemistry data

In PCA, eigenvalues are normally used to determine the number of components (PCs) that are retained for further analysis. An eigenvalue gives a measure of the significance of the factor. The factors with highest eigenvalue are the most significant. Eigenvalues of 1.0 or greater are considered significant (Shrestha & Kazama, 2007)). Number of components is equal to the number of variables in PCA.

The first component loading accounted for 30.27% of the total variance showed higher loading for TH, TDS, NO3−, Cu, and Cl−, with significant contribution from Ca and Mn. It is reasonable to observe a strong positive loading of TH, TDS, NO3−, with Cl−, which may be probably due to leachate from domestic waste discharge in some part of the area. The higher Cl− and TDS is an indication that there is a possibility that the groundwater is polluted by sewage or waste from refuge dumping site (Amadi, 2010). This factor can be labeled as the hardness and anthropogenic.

Factor loading (VR2) accounted for 17.43% of total variance represents higher loadings for Fe and higher negative loading for Cr and Electrical conductivity, with significant contribution from Mg. These could be conceived to mainly originate from the ionic dissolution in the course of groundwater migration. This factor can be labeled as a natural process leading to heavy metal pollution. VR3 explains 13.16% of total variance and has a higher loading on Mn, Mg, Cl, and moderate negative loading on Ca with significance contribution from TDS and Cu. The major variables constituting VR3 (Ca, Mg, Cl, and Mn) are related to the hydrochemical variables originating from mineralization of groundwater.

5 Cluster Analysis

Cluster analysis is applied to detect the similarity among the different sampling sites. The clustering procedure generates either cluster or groups on the basis of similarity or dissimilarity of variables. Cluster analysis classifies a set of observations into two or more unknown groups based on combination of internal variables. Therefore, the cluster analysis is to discover a system of organized observations where a number of groups/variables share properties in common, and it is cognitively easier to predict mutual properties based on an overall group membership. This helps define source profiles of variables and their interpretation in terms of possible sources. The clustering procedure generates either cluster or groups on the basis of similarity or dissimilarity of variables. From Figs. 1 and 2, the cluster analysis resulted in the grouping of 20 monitoring boreholes 15 parameters into four (4) and two (2) statistically significant clusters, respectively, based on their similarities.

Fig. 1
figure 1

Dendrogram of water sampling sites

Fig. 2
figure 2

Dendrogram of hydrochemical parameters

Cluster 1: This includes nine (9) boreholes no. 10, 12, 16, 3, 1, 4, 18, 19, and 2(45%)

Cluster 2: It consists of three (3) boreholes which include no. 7, 17, and 20(15%)

Cluster 3: It consists of four (4) boreholes which include no. 11, 13, 14, and 15(20%)

Cluster 4: It consists of four (4) boreholes which include no. 5, 9, 6, and 8(20%).

Samples from cluster 1 are composed of nine (9) boreholes 1, 2, 3, 4, 10, 12, 16, 18, and 19, constituted 45% of the water samples, and are characterized by high concentration of Ca, and low concentration of Mn and EC in all clusters. Sample from cluster 2 is composed of three (3) boreholes 7, 17, and 20, constitutes 15% of the water samples, and is characterized by high concentration of Fe, Zn and Pb and low concentration of NO3− in all clusters. Samples from cluster 3 are composed of four (4) boreholes 11, 13, 14, and 15, constituted 20% of the water samples, and are characterized by high concentration of T/Hardness, TDS, Mg, Cl−, NO3−, Cu, Mn, and low concentration of Cr in all clusters. Samples from cluster 4 are composed of four (4) boreholes 5, 6, 8, and 9, constituted of 20% of the water samples, and are characterized by high concentration of EC and Cr and low concentration of T/Hardness, TDS, Ca, Mg, Cl−, Fe, Cu, Zn, and Pb in all clusters.

6 Conclusion

The quality evaluation of groundwater sources in Hadejia Local Government Area of Jigawa State revealed that the results showed that concentrations of Mn, Cr, and Pb are slightly higher above the maximum permissible limit of Nigerian standard for drinking water quality (NSDWQ, 2007). The values for physicochemical variables EC, pH, Temp., TDS, TH, Ca, Mg, Cl−, and NO3− were found to be below or within the permissible limit set by the (NSDWQ, 2007) in majority of the boreholes. PCA was used to investigate the origin of each water quality parameters and yielded five varimax factors/components with 78.69% total variance, indicating that the major variations are related to anthropogenic activities and natural processes. Cluster analysis results grouped the 20 sampling points and 15 parameters into four (4) and two (2) statistically significant cluster based on their similarities.