1 Introduction

Groundwater quality appraisal using water quality indices (WQIs) is essential for managing water quality [1,2,3,4,5,6]. The use of WQIs for appraisal of groundwater aptness for drinking and industrial uses in developing countries is required. There is an unprecedented increase in anthropogenic activities that are harmful to water quality. These include urbanization, industrialization, and irrigation farming. The WQIs were established for a rating of sources of water supply in a user-friendly format and easily understandable design. It allows for defining the acceptable limits for water usage. By design, WQIs reduce information on hydrochemical data and provide a summary of hydrochemical data that failed to comply with a certain index. Thus, water quality indices are typically beneficial for comparative analysis and overall inquiries relating to water quality and population vulnerability [7]. Concerns about different hydrochemical characteristics consequent of variation in geology and geography are possible and require the application of WQIs [8]. Water quality comprises the esthetic, radiological, biological, chemical, and physical qualities of the water [9, 10]. The evaluation of groundwater quality is beneficial owing to increasing demand and degradation of groundwater aquifers in emerging economies and pollutant generation from multiple sources such as industry, agriculture, and urban discharges. The water quality of aquifers can be weighed using individual water quality parameters. It is currently declining since it involves computations of a lot of concentrations of hydrochemical parameters. Many countries prefer the application of a WQI. This enables the appraisal of water quality status. It is easy to comprehend as a single rated parameter [10].

Individual countries and agencies have incorporated various parameters of water quality to create local WQI. Most of the indices created are built on the American National Sanitation Foundation index [10]. Hydrochemical analysis of groundwater using WQI is widely conducted in many parts of the world. The evaluation of nitrate pollution in groundwater and its impending health hazards indicate that 61% of groundwater sources have NO3 concentrations exceeding WHO reference guidelines in the Nagpur region of southern India [11]. Sources of drinking water were slightly alkaline. The values of WQI vary between 92 and 295. Also, 86% of groundwater sources fall in poor water quality class [11]. Weathering of bedrock and anthropogenic activities have rendered shallow aquifers near Kaduna Refinery unsuitable for drinking, due to the migration of effluents from petrochemicals and landfills [12]. Leaching of HCO3 was the major mechanism aiding arsenic mobilization in HCO3 assertive aquifers in Bangladesh [13]. Hydrochemical analysis of shallow groundwater in Kakuri and its Environs by Anudu, Obrike [14] discovered water of good quality for drinking.

Seasonal assessment of WQI from the River Kolong, India, by Bora and Goswami [15], revealed more deterioration of water quality during monsoon. Mean WQI varied between 85.73 and 80.75 in pre- and post-monsoon, respectively. Significant temporal variability with high concentrations of NO3, SO4, and Ca was reported from the Pocheon spa region of South Korea [16]. It suggested the impact of surface processes. The evaluation of groundwater quality during post-monsoon indicates that 90% of water sources are appropriate for drinking. In contrast, the percentage declined to 60% during monsoon [17]. Water quality appraisal from Kaduna, Kafancan, and Zaria indicate no radium and thorium could be trace from the study area. However, Cd, Ni, COD, and pH concentrations were above the WHO reference guidelines [18]. Groundwater quality assessment from some villages in northern India indicates the unsuitability of groundwater due to high levels of hardness, F, Ca, and Mg [19]. The content of calcium carbonate renders aquifers unsuitable for industrial use since CaCO3 precipitates easily. In Kanavi Halla basin India, two-third of groundwater sources fall in poor to very poor class based on WQI [20]. The heavy metal pollution index (HPI) revealed severe pollution from a gold mine in east Cameroon [21]. A good water based on WQI was revealed from Karacaoren Dam. Poor and very poor water quality occurred in the northern and southern portions of the basin. Water quality in the study area was impacted by the diffusion of in situ pollutants in the Aksu River Basin SW Turkey [22]. Groundwater contaminants were derived primarily from anthropogenic activities in the Lower Yangtze Delta in China. Groundwater suitability for industrial and domestic uses can be guaranteed after the removal of high ions and toxic metals [23]. The water of better quality occurred during post-monsoon compared to pre-monsoon as a result of groundwater recharge in the Bay of Bengal, India [24].

Despite the tremendous research works on groundwater quality in Kaduna Basin [25,26,27,28,29,30], these studies are characterized by reports on individual physicochemical elements of water quality, instead of the application of WQI for the appraisal of the quality status of aquifers. Studies around the world are increasingly employing water quality indices [10, 11, 15, 20, 22, 31,32,33,34,35,36], for the evaluation of groundwater suitability for drinking. Appraisal of the quality status of groundwater using WQI helps to classify groundwater sources and became an essential topic in third world nations; hence, continuous monitoring of sources of drinking water is important. This is due to anthropological activities (mainly effluents from industry, agriculture, and municipal sources) that are harmful to groundwater quality. This study seeks to evaluate the hydrochemistry of groundwater using WQIs in the Kaduna Basin.

2 The study area

2.1 2.1. Location and climate

Kaduna Basin is situated between latitude 9°30N–11° 45N and longitude 7°03E–8°30E (Fig. 1). It covers a total area of 21,065 sq km. It is drained by Rivers Galma, Kubanni, and Tubo which formed the major tributaries to River Kaduna [37]. Kaduna Basin rests on the ‘High Plains’ of northern Nigeria reaching up to 670 m above sea level at some locations. The basin is in Guinea Savannah Zone, with both wet and dry seasons. Rainfall season prevails from May to October. The average annual precipitation is above 2000 mm. During the dryer years, it can be as low as 300 mm (Fig. 2a). The difference between the wettest and driest month in terms of precipitation is 279 mm. The long-term average is 1000 mm [37]. The annual variation of temperature is 3.8 °C (Fig. 2b). The dry spells last from November to April. It is characterized by low temperatures during Harmattan (December–February). Very dry and hot weather is prevalent from March to April. Mean diurnal temperature can attain 27 °C. The relative humidity is high throughout the rainy season. It decreases during the dry spells.

Fig. 1
figure 1

Map of the Kaduna Basin. After Okafor and Ogbu [38]

Fig. 2
figure 2

a Temperature and b rainfall (Climate-Data.Org: https://en.climate-data.org/africa/nigeria/taraba/kaduna-lissam-385152/. Retrieved on 12/02/2020)

2.2 Geology and hydrogeology

Kaduna Basin consists of crystalline basement complex (CBC) rocks. These are mainly of granitic gneisses, migmatites, and biotite types. Rebooted metasediments formed the magnetite gneiss complex. It is characterized by various structures and textures. Batholiths and younger granites are noticeable southwestwards. Severe chemical weathering and fluvial erosion shaped by the environmental bioclimatic scenery of the region have formed a conspicuous high-pitched undulating flatland as well as soothing interfluves. Weathered or metamorphic rocks are dominant. The normal rock type is a gneiss–migmatite complex that outcropped along the Kaduna–Zaria axis [37]. Metasedimentary rock series consisting of interchangeable rocks including gneiss, pegmatites, schist, and quartz are noticeable (Fig. 3), mainly comprised of decayed sedimentary and metavolcanics rocks. Marvelous boulders of well-outlined migmatites in the northeast–southwest bloc and west of the Kaduna area are noticeable. The ubiquitous batholiths in the southern parts of Kaduna are characterized by exposed plutonic rocks [37]. These batholiths mainly comprise of granodiorites, charnockites, granites, monthonites, diorites, and leucocratic-porphyritic granites. Some sections are coated by laterites that are sporadically fused especially the battered exteriors into lateritic knobs pied with silty and sandy clays [37].

Fig. 3
figure 3

Theorized geological cross section of the groundwater flow in the Kaduna Basin

Over 80% of the study area is enclosed by the CBC. The fresh alluvium of the River Niger and Nupe Sandstone make up 20% [39]. The hydrogeological specifications of the basin are characteristic of Nigeria’s Basement Complex areas [40,41,42]. Previous assessment of hydrogeological conditions in the basin showed that at least 30% of boreholes were not productive. Borehole yields varied between 0.2 and 1lit/sec. Although a 30% borehole failure was documented, even the productive wells were not promising, thus illustrating the gratuitous hydrogeological condition of Nigeria’s Basement Complex [39]. Ten percent of the Kaduna Basin is covered by sedimentary formations of the Nupe Sandstone. The ‘Newer basalt’ is evident throughout Manchok and Kafanchan areas. It adjoined the western peripheries of the north–central plateau. The basalt was formed when the plateau attained its current topography. It is marginally influenced by erosion, consequently overlaying alluvial sediments [43]. The areas of severe erosion and sandy riverbeds also rise. These are squeezed in between specific basalt deluges.

The prospect of the fluvio-volcanic aquifer shows a great water yield (370–500 m3/day). At Tum Village (Borehole No. GWR/21/1) in the ‘Newer Basalt’ a great quantity of water was recorded (12.6 m3/h). An extremely productive spring appears in the ‘Newer Basalt’ producing about 11,000 m3/day at Manchok, throughout the dry season. It established the headwater of an offshoot of the Kaduna River [43]. Potential sources of pollutants like landfills ought to be located far away from probable recharge regions, consequent to the superficial depth of shallow groundwater [25]. Seasonal assessment of spring water, shallow and deep aquifers by Obada and Olaniyan [43], discovered that the superficial aquifers were polluted from anthropologic activities. Sources of pollutants include inappropriate waste disposal, seepage by septic reservoirs, and urban effluents. Although the hydrochemistry and the hydrogeology of the Kaduna Basin are detailed in the literature [12, 18, 44,45,46,47], there is a need for further analysis of groundwater quality based on WQI.

3 Materials and methods

3.1 Review procedure and data sources

Hydrochemical data were derived from the literature following the procedure summarized in Fig. 4 and Table 1. A total of 1754 potentially important articles were identified, from Google Scholar, reducing to 3 based on extractable data on pH, temperature, TDS, EC, Na, Fe, K, Cl, NO3, HCO3, and SO4. This was based on the defined criteria summarized in Fig. 2. The search was limited to studies published from the year 2000 (Table 2). Data on physicochemical parameters were adopted from 52 sites [14, 26, 43]. The electrical conductivity (EC) values were not measured by Obada and Olaniyan [43]. So, the EC values were calculated using the LENNTECH converter for EC values (https://www.lenntech.com/calculators/conductivity/tdsengels.htm).

Fig. 4
figure 4

Review methodology adopted for this study

Table 1 Search terms used in exploring the database and the related parameters
Table 2 Sources of data employed for review

3.2 Internal consistency test

The internal consistency of data was tested using the chemical balance error (CBE) equation [48]. The CBE is defined thus:

$${\text{CBE}} = \frac{{\sum {\text{Cations}} - \sum {\text{Anions}}}}{{\sum {\text{Cations}} + \sum {\text{Anions}}}}{ } \times 100{ }$$
(1)

where the concentration of individual elements is stated in meq/l. The sums of cations (6893.09) and anions (7829.92) were substituted, Eq. 2 and 3:

$${\text{CBE}} = \frac{{\sum {\text{cations}}\left( {6893.09} \right) - \sum {\text{Anions}}\left( {7829.92} \right)}}{{\sum {\text{Cations}}\left( {6893.09} \right) + \sum {\text{Anions}}\left( {7829.92} \right)}}{\text{ x }}100{ }$$
(2)
$${\text{CBE}} = \frac{6893.09 - 7829.92}{{6893.09 + 7829.92}} = \frac{ - 936.83}{{14723.00}} = - 0.064 \times { }100 = \pm - 6.40$$
(3)

where the cationic and anionic concentration is fully measured, and the variance should not be more than 5%. However, there are slight but more negative values greater than 5% (\(\pm - 6.40\)), as a result of the lack of NO3 measurements from Kudenda–Nassarawa area. Despite the slight variance, the hydrochemical data were employed for further analysis due to the irregularity of sampling periods and locations, which may interfere with the results.

3.3 Computation of WQI

The WQI is a remarkable technique that offers a broader outlining of water quality conditions [10, 20, 21, 49, 50]. It represents the magnitude that reflects the collective consequence of various physicochemical parameters. It is computed by assigning discrete weights (wi) over a scale of 1 which represents the lowest effect. The greatest impact on water quality is presented by 5. It is built on their expected impacts on human health. Elements having serious health consequences and whose concentrations exceeding the essential limits can hamper the usability of groundwater are ranked high. High intensities of elements such as NO3, Cl, and TDS were given a high-ranking weight of 5 (Table 3). Elevated NO3 concentration in drinking water is related to the blue baby syndrome. High chloride can affect the palatable taste and is injurious to plants. Variation in TDS can be an indicator of anthropogenic inputs. The intermediate elements such as Ca, Mg, SO4, sodium, bicarbonate, and potassium were assigned weights varying from 2 to 4. Those that have negligible impacts like potassium were assigned the lowest weight of 1 (Table 3). The relative weight of physicochemical parameters, as summarized in Table 3, is estimated using Eq. 4:

$$W_{i} = \frac{{w_{i} }}{{\mathop \sum \nolimits_{i = 1}^{n} w_{i} }}$$
(4)

where \({\varvec{W}}_{1}\) represents the relative weight, \(w_{i}\) is the weight of individual elements and n is the sum of the analyzed elements. The estimated \(W_{i}\) values of respective elements are summarized in Table 3.

Table 3 Relative weight of chemical parameters

The quality ranking (qi) for discrete elements is given by dividing its absorption value(s) by its standard value specified by the WHO (2011). The outcome is transformed into a fraction (mainly percentage) through multiplying by 100. This is computed using Eq. 5:

$$q_{i} = \left( {\frac{{C_{i} }}{{S_{i} }}} \right) \times 100$$
(5)

where the quality rating is \(q_{i}\), Ci is the intensity of different elements in groundwater samples (mg/l) and the reference value for the individual parameter (mg/l) is \(S_{i}\) built on the WHO (2011) reference value. It represents SIi value determined initially using Eq. 5, before computation of water quality index:

$${\text{SI}}_{i} = W_{i} \times q_{i}$$
(6)
$${\text{WQI}} = \mathop \sum \limits_{i = 1}^{n} {\text{SI}}$$
(7)

where the subindex is \({\text{SI}}_{i}\) of the ith element and the quality rating built on the intensities of ith elements is \(q_{i}\). The computed values of WQI are categorized into five classes: WQI =  ≤ 50 (excellent), WQI = 50–100 (good), WQ1 = 100–200 (poor), WQI = 200–300 (very poor), and WQI =  > 300 (unsuitable) [11, 15, 20, 31,32,33, 35, 51]. However, groundwater sources that are suitable for drinking may not necessarily be suitable for industrial use(s).

3.4 Suitability for industrial use

Water quality requirements vary with types of industry. Industries, such as thermal plants, boiler feed water, and beverages, require a substantial quantity of water. Good water quality capable of averting scale formation and corrosion is required by certain industries. Beverages, dairy, and brewing industries require guidelines for drinking water quality. In heavy industries, scale formation constitutes a major problem. Consequently, Langelier (1936) proposed a saturation index to determine the extent of scale formation by water. The Langelier Saturation Index (LSI) predicts the stability of CaCO3 in water and shows whether water can dissolve, precipitate, or will be stable with CaCO3 [10, 52]. The LSI is computed (Eq. 8), as the variance between calcium bicarbonate-saturated pH (pHs) and the actual pH (Eq. 9).

$${\text{LSI}} = {\text{pH }}\left( {{\text{measured}}} \right) - {\text{pHs }}$$
(8)

where

$${\text{pHs}} = A + B - C - D{ }$$
(9)

The pHs, A, B, C, and D are computed following the Langelier technique. The LSI tends to be negative if the pH level of water is below the saturation pH. Thus, water is expected to have low scale formation potential. The LSI tends to be positive if the initial pH of water is above the pHs, indicating CaCO3-supersaturated water that has a high potential for scale formation. The greater the LSI values, the more its potential for scale formation [10, 52]. Also, a different index for quantifying CaCO3 scale formation was later developed by Ryznar [53]. The Ryznar Stability Index (RSI) helps to avoid confusion of positive saturation indices that are typically classified as non-scale forming or non-corrosive. The RSI is calculated:

$${\text{RSI}} = 2{\text{pH}}_{{\text{s}}} - {\text{pH }}$$
(10)

where pH represents the estimated pH value of water and the pHs represents the pH at saturation point, computed using Langelier’s technique. The RSI for all water types is constantly positive. The character of both treated and natural water having RSI values of 5.5 or below can have high potentials for scale formation. Water having RSI values of 9.5 tend to have a low scale formation potential, though might have acute corrosivity under high temperature [10, 52,53,54]. The LSI was calculated using the LSI Calculator (https://www.lenntech.com/calculators/langelier/inex/langelier.htm).

3.5 Statistical analysis

3.5.1 Factor analysis

Factor analysis (FA) is applied to classify data for an easy explanation [55, 56]. It is used to obtained proper evidence concerning the link between hydrogeochemical parameters and sampling spots [57]. The central objective of FA in the hydrogeochemical analysis is the classification of the hydrochemical data [48]. It involved two steps: (a) data standardization and (b) removal of factors [56, 58]. Even though some related hydrogeochemical data are lost during the analysis, the depiction of the method is significantly reduced [56]. It is used to obtain proper evidence concerning the link between hydrogeochemical parameters and sampling spots [57]. The FA bilinear model is typically reorganized using a matrix decomposition equation:

$$X = TP^{T} + E$$
(11)

where the matrix of data is represented by X which is reduced into T (matrix score) and PT (loadings matrix), plus (residual matrix(E)).

3.5.2 Hierarchical clustering analysis

Hierarchical clustering analysis (HCA) is used to categorize groundwater sources based on a raw hydrochemical data matrix in individual clusters devoid of any prior supposition. This study used the Ward’s algorithmic clustering process subsequent to the Euclidean distance method. It is deemed a potent clustering procedure [58, 113]. Before the clustering, the hydrogeochemical data xji were harmonized by Z-scale translation as:

$$Z = \frac{{X_{ji} - \dot{x}_{j} }}{{S_{j} }}$$
(12)

where xji = value of the jth hydrochemical factor calculated at the ith location, j = mean (spatial) value of the jth parameter and Sj = standard deviation (spatial) of the jth parameter.

This verification allowed for the formation of a dendrogram as a function of the sampling locations and hydrogeochemical parameters. So, employing the hydrogeochemical data into HCA is an incomparable approach. HCA simplifies data grouping assembled on similarities of the studied physicochemical parameters [59, 60]. The FA and HCA were performed on 13 subsets of hydrogeochemical parameters. The entire analyses were performed using PAST3 (version 3.14), SPSS (version 16) and Minitab (mbt 16) statistical software packages.

3.5.3 Regression analysis

Regression analysis enables an understanding of the relationship between hydrochemical elements by fitting a linear equation to the hydrochemical data. Values of the separate elements x are connected to a value of y (i.e., dependent variable). The regression model of hydrochemical data for p experimental elements x1, x2, x3,….xp is expressed:

$$\mu_{\gamma } = \beta_{o} + \beta_{1} + x_{1} + \beta_{2} x_{2} + \cdots + \beta_{p} x_{p}$$
(13)

How the mean reaction \(\mu_{\gamma }\) varies with experimental elements is defined by the regression model. The detected y values tend to vary with their means \(\mu_{\gamma }\) and are believed to have a similar standard deviation \(\sigma\). The fitted values \(b_{0} b_{1} , \ldots , b_{p}\) approximate the elements \(\beta_{0 } , \beta_{1} , \ldots , \beta_{p}\) of the regression model. Meanwhile, the detected y value(s) differ with their mean, and the regression model contains an expression for this difference. It is defined as Data = fit + residual. The term ‘fit’ is expressed:

$$\mu_{\gamma } = \beta_{o} + \beta_{1} + x_{1} + \beta_{2} x_{2} + \cdots + \beta_{p} x_{p}$$
(14)

The term ‘RESIDUAL’ is the deviation of the detected y value(s) from the means \(\mu_{\gamma }\). It is typically distributed with mean 0 and the variance \(\sigma\). The symbol of the model is \(\varepsilon\). The regression model for formally given n observations is:

$$y_{i} = \beta_{o} + \beta_{1} x_{i1} + \beta_{2} x_{i2} + \cdots + \beta_{p} x_{ip} + \varepsilon_{i} \quad {\text{for}}\; i = 1,2, \ldots n$$
(15)

The best-fit line and least-squares models for the hydrochemical data are computed by lessening the sum of squares of the perpendicular deviations from individual data point to the line. If a point lies exactly on the fitted line, then its perpendicular deviations = 0. Since the aberrations are initially squared, then calculated, there are no annulments between negative and positive values. The least-squares values \(b_{0} b_{1} , \ldots ,b_{p}\) are typically calculated using the statistical software package(s). The fitted values using the equation \(b_{0} b_{1} x_{i1} + \cdots + b_{p} x_{ip}\) expressed as \(\hat{y}_{i} ,\) and the residuals \(e_{i}\) are equal to \(y_{i} - \hat{y}_{i}\), the variance between the fitted and observed value(s). The summation of the residuals = 0. The variance \(\sigma_{2}\) maybe computed:

$$S^{2} = \frac{{\sum e_{i}^{2} }}{n - p - 1}$$
(16)

It is identified as MSE, i.e., mean squared error. The computed standard error is given as \(s = \sqrt {{\text{MSE}}}\). The statistical analysis was conducted using MINITAB (mbt 16) statistical package.

3.6 Groundwater evolution

The basis of dissolved ionic substances and the processes for the evolution of groundwater are revealed by the graphical procedure as an interface of TDS versus Cl/(Cl + HCO3) and Na/(Na + Ca) ratios [21, 61, 62]. Precipitation is the preliminary process since the chemical compositions of aquifers are affected by the volume of suspended salts/ions generated by precipitation [21, 61,62,63,64]. Naturally, the volume of precipitation is consistently high surpassing the low quantity of dissolved elements resulting from the rock mineral. Aquifers in this group occur naturally in hot and dry regions. The contrary end member to precipitation sequence consists of freshwaters obtaining their main source of suspended elements from the soils and rock mineral of their basins. This assembly specifies the successive mechanism which affects groundwater chemistry as rock dominance. The last mechanism is consisting of evaporation–fractional definition. This process produces an order of series extending from the calcium-rich water type resultant from rock weathering, to the sodium-rich high-salinity end member which is the opposing process. The mechanism shaping world water chemistry can be measured by plotting the TDS mass ratio against the [Na + K]/Na + K + Ca] and [Cl]/[Cl + HCO3] ratios for anions and cations [61, 64]. Further, the Chadha diagram was used to classify groundwater [65,66,67].

4 Results and discussion

Table 3 and Fig. 5 present a statistical summary of hydrochemical data. Groundwater rating built on WHO (2011) reference guidelines indicated that TDS, Ca, Mg, and K concentrations are above the defined reference values for drinking water in the Kudenda–Nassarawa area. Though TDS is not commonly viewed as an essential pollutant; it is used as a mark of palatable attributes of drinking water and as an overall indicator of the chemical contaminants. Essential sources of TDS in aquifers include overflow from irrigated fields and private (or urban) spillover, effluent-rich mountain waters, filtering of soil sullying, and point source water contamination release from modern or polluted water treatment facilities [58, 68,69,70,71,72,73]. The most recognized chemical constituents are Ca, PO4, NO3, Na, K, and Cl, which are found in overland flow, general stormwater spillover, and overflow from street deicing salts [74,75,76,77,78]. Vibrant and destructive components of TDS are pesticides and other contaminants emerging from overland flow [79,80,81]. Elevated concentrations of Ca and Mg in aquifers are normally associated with hardness [82]. However, significant positive correlations between hardness and sodium were reported from Texas aquifers [83]. Calcium and Mg are particularly derived from dolomite, gypsum, and limestone. Calcium and Mg occur in vast amounts in saltwater [84, 85].

Fig. 5
figure 5

Box and jitter plot of physicochemical elements of water quality in the Kaduna Basin. Note: All absorptions are in mg/l, except pH (unit), EC (µS/cm) and temperature (°C)

Magnesium is the primary cause of the hardness and scale-forming properties of water. It is imperative to understand that other factors including pH, temperature, supersaturation, and flow velocity can influence scale formation [86]. Groundwater sources that have low Ca and Mg are required in manufacturing, coloring, tanning, and electroplating [86,87,88]. Calcium functions as a stabilizer for pH, owing to its buffering properties. Calcium can react with water even at a room temperature based on the mechanism defined in Eq. 17.

$${\text{Ca}}\left( {\text{s}} \right) + { }2{\text{H}}_{2} {\text{O}}\left( {\text{g}} \right) \to {\text{Ca}}\left( {{\text{OH}}} \right)_{2} \left( {{\text{aq}}} \right) + {\text{H}}_{2} \left( {\text{g}} \right){ }$$
(17)

Consequently, dissolved calcium hydroxide in the form of hydrogen gas and soda is formed. Erosion reaction is another critical reaction mechanism. It is triggered by the presence of CO2, resulting in the formation of carbonic acid. This can alter Ca compounds. Carbon weathering reaction mechanism is:

$${\text{H}}_{2} {\text{O}} + {\text{CO}}_{2} \to {\text{ H}}_{2} {\text{CO}}_{3} \;{\text{and}}\;{\text{ CaCO}}_{3} + {\text{H}}_{2} {\text{CO}}_{3} \to {\text{CaH}}({\text{CO}}_{3} )_{2}$$
(18)

Calcium and Mg are the major cause of scale formation in pipes, water radiators, and boilers and to the horrible curd within the sight of cleanser. The total reaction mechanism is:

$${\text{CaCO}}_{3} \left( {\text{s}} \right) + {\text{CO}}_{2} \left( {\text{g}} \right) + 2{\text{H}}_{2} {\text{O}}\left( {\text{I}} \right) \to {\text{Ca}}^{2 + } \left( {{\text{aq}}} \right) + 2{\text{HCO}}_{3}^{ - } \left( {{\text{aq}}} \right)$$
(19)

Sulfate concentrations are below the WHO guidelines as indicated in Table 4. Sulfide mineral oxidation is the primary source of SO4 in groundwater. Equally, sulfate can originate from calcium sulfate, sodium sulfate, and magnesium sulfate minerals [89, 90]. Conversely, sulfate ions originate from industrial effluents and shales [89, 90]. Sulfate is found in all types of aquifers because SO4 is one of the major dissolved components of precipitation. High-level SO4 in aquifers tends to be correlated with the emetic effect, chiefly if combined with Mg or Na [91, 92].

Table 4 Statistical summary of hydrochemical data derived from the literature

4.1 Groundwater evolution

4.1.1 Chadha diagram

Groundwater classification based on the Chadha diagram (Fig. 6) showed that 11.54% (n = 6) fall in field 4 representing Na–HCO3 water type. Groundwater samples in field 4 are derived from the Kudenda–Nassarawa area. Na–Cl water type occurred at one location representing 1.92%. The location is in Kaduna South. The remaining locations (n = 45; 86.53%) fall in field 2, indicating a Ca–Mg–Cl water type. Water samples in this field are comprised of the locations from Kakuri and its Environs (n = 15) and Kaduna South (n = 30). Results concur with the previous classifications of groundwater in the Kaduna Basin [12, 43, 93] and elsewhere in Nigeria’s Basement Complex areas [94,95,96,97].

Fig. 6
figure 6

Chadha plot showing groundwater classification in Kaduna Basin

4.1.2 Gibbs diagram

The processes influencing the hydrogeochemistry of aquifers can also be revealed by Gibbs plot. It helps explains visibly, the pattern of groundwater sources producing a rebound-shaped cloud on a graph. Groundwater is not in a state of equilibrium with aquifer rock mineral, consequent of their mingling with recharged waters with distinct transport periods and often having higher Ca and HCO3 concentrations when compared with Cl and Na [64]. The hydrochemistry of aquifers in the study area is controlled by rock weathering, as depicted in Fig. 7. This symbolizes the inverse end member to precipitation sequence. It consisted of freshwaters drawing their main source of liquefied components from the rock minerals and soils from their basins. The compendium is classified as a rock weathering process as portrayed in Fig. 7. Results are concurrent with multivariate analysis.

Fig. 7
figure 7

A plot of ratio weight of TDS vs. Na + K/(Na + K + Ca) and Cl/(Cl + HCO3) for anions and cations. KN Kudenda–Nassarawa, KE Kakuri and Environs, KS Kaduna South

4.2 Suitability for drinking

Table 5 presents groundwater classification based on nitrate (NO3), chloride (Cl), total hardness (TH), electrical conductivity (EC), and total dissolved solids (TDS). Kudenda–Nassarawa area had TH above 300 mg/l, indicating very hard water. The TH values were less than 75 mg/l in Kakuri and its Environs, indicating soft water. Likewise, 93.55% of sampling locations in Kaduna South had TH values less than 75 mg/l (soft water), 3.23% (moderately hard water), and 3.23% (hard water). Hardness is influenced by several factors including soil/rock composition, the evolution of groundwater chemistry, and outflow from adjoining aquifers. Possible human controls on TH comprised of flow from irrigation return and intrusion of saltwater induced by pumping [83].

Table 5 Water classification based on physicochemical parameters

The TDS level is generally low in the study area. TDS less than 500 mg/l denote excellent water. Kaduna South and Kakuri and its Environs had TDS less than 500 mg/l [6, 24, 62, 98, 99]. In the Kudenda–Nassarawa area, 33.33% fall in excellent class, 16.67% are suitable (500–1000 mg/l TDS) and 33.33% are acceptable (1000–3000 mg/l TDS). The significance of TDS is that high ingestion may be connected to the strength of the joints, gallstones, kidney stones, inurement, or blockage of veins [100]. Based on chloride, 100% of water sources in the Kudenda–Nassarawa area are in the brackish salt class. Similarly, 80% are brackish and 20% are brackish salt in Kakuri and its Environs. In Kaduna South, the composition of groundwater varied from very fresh to brackish salt (Table 5). Chloride (Cl) is important to humans as it plays a significant role in the structure of the cell. Littoral aquifers can have high Cl level consequent of seawater invasion [101, 102]. At concentrations above 250 mg/l, water taste can be influenced [62, 103].

Nitrate pollution is generally low. In Kakuri and its Environs, 80% of groundwater had NO3 concentrations less than 5 mg/l. Water in this category is acceptable for drinking. Twenty percent (20%) had NO3 concentration ranging from 5–30 mg/l. In Kaduna South, 93.55% of groundwater had NO3 level of less than 5 mg/l. The implication of high NO3 in drinking water is its harmful effects on children (blue baby syndrome). Nitrate in aquifers is derived from many sources such as soil organic matter, urban runoff, landfill, septic tanks, municipal sewage, and animal wastes [103]. Thus, it is expected that current contaminating activities will continuously impact the nitrate levels for many decades to come. If the groundwater abstraction is high, NO3 transport can be accelerated within the zone of saturation (Fig. 8). Therefore, a higher NO3 level in groundwater is a sign of previous anthropogenic pollution.

Fig. 8
figure 8

Conceptual model of NO3 occurrence in groundwater. After DVGW [104]

4.3 Water quality index

The water quality index (WQI) of Kaduna Basin (mean ± standard error) was 8.387 ± 1.088 in Kudenda–Nassarawa area, 2.062 ± 0.376 in Kakuri and its Environs, and 1.204 ± 0.109 in Kaduna South, respectively (Fig. 9). The computed WQI revealed that the overall WQI was 13.46 in the Kudenda–Nassarawa area, 7.64 in Kakuri and its Environs, and 10.26 in Kaduna South. The study area holds groundwater of excellent quality based on WQI (Table 6). Calcium, potassium, and TDS are the primary elements controlling water quality in the Kudenda–Nassarawa area. Iron is the major element influencing water quality in Kakuri and its Environs. In contrast, pH was the major element controlling water quality in Kaduna South. The pH values indicated an alkaline condition. High pH in aquifers means that the water can buffer acidic solution having elevated levels of hydrogen ions. The high dissolution of carbon-based minerals is the primary source of high pH in aquifers. Alkaline water tends to be hard. The major mineral compound triggering high pH (or alkalinity) in aquifers is calcium carbonate. It is chiefly derived from mineral rocks including limestone, dolomite, and calcite [105,106,107,108,109].

Fig. 9
figure 9

Box and jitter plot showing calculated water quality index

Table 6 Groundwater classification based on WQI

4.4 Groundwater classification based on LSI and RSI

The Langelier Saturation Index (LSI) (mean ± standard error) was − 2.71 ± 0.18 and ranged from − 5.30 to − 1.00 in Kaduna South, − 2.03 ± 0.27 and ranged from − 5.30 to 0.18, in the Kudenda–Nassarawa. The LSI was − 3.00 ± 0.39 and ranged from − 5.30 to 0.27 in Kakuri and its environs (Table 7; Fig. 10a). Table 7 presents the groundwater classification based on LSI. In Kaduna South, 74.19% of groundwater samples fall in very aggressive class, and 25.81% fall in moderately aggressive class. In the Kudenda–Nassarawa area, 16.67% are very aggressive, 66.67% are moderately aggressive and 16.67% are non-aggressive. Very aggressive water occurred in the Kakuri and its Environs. Non-aggressive water is ideal for industrial uses [10, 52].

Table 7 Groundwater classification based on Langelier Saturation Index
Fig. 10
figure 10

Box and jitter plot: a Langelier Saturation Index and b Ryznar Stability Index

Table 8 and Fig. 10b summarize the groundwater classification based on the Ryznar Stability Index (RSI). The RSI values were above 8.5 in Kaduna South and Kakuri and its Environs, indicative of extremely aggressive water. Similarly, 83.33% of RSI values in Kudenda–Nassarawa are between 6.2 and 8.5, indicative of aggressive water. However, 16.67% is acceptable for industrial use. Saturation index is essential for rating the extent of precipitation by water running through pipes or the extent of the dissolution of calcium carbonate. Groundwater in Kaduna Basin is unsuitable for industrial use.

Table 8 Groundwater classification based on the Ryznar Stability Index

4.5 Statistical application

4.5.1 Factor analysis

Factor analysis (FA) is a widely applied statistical method in hydrogeochemical analysis. It is used to classify hydrogeochemical data and relates it to the origin of ions in aquifers. Factor analysis was conducted on 13 subsets of parameters (Table 9). Factor 1 had high positive correlations on pH, EC, TDS, temperature HCO3, SO4, and K in Kudenda–Nassarawa (KN), HCO3, NO3, Na, Ca, and K in Kakuri and its Environs (KE), TDS, EC, and Cl in Kaduna South (KS). Factor 1 correlated with the rock weathering process. However, a high positive correlation on NO3 is suggestive of anthropogenic contributions because NO3 is continuously added from the N-rich fertilizers, oxidation of ammonia, and macrobiotic waste. Chloride is gradually increased in the environment by human activities [110,111,112]. Factor 2 had positive correlations on Na, Mg Fe, and Ca in KN; EC, TDS, and SO4, in KE; and HCO3, Mg, and Ca in KS. Factor 2 is exclusively correlated with rock weathering.

Table 9 Rotated (varimax) factor scores of hydrochemical data

Factor 3 had high positive loading on Cl and Mg in KN; Mg in KE, SO4, and K in KS. Factor 3 can be correlated with rock weathering. Insignificant correlation on Ca and Mg in Factor 1 from KN is pleasing since pH regularly attains an opposite relationship with ions of carbonate source [62, 113]. Figure 11 demonstrates the biplot of the extracted factors. The three factors accounted for 83.539% of the total variance in KN, 86.653% in KE, and 59.866% in KS, respectively. These factors have the highest eigenvalues.

Fig. 11
figure 11

Factor analysis (biplot) showing the variability of hydrogeochemical elements a Kaduna South, b Kudenda–Nassarawa, and c Kakuri and its Environs

4.5.2 Hierarchical clustering analysis

The use of hierarchical clustering analysis (HCA) in hydrogeochemical analysis aids the classification of aquifers by separating hydrogeochemical data that acts contrarywise. Groundwater sources with comparable hydrogeochemical characteristics tend to produce a discrete cluster. The graphical image of the grouping procedure is given as a dendrogram (Fig. 12). Cluster 1 is consisting of wells in the Kudenda–Nassarawa area and is having analogous concentrations of temperature and pH. So, it can be correlated with physical/external control (i.e., extreme temperatures), with its consequential effect on chemical reactions within aquifers. Inconsistency of temperature (5–10 °C) in groundwater influences TDS concentration, which eventually disturbs complexation, ion exchange, the solubility of gasses, redox reaction, sorption process, speciation process, and pH level [62, 114,115,116,117,118]. Cluster 2 comprises of wells in Kakuri and Environs and some wells (KS10, KS26, KS24, KS25 and KS01). It is characterized by analogous concentrations of SO4, Cl, Mg, Ca, K, Na, and TDS. Cluster 3 is comprised of wells in Kaduna South and are having parallel concentrations of NO3, HCO3, Fe, and EC. Cluster 2 and 3 can be correlated with the rock weathering. However, NO3 and SO4 concentrations in groundwater are increasing consequent of agriculture and household chemicals [62, 116,117,118,119,120].

Fig. 12
figure 12

Hydrogeochemical characteristics of groundwater based on hierarchical cluster analysis

4.5.3 Generalized regression model

A general regression model was generated using MINITAB (mbt 16) statistical software, to identify the major hydrochemical parameter(s) influencing the hydrochemistry of aquifers in the Kaduna Basin. Electrical conductivity (EC) was choosing to be a response variable, while the remaining 12 parameters were predictors. Although EC cannot be associated with any specific chemical parameter; it is an excellent indicator of the overall ionic concentrations of water. It informs the range into which ionic concentrations are likely to fall. Consequently, it enables a water quality analyst to take suitable decisions relating to water usage. A significant relationship between EC and certain ions is an indicator of major ionic effect on the hydrochemistry of aquifers. The regression equation is thus: EC = 123.553 − 11.2206 pH − 1.71577 Ca − 4.41477 Mg + 5.99857 Na + 0.929561 K − 0.178372 HCO3 + 0.903213 Cl − 0.16986 SO4 − 3.40888 Fe + 0.162469 TDS + 0.582821 NO3.

The model as summarized in Table 10 showed that Ca, Mg, Na, and TDS are the most important hydrochemical variables influencing EC levels. The residual plots are summarized in Fig. 13. It was used to verify the hypothesis that residuals are normally distributed. The normal probability plot of the residuals approximately follows a straight line (Fig. 13a). The overall P value is < 0.001. The observed relative normality in this analysis, confirmed the accuracy of the model used. The hydrochemical parameters with P value < 0.005 are considered significant. Calcium, Mg, Na, and TDS have P value < 0.001. These elements represented the most important parameters affecting the hydrochemical composition of aquifers in the study area. Further, the model output had clearly shown no significant anthropogenic inputs (P value > 0.005, NO3; Cl) in the study area.

Table 10 Summary of the general regression model
Fig. 13
figure 13

Summary of regression analysis

5 Conclusion

With increasing industrial activities and urbanization, more groundwater is harnessed. Kaduna Basin has a semiarid tropical climate. This coupled with changes in land use posed a threat to water quality. While human activities can alter the hydrochemistry of aquifers, understanding the natural processes influencing the hydrochemistry of aquifers is also important. Kaduna–Kano zone represents the most urbanized and industrialized section of northern Nigeria. Thus, the appraisal of subsurface water by rating its quality for domestic and industrial uses is important. The use of WQIs to classify groundwater has demonstrated the effectiveness of these techniques for rating the quality of groundwater aquifers. Results obtained from this review lead to the following remarks:

  1. 1.

    Mean concentration of TDS, Ca, Mg, and K is above WHO (2011) reference guidelines for drinking water in Kudenda–Nassarawa area;

  2. 2.

    Groundwater classification based on the Chadha diagram showed that the Kudenda–Nassarawa area had a Na–HCO3 water type. Na–Mg–Cl water type occurred in Kaduna South and Kakuri and its Environs;

  3. 3.

    Groundwater sources in the Kudenda–Nassarawa area had TH above 300 mg/l, indicating very hard water. The TH values are less than 75 mg/l in Kakuri and its environs, indicating soft water;

  4. 4.

    Likewise, 93.55% of groundwater in Kaduna South had TH values less than 75 mg/l, indicating soft water;

  5. 5.

    The TDS level was less than 500 mg/l in Kaduna South and Kakuri and its Environs, indicative of excellent water for drinking;

  6. 6.

    Based on chloride, 100% of water sources in the Kudenda–Nassarawa area fell in brackish salt class;

  7. 7.

    Similarly, 80% are brackish and 20% are brackish salt in Kakuri and its Environs. Kaduna South had very fresh to brackish saltwater type;

  8. 8.

    Nitrate pollution is generally low. In Kakuri and its Environs, 80% of water samples have NO3 concentrations less than 5 mg/l and 20% had NO3 levels ranging from 5–30 mg/l;

  9. 9.

    In Kaduna South, 93.55% of groundwater sources have NO3 level below 5 mg/l;

  10. 10.

    The computed WQI revealed that the overall WQI was 13.46 in the Kudenda–Nassarawa area, 7.64 in Kakuri and its Environs, and 10.26 in Kaduna South;

  11. 11.

    Based on LSI and RSI, 16.67% was very aggressive, 66.67% was moderately aggressive and 16.67% was non-aggressive in the Kudenda–Nassarawa area. Very aggressive water occurred under Kakuri and its Environs;

  12. 12.

    Groundwater is primarily controlled by rock weathering as indicated by multivariate analysis and Gibb’s model; and

  13. 13.

    Regression analysis revealed the hydrochemical parameters with P-value < 0.005 are Ca, Mg, Na, and NO3. These elements represented the most significant elements controlling the hydrochemistry of aquifers in the Kaduna Basin.

Based on these revelations, it can be inferred that groundwater in Kaduna Basin is unsuitable for industrial use. It is undersaturated with CaCO3. Undersaturated water can take off existing CaCO3 protecting shells within equipment and pipelines. Classification of water using WQIs and saturation indices presented a user-friendly tool for rating sources of water supply. It helps to figure out the suitability of water for domestic and industrial uses. We hope that this review will stimulate other researchers to a similar method in an upcoming investigation on water quality.