1 Introduction

As much as three-quarters (about 71%) of the earth surface is covered by water, only a smaller amount is marked as “freshwater” and a more significant percentage as “saline water”. Factually, about 68% of the freshwaters are trapped in ice caps and glaciers (Li & Qian, 2018). The only accessible ones are the underground and surface waters (lakes, streams and springs). However, the quality of these available freshwaters has been threatened seasonally by several factors. The increased rate of urbanization and industrialization has remained a present-day challenge in demand and search for a quality water supply because the quality of available water keeps deteriorating by the day (Li et al., 2017). In many parts of the world, surface and groundwaters are inarguably the significant water sources for drinking, domestic, agricultural and industrial purposes. However, in recent times, their quality and suitability for specific purposes have remained questionable due to contaminations from both natural and anthropogenic activities (Gorgij et al. 2017; Barzegar et al. 2019; Egbueri 2018, 2019a, b, c, 2020a, b; Egbueri and Unigwe 2019; Mgbenu and Egbueri 2019; Sale et al. 2019). With the massive population explosion, increased socioeconomic activities, and unplanned development in many parts of the world, the demand for freshwater supply has risen. However, there is currently a high scarcity of fresh water in many regions across the globe. The latest report by the World Health Organization stated that by the year 2025, half of the world’s population would be living in water-stressed areas (Singh et al., 2019).

Nitrates and heavy metals are common water pollutants that have drawn the attention of many water quality researchers across the globe (Adimalla & Li, 2019; Adimalla et al., 2019; Li et al., 2019; Ma et al., 2018). Although they can occur in the water environment through some natural processes, anthropogenic activities due to agriculture, industrialization, poor waste disposal/management, and urbanization have significantly influenced their accumulation in recent times (Barzegar et al., 2019; Egbueri et al., 2019; Egbueri, 2018, 2020a; Ezugwu et al., 2019; Ukah et al., 2020). Some heavy metals, in small quantities, are essentially important in the human body for growth and development (Chowdhury & Chandra, 1987). However, the pollution of water systems has posed a significant threat to human and environmental health. In such a scenario, water reserved for drinking is thereby made unfit for human consumption. Some heavy metals are carcinogenic (cancer-inducing) in nature; however, some tend to cause other severe, chronic illnesses (Egbueri, 2020a; Mgbenu & Egbueri, 2019). In any attempts to protect and sustain human health and the environment, it is important to encourage and implement continuous monitoring and assessment of water quality for drinking purposes. Monitoring and evaluating water quality enable us to understand better how geogenic and anthropogenic activities could adversely affect the water systems and equally give insights on how to avoid or control the effects.

Over time, several numerical models have been proposed and developed by different researchers based on local and international standard limits, such as those of the Nigerian Industrial Standard (NIS, 2007), Bureau of Indian Standards (BIS, 2012), and World Health Organization (WHO, 2017). Such numerical models have been developed and used for summarizing the quality of drinking water. Water Quality Index (WQI), Pollution Index of Groundwater (PIG), Synthetic Pollution Index (SPI), Overall Index of Pollution (OIP), Modified Water Quality Index (MWQI), Integrated Water Quality Index (IWQI) and Entropy-weighted Water Quality Index (EWQI) are some of the index used for drinking water quality assessments. Several parameters are usually analyzed for water quality monitoring and projects assessment. Formulating numerical models for water quality evaluations has become very important and useful for streamlining the wide-range parametric data into whole numbers. This ensures that water quality reports are easily understood by experts, policymakers, and the public. These models are developed to serve as tools for (1) simple interpretation of water quality; and (2) planning, management and mitigation of water pollution. Integration of two or more numerical models in water quality assessment has been found useful as it tends to minimize the subjectivity related to using a single model (Egbueri & Unigwe, 2019). In recent times, several water quality assessments have been conducted using either MWQI, IWQI or EWQI. The EWQI was first proposed by Li et al. (2010). Nevertheless, the MWQI was proposed by Shankar and Sreevidya (2019), whereas the IWQI was developed by Mukate et al. (2019). Previous research has proven that these models have greatly improved water quality monitoring and assessment projects. However, there is no known study that has simultaneously compared their performances in water quality research.

The present work was conducted in Umunya area, southeastern Nigeria. Previous studies (Egbunike, 2018; Mgbenu & Egbueri, 2019) on the water quality in this suburban district revealed that the drinking suitability of some of the available water resources utilized by the public (inhabitants) is very questionable. Therefore, with respect to nitrate and selected heavy metals (Fe, Zn, Mn, Pb, Cr, and Ni) concentrations, this study aims to assess the quality of the public water resources using a joint statistical and numerical modeling approach. The specific objectives are to (1) examine the suitability of the public water supplies for human consumption using MWQI, IWQI and EWQI; (2) identify essential pollutants, their parametric associations and possible sources using Hierarchical Cluster Analysis (HCA), and (3) highlight on the performance and sensitivity relationships between the three indexical models. To the best of the authors’ knowledge, this paper is the first to simultaneously use the MWQI, IWQI and EWQI in water quality assessment. It is also the first of its kind to compare the performances of these models. Although these models utilize different classification schemes and there may be some uncertainties associated with variances in their methods of calculation, the reasons behind the testing and analysis of their efficiencies (sensitivities) include: (1) There is a similarity in the presumptions and assumptions that founded their development. (2) The models are all used for drinking water quality assessment. (3) They are not region-specific. (4) The EWQI and IWQI do not require parameter weight assignment, which may introduce bias. (5) The same standard limits could be used in the calculation of all the models, as this would ensure fairness and uniform calculation conditions. (6) All analyzed water parameters could be considered in the quality evaluations, in order to further ensure that the same calculation conditions are maintained. And, (7) The limitations of these models are not well-known. Although the comparison of the sensitivities of these models in detecting water quality may not provide exact information, it can provide us with general insight regarding the agreements and trends between them. It is hoped that this study will provide insights to local and international water quality researchers regarding the water quality of the fast-growing suburb and the validation and sensitivity of the selected models’ performance in water quality assessment.

2 Background information of the study area

The area under study lies within latitudes 6° 10′ N to 6° 15′ N and longitudes 6° 54′ E to 7° 00′ E in Anambra State, southeastern Nigeria (Fig. 1). Umunya is situated at the heart of the two major cities (Awka and Onitsha). A polyvinyl chloride (PVC) industry, an abattoir servicing the surrounding communities, a National Youth Service Commission (NYSC) camp, and Tansian University are located in Umunya suburban district. Generally, these influence the population of this suburb. Moreover, despite increasing commercial and industrial activities in this area, small to large scale farming activities are a means of livelihood for many of its inhabitants. However, due to the unplanned population growth and increasing human activities, waste management programs in this area is still poorly executed. Wastes are indiscriminately disposed into roadway drainage channels, surface water bodies, and open dumpsites. Usually, such activities are known to release potentially toxic metals into water bodies. The study region experiences two distinct seasons, rainy and dry seasons. The rainy season lasts from April to October with an average annual rainfall of 1400–2500 mm, whereas the dry season starts from November and extends up to March.

Fig. 1
figure 1

Map showing the location, accessibility and geology of the study area

From the geologic point of view (Fig. 1), the study area is underlain by the Eocene Nanka Formation of the Ameki Group (Mgbenu & Egbueri, 2019; Nwajide, 1979). The lithology is composed of very friable, flaser-bedded units of fine-medium-grained sands, with intervals of light gray mudrocks and ironstones (Nwajide, 2006, 2013). The main aquifer in the area is the Nanka Formation (comprising over 60 m sandstone interval), while the underlying older Imo Formation, mostly composed of mudrocks, acts as the aquitard (Mgbenu & Egbueri, 2019). The Nanka Formation, characterized by high porosity and permeability (Egbueri & Igwe, 2021), forms the aquifer in this area (with over 60 m sandstone interval). Nfor et al. (2007) reported that a major ridge system in the area acts as a water divider. This ridge system creates groundwater flow patterns running southwards and eastwards away from the divide (Mgbenu & Egbueri, 2019; Nfor et al., 2007). Groundwater depth in the Nanka Formation has been reported to be at varying depths, while Nfor et al. (2007) reported a depth of ≥ 20 m, Okoro et al. (2010) reported a depth of ≥ 7 m. However, the study by Egbueri and Igwe (2021) reported the water table in some parts of the Nanka Formation to have a depth of ≥ 9.6 m. It is believed that several streams and river systems in and around the study area serve as recharge systems. However, there is limited literature on groundwater properties such as flow direction, which should be of significant focus for future research.

3 Materials and methods

3.1 Sampling and analysis

Both surface and groundwater samples were collected from water supplies within the study area. In total, fifteen samples were collected from available water sources in prewashed and sterilized 1 L polyethene containers and labeled accordingly from WS01–WS15 (Fig. 1). Furthermore, samples were preserved in an ice-crested cooler before laboratory analysis. The pH readings of the fifteen water samples were measured in the field. In the laboratory, the nitrate (NO3) was determined using the titrimetric method. In contrast, the metals such as iron (Fe), zinc (Zn), manganese (Mn), lead (Pb), chromium (Cr), and nickel (Ni) were analyzed using atomic absorption spectrophotometer (AAS). It is pertinent to state that all the performed analyses followed the American Public Health Association (APHA, 2005) recommendation. Figure 2 is a pictorial summary of the steps taken in this water quality assessment.

Fig. 2
figure 2

Quality rating of the eight parameters for the six water samples collected within the zone of high waste accumulation

3.2 Modified water quality index (MWQI)

Proposed and developed by Shankar and Sreevidya (2019), the MWQI considers an overall quality of water with respective background values. The IWQI was developed as a modification of the commonly used Canadian WQI (CWQI). This model allows its users to assign certain weight factors to input parameters. This is done by either considering the parameter with a higher threat to human health or done by considering parameters with higher concentration, depending on the selected area of study. Also, the MWQI considers parameters excursing benchmarks, measurements excursing benchmarks and the amount of excursion from standards in failed measures. According to Shankar and Sreevidya (2019), the MWQI explains “the complete picture of water quality in a simple and extremely reliable manner and hence shows enormous promise for its application and large suitability across the globe.” The MWQI evaluation involves three basic steps:

3.2.1 Selection of water quality parameters and corresponding benchmarks

This study identified eight water quality parameters with their sub-indices, as shown in Table 1. The WHO (2017) and the NIS (2007) standard limits were selected as the corresponding benchmarks.

Table 1 Standard values and weight factors of water quality parameters used in evaluating MWQI (Shankar & Sreevidya, 2019)

3.2.2 Appropriate weight factors for key input parameters

At this stage, significance were placed on the parameters with higher health threat to humans (Shankar & Sreevidya, 2019). Pb was given the highest weight factor of four (4) dues to its health threat and concentration in the study area. Next, Ni and Cr with a weight factor of three (3). Mn was apportioned two (2), then pH, Fe, Zn and NO3 were given one (1) (Table 1).

3.2.3 Calculation of the MWQI

This was calculated by considering three key factors, scope factor (FS), frequency factor (FF), and amplitude factor (FA) (Shankar & Sreevidya, 2019). The FS signifies the number of parameters that excurse benchmarks, FF signifies the number of measurements that excurse benchmarks, and the FA represents the amount of excursion from benchmarks in the failed measurements (Shankar & Sreevidya, 2019). The FS and FF were calculated using Eqs. (1 and 2), respectively.

$$F_{{\text{S}}} = \frac{{\mathop \sum \nolimits_{j = 1}^{n} W_{qj} }}{{\mathop \sum \nolimits_{i = 1}^{m} W_{qi} }},\quad i\, = \,1,\,2 \ldots \,m;\quad j\, = \,1,\,2 \ldots , \, n$$
(1)
$$F_{F} = \frac{{\mathop \sum \nolimits_{j = 1}^{n} \left( {W_{qj} \times W_{qi} } \right)}}{{\mathop \sum \nolimits_{i = 1}^{m} \left( {W_{qj} \times N_{qj} } \right)}}$$
(2)

where Wqi represents the weight factor of an input parameter; Wqj represents the weight factor of a violator parameter; Nqi represents the number of input parameter measurements; Nqj represents the number of violator parameter measurements; m is the number of input parameters and n is the number of violator parameters.

For FA determination, the excursion amount of violator parameters (Eqj) was first calculated (Eq. 3), and then the Normalized Sum of Excursions (NSE) (Eq. 4).

$$E_{qi} = \frac{{C_{qi} }}{{{\text{SV}}_{qi} }} - 1$$
(3)
$${\text{NSE }} = \frac{{\mathop \sum \nolimits_{j = 1}^{n} \left( {W_{qj} \times E_{qj} } \right)}}{{\mathop \sum \nolimits_{i = 1}^{m} \left( {W_{qi} \times W_{qi} } \right)}}$$
(4)

where Cqi represents the violator parameter's concentration of the violator parameter; SVqi represents standard value; and Eqj, excursion amount of violator parameter.

The FA, and the MWQI are calculated using equations Eqs. (5 and 6), respectively.

$$F_{{\text{A}}} = \frac{{{\text{NSE}}}}{{0.01 {\text{NSE}} + 0.01}}$$
(5)
$${\text{MWQI}} = 100 - \frac{{\sqrt {F_{{\text{S}}}^{2} + F_{{\text{F}}}^{2} + F_{{\text{A}}}^{2} } }}{1.734}$$
(6)

3.3 Integrated water quality index (IWQI)

The IWQI was developed by Mukate et al. (2019) and have been found helpful for water quality assessment (Egbueri, 2019b). With the IWQI, water quality parameters below their desirable limits and above permissible limits are described as deficient and excessive, respectively. Both scenarios are considered to have associated health problems. Just as the names imply, “deficient” indicates that something is lacking, and “excessive” suggests that something is beyond its standard limit. By considering both the deficiencies and excessiveness of parameters, a better understanding of water quality could be established. The primary focus of the IWQI is assessing the mineral or ion balance of water resources to avoid health implications due to excesses or deficiencies (Mukate et al., 2019). Targeted at improving the conventional WQI model, the IWQI was designed to exclude the problems of ambiguity, eclipsing, and aggregation observed when using the WQI model. According to Mukate et al. (2019), “the IWQI is flexible, unbiased, easy to calculate and time-saving, and provides useful information to prioritize and maintain the water quality of potable sources and reduce human health impacts from using poor-quality water resources.” Summarily put, the IWQI is described as a comprehensive and intensive water quality index that considers both the desirable and permissible parameters. The IWQI has a different step mechanism from MWQI which has been shown as follows:

3.3.1 Calculation of range

The NIS (2007) standard limits were utilized in the range calculation as shown in Table 2. The range was calculated using Eq. (7).

$${\text{Range }} = {\text{ Permissible limit }}\left( {{\text{PL}}} \right) - {\text{Desirable limit }}\left( {{\text{DL}}} \right)$$
(7)
Table 2 NIS (2007) drinking water standards and calculated range and MPL values for IWQI (Egbueri, 2020b; Mukate et al., 2019)

3.3.2 Calculation of modified permissible limits (MPL)

The NIS (2007) was also considered in determining the MPL (Eq. 8) and is presented in Table 2.

$${\text{MPL }} = {\text{ PL}} - \left( {20\% {\text{ of Range}}} \right)$$
(8)

In IWQI calculation, it is believed that the concentration of any parameter below the Desirable limit (DL) or above Permissible Limit (PL) is not suitable for drinking. Also, the concentrations that are less than the minimum require more concentration and above MPL, which will affect the water quality.

3.3.3 Determination of the subindex (SI)

Three different parameters are considered while calculating for the subindex. They are as follows.

$${\text{SI}}_{1} = \, 0$$
(9)

If the ith parameter (Pi) ’s observed value is > DL but < MPL, then SI1 will be expressed as zero.

$${\text{SI}}_{2} = \frac{{\left( {{\text{DL }} - {\text{Pi}}} \right)}}{{{\text{DL}}}}$$
(10)

Equation (10) is considered when the value of ith parameter is less than or equal to the DL; i.e., Pi ≤ DL.

$${\text{SI}}_{{3}} = \frac{{\left( {{\text{Pi }} - {\text{ MPL}}} \right)}}{{{\text{MPL}}}}$$
(11)

Equation (11) is considered when Pi is greater than or equal to the MPL; i.e., Pi ≥ MPL.

The final IWQI is then evaluated using Eq. (12);

$${\text{IWQI}}\, = \,\sum\nolimits_{i = 1}^{n} {{\text{SI}}_{ij} }$$
(12)

where SIij is the sub-index value of ith sample and jth water quality parameter (Egbueri, 2021; Mukate et al., 2019).

3.4 Entropy-weighted water quality index (EWQI)

Among the three models utilized in this work, the EWQI was first developed by Li et al. (2010). This model was developed to improve the conventional WQI model, which requires an assignment of specific weights to water quality parameters regarding their relative importance to human health. In EWQI, information entropy (ej) is used instead to get the uncertainty of the stochastic event, rather than assigning relative weights to parameters, as done with the conventional WQI model. The EWQI uses a discrete entropy equation to determine the weights of parameters. Also, EWQI helps identify the parameter that has the most significant impact on water quality without bias. This is obtained by determining the parameter with the lowest information entropy (ej) and the highest entropy weight (wj). Over time, the EWQI has been used by several researchers from different parts of the world and has been described as a model that provides unbiased, justifiable, accurate and reliable water quality analysis (Adimalla et al., 2020; Amiri et al., 2014; Feng et al., 2019; Li et al., 2010; Singh et al., 2019; Ukah et al., 2020; Wu et al., 2011). It also provides an overall understanding of different sets of water samples (Adimalla et al., 2020; Ukah et al., 2020). In this study, EWQI was employed to help assess the water quality and confirm other water quality tools used in the study area. The EWQI computation steps are as follows (Li et al., 2018; Ukah et al., 2020; Wang et al., 2020):

The evaluation matrix “X” is computed as shown in Eq. (13)

$$X = \left[ {\begin{array}{*{20}c} {x_{11} } & {x_{12} } & \ldots & {x_{1n} } \\ {x_{21} } & {x_{22} } & \ldots & {x_{2n} } \\ \ldots & \ldots & \ldots & \vdots \\ {x_{m1} } & {x_{m2} } & \ldots & {x_{mn} } \\ \end{array} } \right]$$
(13)

where “m” signifies numbers of samples; “n” represents the number of analyzed parameters.

The standardization process for “yij” and “Y” is computed using Eqs. (14 and 15), respectively.

$$y_{ij} = \frac{{x_{ij} - \left( {x_{ij} } \right)\min }}{{\left( {x_{ij} } \right)_{\max } - \left( {x_{ij} } \right)_{\max } }}$$
(14)
$$Y \, = \left[ {\begin{array}{*{20}c} {y_{11} } & {y_{12} } & \ldots & {y_{1n} } \\ {y_{21} } & {y_{22} } & \ldots & {y_{2n} } \\ \ldots & \ldots & \ldots & \vdots \\ {y_{m1} } & {y_{m2} } & \ldots & {y_{mn} } \\ \end{array} } \right]$$
(15)

where xij represents the initial matrix; (xij)min and (xij)max signify the minimum and maximum values of the analyzed parameters of the samples (Adimalla et al., 2020).

The first step in EWQI, is to determine the information entropy (ej) for each chemical parameter as shown in Eq. (16).

$$e_{j} = - \frac{1}{{{\text{In}}\,m }}\mathop \sum \limits_{i = 1}^{m} P_{ij} {\text{In}}\,P_{ij}$$
(16)

where Pij denotes the probability of occurrence of the normalized value of the parameter j expressed in Eq. (17).

$$P_{ij} = \frac{{P_{ij} }}{{\sum P_{ij} }}$$
(17)

Next is the entropy weight (wj) calculation for each parameter as expressed in Eq. (18).

$$w_{j} = \frac{{1 - e_{j} }}{{\mathop \sum \nolimits_{j = 1}^{n} 1 - e_{j} }}$$
(18)

Third step involves the calculation of the quality rating scale (qj) for each parameter.

$$q_{j} = \frac{{C_{j} }}{{S_{j} }} \times 100$$
(19)

where Cj is the concentration of parameters (mg/L); Sj is the standard permissible limit (mg/L) of the NIS (2007).

The final stage is the calculation of the EWQI as shown in Eq. (20).

$${\text{EWQI}} = \sum\nolimits_{j = 1}^{n} {w_{j} \cdot q_{j} }$$
(20)

3.5 Statistical analysis

In this study, Stepwise Regression Analysis (SRA) and Hierarchical Cluster Analysis (HCA) were performed using SPSS (v. 22). These analyses show and understand the spatio-temporal distribution of water quality and the relationship between the models. The SRA and the HCA were both utilized for the analysis of the sensitivities and agreement between the EWQI and IWQI models. The possible sources of water contaminants and drinking water quality demarcations (classification) of the samples were analyzed using HCA (Egbueri, 2020b; Wu et al., 2020). For the water quality demarcation, the HCA was performed using the IWQI and EWQI scores. Moreover, the Ward’s linkage method (with squared Euclidean distance) was used for the HCA. The data were normalized using z-score standardization to remove any bias due to variances in the obtained scores.

4 Results and discussion

4.1 General characteristics of the water resources

Table 3 presents the analyzed results for the physicochemical parameters, while the univariate statistical summary of all parameters is also presented in Table 3. Results were compared to the WHO (2017) and NIS (2007) drinking water standards. In this study, the water samples are acidic, with pH values ranging from 4.61 to 6.53 and an average of 5.561 (Table 3). The chemical parameters follow the trend of NO3 > Pb > Zn > Fe > Ni > Mn > Cr. Nitrate (NO3) can contribute to lowering pH in water supplies through oxidation and hydration processes (Collin et al., 2018). NO3 concentration in the study swerves from 0 to 21.1 mg/L with a mean of 6.417 mg/L (Table 3). NO3 were below the permissible limit in all water samples. However, 20% recorded no NO3 enrichment (Table 3). The source of NO3 in the study area is attributable to the use of fertilizers, domestic waste, agricultural waste, leachates from landfills and sewage leakages (Egbueri et al., 2019; Ezugwu et al., 2019; Mgbenu & Egbueri, 2019). About 26.6% of the water samples did not contain iron (Fe). However, 20% of the water samples exceeded the permissible limit (Table 3). Fe concentration in the water samples swerves from 0 to 0.54 mg/L, the mean of 0.127 mg/L (Table 3). Although Fe is an essential element in the human body, “red hot disease” is associated with drinking Fe-laden water. The possible source of Fe in the water supplies may include domestic sewages and coating of pipes (Wagh et al., 2018). Manganese (Mn) in this study is not widely distributed. About 60% of the water samples have no Mn enrichment, while 40% have Mn scores below the permissible limit (Table 3). Excess Mn's associated health effects include retarded mental development and damage of the central and peripheral nervous systems (SON, 2015; Ukah et al., 2019). Zinc (Zn) is essential in human, animal and plant growth (Hotz and Brown, 2004). Zn was present in all water samples with concentrations ranging from 0.001 to 0.54 mg/L and a mean of 0.127 mg/L (Table 3).

Table 3 Result of the physicochemical analysis of water samples

A report had shown that Lead (Pb) is a very toxic heavy metal that enters both the tissues of plants and the skeletal framework of man through ingestion by water, inhalation through soil and consumption of crops (WHO, 2017). Its accumulation gives rise to different health-related problems like mental disorders, cancer development and failure of the hematologic and renal system (Adimalla & Wang, 2018; Egbueri, 2020a). In this study, about 33% of the samples had no Pb enrichment, 46.6% had scores above the acceptable limit for drinking water, and however, 13.3% had scores below the permissible limit (Table 3). Overall, Pb concentration ranges from 0 to 3.087 mg/L with an average of 0.825 mg/L (Table 3). The sources of Pb in the water supplies are attributed to the use of chemical anti-killing agents (insecticides and pesticides), onsite sewages, faucets, leaded plumbing fittings, urban wastes and dumpsites (Egbueri, 2018, 2020a, 2020b; Wagh et al., 2018). Chromium (Cr) had a very low concentration which swerves from 0 and 0.01 mg/L with an average of 0.002 mg/L (Table 3). In this study, about 46.6% of the water samples had no Cr enrichment; 53% were enriched but below the permissible drinking water limit. Nickel (Ni) is associated with different health-related problems, including dermatitis, lung fibrous, cardiovascular and kidney disease (Egbueri, 2020a; SON, 2015; WHO, 2017). About 40% of the samples had no Ni enrichment, while 13.3% had low Ni enrichment. On the contrary, 33.3% were above the permissible limit for drinking (Table 3). Ni concentration in this study ranges from 0 to 0.34 mg/L with a mean of 0.084 mg/L (Table 3). Ni contamination in this study is attributed to poor waste disposal, sewage sludge and application of chemical fertilizers (Adimalla & Wang, 2018).

4.2 Modified water quality index (MWQI)

The MWQI classification scheme is given in Table 4. In this study, the violator parameters affecting the quality of the water supplies include Pb, Cr, and Ni. The MWQI was calculated for the 15 water samples and eight physicochemical parameters. The results for MWQI are presented in Table 5. The results showed that the number of parameters that excurse benchmarks (FS) is 0.781. However, the FF been the number of measurements that excurse benchmarks is 0.11, while 9.79 is the amount of excursion from benchmarks in the failed measurements (FA). The final MWQI was observed as 44.95. Based on the classification criteria for MWQI, the water supplies fall within the “Marginal” rank, which signifies that the water quality is frequently threatened at a desirable level. The study area is a fast-developing area with increasing developmental, agricultural and improper waste management (open waste dumping) activities. It is believed that the high threat to these water supplies is due to the waste generation and management method practised in the study area, as reported by Egbunike (2018), Mgbenu and Egbueri (2019), Egbueri and Unigwe (2020).

Table 4 Classification criteria for MWQI (Shankar & Sreevidya, 2019)
Table 5 Summarized result of the MWQI

4.3 Integrated water quality index (IWQI)

The classification scheme for IWQI subdivides water quality into five distinct groups: IWQI < 1 (Excellent water for drinking); IWQI 1–2 (Good water for drinking); IWQI 2–3 (Acceptable water for domestic); IWQI 3–4 (Poor water for drinking); IWQI > 5 (Unacceptable water) (Egbueri, 2021; Mukate et al., 2019). After due computation, formulation and evaluation of the IWQI, the summary results were presented in Table 6. The observed values ranged from − 9.11 to 401.8 with a mean of 106.8, which shows that most of the water samples are polluted. Following the classification criteria by Mukate et al., 2019, any IWQI value above 3 is classified as poor, whereas any above 5 is unfit for human consumption. Based on the information provided in Table 6, about 20% of the water samples are excellent for drinking, 6.67% is of good quality, 13.33% are acceptable for domestic purposes, and 60% are unsuitable for human consumption. However, water samples 2, 5, 6, 9, 11 and 14, which are believed to have the strongest anthropogenic influence, show very high pollution levels, far more than other water samples. Similar research findings have been reported (Egbunike, 2018; Mgbenu & Egbueri, 2019), whereby some water samples were found to have higher heavy metal pollution than others.

Table 6 Summary results for the IWQI and their quality rating

4.4 Entropy-weighted water quality index (EWQI)

The EWQI classification scheme for water quality is as follows: “Rank 1” < 50 (excellent water quality); “Rank 2” 50-100 (good water quality); “Rank 3” 100–150 (average water quality); “Rank 4” 150–200 (poor water quality); “Rank 5” > 200 (Extremely poor water quality) (Adimalla et al., 2020; Amiri et al., 2014; Feng et al., 2019; Li et al., 2010; Singh et al., 2019; Ukah et al., 2020; Wu et al., 2011). The information entropy (ej) and entropy weight (wj) results of all analyzed water quality parameters are presented in Table 7. However, Table 8 contains the summarized EWQI results. In this study, Pb was observed to have the highest effect on the water quality. This assertion was made following Gorgij et al. (2017) report that the parameter with the highest wj and lowest ej will have the highest effect on water quality (Table 7). The summary results for the EWQI shown in Table 8, swerves from 84.02 to 21,430 with an average of 5866. Furthermore, any EWQI value that exceeds Rank 3 (i.e., above 150) is believed to be unsuitable for drinking purpose.

Table 7 Results of information entropy (ej) and entropy weight (wj)
Table 8 Summary results for the EWQI and their quality rating

Based on the information presented in Table 8, it was realized that about 26.6% of the total water samples are of good quality, 13.3% are of average quality, and however, 60% of the water samples are extremely poor in quality. Meanwhile, 13.3% of the water categorized as average water quality could only be used for domestic purposes. However, 60% of the water, categorized as extremely poor water quality, are unfit for human consumption. Furthermore, water samples 2, 5, 6, 9, 11 and 14 show extreme high values of EWQI ranging from 7527 to 21,430 and an average of 14,116. It is further confirmed that the high pollution in these samples is due to the high impact from anthropogenic activities, which is often common in fast-developing suburbs. Thus, it is suggested that any pollution remediation measures proposed for the study area should consider these six extremely polluted water stations first before the others marked to have better water quality.

Figure 2 presents the quality rating (qj) for the six water samples collected within the zone of high waste accumulation. From this figure, Pb (in green) has the highest values in the six water samples, followed by Mn (in yellow). NO3 (in orange) shows a high presence in WS6 and WS14. Ni (in purple) was relatively low, except for WS2 and WS11. Fe (in black) showed a very high presence in WS11. However, Mn (in red) and Cr (in gray) were relatively low in all water samples. The pH in all waters were relatively low, indicating their acidic nature. Figure 2 was created in order to establish a better understanding of the significant elements contributing to the quality deterioration of the water supplies within the points of interest and also to prove that the parameter with the highest wj and lowest ej has the highest impact on the water supplies (Gorgij et al., 2017). Summarily, with the quality rating (qj) of the six water samples shown in Fig. 2, Pb was observed as the major heavy metal deteriorating the water supplies. This may be attributed to the nature of the anthropogenic activities taking place in the study area. However, inhbaitants who make use of these water supplies (WS2, WS5, WS6, WS9, WS11 and WS14) may be exposed to cancerous and chronic illnesses due to the high concentration of Pb.

4.5 Water quality classification/demarcation using HCA

As can be observed from Tables 7 and 9, the differences in classification schemes of the IWQI and EWQI introduced some kind of bias in deciding which sample is suitable or not. The samples identified as excellent by the IWQI were placed in the ‘good’ category by the EWQI. Hence, there was need to utilize HCA to show the quality groupings of the analyzed water samples. Some water quality studies (Egbueri & Unigwe, 2019; Egbueri et al., 2019; Egbueri, 2018, 2019a, 2020a, 2020b) have shown that HCA is a suitable tool for classifying water samples into various classes based on their quality and chemical genetics.

Table 9 The obtained correlation coefficients showing the sensitivity of the models on the parameters

The IWQI and EWQI scores were integrated to produce a dendrogram (Fig. 3) for the water quality classes. Cluster 1 comprises of those samples generally classified as lowly-marginally contaminated. However, Cluster 2 is dominantly comprised of those samples that are highly polluted with heavy metals. Compared with those of Cluster 1, it was observed that the samples in Cluster 2 have extremely high IWQI and EWQI scores. By this demarcation, it can be generally stated that the samples in Cluster 1 (making about 60% of the total) have low to high pollution, while those in Cluster 2 (making about 40%) have very high to extreme heavy metals pollutions. Following the quality classes given by the IWQI and EWQI, the result of the MWQI (Table 5), which highlighted that the overall water quality of the study area is frequently threatened, is confirmed.

Fig. 3
figure 3

A hierarchical dendrogram classifying the water quality of the samples based on integrated IWQI and EWQI values

In this study, it was also noticed that the surface waters, which because of their clean appearances, are often judged by the local consumers as suitable drinking water supplies, have very questionable drinking quality. The spring samples were identified by both IWQI and EWQI as extremely polluted and unsuitable drinking waters. On the other hand, the only stream water supply was marked by both indices to be of marginal or average drinking quality. The pollution of these surface waters could be attributed to anthropogenic inputs related to indiscriminate waste disposal. However, some boreholes were highly polluted. This could be indicating that they are shallow aquifers prone to easy surface contaminations. Additionally, because the Nanka Formation is mainly composed of loose, porous and permeable materials (Egbueri and Igwe 2020), the contamination of its shallow groundwater would be much easier.

4.6 Water contaminant source allotment using HCA

The discussion so far has indicated that the quality of some water supplies is questionable due to pollution. Therefore, it was thought to analyze the possible source of pollution further using the HCA. The application of HCA for water pollutant source apportionment is well demonstrated in the current study. A dendrogram (Fig. 4) was produced using the Ward’s linkage method and z-score standardization. From Fig. 4, it can be seen that the analyzed parameters formed two major cluster groups, both of which have their separate sub-clusters.

Fig. 4
figure 4

source apportionment

A hierarchical dendrogram of the water quality parameters for pollution

The first cluster comprises parameters (Zn, Pb, Ni, pH, and Cr) that are more typical of origins due to anthropogenic influences. It was observed that these parameters in this cluster class greatly influenced the water quality of the study area more than other parameters. Cluster 1 has two sub-clusters. In the first sub-cluster Zn, Pb, and Ni were grouped. This points that similar processes, such as leaching from batteries, tires and heavy chemical wastes in dumpsites, could be influencing their enrichment in the water supplies (Egbueri, 2018, 2020a; Mgbenu & Egbueri, 2019). Because the waters are generally acidic, they have the potential to leach Pb from water distribution pipes (Egbueri, 2021). The association between pH and Cr could suggest that the acidity of the water may be influencing the enrichment of Cr. However, the variation in their Euclidean distances indicates that this association is not so strong. Other factors that may add Cr to water include metallurgical and heavy chemical wastes (Egbueri, 2018, 2020a; Mgbenu & Egbueri, 2019).

However, Cluster 2 is comprised of such parameters (Fe, Mn, and NO3) that are typical of both natural and anthropogenic origins. Mgbenu and Egbueri (2019) had attributed the presence of nitrate in the waters from the Umunya district to anthropogenic activities. For this area, such anthropogenic activities that could influence nitrate enrichment in the water include poor disposal mechanics for organic wastes such as vegetables and sewage (Egbueri et al., 2019). However, Barzegar et al. (2019) reported that NO3 in water could also be due to some geogenic processes like denitrification, redox reactions, and anoxic conditions of the aquifer. The use of metallic pipes for water distribution and metallic wastes may be responsible for the Fe contents of the waters. Nevertheless, geogenic processes like weathering ironstones and ferromagnesian minerals could also be responsible for the Fe in the waters. Mn is commonly associated with NO3. However, the higher concentrations of NO3 more than Mn (and Fe) could explain their varied Euclidean distances in Fig. 4. Moreover, their separation into different sub-clusters could be suggesting a higher impact of anthropogenic activities in the addition of NO3 more than the other duo. For the study area, the presence of Mn could be due to human inputs such as sewage, fertilizers, and fossil fuel combustion (Egbueri, 2018, 2020a). Natural factors like redox reactions and anoxic and flow conditions may influence the concentration of Mn in water (Liao et al., 2018).

4.7 Establishing the relationships between the indexical models

The observations made so far have shown that there is some kind of similarity between the results of these models in this study. The MWQI did not give the quality description for individual samples; however, it did present the general overview of the quality of the public water supplies. The result it showed was in line with those given by the IWQI and EWQI. However, much of our comparative analysis of the models’ sensitivities will focus on IWQI and EWQI.

An SRA was performed on the IWQI and EWQI scores. It was observed that a strong relationship exists between the two models (Fig. 5), indicating that they followed a similar trend in identifying the water quality of the study area. In other words, the values of IWQI and EWQI increased accordingly in the same direction across the water samples. Additionally, Pearson’s correlation analysis was performed between the parameters’ concentrations and the IWQI and EWQI scores. Usually, correlation coefficients < 0.5, 0.5 < r < 0.7, and > 0.7 are, respectively, marked as weak, moderate, and strong. The correlation results obtained (Table 9) showed that both models sensitively identified Pb, Zn, and Ni as the most important pollutants that influenced the IWQI and EWQI scores. But, the identification of Zn as one of the important pollutants does not make it a significant pollutant in the study area, as all the Zn concentrations in all the samples were well below the maximum allowable limit of 3 mg/L.

Fig. 5
figure 5

Regression plots for assessing the relationship (agreement) between the IWQI and EWQI

5 Conclusion

The three indexical models (MWQI, IWQI and EWQI) utilized in this current study have proven to be sensitive, efficient and effective in water quality assessment. This study has indicated that the integration of the three models gives a better understanding of the water quality in the area. Moreover, the HCA proved very efficient in the allotment of the possible sources of pollution in the study area. Overall, the results of this study confirmed that the water supplies are threatened by heavy metals pollution. Based on the indexical approach employed to investigate the quality of the water supplies, it has been proven that the water supplies are relatively lowly to very highly polluted. In this study area, about 40% of the analyzed water samples are suitable for both drinking and domestic purposes, whereas the majority (60%) are unfit for human consumption. Investigation into the performance and sensitivity of the MWQI, IWQI and EWQI models in water quality assessment was analyzed and the results showed that they are all sensitive, efficient and effective tools. This study has indicated that the integration of the three models gives a better understanding of water quality. Therefore, adequate measures should be taken to prevent further deterioration of the water quality. Moreover, the contaminated water should be adequately treated, either by boiling activated carbon filtration, or advanced technology, before human consumption and used for domestic purposes such as cooking.