Introduction

Groundwater is a vital and sustainable natural resource for socio-economic development, for healthy ecosystems, and for improving the productivity and welfare of populations. Leaching action of minerals, extreme utilization of manures, and anthropogenic activities (Rango et al. 2012) make groundwater non-potable because these activities enhance contamination of mainly heavy metals (arsenic (As), Fe, cadmium (Cd), chromium, and fluoride (F)) in groundwater. The concentration of these harmful toxic pollutants is rising continuously due to rapid population growth, economic activity, infrastructural development, and industrialization (Ram et al. 2021). The long-term viability of groundwater quality is among the most significant challenges facing the world today (Mittal et al. 2021). In dry period, the situation becomes more critical because of low groundwater level.

Water contamination deteriorates water quality resulting in an adverse impact on aquatic life, human health, economic development, and social thriving (Milovanovic 2007). Therefore, we need to stop water pollution (Simeonov et al. 2003) sourced from anthropogenic activities by imposing rules and regulations designed by the policymakers of every country regularly (Vasanthavigar et al. 2010). Before applying several treatment options like physicochemical methods and membrane technology for groundwater purification, its quality should be monitored and maintained. Water quality index (WQI), used by the scientific community and decision-makers (Abbasnia et al. 2019; Adimalla 2021; Chaurasia et al. 2018; El Baba et al. 2020; El Osta et al. 2020; Panghal and Bhateria 2020; Ram et al. 2021), is a technique which combines multiple parameters into a single numeric dimensionless value determined by integrating water quality variables according to their relevance (Mallick et al. 2021). This simple, outstanding and proficient tool is applied to classify groundwater quality for drinking into any one of five categorizes—excellent, good, medium, poor, and extremely poor. Many countries like India (Chaurasia et al. 2018; Panghal and Bhateria 2020), Iran (Abbasnia et al. 2019), Brazil (Sabino et al. 2020), and the UK (El Baba et al. 2020) applied WQI technique to confirm current scenario of groundwater.

Multivariate statistical analysis such as correlation analysis (r), principal component analysis (PCA), and cluster analysis (CA) elucidates the relationships between water quality variables and possible factors, as well as their influences on water quality responses (Kumar and Krishna 2021). Correlation coefficient (r), also known as Pearson correlation, reflects the relationship between at least two dependent and independent variables (Sadat-Noori et al. 2014). This concept is applied by various researchers (Awomeso et al. 2020; El Osta et al. 2020; Nath et al. 2021; Panghal and Bhateria 2020).

PCA is effectively used to elucidate large datasets in complex forms and to make less prejudiced processes (Kazi et al. 2009). It additionally encourages us to recognize the possible pollution sources or factors impacting water quality. Besides, PCA strategies have already been established to be a beneficial tool for water resource management and explanation of contamination issues (Awomeso et al. 2020; Kumar and Krishna 2021; Nyam et al. 2020). By classifying samples as per their distinct chemical properties, CA proves to be an effective tool for establishing chemical interactions between them (Sheikhi et al. 2021).

Published journals on groundwater quality in Tripura, northeastern state of India, are scanty. Singh and Kumar (2015) focused on the status of irrigation and drinking underground water quality in rural and urban areas of Agartala (state capital, Tripura) after analyzing 21 samples. Coexistence of As and F in 59% samples of groundwater in the Dharmanagar region, North Tripura, was established by Bhattacharya et al. (2020) through correlation coefficient data. As and F concentrations exceeded WHO permissible limits in 30% samples. Paul et al. (2019a) reported dominance of carbonate dissolution, rock–waterinteraction, and silicate weathering processes in groundwater hydrochemistry. Paul et al. (2019b) observed Fe and Mn in majority of samples exceeded BIS standard limit and all samples satisfy the irrigation suitability criterion. Brindha et al. (2020) found 66.2% of groundwater samples had good quality water by applying the Canadian Council of Ministers of the Environment Water Quality Index (CCMEWQI) method. They also observed a high risk to human health from groundwater ingestion containing unsafe levels of As and Fe.

Groundwater demands in Tripura meet 50% of irrigation needs, 50% of urban needs, and 80% of rural needs (Debbarman et al. 2013). The necessary actions to be made to ensure the present status of groundwater on regular basis due to changes in various parameters like (i) rapid population growth, (ii) anthropogenic activities, (iii) liquid effluents from industries and engineering workshops, (iv) contaminated water by agricultural waste and effluents, and (v) climatic situation of the environment. In this study, an effort has been initiated to achieve the following objectives: (i) to assess groundwater quality and environmental impact using numerical tools like WQI, correlation coefficient, PCA technique, spatial mapping, and CA; (ii) to generate experimental data for these tools through analysis of 38 groundwater samples collected in January, 2018, from both urban and rural areas of West Tripura District; and (iii) to obtain outcomes that help decision-makers in India, especially for the Tripura Government, for developing a suitable mechanism for treatment as well as making strategies for avoiding future contamination. Besides, an extensive literature survey revealed that a special attention on Fe contamination with its impact should be elaborately discussed.

Study area

The study area covers seven rural development blocks, two Nagar Panchayats of West Tripura District including the Agartala Municipal Corporation (Table 1; Fig. 1). Tripura is one of the smallest states which is located in the southern portion of Northeast India. Sub-tropical and high-humid climate with strong existence of southwest monsoon is characterized with high rainfall in this region. The area receives about 2000 mm (average) annual rainfall and experiences three seasons (winter, summer, and rainy). Groundwater is mainly recharged by rainwater. The topography of the study area is characterized by hilly, undulating plains and alluvial valleys. Dendritic with sub-parallel to parallel drainage system is noticed (Fig. 1). A total 38 number of groundwater sampling locations are randomly selected in West Tripura District, mainly from in and around the Agartala City as well as from Gram Panchayats and Tripura Tribal Areas (ADC villages). The geographical extension of study area is bounded by longitudes between 91° 09′ E to 91° 47′ E and latitudes between 23° 16′ N to 24° 95′ N. All the samples are collected in HDPE bottles from tube wells as per standard protocols. The yield of tube wells varies from 50 to 200 LPM. About 18% samples have been collected from < 20 m depth while 53% samples have 20–40 m depth. Only about 29% of samples are collected from > 40 m depth which influences certain degree of geochemical zoning. The average maximum and minimum temperatures of the study area are experienced at 35 °C and 10 °C, respectively. The purpose behind selecting the study area expecting the groundwater might be exceptionally contaminated due to the following: (i) it is the highest populated (988,192 as per 2011 census of India) district covering 984 km2; (ii) Agartala, district headquarters, as well as the state capital, is the primate city situated in this area; (iii) rubber and allied processing industries (ABM rubber industries and Malaya rub tech industries) are well-established; and (iv) geographically, it is bounded by Bangladesh (groundwater affected country) on the west.

Table 1 Block/MunicipalCorporation/Nagar Panchayat wise sampling locations
Fig. 1
figure 1

Sampling locations of the study area

Geological and hydro-geological setting in study area

Geologically, the study area is young. The massive fluvial sediments are periodically deposited and get uplifted due to the upliftment of whole landmass. The geology of the District has been sub-divided into four major groups: Surma, Tipam, Post Tipam, and Recent. Generally, sandstone, shale, and loose unconsolidated sand, silt, and clay are predominant in its structure (Dasgupta 1979). The area particularly represents the second-order landform system with an average altitude of 15 m. The erosion remnant mounds separated by linear in-filled valleys characterized this highly dissected terrain. These are locally referred to as tilla and lunga respectively.

The Quaternary arrangement, the most recent topographical timeframe in Earth’s history, is made out of Alluvium sand/silt and clay alternating beds under rock group of “unconsolidated sediments.” The DupiTila structure consists of unbending to fine sandstone with yellow and light brown silt clay bands under rock group of “semi-consolidated sediments.” Tipam formation composed of sandstone, pebble bed, and conglomerates is under the rock category of “semi-consolidated sediments.” The Bhuban and Boka Bil formations under the Surma group (“semi-consolidated sediments” rock type) are made of gray to brownish-gray massive shale sandstone-siltstone with limestone beds.

Geomorphologically, the territory of Tripura is juvenile and stands for the first-order terrain. The N-S organized hillsides are anticlinal and interceding basins are synclinal. Five hill ranges and valleys are present in Tripura. The drainage arrangements are of dendritic and rectangular styles.

Materials and methods

Sampling and analytical methods

About 38 samples were stored in HDPE bottles (1-L capacity each) after 10 min of flushing out of each tube well water. Physical parameters including electrical conductivity (EC), dissolved oxygen (DO), turbidity, pH, and total dissolved solid (TDS)(Table 2) were analyzed in sampling locations immediately after collecting the sample. Samples were acidified with 7.9 N nitric acid solution (1:1) in each sample to make pH < 2. The chemical parameters (alkalinity, calcium (Ca), TH, magnesium (Mg), chloride (Cl), and Fe) (Table 2) were tested in the laboratory using standard protocol, American Public Health Association (APHA)(2012). pH was tested using a THOSCON pH meter of TMP 3 model. Turbidity measurement was done by turbidity meter (Labtronics, Model LT-33). DO value was measured with a digital DO meter (Labtronics, Model LT-18). EC and TDS values were checked by EI (Electronics India) equipment. Before conducting experiments, all instruments were pre-calibrated using standard operating procedures. The alkalinity analysis was performed by an acid titration method utilizing phenolphthalein and mixed indicator (bromocresol green and methyl red). Mg, Ca, and TH were analyzed using an EDTA titration method. Cl was determined using silver nitrate titration method (potassium chromate indicator). Fe concentration was measured by phenanthroline method using UV spectrophotometer (PerkinElmer Lambda 35) at 510 nm. All measurements were performed three times and an average value was taken for further interpretation. The spatial distribution map was prepared using the inverse distance weighted (IDW) interpolation technique (Natesan et al. 2021) in the ArcGIS v.10.7.1 software. IDW estimates cell values by averaging the values of sample data points in the neighborhood of each processing cell specifying a lower power that will give more influence to the points that are farther away, resulting in a smoother surface.

Table 2 Basic statistical summary of physicochemical parameters

Calculation of WQI

WQI values of 38 samples were estimated taking 11 variables to obtain their fitness towards drinking. To calculate WQI, two methods (EWWQI and WAWQI) have been employed.

Calculation of EWWQI

The thought, information entropy was first created by Shannon (1948) to quantify the level of confusion and vulnerability of a framework. EWWQI is a widely accepted technique over other WQIs, as EWWQI method does not require any personnel judgments on assignments of weight to each parameter (Amiri et al. 2014). The formulation of EWWQI requires the following steps (Adimalla 2021):

  • Development of a matrix taking analyzed values of parameters of each sample.

  • Construction of normalization matrix considering normalized values of each analyzed parameter to eradicate errors caused by different units and dimensions.

  • Estimation of information entropy based on these normalized values which is formulated by Shannon (1948):

$$ {e}_j=-\frac{1}{\ln m}\sum \limits_{i=1}^m{P}_{ij}\ln {P}_{ij} $$
(1)

where m represents the no. of sampling areas and Pij is the possibility of occurrence of normalized value (yij) of jth evaluated parameter in ith sample which is given by the following equation:

$$ {P}_{ij}=\raisebox{1ex}{${y}_{ij}$}\!\left/ \!\raisebox{-1ex}{$\sum \limits_{i=1}^m{y}_{ij}$}\right. $$
(2)
  • Then, entropy weight can be resolved so that lower entropy parameters have been given more weightage.

$$ {w}_j=\frac{1-{e}_j}{\sum \limits_{j=1}^n1-{e}_j} $$
(3)
  • Finally, EWWQI can be formulated by accumulation of entropy weight and quality rating scale.

$$ EWWQI=\sum \limits_{j=1}^n{w}_j\times {q}_j $$
(4)

where qj is the ratio of analyzed value of jth parameter (Cj) to its World Health Organization (WHO)(2011) recommended standard value (Sj).

$$ {q}_j=\frac{C_j}{S_j}\times 100 $$
(5)

Several researchers (Adimalla 2021; Amiri et al. 2021; Fagbote et al. 2014) have applied this technique to determine WQI. Based on estimated WQIs, they utilized Table 3, where groundwater can be categorized into 5 different categories for drinking, from “excellent to extremely poor” (Amiri et al. 2021).

Table 3 Classification of groundwater quality based on EWWQI value

Calculation of WAWQI

WAWQI technique was primarily proposed by Horton (1965) and successively modified by Brown et al. (1970) and Cude (2001). Three steps are to be pursued to achieve WQI: The initial step is the computation of weightage unit of jth parameter and it is inversely related to Sj for that corresponding parameter. Therefore, the mathematical equation for the weightage unit is

$$ \kern1.25em {w}_j^{\prime }=\frac{K}{S_j} $$
(6)

where K is the proportionality constant which can be determined as follows:

$$ K=\frac{1}{\sum \limits_{j=1}^n\frac{1}{S_j}} $$
(7)

The second step is the formulation of quality rating (qj) for jth parameter by using the same Eq. 5 as used in the EWWQI method. The last step is the computation of WQI by aggregation of quality rating with the unit weight which is given by following equation:

$$ WAWQI=\frac{\sum \limits_{j=1}^n{w}_j^{\prime}\times {q}_j}{\sum_{j=1}^n{w}_j^{\prime }} $$
(8)

Several researchers (Khatri et al. 2020; Ram et al. 2021) used this technique to estimate WQI. Depending upon the estimated WQI, the status of water for human consumption can be categorized as per Table 4 (Khatri et al. 2020).

Table 4 Category of groundwater quality according to WAWQI value

Multivariate statistical analysis

Correlation coefficient (r) is calculated statistically and its value may be positive or negative and may be ranged from − 1 to + 1. r values of + 1 and − 1 indicate strong and inverse linear relationships, respectively, between variables.

PCA is a tool to decide an autonomous variable by taking out exceptionally related parameters. It evaluates the variance of interconnected variables by taking complex datasets and transformed them into less number of pseudo-variables. The steps for PCA are as follows (Nath et al. 2021):

  • Correlation coefficient matrix calculation: the datasets are appropriate to do PCA if r values between parameters are appeared to be high

  • Identifying the number of factors to be kept relying upon % of variance and eigenvalues

  • Factors rotation to enhance their clarification by maximizing loading of parameters

Kaiser–Meyer–Olkin (KMO) and Bartlett’s sphericity tests have been executed to confirm whether PCA is reasonable or not for the present experimental dataset. KMO index measures the satisfactoriness of samples. The entire analyzed datasets will not be appropriate for PCA if this index shows a ˂ 0.5 value. In the present study, PCA is applicable as this value is 0.722. Bartlett’s sphericity test has been performed to acquire correlation matrix’s nature whether this is a uniqueness matrix or not. Null hypothesis is not considered for PCA through Bartlett’s sphericity test because this hypothesis already considers the uniqueness of correlation matrix. In the present investigation, the significance level (0.00) is ˂ 0.05 which discard null hypothesis and it affirms the presence of a strong relationship among parameters. Finally, PCA technique has been implemented on normalized dataset using varimax rotation (to maximize factor loadings) and Kaiser’s principle (Kaiser 1960) to achieve principal factors.

CA was interpreted to group monitoring sites and to determine whether each sample had similar physicochemical (inherent) characteristics. In CA, every sample forms an individual cluster and a pair of clusters is combined depending on similarity (measured by squared Euclidean distance) and applied linkage technique (Ward’s method). This method utilizes an ANOVA technique to determine differences between each cluster. The outcomes are demonstrated by a dendrogram plot, providing a basic image of the cluster (groups)(McKenna 2003). IBM Statistical Package of Social Studies (SPSS) version 21 was employed to make PCA and CA assessments.

Results and discussion

Table 2 represents the basic descriptive statistics of environmental parameters of groundwater samples. The mean value of all parameters was within WHO contaminant level for drinking except few cases suggesting local contamination sources.

Physicochemical parameter analysis

pH represents the presence of H+ ion and acidity or basic nature in water. In water body, all types of reactions (chemical, biological, physical, etc.) depend on pH (Rao 2006). The average value of pH was 5.88 ± 0.492 (Table 2) which was below WHO contaminant level (World Health Organization (WHO)2011). However, 79% of samples showed acidic nature because of dissolved CO2 and discharge of effluents from industrial waste. pH confirmed strong correlation with alkalinity (r = 0.751) (Table 6) as higher pH raises alkalinity level resulting in the conversion of all dissolved CO2 to CO3 and HCO3 ions.

EC refers to a substance ability to conduct electrical current through water. The mean EC was 169.30 μmho/cm(Table 2), while only 13% of samples exceeded WHO standard limit of 250 μmho/cm. The highest EC can damage crop yield and soil structure as water transparency is directly proportional to crop yield. Geochemical processes including reverse and direct ion exchange, high evaporation, silicate weathering, and rock-water association are various causes of high EC values (Alfaifi et al. 2020). EC showed strong correlation with Ca (r = 0.909), Mg (r = 0.948), and TH (r = 0.949) and moderate correlation with Cl (r = 0.632) (Table 6)(Panghal and Bhateria 2020), suggesting all ions (cations and anions) have a common source and geochemical process. Figure 2f depicts the highest EC value in the central part of Agartala City, located in the western part of the study area. Northern and southern parts have observed low to very low (< 114.50 μmho/cm) EC.

Fig. 2
figure 2

Spatial distribution maps of physicochemical parameters showing concentrations of a iron (Fe), b turbidity, c dissolved oxygen (DO), d calcium (Ca), e magnesium (Mg), and f electrical conductivity (EC)

TDS represents inorganic salts of Mg, Ca, Na, K, and Cl dissolved in groundwater (Adimalla et al. 2018). In this study, TDS ranged from 19.23 to 750.5 mg/L with an average value of 89.53 mg/L showing all samples within WHO limit of 1000 mg/L (World Health Organization (WHO)2011). A higher level of TDS prompts drinking water to be of bitter taste, brackish, and salty which in turn can harm aquatic species present in water. Higher dissolved salts pointing to high EC show perfect correlation with TDS (r = 0.946) (Table 6).

DO is an essential organic parameter to assess the fitness of water for keeping up human well-being and oceanic life. The average DO of groundwater was 4.38 ± 1.61 (Table 2). About 37% of samples were much below WHO prescribed limit (World Health Organization (WHO) 2011) contributing to decomposition and die-off of submerged plants. Turbidity (high concentration) absorbs sunlight and produces warm water, thereby reducing DO level, confirming weak negative relationship with turbidity (r = − 0.315). The maximum concentration of DO has been reported in the northwestern to the western portions of the study area. Some pockets of the southern part also have high concentration of DO. However, northeastern and southeastern parts of the district observed poor concentration of DO (Fig. 2c).

Turbidity is an indication of optical quality that reflects the suspended particles present in water via dispersion of light (Kumar and Krishna 2021). The analyzed mean turbidity was 16.14 NTU with 45% samples exceeded WHO standard limit of 5 NTU. Only one monitoring station (GW17) showed a turbidity value (148.1) about 30 times higher than the WHO limit. With the rising of water levels due to rapid and copious raining, suspended and dissolved solids in water also raises which in turn increases turbidity. Turbidity derives from insoluble Fe particles (colloidal form) and excessive dissolved metal produces turbid water, showing moderate correlation with Fe (r = 0.636) (Table 6). From Fig. 2b, the western and southeastern parts have the highest turbidity than northeast of the central part.

Alkalinity represents the presence of carbonate, bicarbonate, and hydroxide compounds of sodium, potassium, and Ca in water. Alkalinity varied from 204 to 358 mg/L(Table 2) with an average of 128.05 mg/L showing 26% of samples were alkaline in nature. Too much alkalinity can change the normal pH of body prompting numerous human health issues like nausea, vomiting, and bone problems. Excessive alkalinity leads to the formation of corrosive effect in boiler known as “embrittlement.”

Mg and Ca are essential elements for biological growth. The source of Mg and Ca in groundwater mainly emerges from calcite, dolomite, and magnetite rock. Excessive concentrations of Ca lead to stone formation in human body adversely destroying human health. Again lack of Ca can cause rickets in human bodies. In the present study, the average value of Ca and Mg was 11.75 mg/L and 37.85 mg/L(Table 2), while Mg in 21% samples overstepped the WHO standard limit of 50 mg/L. The western part near Agartala City and southeastern fringe reported maximum Mg concentration than the northern part (< 24.66 mg/L) of the study area (Fig. 2e). From Fig. 2d, the western edge and south-central part observed maximum Ca than the northern part. A gradual decrease of Ca in groundwater has been taken place from the northwestern to southern portions of the district.

TH is obtained from the summation of Mg and Ca hardness. All samples categorized as soft water with hardness values of 7.921–265.346 mg/L with an average value of 49.609 mg/L were reported. Soft water can increase heavy metal (copper, zinc, lead, etc.) solubility in water. This also increases scale deposition in pipes (pipe corrosion) which prevents heat transfer through the pipe causing a serious environmental problem. TH exceeding 150–300mg/L can spoil the kidney by developing stone in the kidney and it reduces soap’s ability to produce lather. Mg confirmed more good correlation with TH (r = 0.99) (Table 6) than Ca (r = 0.96) does. This statement confirmed that TH was primarily dominated by Mg. The correlation of TH, Ca, and Mg specifies contamination due to minerals or geologic sources.

Cl in groundwater at higher dose can cause laxative effects. In the present investigated area, the mean Cl concentration was 31.88 mg/L(Table 2) showing all samples within WHO contaminant level. Cl ion above 350 mg/L increases EC which causes metal corrosivity, thus increasing metal levels in potable water. Excess Cl ion is an indication of contamination from various sources and it also creates salty water (Marghade et al. 2012).

Fe exists in groundwater either in + (II) or + (III) oxidation state depending on anaerobic or aerobic conditions. The average Fe value was 4.68 ± 4.03 mg/L(Table 2) with 89% of samples exceeded WHO prescribed limit of 0.3–1 mg/L. This demonstrates that the monitored location is exceptionally influenced by Fe contamination. The higher Fe value might be credited to the leaching of Fe-bearing minerals into groundwater and low groundwater level. Excessive Fe concentration transforms water color to reddish-brownish and is capable to enhance bacterial growth (Awomeso et al. 2020). The intake of too much Fe has been shown to harm the blood vessels, liver, pancreas, and kidneys and can sometimes results in death. Bloody stool and vomiting are common side effects of excessive Fe consumption (Panghal and Bhateria 2020). Overexposure to Fe has been linked to diabetes, fatigue, abdominal pain, and in some cases impotence among other health consequences. The southeastern and southwestern parts of the study area reported more Fe contamination comparing to the northern and central parts (Fig. 2a). It has also been observed that especially southern bank of river Haora has more Fe contamination than the northern part. Other studies reported Fe concentration as 0.01–19.28 (Singh and Kumar 2015), 0.22–5.07 (Paul et al. 2019a), 0.12–5.14 (Paul et al. 2019b), and 0.05–5.39 (Brindha et al. 2020) mg/L.

The Fe concentration is varying in different sub-surface layers. The average value of Fe content is high in near to land surface and relatively poor concentrations have been found in higher depth of land surface. About 18% samples have high Fe contamination (> 7.79 mg/L). Maximum sample stations of this group are located in the western part of the research area where groundwater has been collected mostly from < 20 m bottom of the earth surface specially Kalyani, Women’s College, Bodhjungnagar area. About 53% of sample stations have been found from 20 to 40 m depth of the surface. It is also found that Fe contamination is significantly lower in these areas. The rest of 29% of sampling station reflects that groundwater has been collected from more than 40 m depth from the earth’s surface. The study reveals that Fatikchara, Shantipur, Salbagan, Narsinghgarh Bazar, and Hapania areas have a relatively very low level of Fe. It signifies that, with an increase of depth of the soil layer, geochemical properties of groundwater are reduced proportionately. Geological and morphological structures of the region have strong relation among the depth of the earth surface and ground water quality (Fig. 2a).

WQI analysis

WQI is estimated utilizing two strategies in the present reported area. To know the gravity of concern regarding Fe contamination, EWWQI and WAWQI values are also estimated considering 10 parameters where iron concentration is not included. The research additionally features percent decrease of two WQI considering with and without Fe as per following equation

$$ \%\mathrm{decrease}\ \mathrm{of}\ \mathrm{WQI}=\frac{\mathrm{WQI}\ \mathrm{with}\ \mathrm{iron}-\mathrm{WQI}\ \mathrm{with}\mathrm{out}\ \mathrm{iron}}{\mathrm{WQI}\ \mathrm{with}\ \mathrm{iron}}\times 100 $$
(9)

From estimated results of EWWQI as reflected in Fig. 3a and Table 5, almost 58% of samples are of “excellent” to “good” category, five samples are of “medium” category, and 29% samples are under “poor” to “extremely poor” category. Out of 29% poor category samples, 18% sample belongs to extremely populated capital city, Agartala. Depending on EWWQI values (Table 5; Fig. 3a) excluding iron concentration, 71% of samples cover “excellent” to “good” category, three samples are of “medium” standard, and “poor” to “extremely poor” category are shown by 21% samples. According to Eq. 9, percentage decrease of EWWQI between 50–80, 40–50, 30–40, 20–30, and 0–20 are detected in five, nine, seven, seven, and ten groundwater samples, respectively. Figure 4 depicts spatial distribution of EWWQI with and without Fe. From Fig. 4a, groundwater quality in the southern and northeastern parts is very poor. Although, it has been depicted that about 10 areas especially the western and central parts have good to excellent quality of groundwater. But from Fig. 4b, EWWQI without Fe portrays extremely poor water quality in the southwestern and southeastern parts, while about six micro-regions are found in the central and western parts of the study area, where groundwater quality is excellent. As per EWWQI without Fe, overall, good quality groundwater is observed in the central part of the area.

Fig. 3
figure 3

WQI values according to a entropy weight method and b arithmetic weight method

Table 5 EWWQI and WAWQI values of groundwater samples
Fig. 4
figure 4

Spatial distribution maps of EWWQI a with iron and b without iron

From WAWQI values (Fig. 3b; Table 5), only five samples cover “very poor” standard and the remaining all other samples are “unsuitable” for drinking, whereas without considering Fe concentrations, 60% (23) of samples are of “poor” to “very poor” category and the remaining samples fall under “unsuitable” category. Percent decreases of WAWQI values estimated as per Eq. 9 were noted as 70–90, 50–70, 40–50, 30–40, and 0–30 for eight, nine, nine, five, and seven samples respectively. Figure 5 depicts the spatial distribution of WAWQI with and without Fe. According to Fig. 5a, groundwater quality of the entire study area is unsuitable for drinking. On the other hand, Fig. 5b depicts that the entire study area has very poor groundwater quality except 14 small pockets located in the central and western parts of the study area.

Fig. 5
figure 5

Spatial distribution maps of WAWQI a with iron and b without iron

GW17 shows the highest WQI value in both methods as EC, Mg, TDS, Fe, and turbidity concentration in this sample exceeded the WHO permissible limit. Again, from the r value, a strong positive relationship exists between EC, Mg, turbidity, and TDS. Also, Fe and turbidity show a strong relationship. Geological strata and underlying groundwater containing excessive minerals led to increased levels of ions (EC, Mg) and TDS. High concentrations of ions suggest processes of leaching and rock-water interaction deteriorating water quality. Excessive Fe and turbidity concentration may be due to improper waste disposal, inappropriate maintenance, and functioning of a septic system. Excessive utilization of groundwater enhances Fe concentration in this location. Moreover, GW17 (Battala) is located in Agartala City where WQI value is much higher than other samples because thick population and anthropogenic sources degrade groundwater quality.

Principal component analysis

PCA technique is applied to recognize individual loadings of 11 variables in water quality. Generally, eigenvalues are utilized to achieve principal components (PCs). An eigenvalue determines major variables with maximum value. Eigenvalues ≥ 1 are considered as the most significant (Muangthong and Shrestha 2015). PCs with eigenvalues ˂ 1 were discarded on account of their low essentialness.

A scree plot diagram (Fig. 6a) has been established where the first three factors show their eigenvalues > 1. This figure shows a minor fall in slope after the third eigenvalue. Therefore, only the first three components with a total variance of 84.5% have been decided. The percentage variance, eigenvalues, and loadings of 3 PCs are given in Table 7. The PC loadings were sorted as strong (> 0.75), moderate (0.75–0.50), and weak (0.50–0.30) (Liu et al. 2003).

Fig. 6
figure 6

Principal component analysis by a scree plot and b 3D loading plot

The first component (PC1) accounting the largest proportion in total variance (56%) (Table 7) had positive loadings for EC, TDS, Mg, TH, Ca, and Cl. This may be credited to the source of natural water (Simeonov et al. 2001) and this factor is termed as “water hardness salinity” factor. However, strong loading of Ca, TH, and Mg in PC1 specifies the origin of groundwater from rock-water association with weathering process of dolomite and calcite. In PC1, dissolved salts of Ca, Cl, and Mg have the greatest influence on water quality (Kumar and Krishna 2021). PC2 clarifies about 17% (Table 7) of the total variance, where pH and alkalinity had a strong influence. This strong relationship is likewise found from r value (r = 0.751) (Table 6). PC2 represented here the physical aspects of water by which water acidic or basic nature can be communicated. PC3, contributing just 11% (Table 7) of the total variance, had moderate positive coefficients (Fe and turbidity) but was negatively loaded with DO. This statement is in agreement with the acquired results of r (Table 6). This factor (PC3) may be credited to the dissolution of solids making turbid water and pollution because of harmful toxic metal contamination.

Table 6 Correlation coefficient (r) matrix of groundwater samples
Table 7 Loading values of principal component of groundwater samples

Figure 6b shows a 3D plot of PCA loading values. This figure affirms that a solid association exists among pH and alkalinity (represents acidic or basic water source). Fe, DO, and turbidity in one group signify some metallic ions suspended in water and make water turbid. Again EC, TDS, Mg, TH, Ca, and Cl are derived from a single group (corresponds to water hardness and dissolution of solids).

Cluster analysis

CA derives three clusters out of 38 monitoring sites. Cluster 1 includes 20 stations, divided into two sub-groups (22, 24, 37, 20, 32, 7, 21, 9, 38, 10, 12, 1, 25, 19, 14, and 18) and (5, 6, 11, 30) (Fig. 7). This cluster is termed as a less polluted zone as 18 samples except two (GW18 and GW30) that have good water quality. Cluster 3 comprises of only one sampling station (GW17), termed as a highly contaminated region as in the “WQI analysis” section where it has been already discussed that GW17 has the highest WQI value due to the thickly populated region and anthropogenic activities. Cluster 2 covers the remaining 17 sites (Fig. 7) considered as moderately contaminated regions, having 8 samples of poor quality water. Comparing to cluster 1, cluster 2 has more number of poor quality water samples, that is why it is considered a moderately polluted region. Linkage distance of cluster 1 and cluster 2 are less than cluster 3 indicating that samples in cluster 1 and cluster 2 have almost similar characteristics to cluster 3.

Fig. 7
figure 7

Cluster analysis of sampling locations

Another dendrogram view of parameter cluster analysis is shown in Fig. 8. Here, cluster 1 includes five environmental parameters: pH, DO, Fe, Ca, and turbidity. This cluster may be from the natural and metallic sources along with the rock-water interaction process. Cluster 2 covers three parameters: TH, Mg, and Cl, indicating hardness source is mainly dominated by Mg and the dissolution of Cl in groundwater. High Mg content than Ca suggests weathering of minerals like dolomite and magnesite. Cluster 3 comprises of EC, TDS, and alkalinity, which could be attributed to ionic influence and salt concentration in groundwater.

Fig. 8
figure 8

Cluster analysis of physicochemical parameters

Conclusions

To obtain the present status of fitness of groundwater for drinking, a few investigation techniques like WQI, correlation analysis, CA, and PCA have been employed effectively based on analyzed data of 38 groundwater samples. According to the outcomes of WQI, the higher values of Fe and turbidity play a significant commitment to make non-potability of groundwater. Comparing two WQI methods, the EWWQI method gives more precise and reliable outcomes than the WAWQI method because the EWWQI method consistently avoids personnel judgments (that assume weightage factor to each water quality parameter), which are mulled over for the WAWQI method. The WAWQI technique does not provide detailed information about the actual condition of groundwater due to over-emphasizing phenomena. In this study, 11 parameters are reduced into 3 underlying factors (PC1, PC2, and PC3) by the PCA method to distinguish water quality degradation. This technique reflects water quality deterioration because of the contribution of metallic ion concentration, turbidity, and DO for PC3; pH and alkalinity for PC2; and the remaining parameters responsible for PC1. CA results agreeably validate the outcomes of WQI. In this manner, from the above investigation, groundwater quality is ought to be observed routinely, and accordingly, treatment has to be adopted only when the sample will be not fit for drinking.