Introduction

The demand for groundwater has increased substantially during the last few decades for drinking, agriculture, irrigation, and industrial activities (Gleeson et al. 2012). In urban areas, the availability of potable water supply is becoming more problematic due to unplanned development, population explosion, etc. Climate change and fast urbanization are the main cause of water shortages in many parts of the world (Sophocleous 2004). Around 90% of the rural and more than 50% of the urban population get their drinking water supplies from groundwater resources. Recent studies show that groundwater storage has declined with time (Nair and Indu 2021), which is a matter of concern for long-term groundwater management plans for sustainable development. The unavailability of suitable water may affect the economic growth and development of society. Groundwater aquifers in coastal areas are also prone to seawater intrusion through hydraulically connected faults and lineaments extending toward the sea (Ketabchi and Ataie-Ashtiani 2015). Saline water intrusion caused by excessive groundwater pumping may considerably affect the potable water supply in coastal areas (Werner et al. 2013), up to at least 100km from the coastline where more than 40% population resides. The groundwater quality is also affected by mining, industrial uses, and untreated sewage discharge in urban areas (Gupta 2002; Robson et al. 2006). Furthermore, the population residing in urban areas is more likely to be exposed to various chemical pollutants during their lifetime (Huang et al. 2020). Groundwater pollution has varying impacts and consequences for different land-use land-covers patterns (Gómez et al. 2017). It has been observed that poor quality of drinking water may be the primary cause of about 80% of diseases (Mukherjee et al. 2021). Therefore, considering the health hazards and other issues associated with polluted subsurface water, evaluation of groundwater quality is crucial (He and Wu 2019).

The hydrogeochemistry of groundwater has been studied from time to time, but little attempt has been made to understand spatio-temporal variation of groundwater quality (2012 to 2022) in the coastal state of Odisha, India. In this paper, we used Shannon’s entropy theory, GIS technique, and statistical analysis to examine various geochemical characteristics of groundwater in Odisha, and impacts of anthropogenic activities and coastal vulnerability. The first objective is to evaluate major controlling factors of groundwater quality by using Shannon’s entropy method. Statistical analysis has been used to analyze and understand the relationship between hydrogeochemical parameters that influence groundwater quality. Variations of LULC between 2012 and 2022 have been used to understand the impacts of anthropogenic, and industrial activities on groundwater quality. The analysis and results may provide valuable inputs to the planners and policymakers involved in water resource management for sustainable development.

The water quality index (WQI) is a widely used method to estimate the quality of water for drinking and irrigation purposes (Goswami and Rai 2022). The aggregation techniques of various sub-indices that are used in WQI studies are discussed in detail by Banda and Kumarasamy, 2020. In brief, there are numerous aggregation methods, but the most common are the multiplicative (geometric), and additive (arithmetic) (Sutadian et al. 2016). The additive method is widely used for aggregating sub-indices in a water quality study (Walski and Parker 1974; Stoner 1978). In some aggregations techniques used for WQI, all parameters are assigned a unit weight (Mohebbi et al. 2013), which appears impractical given that different parameters have varying degrees of influence on water quality. As a result, appropriate weights for water quality parameters must be determined and considered in the aggregation of parameters for WQI estimation (Singh et al. 2008). Among the various WQI evaluation standards such as the Bureau of Indian Standards (BIS), World Health Organization (WHO), and Indian Council of Medical Research (ICMR), the BIS standard has been found to be best suited for WQI analysis (Seifi et al. 2020).

For WQI analysis, calculating the weights of parameters is an important step. Several methods are adopted for calculating weights, e.g. based on the relative importance of variables relative weights on a scale of 1 to 5 are assigned (Brown et al. 1972; Tiwari and Manzoor 1988). Similarly, weights are also estimated using Analytical Hierarchy Process (AHP) (Saaty 1980). However, these are subjective methods, and the assigned weights are likely to be biased by expert opinion (Sutadian et al. 2017). However, the weights estimated using Shannon’s entropy technique (Shannon 1948; Hassan and Rai, 2020), are relatively free from such subjective biases. The term entropy refers to the element’s unpredictability as well as the assessment of fuzziness in a system. Shannon entropy is based on probability theory and uses relative weights of attributes to calculate the average information provided to the decision-maker (Shannon and Weaver 1947). Therefore, the water quality index estimated using entropy weights (EWQI) is considered better (Li et al. 2010; Chang and Lin 2014; Hasan and Rai 2020).

Salinization, or seawater intrusion, in coastal aquifers, has become a major concern across the world, particularly in areas where freshwater aquifers are hydraulically linked to seawater (Park et al. 2005). The Na+ vs Cl-, Na+/Cl- molar ratio and Revelle co-efficient has been used preliminary to understand the impact of seawater intrusion in the study area. On the other hand, the mineral saturation index (SI) has been used to understand the degree of equilibrium of different mineral phases that tend to dissolve and precipitate in the groundwater (Li et al. 2010), which helps to assess the degree of coastal aquifer contamination by seawater intrusion. The details of the methodology and results are discussed in the following sections.

Integrating hydrogeochemical data for understanding groundwater quality with the Geographic Information System (GIS) provides valuable insights into the spatial variability of water quality (Nayak et al. 2017). Several researchers have conducted the hydro-geochemical and groundwater quality study for the region (Das and Anandha Kumar 2005; Shukla et al. 2010; Madhav et al. 2018; Sahu et al. 2021; Goswami and Rai 2022). In this paper, we have attempted to study the spatiotemporal variation of the groundwater quality index, determined using the entropy method, Geospatial, and statistical techniques. We also study the impacts of LULC change, seawater intrusion impacts, and other anthropogenic factors on the water quality in the region. Statistical methods have been used to understand the relationship between various hydrogeochemical parameters influencing groundwater quality. The principal component analysis (PCA) has been used to understand important variables influencing the water quality in the region (Hu et al. 2013; Xiao et al. 2019), whereas attribute-based clustering has been used to identify similar types of water samples. The data, methodology, and results are discussed in the following sections.

Data and Methodology

Study area

The study area is located in the central-eastern part of India (Fig. 1), between latitudes of 17°48′–22°34′N and longitudes of 81°24′– 87°29′E. The region can be broadly classified into four zones i.e., the Central River Basin, the Northern Plateau, the Coastal Plains, and the Eastern Ghats Hill Ranges. The granites, granite-gneisses, khondalites, schistose, charnockites, calcsilicates, phyllite, quartzites, shale, marble, sandstone, and limestone rocks are the most common subsurface basement rocks. The Precambrian basement covers 80% of the study region, with ages ranging from Precambrian to Cenozoic. The Mahanadi River Basin occupies around 45%, and the coastal plains account for around 10% of the overall geographical area. The region falls under a tropical climate and is characterized by humid, high temperature, and high to medium rainfall. Annual rainfall varies from 1211-1697mm with an average of 1481mm, where maximum rainfall is observed during the months of July and August, with 85% of rainfall due to the southwest monsoon. The annual temperature varies from ~3 and 6°C in winter, and 45°C during the summer.

Fig. 1
figure 1

Geographical location of the study area. Sampling locations clustered in 5 groups based on similarity in hydrogeochemical characteristics, are shown by circles of different colours. Major lineaments are shown by black-lines

The water samples from around 725 sampling locations have been obtained from published reports of various years of the Central Groundwater Board, Govt. of India (CGWB 2012-13, 2015-16, 2018-19, 2021-22). A total of ten groundwater physicochemical parameters have been studied e.g., pH, fluoride (F-), calcium (Ca2+), magnesium (Mg2+), Chloride (Cl-), total dissolved solids (TDS), HCO3- (bicarbonate), sulfate (SO42-), TH (total hardness), iron (Fe2+). The outlier in the data sets was removed using statistical methods, where data values below the Q1–1.5 IQR or above Q3+1.5 IQR, were identified as the outlier. The Q1 and Q3 are the 25th and 75th percentiles, and IQR is the interquartile range (Q3–Q1). The point data were transformed to raster data by the Inverse distance weighting (IDW) interpolation technique (Watson 1992), and understand the spatial-temporal variation of the several parameters. IDW is a data interpolation method, which is used to evaluate values of data at unsampled locations, using a linear combination of known data from observed locations, using weights that are inversely proportional to the distance of original sampling locations (Watson 1992; Manson et al. 1999). Furthermore, Gibbs's diagram has been used to understand the dominant mechanism that influences the overall groundwater quality in the study region.

Furthermore, to identify broader similarities and differences in the hydrogeochemical composition of the water samples in the study area, the data samples were categorized into clusters based on their composition or attributes. A cluster is a collection of objects aggregated together because of certain similarities. The main aim of spatial clustering is to partition spatial data into a meaningful subclass called clusters, such that objects in the same cluster have similarities to a great extent, and are dissimilar to those in other clusters. For geo-referenced 2-D spatial data, several spatial clustering algorithms based on attributes have been developed, e.g. partitioning algorithms such as k-means (MacQueen 1967) have been applied to solve spatial clustering problems (Miller and Han 2009). K-means clustering is one of the simplest and most popular unsupervised machine learning algorithms. We use k-means algorithms for clustering the hydrogeochemical compositions based on a number of attributes such as pH, TH, and TDS, and categorized the data into five broad categories (Fig. 1).

Correlation Matrix and Principal Component Analysis (PCA)

A correlation matrix study was performed to determine the degree of similarity between different physicochemical parameters. Component matrix analysis has the advantage of classifying the variable data into some organized components, which can be used to draw meaningful inferences. The correlation between the hydrogeochemical groundwater parameters is determined using Pearson's correlation coefficient (r) (Ustaoğlu et al. 2020). The correlation coefficient is a useful statistic for illustrating the relationship between dependent and independent variables (Fleury 2005).

Principal Component Analysis (PCA) is another statistical technique used to reduce the dimension of the bulk amount of data and organize it into well-classifiable groups without losing information. PCA was used to identify probable sources of groundwater facies and variables regulating hydrogeochemistry in the study area. Several studies have employed PCA to calculate and identify the primary factors influencing groundwater quality by reducing the dimension of various physicochemical parameters (Williams and Abdi 2010). Principal Components are estimated from the eigenvalue analysis of the correlation coefficient matrix derived from various parameters and were computed using the IBM SPSS Statistics (IBM Corp, 2020) software. The rotated component matrix is the foundation of PCA analyses. Variables with >0.5 rotated loading are considered significant (Sikdar and Chakraborty 2008). PCA illustrates the relationship between primary components influencing the data, and factor loading (Xiao et al. 2019).

Entropy-based Water Quality Index (EWQI)

The following equation has been applied to analyse the entropy, where wj represents the weights which are functions of entropy ("ej"). The relative weights are function of entropy, where smaller weights are associated with higher entropy and vice versa (Shannon 1948).

The entropy of the system is defined as

$${e}_J=\frac{1}{{\mathit{\ln}}_{(m)}}\sum {r}_{ij}\ln \left({r}_{ij}\right)$$
(1)

Where,

\({r}_{ij}=\frac{x_{ij}}{\sum_{i=1}^m{x}_{ij}}\) is the normalized decision matrix (rij).

Finally, the entropy weights (wj) are given by

$${w}_j=\frac{1-{e}_j}{\sum_{j=1}^n\left(1-{e}_j\right)\ }$$
(2)

Then, the entropy-based water quality index (EWQI) ca be represented by (Li et al. 2010)

$$EWQI=\sum \nolimits_{j=1}^n\left({w}_j\times {q}_j\right)=\sum \nolimits_{j=1}^n\left[\left(\frac{1-{e}_j}{\sum_{j=1}^n\left(1-{e}_j\right)}\right)\times \left(\frac{C_j}{S_j}\times 100\right)\right]$$
(3)

Where "Cj" represents the different physicochemical parameters, and "Sj" indicates the water quality ranking standard as per BIS (2012) values for drinking water. EWQI is considered a better technique than the traditional WQI techniques which are based mostly on subjective weights of different parameters.

Saturation Index (SI)

The mineral saturation index (SI) is used to understand the degree of equilibrium of different mineral phases that tend to dissolve and precipitate in the groundwater (Larson et al. 1942). SI values are calculated using the following equation,

$$SI=\mathit{\log}\left(\frac{IAP}{K}\right)$$
(4)

where "IAP" is an ion activity product, and K represents the equilibrium constant. A positive SI value (SI>1) suggests the oversaturated condition where minerals will tend to precipitate, whereas a negative value of SI (SI<0) indicates undersaturated conditions i.e., an ongoing mineral dissolution stage. SI value of approximately zero (SI∼0) indicates an equilibrium condition between water and the mineral (Goswami et al. 2022).

Results and Discussion

Physicochemical Variations

The highest and lowest pH values were observed from 6.5 and 8.59 with a mean value 8.08 and 6.5 to 8.9 with a mean value 7.83 in 2012-13 and 2018-19, respectively (Fig. 2a, b, c), and vary from slightly acidic to alkaline in nature. While pH in 2021-22 varies from <6.5 to 8.71 and with a mean of 7.83. The highest pH concentration has observed mostly in the western and northeastern parts of Odisha for the year 2012-13 (Fig. 2a). While the sspatiotemporal variation between the years 2018-19 pH concentration varies (Fig. 2b). The low pH is caused by acid mine drainage when a significant amount of Fe2+ becomes stable and dissolved in groundwater under reducing conditions (Fig. 2b).

Fig. 2
figure 2

Spatio-temporal variation of physicochemical parameters i.e. pH (a, b, c), TDS (d, e, f), and TH (g, h, i) during 2012-13, 2018-19 and 2021-22

The minimum and maximum concentrations of TDS fall within a range of 37.17 and 1628.85 mg/L with mean value 414.01, 40 and 2766 mg/L with mean value 356.89, 45 and 2766 with mean value 354.85 mg/L during the period between 2012-13, 2018-19 and 2021-22 respectively (Fig. 2d, e, f). Total Dissolved Particles were calculated using the formula TDS=K*Ec (Walton 1989). Because of the mixing of fresh and saline water, and comparatively high EC owing to dissolved ions, the spatial-temporal variation map of TDS values shows a large concentration near the eastern and western parts of the region. Furthermore, because TDS represents the sum of all dissolved ions, the level of groundwater contamination due to anthropogenic activities can be estimated by comparing TDS levels with different ionic types (Rao et al. 2021).

Furthermore, Total Hardness (TH) of groundwater is defined as the amount of multivalent cations in the sample in the presence of alkaline earth (Nag and Das 2017). The higher TH concentration was observed mostly in the western parts of Odisha (Fig. 2g, h, i). The spatio-temporal variation (2018-19) suggests the overall TH concentration has increased (Fig. 2h). The TH values in 2012-13, 2018-19, and 2021-22 vary from 1.0 to 847.79 with mean value of 205.55 mg/L; 25.39 to 1440mg/L with mean value of 210.53 mg/L and 26.39 to 1540 with mean value of 213.59 mg/L in the study area, respectively, indicating that some areas have exceptionally hard water, which can't be used for domestic purpose. According to BIS (2012) standards, the permissible limits are 200mg/l and 600mg/l. Increasing TH in the central to NW parts suggests excessive fertilizer use in the agricultural land, mines, seepage, and runoff from the soil, which percolates and contaminates groundwater quality and increases the hardness in the study region.

The spatial-temporal variations of Ca2+ and Mg2+ are shown in Fig. 3(a, b, c, d, e, f), and are seen to be increasing between 2012-13, 2018-19, and 2021-22. For the year 2012-13 the Ca2+ mostly varies in the eastern and central parts of the study region. The spatio-temporal variations (2018-19) of the Ca2+ concentration increase towards the western, Central, and north-western parts of the study area (Fig. 3 a, b). While in 2021-22 the Ca2+ ions are seen to be increased towards the western, central, and few eastern parts of Odisha (Fig. 3c).

Fig. 3
figure 3

Spatio-temporal variation of the ions i.e. Ca2+ (a, b, c), Mg2+ (d, e, f), and HCO3- (g, h, i) during 2012-13, 2018-19 and 2021-22

The highest concentration of Mg2+ was observed in the western parts of the study region for the year 2012-13 (Fig. 3d). The spatio-temporal variation (2018-19) of Mg2+ concentration is increased along the eastern and western parts of Odisha (Fig. 3e). During 2021-22 the increased of the Mg2+ concentration along the western, central parts of the study area (Fig. 3f). The highest HCO3- concentration was observed in the central parts for 2012-13 (Fig. 3g). The spatial-temporal variation of HCO3- (2018-19) and (2021-22) concentration increased along the eastern, central, and western parts as well (Fig. 3h, i). The highest SO42- was observed in the north-western parts for the year 2012-13 (Fig. 4a). The spatio-temporal distribution of SO42- concentration is increased towards the eastern and central and western parts of Odisha (Fig. 4b, c). The highest concentration of Cl- was observed along the western and northeastern parts for the year 2012-13 (Fig. 4d), whereas the spatial distribution of Cl- concentration is slightly increased along the eastern and western parts of the study area for the year 2018-19 and 2021-22 respectively (Fig. 4e, f).

Fig. 4
figure 4

Spatio-temporal variations of physicochemical parameters: SO42- (a, b, c); Cl- (d, e, f) during 2012-13, 2018-19 and 2021-2

The mean of Cl- concentrations are 97.81, 97.20 and 98.79 mg/L for the year of 2012-13, 2018-19 and 2020-21 respectively. The mean SO42- concentration values are 21.52, 25.80, and 25.51 mg/L for the year of 2012-13, 2018-19 and 2020-21 respectively. The mean concentration value of HCO3- 154.44 (2012-13), 178.62 (2018-19), and 210.89 (2020-21) mg/L respectively. The mean concentration value of Mg2+ 22.75, 25.08, and 25.71 mg/L for the year of 2012-13, 2018-19 and 2020-21 respectively. The mean concentration value of Ca2+ 44.84 (2012-13), 42.96 (2018-19), and 44.29 (2020-21) mg/L respectively. The mean concentration of F- value is 0.45, 0.47, and 0.49 mg/L for the 2012-13, 2018-19 and 2020-21 respectively.

A correlation matrix analysis (Table 1) shows that physicochemical parameters such as Ca2+ (r=0.85 and r=0.79); Mg2+(r=0.91 and r=0.90); Cl- (r= 0.80 and r=0.82); and TDS (r=0.0.84 and r=0.84) are strongly and positively correlate with TH for the years 2012-13 and 2021-22. TH is weekly correlated with F- (r=0.19 and r=-0.03). TDS is positively correlated with Cl- (r= 0.94 and r=0.90) for 2012-13 and 2021-22, which indicates that groundwater may be influenced by seawater. The TDS and Cl- values confirmed intermixing of fresh/saltwater in the coastal area through faults and fractures. Whereas Ca2+ and SO42- correlation (r=0.56 and r=0.42) for the years 2012-13 and 2021-22 indicates the possible impact of the dissolution of evaporites (gypsum) e.g., CaSO4.2H2O = Ca2+ + SO42- + 2H20.

Table 1 Pearson’s correlation matrix of the hydrogeochemical parameters (n=725)

For 2012-13, only two Principal Components (PC1, PC2) were sufficient to explain most of the variance (73.755%) of the data (Fig. 7a). The first principal component (PC1) consists of TDS, TH, Ca2+, Mg2+, Cl-, and SO42- which are responsible for 58.517% of overall variances data. Similarly, PC2 which is responsible for 15.238% of the overall variance is dominated by pH, F-, and HCO3-. On the other hand, for 2018-19 three principal components (PC1, PC2, PC3) were required to explain most of the variance (74.860%) of the data (Fig. 7b). The first principal component (PC1) comprised TDS, TH, Ca2+, Mg2+, Cl-, and SO42- and explains the overall 52.950% variance. PC2 explains the 12.782 % (2018) total variance consisting of pH, HCO3-, and Fe2+ PC3 explains the 9.128% (2018) overall variance in the study area and is strongly loaded with F-. While in 2021-22 the three PCA accounts for 77.783% of the variance in the data set. The first principal component (PC1) comprised of TDS, TH, Ca2+, Mg2+, Cl-, SO42- and explains 52.543% variance in the data. The Second principal component i.e., PC2 comprised of pH and HCO3, Fe2+ suggests the 14.77% total overall variance of the groundwater quality. The third principal component extracted mainly F- i.e., PC3 explains the overall 10.470% overall variance of the study area.

Ionic variations

The concentration of Ca2+ (Fig. 3a, b, c) in groundwater is influenced by the rock types and sedimentary rock of marine origin, which have a mixture of anhydrite, gypsum, aragonite, calcite, and dolomite. Calcium in groundwater may be caused by the chemical breakdown of calcic-plagioclase feldspars and pyroxenes (Ganyaglo et al. 2010) or may agricultural fertilizers originate Ca2+ from lime (Saha et al. 2019). Ca2+ and Mg2+ ions are released into groundwater via weathering of dolomite, limestone, anhydrite, and gypsum as well as cation exchange activities (Goswami and Rai 2022). Ca2+ and Mg2+ concentrations may have been influenced by long-term natural processes in eastern and western parts of Odisha specifically where concentration shows higher (Fig. 3a, b, c, d, e, f). Anthropogenic activities such as the use of fertilizer, sewage discharge, and animal waste disposal may have contributed to the ionic concentration variability as per LULC variations in the study area. Presumably, an increase in the concentration of Ca2+ and Mg2+ in the eastern to western parts results from the dissolution of evaporite minerals (anhydrite, gypsum), and sedimentary rocks (dolomites), whereas Mg2+ is possibly contributed by anorthosite and albite rock. The excess magnesium is found in groundwater as a result of the disintegration of ferromagnesian rocks such as pyroxene, amphiboles, olivine, and dark-colored micas and domestic wastes or precipitation of Ca2+ (Tiwari and Singh 2014).

The correlation between Ca2+, Mg2+, and Th suggests that untreated industrial wastewater may be penetrating and mixing with subsurface water, enhancing the total hardness. Magnesium degrades soil composition by increasing salinity and sodium in water, resulting in alkaline soil and crop failure (Paliwal 1972). Thapa et al. (2017) also found that high Mg2+ concentrations in irrigation water can harm plants due to decreased K+ in soils. Magnesium hazard (Ragunath, 1987) is given as

$$\textrm{Magnesium}\ \textrm{hazard}\ \left(\textrm{MH}\right)=\frac{Mg^{2+}\times 100}{\left({Ca}^{2+}+{Mg}^{2+}\right)}$$

Here, all parameters are in mg/L. Magnesium Hazard was estimated to determine the consistency of groundwater for irrigation purposes. The calculated MH values ranged from 3.25 to 100, averaging 45.09. Approximately 61% of the groundwater samples appear to be suitable for irrigation, while the other 39% seem hazardous and unsuitable for irrigation (MH >50%).

The presence of HCO3- indicates the dissolution of carbonate/sedimentary rocks weathering and carbonic acid dissolving in aquifers (Stumm and Morgan 2012; Singh et al. 2012) or maybe silicates-bearing minerals in the study area (Fig. 3g, h, i). Depending on the presence of dissolved CO2 and carbonic acid, the carbonate minerals present along the path will be dissolved during infiltration of rainwater and irrigated water (CO2 + H2O = H2CO3). HCO3- and Ca2+ are released into the groundwater during recharge. The Cl/HCO3 ratio varies between 0.1 and more than 10, suggesting that the Cl- presence in groundwater is caused by irrigation's return flow or natural salt dissolution (Lusczynski and Swarzenski 1996). Chloride is the major seawater ion but occurs in much lower concentrations in groundwater. While HCO3- is the most common anion in groundwater but occurs in low concentrations in marine water. Therefore, the Cl/HCO3 anions ratio is used to measure the degree of seawater contamination of freshwater aquifers. Our study indicates that 55.17% of groundwater samples were of good quality, whereas 37.14% were relatively less contaminated, and 7.58% of samples were moderate to highly contaminated (Table 5).

The presence of SO42- in subsurface water suggests exposure to soluble sulfate-bearing minerals/evaporate rocks and anthropogenic activities in the region. High SO42- may also result from chemical fertilizers used in irrigation and industrial wastewater in the study region. It is observed that SO42- and Cl- (Fig. 4) show a mean value (>126 mg/L) above the permissible limit (BIS, 2012), which may be due to sulphur gases released by industry and utilities are oxidized and contaminating the groundwater. The major anions SO42- are rapidly depleted in the coastal aquifer due to seawater intrusion (De Montety et al. 2008). For halite (Na+Cl-), and gypsum (Ca2+SO42−.2H2O) indicates that the continued mineral dissolving process is leading to salinization.

The presence of more Cl- in the groundwater near coastal areas suggests infiltration of seawater, which raises the salinity. The chloride concentration in drinking water should not exceed 250mg/l (0.25 g/l) according to WHO guidelines (Manh et al. 2014). Chlorine can cause dry, itchy skin, stomach aches, vomiting, and diarrhea. The western parts show higher Cl- concentrations probably due to contaminations of industrial wastewater or domestic sewage, inorganic fertilizer, decomposition and dissolution of decomposition of organic matter, effluent from septic tanks, discharge of domestic trash, and industrial wastewater near the industrial area (Appelo and Postma 1996; Kelly 2008). While eastern parts (Fig. 4d) show higher concentration due to excessive penetration of seawater in the study regions via faults and fracture (Goswami et al. 2022).

The excess F- (>1.0mg/L) was also observed in the study area, and are shown by a red circle (Fig. 5a). The Excess of fluoride in groundwater is a major threat, and can cause dental, skeletal, and non-skeletal fluorosis and other health problems. The higher F- concentrations are mostly due to the weathered of fluoride-bearing minerals/rocks (fluorite, apatite, topaz, muscovite, cryolite) (Hem 1989; Goswami and Rai 2022). Apart from geogenic factors, F- may also be contributed due to anthropogenic activities such as the burning of coal, extraction of aluminium and steel, and fertilizer industry particularly phosphate in areas such as Sambalpur and the Bargarh districts which are in the western parts. High concentrations particularly in some geothermal regions such as Khurda, Anugul, and Nayagarh districts may be due to the presence of Hot-springs and their interactions with the subsurface water (Kundu et al. 2001).

Fig. 5
figure 5

(a) Spatio-temporal variation of F- concentration, the red cross circle represents the hot spring; (b) Spatial distribution of average Fe2+ Iron concentration in the region

The Fe2+ concentration has varied from 0 to 65.67 mg/L between the 2015–2018 years, which exceeds the permissible limit (Fig. 5b). Ferric iron (Fe3+) rapidly precipitates as iron oxide (Fe2O3), iron hydroxide (FeHO2), or poorly crystalline to amorphous material at normal groundwater pH values. Ferrous iron is stable under reduced conditions and will remain dissolved in groundwater. Naicker et al. (2003) discovered that oxidation of pyrite (FeS2) present in mine tailing dumps polluted and acidified groundwater inside the mining district. Acidic groundwater is corrosive; thus, it dissolves iron and increases its concentration when it flows through pipes, pumps, and other equipment. The maximum chromite and iron mines are situated in Odisha's northwest and western parts. The Fe2+ concentration increased between 2015-16; which signifies that the rapidly increasing mining industry relies heavily on iron and chromite ore. However, if too much Fe2+ is present in water, it creates red oxyhydroxide precipitates that stain clothing and plumbing fixtures, making it an unwanted pollutant in home and industrial wastewater sources. Some of the important Fe2+ belts in the study area include Jamda-Koira Belt (Fe), Bonai-Badampahar Belt (Fe), Noamundi-Jamda Belt (Mn, Fe), East Coast Province (Al), Garumahisani Province (Fe), etc. As per the BIS standard limit of Fe2+ concentration in groundwater is 0.3mg/L, the excess amount of Fe2+ shown in the map (Fig. 5b) in the northern and south-western parts of Odisha due to excess Fe2+ mining industries located and water starts degrading. From 2012 onwards, the iron concentration increased and deteriorated the groundwater quality. Furthermore, a recent study has been done by Ranjan et al., 2020, which suggested that the mining activity become a serious threat to forest resources of the Jharsuguda district of Odisha between 2006-2016.

The saturation index analyses (Fig. 6) show that precipitation and dissolution of aragonite, calcite, Halite, anhydrite, dolomite, and gypsum mineral assemblages play a significant role in contributing HCO3, Ca2+, SO42-, Mg2+, and Cl. The samples appear to be oversaturated (SI>0) with respect to aragonite, calcite, and dolomite minerals assemblages and undersaturated with respect to Halite, anhydrite, and gypsum (Fig. 6a). The bi-variate SI plots for Halite, gypsum, and anhydrite shows a positive correlation with SO42-, whereas calcite, dolomite, and aragonite show a very weak to marginal negative correlation with SO42- (Fig. 6b, c, d).

Fig. 6
figure 6

a Mineral saturation index indicating precipitation and dissolution of mineral assemblages, (b, c, d) variations of SI of Halite, Gypsum and Anhydrite w.r.t. SO42-

The analysis of the first principal component (PC1) for the year 2012-13 (Table 2a; Fig. 7a) represents that the region is influenced by a natural process such as seawater intrusion, dissolutions of evaporate minerals, water-rock interaction, cation exchange, household wastewater, etc. The second component i.e., PC2 suggests an association of bicarbonates with F- in groundwater (Table 2a), which may be related to groundwater recharge. This also indicates that groundwater may have been deteriorating due to weathering of geogenic rocks, interaction with fluoride-bearing minerals/rocks or anthropogenic activities or interaction with Hot-springs water.

Table 2 Principal component analysis for groundwater sample
Fig. 7
figure 7

Principal Component Analysis (PCA) of the groundwater samples

PC1 for the year 2018-19 and 2021-22 indicates a strong influence of natural processes such as saline water intrusion, dissolutions of evaporate minerals, water-rock interaction, cation exchange (Table 2b, Fig. 7b). PC2 explains the overall variance may be due to anthropogenic activities, fertilizers, irrigations, pesticides, mining development, and mineral explorations (mining industrial swages, untreated mining wastages, wastewater, etc). The HCO3- is derived from the geogenic sources and detailed explanations given in reports (Goswami and Rai 2022). In recent years, groundwater quality in the Odisha districts of Jajpur, Keonjhar, Sundargarh, Dhenkanal, and Nuapada has worsened due to heavy contamination from iron (Fe2+) and chromite mining wastes. The third principal component (PC3) suggests the groundwater quality deteriorates due to geogenic factors of weathering of fluoride-bearing minerals/rocks or anthropogenic activity.

The Gibbs diagram (Gibbs 1970) is divided into three distinct fields, i.e. water-rock interaction, evaporation, and precipitation dominance field on the basis of cations [(Na+K)/(Na+K+Ca)] and anions [Cl/(Cl+HCO3)] concentration against the TDS (Fig. 8). Chemical weathering, evaporation-crystallizations, and ionic exchange are the major dominant the geochemical processes that affect groundwater chemistry and regional geology of the study area. The groundwater samples broadly fall within the water-rock interaction and evaporation dominance field. The evaporation zone also illustrates that marine saline water or the dissolution of evaporation-bearing minerals can have an effect on the regions (Halite). As a result, PC1 which contributes to the dominant ions in the study area, may be due to the natural hydrogeochemical environment in all the years under consideration.

Fig. 8
figure 8

Gibs Plot showing major processes controlling groundwater geochemistry for the year 2021-2022

Entropy-Based Water Quality Index (EWQI)

The entropy weights estimated for the anions such as F- (wj=0.235), TDS (wj=0.157), Cl- (wj=0.156), and SO42-(wj =0.128) indicate that these ions have more influence out of the ten parameters that were considered in this study, as they have the highest entropy weights (Table 3). The weights for Ca2+ and Mg2+ are 0.055 and 0.059 respectively, and are less than 0.10, indicating that these cations have a minor influence on the overall groundwater quality. The weight for pH was found to be 0, indicating that pH may not contribute much to the groundwater quality variations. The hydrogeochemical parameters sorted based on their relative weights and impact on groundwater quality can be arranges in sequence F->TDS>Cl->SO42->Fe2+>Mg2+>TH>Ca2+>HCO3->pH in a decreasing order for the year 2018-19 and 2021-22 respectively. Thus, as per the entropy calculation, five physicochemical parameters F-, Cl-, TDS, SO42-, and Fe2+ have most impact the groundwater quality.

Table 3 The estimates of the entropy (ej), and the corresponding weights (wj) for the year (a) 2012-13; (b) 2018-19, by Shannon’s entropy method

Water quality in the region can be categorized into five sets i.e. excellent to good water quality based on the EWQI values for the year 2012-13 (Fig. 9a). EWQI of less than 50 suggests excellent (rank 1) water type, for 50<EWQI<100 the water quality can be considered good quality (rank 2), whereas 100<EWQI<150 indicates average water quality (rank 3). A water quality index between 150 and 200 indicates poor water quality (rank 4), whereas a WQI of more than 200 suggests inferior water quality (rank 5) (Goswami and Rai 2022). In some areas, the EWQI is less than 100, which is suitable for drinking and cooking (Gao et al. 2020). An EWQI of 100 or more poses risk to human health and may be inappropriate for household use.

Fig. 9
figure 9

a, b, c The Spatio-temporal variation of the groundwater quality during 2012-13; 2015-16; 2018-19; 2021-22 (c) Location of urban, Industrial, and major mining and mineral deposits areas are shown in by pink triangle (major town), yellow Circles and stars; (d) The water quality index (WQI) plotted rank with respect to the area (%); (e) Bivariate plot of Na+ vs Cl-

For 2012-13, the overall water quality was excellent to good, while in 2015-16 to 2021-22 water quality may have been degraded due to the infiltration of contaminants derived from industrial, mining, household, and fertilizers wastewater discharges. Most of the groundwater samples fall within ranks 1 and 2, i.e., excellent to good quality. On the other hand, relatively average, poor, and very poor groundwater quality shows in small patches in the study area (shown in yellow, orange, and red colour in Fig. 9c; i.e., ranks 3, 4, and 5 appear to be concentrated in the west, central, northern and eastern parts of Odisha (Fig. 9c). Approx. 98% study area shows excellent water quality (rank 1), whereas ~2% shows good water quality (rank 2) in 2012-13 (Fig. 9a). While in 2015-16, ~87.94% study area shows excellent water quality followed by good (11.77% area), average (0.26% area), poor (0.021% area), and very poor (0.005% area) water quality (Fig. 9b). For 2018-19, 71.42% of the study areas show excellent water quality followed by 25.33% (good), 2.54% (average), 0.58% (poor), and 0.12% (very poor) respectively (Fig. 9c). In the year of 2021-22, 42% study area shows the excellent water quality, good (24.33% area), average (3.54% area), poor (2.58% area) and very poor (1.12% area) respectively (Fig. 9d).

The majority of urban and rural settlements are concentrated in fairly small areas (0.31 percent) around major cities. Growing population density along the coast may also be a significant factor in the deterioration of groundwater quality in north-eastern and eastern parts (Fig. 10c). Some of the important mining areas are located in the northern parts of Odisha, such as South Kaliapani, Daitari, Sukrangi, Kaliapani-Block in Jajpur district; Banspani, Khandbandh, B.P.J., Mahaparbat, Sakradihi-Dubna, Bangur, Seremda-Bhadrasahi, Dalki, Baniapank Gandhamardan block in Keonjhar district and Gorumahisani, Badampahar and Suleipat in Mayurbhanj district. In the western parts of Odisha major mines are located Budhapara block in the Nuapada district. While the north-western parts of Odisha, the major mines are Koira-Bhanjpali, Koira-Kasira, Rantha, and Kumritar located in the Sundargarh district. Most of the mines in the eastern parts are located in the Rayagada district and Dhenkanal district as per the Odisha Mines Corporation (OMC). Other major mining belts are located in the western, north-western, and western parts of Odisha (Fig. 9c), such as the Baula-Nausahi Belt (Cr), Sukinda- Nausahi Belt (Cr), Adash Raimal Belt (Cu), Sukinda Belt (Cr), etc. All these factors may have influenced the subsurface water quality in the study region.

Fig. 10
figure 10

Land use and Land cover variations during the years (a) 2012, (b) 2021, and (c) Population distribution map of Odisha, and Spatial distribution point of the estimated

Land Use Land Cover Change

The LULC map (Fig. 10a, b) indicates that the main LULC types are agriculture and vegetation, Forest, wetland, settlement area, bare land, and water. Further analysis indicates that the area under agricultural and sparse vegetation has decreased from ~68.42% to 66.62 % and from 1.41 to 1.31 between the years 2012 and 2021 respectively. The settlement area has increased from 0.29% to 1.20%, which signifies that urbanization may have increased between 2012 to 2021 respectively (Table 4; Fig. 10a, b). The forest area has changed slightly from 26.20% to 26.96% i.e., 0.76% in the year 2021 (Fig. 10b). Minor increase in forest area, enable holding water, percolation and recharge processes which could slightly improve groundwater water quality in the region. The bare land and wetland area remained unchanged during this period (2012 and 2021). The area of water bodies has increased from 2.68% to 2.78% in the year 2021. These changes will likely impact water quality, general environmental conditions, agricultural productivity, industries, and socioeconomic conditions of society. The settlement area has grown slightly, indicating that anthropogenic activities may affect the overall groundwater quality between 2012 to 2021 (Fig.10a, b). The saline intrusion has increased with time from 0.04% to 0.10% between 2012 to 2021. The poor water quality in the eastern parts of may be due to the intrusion of saline water shown in Fig 9(e). The population density map (Fig. 10c) also suggests that the groundwater quality may deteriorate due to the excess withdrawal and enhanced seawater intrusions during the last few decades. A recent study has been done by Ranjan et al., 2020, which suggested that the mining activity become a serious threat to forest resources of the Jharsuguda district of Odisha between 2006-2016.

Table 4 Land Use Land Cover variation from 2012-2018 (ESA 2017)

Seawater Intrusion

The molar ratio of Na+/Cl- is significant in terms of onshore encroachment of seawater; the ratio of 0.86 represents the seawater intrusion, while a ratio of >1.0 freshwater source (Ekhmaj et al. 2014). The Na+/Cl- molar ratio of groundwater samples varied from 0.058 to 6.96, with an average of 1.09. Based on a theoretical mixing line, seawater intrusion or evaporite dissolution in the coastal aquifer is indicated by a few samples lying on or near the mixing line in the Na+ and Cl- plots (Fig. 9d). The Cl- concentration appears to be slightly higher than the standard limit (WHO 2017). The correlation coefficient of the Na+ vs Cl- plot shows 0.689, suggesting that the study regions may be influenced by seawater intrusion (Fig. 9d).

The Revelle coefficient is the ratio of Chloride and bicarbonate (Cl/(CO3+HCO3)) and is used to assess the saltwater intrusion in groundwater aquifers (Lobo-Ferreira et al. 2005). Chloride is the major seawater ion but occurs in much lower groundwater concentrations. While HCO3- is the most common anion in groundwater but occurs in low concentrations in marine water. The Cl/HCO3 anions ratio is used to measure the degree of seawater contamination of freshwater. In the study area, 55.17% of groundwater samples were of good quality, whereas 37.14 percent were relatively less contaminated, and 7.58 percent were moderate to highly contaminated (Table 5).

Table 5 Classification of the Groundwater Quality based on Revelle Coefficient (Cl/(CO3+HCO3))

Conclusions

In this paper, we investigated the groundwater physicochemical parameters to understand the spatio-temporal variations of various parameters that control the water quality in the region. The uneven distribution of cations in groundwater suggests that the groundwater aquifers have heterogeneous geochemical characteristics. PCA analysis indicates that the region is mainly affected by water-rock or geogenic seawater intrusion/cation exchange process or maybe dissolutions of evaporate minerals, fertilizers, and mining activity. The Principal Component Analysis (PCA) indicates two major principal components PC1 (TDS, TH, Ca2+, Mg2+, Cl-, and SO42 with 45.87% variance), and PC2 (HCO3-, and Fe2+ with 18.14% variance). This suggests that water may have been influenced by both natural and anthropogenic sources in the study area. For 2021-22, PC1 accounts for 52.543% of data variance and indicates that most ions are derived from natural sources. From the year 2015-16 onward, the Fe2+ concentration shows an increasing trend, suggesting that mining activity may have some influence in the region. Furthermore, the saturation index of the dissolution of gypsum, Halite, dolomite, and calcite signifies that the dissolution of these minerals also affected groundwater salinization. Approximately 61% of the groundwater samples were found suitable for irrigation, while the remaining 39% may be unsuitable for irrigation (MH >50%). Shannon's entropy method was used to compute the Entropy-based Water Quality Index (EWQI). The highest entropy weights (0.235, 0.157, 0.156) were found for F-, TDS, and Cl-, whereas pH shows relatively lower entropy weight. The EWQI suggests that quality of groundwater in the study region varies from excellent to medium quality. The analysis also indicates that water quality may have been slightly degraded since 2015 due to anthropogenic activities such as industries, mining, and/or agricultural activities, etc. An increase in the settlement area, as seen in the region may also influence the overall quality of subsurface water. The analysis and results may be used for planning and policy decisions by the state pollution control, and water resource management agencies for sustainable development in the region.