Introduction

Groundwater is considered a substantial source for drinking, agricultural and industrial activities in the semi-arid parts of India. Urban, semi-urban and rural areas of Indian states are mainly reliant on groundwater as approximately 85% rural domestic water demands and 50% urban potable water requirements are fulfilled by groundwater only (Ahada and Suthar 2018a; Jain and Vaid 2018). India is the first largest user of groundwater in the world with 251 km3/year abstraction rate followed by China (112 km3/year) and USA (112 km3/year). Out of the total amount of groundwater extracted, largest share ~ 89% is used by the agricultural sector particularly for irrigation purpose and remaining 11% for public water (domestic purpose) and industrial supply (World bank 2012; Margat and Gun 2013; Raju and Singh 2017; Singh et al. 2019b; Ahada and Suthar 2018b). Groundwater plays a crucial role in the socio-economic, agricultural and environmental development of a region (Kaur and Rishi 2018). Anthropogenic activities like unplanned industrialization, unsustainable economic development, rapid population growth and overexploitation of groundwater in the past few decades had posed a severe threat on this precious resource in terms of both quantity and quality (Sharma et al. 2017; Pant et al. 2017b; Kaur et al. 2019). During the green revolution, farmers of Punjab have extensively followed the monocropping culture (mainly wheat–rice cycle), instead of multiple cropping by abandoning other crops like pulses, mustard and vegetables due to fertile alluvial plain and their surplus availability of water resource. Intensive use of fertilizers and pesticides act as diffuse pollutants originating from irrigated agricultural field, so the fertilizers and pesticides are the main responsible factors affecting the quality of the aquifers. Thus, nutrients in the form of NO3, PO43− and K+ are common contaminants in soils and groundwater. The chemical contamination of groundwater from both natural and anthropogenic factors is almost an irreversible process (Causapé et al. 2006; Aydi 2018). Further, the long-term consumption of contaminated water is of great concern due to possible health risks. Besides heavy metals, nitrate and fluoride are known to be harmful to children and adults because of their low tolerance towards the pollutants. The human exposure to these metals is mainly through ingestion and dermal contacts. The non-carcinogenic human health risks from NO3 and F in groundwater have been enlisted as an emerging issue in our country. The health hazard associated with the consumption of NO3-rich water is methemoglobinemia in newly born children, and thyroid dysfunctions and hypertension in adults and pregnant women (Ahada and Suthar 2018b). Furthermore, ~ 200 million people around the globe are regularly consuming water contaminated with F content above the limits of WHO 2011 (Emenike et al. 2018). Also, the ingestion of high-F-content water may cause dental fluorosis, osteoporosis, arthritis and thyroid. Therefore, the toxicity effects of NO3 and F are needed to assess their non-carcinogenic health risks on human health expose at any dose.

Jalandhar is a predominantly agricultural district in the central part of Punjab (commonly known as the Bist Doab tract) with cropping intensity of 178%. Around 90% of the total irrigated area is fulfilled by groundwater. The region also witnessed a considerable increase in the density of the tube wells, i.e. 38% from 1998 to 2013 (Dhillon et al. 2019). The high annual rate of groundwater extraction (8600 m3/ha/a), chiefly for agricultural purposes, has turned most of the blocks into the ‘overexploited’ category in the region (Rao et al. 2017). The district witnessed expansion of agricultural land by illegally converting forest area to increase paddy cultivation area has led to tremendous stress on groundwater. The area received very low seasonal rainfall (70.3 cm/year), which is unable to support the high-water demanding agricultural crops. Thus, groundwater mining is the only available option to sustain the agricultural productivity in the fertile alluvial plains of the Indo-Gangetic region. Farmers are extensively using agrochemical fertilizers on highest scale of consumption rate, i.e. 251 kg/ha as compared to the national average of 165.8 kg/ha to maintain the productivity (World Bank; Government of Punjab 2016). As per the recent report of the Punjab Department of Agriculture, the demand of N, P and K fertilizers in the study area has increased significantly from 37.50 kg/ha/year in 1970 to 257 kg/ha/year in 2016. The conversion of conventional farming practices (organic method) to synthetic agrochemical techniques has largely disturbed the various segments of environmental dynamics and affects the non-target species due to release of toxic persistent chemicals. The water-soluble chemicals are dissolved in irrigated water and percolated into subsurface strata, thus affecting the natural chemical equilibrium of aquifer system (Singh et al. 2015; Pant et al. 2017, 2019). Thus, the prolonged consumption of chemical-contaminated groundwater is one of the threatening factors for human health risk, cattle and beneficial soil microbes. Jalandhar district has nearly 21 large- and medium-scale and 20,295 small-scale industries and is also famous for manufacturing various sports good in the country (Government of Punjab 2016). Industrial clusters such as tanneries, paper and textile industries are mainly located in the vicinity of the Jalandhar city. Industrial and urban sectors not only utilize the available water, but also generate a large amount of wastewater of complex nature (Aulakh et al. 2009). Moreover, the study area lacks proper sewerage or drainage system where wastewater is directly discharged into nearby streams and river. It results in the increasing vertical and lateral spread of pollutants in aquifer system, rendering serious threats to environmental health (Mukate et al. 2018).

Some researchers have already carried out studies focusing on the basic quality appraisal of groundwater for drinking and irrigation purposes in this region (Purushothaman et al. 2012, 2014; Singh et al. 2019a; Rao et al. 2015). Research on groundwater contamination and associated human health risks is rather limited. To fulfil the loopholes of the previous study, the integrated approach of multivariate statistical techniques along with GIS, hydrogeochemical modelling and human health risk assessment was employed as robust methods to understand the geochemical signatures and water quality appraisal in the alluvial aquifer of north-east Punjab (Herojeet et al. 2015, 2016; Li et al. 2016, 2018a; Rao et al. 2017; Singh et al. 2019b; Keesari et al. 2019; Kaur et al. 2019). The emergence of hydrogeochemical characterization techniques and chemometric modelling enables to overcome the limitation of conventional study of using various water quality indices and graphical representation to define pollution source of groundwater in Jalandhar district. Chemometric techniques such as principal component analysis (PCA) and cluster analysis (CA) along with geographical information system (GIS) techniques aid in the interpretation of the large dataset for a better understanding of the geochemical evolution and sources of the pollution in the alluvial aquifers (Singh et al. 2019b; Herojeet et al. 2017; Rajkumar et al. 2018). The major objectives of the present research were to assess the quality appraisal of groundwater for drinking and agricultural purposes, to understand hydrogeochemical evolution through graphical modelling, to elucidate the source apportionment related to anthropogenic and natural sources with the help of chemometric and GIS techniques and to assess the non-carcinogenic human health risk associated with direct ingestion of groundwater in the agricultural dominated region of the study area. This study will help to maintain the ecological balance with the water resource management in such semi-arid agrarian state of India.

Study area and groundwater scenario

Jalandhar is an agriculturally prosperous district of Punjab situated in the central part of the state and located between 30°59′ to 31°37′N latitudes and 75°04′ to 75°57′E longitudes (Fig. 1). The district is divided into five taluks, namely Jalandhar I and II, Nakodar, Shahkot and Phillaur. Three distinct seasons are experienced, viz. winter, summer and monsoon. The study area represents semi-arid conditions marked by hot summer and mild winter. The area received an average annual rainfall of 703 mm where 70% of the annual normal rainfall is received during the south-west monsoon from July to September (CGWB 2016, 2018). Temperature in the district varies between a minimum of 5 °C (in January) and a maximum of 45 °C (in May–June). Agriculture sector share is the main economic domestic product and dominant land-use–land-cover (LULC) class in the district (Fig. 2). Wheat and rice are the principal crops grown in the region during Rabi and Kharif seasons, respectively. Jalandhar district is a part of Satluj sub-basin of Indus Basin representing vast Indo-Gangetic alluvial plains and is drained by perennial River Satluj and its tributary East Bein. The study area is part of Bist Doab (the area bounded between the River Satluj and Beas) tract comprised of Quaternary to Sub-Recent alluvium type consisting of pebbles, gravel, fine- to medium-grained sand, clay and kanker. Two types of alluvium, namely older alluvium (Bhangar) and newer alluvium (Khadar), are available in the region. The older alluvium of Middle to Late Pleistocene age comprises of fine to medium sandy soil, silty clay, loam, sandy loam and sandy clay which is rich in calcareous material occurring all over the district (Supplementary Fig. 3 and Supplementary Table 1). The newer alluvium belongs to Upper Pleistocene to Recent age consists of coarse- to medium-grained sand with a lesser amount of silt and clay and poor in the calcareous matter (Rao et al. 2017; CGWB 2012, 2014). Kankar beds are generally more in older alluvium. Sand sheets of aeolian origin of Recent age from Rajasthan also exist in the western parts of the district (CGWB 2018). Physiographically, the district is prominent by two different features, namely vast alluvial plains and Satluj floodplain (G et al. 2015). Groundwater exists under both unconfined and confined conditions in these alluvial aquifers. Multiple aquifer groups consisting of fine- to coarse-grained sand are present in the study area to the depth of 350 metres below ground level (m bgl). The first aquifer group (up to 115 m bgl) is under unconfined conditions, while the second and the third aquifer groups under semi-confined to confined conditions exist up to 130 to 195 m bgl and 215–333 m bgl, respectively (CGWB 2012, 2016). An average 81 m coarse sand bed which is gravel in nature comprises the aquifer group I, and extensive 24-m-thick clay layer separates the aquifer I from underlying aquifer group II (Rao et al. 2017). Aquifer group II consists of alternating sequences of thin layers of sand and clay beds and gravel and occasional kankars are also present in this aquifer group. Topographically, it is more or less a flat terrain, having a gentle slope from NE to SW. Long-term water level trend analysis by Central Ground Water Board (CGWB) indicates general decline (negative) up to 8.18 m in the district. The water table elevation varies from 186 to 240 m above mean sea level (a msl) in the study area (Fig. 3). Annual water level analysis indicates that groundwater level decline at the rate of 0.0290 to 1.1398 metres per year. The groundwater flow in the district is towards south-west direction (Fig. 3). All the ten blocks of the district are categorized as ‘overexploited’ with 229% of groundwater development stage (CGWB 2016, 2018). The unconfined aquifer in the study area is under tremendous stress due to the high withdrawal of groundwater for agricultural and domestic purposes.

Fig. 1
figure 1

Map of the study area with groundwater sampling locations

Fig. 2
figure 2

LULC map of the Jalandhar district

Fig. 3
figure 3

Groundwater flow direction in the study area

Materials and methods

Sample collection and analytical procedures

A well-tested systematic random sampling technique was adopted to collect the groundwater samples from the study area. Forty-one samples were collected using Global Positioning System (GPS) during the post-monsoon season (October 2017) from tube wells (TW) and hand pump (HP) in the area under investigation. The geographical locations of all sampling stations are presented in Fig. 1. The standard methods of the American Public Health Association (APHA 2005) were followed for sample collection, transportation, conservation and analytical procedures (Table 2). All the groundwater samples were collected in 1-L high-density polyethylene (HDPE) bottles pre-washed with 10% nitric acid (HNO3) and rinsed with double-deionized water. Before sampling, the water stored in the well pipes was pumped out for 10–15 min to ensure the original composition of water. All HDPE bottles were thoroughly rinsed three times with the water to be sampled to avoid unpredictable changes in characteristics as per standard protocols. Physical parameters like pH, electrical conductivity (EC) and total dissolved solids (TDS) were measured on site with a portable multiparameter water quality meter (Hanna HI98194). Two sets of groundwater samples were collected from each sampling stations. The groundwater samples were filtered through 0.45 μm Whatman filter paper using Millipore glass assembly unit and kept at a temperature of 4 °C until analysis. Another set of collected samples was acidified to pH ~ 2 by adding ultra-pure concentrated HNO3 for measurements of the major cations. To avoid any oxidation process, containers were completely filled with groundwater to ensure that no air bubble be trapped within the sample. All the groundwater samples were analysed for major anions, viz. HCO3, CO32−, SO42−, PO43−, F, Cl and NO3, and cations, viz. Ca2+, Mg2+, Na+ and K+, according to the standard methods enlisted in APHA (2005). ArcGIS 10.2.1 software developed by Environmental Systems Research Institute (ESRI) was used to prepare various maps. To know the spatial extent of various water quality parameters, inverse distance-weighted (IDW) algorithm interpolation method was used. In this technique of interpolation, unknown values of the pixels are estimated by averaging the known value. It is an effective algorithm extensively used in the mapping of spatial interpolation of groundwater quality parameters (Shakerkhatibi et al. 2019). The accuracy of the chemical ion data was examined using charge balance error (CBE) equation, and values were within the acceptable limit of ± 5% (Hounslow 2018).

$${\text{CBE}}{\% } = \frac{{\left( {\Sigma {\text{Cations}}} \right) {\text{meq}}/{\text{L }} - \left( {\Sigma {\text{Anions}}} \right) {\text{meq}}/{\text{L }}}}{{\left( {\Sigma {\text{Cations}}} \right) {\text{meq}}/{\text{L }} + \left( {\Sigma {\text{Anions}}} \right) {\text{meq}}/{\text{L}}}}*100$$
(1)

Procedure to evaluate drinking water quality index (DWQI)

DWQI estimation helps in determining the water quality suitability for drinking purpose by reducing a large water quality dataset into a single value (Yadav et al. 2018). DWQI is a very efficient method and an effective mathematical technique which provides information on the overall quality of water by examining every individual water parameters (Horton 1965; Herojeet et al. 2016). Weighted arithmetical index method developed by Horton (1965) was used to calculate DWQI. Ten water quality variables (pH, TDS, TH, Ca2+, Mg2+, HCO3, F, Cl, SO42− and NO3) and Indian drinking water standards (BIS 2012; IS: 10500) were used for the estimation of DWQI. It was computed by adopting the following equation developed by Tiwari and Mishra (1985). Table 1 describes the relative weights of physicochemical parameters for DWQI calculation.

$${\text{DWQI}} = \mathop \sum \limits_{{q_{n} }} W_{n} /W_{n}$$
(2)

where \(W_{n}\) = unit weight of nth parameters is computed by using the following equation.

Table 1 Unit weight of water quality parameters for DWQI
$$W_{n} = K/S_{n}$$
(3)

and K is the proportionality constant derived from

$$K = \left[ {1/\left( {\mathop \sum \limits_{i = 0}^{n} 1/S_{i} } \right)} \right]$$
(4)

where \(S_{n}\) and \(S_{i}\) are the BIS standard values of the water quality parameter

$$q_{\text{ni}} = 100 \times \left[ {\left( {V_{a} - V_{i} } \right)/\left( {V_{s} - V_{i} } \right)} \right]$$
(5)

\(q_{\text{ni}}\) is quality rating of ith parameter for a total of n water quality parameters, where \(V_{a}\) = actual value present in the water sample, \(V_{i}\) = ideal value (0 for all parameters except pH which is 7.0) and \(V_{s}\) = BIS standard value of water quality parameters.

Health risk assessment

Health risk assessment studies provide significant information about the chemical exposure from the groundwater quality, especially for drinking purpose (Wu and Sun 2016; Chen et al. 2017; Adimalla 2018; Adimalla and Li 2018; He and Wu 2018; He et al. 2018; Ahmed et al. 2019). Most of the groundwater samples have elevated concentration of NO3 and F, which may pose a serious health risk to humans. The conventional model established by the United States Environmental Protection Agency (USEPA 2009) was employed for the health risk assessment in the study area. Only non-carcinogenic pollutants (NO3 and F) were selected to perform the health risk assessment through oral pathway of exposure for the present study because other exposure pathways such as dermal and inhalation were negligible risk. Non-carcinogenic risk through ingestion is calculated as follows:

$${\text{Chronic Daily Intake }}\left( {\text{CDI}} \right) = \left( {{\text{C}} \times {\text{IR}} \times {\text{ED}} \times {\text{EF}}} \right)/\left( {{\text{BW}} \times {\text{AT}}} \right)$$
(6)
$${{\text{HQ}}_{\text{ingestion}}} = {\text{CDI}}/{R_{f}} {D_{\text{ingestion}}}$$
(7)

C = Concentration of contaminant in groundwater (mg/L); IR = ingestion rate of drinking water (L/d) (2 L/d for adults and 1 L/d for children); ED = exposure duration (years) (30 years and 12 years for adults and children, respectively); EF = exposure frequency (days/year) (365 days/year for both adults and children); BW = body weight of the receptor (kg) (60 kg for adults and 20 kg for children); and AT = average time (years) (10,950 days for adults and 4380 days for children);

Total non-carcinogenic health risks (HIingestion) due to ingestion exposure pathway (for NO3 and F) can be calculated as follows:

$${\text{HI}}_{{ {\text{ingestion}}}} = {\text{HQ}}_{{ {\text{ingestion}}, NO_{3}^{ - } }} + {\text{HQ}}_{{ {\text{ingestion}}, F^{ - } }}$$
(8)

HQingestion = hazard quotient of non-carcinogenic risk by ingestion pathway and \({R_{f}}{D_{\text{ingestion}}} =\) reference dosage for non-carcinogenic pollutant through drinking water exposure pathway. RfDingestion value for NO3 is 1.6 mg/(kg d) and for F is 0.06 mg/(kg d) is obtained from the Integrated Risk Information System (IRIS) database of the US EPA.

Chemometric techniques and graphical modelling

Chemometric analyses such as PCA and CA were employed to understand the sources of major ions and geochemical processes influencing the groundwater quality. These techniques have been widely used as unbiased methods for attaining important information from the hydrochemical dataset related to the groundwater system (Singh et al. 2004; Nagaraju et al. 2014; Mohamed et al. 2015; Herojeet et al. 2016, 2017; Li et al. 2018b; Rahman et al. 2018; Song et al. 2018). Varimax rotation with Kaiser normalization method was applied for the extraction of principal components (PCs). CA is a powerful data mining method, which classifies variables into clusters on the basis of similarities within a group and dissimilarities between different groups (Rajkumar et al. 2018). For cluster analysis, Ward’s method of linkage with squared Euclidean distance as a similarity measure approach was applied on water quality dataset (Otto 1998). Hydrogeochemical characterization of the groundwater was evaluated by Piper trilinear diagram using Geochemist’s Workbench Student Edition 12.0 software. Furthermore, saturation index (SI) of the groundwater was carried out using geochemical modelling technique PHREEQC. All the calculation, scatter plots and statistical analysis were performed by using the Microsoft Excel 2010, SPSS 16.0 and Minitab 17 softwares.

Results and discussion

General parameters of groundwater

The groundwater quality parameters of the study area such as maximum, minimum, mean and standard deviation and the concentration of each physicochemical parameters are compared with the standard values of the World Health Organization (WHO 2011) and the Bureau of Indian Standards (BIS 2012), respectively, for drinking water and are presented in Table 2. The analytical results of the physicochemical parameters are given in Supplementary Table 2. pH values range from 7.1 to 8.8 with mean ± standard deviation (SD) of 8.1 ± 0.49 and indicate that the groundwater of the study area is slightly alkaline in nature. 24.39% samples have pH concentration above the permissible limit (6.5–8.5) of BIS (2012) and WHO (2011) (Table 2). The range and mean ± SD of EC in the groundwater of the study area are 570 μS/cm to 1058 μS/cm and 762.8 ± 122.9, respectively. The spatial distribution map of EC is shown in Supplementary Fig. 2(a). The entire groundwater samples are well within the permissible limit (1500 μS/cm) of WHO (2011) prescribed for drinking water. TDS and total hardness (TH) as CaCO3 varied between 365 and 677 mg/L and 152.5 mg/L and 322.5 mg/L, and their spatial distribution map is presented in Supplementary Fig. 2(b and c), respectively. As per Davis and DeWiest (1967) classification of groundwater based on TDS, 39% samples of the study area are permissible for drinking (Table 3). Also, TDS value is found below 1000 mg/l in all the samples and indicates that the groundwater is fresh in nature as per Freeze and Cherry (1979) classification (Table 3). According to Sawyer and McCarty (1967) based on TH, 95% of the groundwater samples fall under hard water category and the remaining 5% samples fall under very hard water category (Table 3). 44% of total alkalinity (TA) of samples is beyond the desirable limits (200 mg/L) of BIS (2012) and WHO (2011) (Table 3) and its spatial distribution map is shown in Supplementary Fig. 2(d). Weathering of silicate such as feldspar and dissolution of carbonate minerals in the presence of soil CO2 (g) is responsible for the high alkalinity by releasing HCO3 and Na+ ions in the groundwater as expressed in Eqs. 9, 10 and 14 (Keesari et al. 2016a; Roy et al. 2018). In the top soil, decaying of humus (CH2O) along with root respiration significantly increases the amount of soil CO2 (g). Thus, the infiltrating H2O through the top soil react with soil CO2 and leads to the formation of carbonic acid (H2CO3). The breakdown of H2CO3 in the presence of excess H2O releases HCO3 ion in the aquifer (Raju and Singh 2017) as expressed in Eq. 9.

Table 2 Statistical summary of physicochemical parameters of groundwater of the study area
Table 3 Classification of groundwater based upon TDS (Davis and DeWiest 1967), TDS (Freeze and Cherry 1979) and TH (Sawyer and McCarty 1967)
$$\left. {\begin{array}{*{20}l} {{\text{O}}_{2} + {\text{CH}}_{2} {\text{O}} \to {\text{H}}_{2} {\text{O}} } \hfill & {\left( {{\text{Organic}}\;{\text{decay}}\;{\text{and}}\;{\text{root}}\;{\text{respiration}}} \right)} \hfill \\ {{\text{CO}}_{2} + {\text{H}}_{2} {\text{O}} \to {\text{H}}_{2} {\text{CO}}_{3} } \hfill & {\left( {{\text{Formation}}\;{\text{of}}\;{\text{carbonic}}\;{\text{acid}}} \right)} \hfill \\ {{\text{H}}_{2} {\text{CO}}_{3} \to {\text{HCO}}_{3}^{ - } + {\text{H}}^{ + } } \hfill & {} \hfill \\ \end{array} } \right\}$$
(9)
$${\text{CaCO}}_{3} + {\text{H}}_{2} {\text{CO}}_{3} \to {\text{Ca}}^{2 + } + 2{\text{HCO}}_{3}^{ - } \;\;\;\;\left( {{\text{Calcite}}\;{\text{dissolution}}} \right)$$
(10)

Major cations and anions in the groundwater

Abundance order of the major ions in the groundwater samples is Na+ > Mg2+ > Ca2+ > K+ and HCO3 > Cl > SO42− >  NO3 > CO32− > F > PO43−, respectively, for both cations and anions (Fig. 4a, b). The concentration of Ca2+ and Mg2+ in study area ranged from 15.0 to 59.3 mg/L and 21.0 mg/L to 62.3 mg/L with mean ± SD of 31.1 ± 10.10 and 33.1 ± 9.3, respectively (Table 2). Dissolved magnesium exceeds calcium in aquifers once calcium precipitates after reaching supersaturation and accounts for higher magnesium concentrations than calcium (Hem 1985). Therefore, 61% of groundwater samples have Mg2+ contents above the desirable limit (30 mg/L) of BIS (2012), which may contribute to total hardness in groundwater (Table 2). The higher content of Mg2+ in the groundwater of the study area may be due to weathering of dolomite minerals, sandstone and applications of chemical fertilizers and pesticides as most of the sampling locations are from the cultivated land (Keesari et al. 2014). The spatial distribution map of Ca2+ and Mg2+ is given in Supplementary Fig. 2(e) and (f), respectively. Na2+ in the study area ranged from 22.4 to 86.1 mg/L (mean ± SD of 47.4 ± 15.07) and most dominant cation in groundwater (Fig. 4a). The concentration of K+ in the study area ranged from 4.1 to 32.8 mg/L with 51% samples above the permissible limit (12 mg/L) of WHO (2011) (Table 2). The source of Na+ and K+ in the study area may be due to weathering of potash and silicate minerals, application of potash fertilizers and sewage wastewater (Srinivasamoorthy et al. 2014). The spatial distribution map of Na+ and K+ is presented in Supplementary Fig. 2(g and h).

Fig. 4
figure 4

a and b Box plots of the major cations and anions of groundwater of the study area (asterisk: outlier, cross dot: mean, whisker horizontal line: median, whisker vertical line: maximum and minimum and box: first quartile and third quartile

Among the anions, HCO3 is the dominant anion (Fig. 4b), ranging from 137.0 to 318.0 mg/L (mean ± SD of 214.7 ± 41.3), and their spatial distribution map of HCO3 is shown in Supplementary Fig. 2(i). The concentration of CO32− ranges from 0.0 to 28.0 mg/L with mean ± SD of 7.5 ± 8.3. The presence of HCO3 and CO32− ions is mostly responsible for the alkalinity of groundwater. Major source of HCO3 ion in the groundwater is silicate weathering and dissolution of carbonate rocks such as calcite [CaCO3] and dolomite [CaMg(CO3)2] in the aquifers (Keesari et al. 2014) and discussed in Eqs. 10 and 18. NO3 concentration varied from 2.4 to 80.6 mg/L (mean ± SD of 26.4 ± 19.57), and about 20% and 12% groundwater samples have exceeded the permissible limits for NO3 in drinking water of BIS (2012) and WHO (2011), respectively [Table 2 and Supplementary Fig. 26(j)]. The consumption of water with high NO3 content causes blue baby or methemoglobinemia in infants, and gastric cancer, central nervous system birth defects and hypertension in adults (WHO 1997; Yadav et al. 2018). Anthropogenic activities such as application of fertilizers, domestic and municipal sewage and seepage from the septic tanks are the main sources of nitrate in the study area (Keesari et al. 2016a; Raju and Singh 2017). According to the Adimalla et al. 2018 classification, around 20% samples show high health risk due to excess content of NO3 Supplementary Table 3. F content in the study area varied from 0.03 mg/L to 2.0 mg/L with mean ± SD value of 0.7 ± 0.40. Nearly 20% and 5% samples have F concentration above the desirable and permissible limits of BIS (2012) [Table 2 and Supplementary Fig. 2(k)]. As per Adimalla et al. 2018 classification, around 5% samples show dental fluorosis risk and 63.4% samples have F content between 0.6 and 1.5 mg/L, which indicates human health risk (Supplementary Table 3). Several conditions such as alkaline pH, higher content of HCO3 and lower concentrations of Ca2+ in the groundwater of the study area provide favourable conditions for F enrichment in the aquifer. Apart from natural sources, some anthropogenic sources such as industrial activities like brick kiln and seepage of untreated sewage water may be the reason for the elevated concentration of fluoride in the study area (Roy et al. 2018; Yadav et al. 2018). The concentration of SO42− and Cl in study area varied from 12.0 to 123.7 mg/L and 21.3 mg/L to 179.0 mg/L, respectively, and within the BIS (2011) and WHO (2011) (Table 2). Sodalite, micas and apatite are the chief minerals that contribute chloride to groundwater (Karanth 1987). Oxidation of sulphide minerals, weathering of rocks containing gypsum (CaSO4·2H2O) mineral and anthropogenic input like industrial effluent, fertilizers may contribute to SO42− in the groundwater (Roy et al. 2018). The spatial distribution map of Cl is presented in Supplementary Fig. 2(l). The concentration of PO43− in the study area is quite low, ranged from 0.00 to 1.0 mg/L (Table 3). However, BIS (2012) and WHO (2011) have not suggested any guideline limits for PO43− ion in the groundwater.

Drinking water quality index evaluation

The computed DWQI values of groundwater of Jalandhar district varied from 6 to 117 with mean value 50.3, as shown in Supplementary Table 4. Spatial distribution map of the DWQI is shown in Fig. 5. According to the classification of DWQI (Tiwari and Mishra 1985), majority of the samples fall under ‘excellent to good’, followed by ‘poor to very poor’ and ‘unfit for drinking’ classes for drinking purposes, with values of 51%, 46% and 3%, respectively (Table 4). Extensive agricultural, indiscriminate use of chemical fertilizers, industrial activities and discharge of untreated sewage water may be responsible for groundwater quality degradation (Keesari et al. 2014). The south-western and central parts of the study area show more deteriorated water quality in comparison with the northern part (Fig. 5). One groundwater sample location (J24) that requires urgent attention is falling under ‘unfit for drinking category’. Continuous consumption of groundwater for drinking purpose from this pocket may pose a great threat to human health, and thus, immediate measure is must in such areas. As per the groundwater elevation map (Fig. 3), the aquifer flow direction is towards the south-west which is another key factor for accumulating the contaminants dispersing from northern and eastern region besides local or nearby pollution sources. The result of DWQI is well corroborated with groundwater movement, which indicates that regular and strict monitoring of groundwater is a necessity in the south-western part and certain pockets to prevent further contamination. Effective pollution control measures would significantly reduce overall pollution, which further improves the quality of water, and people depend on such water source.

Fig. 5
figure 5

Spatial distribution map of the DWQI

Table 4 Water quality index classification

Health risk assessment

The results of the non-carcinogenic risk assessment for HQingestion and HIingestion are summarized in Table 5. The HQingestion (NO3) ranges from 0.08 to 2.52 and 0.05 to 1.68 with a mean value of 0.83 and 0.55 for children and adults (Table 5). About 36.58% and 14.6% groundwater samples show HQ > 1 for children and adults, respectively, and suggest that drinking groundwater polluted with NO3 could pose severe health hazard to children than adults. Similarly, HQingestion (F) for children and adults varied between 0.03 and 1.63 and 0.02 and 1.09 with 7.32% and 2.4% samples showing HQ > 1. The results of HQ ingestion (Table 5 and Fig. 6) indicate that children and adults are more vulnerable to NO3 than F contaminants in drinking water. Further, compared with adults, the non-carcinogenic risk of NO3 is much more serious for children. HI more than 1 indicates a potential non-carcinogenic effect on human health, while HI < 1 indicates negligible risk on human health. For adults, the calculated HIingestion (NO3 and F) is varied from 0.08 to 2.23 with mean value of 0.96, and 19 (46.34%) groundwater samples have shown HI > 1. The calculated HIingestion (NO3 and F) for children is varied from 0.12 to 3.35 with approximately 68% (28 groundwater samples) samples showing HI > 1. The mean value of HIingestion for children (1.44) is higher than 1, inferring that the children are more prone to the non-carcinogenic health risk from NO3 and F. Thus, the health risk assessment for HQ and HI confirmed that children are more vulnerable and threatening as compared to the adults and cannot be ignored. Overall the population of the area under investigation are at high non-carcinogenic health risk and more people may become prone to disease in the future.

Table 5 Calculated non-carcinogenic health risk for adults and children
Fig. 6
figure 6

Health risk assessment for adults and children

Evaluation of groundwater salinization

Revelle index (1941)

Revelle index (1941) is used for the evaluation of the degree of groundwater salinization in the study area and calculated by using Eq. 11. It has been considered to be an effective method for groundwater salinization evaluation in arid and semi-arid regions (Revelle 1941).

$${\text{Revelle}}\;{\text{index }}\left( {\text{RI}} \right) = {\text{Cl}}^{ - } / \left( {{\text{HCO}}_{3}^{ - } + {\text{CO}}_{3}^{2 - } } \right)$$
(11)

where all the values are in meq/L

The RI value varies from 0.1 to 1.5 meq/L in the study area (Supplementary Table 7). As per the classification based on RI values, 78% of the groundwater samples in the study area are not affected by salinization (RI value < 0.5 meq/L) and 22% samples are slightly influenced by salinity (Supplementary Table 5). Figure 7 shows that groundwater salinization is more prominent in the south-western part and few pockets in the south and central parts of the study area. It may be due to anthropogenic activities such as uncontrolled agricultural practices, seepage from municipal sewage and septic tanks and aquifer movement in the study area. Therefore, groundwater movement is also towards the south-western direction (Fig. 3) and this area is more affected by salinization problem and poor water quality as per the Revelle index and DWQI map (Fig. 5). As we discussed earlier that aquifer movement is also a key responsible factor complying the accumulation of ions along the flow path, thereby affecting the water quality (Fig. 3). The results of RI and DWQI draw the clear picture that poor water quality in the south-western and some parts of the study area is influenced by groundwater movement leading to salinization besides local pollution sources. Hence, it is recommended that the water quality study of an area shall be assessed together with RI and groundwater movement. This integrative study is a noble approach to understand groundwater quality and salinity related to flow direction and help to demarcate the aquifer region for immediate restoration and management formulation and policy as well.

Fig. 7
figure 7

Spatial distribution map of the groundwater salinization

Mechanisms controlling groundwater chemistry

Gibbs diagram and water–rock interaction

To understand the mechanisms controlling groundwater chemistry in the alluvial aquifer of Jalandhar district, water quality data were plotted in the Gibbs diagrams (Gibbs 1970). Figure 8 depicts that majority of the groundwater samples clustered on the rock dominance zone, suggesting that geochemical processes such as percolating rock–water interactions and chemical solute exchange processes of rock-forming minerals influence the hydrochemistry. Further, Schoeller indices or chloro-alkaline indices (Schoeller 1967) were employed to assess the ion exchange reactions among the host rocks and aquifer during their residence time and evaluated by Eqs. 12 and 13.

Fig. 8
figure 8

Gibbs diagram representing controlling factors of groundwater quality

$${\text{CAI}} - {\text{I}} = {\text{Cl}}^{ - } - \left( {{\text{Na}}^{ + } + {\text{K}}^{ + } } \right)/{\text{Cl}}^{ - }$$
(12)
$${\text{CAI}} - {\text{II}} = {\text{Cl}}^{ - } - \left( {{\text{Na}}^{ + } + {\text{K}}^{ + } } \right)/\left( {{\text{SO}}_{4}^{2 - } + {\text{HCO}}_{3}^{ - } + {\text{CO}}_{3}^{2 - } + {\text{NO}}_{3}^{ - } } \right)$$
(13)

The positive and negative values of CAI-I and CAI-II explain reverse ion exchange reaction and chloro-alkaline disequilibrium (cation exchange), respectively. 90% groundwater samples exhibit negative ratio for both CAI-I and CAI-II indices, indicating the chloro-alkaline disequilibrium or dominance of cation exchange process in the aquifer system (Supplementary Table 7 and Supplementary Fig. 3). In this process, Ca2+ and Mg2+ ions in the groundwater are exchanged with Na+ and K+ of the host rock as shown in Eq. 14 (Rajmohan and Elango 2003), whereas only 10% of the samples showed positive values illustrating the base exchange reaction (Supplementary Table 7 and Supplementary Fig. 3).

$${\text{Na}}^{ + } \left( {{\text{K}}^{ + } } \right) - {\text{Clay}} + {\text{Ca}}^{2 + } \left( {{\text{Mg}}^{2 + } } \right){\text{aq}} = 2{\text{Na}}^{ + } \left( {{\text{K}}^{ + } } \right){\text{aq}} + {\text{Ca}}\left( {\text{Mg}} \right) - {\text{Clay}}$$
(14)

Weathering processes

The concentration of dissolved minerals or major ions in the groundwater is mainly due to various direct and indirect interactions between aquifer and host rock. Various bivariate plots were prepared to understand the chemical weathering profile and sources of solutes in the alluvial aquifer. The cation exchange processes involved in the aquifer is further substantiated by the scatter plot between [(Ca2+ + Mg2+) – (HCO3+ SO42−)] and [(Na+ + K+) – Cl]. The majority of the groundwater samples fall on a slope of − 1 [slope y = − 1.0662x, Fig. 9a] and clearly indicate the ion exchange process between Ca2+, Mg2+ and Na+ ions in the alluvial aquifer of the study area (Keesari et al. 2016a; Batabyal and Gupta 2017). The scatter plot between Ca2+ + Mg2+ and HCO3 + SO42− (Fig. 9b) revealed that both carbonate and silicate mineral dissolutions are affecting the aquifer chemistry in the study area. The groundwater samples (36%) were above the equiline (1:1), reflecting reverse ion process, and about 61% samples were below the equiline (1:1), suggesting ion exchange is occurring in the region (G et al. 2015; Liao et al. 2018). One groundwater sample (3%) falls on equiline (1:1) and is influenced by the dissolution of calcite and dolomite in the alluvial aquifer (Singh et al. 2017). The general silicate weathering reaction with carbonic acid (Das and Kaur 2001) is given in Eq. 15.

Fig. 9
figure 9figure 9

Major ion relationship: a (Ca2+ + Mg2+) − (HCO3 + SO42−) versus (Na+ + K+ −  Cl), b Ca2+ + Mg2+ versus HCO3 + SO42−c Na+ versus Ca2+d Na+ versus Cle Na+/Cl versus EC, f Ca2+/Na+ versus HCO3/Na+, g Ca2+/Na+ versus Mg2+/Na+, h SO42− versus Ca2+, i Ca2+/Mg2+ versus Samples, j Ca2+ + Mg2+ versus TZ+, k Na+ + K+ versus TZ+ for groundwater in the study area

$$\left( {{\text{Na}},{\text{K}},{\text{Ca}},{\text{Mg}}} \right){\text{silicate }} + {\text{H}}_{2} {\text{CO}}_{3} \to {\text{H}}_{4} {\text{SiO}}_{4} + {\text{HCO}}_{3} + {\text{Na}}^{ + } + {\text{K}}^{ + } + {\text{Ca}}^{2 + } + {\text{Mg}}^{2 + } + {\text{Clay}}$$
(15)

The ion exchange process is further validated by Na+ versus Ca2+ plot (Fig. 9c), as the majority of the groundwater samples fall above and close to the equiline (1:1), suggesting the prevalence of ion exchange process in the study area (Singh et al. 2018). The scatter plot between Na+ vs Cl is used to identify the sources of salinity and saline incursions in arid and semi-arid regions (Gaofeng et al. 2010). Figure 9d depicts that majority of the groundwater samples fall above the equiline (1:1) and signify that Na+ may be attributed to silicate weathering (weathering of albite as expressed in Eq. 16) and cation exchange (Rajmohan and Elango 2003). On the other hand, some samples fell below the equiline, indicating the sources of Cl in the groundwater may be derived from the dissolution of chloride salts (Patel et al. 2016). Some groundwater samples were found on the y = x line, indicating the source of ions is the dissolution of halite as expressed in Eq. 17 (He et al. 2018; Li et al. 2018a). Further, Na+/Cl ratio (Supplementary Table 7) ranged from 0.6 to 4.4 with a mean value of 1.6. Around 78% groundwater samples showed Na+/Cl versus EC (Fig. 9e) ratio more than 1, indicating that silicate weathering is responsible for Na+ in the aquifer and 22% groundwater samples showed Na+/Cl versus EC (Fig. 9e) ratio below or close to 1, reflecting that halite dissolution is contributing Na+ to the aquifer (Kumar et al. 2015).

$$\begin{aligned} & 2{\text{NaAlSi}}_{3} {\text{O}}_{8} + 2{\text{H}}_{2} {\text{CO}}_{3} + 9{\text{H}}_{2} {\text{O}} \to {\text{Al}}_{2} {\text{Si}}_{2} {\text{O}}_{8} ({\text{OH}})_{4} + 2{\text{Na}}^{2 + } + 4{\text{H}}_{4} {\text{SiO}}_{4} + 2{\text{HCO}}_{3}^{ - } \\ & \left( {\text{albite}} \right)\;\;\;\;\;\;\;\left( {{\text{silicate}}\;{\text{weathering}}} \right)\;\;\;\left( {\text{kaolinite}} \right) \\ \end{aligned}$$
(16)
$${\text{NaCl}} \to {\text{Na}}^{ + } + {\text{Cl}}^{ - } \left( {{\text{Halite }}\;{\text{dissolution}}} \right)$$
(17)

The mixing plots of Ca2 + normalized Na+ versus HCO3 normalized Na+ and Ca2 + normalized Na+ versus Mg2+ normalized Na+ reveal silicate weathering is the main factor influencing the aquifer hydrochemistry (Fig. 9f, g). In the scatter plot between SO42− and Ca2+ (Fig. 9h), most of the samples were below the equiline (1:1), indicating the insignificant amount of gypsum present in the study region (Purushothaman et al. 2014). Few samples were found above the equiline (1:1), indicating the precipitation of calcite, and samples fall along the equiline (1:1), representing the CaSO4 dissolution (anhydrite) in groundwater (Kumari et al. 2018). The Ca2+/Mg2+ ratio was also studied to understand the dissolution of dolomite and calcite in the area. Figure 9i depicts that majority of the samples showed the value of Ca2+/Mg2+ ratio below 1 which indicates dissolution of dolomite is the main process in the alluvial aquifer of the study area (Mayo and Loucks 1995) and as expressed in Eq. 18.

$${\text{CaMg}}({\text{CO}}_{3} )_{2} + 2{\text{H}}_{2} {\text{CO}}_{3} \to {\text{Ca}}^{2 + } + {\text{Mg}}^{2 + } + 4{\text{HCO}}_{3}^{ - } \left( {{\text{magnesium }}\;{\text{calcite}}\;{\text{dissolution}}} \right)$$
(18)

The high content of magnesium and calcium is largely due to the occurrence of kankar (calcareous encrustations) in the soil matrix of the study region. The scatter plot of Ca2+ + Mg2+ versus total cations (TZ+) (Fig. 9j) shows that all the groundwater samples were plotted below the equiline (1:1), which indicates that weathering of calcium carbonate (dolomite and calcite) and calc-silicate minerals (plagioclase, amphiboles, pyroxenes) may contribute to Ca2+ and Mg2+ ions in the alluvial aquifer of the study region (Neogi et al. 2017). Therefore, the reaction of H2CO3 with plagioclase feldspar and clinopyroxene acts as an additional source to contribute these ions, namely calcium, magnesium, sodium, bicarbonate and H4SiO4 (Eq. 16). In the scatter plot between Na++K+ and TZ+ (Fig. 9k), all the samples were plotted below the equiline (Na++K+ = 0.5TZ+), which depicts that the weathering of silicate-containing minerals along with some anthropogenic activities is responsible for sodium and potassium ions in the groundwater (Mahaqi et al. 2018).

Saturation index (SI) and Langelier saturation index (LSI)

The Gibbs diagram (Fig. 8) clearly shows that hydrochemistry of the study area is mainly influenced by interactions of aquifer materials. Under such interactions, rock minerals may contaminate the groundwater (Sridharan and Nathan 2018). The dissolution/precipitation of minerals is the major governing natural process responsible for the major solute in natural water which further depends upon the saturation state of the minerals (Li et al. 2010, 2018a). The chemical equilibrium for a particular mineral is evaluated by the SI using Eq. 19. SI of seven minerals such as calcite, dolomite, aragonite, gypsum, anhydrite, fluorite and halite was estimated by using geochemical modelling software PHREEQC 3.1.2 and is presented in Supplementary Table 7.

$${\text{SI}} = \log {\text{IAP}}/K_{\text{sp}}$$
(19)

where IAP is the ion activity product and Ksp is the mineral solubility product.

SI value equal to zero indicates saturation, whereas SI value > 0 and SI value < 0 indicate supersaturation (precipitation and saturation from the groundwater) and undersaturation (dissolution of the minerals into groundwater.) with respect to a particular mineral, respectively (Li et al. 2010; Ndoye et al. 2018). The calculated SI values for anhydrite (CaSO4), fluorite (CaF2), gypsum (CaSO4:2H2O) and halite (NaCl) ranged from − 3.29 to − 2.06, − 4.68 to − 0.7, − 2.99 to − 1.76 and − 7.76 to − 6.47, respectively (Supplementary Table 7), which showed negative values (SI < 0) in all the groundwater samples (Fig. 10). It indicates that these minerals are in undersaturation state and weathering of these minerals (rock–water) may continue to release Na+, SO42−, Cl, Ca2+ and F ions into the groundwater. On the other hand, all the groundwater samples showed positive values (SI > 0) for carbonate rock minerals such as calcite (CaCO3), dolomite [CaMg(CO3)2] and aragonite (CaCO3), suggesting that the groundwater of the study area is oversaturated with these minerals. The calculated SI values range from 0.1 to 1.35, 0.35 to 2.79 and − 0.4 to 1.2 for calcite, dolomite and aragonite, respectively (Supplementary Table 7). In semi-arid regions, some factors like high temperature, evaporation rate and low rainfall are also responsible for the precipitation of calcite minerals in the groundwater (Rashid et al. 2018). The presence of kankars in the study area is an indicator of calcite precipitation. Thus, the oversaturation of magnesium and calcium bearing minerals leads to the depletion of Ca2+ and Mg2+ concentration in the aquifer system due to their precipitation. Moreover, gypsum dissolution also releases calcium ions, leading to the oversaturation of the groundwater along with calcite mineral due to the common ion effect (Keesari et al. 2016b; Mushtaq et al. 2018).

Fig. 10
figure 10

Saturation index of the groundwater of the study area

LSI (Langelier 1946) is also calculated to understand the dissolution, precipitation and neutral state of the groundwater of the study area. The corrosive nature of water chiefly depends upon the interaction of carbonates with CO2. The corresponding concentration of Ca2+, Mg2+ and CO2 in the groundwater deters the conversion of HCO3 to CO32−. LSI is computed by Eq. 20 and their results are given in Supplementary Table 7.

$${\text{LSI}} = {\text{pH}} - {\text{pH}}_{s}$$
(20)
$${\text{where}}\;{\text{pH}} = - \log \left[ {{\text{H}}^{ + } } \right]$$
$${\text{pH}}_{s} = \left( {9.3 + a + b} \right) - \left( {c - d} \right)$$
$$a = \log_{10} \left( {\text{TDS}} \right) - \frac{1}{10}$$
$$b = - 13.12\log_{10} \left( {C + 273} \right) + 34.55$$
$$c = \log_{10} \left( {{\text{Ca}}^{2 + } \;{\text{as}}\; {\text{CaCO}}_{3} \; {\text{mg/L}}} \right) - 0.4$$
$$d = \log_{10} \left( {{\text{alkalinity}} \;{\text{as}}\; {\text{CaCO}}_{3} \; {\text{mg/L}}} \right)$$

The calculated LSI values ranged from − 3.19 to − 1.24, which indicates that all the groundwater samples may readily dissolved the carbonate minerals from the aquifer due to free CO2 and thus mild to moderate corrosive nature of water (Supplementary Table 6).

Hydrochemical facies of the groundwater

To know the hydrochemical facies and associated processes in the groundwater of the study area, water quality chemical data are plotted on the Piper trilinear diagram (Piper 1944). Table 6 and Fig. 11 show that the majority of the groundwater samples belong to Ca2+–Mg2+–HCO3 (87.80%) hydrochemical facies followed by Ca2+–Mg2+–Cl–SO42− (7.31%) hydrochemical facies in certain pockets. It indicates that apart from the natural process (weathering of silicates and carbonates, ion exchange process), partially by some anthropogenic activities such as irrigation return flow, septic tank effluents and municipal wastewater, the aquifer salinity is influenced (Selvam et al. 2018). On the other hand, alkaline earth (Ca2+–Mg2+) significantly exceeds alkalies (Na+–K+) and weak acids (HCO3–CO32−) exceed over the strong acids (Cl–SO42−–F) in the study area. About 87.80% samples fall in the 5th field where carbonate hardness (secondary alkalinity) exceeds 50%; it may be due to the silicate weathering in the study area (Herojeet et al. 2017). In cation triangle, 87.80% and 12.19% of groundwater samples fall into the no dominant and Mg2+ water type, respectively. Among anions, the majority of the samples (92.62) fall in HCO3 water type (Table 6). Thus, the Piper diagram suggests that weathering of silicates and carbonates and cation exchange process are the main factors governing the hydrochemistry in this study area.

Table 6 Hydrochemical facies of the groundwater of the study area derived from piper diagram
Fig. 11
figure 11

Piper diagram representing hydrogeochemical facies of groundwater of the study area

Chemometric analysis

Principal component analysis

To elucidate the potential pollution sources and geochemical signatures influencing the groundwater chemistry, PCA was applied to sixteen variables of the water quality dataset of the study area. Only those principal components having eigenvalue greater than one have been taken into account for the interpretation of PCA (Herojeet et al. 2016; Rajkumar et al. 2018). In total, five principal components were obtained by varimax rotation with Kaiser normalization method (Herojeet et al. 2017; Sahu et al. 2018), altogether accounted for 81.7% of the cumulative variance (Table 7). The variances of principal components PC1, PC2, PC3, PC4 and PC5 are 33.1%, 17%, 14.1%, 10.3% and 7.2%, respectively (Table 7).

Table 7 Principal component matrix of the groundwater after varimax rotation with Kaiser normalization

The principal component 1(PC1) corresponds to over 33.1% of the total variance with an eigenvalue 5.29, having positive loading with TH, Mg2+, Cl and moderate positive loading with EC, TDS and NO3. Strong positive loading of Mg2+ and TH indicates that dissolution of dolomite [CaMg(CO3)2] mineral significantly controlling the groundwater chemistry of the study area is also revealed in Fig. 9i. The strong positive loading for Cl and moderate positive loading for NO3 are mainly attributed to anthropogenic activities chiefly industrial effluents, irrigation return flow and applications of fertilizers as agriculture is the dominant LULC class in the study area (Srinivasamoorthy et al. 2014; Keesari et al. 2016a; Kaur et al. 2019). Moreover, the study area lacks proper sewerage and drainage network; thus, seepages from the septic tanks coupled with domestic and municipal sewage in the aquifer also contribute to these ions (NO3 and Cl). Overall PC1 describes that both natural (cation exchange, dolomite dissolution) and anthropogenic sources influence the hydrochemistry of the aquifer. PC2 is contributing 17% of the overall variance with an eigenvalue 2.72 in the hydrochemical data of the groundwater. TA and HCO3 show strong positive score in PC2 with the loading value of 0.88 and 0.90, respectively. The main source of HCO3 is due to silicate weathering and dissolution of carbonate rocks such as calcite [CaCO3] and dolomite [CaMg(CO3)2] as shown in Eqs. 9, 10, 18 and Fig. 9b (Keesari et al. 2014; Li et al. 2018a). Based on the Piper and box plot diagram, HCO3 is the dominant anion in the groundwater which influences the TA in the groundwater (Keesari et al. 2014; G et al. 2015; Roy et al. 2018). A moderate positive loading of Na+ with TDS and EC indicates that dissolution of silicate and albite minerals are responsible for the Na+ in the groundwater of the study area as discussed in Eq. 16 and Fig. 9e. Hence, PC2 is mainly influenced by the natural processes due to weathering and dissolution of calcite, dolomite, silicate and albite minerals. PC3 accounts for 14.1% variance with eigenvalue 2.24 dominated by strong negative loading of pH (− 0.85) and CO32− (− 0.75) and moderate positive loading of K+ (0.68) and PO43− (0.52). The strong correlation between pH and carbonate indicates that CO32− is mainly controlled by the pH as alkaline condition favours the formation of the CO32− in the groundwater (Raju and Singh 2017). The evapotranspiration process also increases the pH of groundwater (Batabyal and Gupta 2017). The moderate positive loading between K+ and PO43− indicates that application of potash fertilizers and agrochemicals reflects the anthropogenic origin (Keesari et al. 2016a; Roy et al. 2018). The PC4 contributes 10.3% of the total variance with eigenvalue 1.65 and the strong loading on F (0.79) and moderate negative weight SO42− (− 0.64), suggesting the geogenic and anthropogenic origin. The SI revealed that fluorite and anhydrite mineral is the indicator of F and SO42− content in the groundwater. Moreover, the study area is represented by semi-arid condition; thus, high evaporation rate and longer residence time of groundwater (water–rock interaction under alkaline condition) are the significant factors for the fluoride in the region (Purushothaman et al. 2014; Adimalla and Li 2018; Khanoranga 2019). The negative loading of Ca2+ with respect to F is due to the process of oversaturation of calcite mineral, which leads to the precipitation of CaCO3 and promotes the dissolution of CaF2 mineral. After oversaturation, the process of common ion effect suppresses the CaF2 dissolution and this condition gave negative loading of Ca2+ with F (Batabyal and Gupta 2017). Insignificant loading of F with other variables neglected the anthropogenic influence in groundwater chemistry. However, the inverse relation of SO42− and the absence of correlation with other parameters indicate the different sources of origin related to human activities (Brindha et al. 2017). A strong positive loading observed in Ca2+ in PC5 represents 7.2% of the entire variance with eigenvalue 1.1. It is mainly influenced by the natural processes such as the dissolution of calcite and limestone (kankars) along with the cation exchange process, significantly contributing to the calcium concentration in the study area.

Cluster analysis

CA is performed to categorize the analyse dataset of groundwater samples. Four clusters were identified by dendrogram and their significance was further studied by the Sneath’s test techniques (Dlink/Dmax)*100 < 90 (Supplementary Fig. 4). To elucidate the heterogeneity among the clustering pattern, the mean value of each physiochemical parameters is computed for individual cluster group (Table 8). The indicator parameters influencing each clustering pattern are marked bold and italics with bold.

Table 8 Clustering pattern of groundwater samples in the study area

Cluster 1 (C1) represents the largest number of sampling group accounting 56.09% of the total samples. SO42− is the only special tracer influencing C1 which indicates that this ion might have different sources rather than geogenic factor. Saturation index revealed that dissolution of anhydrite mineral releases SO42− ion which is in undersaturation. However, the absence of any significant relation with other parameter depicts the contribution from human induce sources such as the by-products of fertilizer, breakdown of organic materials from weathered soil releasing sulphate into groundwater as the study area being intensively cultivated region. C2 forms the highest levels of pH and CO32− and higher value of TA comprises 21.95% of the samples. It signifies that the presence of CO3 ion is controlled by pH (alkaline conditions) which further influenced TA in groundwater (Roy et al. 2018). Therefore, the samples locations belong to C2 which indicates alkaline nature of water. The third cluster (C3) is marked by the special tracers of Mg2+, HCO3 and TA and higher levels of Ca2+ and TH which constitutes 12.21% of the entire samples. The significance level of Ca2+, Mg2+ and HCO3 depicts temporary hardness. However, the presence of dominant HCO3 ion is the main reason for TA and fits well with PC2. Piper diagram revealed that temporary hardness (Ca2+-Mg2+-HCO3 water type) in groundwater is found in C3. The last cluster (C4) is formed by the single sampling location (32) with multiple parameter tracers, namely EC, TDS, TH, Ca2+, Na+, K+, NO3, F, Cl and PO43−, respectively. The excess content of these parameters makes the water highly mineralized, resulting in high EC and TDS. The main source of Ca2+, Na+, K+ and F is the geogenic source, whereas NO3, Cl and PO43− belong to anthropogenic inputs as discussed above. The groundwater of this sampling location falls under salinity class as per the Revelle index. Therefore, C4 is influenced by salinity problem. Chemometric statistical techniques (PCA and CA) confirmed that the groundwater samples are largely controlled by parameters Ca2+, Mg2+, Na+, K+, HCO3, CO3 and F originated from geogenic factors and NO3, SO42−, Cl and PO43− of anthropogenic origin with minor influenced on the aquifer.

Conclusion

The groundwater is the main source for drinking and other purposes in the study area. Extensive agricultural activities and overextraction of groundwater draws the main attention for suitability appraisal using integrated application of GIS and chemometric statistical techniques and human health risk assessment through direct ingestion of groundwater. The analytical result of the physicochemical parameters revealed that the majority of the groundwater samples are within the permissible limits of BIS (2012) and WHO (2011), except few samples of pH, NO3 and F, respectively. The calculated DWQI exhibited that 49% of the groundwater samples fall under poor to unfit for drinking class. The corroboration of DWQI and RI results depicted that poor water quality zones are the south-western and central parts of the study area that may be influenced by groundwater movement, leading to salinization besides local pollution sources. The groundwater chemistry is mainly governed by rock–water interaction, indicating Ca2+–Mg2+–HCO3 is the dominant hydrochemical facies with Mg2+–HCO3 water type in the study area. The non-carcinogenic human health risk assessment shows that ~ 68% groundwater samples have shown HI > 1 and their mean value of HIingestion indicates that children are more vulnerable to NO3 and F contaminants than adults through direct ingestion of groundwater. Chemometric statistical analysis and bivariate plots between different ions revealed that in addition to the natural processes such as weathering of silicate and carbonate minerals, halite dissolution and cation exchange process, some anthropogenic factors also influence the groundwater chemistry of the region. It is recommended that the study of groundwater quality must be assessed together with RI and groundwater movement. Further, the poor water quality zones of south-western and central part must be properly treated before subjected to drinking purpose. Due to the agriculture-based economy and lack of proper education and awareness related to water quality, the potential public health risks are often ignored in the study area as well as other parts of the country. This integrative study is a noble approach to understand groundwater quality, salinity related to flow direction and pollution source extraction to demarcate the aquifer region for immediate restoration and management formulation and policy as well. The future dimensions of the study shall be focused on human health risks associated with heavy metals contamination in groundwater and soil. Regular monitoring is vital for sustainable groundwater management and long-term protection of water quality from further deterioration.