1 Introduction

Groundwater is a vital source of freshwater supply in arid and semi-arid regions to sustain agricultural, industrial and human consumption needs (El-Alfy et al., 2018). Among arid countries, Egypt reached the lowest per capita water shares in the world (approximately 550 m3 per capita per year) due to the high growth population (113 million in 2023) and the stable water budget from the Nile River (55.5 billion cubic meters annually), which represent about 97% of the total water demand (Abd Ellah, 2020). Furthermore, Egypt imports annually about 40 billion cubic meters of "virtual water" embedded in food imports to meet population needs (Ayyad & Khalifa, 2021). The most notable groundwater resources in Egypt include the Nile Valley and Delta aquifer, the Nubian Sandstone aquifer, Moghra, and the fractured limestone aquifer in the Eastern and western Deserts and Sinai Peninsula (El Alfy et al., 2019; Gemail et al., 2021; Ibrahim et al., 2023).

The Eastern Desert covers about 22% of the Egyptian territories, where groundwater represents an important resource and requires an effective assessment (Embaby et al., 2016). Groundwater of the Eastern Desert of Egypt is affected by several structural elements, which correlated with the Pan African Orogeny (e.g. fractures, folds, faults and shear zones) and subsequent tectonic reactivations, especially during Oligocene and Cretaceous times (Hussien, 2023). Groundwater resources in the Eastern Desert are scarce, and mainly occur in shallow alluvial and fracture zone aquifers, as well as sandstone and fractured carbonate aquifers (Abdel Moneim, 2005; Sultan et al., 2011). Direct recharge of these shallow alluvial and fracture zone aquifers occurs through seasonal rainfall and infrequent flash floods (Porsani et al., 2005; Ruelleu et al., 2010). The shallow alluvial and fracture zone aquifers can offer significant sources of groundwater at a local level, their utilization is restricted due to their relatively small frame and extensions and reliance on low precipitation recharge rates that occur over decades (Amer et al., 2012; Gemail et al., 2024).

Aquifer is not only a location for storing and transporting of groundwater, but also a domain for water rock interaction. In this regard, excessive release of fluoride in groundwater from different geogenic sources has been considered as a great challenge for groundwater utilization. In view of this, it is reported that 300 million people across Egypt and other arid countries 29 countries are suffering from endemic fluorosis, mainly because of the daily basis of F intake with groundwater for several human purposes (Sahoo et al., 2022). Besides, several anthropogenic activities might cause degradation and/or contamination of groundwater resource including mining, industrial sewage discharges (Gemail et al., 2017; Youssef et al., 2024), landfill sites and agricultural activities (Gemail et al., 2024; Zhai et al., 2022). For instance, a high degradation of water quality in groundwater of Gabes region, Tunisia was recorded due to the overuse of mineral fertilizers (Wederni et al., 2023). Another report showed elevated levels of arsenic, fluoride, and nitrate in the groundwater of southwestern part of New Mexico, USA (Islam, 2023). Consequently, there is an urgent need to monitor the quality of groundwater resources in arid regions for sustainable management of such limited water resources.

Overexploitation is also considered as one of the most important threats for the sustainable utilization of the groundwater aquifers in arid regions. As such, a tremendous decline by about 60m was recorded in the water level of Nubian aquifer in Egypt (El Alfy, 2013; Machiwal et al., 2018). The depletion of groundwater level is also linked with vulnerability of chemical contamination of the aquifer. For example, overexploitation of groundwater in Malwa region Punjab, India led to significant alterations in the aquifer chemistry (Kumar et al., 2023). In addition, the overuse of groundwater in Bastam region, Iran following alterations in precipitation regime (2002–2016) caused intrusion of saline water toward the aquifer due to the decline in groundwater level by about of 90 cm y−1 and water storage deficit by about 17.32 Mm3 y−1 (Bagheri et al., 2019).

The arid water storage basins have various hydrochemical environments and a zonal hydrogeochemical characters, this is due to various geological and metrological conditions (Li et al., 2019). Using of multivariate statistical methods is crucial in determining the natural processes that affect groundwater and resolving hydrochemistry problems(Gulgundi & Shetty, 2018). Several studies had been carried out to assess groundwater potentialities (Ammar & Kamal, 2018; El-Ammawy et al., 2020; Gad & Saad, 2017; Khalil et al., 2021; H. Musaed et al., 2022). The current study is carried out in the area between south-east of Beni Suef and northeast El-Minia, Egypt. Which represents the eastern desert fringes of the Nile Valley. One hundred and four groundwater samples were collected and analyzed for major, minor and trace elements to evaluate the main hydrochemical processes that control the geochemistry of groundwater. The study area is divided into three main zones, depending on geological setting, lineament map and the water geochemistry. The main goal of the present study not only relies on the hydrochemical evaluation of groundwater, but also incorporates the utilization of multivariate geostatistical technique to identify the major processes account for maximum variability in the analyzed data set such as; evaporation, mineral dissolution, precipitation, and anthropogenic factors.

2 Materials and Methods

2.1 Study Area

The study area located between south-east of Beni Suef and northeast El-Minia governorates, Egypt (longitudes of 30°48' – 31°5' E and latitudes of 28°18' – 28°52' N (Fig. 1). The studied area classified into three main zones. The elevation of first zones ranges from 9 m (asl) at the low-lying plain near the Nile River to 147 m (asl) at the southeastern part with an area of 289 km2. The second zone (188 km2) is characterized by numerous low hells in its center and its elevation ranges from 15 to 259 m (asl). The third zone (100 km2) has an elevation ranged from 51 to 240 m (asl) (Fig. 1).

Fig. 1
figure 1

Location map of the study area and water points plotted on SRTM DEM

The climate of Egypt is characterized with hot/dry summer and mildly-wet/cold winter with average values of about 14°C and 30°C during winter and summer months, respectively (Elshaeb et al., 2014). These temperatures in the inland desert areas, however, fluctuate considerably, particularly during summer, from 7°C during night up to 43°C during day (Elshaeb et al., 2014). As the study are located in arid zone and prevailing of dry wind, high evaporation rates are recorded, and they range from 4.8 mm/day in winters to 12.4 mm/day in summers (Harmsen, 2016; Khalil et al., 2021). Egypt receives extremely limited rainfall and the average precipitation rate fluctuates from few millimeters in desert areas up to 260mm year−1 in Sinai Peninsula (El Alfy & Merkel, 2006; Mahmoud, 2014). The annual precipitation rate of the studied area, however, ranges from 3 to > 50 mm year−1 (Egyptian Metrological Authority 1996). However, flash floods are the foremost natural risk since several moderate flash floods were recorded in the studied area during the last 50 years, especially in years of 1969, 1980, 1984, 1985, 1994 and 2020 (Abdel Moneim 2005; El-Saadawy et al., 2020).

2.2 Geology and Hydrogeology

The study area is characterized by a succession of Tertiary to Quaternary sedimentary rocks (Fig. 2). The middle Eocene Samalut Formation exposed in the southern part of the study area with thickness of 58m, it overlays the Minia Formation and undelaying Maghagha Formation. Its lower unit has thin-bedded, grayish white, soft-burrowed nummulitic limestone with calcareous shale at the base. Whereas, the upper unit involves pinkish white and massive to faintly graded bedding nummulitic limestone (Bishay, 1961; Said, 1960). The chalk-marl complex of middle Eocene Maghagha Formation conformably overlies the Samalut Formation and underlies the Qarara Formation with a succession of 100 to 160m thick east of Beni-Mazar and Maghagha cities (Bishay, 1966).

Fig. 2
figure 2

Geologic map of the study area

The limestone of middle Eocene Qarara Formation is unconformably underlain by the Middle Eocene Maghagha Formation. The lower part is composed of variably colored mudstone with gypsum veinlets and siltstone lenses, while the upper part is a yellowish sandy, fossiliferous limestone and marl intercalations ((Bishay, 1966; Said, 1960). The middle Eocene Fashn Formation unconformably overlies the Qarara Formation. The lower part is bedded chalky limestone with chert nodules and bands that are overlain by shale with gypsum veinlets, while the upper part formed of yellowish white limestone intercalated with marl (Bishay, 1966). The Quaternary alluvium deposits are related to Wadi courses. These deposits consist of gravel, sand and silt; however, the Nile silt and clay are deposited in Nile Valley in the western part of the study area. The Fractured limestone Eocene aquifer is tested in the study area since it is influenced by a network of faults and might play an important role in groundwater recharge (Said, 1981). It is characterized by unconfined to semi-confined conditions, which overlain by permeable alluvium sediments. This aquifer has alternative layers of shale, marl, chert bands and nodules; therefore, it represents low hydraulic potential with effective porosity of 13% (Habibah et al., 2022; Tawab, 1994). The Eocene aquifer is recharged by infiltration of water during intense rainfall occasional flash flood events (Sultan et al., 2000) (Fig. 3). In addition, seepage from River Nile and irrigation/drainage canals can recharge this aquifer (Habibah et al. 2022) (Fig. 4).

Fig. 3
figure 3

Map of the water Level for the study area

Fig. 4
figure 4

Total Dissolved solids (TDS) in the sampled groundwater, expressed in (mg/L)

2.3 Water Sampling and Analysis

Groundwater (104 samples) was collected in July 2022 (Fig. 1) with the finest spatial distribution using the techniques outlined by APHA 1995 and USGS 2007. Physicochemical characteristics of groundwater samples including temperature, dissolved oxygen (O2), redox potential (Eh), hydrogen ion concentration (pH), total dissolved solid (TDS) and electrical conductivity (EC) were measured directly in the field using HQ40d Portable Multi-Parameters Meter. This was calibrated for each specific measurement using the proper standard solution. Other chemical analyses of water samples were carried out at the Central Laboratories of National Water Research Center (NERC). Major cations calcium (Ca2+), magnesium (Mg+2), sodium (Na+) and potassium (K+) and anions chloride (Cl) and sulfate (SO42−) were determined using ion chromatography method; however, bicarbonate (HCO3) and fluoride (F) were measured by acid–base titration and ion selective electrode, respectively (Fig. 5). Trace elements concentration aluminum (Al3+), barium (Ba2+) chromium (Cr6+), copper (Cu+), iron (Fe2+), manganese (Mn2+), strontium (Sr2+) and zinc (Zn2+) were determined using ICP-MS (Inductively Coupled Plasma-Mass Spectrometer).

Fig. 5
figure 5

Concentration levels of nitrates in the sampled groundwater in (mg/L)

2.4 Hydrochemical Classification

To identify the key factors that impact hydrochemistry, it is crucial to identify the principal ions in groundwater (El Alfy et al., 2019; Karroum et al., 2017; Musaed et al., 2020; Narany et al., 2018). Gibbs (1970) put forth the idea of using TDS versus Cl/(Cl + HCO3) for anions and TDS versus Na+/(Na+ + Ca2+) for cations as a mean of illustrating the natural processes that govern groundwater chemistry. Other hydrochemical processes that could regulate groundwater quality including evaporation, precipitation, rock weathering, and rainfall recharge were also recorded (Honarbakhsh et al., 2019; Loni et al., 2015) (Fig. 6). A Piper diagram can be used as helpful technique to illustrate the hydrochemical data to categorize various types of groundwater and understand the potential sources of the dissolved constituent salts in groundwater (Domenico & Schwartz, 1997; El Alfy & Merkel, 2006; Freeze & Cherry, 1979; Sargazi et al., 2021) (Fig. 7). Besides, Durov diagram was utilized to provide a comprehensive explanation of the hydrochemical processes that impact groundwater and identifying original genesis of groundwater resources (Al-Bassam & Khalil, 2012). Durov diagram can also investigate various water types in water mixtures as well as ion and reverse ion exchange processes (Lloyd & Heathcote, 1985).

Fig. 6
figure 6

Piper diagram for three locations a, b and c

Fig. 7
figure 7

Drouve diagram for three locations a, b and c

2.5 Multivariate and Geostatistical Analyses

The use of multivariate statistical approaches in SPSSv25 is critical in detecting natural processes that impact groundwater and addressing hydrochemistry difficulties (Gulgundi & Shetty, 2018) (Fig. 8). The original data set was normalized prior to statistical analysis using the standards specified by (Davis, 2002). The geographical distributions of groundwater variables were modelled in a GIS system, while universal kriging was employed to assess parameter values in unmeasured sites. Pearson correlation coefficient was utilized to evaluate the degree of similarity between two parameters in order to examine the hydrochemical evolution of groundwater (Nair et al., 2005; Unnisa & Zainab Bi, 2017; El Alfy et al., 2019).

Fig. 8
figure 8

Gibbs diagram for three locations a, b and c

2.5.1 Factor Analysis

Factor analysis was conducted to identify the significant factors contributing to groundwater hydrochemical processes and classify the various groundwater areas (Davis, 2002; Nan et al., 2016; El Alfy et al., 2017) A total of twenty-three hydrochemical variables were utilized to establish associations between the physicochemical characteristics of the water and identify correlations (Fig. 9). The amount of variance explained by each variable was calculated by squaring the corresponding factor loading for each factor. The residual variance for each variable was computed by subtracting the sum of the variances explained by each factor from 100%. The maximized variance associated with factors was assessed using their eigenvalues, which serve as indicators of the significance of the factors. The factors with the highest eigenvalues are deemed the most significant, and any eigenvalues equal to or greater than 1.0 are considered being significant (Davis, 2002; Kim & Mueller, 1978). The extraction of the primary factors was accomplished using the varimax criterion (Kaiser, 1958), which enhanced the loadings of closely related variables within each factor. This approach facilitated the identification of easily interpretable components (Furi et al., 2012).

Fig. 9
figure 9

a: 3D representation of loadings of Factor 1, factor 2 and factor 3 after varimax rotation. B: 3D representation of loadings of Factor 4, factor 5 and factor 6 after varimax rotation. (zone 1)

2.5.2 Claustral Analysis

Cluster analysis is performed to categorize the hydrochemical system into multiple minor hydrochemical regimes (Suk & Lee, 1999). Cluster analysis is an effective tool that can be utilized to examine and comprehend the groundwater hydrochemical system by discretizing different groundwater zones that hold geological significance (Fig. 10). Cluster analysis is employed to classify water chemistry data in distinct areas and geologic aquifers (Alther, 1979; El Alfy et al., 2018). The hydrochemical data were standardized using z-standardization to ensure equal weighting, and then the samples were grouped based on their similarity to one another. The data were normalized using the following equation to ensure that all variables have the same mean and standard deviation values (Davis, 2002).

$$Zi j=(Xi-\mu i)/\sigma i$$
(1)

where Xi represents the data for the parameters i, µi and si are the mean and the standard. Cluster analysis was conducted using 23 variables. The results provided logical values for data interpretations and hydrochemical patterns that accurately represent field conditions (Fig. 11).

Fig. 10
figure 10

a: 3D representation of loadings of Factor 1, factor 2 and factor 3 after varimax rotation. B: 3D representation of loadings of Factor 4, factor 5 and factor 6 after varimax rotation. (zone 2)

Fig. 11
figure 11

a: 3D representation of loadings of Factor 1, factor 2 and factor 3 after varimax rotation. b: 3D representation of loadings of Factor 4, factor 5 and factor 6 after varimax rotation. (zone 3)

2.6 Saturation Index

The saturation indices were utilized to estimate the reactivity of minerals in the groundwater. It is possible to predict the presence of reactive minerals in aquifer media by analyzing groundwater without the necessity of collecting solid phase samples for direct mineralogical analysis (Deutsch, 2020) (Table 1). The saturation index (SI) is a measurement of the interaction between groundwater and rock. The PHREEQC software was employed to calculate the SI values. The saturation index (SI) for a specific mineral can be defined as follows:

$$\mathrm{SI}=\mathrm{ log}\frac{\mathrm{KIAP}}{\mathrm{Ksp}}$$
(2)

where KIAP is the ion activity product, and Ksp is the solubility product with respect to the temperature of the water sample (Table 2). When the saturation index (SI) is positive, it indicates that there is supersaturation or the possibility of precipitation of secondary minerals. A negative saturation index (SI) indicates undersaturation, meaning that the dissolution of minerals is necessary to attain equilibrium. A saturation index of 0.5 signifies equilibrium conditions (Parkhurst & Appelo, 1999).

Table 1 Salinity value of the different clusters (Zone 1)
Table 2 Salinity value of the different clusters (Zone 2)

3 Results and Discussion

3.1 Spatial Data Distribution

One hundred and four samples were collected from the study area (Table 3). Based on the lineament map, cluster analysis, water types, and geologic map, the study area was divided into three zones. The first zone has a high lineament density in its central part, the second zone exhibits a similar high density in its eastern part, while the third zone has high lineament density in the western and northern parts (Appendix Fig. 13). The first zone is situated in the southern part of the study area and comprises sixty-eight groundwater samples (Fig. 3). The depth to water in this zone ranges from 4 to 75m, while the total depth of the wells ranges from 20 to 227m. Additionally, the water level in wells varies from 6 to 58m (Fig. 3). The second zone is located between the two other zones and consists of twenty-five groundwater samples (Fig. 3). The depth to water in this zone varies from 22 to 90m, and the total well depth ranges from 120 to 370m. Additionally, the water level in wells ranges from 10 to 56 m (Fig. 3). The third zone is positioned in the northern part of the study area and comprises eleven groundwater samples (Fig. 3). The depth to water in this zone ranges from 50 to 100m, while the total well depth varies from 96 to 400m. Additionally, the water level in the wells ranges from 4 to 44 m (Fig. 3). It is important to note that the depth to the water table increases from the southern to the northern part of the large study area. It is noticed that groundwater salinity increases northeastward, where the groundwater becomes deeper. Meanwhile, shallow groundwater with low salinity is recorded in the south and west areas, where the aquifer is recharged with fresh Nile water.

Table 3 Salinity value of the different clusters (Zone 3)

3.2 Salinity Distribution

Groundwater quality was characterized by low to moderate salinity levels with TDS values ranging from 263 to 3370 mg L−1 and average of 924 ± 603.39 mg L−1. Forty-five samples (66%) were above the desirable limit of WHO 2017(600 mg L−1). Those areas with high TDS values were primarily observed in the north-east, south-west, and central parts of the study area (Fig. 4 and Fig. 12). The second zone is characterized by moderate to high salinity, with TDS concentrations ranging from 645 to 1462 mg L−1 and averaged as 773 ± 160.92 mg L−1. All the sample values (100%) exceeded the permissible limit prescribed by WHO 2017 (600 mg L−1), indicating that the water in this zone is unsuitable for drinking. The highest values of TDS were observed in the northern part, with smaller concentrations also present in the western and eastern regions of the study area (Fig. 4). The third zone exhibited the highest salinity levels. The TDS values in this zone ranged from 1025 to 1982 mg L−1, with a mean of 1629 mg L−1. Values of all sample exceeded the desirable limit set by WHO 2017 (600 mg L−1), indicating that the water in this zone is unsuitable for drinking. The high concentrations of TDS extended from the northern to southern parts, indicating a discharge area, while the low values were observed in the western and eastern parts of the study area, suggesting recharge sources from the Nile River and rainfall (Fig. 4). Based on the salinity results, it was observed that the salinity increased towards the north with increasing groundwater depth. The elevated TDS values were associated with geogenic processes, specifically water–rock interactions, where groundwater deeply percolated and dissolute aquifer minerals when it flows from the Red Sea hills to reach the Nile valley. This leads to higher ion concentrations in groundwater, such as the dissolution of evaporite and carbonate rocks, as well as ion exchange. Additionally, anthropogenic processes, such as irrigation return flow, contributed to the increased salinity levels in the groundwater, spatially in the irrigated area with groundwater (Appendix Fig. 14).

3.3 Groundwater Contamination with Nitrate

Values of NO3 in the first zone ranged from 0.10 to 61.8 mg L−1 with a mean of 10.01 mg/l (Fig. 13). Only two samples (3%) exceeded the permissible limit prescribed by WHO 2017 (50 mg/l). The eastern part of this zone exhibited the highest concentrations of NO3 (Fig. 5). Meanwhile, all samples of NO3 in the second zone were within the desirable limit of WHO and their values ranged from 0.28 to 4.16 NO3, with an average of 0.99 mg L−1. The highest concentrations of NO3 were observed in the north and north-east parts of this zone (Fig. 5). Values of NO3 in the third zone were further within the permissible limit prescribed by WHO with a range of 4.17–19.65 mg L−1 and average value of 10.54 mg L−1. The highest NO3 concentrations, however, were observed in the north-western part of this zone (Fig. 5). It could be observed that agricultural activities are the most important source influencing groundwater quality in the first zone since it is close to the Nile River and characterized by shallow depth and flood irrigation. The higher concentration of nitrate is recorded in the south with shallow low TDS groundwater (Fig. 4 and Fig. 5). However, NO3 concentrations decreased towards the north, where the groundwater depth increased and the nitrate is oxidized during its flow through the vadose zone (excess of oxygen).

3.4 Groundwater Origin and Hydrochemical Facies

3.4.1 Piper Diagram

Piper diagram was used in order to determine the main hydrochemical facies. In the first zone, three distinct hydrochemical facies, comprising six diverse groundwater types, were identified (Fig. 6a and Fig. 14). The majority of groundwater samples (72%) belonged to the Na+-Cl type (facies); however, only one sample (1.4%) was related to the Na+-SO42− type, and another sample (1.4%) was related to the Na+-Ca2+-SO42− type. Fifteen groundwater samples (22%) were categorized as Na+-HCO3 type. However, only one sample (1.4%) was related to the Na+-Cl-HCO3 type and another one (1.4%) was classified as Ca2+-HCO3 type. Meanwhile, five different groundwater types were identified in the second zone (Fig. 6b). Only one groundwater sample (4%) belonged to the Na+-Cl type. Water types are categorized as transitional or mixed types, starting from Na+-Mg2+-Cl-HCO3 types (seven samples, 28%), followed by Na+-Ca2+-Mg2+-Cl-HCO3 types (five samples, 20%), Na+-Mg2+-Ca2+-Cl-HCO3(eleven samples, 44%), and Na+-Ca2+-Cl-HCO3 (one sample, 4%). Five water types were identified in the third zone, in which two samples (18%) were related to Na+-Cl type and the other three samples (27%) belonged to the Na+-Mg2+-Cl type (Fig. 6c). Additionally, the Na+-Ca2+-Cl type was represented by three samples (27%), with Ca2+ being present but Mg2+ absent. Furthermore, there were two samples (18%) of the Na+-Ca2+-Cl-SO42− type, or mixed types, in which SO42− was present. Finally, one groundwater sample belonged to the Na+-Ca2+-Cl-HCO3 type. These types indicate certain hydrochemical processes, such as the dissolution of evaporate and carbonate rocks (including anhydrite, gypsum, halite, calcite, and dolomite) and ion exchange.

3.4.2 Drouve Diagram

The groundwater samples of the first zone were categorized into four fields (NO. 2, 5, 6, and 9) (Fig. 7a). Seven samples (10%) fell into the Mg2+-HCO3 facies, indicating influence from meteoric groundwater recharge. Eight samples (12%) were plotted along the dissolution or mixing line, suggesting the mixing of groundwater from two or more different facies. This observation suggested the presence of two different sources of groundwater, possibly from the Nile River and rainfall precipitation. Seventeen samples (25%) were classified as Na+-SO42− facies, indicating gypsum and halite dissolution. These facies may form when the excess Na+, after its connection with Cl, pairs with the excess SO42−, left after its connection with Ca2+ and Mg2+. Thirty-six samples (52%) fell into the Na+-Cl facies, which points to ion exchange and high halite dissolution, a conclusion supported by the saturation index. In the second and third zones, all groundwater samples fell into field NO. 9 and were plotted along the dissolution or mixing line, indicating the mixing of groundwater from two or more different facies (Fig. 7b, c). This suggests the presence of two distinct sources of groundwater or mixed water, likely originating from the Nile River and rainfall. Furthermore, the dissolution of evaporate and carbonate rocks, as well as ion exchange, played a significant role in shaping the chemical composition of the groundwater samples.

3.4.3 Saturation Indices and Geochemical Processes

The average saturation index values of the existing minerals, such as anhydrite, halite, and gypsum mineral phases, were negative. This finding pointed to the dominance of under-saturation conditions for these minerals (Table 4). These minerals tend to be in the dissolved phases, leading to the expansion of open spaces and fractures in the aquifer. As a result, the porosity and permeability of the aquifer increase, facilitating better groundwater flow and storage. While the Saturation Index (SI) of dolomite and calcite was above saturation conditions, indicating the precipitation of secondary minerals. This observation provides valuable information about the flow of water, which was from the south to the north.

Table 4 Average cluster saturation indices (SI) of the different mineral phases

Data presented on the Gibbs diagram (Fig. 8a, b, and c) demonstrate the relationship between (Na+ + K+)/(Na+ + K+ + Ca2+) versus TDS and (Cl)/(Cl + HCO3) versus TDS. This diagram reveals the mechanism governing groundwater chemistry. In the first zone, the groundwater samples fall along a transition zone between rock dominance and evaporation dominance (Fig. 8a). This indicates that both of these processes play an important role in governing the groundwater chemistry. However, since the depth of groundwater in this zone is above 20m, the influence of evaporation on groundwater chemistry must be minimal. Therefore, rock-water interaction is the main dominating factor influencing groundwater chemistry in this area. In the second zone, the majority of samples (96%) fall within the region dominated by rock-water interaction (Fig. 8b). Only one sample was in the region of evaporation precipitation. This indicates that the chemical compositions of groundwater in this zone are primarily influenced by rock-water interaction. In the third zone, the majority of samples fall within the region of evaporation precipitation (Fig. 8c). However, since the depth to groundwater in this zone is above 90m, the influence of evaporation on groundwater chemistry must be minimal. Consequently, rock-water interaction remains the main dominating factor influencing groundwater chemistry in this area.

The relations between Ca2+/Na+ vs HCO3/Na+ and Ca2+/Na+ vs Mg2+/Na+ were established. It was found that in all three zones, groundwater chemistry is primarily influenced by silicate weathering and evaporation dissolution. (Appendix Fig. 15).

3.4.4 Multivariate Statistics

Pearson Correlation coefficient matrix was computed for the hydrochemical parameters. In the first zone, the hydrochemical parameters O2, Eh, temperature and pH exhibited negative and poor correlation with all other parameters (Appendix Table 5). However, strong correlations were observed between TDS and EC with Ca2+, Na+, Cl, and SO42−, as well as moderate correlations with K+, Mg2+, and NO3. Moreover, there was a strong correlation between Ca2+ and Na+, NO₃, and SO₄2−, and a moderate correlation with K+, Mg2+, and Cl. Additionally, K+ displayed a strong correlation with Mg2+ and Cl, and a moderate correlation with Na+ and SO₄2−. Likewise, Mg2+ showed a strong correlation with Na+ and Cl, while Na+ exhibited strong and moderate correlations with Cl and SO₄2−, respectively. Furthermore, Cl demonstrated a moderate correlation with SO₄2− and NO₃ displayed a strong correlation with SO₄2−. On the other hand, pH showed negative correlations with all other parameters.

In the second zone, the hydrochemical parameters (O2, Eh, and pH) exhibited negative and poor correlation with all other parameters (Appendix Table 6). However, temperature showed a strong correlation with F and NO3, and a moderate correlation with TDS and EC. Furthermore, TDS and EC demonstrated strong correlations with Ca2+, Mg2+, Na+, F, Cl, and SO42−, as well as moderate correlations with NO3 . Additionally, a strong correlation was observed between Ca2+ and Na+, and Ca2+ with Cl, while a moderate correlation existed between Ca2+ and K+ and F. Fluoride F showed strong correlations with Cl, NO3, and SO42−, and a moderate correlation with Ca2+ and Cl. Moreover, Mg2+ displayed strong correlations with Na+, F, Cl, and SO42−, and a moderate correlation with NO3. Similarly, Na+ had a strong correlation with F, Cl, and SO42−. Additionally, Cl exhibited a strong correlation with SO42−. Furthermore, a strong correlation was observed between Cr6+ and Sr2+, and a moderate correlation with Fe2+.

In the third zone, O2 displays a moderate correlation with TDS, Mg2+, Na+, and Cl (Appendix Table 7). However, Eh shows poor correlation with all other parameters. The temperature exhibits a moderate correlation with Ca2+ and Al3+, while the pH shows a strong correlation with K+, Fe, and Zn2+, and a moderate correlation with Mg2+, Cl, and Al3+. Additionally, EC and TDS demonstrate strong correlations with Na+, Cl, and SO42−, and a moderate correlation with Ca2+, K+, Mg2+, Cr6+, and Cu+. Ca2+ shows strong correlations with SO42− and Al3+, and a moderate correlation with Na+, Cl, Cr6+, Cu+, and Mn2+. K+ exhibits strong correlations with Cl, Fe, and Zn2+, and a moderate correlation with Mg2+, Na+, SO42−, and Al3+. Moreover, Mg2+ also has a moderate correlation with Na+ and Cl, while Na+ shows a strong correlation with Cl and SO42−. While F and NO3 display negative correlations with most of the parameters.

Based on the previous correlation analysis, it is evident that TDS, being the summation of all dissolved components in groundwater, shows a strong correlation with the majority of the major ions. Moreover, the increasing values of EC are associated with higher TDS values, indicating a very strong correlation between these two parameters. The strong correlation of Ca2+ with Na+, Cl, and SO42−, and the moderate correlation with Mg2+, highlights the significant contribution of gypsum, halite, and dolomite dissolution in the chemical composition of groundwater. Additionally, the positive correlations between K+ with Mg2+, Na+, Cl, and SO42+ in the first and third zones indicate that these minerals are popular sources contributing to the groundwater composition. Notably, the major source of K+ in groundwater primarily comes from surface discharges, which underscores the significant contribution of surface sources to the presence of this mineral in groundwater. Furthermore, the strong correlation between Mg2+ with Na+ and Cl indicates ion exchange of minerals between rocks and water. Moreover, the strong correlation of Na+ with Cl and SO₄2− may be associated with a long history of evaporation, suggesting the dissolution of gypsum and halite. Although the major ions also show relatively strong correlations with each other, it may not necessarily be related to the same geochemical processes. For instance, the increase in the concentration of SO42− with Na+, Cl with F, and NO3 with SO42− may not originate from the same sources. On the other hand, the trace elements displayed poor correlation with all other parameters, except for a moderate correlation between Fe and Al3+. The main sources of these trace elements in the aquifer are likely attributed to the weathering of rocks bearing these minerals.

3.4.5 Factor Analysis

Twenty-three variables (O2, Eh, Tem, pH, EC, TDS, Ca2+, K+, Mg2+, Na+, F, Cl, NO3, SO42−, HCO3, Al3+, Ba2+, Cr6+, Cu+, Fe2+, Mn2+, Sr2+, and Zn2+) were analyzed, and the factors, eigenvalues, proportion of variance, and rotated factor loadings were explained.

In the first zone, the factor analysis shows six important factors with eigenvalues > 1 (Appendix Table 8). The factors represent a total variance of 73.50%, with the smallest eigenvalue being 1.045 (Appendix Table 8). Factor 1 accounted for about 28.57% of the total variance and had an eigenvalue of 7.48 (Appendix Table 8). It exhibited strong loadings with EC, TDS, Ca2+, K+, Mg2+, Na+, Cl, and SO₄2− (Appendix Table 8 and Fig. 9a), ranging from 0.715 to 0.933. This factor indicates the influence of geogenic processes on the chemical composition of groundwater, such as the dissolution of gypsum, halite, dolomite, calcite, and ion exchange. Factor 2 accounted for 12.60% of the total variance and had an eigenvalue of 3.02 (Appendix Table 8). It displayed high to moderate loadings with pH, Al3+, Fe2+, and Zn2+, with values of 0.579, 0.793, 0.809, and 0.760, respectively (Appendix Table 8 and Fig. 9a). This factor indicates an association with the alkaline nature and the processes of iron ion dissolution and precipitation. Factor 3 accounted for 11.97% of the total variance and had an eigenvalue of 2.09 (Appendix Table 8). It showed moderate loadings with O2, F, NO3, and Cu+, with values of 0.508, 0.753, 0.641, and 0.616, respectively (Appendix Table 8 and Fig. 9a). This factor confirms that an oxygenated environment contributes to higher nitrate concentrations. The lithology of the study region alone does not fully explain the raised concentration of fluoride. Instead, factors such as the leakage of fluoride-bearing minerals, concentrated irrigation practices, and the presence of clay minerals likely contributed to the high fluoride loadings. Similarly, the lithologic sources in the study area do not significantly contribute to the increased concentration of nitrate. Instead, the main source of nitrate was identified as the irrigation return flow. Factor 4 accounts for 8.67% of the total variance and has an eigenvalue of 1.91 (Appendix Table 8). It exhibits moderate loadings with Eh, HCO3, and Mn2+, having values of 0.574, 0.699, and 0.664, respectively (Appendix Table 8 and Fig. 9b). This factor is associated with the interaction of water and its surrounding environments, involving chemical and biological processes corresponding to the aquifer redox sequences. The moderate loading of HCO3 suggests meteoric groundwater recharge. The inverse relationship between Eh and pH reflects redox reactions, including nitrogen extractions, as higher pH values can promote these processes. This can be clearly observed from the high positive loading on Eh and the negative loading on pH. Factors 5 and 6 accounted for 6.26% and 5.43% of the total variance, with eigenvalues of 1.35 and 1.04, respectively (Appendix Table 8). Factor 5 exhibited moderate and high loadings with Ba2+ and Cr6+, having values of 0.53 and 0.76, respectively (Appendix Table 8 and Fig. 9b). These significant loadings of Ba2+ and Cr6+ indicate weathering of mineral-bearing layers. On the other hand, Factor 6 showed only a negative loading with temperature, with a value of -0.839. This factor suggests that the temperature was less influenced by human activities and may be indicative of cool recharge water mixing with the aquifer water.

Fig. 12
figure 12

Clustral analysis for three location a, b and c

In the second zone, six factors were determined, all having eigenvalues > 1 (Appendix Table 9). These factors account for a total variance of 83.31%, with the smallest eigenvalue being 1.09 (Appendix Table 9). Factor 1 explains the largest portion of the variance, accounting for 32.33% of the total variance and an eigenvalue of 8.60 (Appendix Table 9). It exhibits strong loadings with EC, TDS, Mg2+, Na+, F, Cl, and SO42−, and moderate loadings with Ca2+ and NO3 (Appendix Table 9 and Fig. 10a). This factor suggests significant hydrochemical processes or geogenic influences, including ion exchange and the dissolution of gypsum, halite, anhydrite, calcite, and dolomite. The presence of nitrates in groundwater is mainly ascribed to agricultural fertilizer usage. The moderate loading of NO3 indicates the impact of return irrigation flow. Moreover, this factor implies that human activities play a significant role in governing the chemical composition of groundwater in this zone. Factor 2 emphasizes the significant influence of Nile and rainfall water on groundwater recharge, as evidenced by its strong loading with pH and moderate loading with HCO3, which typically occur under alkaline conditions (Appendix Table 9 and Fig. 10a). This factor explains 15.59% of the total variance and has an eigenvalue of 3.43 (Appendix Table 9). Furthermore, it indicates the dissolution of HCO3 in the aqueous phase and its subsequent precipitation as CO32− in the solid phase. Factor 3 accounts for 12.92% of the total variance and has an eigenvalue of 2.82 (Appendix Table 9). It exhibits high loadings with Cr6+, Fe2+, and Sr2+ (Appendix Table 9 and Fig. 10a), indicating the weathering of layers containing these minerals. On the other hand, Factor 4 accounts for 8.87% of the total variance and has an eigenvalue of 1.77 (Appendix Table 9). It shows high loadings with Mn2+ and Zn2+, along with a moderate loading with K+ (Appendix Table 9 and Fig. 10b). This factor is associated with the weathering of clay minerals from surface recharge and the bearing layers of Mn2+ and Zn2+. Additionally, it may also be influenced by agricultural activities involving the use of potassium and agricultural pollution. Factor 5 accounts for 6.96% of the total variance and has an eigenvalue of 1.46 (Appendix Table 9). It exhibits a strong loading with Eh and a moderate loading with Al3+ (Appendix Table 9 and Fig. 8b). On the other hand, Factor 6 explains 6.65% of the total variance and has an eigenvalue of 1.09 (Appendix Table 9). It shows high loading with Cu+ and a moderate loading with Ba2+ (Appendix Table 9 and Fig. 10b). This factor suggests the weathering of layers containing these minerals, which could be attributed to geogenic and anthropogenic sources.

In the third zone, five factors were identified, all having eigenvalues > 1 (Appendix Table 10). These factors account for a total variance of 89.88%, with the smallest eigenvalue being 1.64 (Appendix Table 10). Factor 1 explains a significant portion of the variance, accounting for 29.14% of the total variance and having an eigenvalue of 8.69 (Appendix Table 10). It exhibits strong loadings with O2, EC, TDS, Na+, Cl, and SO42−, and moderate loadings with Mg2+ and Cr6+ (Appendix Table 10 and Fig. 11a). This factor is indicative of geogenic processes, such as gypsum, halite, anhydrite, and dolomite dissolutions, as well as ion exchange and weathering of minerals from the bearing layers. Factor 2 accounts for 20.71% of the total variance and has an eigenvalue of 4.24 (Appendix Table 10). It exhibits strong loadings with pH, K+, Fe2+, and Zn2+ (Appendix Table 10 and Fig. 11a). Alkaline water dominated the study area, and the weathering of clay minerals serves as the primary source of K+ in groundwater. On the other hand, Factor 3 represents 16.25% of the total variance and has an eigenvalue of 3.34 (Appendix Table 10). It shows strong loadings with Tem, Ca2+, and Al3+ (Appendix Table 10 and Fig. 11a). This factor is associated with gypsum and calcite dissolution. Factor 4 explains 14.71% of the total variance and has an eigenvalue of 2.77 (Appendix Table 10). It shows strong loadings with Ba2+, Cu+, and Mn2+ (Appendix Table 10 and Fig. 11b), indicating weathering of the bearing layers containing these minerals, likely due to geogenic processes. Anthropogenic influences are absent in this factor. On the other hand, Factor 5 accounts for 9.07% of the total variance and has an eigenvalue of 1.64 (Appendix Table 10). It exhibits moderate loadings with NO3 and HCO3 (Appendix Table 10 and Fig. 11b). This factor suggests a recharge area from the Nile River and Rainfall.

3.4.6 Cluster Analysis

In the first zone, we applied the ward linkage method using z-standardized input data from 68 samples and 23 parameters (Fig. 12a). This analysis resulted in the identification of five major clusters. Cluster 1 comprises 12 groundwater samples, with TDS values ranging from 980 to 3370 mg L−1, an average of 2019 mg L−1, and a standard deviation of 579 (Table 1). The water types within this cluster are characterized by Na+-Cl and Na+-SO42−. These patterns suggest the influence of geogenic processes, particularly halite and gypsum dissolution. Various mineral saturation indices were analyzed for all clusters. For all five clusters, the water was oversaturated for calcite and dolomite, and undersaturated for anhydrite, gypsum, and halite (Appendix Table 11). In contrast, Cluster 2 consists of 7 groundwater samples (Fig. 9a), with TDS values ranging from 786 to 1260 mg L−1, a mean of 987 mg L−1, and a standard deviation of 190 (Table 1). The water types in this cluster are solely associated with Na+-Cl. This indicates a discharge area with a notable presence of halite dissolution. Cluster 3 comprises 9 groundwater samples (Fig. 12a), with TDS values ranging from 431 to 1210 mg L−1, an average of 742 mg L−1, and a standard deviation of 247 (Table 1). The water types within this cluster include Na+-Cl, Na+-Ca2+-SO4,2− and Na+-HCO3, indicating a connection to meteoric recharge and halite/gypsum dissolution processes. On the other hand, Cluster 4 includes 12 groundwater samples (Fig. 12a), with TDS values ranging from 263 to 679 mg L−1, an average of 475 mg L−1, and a standard deviation of 128 (Table 1). This cluster is characterized by low TDS values. The water types associated with this cluster are Na+-HCO3 and Na+-Cl-HCO3, with only one sample showing a relation to Na+-Cl type. These findings suggest a direct recharge area from the Nile River and rainfall. The five clusters encompass 27 groundwater samples (Fig. 12a). TDS values in these samples range from 270 to 1166 mg L−1, with a mean of 710 mg L−1and a standard deviation of 260 (Table 1). The predominant water types are Na+-Cl, with three samples showing Na+-HCO3characteristics. Halite dissolution appears to be the dominant process influencing the water composition.

In the second zone, we analyzed twenty-five samples with 23 parameters. The Furthest Neighbor method was applied, resulting in the identification of three major clusters. The first cluster comprises 4 groundwater samples (Fig. 12b) with TDS values ranging from 688 to 855 mg L−1, an average of 737 mg L−1, and a standard deviation of 79.52 (Table 2). This cluster displays two water types, specifically mixed types with Na+-Mg2+-Ca2+-Cl-HCO3and Na+-Ca2+-Mg2+-Cl-HCO3 compositions. These patterns indicate ion exchange between Na+, Ca2+, and Mg2+, along with dolomite, calcite, and halite dissolutions. The saturation values indicated that calcite and dolomite were in the upper saturation state, while gypsum and halite were in the undersaturation state (Appendix Table 12). The second cluster comprises 6 groundwater samples (Fig. 12b). TDS values in this cluster range from 709 to 912 mg L−1, with an average of 816.5 mg L−1and a standard deviation of 88.41 (Table 2). This cluster exhibits two water types: Na+-Mg2+-Ca2+-Cl-HCO3 and Na+-Mg2+-Cl-HCO3. The dominant elements in these samples are Na+, Mg2+, Cl, and HCO3. This cluster is indicative of a mixture of saline water and fresh recharge water from the Nile River and rainfall. The saturation index (SI) values also indicated upper saturation for calcite and dolomite, and undersaturation for gypsum and halite (Appendix Table 12). The third cluster comprises 14 groundwater samples (Fig. 12b). TDS values in this cluster range from 645 to 810 mg L−1, with a mean of 715.9 mg L−1and a standard deviation of 45.11 (Table 2). The groundwater types observed in this cluster include Na+-Ca2+-Mg2+-Cl-HCO3, Na+-Mg2+-Ca2+-Cl-HCO3, Na+-Mg2+-Cl-HCO3, and Na+-Ca2+-Cl-HCO3. These patterns are indicative of gypsum, halite, calcite, and dolomite dissolutions, as well as rock-water interactions involving ion exchange processes. Moreover, the cluster suggests a mixing of recharge sources, including water from the Nile River and rainfall. The saturation index values showed undersaturation for gypsum and halite, and upper saturation for calcite and dolomite (Appendix Table 12).

In the third zone, we utilized eleven samples with 23 parameters and the ward method to identify three major clusters. The first cluster, containing 3 samples (Fig. 12c), showed TDS values ranging from 1025 to 1242 mg L−1, with a mean of 1136 mg L−1 and a standard deviation of 108.6 (Table 3). The saturation values indicated oversaturation for calcite and dolomite and undersaturation for gypsum and halite (Appendix Table 13). This cluster displayed two water types, Na+-Ca2+-Cl-SO42− and Na+-Ca2+-Cl-HCO3, suggesting a mixing of fresh and saline water from different sources, such as the Nile River and rainfall. Meanwhile, the second cluster comprised 2 samples (Fig. 12c), with TDS values ranging from 1893 to 1982 mg L−1, an average of 1937, and a standard deviation of 62.93 (Table 3). The dominant water type in this cluster was Na+-Ca2+-Cl, indicating halite and calcite dissolution. Saturation values indicated undersaturation for gypsum and halite, and upper saturation for calcite and dolomite (Appendix Table 13). On the other hand, the third cluster included 5 samples (Fig. 12c), showing TDS values ranging from 1663 to 1938 mg L−1, with an average of 1780 mg L−1and a standard deviation of 100 (Table 3). This cluster exhibited three water types: Na+-Ca2+-Cl, Na+-Cl, and Na+-Mg2+-Cl. Saturation values indicated upper saturation for calcite and dolomite, and undersaturation for gypsum and halite (Appendix Table 13). These patterns suggested high dissolution of halite, dolomite, and calcite, as well as ion exchange processes.

4 Conclusion

For this research, hydrochemical investigations, incorporating analyses of geochemistry, multivariate statistics, and geostatistics, were conducted to gain insights into the elements affecting groundwater geochemistry and the potential for pollution. A comprehensive analysis was performed on 104 groundwater samples, examining twenty-three physical, chemical, and trace variables for each sample. Based on the lineament map, cluster analysis, water types, and geologic map, the study area was divided into three main zones. The first zone has a high lineament density in its central part, the second zone exhibits a similar high density in its eastern part, while the third zone has high lineament density in the western and northern parts. Spatial variations in total dissolved solids (TDS) and nitrate concentration, influencing groundwater quality, were mapped. A clear spatial pattern in water table depth emerged, increasing from south to north in the study area. It is noticed that groundwater salinity increases northeastward, where the groundwater becomes deeper. Meanwhile, shallow groundwater with low salinity is recorded in the south and west areas, where the aquifer is recharged with fresh Nile water. The elevated TDS values were associated with geogenic processes, specifically water–rock interactions, where groundwater deeply percolated and dissolute aquifer minerals when it flows from the Red Sea hills to reach the Nile valley. This leads to higher ion concentrations in groundwater, such as the dissolution of evaporite and carbonate rocks, as well as ion exchange. Additionally, anthropogenic processes, such as irrigation return flow, contributed to the increased salinity levels in the groundwater, spatially in the irrigated area with groundwater. A significant correlation was observed between NO3 levels and agricultural activity. The first zone, near the Nile with small depths and flood irrigation, had higher NO3 levels. Moving north with increased depth and drip irrigation, NO3 levels decreased, indicating agriculture as the primary NO3 source in the groundwater in the first zone. The Piper diagram classification revealed three zones and six distinct groundwater types (facies) in the first zone, five types in the second zone, and five types in the third zone. These results imply the existence of particular hydrochemical processes, involving the dissolution of evaporate and carbonate rocks such as anhydrite, gypsum, halite, calcite, and dolomite, along with ion exchange. The predominant processes included meteoric water recharge, where saline water from the Nile River and rainfall played a substantial role. In the drouve diagram classification, first-zone groundwater samples aligning with fields No. 2, 5, 6, and 9 signify meteoric groundwater recharge and the presence of dual sources marked by chemical imprints from the Nile river and rainfall precipitation. This conclusion finds reinforcement in ion exchange and substantial halite dissolution, corroborated by the saturation indices. Conversely, second and third zone samples uniformly falling into field No. 9 and tracing along the dissolution or mixing line indicate the amalgamation of groundwater from two or more distinct facies. This implies the existence of two diverse groundwater sources or mixed water, deriving from the Nile river and rainfall. Moreover, the impactful role of evaporate and carbonate rock dissolution, coupled with ion exchange, is evident in shaping the chemical composition of groundwater samples. The average saturation index values for minerals, including anhydrite, halite, and gypsum, were consistently negative, indicating prevalent under-saturation conditions. In contrast, the saturation index for dolomite and calcite indicated upper saturation conditions, implying the precipitation of secondary minerals. These findings offer valuable insights into the current chemical conditions of the groundwater. Additionally, through Gibbs diagram analysis and the relations between Ca2+/Na+ vs HCO3/Na+ and Ca2+/Na+ vs Mg2+/Na+, it became evident that rock-water interaction significantly dominates as the primary factor influencing groundwater chemistry. The correlation coefficients align with the saturation indices of aquifer minerals, serving as a tool to discern various processes influencing groundwater hydrochemistry, including the dissolution of aquifer minerals and ion exchange. Six factors were identified, describing more than 73.50%, 83.31%, and 89.88% of the total data variance for Zones 1, 2, and 3, respectively. These factors elucidate the chemical attributes of the groundwater, arising from interactions between rock and water, the blending of waters from diverse sources, and, to some extent, anthropogenic influences. Cluster analysis shows that there are five, three, and three significant clusters of samples for Zones 1, 2, and 3, respectively. This indicates that the original groundwater was uniquely influenced by different processes and recharge sources. These results could contribute to more effective water resource management at a regional level and may be applicable to numerous areas with comparable conditions.