Introduction

It is generally agreed that geological, environmental, and geochemical factors play an important role in the occurrence, mobilization, transport, and fate of potentially toxic elements (PTEs) in rivers, groundwater, and other streams and waterways (Guo et al. 2014; Papazotos et al. 2020). The presence of PTEs in the environment, including water supplies, may have been introduced either naturally or by human activities. Various rocks, and consequently the soils/sediments composed of weathering products of source rocks, may be particularly rich in PTEs, as is the case with ultramafic rocks enriched in Cr (Oze et al. 2004), Ni (Kelepertzis et al. 2013), Co (Vithanage et al. 2014), and As (Ryan et al. 2011). However, geochemical conditions have a significant influence on the mobility of PTE in water resources (Kotaś and Stasicka 2000). The Eh/pH system, precipitation/dissolution reactions, sorption, vegetation, and complexation are critical geochemical parameters that need further study in a dynamic and ever-changing system such as groundwater. One question that has preoccupied geoscientists and engineers in recent years is the following: What are the possible natural and anthropogenic sources of PTEs in the groundwater of a study area? (Papazotos 2021). Several integrated and coupled methods have been developed, in particular multivariate statistics (i.e., factor analysis using principal component analysis-PCA and hierarchical cluster analysis-HCA methods -Vasileiou et al. 2019; Papazotos et al. 2020), stable multi-isotopic signatures (e.g.., δ53Cr, 87Sr/86Sr, 206Pb/204Pb, 207Pb/204Pb, 208Pb/204Pb, δ11B, δ15NΝΟ3, and δ18OΝΟ3 - Puig et al. 2017) and machine learning algorithms (e.g., random forest, logistic regression, support vector machine, artificial neural network, bagging, etc.. - Ruidas et al. 2021; Ruidas et al. 2022; Egbueri and Agbasi 2022) are widely used to determine the origin of PTEs in groundwater. However, there is currently no methodological framework that can definitively determine the origin of each PTE based on the specific geochemical characteristics that determine its concentration in natural waters.

In addition to the approaches described above, the calculation of ionic ratios is an important technique for evaluating geochemical processes in the aquifer and provides useful information about hydrogeochemistry and the origin of solutes. Among the different ion ratios, Na/Cl, Cl/HCO3, Mg/Ca, Na/Ca, Cl/SO4, (Ca+Mg)/(HCO3+SO4), Cl/NO3 are the most commonly used and are mainly used for a deeper understanding of the main geochemical processes such as salinization (Abdalla 2016; Alcala and Custodio 2008), ion exchange (Abdalla 2016), water-rock interaction (Papazotos et al. 2020), and N-cycle processes (Chen et al. 2009). More specifically, higher or lower Cl/HCO3 values indicate groundwater salinization due to saltwater intrusion or freshwater recharge, respectively (Revelle 1941). Alternative ion ratios such as Cl/NO3 are used to distinguish the source of N in water resources; higher Cl/NO3 values could indicate input from manure or/and sewage input, while lower Cl/NO3 values could indicate N-containing or mixed fertilizers as the main source of NO3- in an aquifer (Chen et al. 2009). However, the use of ionic ratios is limited to studying the extent of saltwater intrusion or freshwater recharge in an aquifer, although it would be very interesting to use them to evaluate the presence of PTEs in groundwater. Another limitation is that the studies using ionic ratios refer only to the chemistry of major ions, while trace elements have been used in only a few studies. For example, Alcala and Custodio (2008) found that Cl/Br molar ratio is a reliable tracer for determining the source of groundwater salinity. Therefore, trace element dynamics in an ionic ratio approach involving more than one ionic ratio could reveal hidden trends indicative of the origin of PTEs in groundwater, a method that is not yet widely used but whose application could be an important additional tool for delineating important contaminants.

Since 2010, elevated groundwater concentrations of PTEs, particularly Cr, have been reported in the Psachna basin, central Euboea, Greece (Megremi 2010). These concentrations have been attributed to both natural sources, such as dissolution of ultramafic rocks (Papazotos et al. 2019), and anthropogenic influences, such as intensive overfertilization (Remoundaki et al. 2016; Papazotos et al. 2019) and saltwater intrusion (Remoundaki et al. 2016; Eliopoulos et al. 2021). The source of PTEs in Psachna Basin groundwater has been determined using multivariate statistical analyses (i.e., factor analysis and HCA) (Megremi et al. 2019; Papazotos et al. 2019) and stable isotope signatures of 53Cr (Economou-Eliopoulos et al. 2014; 2017). However, these widely used techniques have their limitations, highlighting the need to develop a holistic methodology that uses chemical analyses to separate anthropogenic and geogenic factors for each PTE. Although numerous geochemical studies have been carried out, holistic geochemical modelling, including calculation of ionic ratios a, has yet to be conducted.

The aim of this research is to: a) develop a comprehensive methodology that can be used for any PTE to determine whether it is of natural or anthropogenic origin, and b) analyze the occurrence and mobility of PTEs using advanced geochemical modeling tools. For the first time, a) ion ratios are linked to generate innovative hydrogeochemical diagrams to determine the origin of PTEs in groundwater, and b) real mineralogical data are evaluated for their potential contribution PTE using PHREEQC geochemical modeling software. The main objective of this study is to develop a complete approach that can be used for most PTEs and in any geologic setting, providing the scientific community with important insight into the complicated - and to date not fully understood - issue of the occurrence and mobilization of PTEs in the environment. This approach is being applied in the widely studied ultramafic area of the Psachna Basin, where both geogenic and anthropogenic influences are believed to contribute to the elevated concentrations of PTE in groundwater. The results of this research will be important for environmental scientists and policy makers to determine the exact source of each PTE in the environment and improve environmental and societal resilience as it will impact several important aspects such as science, environment, and human health.

Case study

The Psachna Basin is located on the island of Euboea in Greece at coordinates 38°32'00”-38°36'28” N (latitude); 23°32'00”-23°43'00”E (longitude). (Figure 1); The town of Psachna is located about 90 km from Athens, Greece. Forests and farmland make up most of the basin's area, closely followed by olive trees. Cereals, horticultural crops, olive groves, legumes, carrots, potatoes, cabbage, lettuce and broccoli are the main products of the current intensive agricultural activity. According to Köppen's climate classification, the Psachna Basin has a typical semi-arid Mediterranean climate (Csa) characterized by cold, rainy winters and hot, dry summers (Köppen 1931).

Fig. 1
figure 1

A simplified geological map of the study area (after Katsikatsos et al. 1981) and sampling sites.

This area includes a wide range of rocks, from ultramafic (i.e., serpentinites and peridotites) to mafic (i.e., diabase) rocks and sediments of Quaternary and Neogene age to Cretaceous carbonate formations (limestones and dolostones), and Fe-Ni ore deposits (Katsikatsos et al. 1981). The studied alluvial aquifer is located in the lowlands of the basin and developed in Quaternary sediments; it consists of ultramafic rocks and limestones eroded over a total area of 21 km2. Precipitation, infiltration of surface runoff (torrents), and lateral inflows from groundwater systems formed in carbonate and fractured ultramafic rocks in the northern part of the basin contribute to natural groundwater recharge (Dandolos and Zorapas 2010). The groundwater flow direction is from the northeast to the southwest and the alluvial aquifer system has a hydraulic conductivity of about 2 x 10-4 m/s.

Materials and Methods

The methodology of this work includes the following steps: i) groundwater sampling, ii) analytical determination of physical and chemical parameters using different techniques, iii) data set preparation, iv) mineralogical identification, and v) geochemical modeling using PHREEQC software. Further information on each phase is provided in the following subsections.

Sampling

A total of 47 groundwater samples were collected from irrigation wells to assess groundwater quality in the Psachna Basin (Fig. 1); this work includes 33 groundwater samples from Papazotos et al. (2019) and 14 new groundwater samples from a sampling campaign conducted in October 2019 (Vasilakis et al. 2022). Finally, the final dataset includes four seasonal sampling campaigns: the first in December 2016, the second in June 2017, the third in November 2017, and the fourth in October 2019. All samples from irrigation wells were collected by pumping for a sufficient period of time (at least 15 minutes) to flush out all residual water volume. Water samples were collected in 1000-mL polyethylene bottles that had been pre-cleaned, acid washed, and thoroughly rinsed first with distilled water and then with deionized Milli-Q water. The polyethylene bottles were rinsed three times with the water to be sampled before collection. During sampling, samples were stored in a portable cooler with cold packs and refrigerated at 4° C until chemical determination.

Determinations

Physical parameters

Immediately after sampling, the physical parameters of electrical conductivity (EC), dissolved oxygen (DO), pH, and Eh were measured in situ using the YSI Professional Digital Sampling System (ProDSS) (Xylem, USA). The total dissolved solids (TDS) parameter was also calculated by summing the major ions.

Chemical parameters

Each sample of 1000 mL was separated into three different subsamples for a different chemical determination; one of 500 mL to determine major ions (i.e., Ca2+, Mg2+, Na+, K+, Cl-, SO42-, HCO3-, NO3-, NO2-, and NH4+), one of 100 mL to determine PTEs and other trace elements (i.e., As, B, Ba, Br, Cd, Co, Cr, Cu, Mn, Mo, Ni, P, Pb, Rb, Sb, Se, Si, Sr, U, V, and Zn) after filtering through a 0.45 μm filter and acidifying with 1M HNO3, and one of 25 mL to determine Cr(VI) by ion exchange using IC-All tech cartridges (Ball and McCleskey 2003), after filtering through 0.45-μm and acidifying with 1M HNO3. Table 1 provides an overview of the analytical methods for calculating physical and chemical parameters. Within 8 hours of the sampling, the 500-mL subsamples were transported right away to the lab where, before any analysis or additional processing, the subsamples were passed through 0.45μm filters. None of the 500-mL samples exhibited any significant turbidity. All the samples were kept at 3–7°C, but each parameter received a different treatment: a) Ca2+, Mg2+, Na+, and K+ were acidified to a pH of 2 with 1 M HNO3, and their analyses were completed within 10 days; b) NO3 and HCO3 were completed within 24 hours; and c) Cl and SO42– were completed within 10 days. The above procedures were in accordance with the American Water Works Association (AWWA)/American Public Health Association (APHA)/Water Environment Federation Standard Methods for the Examination of Water and Wastewater (WEF) recommendations. The analyses used the international quality control (QC) and quality assurance (QA) protocols (including blanks, QC standards, and QC samples).

Table 1 The analytical techniques of each determined parameter.

Inductively coupled plasma mass spectrometry (ICP-MS) was used to measure PTEs and other trace elements at the analytical laboratories of Bureau Veritas Commodities Canada Ltd. in Vancouver, British Columbia, while a ICP-MS Agilent Technologies 7700 series was used to determine Cr(VI) at the National Technical University of Athens, Greece. Analyses were performed within seven days following APHA standard procedures (APHA 2012). The Bureau Veritas Commodities Canada Ltd. laboratory is accredited to the ISO /IEC 17025:2005 standard, which includes both QC and QA protocols. The field and laboratory composite samples and duplicate sample analyses were part of the QC processes.

Data processing

ESRI's ArcGIS v.10.3 program was used to create a geologic map to visualize the study sampling locations. The software IBM SPSS v.22 was used to perform the statistics. Data processing included: i) data validation, ii) descriptive statistical analysis, iii) Spearman rank correlation coefficients, and iv) HCA in R mode. The descriptive statistical analysis includes the mean, minimum (min), maximum (max), and median values, standard deviation (SD), first quartile (Q1), and third quartile (Q3). For statistical analysis, the values below the detection limit (BDL) of the analysis method were set to the value of the detection limit (DL). The mean, median and SD values of a parameter that occurred in more than 50% of the samples BDL were not calculated (NC).

Due to the small sample size, data outliers were included in the correlation analysis to determine the relationship between the two parameters; however, the use of the nonparametric robust Spearman rank correlation coefficient (r) reduced the weighting of the potential influence of the outliers (Filzmoser et al. 2018). Equation (1) (Spearman 1904) is used to calculate the Spearman rank correlation coefficient:

$${r}_s=1-\frac{6\sum_{i=1}^n{d_i}^2}{n\left({n}^2-1\right)}$$
(1)

Where di = difference in paired ranks, n = number of cases, xi and yi =data pair.

The formula to use when there are tied ranks is (2):

$$\rho =\frac{\sum_i\left({x}_i-\overline{x}\right)\left({y}_i-\overline{y}\right)}{\sqrt{\sum_i{\left({x}_i-\overline{x}\right)}^2{\sum}_i{\left({y}_i-\overline{y}\right)}^2}}$$
(2)

The magnitude of the absolute value (from better to worse association) was used to divide the correlation coefficient values, which ranged from 1 to 1, into five equally sized groups: very strong (0.8-1), strong (0.6-0.79), moderate (0.4-0.59), weak (0.2-0.39), and very weak (0-0.19). (Evans 1996). The correlation coefficient is considered highly statistically significant, marginally statistically significant, and not statistically significant if the p value is p < 0.01, p < 0.05, and p > 0.10, respectively.

Hierarchical cluster analysis (HCA) is an algorithmic multivariate statistical analysis method for grouping individuals with similar characteristics. In hydrogeochemical studies, HCA classification is used to group samples (Q-mode) or parameters (R-mode) into clusters based on their (dis)similarity, resulting in a tree diagram, also known as a dendrogram. The focus of this study was on the relationship between the studied parameters, so HCA was performed in R-mode using Ward's (1963) method as the linkage and squared Euclidean distances as the similarity measure.

Mineralogy

A detailed geochemical, petrological, and mineralogical study of the rocks and soils was performed using optical microscopy, X-ray diffraction analysis (XRD; Bruker D-8 Focus), and scanning electron microscopy (SEM; JEOL6380LV). About 10 g of the sample was ground in a mechanical mill to determine the qualitative mineralogical composition by XRD. XRD analysis was performed at 2θ from 2° to 72° with CuKa radiation (λ=1.5405 Å) and Ni filter at 40 kV and 30 mA. Mineral phases were identified based on characteristic hkl reflectance distances using EVA version 12 software.

Geochemical modeling

The saturation index (SI) with respect to mineral phases was calculated using the PHREEQC geochemical program (Parkhurst and Appelo 1999), with MINTEQv.4 serving as the primary thermodynamic database to evaluate chemical reactions in groundwater (Allison et al. 1991). Hydrogeochemistry is controlled by the interaction of water with rocks and soil, and geochemical software was used to provide the SI of specific mineral phases found in the study area. The SI of the water samples can be defined by the following equation (3):

$$SI=\log \left(\frac{IAP}{K_{sp}}\right)$$
(3)

where IAP is the Ion Activity product and Ksp is the equilibrium constant.

When the SI value is equal to 0, the solution is in equilibrium with the mineral phase; when the SI value is > 0, the solution is oversaturated, resulting in mineral precipitation, and when the SI value is < 0, the solution is undersaturated indicating that dissolution is required to reach equilibrium.

Results

Chemometrics

The analytical accuracy of the ion measurement was determined to be less than ±10% by calculating the charge balance error in each sample. The descriptive statistics for a total of 39 physical and chemical parameters of the 47 groundwater samples from the Psachna Basin are presented in Table 2. The most interesting physical parameters for the field of environmental geochemistry are pH and Eh, which ranged from 7.21 to 8.86 and from 136 mV to 331 mV, respectively. On the other hand, remarkable NO3 (up to 540 mg/L) and PTE groundwater concentrations were determined (up to 162 μg/L Cr; up to 5.12 μg/L U; up to 21.1 μg/L Ni; up to 610 μg/L Zn; up to 3.3 μg/L As; up to 6.9 μg/L V; up to 5.2 μg/L Se). Furthermore, an - additional - parameter was calculated to further investigate the total PTEs (TPTEs) load in groundwater by summing selected PTEs; therefore, this parameter quantifies the total amount of potentially toxic elements in groundwater and is referred to as TPTEs in the text. The selected PTEs were As, Cd, Co, Cr, Cu, Mn, Ni, Pb, Se, U, and Zn and were chosen based on their importance in terms of their reference abundance in relevant studies (Papazotos 2021) and their toxicity to living organisms. TPTE values range from 23.6 μg/L to 690 μg/L, with a mean value of 157 μg/L (Table 2).

Table 2 Descriptive statistics (mean, min, max, median, standard deviation, 1st Quartile, and 3rd Quartile) of the physical and chemical parameters for the 47 analyzed groundwater samples from the Psachna Basin.

Spearman rank correlation coefficients

The Spearman rank correlation matrix of 47 groundwater samples of the Psachna Basin is presented in Table 3. The autocorrelation is not highlighted in red in Table 3, because it is always equal to 1.

Table 3 Spearman rank correlation matrix and correlation significance (p values) of selected physical and chemical parameters of the groundwater samples of the Psachna Basin (N = 47). Color represents the classification of the absolute correlation value between two parameters i.e., Red:0.80-1.00, Orange: 0.60-0.79, Yellow: 0.40-0.59, Blue: 0.20-0.39, and Green: 0.00-0.19.

R-mode HCA

R-mode HCA was conducted to classify group-related hydrochemical parameters, reveal hidden relationships among chemical parameters, and further evaluate processes associated with elevated groundwater PTEs and other trace element concentrations in the Psachna Basin. In this approach, major ions that provide clear evidence of the origin of PTEs (i.e., Cl-, Si, NO3-, HCO3-) were included; those with multifactorial origins (i.e., Ca2+, Mg2+, SO42-, etc.; Papazotos et al. 2019) were excluded. Multivariate statistical analyses such as factor analysis (Voutsis et al. 2015; Papazotos et al. 2019; Megremi et al. 2019) and Q-mode HCA (Voutsis et al. 2015; Papazotos et al. 2019) were performed in the study area. However, R-mode HCA focusing on a large dataset of PTEs has yet to be performed. Parameters with a linkage distance of less than ten (red dashed line; Fig. 2) and about 15 (green dashed line; Fig. 2) were grouped in the same cluster to form different parameter groups with similar hydrogeochemical patterns. As shown in the dendrogram (Fig. 2), the parameters were divided into five main clusters at the lower coupling distance and two main clusters at the higher coupling distance. The relationship between the two approaches is that cluster 2 of the green dashed line contains clusters 2-5 of the red dashed classification, indicating their hydrogeochemical proximity.

Fig. 2
figure 2

Dendrogram in R-mode HCA for 20 parameters determined in groundwater samples from a total of 47 water samples from the Psachna Basin. The red and green dashed line represents the linkage distance used to create different distinct clusters.

Mineralogy

The predominant lithologic types in the study area are ultramafic rocks, mainly serpentinites and partially serpentinized peridotites, in which the original mineralogical and histological features are still preserved (Fig. 3). Other ultramafic rocks such as harzburgites, dunites, and lherzolites also occur in the wider area. The major mineralogical phase of the dunites is forsterite, while enstatite occurs only in minor amounts (< 5%). Harzburgites consist of forsterite and enstatite (MgO = 34.6-35.75% w/w and FeO = 4.8-5.9% w/w) in proportions of 80-90% and 10-20%, respectively. Diopside (up to 1.5 % v/v) grows mainly in the cleavage of enstatite, forms lamellae and occurs less frequently as small single crystals. Chromium-rich spinel (Cr2O3 =37 to 55% w/w) usually forms euhedral crystals. The total Fe content of the chromites of Euboea varies between 14 and 20% w/w. Mineralogical phases such as quartz, calcite, illite, kaolinite, montmorillonite, nontronite, chlorite, serpentine, goethite, hematite, and Mn oxides were detected in the soils of the wider study area (Megremi et al. 2019; Economou-Eliopoulos et al. 2014). The Cr2O3 was detected in mineralogical phases of ultramafic rocks such as spinel (30-55 wt%), pyroxene (enstatite from 0.2 to 0.5 wt% and diopside from 0.3 to 0.97 wt%), amphibole (0.2-0.7 wt%), serpentine (0.2-0.6 wt% with decrease in Cr2O3 content from the core to the edge of the crystal), rarely talc (0.3-0.4 wt.%) and chlorite (1.1-2.4 wt.%), indicating its geogenic occurrence in the rocks of the area (Megremi et al. 2019; Vasileiou et al. 2014). In the laterites of the area, the main Cr2O3 mineral is chromite, which has an inhomogeneous composition with a variation of Cr/(Cr + Al) from 0.4 to 0.79. Another important Cr-bearing mineral is the pisolitic goethite of the ore with an average Cr2O3 content of 0.89% (w/w), while a small percentage of Cr2O3 (0.7% w/w) is present in Ni-chlorite (Megremi et al. 2019). Significant Cr-bearing mineral phases in soils are spinels (chromite, magnetite) (29-61% w/w), Ni-chlorite (up to 0.83% w/w), and goethite (up to 1.34% w/w) (Megremi et al. 2019).

Fig. 3
figure 3

Photomicrographs of serpentinised harzburgites. Residual forsterite and enstatite crystals are distinguished in a matrix of serpentine, Fe-oxides, and chromite granules. In addition, diopside was developed in the cleavage of enstatite (Srp: serpentine, Ol: olivine/forsterite, Py: pyroxene, Chr: chromite, Fe-oxides: iron oxides).

Geochemical modeling

Table 4 summarizes the mineralogical phases identified in the Psachna Basin and provides information for the calculated SIs of the 47 groundwater samples. The major Cr-bearing mineral phases found in the unsaturated state were minerals of the serpentine subgroup (such as lizardite and chrysotile), pyroxene (enstatite and diopside), chlorite, illite, and forsterite, while oversaturated mineral phases were chromite, magnetite, talc, Fe-(hydro)oxides, and so on.

Table 4 Descriptive statistics (mean, min, max, median, standard deviation, 1st Quartile, and 3rd Quartile) of the SIs of the examined mineral phases for the 47 analyzed groundwater samples from the Psachna Basin.

Discussion

PTEs load in the aquifer

According to the plot of TPTEs versus (vs.) pH values in the modified Ficklin diagram, the total load of selected PTEs in groundwater of the Psachna Basin is generally low (Fig. 4; Ficklin 1992). The prevailing near-neutral to slightly alkaline pH (from 7.21 to 8.86) is the key factor preventing mobilization of cationic PTEs in groundwater (i.e., Pb, Cd, Cu, Ni, Co) (Hermann and Neumann-Mahlkau 1985), which account for the majority of PTEs studied here.

Fig. 4
figure 4

Ficklin diagram (Ficklin 1992) of the 47 groundwater samples of the Psachna Basin.

As for the hydrogeochemistry of the study area, Cr is the most important PTE, since it has the highest concentration of all PTEs (Fig. 5). In general, the geochemistry of Cr is very interesting because it is able to change its valence state in the environment both geogenically (i.e., in the presence of Mn oxides, Fendorf et al. 1992) and anthropogenically (i.e., as a NO2/NO3 oxidant, Mills and Goldhaber 2012; Papazotos et al. 2019). Trivalent Cr mainly forms trivalent cations or hydrolysates, which usually form metal hydroxide precipitates with low solubility and low mobility, while Cr(VI) is usually in the form of chromate anions (i.e., CrO42-) with high mobility (Richard and Bourg 1991). Cr(VI) is the predominant form of Cr in groundwater of the Psachna Basin; (Papazotos et al. 2019). Finally, the pH-Eh diagrams for Mn (Pourbaix 1966) showed that Mn(II) dominates in the groundwater samples of the Psachna Basin (Papazotos et al. 2019b), confirming that the natural oxidation reaction (Rai et al. 1987; Richard and Bourg 1991) proceeds according to equation (4):

$$\textrm{Cr}\left(\textrm{II}\textrm{I}\right)+1.5\ \updelta \textrm{Mn}{\textrm{O}}_2\left(\textrm{s}\right)+{\textrm{H}}_2\textrm{O}\to \textrm{HCr}{\textrm{O}}_{4-}+1.5\textrm{Mn}\left(\textrm{II}\right)+\textrm{H}$$
(4)
Fig. 5
figure 5

The variation of the percentage of each PTE of the Psachna Basin.

The dominance of Cr(VI)species and their elevated groundwater concentrations make Cr a PTE of concern, especially in Cr-rich environments such as ultramafic-dominated areas.

It is worth noting that Zn concentrations are not negligible, but have a wide range of variation. In general, Zn is considered one of the most mobile PTEs in natural waters, because it forms soluble compounds at neutral and acidic pH, while - at higher pH - Zn can form carbonate and hydroxide complexes that control Zn solubility (Małecki et al. 2017). The remaining PTEs have a small contribution to the total PTE concentration, but some of them (Cd, Cu, Mn, Ni, and Pb) have upper outliers, suggesting that the prevalence of certain geochemical conditions is responsible for elevated groundwater concentrations in some samples. Τhis work, compared to the study of papazotos et al. (2019), utilizes the whole dataset of PTEs and emphasizes creating a methodological framework for delineating their origin in groundwater.

The importance of analyzing dissolved Si in groundwater

It is assumed that the dissolved Si in groundwater originates solely from the interaction between water and rock (i.e., anthropogenic activities do not contribute Si into the environment). Groundwater samples from the Psachna Basin present a wide range of Si, from 4069 μg/L to 41954 μg/L, with a median value of 15616 μg/L. Circulating groundwater dissolves Si derived from chemical weathering of silicate minerals in rocks and sediments (Hem 1985), and its content in groundwater is directly proportional to its residence time, because prolonged contact with silicate rocks increases Si concentration (Khan et al. 2015). Therefore, high concentrations of dissolved Si may indicate extensive dissolution of silicate minerals in groundwater (Lee and Gilkes 2005). Most modern hydrogeochemical studies neglect to determine the concentration of dissolved Si in water samples for analysis. However, as shown in this work, the determination of dissolved Si could provide important information. Furthermore, Si is an element derived from silicate minerals, which are important hosts for PTEs. Calculation of an ionic ratio containing Si could provide interesting information about the geochemistry of PTEs. In addition, many studies misuse geochemical modeling without considering the silicate mineral phases, because there is no Si analysis in aqueous solutions. Such an approach neglects the largest and most important part of the geological and geochemical environment related to water chemistry, leading to misleading results.

Delignating the origin of PTEs in groundwater based on a four-step novel approach

Various approaches have been used to evaluate the origin of chemical substances in the environment. Multivariate statistical analyses (e.g., Papazotos et al. 2020; Vasileiou et al. 2019), multi-isotopic approaches (e.g., Puig et al. 2017 ), and machine learning algorithms (Ruidas et al. 2021; Ruidas et al. 2022) have been widely used in modern hydrogeochemical studies; the main objective has been to further investigate the complex isotopic signatures and their fractionation or to reveal the hidden geochemical relationships between chemical variables to assign natural and anthropogenic influences. Here, for the first time a four-step method is presented (Fig. 6) that combines statistics and molar ratios of specific chemical components (i.e., NO3-, Si, Cl-, and HCO3-) to distinguish geogenic and anthropogenic contributions of PTEs in groundwater and provide answers to the hot research topic of their occurrence, transport, and fate in the environment (Papazotos 2021). The approach is applied in the ultramafic-dominated area of the Psachna Basin, a coastal area containing rocks enriched in PTEs and characterised by intense anthropogenic pressures.

Fig. 6
figure 6

A four-step methodology to identify the origin of a selected chemical parameter in groundwater.

Step 1: Data collection and target a parameter

This phase includes the data collection that will be used in the next steps of the methodology. The data could come from a new sampling campaign that includes the determination of chemical parameters or a detailed literature search. For the proposed approach, only the specific chemical parameters NO3-, Cl-, HCO3- and Si (or SiO2) need to be determined. Here, six PTEs (i.e., As, Cr, Ni, Se, U, and Zn) were selected as target parameters to determine their origin in the Psachna Basin.

Step 2: Classification

In this step, information is categorized according to the target parameter's concentration. There are various methods of categorization that can be used. The use of guideline values for safe drinking water by the World Health Organization (WHO) or the limited values mandated by individual countries' laws can lead to inequitable groups that do not adequately represent the full range of PTEs concentrations in water resources. Since most PTEs are not mobilized in the typically encountered pH range of groundwater, it is critical to investigate concentrations below the guideline values for drinking water defined by WHO. Consequently, it is highly recommended to classify the analyzed parameters using descriptive statistics. In particular, the quartiles allow to divide the entire range of values into four approximately equal groups separated by concentration. In this work, the data shown in Table 2 were used to divide the PTEs into different categories.

Step 3: Creating hydrogeochemical diagrams

This phase involves the calculation and plotting of challenging molar ratios to investigate the hydrogeochemical footprint of specific geogenic or anthropogenic influences. Hydrochemical salinization was studied using the molar ratio of Cl-/HCO3-; values from 1.3 to 2.8 indicate a moderate salinization effect; values < 0.5 indicate no salinization effect; values between 0.5 and 1.3 indicate a slight effect, between 2.8 and 6.6 indicate a strong salinization effect, and values > 6.6 can be interpreted as a strong salinization effect (Revelle 1941). The dominance of the anion HCO3- over Cl- is indicative of recharge water and dissolution of carbonate. Figure 7 shows that most samples from the Psachna Basin are plotted in the "carbonate dissolution" field, indicating that groundwater recharge is a dominant hydrogeologic process. However, some samples have a Cl/HCO3 molar ratio > 1, indicating the influence of saltwater intrusion occurring in part of the Psachna Basin, as indicated by many researchers (Papazotos et al. 2019; Remoundaki et al. 2016). In general, seawater has very high Cl concentrations; a typical range is 17000-23000 mg/L and generally low HCO3 concentrations (usually between 130 mg/L and 180 mg/L) with extremely high Cl/HCO3 ratio values (> > 200).

Fig. 7
figure 7

The Cl/HCO3 (molar ratio) vs. Cl (in mmol/L) and the Cl/HCO3 (molar ratio) vs. HCO3 (in mmol/L) diagrams contain carbonate dissolution and salinity fields of the Psachna Basin.

For the first time, an alternative molar ratio was used to determine whether water-rock interaction or fertilization was a major contributor to groundwater contamination. The Si/NO3 molar ratio was evaluated, and it is crucial to investigate the range of values of ordinary aqueous solutions (such as rainwater and seawater), which are used as reference values and provide valuable hidden features of this work. In general, the concentrations of Si and NO3- in seawater and rainwater are very low. However, typical Si concentrations in seawater vary from 1 mg/L to 5 mg/L, and NO3- ranges from 0.02 mg/L to 2 mg/L, suggesting a very high Si/NO3 molar ratio values that in most cases exceed the value of 100 (i.e., when anthropogenic activities that create high NO3- concentrations are absent) and indicating the natural origin of these aqueous solutions. The < 1 molar ratio values indicate that there is evidence of significant NO3 loading from various N-rich sources (such as N-containing or mixed fertilizers, sewage effluents, and septic tanks); the > 2.2 values indicate a significant contribution of siliceous mineral phases to the water-rock interaction. The value of 2.2 was obtained from a common sample unaffected by anthropogenic activities (NO3-=5 mg/L; Kim et al. 2015) and a relatively low dissolved Si concentration (Si=5 mg/L; the median value for dissolved Si in this dataset from the Psachna Basin is 15.6 mg/L). In any case, the molar ratio should be evaluated concerning the geochemical conditions, the examination of the principles of Si- and N- geochemistry, and the assessment of the existing dataset. Figure 8 shows that most of the Psachna Basin samples are plotted in the “nitrate pollution” field, presenting a Si/NO3 molar ratio < 1. Nevertheless, some samples have a Si/NO3 molar ratio > 1, shown in the "Silicate dissolution" box, illustrating the influence of geology on groundwater chemistry. In any case, the coexistence of geogenic (i.e., water-rock/soil interaction) and anthropogenic (i.e., agricultural activities, sewage effluents, etc.) factors cannot be ruled out, but this should be investigated by evaluating the entire data.

Fig. 8
figure 8

The Si/NO3 (molar ratio) vs. NO3 (in mmol/L) and the Si/NO3 (molar ratio) vs. Si (in mmol/L) diagrams contain silicate dissolution and nitrate pollution fields.

It should be noted that element’s concentration is always an important indicator for environmental research and should be taken into account, since negligible data could lead to misleading results and the one-sided study of molar ratios does not give the scientist the desired result. For example, areas relatively unaffected by human activities usually have NO3- concentrations of less than 10 mg/L (Panno et al. 2006); therefore, concentrations below 0.16 mmol/L NO3- should not be attributed to anthropogenic activities, regardless of the Si/NO3 molar ratio. On the other hand, a high to very high Si/NO3 molar ratio followed by low Si concentrations should not be attributed to dissolution of silicates, but to other natural factors (i.e., dissolution of another predominant rock type such as carbonates, seawater, rainwater, etc.).

Finally, the combination of the two molar ratios (Cl/HCO3 vs. Si/NO3) provides interesting information about the groundwater chemistry of a complex aquifer system considering saltwater intrusion, agricultural activities, and water-rock interaction (including both carbonate and silicate mineral phases) (Fig. 9). Step 3 of the present approach also contains the exhaustive co-evaluation of statistical and geochemical data and a thorough literature review, mainly focusing on previous studies from the study area and recording both geogenic and anthropogenic contamination sources in the environment.

Fig. 9
figure 9

Evaluation of major hydrogeochemical processes affecting water chemistry using Cl/HCO3 vs. Si/NO3 diagram.

Step 4: Determine the origin of PTE

After completing steps 1 to 3, the researcher is able to generate complex diagrams showing where the PTEs in the groundwater come from; in this case, the methodology is applied to As, Cr, Ni, Se, U, and Zn, and the results are shown in Fig. 10. The high salinity caused by saltwater intrusion in some areas of the Psachna Basin is responsible for the elevated As concentration in groundwater, as shown by the correlation between As concentration and Cl/HCO3 ratio (Fig. 10a). The R-mode HCA and Spearman's rank correlation coefficients, which highlight the statistical correlation with parameters that have elevated concentrations in seawater, such as Cl-, Br-, and Li, further support this hypothesis. Megremi et al. (2019) mentioned that As and Li are chemical parameters influenced by saltwater intrusion in the Psachna Basin, and modern literature confirms the role of saltwater intrusion in As mobilization (Papazotos et al. 2020). The oxidizing geochemical environment, which favors the stable presence of NO3- in groundwater and inhibits As desorption from Fe/Mn oxyhydroxides from rocks and soils into the aquifer, is responsible for the low groundwater concentrations of As (up to 3.3 g/L) (Guo et al. 2014). Normally, oxidative environments prevent reductive dissolution of Fe/Mn oxyhydroxides, resulting in high groundwater concentrations of NO3- (Xie et al. 2008). Compared to As, Cr shows a distinct pattern in the Cl/HCO3 vs. Si/NO3 plot (Fig. 10b). The class with the highest concentrations (> Q3), indicating agricultural activities as the main contributor to Cr, is entirely dominated by low values of the Si/NO3 ratio. Although Cr concentrations of up to 130 μg/L have been recorded and are due to the geogenic background of an ultramafic area (Vasileiou et al. 2021), the synergistic role of agricultural activities has been shown to increase groundwater concentrations of Cr in an ultramafic environment (Papazotos et al. 2019; Vasileiou et al. 2019). This was particularly evident in the agricultural-dominant Psachna Basin, where nitrogen (N) and phosphorus (P)-containing fertilizers were widely applied (Papazotos et al. 2019). The strong positive, statistically significant correlation coefficients between Cr and NO3- (r=0.66) consistently observed over time in the study area (Vasileiou et al. 2014; Megremi et al. 2019, Papazotos et al. 2019, Remoundaki et al. 2016), and the R-mode HCA provide additional evidence of the role of fertilization (Fig. 2). Finally, Table 5 shows four different wells sampled at different times. As shown in Table 5, there is a simultaneous decrease or increase in the chemical parameters of NO3- and Cr. Both the chemical parameters of NO3- and Cr decrease or increase simultaneously, as shown in Table 5. Therefore, one chemical parameter (i.e., NO3-) must be the cause and the other chemical parameter must be the causal effect, implying a spatiotemporal dependence (i.e., Cr).

Fig. 10
figure 10

Bivariate Cl/HCO3 vs. Si/NO3 diagrams of a) As, b) Cr, c) Ni, d) Se, e) U, and f) Zn.

Table 5 NO3- and Cr groundwater concentrations and their percentage variability from selected wells sampled at different periods.

The proximity of Cr to other chemical parameters such as U, Si, HCO3-, and Ni was also revealed by a rescaled approach for R-mode HCA. Especially, Ni shows a scattering of elevated concentrations in the Cl/HCO3 vs. Si/NO3 plot (Fig. 10c). However, the samples with the lowest Cl/HCO3 ratio appear to have the highest Ni concentrations, suggesting that water-rock interaction is the main factor for elevated Ni concentrations. It is common for Ni to be incorporated into the lattice structure of the silicate mineral phases that make up ultramafic rocks (e.g., Ni-bearing forsterite and chlorite). However, Ni is strongly absorbed under the prevailing alkaline pH conditions, which affects its mobilization in groundwater. The trend of increasing concentrations of Se (Fig. 10d) and U (Fig. 10e) shows similar patterns to As and Cr, respectively. Selenium is an oxyanion-forming trace element with similar hydrogeochemical properties to As, which is fully confirmed by other studies conducted in Greece (Papazotos et al. 2020). In addition, the recent literature mentions the groundwater concentration of Se in seawater (Papazotos et al. 2020) and especially in the case of the Psachna Basin (Eliopoulos et al. 2020); a hypothesis that also emerges from the analysis of the present study through the Cl/HCO3 vs. Si/NO3 diagram, the correlation analysis of the hydrochemical parameters and the R-mode HCA. The highest U concentrations are associated with the lowest Si/NO3 ratio (Fig. 10e), indicating the role of fertilization on U concentrations in groundwater. In general, P-containing fertilizers cause a significant increase in U in groundwater at low concentrations (Liesch et al. 2015). As mentioned earlier, the Psachna basin is an area of intensive agricultural activities where extensive application of P-containing fertilizers has been detected (Papazotos et al. 2019). In addition, strong to very strong statistically significant correlation coefficients of U-SO42- (r=0.796), U-NO3- (0.625), and U-Ca2+ (r=0.863) demonstrate that the main source of U in groundwater is the fertilizers used in this area. The high concentrations of SO42- and NO3- were attributed to the fertilizers in the Basin from previous researchers (e.g., Papazotos et al. 2019; Remoundaki et al. 2016). Fertilizers rich in phosphorus (P) are extracted from phosphate rocks that typically contain apatite-group minerals [Ca5(PO4)3 (F, OH) or Cl] and have a high Ca content; phosphate rocks are enriched in PTEs (Kratz et al. 2016; Molina et al. 2009; Papazotos et al. 2019). Psachna Basin local fertilizer market sales of P-containing fertilizers were found to have a very high PTE content, as demonstrated by Papazotos et al. (2019). Very high U content (30-200 mg/kg) can be observed in rock phosphate, in which the U substitutes for Ca2+ in the apatite crystal structure (Liesch et al. 2015). Strong correlations between Ca2+ and PTEs have been observed in other areas of Greece (e.g., Schinos area, Papazotos et al. 2020) and attributed to P-bearing fertilizers. Moreover, Zn does not show an obvious trend of the influence of the primary factors shown in the Cl/HCO3 vs. Si/NO3 plot (Fig. 10f), and therefore the origin of Zn should be sought in other causes such as the prevailing geochemical conditions (i.e., pH, Eh, D.O., etc.) and other anthropogenic factors. Under oxidizing conditions and especially at acidic to neutral pH, Zn is one of the most mobile water migrants (Alloway 2013). On the other hand, Zn precipitates under reducing conditions. In addition, it can co-precipitate with Fe/Mn oxyhydroxides in systems with high pollution if it is present in very high groundwater concentrations (Smith 1995). As a result, Fe/Mn oxyhydroxides, clay minerals, and organic matter in waters readily bind to Zn and form organometallic complexes that increase the ionic strength of the solution (Diatta and Kociałkowski, 1998). Sorption of Zn increases with increasing pH and decreasing salinity (Huang et al. 2014). Zinc cations can be easily exchanged when competing with other PTE cations (Fonseca et al. 2011). The increase in Zn sorption is followed by a decrease in salinity, as confirmed in Fig. 10f, suggesting that this is a key parameter for Zn mobility in the groundwater. The schematic distribution of TPTEs (Fig. 11) is similar to that of Cr and Zn (Figs. 10a, 10f), as expected from their high contribution to the final calculation of TPTEs.

Fig. 11
figure 11

Bivariate Cl/HCO3 vs. Si/NO3 diagram of TPTEs.

Nevertheless, it is worth noting that geochemical conditions are the most important parameters for understanding the occurrence, mobilization, transport, and fate of PTEs in groundwater, and the proposed successful approach was applied in an ultramafic geologic setting that produces a specific geochemical footprint in groundwater (i.e., neutral to alkaline pH conditions, elevated Cr concentrations, etc.). In a different geologic setting (e.g., dominance of carbonate rocks, etc.) or under different geochemical conditions (e.g., acidic pH or reducing Eh conditions), it is possible that the technique would need to be modified or could not be applied with the same success. Future work focusing on the application of the methodology in different areas - on a global scale - under alternative geochemical approaches is encouraged to evaluate the above framework to investigate its in-depth effectiveness, or could provide useful modifications.

Geochemical modeling

Geochemical modeling was performed to further evaluate the thermodynamic constraints of the water-rock interaction. Figure 12 shows the mineralogical phases that contribute to the chemical elements in the aquifer via chemical dissolution. It is important to note that the theoretically oversaturated minerals do not contribute elements to the water due to the prevailing geochemical conditions.

Fig. 12
figure 12

Saturation indices of examined mineral phases from the Psachna Basin.

The conceptual model of the potential dissolution/precipitation mechanism is shown in Fig. 12. However, the identified mineralogical phases do not include some PTEs contained in their lattice structure. Therefore, it is very interesting to examine the Spearman correlation coefficients with selected PTEs (Table 6) to draw meaningful conclusions about the contribution of chemical thermodynamics to environmental geochemistry. Most of the results show negative correlations, proving that PTEs in groundwater are caused by mineral dissolution. Because these are theoretical calculations based largely on the chemistry of major ions and selected PTEs, strong to very strong correlation coefficients are not expected to be found. If this hypothesis were true, it would be reflected primarily in the table of correlation coefficients for all parameters (see Table 3). The saturation index for chromite was calculated using Cr, so there is a positive correlation between the two variables (r=0.522; p=0.01). Nonetheless, statistically significant correlations are evident and need to be further investigated in future work. Targeted future research on the thermodynamic characteristics of the PTE-containing mineralogical phases and a thorough geochemical modeling study (including forward and inverse modeling techniques) on their dissolution reactions and environmental footprint could provide very interesting insights to the geoscience community. This would undoubtedly be a significant step forward in research related to PTE and would help researchers better understand the many unanswered questions.

Table 6 Spearman rank correlation matrix and correlation significance (p values) of selected PTEs and the SIs of examined mineral phases of the groundwater samples of the Psachna Basin (N = 47). Color represents the classification of the absolute correlation value between two parameters i.e., Yellow: 0.40-0.59, Blue: 0.20-0.39, and Green: 0.00-0.19.

Conclusions

For more than ten years, the quality of groundwater in the Psachna Basin in central Euboea, Greece, has been studied in detail. A methodology to distinguish geogenic and anthropogenic influences controlling the geochemistry of PTE using ionic ratios (i.e. Si/NO3 and Cl/HCO3). For the first time, sophisticated ion ratios have been combined to produce hydrogeochemical diagrams showing the origin of each PTE in groundwater resources. The novel methodology was successfully applied in the Psachna Basin, and the analysis revealed that significant groundwater contamination originated mainly from water-rock interaction, over-fertilization, and saltwater intrusion. The main advantages of the proposed methodology are that it is easy to apply, as in the case of the successful application in the ultramafic study area mentioned above, and that it could be implemented in any geological setting with minor modifications due to its flexible methodological framework (e.g., different data classification methods), providing insight to the (geo)scientific community. Primarily, high groundwater concentrations of Cr that systematically exceed the guideline values of WHO (i.e., 50 μg/L) are attributed to intensive agricultural activities, a hypothesis consistent with many other researchers. Lower, but by no means negligible, concentrations of U are also attributed to P-containing fertilizers. On the other hand, the methodology described in this study revealed that the occurrence and mobility of As, Se, and Zn in groundwater are related to saltwater intrusion. Finally, Ni seems to be mainly related to geogenic factors, although its occurrence is significantly controlled by the prevailing geochemistry, a key factor whose influence cannot be ignored in the analysis of PTE. In addition, the use of real mineralogical data in geochemical modeling studies underscores the need for detailed interdisciplinary studies of hydrogeochemistry, which contribute significantly to a better understanding of the potential contribution of water-rock interactions to the groundwater system. This analysis shows that many silicate mineral phases are undersaturated in an aqueous solution (i.e., groundwater), suggesting a potential contribution from PTEs (and other major/minor elements). This initial approach to the important issue of PTEs in water resources through Si/NO3 vs. Cl/HCO3 diagrams can be a useful methodological framework to describe the origin of PTEs in groundwater, especially if this approach is combined with the use of advanced statistical methods (i.e., PCA, HCA, etc.), geospatial data analysis, multi-isotope approaches, and machine learning algorithms allows for a holistic assessment of a complex and challenging topic such as the origin, occurrence, mobilization, transport, and fate of PTEs in the environment and, in particular, in a sensitive receptor such as water resources.

There are two major limitations to the above proposed methodology:

(a) the determination of certain chemical parameters, such as NO3-, Cl-, HCO3-, and Si, is required because these data are mandatory for the proposed method; and (b) the limited thermodynamic data, since the thermodynamic geochemical databases (e.g., PHREEQC, MINTEQ, etc.) do not include the formulas of the PTE -bearing mineralogical phases, neglecting that many PTEs are included in their lattice structure; this fact must be taken into account, otherwise researchers will not be able to fully understand the geogenic system and will therefore obtain misleading results regarding complex water-rock-soil interaction processes.

The implications of this research are expected to be significant to the scientific community still seeking answers to the complex question of the origin of PTEs in groundwater, as it allows researchers to apply a specific methodology to further investigate their data set in the context of advanced environmental geochemistry. Specifically, this work aims to develop a holistic methodology that can be (a) applied globally and in any geological setting, and (b) applied by researchers, making a critical contribution to the scientific community's understanding of the occurrence, mobilization, and geochemistry of PTEs and achieving breakthroughs in environmental science and pollution research. The results of this study will be applied by environmental scientists to create a powerful tool for understanding the origin of PTEs and contribute to a new regulatory framework that enhances both environmental awareness and human health.