Introduction

Since the beginning of the Industrial Revolution, and above all after the so-called Great Acceleration, the distribution and quality of freshwater have been changing due to human decisions that have even reshaped the global hydrological cycle (Vörösmarty et al. 2013). Among the consequences of this mismanagement of the global commons, we have that groundwater systems, a crucial resource for both socioeconomic development and healthy ecosystems, are experiencing a world crisis (Foster et al. 2013). This crisis is in the first place evidenced by the depletion of aquifers, which is in itself a major issue if it is considered that almost half of all drinking water worldwide comes from underground sources (Smith et al. 2016). However, for some authors, the largest threat to these resources is pollution, as global groundwater quality is declining due to the presence of chemicals of geogenic or anthropogenic origin (Custodio 2013; Dermatas 2017). In the first case, the geochemical composition of rock and soil explains the occurrence of metalloids (i.e., arsenic in Bangladesh and India), heavy metals (as hexavelant chrome in California, Italy, and Greece), or fluorides (in India, Ethiopia, and Sri Lanka) in groundwater (Singhal and Gupta 2010; Dermatas 2017). The people affected by geogenic groundwater background, only coming from arsenic and fluorides, have been estimated at 200 million (Amini 2011). In the case of anthropogenic pollution, it concerns the human activities that contaminate groundwater reserves with heavy metals, persistent organic compounds, petroleum hydrocarbons, pesticides, nutrients, or microorganisms. The main anthropogenic sources are industrial activities, agricultural practices, mining, and waste disposal (Singhal and Gupta 2010; Dermatas 2017). Other related aspects of the global groundwater crisis include land subsidence, saline intrusion, loss of springs, and wetland deterioration (Famiglietti 2014).

In spite of its importance and the ongoing crisis, groundwater is a barely visible resource that is monitored and managed poorly, in particular in developing countries. Mexico relies heavily on groundwater, as it accounts for almost 60% of the water used for municipal supply (CNA 2016). Further, in nearly 50% of the country, where arid or semiarid climates prevail, groundwater is the only permanent source of water. More than a hundred of aquifers are overexploited, including those supplying the population of Mexico City and its conurbation (i.e., more than 21 million of inhabitants in 2016) (CNA 2016). The quality decline is another factor that jeopardizes the sustainability of this critical resource. The geogenic background of several Mexican aquifers (mainly by arsenic and fluoride) is a well-known problem that is favored by the complex geological context of the country, as well as by its active tectonic setting (Armienta and Segovia 2008). The saline intrusion and the unintentional infiltration of wastewater are among the most reported anthropogenic impacts on aquifers (Del Campo et al. 2014; CNA 2016). However, in general, the information about the quality of groundwater in Mexico is scarce and undermines its management and governance.

Consequently, it is necessary to carry out a wide range of studies to assess the quality of the groundwater resources of Mexico and to communicate the results in an easy way to policy-makers, managers, and water users. To this end, water quality indices constitute useful tools, which express the adequacy of water for an intended use in a single value that integrates different relevant variables. Several quality indices have been proposed for groundwater, most of which are based on the variables conventionally measured in aquifers to elucidate the geochemical type of groundwater (i.e., through Piper diagrams) (Vasanthavigar et al. 2010; Sethy et al. 2017). The purpose of this work was to propose an index considering the impact of both anthropogenic pollution and geogenic background and to use it to identify quality water issues of a specific well in the Santa Cruz locality, in the Apan aquifer (Hidalgo, Central Mexico). This well was selected as the study site because it is recognized by the local population as a problem due to the geogenic presence of manganese, and it has not been included in the scarce assessment studies done in the aquifer (Huizar-Álvarez et al. 1997; Huizar et al. 1998). The proposed index was compared against another index based on the evaluation of variables related to mineralization processes that are measured commonly in geochemical studies of groundwater.

Materials and methods

Study area

The Apan aquifer is located in the southeastern part of Hidalgo State, between north latitudes 19°35′ and 19°50′ and west longitudes 98°11′ and 98°40′, with a total drainage area of 733.2 km2. The evidence suggests that it is a heterogeneous, anisotropic, and semi-confined aquifer from porous to fractured (CNA 2015). The aquifer is mostly intergranular and its bedrock is volcanic (i.e., mainly constituted by basalts and andesites) (Huizar-Álvarez et al. 1997). The mean annual temperature varies between 10 and 16 °C, and the average annual rainfall is about 610 mm (CNA 2015).

Figure 1 shows the study site (the well situated in the locality of Santa Cruz) and the four wells known as Espejel, Acopinalco, Voladores, and Loma Bonita. These four wells supply water mostly for domestic use, and they were considered as reference points. The Santa Cruz well is locally recognized as a problem due to the presence of manganese, and therefore, the water supplied is almost not used by the inhabitants of the locality. The agriculture practiced around the study site is only of seasonal type; the agricultural use is not allowed for this well (CNA 2015).

Fig. 1
figure 1

Left: Location map showing the study site (Santa Cruz) and the reference sites (Espejel, Acopinalco, Voladores, and Loma Bonita) at the micro-watershed scale. Upright: Sampling points and some near municipalities. Downright: Location of the sampling points at the scale of the Hidalgo State

Sampling and analytical procedures

Five-monthly samplings were carried out between October 2015 and February 2016 in the Santa Cruz well. The reference sites were sampled once during the same period. Samples were collected from the five wells into HNO3-washed plastic containers. The samples intended for metal analyses were acidified with 10% HNO3 to prevent the formation of oxides. For the microbiological analysis, the samples of the Santa Cruz well were collected in sterile Whirl-Pak® bags. All the samples were maintained in ice-packed boxes, while they were transported to the laboratory and kept at 4 °C until analysis. The microbiological analyses were carried out the same day of sampling.

pH, electrical conductivity (EC), and total dissolved solids (TDS) were measured in situ in all samples by using a multiparametric equipment (Hanna Instruments model HI 9828). Reference samples (from Espejel, Acopinalco, Voladores, and Loma Bonita wells) were analyzed ex situ only concerning the contents of the mineralization-related variables, i.e., HCO3, Cl, Na+, K+, Ca2+, and Mg2+. In the Santa Cruz samples, we determined ex situ, besides the variables mentioned above, the chemical oxygen demand (COD), and the contents of N–NH4+, N–NO2, N–NO3, PO43−, Al, As, Cd, Cr, Cu, Fe, Mn, Ni, Pb, and Zn. The densities of several microbial indicators—heterotrophic plate count (HPC), Escherichia coli, Salmonella sp., Shigella sp., Enterobacter sp., Pseudomonas aeruginosa, Enterococcus faecalis, Staphylococcus sp., Clostridium perfringens, and molds and yeasts (M&Y)—were also determined in the Santa Cruz samples. All the physicochemical parameters were measured in triplicate and according to the standard methods (APHA 2012), excepting the concentrations of N–NO3, which were assessed by the technique of Mubarak et al. (1977). The concentrations of metals were measured by flame atomic absorption spectroscopy using a SpectrAA spectrophotometer (Varian 880, USA). For HCO3 and Cl, titrimetric methods were used, while for COD, N–NH4+, N–NO2, N–NO3, and PO43−, the methods were spectrophotometric.

For microbiological analyses, 100-mL aliquots of each sample were filtered through 0.45-μm nitrocellulose filters that were then cultured on non-specific solid media (Standard Plate Count Agar, Bioxon™) for the determination of the HPC or on specific solid media for the assessment of the other microbial indicators. E. coli was determined by culture on m-FC agar (Fluka Analytical™); Salmonella sp. and Shigella sp., on HiChrome™ ECC selective agar (Sigma-Aldrich); Enterobacter sp., on violet red bile agar with MUG (BD); P. aeruginosa, on Pseudomonas agar (DIFCO™); E. faecalis, on m-Enterococcus agar (Difco™); Staphylococcus sp., on Staphylococcus medium 110 (BD); C. perfringens, on SPS medium (Merck); and finally, M&Y, on potato dextrose agar (Bioxon™). All the plates were made in triplicate and cultivated at 37 °C, excepting E. coli (45 °C) and M&Y (25 °C), and colonies counted after 48 h. The plates of C. perfringens were incubated in an anaerobic atmosphere with the BBL Gas Pack System™ (BD).

Statistical analyses

All statistical analyses were conducted using the SPSS version 21 statistical package (SPSS Inc., Chicago, USA). The results of the measurements of the mineralization-related variables were analyzed by descriptive statistics, and statistical differences were determined based on the Bonferroni correction test at the 0.025 significance level [for the samples of the study and the reference sites, and for the reference sites samples and the data reported by Huizar et al. (1998) for the same sites].

After a descriptive statistical analysis, the data collected for the study site (the Santa Cruz well) were also analyzed by multivariate techniques. Pearson correlations were calculated (0.01 ≤ p ≤ 0.05) to reveal the dependency between variables and to identify the minimal significance of the variables, which were further used to assess the loadings of the components to be considered according to their statistical significance (Lucho-Constantino et al. 2005). Principal component analysis (PCA) and principal factorial analysis (PFA), followed by varimax rotation, were finally carried out according to the method reported by Lucho-Constantino et al. (2005).

Calculation of the water quality indices

The calculation of an index allows the reduction in different water quality data to a single number reflecting the suitability of the source for a particular use. Concerning groundwater, the common targeted use is for drinking purposes, and consequently, the water adequacy is established against several relevant quality indicators proposed in international or national drinking water guidelines. In this paper, we used a simple methodology commonly found in the bibliography to calculate weighted arithmetic groundwater quality indices (Vasanthavigar et al. 2010; Oişte 2014; Bodrud-Doza et al. 2016). Lower values of the indices indicate less deviation from the values recommended by the considered guideline and thus higher drinking water qualities.

The calculation of the water quality indices involved three steps (Sethy et al. 2017). In the first step, the variables to be taken into account were selected, and a weight (wi) was assigned to each of them to reflect its relative importance in controlling the overall groundwater quality. Both the selection of variables and the assignation of their weights integrate expert knowledge and then are highly heuristic. Normally, practitioners choose freely the variables to be included accommodating site and treatment considerations, as well as published quality guidelines (Hurley et al. 2012). The highest weight of 5 was assigned to the variables having the major impacts on water quality; on the contrary, the minimum weight of 1 was assigned to the variables considered as having the least impacts on water quality. Intermediate weights (between 2 and 4) were allocated to the other variables depending on their perceived importance on the quality of water for human consumption.

In the second step, the relative weight (Wi) of each variable was calculated using Eq. (1):

$$W_{i } = \frac{{w_{i} }}{{\mathop \sum \nolimits_{i = 0}^{n} w_{i} }},$$
(1)

where wi is the assigned weight of each variable and n is the total number of considered variables.

In the third step, the quality rating of each variable (qi) was calculated by dividing its value in water samples (Vi) by its respective standard value (Si), according to Eq. (2):

$$q_{i} = \frac{{V_{i} }}{{S_{i} }} \cdot 100.$$
(2)

For calculating the overall water quality index, Eq. (3) was used:

$${\text{MWQI}}\,{\text{or}}\,{\text{GAWQI}} = \mathop \sum \limits_{i = 0}^{n} W_{i} \cdot q_{i} .$$
(3)

Two water quality indices were calculated. One index is based only on mineralization-related variables (MWQI), and it was used to assess the quality of the reference and the study sites with a few of parameters conventionally measured in groundwater. The other index (GAWQI) was developed to include variables related to both geogenic background and anthropogenic pollution in the water quality assessment. We selected the variables and their weight after reviewing the water index bibliography and prioritizing the groundwater pollution issues of the state of Hidalgo (Gallegos et al. 1999; Armienta and Segovia 2008; Lesser-Carrillo et al. 2011). We also made sure that the Mexican guidelines for drinking water provided maximum permissible limits for the chosen variables (NOM 2000). The GAWQI was only calculated for the more comprehensive data obtained for the Santa Cruz study site. To facilitate the computation of the indices, when the variables were undetected in the samples, they were assumed to be equal to zero. Table 1 shows the assigned weights, the standard values, and the relative weights used to calculate MWQI and GAWQI. The computed values of MWQI and GAWQI were classified into five categories, as given in Table 2 (Vasanthavigar et al. 2010; Oişte 2014; Sethy et al. 2017). The concentration of E. coli gives no weight, as it implies to divide by zero. However, if this indicator was detected in one sample, the water was directly classified as “unsuitable for drinking” (Al-Omran et al. 2015).

Table 1 Variables, weight factors (wi), relative weights (Wi), and standard values (Si) considered for the calculation of the groundwater quality indices based on mineralization-related variables (MWQI) and on indicators of geogenic background and of anthropogenic pollution (GAWQI)
Table 2 Legend of the groundwater quality indices

Results and discussion

Water quality evaluation based on mineralization-related variables

General characteristics of groundwater from the reference and the study sites are summarized in Table 3 with mean, standard deviation (SD), and coefficient of variation (CV) values. For comparison purposes, the values of the Mexican drinking standard were also included. The CV values were lower than those reported in another studies (Jain et al. 2010; Del Campo et al. 2014), maybe because we analyzed a low number of samples (n = 4 for the reference sites and n = 5 for the study site).

Table 3 Mean, standard deviation (SD), and coefficient of variation (CV) of the mineralization-related variables measured at the reference and the study sites

The mean pH values of the reference and the study sites were 7.2 and 6.9, respectively, signaling neutral groundwater in both cases. Most of the groundwater has a pH ranging from 5.0 to 8.0 (IDNR 2002). The EC values were comprised between 316 and 455 μS/cm for the reference sites and between 319 and 516 μS/cm for the study well, while the TDS values ranged from 158 to 226 mg/L and from 139 to 258 mg/L for the reference and the study sites, respectively. Values for both EC and TDS indicated low-mineralized groundwater in this area, probably due to a low residence time (IDNR 2002; Jain et al. 2010). Groundwater with a content of TDS lower than 1000 mg/L is assumed to be fresh and desirable for drinking water if it is lower than 500 ppm (Patra et al. 2016). The order of relative abundance of major cations, expressed as percentage of meq/L, was Ca2+ (33%) > Na+ (32%) > Mg2+ (29%) > K+ (6%) in the reference sites. In the Santa Cruz well, the relative abundance of the major cations (in meq/L) was Mg2+ (35.8%) > Ca2+ (32.5%) > Na+ (23.5%) > K+ (8.2%). The lack of data for other anions (particularly sulfates) rules out the full identification of the type of groundwater sampled; however, the predominance of HCO3 and Ca2+ in the reference sites suggested that the groundwater composition might be influenced by carbonate mineral dissolution (Bodrud-Doza et al. 2016). In a previous study covering the Apan–Tochac sub-basin, the Ca + Mg + HCO3 hydrochemical facies was found to be dominant (Huizar et al. 1998). The authors of this study concluded that such chemical composition was in good agreement with a volcanic environment where calcium and magnesium were derived from plagioclase and ferromagnesian minerals, respectively.

According to the mineralization-related variables presented above, the groundwater from both the reference and the study sites complies with the Mexican water quality standards. We also evaluated the adequacy of the groundwater for irrigation purposes; for this, it is common to use the sodium adsorption ratio (SAR) and EC values. SAR is commonly calculated using Eq. (4), provided by the U. S. Department of Agriculture (Regional Salinity Laboratory 1954):

$${\text{SAR}} = \frac{{[{\text{Na}}^{ + } ]}}{{\sqrt {\frac{{\left[ {{\text{Ca}}^{2 + } } \right] + [{\text{Mg}}^{2 + } ]}}{2}} }}.$$
(4)

High SAR values (> 18 meq/L) indicate a sodium hazard, because the irrigation with saline water (with high Na+ and low Ca2+ contents) favors the saturation of Na+ and the increase in the exchangeability of this cation, thereby leading to the destruction of soil structure and a diminution of its permeability (Vasanthavigar et al. 2010). The mean calculated SAR values were 1.15 and 0.77 meq/L for the reference and the study sites, respectively. Consequently, in both cases, the low Na content of the groundwater allows its use for irrigation on almost all soils without this involving the development of harmful levels of exchangeable Na (Regional Salinity Laboratory 1954). If the mean CE values are also considered (388 and 454 μS/cm for the reference and the study sites, respectively), the irrigation with these groundwaters represents a medium salinity hazard, as in most cases plants with moderate salt tolerance could be grown (Regional Salinity Laboratory 1954).

Statistical analysis of the mineralization-related variables

Paired difference tests were performed between the data measured in the reference and the study sites (presented in Table 3), but also between each of them against the characteristics reported for the reference sites in a previous study (Huizar et al. 1998). On the one hand, this analysis revealed significant differences (p ≤ 0.025) between the reference sites and the study sites concerning the mean values of pH, EC, HCO3, and Cl. In spite of the correlation existing between EC and TDS values, no significant differences were found in the TDS measured in the reference and the study sites, nor in the contents of the major cations. On the other hand, when the characteristics measured at the reference sites were compared against those measured at the same sites but more than 19 years ago, significant differences were found only for one parameter after Bonferroni correction. In this way, the mean value of Cl (33.5 ± 2.7 mg/L) was significantly higher in the above-mentioned previous study (Huizar et al. 1998) than in our samples (8.0 ± 3.9 mg/L).

Assessment of the mineralization-based water quality index (MWQI)

The quality index based on mineralization variables (MWQI) was calculated from the values presented in Table 3. For the study site, we used the mean values of these parameters and then we could only obtain one value for the MWQI (i.e., 17.9), while for the reference sites we calculated the index for each site (i.e., Espejel, Acopinalco, Voladores, and Loma Bonita) separately. The MWQI ranged between 15.4 and 23.0 for the reference sites. The results are presented in Fig. 2. We used the values of the quality parameters reported in Huizar et al. (1998) to compute the MWQI of the groundwater from these reference wells (which were comprised between 24.8 and 29.7), and the resulting values are presented in Fig. 2 too. All the calculated MWQI values allow these groundwaters to be classified as excellent for drinking, according to Table 2.

Fig. 2
figure 2

Values of MWQI calculated for the mean quality values measured at the study and the reference sites. The values measured in the reference sites by Huizar et al. (1998) are also shown, as well as the upper limit value of the “excellent water” category (discontinuous line)

It should be noted that the groundwater from the reference sites presented higher MWQI values in 1998, due mainly to its higher mineralization compared with the same sites nowadays. In Fig. 2, the contribution of each parameter (Wi· qi) to the final MWQI value is also presented. Evidently, the biggest contributions come from the most weighted parameters (i.e., from the parameters rated with the highest wi values, as Na+ and TDS). However, for the reference samples analyzed in 1998, the influence of the chloride concentrations (weighted with a mean value, wi = 3) in the final value of the MWQI is noteworthy. In fact, the mean Cl content reported for the reference sites in 1998 (33.5 ± 1.3 mg/L) is more than ten times higher than the Cl concentration that we measured in the Santa Cruz samples recently (2.6 ± 1.0 mg/L).

Water quality evaluation including indicators of geogenic background and anthropogenic pollution

The groundwater from the study site (the Santa Cruz well) was analyzed in a more comprehensive way than the reference sites, and several indicators of geogenic background and anthropogenic pollution were also determined for this particular well (Table 4). Higher CV values were obtained for these parameters (ranging from 4.2 to 200%) than for the mineralization-related variables presented above (that fluctuated between 1.4 and 52.5%; Table 3).

Table 4 Mean, standard deviation (SD), and coefficient of variation (CV) of the indicators of geogenic background and anthropogenic pollution measured at the study site

Concerning the geogenic indicators, their relative abundance was Mn > Fe > Al > Ni > Cr > Pb > Cd > Cu. Neither As nor Zn was detected in any sample, despite the fact that arsenic is a frequently reported contaminant of groundwater from several zones of Mexico, such as the region known as “Comarca Lagunera,” the Río Verde basin, the Salamanca aquifer, and Zimapán, also in Hidalgo State, among others (Armienta and Segovia 2008).

As expected, the groundwater of the Santa Cruz site presented high contents of Mn (ranging from 3718 to 5759 μg/L), which were significantly different from the Mn concentrations found in the reference sites (p < 0.05, comprised between 10 and 43 μg/L) and quite higher than the permissible limit of 150 μg/L established by the Mexican guidelines (NOM 2000). High Mn concentrations restrain the water uses, because they lead to organoleptic problems, staining of surfaces and reduced water pressure and flow in distribution pipelines derived from the accumulation of Mn oxides (Homoncik et al. 2010). Besides, recent epidemiological studies report the association between Mn exposure from drinking water supplies with cognition and behavioral deficits in children and adults (Frisbie et al. 2012; Oulhote et al. 2014).

To the best of our knowledge, the Mn concentration measured at the Santa Cruz site is the highest value reported for groundwater from Mexican sources. Lower Mn contents were reported for the Apan–Tochac sub-basin (0–188 μg/L; Huizar et al. 1998) and for other near aquifers of Hidalgo State, as the Tecocomulco sub-basin (0.1–133 μg/L; Huizar-Álvarez et al. 2001), the Río de las Avenidas sub-basin (1–1068 μg/L; Huizar-Álvarez 1997), and the Mezquital Valley (0.3–150 μg/L; Lesser-Carrillo et al. 2011). For groundwater from the Mexico basin, the measured Mn concentrations ranged between 3 and 960 mg/L (Domínguez-Mariani et al. 2015). In this same basin, the groundwater from individual wells of the Peñón-Texcoco site has peaked at 4610 μg/L, but the overall mixture of the water from 15 wells yields a mean concentration of 1520 μg/L (Piña and Ramírez 2003). The values reported for groundwater from other Mexican sources are also lower than those obtained in our study, such as wells supplying the cities of Guaymas, Son. (100–1500 μg/L), Navojoa, Son. (1300 μg/L), and Veracruz, Ver. (390–540 μg/L) (Piña and Ramírez 2003).

The presence of Mn in groundwater is controlled by such factors as rock geochemistry, water composition, and microbial activity. The weathering of Mn-bearing rocks (i.e., shale, graywacke, limestone, or ultramafic rocks) can result in elevated concentrations of this metal in soils and sediments. In the aquifer, groundwater dissolves these materials, which release Mn to the water. The groundwater characteristics, such as pH, redox potential, dissolved oxygen, and dissolved organic matter, control the resulting concentration and speciation of Mn. The most likely species is Mn2+ at pH < 7 and at a redox potential as high as 800 mV (Homoncik et al. 2010), as it occurs in the Santa Cruz site, where redox potentials of around − 4 and 133 mV were measured (data not shown). Finally, the microbial activity has a significant role enhancing the Mn oxidation as well as its reduction.

As all the mechanisms mentioned above act similarly on Mn and Fe, these metals are often found concurrently in groundwater, with Fe being present at higher levels than Mn (Homoncik et al. 2010). However, this was not the case for the Santa Cruz site. The mean concentration of Fe found in the Santa Cruz groundwater (395 ± 213 μg/L) was not significantly different from the average content of this metal measured in the reference sites (318 ± 126 μg/L; p < 0.05; data not shown). However, the content of this metal in the Santa Cruz well exceeded the Mexican guideline value (300 μg/L) in 83% of the samples. The mean contents of Al (253 ± 34 μg/L), Cd (12.5 ± 1 μg/L), and Pb (28 ± 26 μg/L) also surpassed the permissible limits of the Mexican guidelines (200, 5, and 10 μg/L, respectively). In the cases of the concentrations of Al and Cd, the permissible limits were surpassed in 100% of the samples, while for the content of Pb, it exceeded the limit in 60% of the samples. The concentrations measured for Cu (ranging from 5 to 17 μg/L) and Cr (varying from 26 and 36 μg/L) did not exceed the guideline values (of 2000 and 50 μg/L, respectively).

Concerning the eventual anthropogenic pollution of the study site, we assessed the contents of organic matter, nutrients, and the densities of several microbial indicators in the groundwater. The results are summarized in Table 4. Although COD is rarely determined in groundwater, it is used as an indicator of aquifer pollution by wastewater. The mean concentration of COD measured in our samples was 35.2 ± 14.2 mgO2/L, which is quite similar to the value determined (31.3 ± 18.7 mgO2/L) for an aquifer affected by excessive withdrawal and maybe by sewage intrusion (Domínguez-Mariani et al. 2015). Ammonium is another indicator of sewage pollution, even though it is also associated with the intrusion of leachates from landfill sites or agricultural activities into aquifers (Böhlke et al. 2006). For these cases, concentrations of N–NH4+ of the order of 10–100 mg/L have been reported (Böhlke et al. 2006), which are quite higher than the contents determined in our samples (0.13 ± 0.16 mg/L). Our values were always lower than the Mexican permissible limit (0.5 mg N-NH4+/L).

By far, nitrate is the most ubiquitous anthropogenic pollutant in groundwater (Spalding and Exner 1993; Montiel-Palma et al. 2014). Its primary source is the excessive use of nitrogenous fertilizers in agricultural zones, followed by farming activities and sewage intrusions. Nitrate is linked to harmful health effects, mainly in children, and consequently, its presence in drinking water is a matter of public concern (Spalding and Exner 1993). Unexpectedly, taking into consideration that the study site is entirely dedicated to agricultural activities, the measured concentrations of this pollutant in our samples were not significant (0.48 ± 0.93 mg N-NO3/L) and always lower than the Mexican guideline value (10 mg N-NO3/L). Our results contrast with several studies made in Mexican agricultural zones, where nitrate is present at unsafe concentrations. For instance, in wells of the Comarca Lagunera region and of the Mezquital Valley (Hidalgo State), mean concentrations of 23.4 mg N–NO3/L (Medina and Cano 2001) and 13.46 ± 4.25 mg N–NO3/L (Gallegos et al. 1999) have been measured, respectively. In the case of the Mezquital Valley study, the difference in the contents of nitrates can be attributed to the irrigation with raw wastewater made in the zone, while at the Santa Cruz site the agriculture is of rainfed type. Likewise, the nitrite concentrations determined in our samples (0.02 ± 0.03 mg N-NO2/L) were always lower than the Mexican guideline value (1 mg N-NO2/L). The measured content of orthophosphates in the Santa Cruz well ranged between 0.93 and 2.74 mg/L. These contents can be considered as high, because they are in the same order of magnitude as the concentrations assessed in the groundwater from the Mezquital Valley (< 0.01–3.08 mg/L; Gallegos et al. 1999). Phosphates are not considered by the Mexican drinking water guidelines (NOM 2000).

Groundwater is a recognized source of potentially hazardous pathogens. The primary cause of the microbial groundwater pollution is the intrusion of sewage, municipal wastes, leachates from farming sites, and even surface water, into aquifers (Murphy et al. 2017). Usually, the degree of microbial pollution is assessed by the concentration of fecal coliforms, which is the main indicator of the microbial quality of drinking water. However, in a recent review, a total of 17 pathogens belonging to several groups and families were found responsible for the latest groundwater-related outbreaks. Among these pathogens, E. coli was the seventh most common outbreak cause, and coming after norovirus, Campylobacter, Shigella, the Hepatitis A virus, Giardia, and Salmonella (Murphy et al. 2017).

The densities of the microbial indicators analyzed are presented in Table 4. The most abundant indicator was the heterotrophic bacteria group (HPC), followed by P. aeruginosa, C. perfringens, and M&Y. These indicators were found with the highest frequency too (in 100% of the samples). Staphylococcus sp. (detected in 75% of the samples), E. coli (50%), Salmonella sp. (20%), and Shigella sp. (20%) were also found but in lower quantities (they are listed in descending order of abundance). Neither Enterobacter sp. nor E. faecalis was detected in any sample. Consequently, the Santa Cruz groundwater did not comply with the Mexican regulations for drinking water, which establish that E. coli, as well as fecal coliforms and thermotolerant microorganisms (term including Enterobacter sp. and Enterococcus sp.), must be absent in drinking water samples (NOM 2000). The mean concentration of E. coli found in our samples (2.5 ± 4.4 CFU/100 mL) is of the same order of magnitude as the majority of the values reported for wells of the Mezquital Valley (< 3 CFU/100 mL). However, very high concentrations of this indicator (> 2.4·103 CFU/100 mL) were measured in some of these wells (Gallegos et al. 1999). For groundwater from the Mexico basin, mean concentrations of E. coli of 0.27 ± 0.97 CFU/100 mL have been reported (Mazari-Hiriart et al. 2005). These authors state that E. coli, Enterococcus sp., and P. aeruginosa were the most prevalent microorganisms throughout a year. In our samples, the most ubiquitous and most abundant microbial indicator was P. aeruginosa.

Multivariate statistical analysis of the geogenic and anthropogenic pollution indicators

The correlation matrix indicates how well the variance of each variable can be explained by relationships with each other of the remaining variables (Bodrud-Doza et al. 2016). The Pearson correlation coefficients of the water quality variables measured in the Santa Cruz site are presented in Table 5. Strong (p < 0.01) and significant (p < 0.05) correlations were found, which reveal the influence of several processes on the groundwater composition.

Table 5 Pearson correlation matrix for 18 variables measured at the study site

On the one hand, the geochemistry showed an impact on the indicators of both the mineralization and the geogenic pollution of water. Accordingly, a strong and positive correlation (r = 0.966) was found between the Mn content and the EC. As signaled before, the most likely manganous species at the Santa Cruz site is Mn2+, which certainly contributes to the water conductivity. The pH presented a significant and positive correlation with Cl (r = 0.851), while it showed a significant but negative correlation with K+ (r = − 0.881). Bodrud-Doza et al. (2016) also found a negative correlation between pH and K+, which was attributed to geogenic processes in rocks altered by the acidification of the groundwater.

On the other hand, certain correlations highlight the anthropic entries of pollutants, mainly concerning nutrients. N–NO2 was found to be strong and positively correlated with PO43− (r = 0.996), as well as N–NO3 with Cr (r = 0.994) and Ni (r = 0.994). This correlation suggests a common origin of these macro- and micronutrients, maybe the application of fertilizers. Significant and positive correlations were also detected between N–NH4+ and Cr (r = 0.902), Ni (r = 0.902), and N–NO3 (r = 0.894). Although the microbial indicators are often correlated positively with nutrients, in our study, we only found significant and positive correlations of HPC with N–NO2 (r = 0.815) and with PO43− (r = 0.812).

Both PCA and PFA are multivariate techniques used to derive a small number of relevant factors from a broad data set. Hereafter, we only present the results of the PFA, because this analysis confirmed the relationships between the study variables found preliminarily by the PCA. Table 6 shows the orthogonally rotated matrix with the loadings of 18 quality variables measured in the groundwater of the Santa Cruz site. Two factors were extracted (F1 and F2) (Fig. 3), which together explained 100% of the total variance in the original dataset. Each factor loading designates the correlation between the variable and the new factor.

Table 6 Loadings of 18 variables measured at the study site on two significant principal factors (F1 and F2)
Fig. 3
figure 3

Loading plot of 18 variables measured at the study site on two significant factors, F1 (in discontinuous lines) and F2 (in continuous lines)

In Table 6, the loading factors higher than 0.812 are highlighted in bold characters. This threshold value corresponded to the minimum Pearson correlation value calculated with p < 0.05 (Table 5) and allowed the strong correlations between the original variables and the new factors (F1 or F2) to be identified. F1 explained 65.72% of the total variance of the original data. F1 was positively loaded by organic matter (COD), macronutrients (N–NH4+ and N–NO3), and some micronutrients (Cr and Ni). At the same time, F1 was negatively contributed to by metals (Cu, Cd, and K+), as well as the contents of P. aeruginosa and M&Y (Fig. 3). In this way, F1 seems to be related to the anthropogenic pollution and geogenic background of the Santa Cruz site, probably due to the convergence of several sources, i.e., the intrusion of wastewater or the infiltration of agricultural wastes or inputs (such as fertilizers). F2 accounted for 34.28% of the total variance in the dataset and showed a positive correlation between the pH, the EC and the contents of Mn, Fe, and Cl. F2 might be thus associated with mineralization and geogenic processes modifying the composition of the groundwater. It is worth to note that F2 revealed a strong correlation of both Mn and Fe, as it is often signaled in the bibliography (Homoncik et al. 2010). F2 also consisted of a negative loading of nutrients (N–NO3 and PO43−) and HPC (Fig. 3).

Assessment of the groundwater quality index including geogenic and anthropogenic pollution indicators (GAWQI)

We proposed an index that considers mineralization processes, as well as geogenic background and anthropogenic pollution, in the groundwater quality for drinking purposes. The GAWQI values calculated for the individual samples taken from the study site are presented in Fig. 4, as well as the contribution of each indicator considered. The breakdown of the GAWQI calculated for the samples 3 and 4 is also presented even though both samples were classed as unsuitable for drinking only due to the detection of E. coli.

Fig. 4
figure 4

Values of GAWQI calculated for the study site. The discontinuous line indicates the upper limit of GAWQI values for drinking water

The GAWQI values ranged from 451.1 to 560.3, which indicate that the groundwater from the Santa Cruz site is unsuitable for drinking (Table 2). The contributions of some of the indicators (i.e., Na+ and the nitrogenous pollutants) to the total GAWQI value are not discernible in Fig. 4 because they are too small. In the case of As, a metalloid undetected in our samples, its contribution is null. However, due to its historical importance in the groundwater resources in Mexico and to its high toxicity, we think that it is important to include As in the GAWQI calculation for further uses of the index. Figure 4 shows that the poor quality of the groundwater is due, by far, to its excessive Mn concentration (more than 35-fold higher than the maximum permissible value established by the Mexican guideline). In some samples, the second largest contribution to the GAWQI value comes from Pb, because it was the most weighted parameter for the GAWQI calculation due to its high toxicity. Finally, Fe represents the third contribution to the GAWQI value, even though this is not considered a toxic metal as Pb but rather a micronutrient. Altogether with Mn, Fe oxidizes when the groundwater enters in contact with atmospheric air, and the resulting deposits are considered a nuisance in water supply systems (Piña and Ramírez 2003).

As the GAWQI values indicate, the groundwater quality of the Santa Cruz site is unsuitable for drinking due to combined geogenic and anthropogenic sources. Therefore, the monitoring of a range of parameters wider than the usual is necessary for a proper assessment of the groundwater quality. Such assessment can provide options for water treatment systems able to deliver safe water to the inhabitants of the study zone and to enhance the governance of their water resources.

Conclusion

We analyzed the quality of groundwater of the Apan region (Hidalgo State, Mexico), coming from one well in the study site (Santa Cruz) and four reference wells. When only conventional variables related to mineralization processes were assessed (i.e., pH, TDS, Na+, Ca2+, Mg2+, and Cl), as is often done in groundwater studies, all the samples complied with the Mexican standard for drinking water. Besides, the SAR values calculated for all the samples indicated that the use of these groundwaters for irrigation is allowed. The values of the index calculated from these mineralization-related variables (MWQI) showed that the groundwater qualities of these wells were well suited for drinking; actually, they were classified as “excellent.”

The groundwater from the Santa Cruz site was further analyzed regarding several indicators of geogenic background and anthropogenic pollution. We confirmed that the groundwater of the study site presents a significant problem due to the concentration of Mn, which is probably the highest reported value for Mexican groundwater sources. The groundwater of the study comply with the Mexican guidelines established for nitrogenous pollutants, arsenic, and some heavy metals (such as Cu, Cr, and Zn), but all the samples exceeded the limits concerning Al, Cd, and obviously Mn. Due to seasonal differences in the groundwater quality, some of the samples also exceeded the permissible limits of Pb and E. coli. According to the GAWQI values, the groundwater of the study site was found largely unsuitable for drinking. This should be a major concern for the local water managers and users, leading to the search for adequate treatment options.

Our results support the need for the continuous monitoring of the groundwater quality, and the usefulness of a comprehensive water quality index. The main advantage of the proposed index is that it embraces a range of indicators of mineralization and anthropogenic pollution, as well as of a geogenic background common in Mexican aquifers. Although undoubtedly the index proposed herein is perfectible, it is our view that it could represent a valuable communication tool to inform the population and the authorities about the quality of the local water bodies. This accessible information is necessary for the scientifically sound management of groundwater resources, which is a crucial aspect of the development of a semiarid country as Mexico.