Introduction

Meeting the increasing need for water with sufficient physicochemical and biological quality to meet human, agricultural, livestock, and leisure demands is hampered by the decreasing availability of water sources of this quality. This decrease is partially due to anthropogenic changes that affect the infiltration and storage rates but mainly due to the increasing number of contaminants associated with the different types of land uses and the effects on environmental components. Such conditions are directly controlled by the management and technologies applied in the disposal, storage, use, and control of the amount of contaminants. Among the affected environmental components are surface water and groundwater; in this aspect, when groundwater is affected by contaminants, there is contamination of the geological materials (rocks and unconsolidated materials), and contamination of surface water through discharge zones (springs) can also occur. Contamination of the entire biological and food chain may occur, which will have a broad effect on society.

In the case of subsuperficial waters (saturated and vadose zones), some studies that highlight management and consequent planning to maintain groundwater’s availability and quality stand out. The first is the assessment of the degree of alteration of these waters, the second is the estimated availability, and the third is their vulnerability to the possible installed and/or planned sources of contaminants, considering the foreseen uses for a given region. Moreover, vulnerability studies are essential for the analysis and estimation of possible contamination and also provide fundamental information for territorial planning, aiming at maintaining the quality of groundwater storage. Since the 1970s, studies involving vulnerability estimates have been performed in both technically and economically developed and developing countries, with dozens of methods developed for scales ranging from very small for large areas (large basins and countries) to large scales for small areas (first-order basins). However, the same procedures have been applied to conditions of very different environmental components, with some results not revealing the actual vulnerability conditions. Others are based on a reduced set of information about the environmental components involved in the contaminant flow, which can include very distinct areas in the same category.

The predictability of groundwater contamination is a complex process, because it depends on natural and anthropogenic information and contains a significant degree of uncertainty, such as contaminant sources, potential infiltration capacity, distribution of infiltrated waters, and different physical and chemical conditions of the water and geological materials. One method to assess the potential for contamination is to consider the contaminant sources and waters and to assess the vulnerability aspect by considering the characteristics of the geological environment. Groundwater vulnerability cannot be measured directly in the field, although it is possible to identify units that present different contamination conditions and, therefore, different degrees of vulnerability. This zoning can be developed via mathematical models or several types of analyses based on information that represents the system and a conceptual model.

The importance of vulnerability studies has been discussed in several publications. Recently, there have been noteworthy suggestions from several professionals regarding the importance of the topic and the need to consider data that provide a more representative index in terms of vulnerability, such as Witkowski (2016), Iván and Mádl-Szőnyi (2017) and Baalousha (2017).

A proposal and application of a vulnerability index method based on information that reflects the different natural and anthropogenic environmental components that interfere in the process of groundwater contamination was developed. The process considers the following basic assumptions: the use of logical tree resources to structure the information in a hierarchy; sequential discretization of the problem; the variability of geological materials, vadose and saturated zones, diffusion, and point contaminant sources; and terrain units delimited based on engineering geological zoning associated with land-use types.

Literature review

The estimation of groundwater contamination is a complex problem requiring data from different areas of study, scales, and magnitudes of the involved mechanisms and processes. These studies can be classified into three main categories: analyses of vulnerability before the installation of possible sources of contamination, analyses of the vulnerability of groundwater to installed contaminant sources, and analyses aimed at generating guidelines and standards for territorial and environmental planning.

The studies are complex, because predicting the extent of contamination varies over small distances, often less than a 1000 m because of the natural attenuation processes that occur in the space from the source of contamination to the considered groundwater depth associated with the flow conditions. These studies have multiplied in the last 50 years, with pioneering works by Le Grant (1964) and Albinet and Margat (1970), who coined the term groundwater vulnerability. However, the term has undergone conceptual variants, depending on the researchers, scale of analysis, country, and methods and procedures, as can be observed in the works of Rao and Alley (1993), Vrba and Zaporozec (1994), Zaporozec (2002), Gogu et al. (2003), Worrall and Kolpin (2004), and Witkowski (2016).

The most well-known concept used by the professionals involved in this subject is that of Rao and Alley (1993), which conceptualizes groundwater vulnerability as the possibility of a contaminant reaching a certain position in the saturated system (hazardous event) when a contaminant is disposed at a point topographically above the analysed point. In general terms, the vulnerability of a groundwater system, according to Gogu and Dassargues (2000), can be developed in two directions: the intrinsic condition, which is only based on geological, hydrogeological and hydrological factors without considering the possible sources of contaminants, and the specific condition, which addresses the assessment of the contaminants and their interrelationship with the aspects considered in the intrinsic condition.

These different methods and procedures can be grouped into seven categories: geological complexity and clustering methods (MODEL BASED ON REGIONAL GEOLOGIC FRAMEWORK, DIVERSITY, and GALDIT vulnerability index); methods focused on indices (DRARCH, MLPI, LPI, AVI, PI, COP, and RI); analogic methods (AlbinetandMargat, USEPA, and BRGM); parametric methods (DRASTIC,GOD, EPIK, SINTACS, FLEMISH METHOD, VULPEST, VLDA model, DRASTICA model, NEURO-FUZZY TECHNIQUES, and DRAV model); mathematical methods (MODFLOW, GLEANS, PRZN, AEM/DRASTIC, SEEPAGE, SEEP/W, and SEEPAGE); statistical methods (PCASD, GLA, AGRIFLUX–MODFLOW, GERMAN METHOD, MULTIVARIATE STATISTICAL ANALYSIS, DASTI/IDRISI, and PESTANS); combined methods (CNR-GNDCI, USGS, EVARISK, and AQUIPRO); and methodologies coupling travel time estimation and rating methods (APLIE method).

Because these methods produce different results for the same area, some authors, such as Neukum and Hötzl (2006), have attempted to propose a relationship between the different results, although the variability of attributes considered in the different methods is limited. It should be noted that many of these methods have been modified for applications in specific regions with changes in attribute classes and simplifying or increasing complexity, such as the recent examples of Mishima et al. (2011), Duarte et al. (2015), Dickson-Anderson et al. (2015), Sahoo et al. (2016), Bonfanti et al. (2016), and Boufekane and Saighi (2018). Others have developed procedures to specific natural conditions, e.g., Zhou et al. (2012), Huneau et al. (2013), and Jia et al. (2014).

The methods and procedures cited above present some points in common that should be highlighted:

  1. 1.

    The attributes/information used in the different methods varies widely, which leads to very different results for the same area.

  2. 2.

    The methods do not usually separate intrinsic vulnerability from specific vulnerability.

  3. 3.

    The concepts of vulnerability considered in the different methods vary, even when the same method is applied in different areas and by different professionals.

  4. 4.

    When data about contaminant sources are used, the data do not often reflect the intensity or magnitude.

  5. 5.

    Most of the methods used do not consider data regarding contaminant sources and anthropogenic aspects.

Proposal and development of a vulnerability index

The main point of this work is the proposal of a set of procedures that consider the following aspects:

  1. 1.

    Use of the conceptually most appropriate context, which is the assessment of specific vulnerability.

  2. 2.

    The water of the saturated and unsaturated zones as the environmental component responsible for the maintenance and continuity of the surface channels and subsurface storage.

  3. 3.

    The data associated with specific data acquired in posterior studies should allow for risk analysis in the future.

  4. 4.

    The result should generate the classification of the land units in terms of the vulnerability grading based on the final vulnerability index values obtained using all attributes for units relative to the reference value (maximum or minimum values of the total vulnerability index).

The proposed index is based on the following aspects of geological materials, contaminant sources, and methods to acquire the data:

  1. 1.

    The spatial variability of the geological materials, mainly unconsolidated materials.

  2. 2.

    Large extensions consisting of zones with infiltration conditions that respond on large scales.

  3. 3.

    Attributes that are obtained by engineering geological mapping procedures, e.g., those proposed by Zuquette, Pejon and Collares (2004), without the requirement of sophisticated equipment, although they must represent the characteristics of the environment.

  4. 4.

    Costs and uncertainties related to obtaining the information. The set of procedures and methods must have a reasonable cost, and the uncertainties should fit within the work scale and the data type.

  5. 5.

    The diversity and specific characteristics of contaminant sources.

  6. 6.

    The relationships between the mechanisms related to the introduction of the contaminant, transport of pollutants in porous media and natural attenuation.

  7. 7.

    Application at scales preferably larger than 1:50,000. The purpose of an analysis and vulnerability map is the possibility of application for different purposes, i.e., spatial or specific planning, which is possible only at scales larger than the considered magnitude.

To satisfy the previous conditions, the following aspects were considered:

  1. 1.

    The attributes and their classes are hierarchically ordered from the lowest to the highest influence in the context of vulnerability.

  2. 2.

    The degree is related to the potential magnitude of the contaminant load that would reach the depth level under analysis.

  3. 3.

    The contaminant load and the degree of vulnerability are functions of four major aspects of the analysis:

  • The analysis of the contaminant sources according to the magnitude and contaminant types.

  • Contaminant transport to the depth level under analysis.

  • The natural attenuation conditions of geological materials.

  • The conditions of redistribution of contaminants and biogeochemical reactions.

The main components that control the contaminant flow process in geological porous media are the following:

  1. 1.

    The introduction of contaminants from a source in contact with the geological materials.

This aspect depends on the availability of contaminants in a liquid solution and the liquid volume to transport them from the contaminant source to the geological environment as well as the propagation and distribution to varying depths.

  1. 2.

    The propagation of contaminants in the geological environment to the analysed depth (contaminant transport).

The propagation depends on the amount and types of liquids, the porosity and the characteristics of the voids, and the spatial distribution of the geological materials that control the variability of the hydraulic properties and flow conditions.

  1. 3.

    The interactions of the contaminants with the water and geological materials as well as decay reactions, degradation aspects, sorption and other processes that constitute natural attenuation.

Figure 1 shows a diagram of the relationships among the fundamental components in the groundwater vulnerability assessment with the separation of different aspects that are part of the three basic components (source of contaminants, geological materials, water, anthropogenic conditions, and water conditions—vadose and saturated) and the interactions that should be assessed in vulnerability studies.

Fig. 1
figure 1

Hierarchical levels and respective constituents considered to obtain the Vulnerability Index

Based on Fig. 1, it is found that the following basic conditions must be considered in the proposal of a vulnerability index:

  1. 1.

    Type of contaminant sources in spatial (point or diffuse) and temporal (continuous or pulse) terms.

  2. 2.

    Conditions for introducing water or other contaminated liquids from the surface.

  3. 3.

    Flow conditions in saturated and unsaturated zones.

  4. 4.

    Processes involved in the transport of contaminants in the geological environment (advection and hydrodynamic dispersion).

  5. 5.

    Aspects of sorption, desorption, fugacity, and retardation that interfere with and control natural attenuation conditions.

  6. 6.

    Biogeochemical reactions that result in precipitations, chemical compound releases, decays, and degradations.

  7. 7.

    The (two-dimensional)equation that governs the transport of contaminants, which incorporates the following factors: advection, contaminant sources, hydrodynamic dispersion, sorption and retardation aspects, irreversible reactions, and other reactions among the chemical components:

$$\overbrace {{\left( {1+\frac{{{\rho _{\text{d}}}}}{\theta }\frac{{\partial \bar {C}}}{{\partial C}}} \right)}}^{\begin{subarray}{l} {\text{Retardation}} \\ {\text{aspects}} \end{subarray} }=\overbrace {{\frac{\partial }{{\partial xi}}\left( {{D_{{\text{ij}}}}\frac{{\partial C}}{{\partial xj}}} \right)}}^{\begin{subarray}{l} {\text{Hydrodynamic}} \\ {\text{disperson}} \end{subarray} } - \overbrace {{\frac{\partial }{{\partial x}}(ViC)}}^{\begin{subarray}{l} {\text{Retardation}} \\ {\text{aspects}} \end{subarray} }+\overbrace {{\frac{{qs}}{\theta }Cs}}^{{{\text{Advection}}}} - \overbrace {{\lambda \left( {C+\frac{{{\rho _{\text{d}}}}}{\theta }\bar {C}} \right)}}^{\begin{subarray}{l} {\text{Contaminant}} \\ {\text{sources}} \end{subarray} }+\overbrace {{\left( {\frac{{\partial C}}{{\partial t}}} \right){\text{reac}}}}^{{{\text{Reactions}}}},$$

where C—dissolved concentration; \(\bar {C}\)—sorbed concentration, which is a function of the dissolved concentration, C, as defined by the sorption isotherm; Vi—average linear water velocity; Dij—dispersion coefficient tensor; qs—flow rate of a fluid source per unit aquifer volume; Cs—contaminant concentration of the fluid source; Θ—porosity; λ—reaction constant; ρd—bulk density of the porous medium; t—time; x—longitudinal direction; and React—the biological or chemical reaction of the solute (other than sorption).

  1. 8.

    The position of the target depth of the saturated zone in relation with the geological materials generates two conditions: the depth of the saturated zone is positioned only within the unconsolidated materials (above the top of the rock substrate) or in the rock substrate (below the top of the rock substrate), which both demand different information to obtain the vulnerability index. In the first condition, the rock materials do not interfere with the contaminant flow or the final vulnerability index. However, in the second condition, the rock and unconsolidated materials should be considered

Considering the spatial limits and the possibilities of both conditions, it is observed that the following basic data are essential:

  • Saturated zone depth.

  • Depth of the rock substrate.

  • Type of contaminant sources and its characteristics.

  • Position (depth) of release of the contaminant to the geological environment.

  • Source of water (rain, introduction with the contaminant or anthropogenic not related to the contaminant).

  • Characteristics of the geological materials between the base of the contaminant source and the position of the groundwater level in question.

  • The need to address the understanding of the flow in the direction of interest between the contaminant source base and the point (depth in question to obtain vulnerability).

Selection of attributes

One of the major challenges in proposing an index for vulnerability estimation is the set of criteria to select the attributes that actually control the process. In this case, three basic components are needed: introduction of the contaminant from a source in contact with the geological environment, propagation of the contaminant in the geological environment to the analysed depth, and the interactions of the contaminants with the water and the geological materials between the base of the contaminant sources and the analysed position.

In the process, the relationships between the introduction of contaminants, contaminant transport, natural attenuation, characteristics of the contaminant sources, waters and geological materials (considering the mechanisms and control aspects of the mechanisms), and environmental and anthropogenic factors that affect the cited aspects and consequently the mechanisms are needed. These relationships were used to define the hierarchical levels and attribute groups in this study.

The achievement of the final vulnerability index is focused on a hierarchical chain of five levels, which were defined by the initial subdivision and the logical trees developed from hierarchical level 1 to level 5, which is related to the most detailed level. The levels are defined as follows:

Hierarchical Level 1—final vulnerability index (FVI), which is the purpose of the proposal.

Hierarchical Level 2—this is the main subdivision, i.e., the basic components composed of the mechanisms, processes, and functions that affect the migration of the contaminants in the natural porous environment (introduction of the contaminant, transport and redistribution, and natural attenuation).

Hierarchical level 3—this level represents the grouping of the aspects at a detailed level than just the basic components alone, representing a set of interrelated information (attribute groups).

Hierarchical level 4—this level includes the attributes that characterize the different groups of hierarchical level 3. The attributes should be identified and characterized based on field and laboratory studies.

Hierarchical level 5—this level brings together all classes related to the attributes that allow the characterization of an area and its classification.

The hierarchical arrangement of the information allows for different methods of data processing, such as weight methods, rating indexes, fuzzy theory, overlapping, matrices, statistics, scoring, and methods that use combinations of the above.

The attributes and definition of the respective classes were selected based on the following aspects as well as the mechanisms and processes involved in contaminant transport in natural porous media:

  1. 1.

    The groups of hierarchical level 3 were defined considering that they gathered attributes related to the different mechanisms, processes, and functions involved in the movement of a contaminant between the sources and the analysed depth.

  2. 2.

    The attributes were selected based on their importance in the process as well as their interrelationships with more than one mechanism, either directly or indirectly.

  3. 3.

    The classes were adopted based on the following points:

    • The amplitude of the data variations based on both the natural occurrence and anthropogenic characteristics.

    • The interference of the extreme classes for the different mechanisms, processes, and functions.

    • When adopting nominal or ordinal classes, limits or descriptions should be mapped to a scale greater than 1:50,000.

    • The classes of each attribute were defined considering the following points:

  4. 1.

    The maximum and minimum classes are directly related to extreme movement of contaminants in natural porous media.

  5. 2.

    The maximum number of classes for each attribute cannot be greater than 7 to avoid a large number of mapping units, which would make it difficult to apply the classes in regions with high spatial variability.

  6. 3.

    The maximum and minimum classes must be valid for the most regions in terms of their geological conditions and contaminant source types.

  7. 4.

    The classes should reflect the degree of sensitivity in terms of the contaminant movement and geological variability.

  8. 5.

    The data of the maximum and minimum classes must be easy to obtain in the laboratory or in the field.

  9. 6.

    The maximum and minimum class results must generate conditions for the proposal of intermediate classes with real significance.

  10. 7.

    The use of the maximum class values should represent a potential for groundwater contamination at the determined depth level and over short time periods, generally less than 10 years.

For the selection of the attributes and classes, logical trees were used; 46 logical trees were elaborated, one for each attribute. All aspects related to vulnerability attributes, and respective classes could be analysed. One example of logical tree is shown in Fig. 2.

Fig. 2
figure 2

Example of Logic tree developed to analyze the vulnerability conditions associated with contaminant transport considering the five hierarchic levels

From the developed relationships, we defined ten groups of attributes that aggregate different attributes and respective classes, namely, rainfall, point and diffuse sources, groundwater, unconsolidated material 1 and rock substrate 1 (advection), unconsolidated material 2 and rock substrate 2 (hydrodynamic dispersion and redistribution), and unconsolidated material 3 and rock substrate 3 (sorption, reactions, and chemical precipitations). The global procedure is organized into five hierarchical levels from the top (hierarchy level 1) to attribute classes (hierarchy level 5) with respective elements used to define the weights and obtain the eigenvector for each element of each hierarchical level, as shown in Table 1.

Table 1 Partial index for basic components

Definition and assignment of weights

After the development of all hierarchical levels (Table 1—cols. 1, 2, 4, and 6), the weight assignment phase was conducted for all items of each component of the hierarchical level based on the following fundamental conditions:

  1. 1.

    Each hierarchical level assigns weights that denote the possibility of a higher contaminant load reaching the analysed depth, i.e., toward the highest degree of contamination.

  2. 2.

    The weights can be assigned based on different aspects, conditions, and criteria using the following procedures, some of which yield more effective results: simple assignment, linear evaluation, relative position, direct weights, least squares, entropy, eigenvector, minimal information trade-off assessment (MITA), and minimal pair comparison (MIPAC).

  3. 3.

    Considering that a system consists of “n” data points, a set of n-1 comparison pairs is possible, which can be assessed in three different manners: direct comparison, basic comparison group, and complex comparison group.

Based on these conditions, to obtain the numerical vulnerability index, weights ranging from 1 to 9 were adopted, as proposed by Saaty (1980), for the hierarchical analysis method; these weights are associated with the following precautions:

  • The weights were defined by comparisons of direct importance based on the mechanisms, phenomena and processes.

  • The attribution of the weights followed the complex comparison group system.

  • For all information associated with each of the hierarchical levels, an eigenvector was obtained from the initial weights assigned in the previous phase.

Obtaining the final vulnerability index

Based on the matrices, with the eigenvector (partial index) values normalized and respecting the appropriate maximum eigenvalue, the consistency index and the consistency ratio, the following operations were developed:

  1. 1.

    A total partial index (TPI), from Table 1 (col. 8), was adopted for the different aspects relevant to the hierarchical levels, including the mapped attributes and classes. The TPI values were obtained as follows:

$${\text{TPI }}={\text{ }}\left( {{\text{NH2}} \times {\text{NH3}} \times {\text{NH4}} \times {\text{NH5}}} \right),$$

where NH2—partial index for hierarchical level 2; NH3—partial index for hierarchical level 3; NH4—partial index for hierarchical level 4; and NH5—partial index for hierarchical level 5.

Table 1 lists the partial indices for the different hierarchical levels and the total partial index for each class of all attributes. From this table, it is possible to apply the vulnerability index for the two conditions (ground water level above and below the top of the rock substrate) as well as to for different regions at scales greater than 1:50,000.

  1. 2.

    Calculation of the total vulnerability index (TVI) for each unit, according to the following expression:

$$\begin{aligned} {\text{TVI}} & =\left( {{\text{TPIc}} - {\text{introduction of contaminants}}+{\text{TPIc}} - {\text{transport of contaminants}}} \right) \\ & \quad - \left( {{\text{TPIc}} - {\text{natural attenuation of contaminants}}} \right), \\ \end{aligned}$$

where TVI—total vulnerability index for the unit and TPIc—sum of total partial indices for each basic component of hierarchical level 2:

$${\text{TPIc}}=~\mathop \sum \limits_{1}^{n} \left( {{\text{TPI}}} \right).$$

Here, n—number of attributes and respective classes considered part of the component; and

  1. 3.

    Obtaining the final vulnerability index (FVI) for each unit.

To verify the amplitude and the possibility of categorization of the units in terms of the total vulnerability index (TVI), some aspects were considered:

  • The maximum and minimum values that a unit can were calculated considering the maximum and minimum partial index values.

  • For both the top of the saturated zone being above and below the rock substrate level, we have the following.

  • Top of the saturated zone above the top level of the rock substrate:

  • maximum value = 0.106216463100

  • minimum value = 0.020618785200.

  • Top of the saturated zone below the top level of the rock substrate:

  • maximum value = 0.134887500100

  • minimum value = 0.024351523200.

The possible difference between the maximum and minimum values for both conditions shows that it is possible to fit a significant range of natural and anthropogenic conditions between these limits.

The final vulnerability index (FVI) is obtained for each terrain unit and is represented by the units resulting from the combination of geological and geotechnical characteristics and land-use types. The FVI is a ratio between the total vulnerability index (TVI) and the maximum possible value for a unit considering the classes with a maximum total partial index, which result in the maximum TVI value. Then, the FVI is obtained from the following ratio:

$${\text{FVI }}={\text{ TVI of the unit}}/{\text{maximum value of TVI }}\left( {{\text{possible for specific condition}}} \right).$$

From the FVI values, all units can be compared, because the classification is based on a reference value.

  1. 4.

    Based on the FVI values, the vulnerability classes were defined considering the mean and standard deviation; however, other criteria can be used for class definition, depending on the amplitudes of the obtained values and the number of terrain units under analysis. Moreover, the same expression and procedures can be used to obtain the final partial indices for the three basic components (contaminant sources, contaminant transport, and natural attenuation) based on the relevant values for each situation as well as for specific group of attributes if necessary

Application

The proposed vulnerability index method was applied in the Ribeirão do Feijão Basin (RFB) located in the municipality of São Carlos, State of São Paulo, Brazil. The following aspects related to the basin supported the selection of this basin:

  • The natural environmental conditions of the basin are representative of most of Brazil, mainly in the south, southeast, and west regions.

  • There are large groundwater storage areas.

  • The Ribeirão do Feijão Basin supplies 50% of the total drinking water in the municipality of São Carlos.

  • The exploitation of surface and groundwater has increased significantly over the last 50 years due to rapid changes in land uses in the basin.

  • An unconfined aquifer occupies more than 50% of the study area.

  • In the study area, rainfall is seasonal, with well-defined rainy and dry periods. The surface water channels are maintained by water infiltrated during the rainy period.

  • There are several types of diffuse and point sources of contaminants.

The methodology was applied following the steps shown in the flowchart (Fig. 3); some aspects are fundamental to its application, such as the selection of a topographical map compatible with the chosen scale; interpretation of aerial photography and satellite images to identify and delimit water conditions, land uses, and contaminant sources; field and laboratorial works to characterize the geological materials; elaboration of maps and charts; adoption of the partial index and the calculation of the total vulnerability index for individual units; the data processing; and obtaining the final vulnerability index and classes for individual units.

Fig. 3
figure 3

Flowchart of the steps developed during the application in the study area

The proposed procedures associated with the predefined hierarchical levels permit the elaboration of intermediate cartographical documents and analyses to assess the results of different attribute groups. In this study, two intermediate analyses were developed: contaminant sources and natural attenuation conditions according to procedures cited in the respective topic and considering the specific attribute and class groups.

Study area

The Ribeirão do Feijão Basin (RFB) is located in the centre of São Paulo State, southeast Brazil, with an area of 243 km2, a perimeter of 48 km, and a drainage density of 0.60 km/km2. Figure 4a shows the study area location, main drainage channels, and geological formations.

Fig. 4
figure 4figure 4

a Ribeirão do Feijão Basin location with the drainage channels and map with geological formations. b Land-use map of Ribeirão do Feijão Basin. c Unconsolidated material unit map of Ribeirão do Feijão Basin. RUM is Residual Unconsolidated Material. d Map of the unit distribution in terms of position of groundwater level related to upper rock substrate surface

The climate type according to the updated Köppen-Geiger classification (Peel et al. 2007) is Cwa, with rainy summers and dry winters. The average annual rainfall is 1410 mm, with minimum and maximum values of 960 mm and 1660 mm. The average annual temperature is 21.2 °C, with minimum and maximum values of 5 °C and 37 °C.

Land uses are predominantly rural types, with grazing areas (semi-intensive and extensive cattle breeding) and agriculture with different types of crops (sugar cane, orange, and fruit plantations are most common). The other land uses are forestation (Pinus sp. and Eucalyptus sp.) and natural vegetation (Cerrado shrubs, semi-deciduous forests, and riparian vegetation).

The slope varies from less than 2 to 20% in most of the region; in some zones, it can exceed 100%. The altitude and relief amplitudes range from 650 m to over 1000 m and 20 to over 80 m, respectively.

Rock substrate map

Geologically, the basin is supported by different lithologies; the main characteristics are shown in Table 2. The unconsolidated materials found in the study area are mainly sandy residual and transported.

Table 2 Geological formations, main lithologies, and basic characteristics of the study area

Land-use map

The land-use map is shown in Fig. 4b. The main land use is pasture (77 km2; 32%), followed by forest (56 km2; 23%), orange and fruit plantations (43 km2; 17.6%), sugar cane (34 km2; 14%), and forestation (32 km2; 13.4%). However, the area associated with sugar cane plantations is increasing and may occupy 50% of the basin within a few years. The surface layer is currently much modified due to the ploughing process and management practices of land uses. Frequently, the surface layers are compacted because of the weight of the machines and intense use of fertilizers that function as a deflocculant, affecting the volumetric characteristics of the geological materials.

Unconsolidated materials map

The unconsolidated materials were classified according to their genesis as residuals or transported, with three residual unconsolidated materials (RUMs) and six transported materials. Figure 4c shows the spatial distribution of the units. In general, the study area is primarily composed of sandy texture materials (residual from the Botucatu Formation, sandy transported 1 and sandy colluvium), representing 65.75% (160 km2) of the RFB. The other unconsolidated materials (28%; 68 km2) present textures varying from sandy clayey to clayey sandy (residual from Itaqueri Formation, sandy transported 2, sandy transported 3, and alluvium and lateritic concretion) except for the residual from Serra Geral Formation (6%; 15 km2), which is a clayey texture unit.

Groundwater level position map

To classify the different terrain units, especially with respect to the position of the top of the saturated zone being below or above the top of the rock substrate, a map (Fig. 4d) was prepared with the classification of 94 units, where 39 units have the top of the saturated zone above the rock substrate, and therefore, the transport of the contaminants is not influenced by the components of the rock substrate; 55 units have the top of the saturated zone below the top of the rock substrate. Considering the distribution of the units for the two conditions, it is found that approximately 40% have the top of the saturated zone above the top of the rock substrate, whereby the rock substrate materials do not interfere with the contaminant flow between the base of the contaminant sources and the top of the saturated zone.

Engineering geological units

The combination of the unconsolidated material units with lithological types generates the geotechnical and geological units that were used as a natural division of the basin. Each unit is characterized with the attributes that constituted the groups of hierarchical level 3 (unconsolidated materials 1, 2, and 3 and rock substrates 1, 2, and 3), as shown in Table 1.

Association of geotechnical and geological with land-use type units

Based on the distribution of the engineering geological aspects and land-use types, the units were defined and delimited. Figure 5 shows the unit map with 29 different units, and Table 3 lists their main geological and geotechnical characteristics. Considering these conditions, the basin was divided into 94 territorial units, which provided better control of the possible contaminant sources and geological and geotechnical units. This analysis facilitated the contaminant source map reflecting an intermediate vulnerability index, which permitted better analysis of the final results.

Fig. 5
figure 5

Map of land unit and engineering geological conditions associated land-use units of the Ribeirão do Feijão Basin

Table 3 Summary of the basic geotechnical characteristics of the Engineering Geological units of the Ribeirão do Feijão Basin

Contaminant source map

The contaminant sources were mapped based on the different land-use types as well as management procedures such that the different land-use types have a temporal continuity. Aspects resulting from land uses characterized as sources of contaminants were also considered, including deposits of agricultural industry and animal wastes. There are also urbanized areas in the region with sewage systems containing septic tanks, such as hotels, fuel stations, restaurants, and poultry, cattle, pig, and horse farming. Considering all possible types of sources and respective attributes, the first step was to calculate the maximum and the minimum values that a unit could reach (0.0471762 and 0.0063378, respectively). The second step was to obtain the total intermediate index value for each unit considering that the total partial index from Table 1 (col. 8) is related to the contaminant sources. The third step was to obtain the final partial indices as percentages between the total intermediate index values of the contaminant sources and the maximum value that a unit could reach (0.0471762). In Fig. 6a, the final partial indices are displayed for each unit. The results show a maximum value of 0.6779608, a minimum value of 0.172163, a mean of 0.401426, and a standard deviation of 0.13902856.

Fig. 6
figure 6

a Partial index values for 94 units considering contaminant source attributes. b Partial index values for units considering natural attenuation for Condition 1. c For Condition 2

From the mean and standard deviation, four classes were defined (Table 4) to fit each unit. Figure 7a shows a map with the classification of the units in terms of the final partial indices; the map refers to the existing types of contaminant sources, which was considered in the analysis of the final vulnerability index.

Table 4 Adopted classes for contaminant sources, natural attenuation, groundwater level depth higher than 30 m, groundwater level above and below the rock substrate surface unit classification
Fig. 7
figure 7

a Map of the land unit classification according to final partial index related to contaminant sources, b Map of the land unit distribution related to natural attenuation for conditions 1 and 2

In terms of the magnitude of the contaminant sources in each unit, 25, 15, and 36 units are of Classes 1, 2, and 3, respectively; only 18 units are of Class 4, smaller magnitude. Thus, approximately 25% of the units have indices of the highest magnitude class in terms of contamination sources, which implies that these units should be better assessed in terms of the land uses that are under development in addition to management.

Natural attenuation map

The natural attenuation conditions were evaluated considering the groundwater level below and above the rock top to evaluate the natural attenuation differences for the different groups of attributes and classes.

Considering the position of the groundwater level below the top of the rock substrate (Condition 1) or above (Condition 2), the first step was to obtain the maximum and the minimum values that a unit could reach for Condition 1 (0.18659532 and 0.0264970, respectively) and Condition 2 (0.12292319 and 0.01774036, respectively). The second step was to calculate the total intermediate index value for each unit considering the total partial index from Table 1 (col. 8), which is related to the natural attenuation attributes for Conditions 1 and 2. The third step was to obtain the final partial indices as percentages between the total index values of the natural attenuation and the maximum value that a unit could reach (Condition 1 of 0.18659532 and Condition 2 of 0.12292319).

Figure 6b, c is related to the condition of the saturated zone below the top of the rock substrate (Condition 1) and the saturated zone above the rock substrate (Condition 2), respectively. The results obtained for Condition 1 show a mean of 0.38054675 and standard deviation of 0.07798768, and Condition 2 has a mean and is 0.51185113 and standard deviation of 0.09360793. These values allowed for the definition of the classes, as listed in Table 4.

Figure 7b shows a distribution map of the classification of the units for the two conditions, and the 25 units in Class 1, under both conditions, are highlighted; 15, 36, and 18 units are in Classes 2, 3, and 4, respectively. It is found that 40 units, or approximately 40% of the units, are in the classes with the highest contaminant load, which places the basin and natural spring on alert.

Final vulnerability index

The procedures to obtain the final vulnerability index for the individual units were developed considering three conditions: the first is a general condition for all 94 units, considering the depth target of the assessment greater than 30 m; the second one considers the top of the saturated zone above the rock substrate; and the third condition considers the position below the rock substrate. The first condition aims to evaluate all units for a deep groundwater level and reflects the conditions of intrinsic vulnerability of each unit, because it involves all considered attributes. The other two conditions are the real conditions in terms of the depth of the saturated zone, as shown in Fig. 4d. The final vulnerability index for the three conditions was acquired according to the procedure outlined in “Materials and methods.”

First condition

The 94 units have the final vulnerability indices (FVIs) ranging between 0.00773 and 0.683, according to Fig. 8a, with a mean value of 0.452888069 and standard deviation of 0.15847067. From these data, four classes were defined for vulnerability grading, according to Table 4. In this case, the considerable range between the minimum and maximum values is verified, reflecting the variability of the characteristics of the units.

Fig. 8
figure 8

Final vulnerability index values: a for units considering the groundwater level depth higher than 30 m, b for units with groundwater level above the upper rock substrate surface (39 units), c for units for groundwater level below the upper rock substrate surface (55 units)

Second condition

There are 39 units with the top of the saturated zone above the top of the rock substrate. In this situation, the following attributes were considered: point and non-point sources, subsurface water, rainfall, and unconsolidated materials 1, 2, and 3. The final vulnerability index (FVI) values ranged from 0.438 to 0.753 according to the data in Fig. 8b, with a mean value of 0.60612652 and a standard deviation of 0.08867895, and 4 classes were defined for vulnerability grading according to Table 4. The final vulnerability index values do not exhibit significant amplitudes; however, in this condition, the highest final vulnerability index (FVI) values were obtained.

Third condition

The third condition includes the units with the top of the saturated zone below the top of the rock substrate, resulting in 55 units. All attribute groups (hierarchical level 3) were considered. The final vulnerability index (FVI) values ranged from 0.4228 to 0.809, as shown in Fig. 8c, with a mean value of 0.5462516 and standard deviation of 0.1719165. Four classes were defined for vulnerability grading according to Table 4.

Figure 9 shows the classification map for the different units in terms of the magnitudes of the final vulnerability index values considering a target depth of more than 30 m and both the groundwater level above and below the top of the rock substrate.

Fig. 9
figure 9

Vulnerability gradation map with unit classification based on final vulnerability index classes for three conditions (for units considering the groundwater level depth higher than 30 m, for units with groundwater level above the upper rock substrate surface—39 units, for units for groundwater level below the upper rock substrate surface—55 units)

The amplitude of the resulting values for the three considered conditions is compatible with the general difference between the maximum and minimum values. For the first condition (target depth > 30 m), no unit is in Class 1, i.e., the highest degree of vulnerability. Approximately 60% of the units are in Classes 3 and 4, which have smaller magnitudes. These data, associated with the classification of contaminant sources in Fig. 6a, reflect the natural attenuation capacity of the geological materials as well as the transport behaviour, because none of the 25 units classified in the highest contaminant source class exhibit vulnerability in the highest class.

Furthermore, in units under Condition 2 (top of the saturated zone above the top of the rock substrate), there are no units classified at the lowest vulnerability level (Class 4), and only 6 units (17%) are in Class 1 (high), all of which are in Class 1 in terms of contaminant sources. Regarding Condition 3 (top of the saturated zone below the top of the rocky substrate), there are 11 units (25%) in the high vulnerability class (Class 1), and 18, 13, and 13 units are in Classes 2, 3, and 4, respectively.

According to the results of Conditions 2 and 3, 17 units (5, 18, 21, 24, 27, 31, 33, 40, 59, 70, 73, 84, 85, 86, 89, 90, and 91) fit the highest vulnerability class, whereas only units 18, 21, 24, 31, 40, 59, 73, and 85 are in Class 1 in terms of contaminant sources. Thus, the index reveals that there is a group of units (5, 27, 33, 70, 84, 86, 89, 90, and 91) that are in the highest class of vulnerability even with low contaminant sources. These units should receive special care, because if the magnitude of the contaminant sources increases, the contamination of the saturated zone waters will be unavoidable due to the low attenuation capacity. For natural attenuation, both saturated zone conditions were analysed, and 18 units (10, 38, 49, 51, 52, 53, 54, 55, 56, 57, 58, 63, 64, 67, 69, 81, 83, and 94) were classified in the highest natural attenuation category for both conditions (Class 1).

Conclusion

This proposal considers a group of information that could be considered large, although the necessary information can be obtained by fieldwork and the use of simple and low-cost laboratory equipment, especially if procedures such as engineering geological mapping are followed. The set of information can be processed using different resources, including not only geographic information systems but also other, more robust systems, such as MATLAB (MATrix LABoratory), Mathcad and other data processing software, such as Excel.

From the analysed group of attributes and classes, one can select attributes that permit specific analyses for the different contaminants. It is also possible to select attributes or groups for intermediate analyses, which generate interesting results, as done in this study for natural attenuation and contaminant sources.

The use of terrain units with practical significance and easy delimitation is fundamental to assess, verify, and validate the results of the final vulnerability indices as well as to adopt territorial and environmental planning guidelines. In this case, the combination of engineering geological units with the types of land uses permit an assessment of the influence of the land-use management practices and the contaminant source magnitudes.

The set of information also allows the terrain units to be analysed and classified in terms of the intrinsic characteristics and sources, thus allowing the adoption of specific guidelines for each unit in terms of territorial and environmental planning. Moreover, the adoption of terrain units permits the processing of information in spreadsheets, which facilitates the verification of variability for each unit. In addition, the dynamic analysis can be performed with changes in only attributes and classes; therefore, results can be obtained for each unit without having to prepare new maps and characterizations.

The difference between the maximum and minimum values allows the proposal to be applied in areas with a great diversity of natural and anthropogenic characteristics; framing terrain units in different categories of final vulnerability index values, there are wide possibilities of application for different areas with very broad natural and anthropogenic characteristics.

The proposed vulnerability index permits the classification of each unit of a basin, reflecting the different natural and anthropogenic conditions, and consequently with spatial detail compatible with the scale. In addition, when comparing the results for the water-level condition greater than 30 m with the other two conditions, the results vary, although this is compatible with the actual conditions of each unit, predominantly increasing the degree of vulnerability.

The proposal allows considering the final vulnerability index, i.e., the gradation, dynamically with the analysis of the temporal variation of the sources as a function of changes in land use and both territorial and environmental planning standards.