Keywords

1 Introduction

Groundwater is one of the world’s most essential and renewable natural resources. In many places globally, groundwater is extensively used for household, commercial, and agricultural purposes [1]. The reliance on groundwater is increasing all the time. These rising demands frequently lead to overexploitation, putting a considerable strain on this finite freshwater supply [2, 3]. The water table has been steadily declining as a result of groundwater exploitation [4].

Furthermore, owing to inappropriate irrigation patterns, dense population, and climatic change, groundwater problems have intensified, primarily in many parts of the world [5]. Ethiopia has had tremendous economic growth, including an average yearly growth rate of 8%; from 1998 to 2016, the pace of urbanization expanded by 10%, resulting in fast urbanization rates ranging from 6 to 20%. This expansion has resulted in uncontrolled migration from rural areas to major urban centers searching for work, significantly impacting current infrastructures, especially water supply [6]. Ethiopia is known as East Africa’s “water tower”. However, water is frequently unavailable when it is needed due to considerable temporal and geographical fluctuations in rainfall and complicated aquifers. Ethiopia’s groundwater contributes little to its agricultural growth [7].

Groundwater sources like bore wells, springs, and shallow wells provide domestic water to most Ethiopian towns and villages. Groundwater accounts for more than 70% of Ethiopia's water supply [8]. Despite this, just 34% of the population has access to safe drinking water. It implies that groundwater development and use must be in great demand in the country. But there is an absence of comprehensive knowledge about Ethiopia's groundwater potential. And there is a significant gap in estimating groundwater reserves in Ethiopia [8]. In Ethiopia, several complex variables hinder the efficient and effective use of groundwater. The most critical is the absence of accurate hydrological data, a lack of comprehension of aquifer architecture and features, and technical constraints [9].

As a result, it is vital to comprehend the structure of aquifers and explore cost-effective, comprehensive, and user-friendly methodologies and processes for groundwater detection, extraction, and monitoring [9]. Ethiopia’s aquifers are very intricate, low-storage aquifers that are segregated. The complex geological nature is to account for this. As a result, conventional groundwater investigation methods in Ethiopia will be time-consuming and costly [10]. Groundwater is underutilized in Ethiopia due to increased development and operating costs and a failure to understand groundwater dynamics [11]. In Ethiopia, groundwater is mainly used for domestic purposes. There have been a few instances of groundwater being utilized for farming and other non-domestic purposes. There has been minimal research on groundwater potentiality for agriculture and other non-domestic purposes [12]. There are very few regional studies on geospatial technologies in Ethiopia to delineate groundwater potential [10].

Various elements, such as geology, soil, geological structures, land use/land cover (LULC), topography, and rainfall, influence groundwater occurrence. It is easier to predict a region’s groundwater potential using various techniques considering all of these elements [13]. Before a groundwater borehole can be dug or drilled, several approaches such as geophysical, hydrogeological, geological, and spatial methods have been employed to analyze a suitable location for groundwater potential. Furthermore, several of these approaches are too costly and challenging to implement. However, spatial technology has aided in developing a method for groundwater investigations [14, 15]. Satellite images are utilized to create the following layers: lithology, landscapes, LULC, stream, slope, Normalized Difference Vegetation Index (NDVI), Topographic Wetness Index (TWI), geological structures, and so on, which serve as the basis for determining the potential groundwater zone [16].

The following researchers from across the world, including Ethiopia, employed remote sensing (RS) and Geographic Information System (GIS) technology to delineate prospective groundwater zones, including in Karur district, Tamil Nadu, India, by using AHP methods [17]; in the Leylia–Keynow watershed, southwest of Iran [18]; in Tirunelveli Taluk, South Tamil Nadu, India [19]; in Guna tana landscape, upper Blue Nile Basin, Ethiopia [20]; and Jemma River basin, Ethiopia [10]. Various scholars have used different models in potential groundwater mapping worldwide, such as the logistic regression model [21], multi-criteria decision evaluation [22, 23], random forest model [24, 25], evidential belief function [24], frequency ratio [26,27,28], weights of evidence model [29, 30], and drainage morphometric analysis [31, 32].

With this background, using remote sensing data, GIS technologies, and a probability-based frequency ratio technique, the current research aims to map the possible groundwater zones in the Iyenda river catchment, Konso area, Southern Ethiopia. The frequency ratio (FR) method is a data-driven framework and bivariate statistical technique for assigning rating (r) values to each class through the computation of spatial links among dependent variables (springs and wells) and the independent variable (thematic layers). In this present study, nine significant factors were taken into account, including lithology, lineament density, drainage density, Slope, NDVI, TWI, LULC, and rainfall. So far, no study has been conducted using this method and data sets in the present study area. The current study results may be helpful for researchers to carry out further research, planners, and administrators for locating suitable sites for drilling water wells.

2 Materials and Methods

2.1 Study Area

The present study area, the Iyenda river catchment, is situated in Southern Nations, Nationalities, and People (SNNPR) regional state, Rift Valley, Ethiopia. It is located geographically between 37° 15′ 28″ and 37° 35′ 15″ E longitude and 5° 18′ 28″ to 5° 39′ 29″ N latitude. The present study area covers a total area of 576 km2. Gidole, Gato, and Konso are the significant settlements. Figure 1 shows the location map of the present study area. The study area has rugged topography and elevation ranging between 1055 and 2576 m, and the terrain slope is between 0° and 51° (Fig. 2). The present study area’s northwest and southwest rift margins have a higher elevation, rugged topography, steep slope in the current location, and vice versa in the rift floor, which divided rift margins.

Fig. 1
figure 1

Study area

Fig. 2
figure 2

Topography and slope map

2.2 Data Collection and Preparation of Thematic Layers

The present study aims to use remote sensing data, GIS tools, and the frequency ratio approach to identify possible groundwater zones in Ethiopia’s Iyenda river watershed. The methodology flow chart for the present study is shown in Fig. 3. The multiple thematic layers were produced by utilizing several data sets, including remote sensing data, secondary information, and field observations, to carry out the present study. The ArcGIS 10.8 software’s many features were used to prepare thematic layers and conduct data analysis. The WGS-84, UTM-37 N projection system was used to project all of the data sets. The data and sources utilized in this investigation are listed in Table 1.

Fig. 3
figure 3

Methodology flow chart

Table 1 Input data and its sources

2.3 Thematic Layers Preparation

The following thematic layers were used, such as lithology, lineament density, slope, TWI, soil, rainfall, drainage density, LULC, and NDVI in the present study. Lithology is recognized as one of the most significant markers of hydrogeological characteristics since it affects aquifer materials’ permeability and porosity [33]. Initially, the lithological contact of the present study area was demarcated using a Geological map prepared by the Ethiopian Geological Survey. Hornblende gneiss, basalt, and alluvial deposits are the central rock units present in the study area. The above-mentioned rock units were further classified based on their weathering nature. And the same was identified using false-color composite of the Landsat-8 OLI satellite image. The rift margins comprise the differently weathered basalts and hornblende gneiss, and alluvial deposits cover the rift floor (Fig. 4b).

Fig. 4
figure 4

Lineament density and lithology map

In hard rock fractured aquifer areas, lineaments regulate groundwater ingress and serve as a transmission medium [34]. The present research area lineaments were identified and extracted from a Landsat-8 OLI satellite image through a visual interpretation approach based on the image's tonal contrast. Further, the lineaments were extracted in the satellite data, where the linear drainages with moisture content appeared as black tonal contrast. The lineament density (km/km2) map was created with ArcGIS software’s line density tool, and the same were classified into five classes natural break method viz (0–0.25, 0.26–0.51, 0.52–0.77, 0.78–1.03, and 1.04–1.29). The present study area's lineament density map is shown in Fig. 4a.

In general, water moves slowly over moderate slopes, allowing for more significant infiltration into the substrate. The present study area’s slope map research region was created using ArcGIS 9.3’s Spatial Analysis tools and a DEM. On the other hand, the steep slope increased run-off and decreased infiltration into the surface [35]. SRTM-DEM with 30 m spatial resolution data was used to prepare the slope layer of the present study area. The minimum and maximum slope of the current study area are 00 and 540, respectively. Using ArcGIS software’s natural break technique, the slope layer has been further categorized into five groups (0–4.66, 4.67–9.74, 9.75–16.03, 16.04–23.94, and 23.95–51.74). Figure 2b shows the slope map of the present study area, and it has been observed that the rift margins have a steep slope compared to the rift floor.

Drainage density, which negatively correlates to soil and rock permeability, is an important indicator in evaluating the potential groundwater zones. High drainage density levels cause run-off and thus indicate a limited groundwater occurrence. In contrast, if lineaments or fractures control the drainage system of any area, then high drainage density areas are potential for groundwater occurrences [28]. The present study area's drainage system is controlled by structural features [32]. SRTM-DEM data and Archydro toolset of ArcGIS software were used to automatically extract the present study area’s drainage, as procedures suggested by Maidment [36]. The drainage density (km/km2) map was created with ArcGIS software's line density tool, and the same was classified into five classes natural break method viz (0–0.42, 0.42–0.84, 0.85–1.26, 1.27–1.68, and 1.69–2.10). The drainage density map for the current study area is given in Fig. 5a.

Fig. 5
figure 5

Drainage density and LULC map

The LULC of a particular region has a significant impact on groundwater occurrence [37]. Surface run-off and flooding are more in the built-up areas, primarily made up of impervious surfaces. On the other hand, agricultural areas are less prone to flood due to the positive relationship between vegetation density and infiltration capacity [38]. The LULC map in this study was created using a false-color composite of the Landsat-8 (OLI) image. Based on the fieldwork, the LULC of the current study area was classified through visual interpretation.

The agricultural field was identified based on the FCC satellite image's rectangle shape, color, and texture [39]. The built-up land has pale bluish-white, fine texture, and uniform shape and size [40]. Forest areas were dark reddish with a fine to medium texture and irregular and varied in size. Barren land was generally identified by its light to dark blue tone and gritty texture [41]. The primary LULC types in the study area are farm and bushland, forest, shrubland, and barren land. The forest is covered in the higher elevation in the rift margins where the rainfall is high, and shrubland is also covered significantly in the rift margins. The present study area's LULC map is shown in Fig. 5b.

The amount of rain that occurs significantly affects groundwater recharge in any region [28]. In the present study, rainfall data in the grid format for the year 2000–2020 was downloaded from the PERSIANN (Precipitation Estimation from Remotely Sensed Information using Artificial Neural Networks) developed by the Center for Hydrometeorology and Remote Sensing (CHRS) at the University of California, Irvine (UCI). The weblink for downloading the rainfall data is https://chrsdata.eng.uci.edu/. For twenty years (2001–2020), average rainfall data was downloaded as a grid format with a spatial resolution of 4 km × 4 km. The minimum and maximum rainfall of the present study area are 521 mm and 623 mm, respectively. The same was classified into five categories, as shown in Fig. 6a.

Fig. 6
figure 6

Rainfall and soil map

The type of soil has a vital role in determining possible groundwater zones. Soil mapping often entails identifying soil types with distinguishable features. The soil data for the current research area was retrieved from the Food and Agriculture Organization [42]. The present study area covered clay (76%) and loam (24%) majorly. A soil map is shown in Fig. 6b.

The examination of the normalized difference vegetation index (NDVI) is considered as an approximate estimate of the quantity of vegetation present and the potential for groundwater across a given area [43]. The NDVI of the present location was calculated using the following Eq. (1).

$$ {\text{NDVI}} = \left( {{\text{Near infrared}} - {\text{Red}}} \right)/\left( {{\text{Red}} + {\text{Near-infrared}}} \right) $$
(1)

In this study, the NDVI was calculated using ArcGIS software's raster calculator. −0.15 and 0.44 are the minimum and maximum NDVI values further, classified into three categories. The maximum values were observed where forest and bushland have occurred. In contrast, negative or minimum values were found around the settlement and barren land. Figure 7a shows the NDVI map.

Fig. 7
figure 7

NDVI and TWI map

The topographic wetness index (TWI) evaluates the hydrologic process’ topographic control. TWI is a metric that assesses the potential of wetlands. Wetland potential is shown in areas with a high positive TWI value. The zones show the erosion potential with a low TWI value [44]. The SRTM-DEM and TWI tool in QGIS 3.18 software were used to prepare the TWI for the present study. The minimum and maximum TWI values for the current study are 3.59 and 21.84, respectively. The same was classified into three categories Fig. 7b. The rift floor and drainages have medium and high TWI values, respectively.

Thirty-four well locations were located using GPS during fieldwork. Among these, 15 are springs and the remaining 12 and 7 are hand pumps and deep boreholes, respectively. Figure 8 shows the spatial distribution of springs, deep boreholes, and springs in the present study area. Seventy per cent (24 wells) of the 34 sample locations were chosen as training data sets for utilizing the FR model to map possible groundwater zoning. The remaining 30% of the samples (10 wells) were utilized for model validation.

Fig. 8
figure 8

Well locations map

2.4 Frequency Ratio Model and Identification of Potential Groundwater Zone

The frequency ratio (FR) method is a bivariate statistical technique widely used to establish the probability relationships among the dependent and independent variables in geospatial assessments [45]. The method examines the statistical connection between borehole locations and the factors that influence the occurrence of groundwater. In practice, the FR was computed using the following Eq. (2).

$$ {\text{FR}} = \left( {A/B} \right)/\left( {C/D} \right) $$
(2)

where A—the number of training wells/springs in the particular sub-thematic feature, B—the total number of wells/springs in the study area, C—the number of pixels in the sub-thematic class, and D—the total number of sub-thematic class’s pixels in the study area.

Table 2 shows the entire weight determination computation for individual parameters. The probability of groundwater in a specific pixel may be calculated by adding the pixel values in the ArcGIS software as per Eq. (3):

$$ {\text{GWPI}} = \sum {{\text{Li}}_{{{\text{fr}}}} + {\text{Ld}}_{{{\text{fr}}}} + {\text{Sl}}_{{{\text{fr}}}} + {\text{Dd}}_{{{\text{fr}}}} + {\text{Lulc}}_{{{\text{fr}}}} + {\text{NDVI}}_{{{\text{fr}}}} + {\text{Twi}}_{{{\text{fr}}}} + {\text{Rf}}_{{{\text{fr}}}} + {\text{So}}_{{{\text{fr}}}} } $$
(3)

where GWPI—groundwater potential index, fr—frequency ratio, Li—lithology, Ld—lineament density, Sl—slope, Dd—drainage density, Lulc—land use/land cover, NDVI—normalized difference vegetation index, Twi—topographic wetness index, Rf—rainfall, and So—soil.

Table 2 Frequency ratio calculation for potential groundwater mapping

3 Results and Discussions

3.1 Analysis of the Relationship Between Frequency Ratio and Thematic Layers

Frequency ratio values were computed for each selected thematic layer based on the association with observed well and spring locations. The frequency ratio between lithology and observed well and spring locations shows that highly weathered Hornblende gneiss and weathered basalt have a high-frequency value of 1.88 and 1.80 respectively, compared to other lithological features. The high-frequency ratio value of 1.24 was found where the lineament density value is 0.52–0.77 km/km2. The frequency value is decreasing when the slope angle is increasing. The association between high-frequency value and drainage density indicates that high-frequency value is concentrated where medium and high drainage density occurs in the present study area. The frequency value is high where agricultural land is present and zero in the barren land LULC classes. The high-frequency ratio value of 2.44 is observed in the high NDVI and vice versa in the low NDVI. The high-frequency ratio value of 2.28 is found where the high topographic wetness index (10.54–21.84) is observed. The inverse relationship was observed between rainfall and the frequency ratio values. Table 2 displays the frequency ratio value of various thematic layers and categories.

3.2 Groundwater Potential Assessment

The frequency ratio values for the present study ranged between 5.34 and 14.77, and the same was classified into three potential groundwater categories such as high (5.34–8.37), medium (8.38–10.11), and high (10.12–14.77). 39.23% of the present study covers the low groundwater potential zones, and 38.33% and 22.44% are the medium and high groundwater potential zones, respectively. Figure 9 shows the groundwater potential map of the present study area. It is observed from the groundwater potential map that the majority of the groundwater high potential zones have occurred in the central part, and that is the rift floor and Gidole area where the plateau is present with high rainfall. Generally, the high and medium groundwater potential zones occurred where lithologically the study area covers highly weathered hornblende gneiss, weathered basalt, and alluvial deposits. And further, the results show that high and medium groundwater zones were found where medium lineament density, medium to high drainage density, less slope, high NDVI, rainfall and TWI, and loam soil occur in the present study area. In contrast, the low groundwater potential zones occurred where barren rock, high slope, less rainfall, unweathered rock, less TWI, and NDVI were found in the present study area.

Fig. 9
figure 9

Groundwater potential map

3.3 Model Validation

Validation is recognized as a crucial phase in the modeling process because of its scientific significance. The receiver operating characteristics (ROC) curve technique is a widely accepted method to validate the groundwater potential mapping’s results [46]. The area under curve (AUC) values range from 0.5 to 1.0, with a number around 1.0 suggesting the highest level of precision and a value around 0.5 showing the model’s unreliability [24]. It has been categorized into the following categories based on the link between AUC value and prediction accuracy, excellent (0.9–1.0), very good (8–0.9), good (0.7–0.8), average (0.6–0.7), and poor (0.5–0.6) [24].

The success and prediction rate curves in this investigation were calculated using the AUC technique. The success rate was assessed at 0.735, utilizing the AUC values obtained using a training set of 24 springs, hand pumps, and bore wells to measure the success rate curve (Fig. 10). However, because the success rate is estimated using the same training data set used to create the groundwater potential zone model, it is not an acceptable approach for assessing a model's prediction capacity [25]. Consequently, the prediction rate curve was calculated using ten well locations not included in the frequency ratio model used to produce groundwater potential zone mapping. The AUC of the prediction rate curve is 0.732. (Fig. 11). The present AUC analysis has proven that the present model had excellent results.

Fig. 10
figure 10

Success rate curve

Fig. 11
figure 11

Predication rate curve

4 Conclusions

Groundwater potential zone mapping is a prominent field of study, particularly in dry and semi-arid environments. Globally, many approaches for assessing regional groundwater potential have been used. Groundwater potential index maps were created in the present research utilizing the frequency ratio technique with remote sensing data, primary and secondary data, and GIS tools. Nine significant thematic layers regulating the groundwater recharge were mapped. The frequency ratio method was adopted for allocating rating (r) values to each class by computing spatial connections between dependent variables (springs and wells) and the independent variable (thematic layers). The weighted linear combination approach was used to combine the nine theme layers to determine possible groundwater zones. According to the study results, 39.23% of the current study includes low groundwater potential zones, 38.33% covers medium groundwater potential zones, and 22.44% covers high groundwater potential zones. The output of the present study was validated through the AUC method. The success and prediction rate AUC values suggest that the technique employed in this investigation was highly successful. Finally, the findings of this study demonstrated that the FR model might indeed be utilized successfully in the groundwater potentiality mapping.