Introduction

Groundwater is one of the most important and valuable parts of the natural water cycle. It is stored in the pores of soil and rock below the water table (Fitts, 2013). One third of all freshwater withdrawals around the world come from groundwater (Taylor et al., 2013). Groundwater is the only source of water for about 2.5 billion people around the world (Unesco, 2020). It is used more and more in irrigation, factories, cities, and homes in rural areas (Todd, 2005). In India, China, and some African countries, where there are a lot of people and a lot of factories, there is not enough water to go around. This is because the demand for water in homes, farms, and factories has grown (Das et al., 2017; Manap et al., 2013). In India as the population is growing, they need to the farmland for agriculture purposes so deforestation and overexploitation of groundwater has occurred across the country, resulting in a subsequent decline in groundwater levels (Saranya & Saravanan, 2020). India’s agricultural and drinking water security have become more reliant on groundwater. According to Ministry of (Jal-Shakti, 2020) the country’s total annual groundwater recharge was calculated to be 436.15 billion cubic metres (BCM). It is estimated that 393 BCM of ground water may be extracted each year, while 432 BCM is replenished each year. The total annual groundwater extraction is 249 billion cubic metres, with 221 billion cubic metres (89%) used for irrigation and 25 billion cubic metres (11%) used for domestic and industrial purposes. According to the most current survey, 61.6% of the country’s groundwater reserves have been used up (CGWB, 2021). Ground water provides the source for approximately 62% of all irrigation, 85% of all water supply in rural areas, and 45% of all water supply in urban areas. As of the year 2001, India had a per capita water availability of 1816 m3, but experts predict that this number will fall to 1140 m3 by the year 2050 (Narendra Modi, n.d.; Sikdar, 2018).

Aquifers holding subsurface groundwater are extremely confined and spatially variable (Satpathy & Kanungo, 1976). Knowing the present reputation of groundwater potential in any place requires knowing the prospective zone of groundwater where artificial recharge techniques can be employed to increase the amount of recharge and ensure a sufficient amount of recharge (Razandi et al., 2015). As a result, it is very important to learn how to approach groundwater management and how to predict groundwater capability on the national, regional, and local level (le Page et al., 2012; Vaux, 2011). Therefore, there is a significant amount of interest among scientists in the process of generating a map of all of the potential groundwater locations (Arabameri et al., 2018; Das & Pardeshi, 2018).

Although there are various reliable geophysical and geological methods for detecting aquifer locations and groundwater levels, employing these techniques to assess groundwater resource availability is both time-consuming and financially prohibitive (Fetter, 2001). Recently, there has been a lot of focus placed on integrating remote sensing (RS) and the geographical information system (GIS) because it is a method of exploration that is not only efficient but also very inexpensive and rapid (Arefin, 2020a, 2020b; Chowdhury et al., 2008; Gassama Jallow et al., 2020). Because satellite data is widely available and is used in mapping and topographical processes, it is easy to put together information that can be used to evaluate places with potential groundwater capability (Magesh et al., 2011; Mallick et al., 2015).

The number of parameters that determine groundwater potentiality varies with the different research studies. Many studies focused on parameters such as Geology, Geomorphology, Slope, Soil texture, Drainage density, Lineament density, Rainfall, Land use & Land cover, Topographic Wetness Index etc. (Ganapuram et al., 2009; Israil et al., 2006; Magesh et al., 2012; Mukherjee et al., 2012; Owolabi et al., 2020) using the number of methods utilizing the thematic maps, to identify possible regions for groundwater recharge such as frequency ratio, (Manap et al., 2014; Pourtaghi & Pourghasemi, 2014; Moghaddam et al., 2015; Naghibi et al., 2016; Al-Abadi et al., 2016; Guru et al., 2017), Logistic regression model techniques (Ozdemir, 2011), evidential belief function (EBF) (Park et al., 2014; Pourghasemi & Beheshtirad, 2015), certainty factor (CF) (Razandi et al., 2015), analytical hierarchy process (AHP) (Arefin, 2020a, 2020b; Kumar & Krishna, 2018; Maity et al., 2022; Moharir et al., 2023; Muralitharan & Palanivel, 2015; Murmu et al., 2019; Saranya & Saravanan, 2020) and many other mathematical and geostatistical models. The Analytical Hierarchical Process (AHP) was shown to be a method that is effective, dependable, and practicable for identifying groundwater potential zones.

The multicollinear test is very important test that is applied to the thematic layers to improve the model’s ability to make accurate predictions. Other RS/ GIS-based models, have also used this method such as landslide susceptibility, assessment of land subsidence, soil erosion susceptibility (Abdulkadir et al., 2019; Blachowski, 2016; Mallick et al., 2021; Rossi et al., 2021), and other models. Map removal sensitivity analysis is a good method to know the most significant parameters in the model output (Mukherjee & Singh, 2020; Thapa et al., 2018). Many researches have applied this technique for groundwater and aquifer vulnerability assessments, distinct from the AHP technique (Kumar et al., 2022; Oke, 2020; Sadat-Noori & Ebrahimi, 2016). Receiver operating characteristic (ROC) curve, a cross-validation method, is used by comparing the model output with the depth of water table of the observation wells. Many researchers have used this cross-validation method to identify the groundwater potential zones (Dar et al., 2021; Hasanuzzaman et al., 2022), Landslide susceptibility (Chen et al., 2018; Gorsevski et al., 2006) and also for other analysis. For this study, the ROC curve was used for validation of groundwater potential map.

The Prayagraj district is currently facing a significant issue of water scarcity, particularly in the Trans Yamuna region. This problem has arisen due to the overexploitation of groundwater resources and the geological characteristics of the dissected plateau, which hinder the infiltration of water. (Ministry of Environment Forest and Climate Change Notification, 2018). So, it is necessary to investigate the groundwater condition in the Trans Yamuna region for sustainable management. Numerous studies have identified the groundwater potential zones in the entire Prayagraj district as well as in various parts of the district (Shukla, 2014; Syed Wamiq Ali et al., 2015; Mukesh Kumar et al., 2018; Tiwari, 2018) using different methods and multiple parameters. To delineate the groundwater potential zones in this study region, remote sensing and GIS techniques were combined with the AHP method using multiple parameters, including some parameters, multicollinearity between parameters, and map removal sensitivity analysis that had never been performed in this study region.

Hence, AHP, GIS, and RS approaches were applied to combine hydrogeological, geomorphological, and climatological data in order to identify distinct groundwater potential zones in the Trans Yamuna region within the Prayagraj district. Ten groundwater recharge affected criteria such as geology, geomorphology, soil texture, lineament density, drainage density, regional slope, rainfall distribution and land-use pattern, Topographic Wetness Index and NDVI have been investigated to make groundwater potential maps for future planning, governance, use, and conservation of groundwater resources in the study area.

Materials and methods

Study area

Prayagraj is one of the oldest and most important cities in India from a religious point of view. It is located at the confluence of the Ganga, the Yamuna, and the unseen Sarasvati. All of which are considered sacred rivers. The district is made up of three primary physical divisions: the Daob, the Trans Yamuna (or Yamunapar) tract, and the Trans Ganga (or Gangapar) plain. The Daob is the smallest of these three divisions. Our focused study area is the Trans-Yamuna tract and its geographical extent is from 24°48′10″N to 25°26′20″N latitude and 81°30′10″E to 82°20′26″E longitude. The study region consists tropical climate, which is typified by lengthy, hot summers, moderately pleasant monsoons, and chilly winters. The average amount of rain in the study region is 976 mm and mean annual temperature is 25.8 °C of different seasons of a year but the amount varies from year to year. It covers a total area of 2990.957 km2 and has a total population of 1931247 people (Census, 2011; https://www.census2011.co.in/census/district/546-allahabad.html). The study area contains four tahsils named, Bara, Karchana, Koraon and Meja. Meja is the largest Tahsil in terms of land area, but Karchana is the largest in terms of population. In the study area, the Denudational hills (exposed rock surfaces) are quite prominent, particularly in Shankargarh, Koraon, and Meja. In the Trans Yamuna region, the thickness of alluvium is up to 75 m. Data from exploratory drilling in the district shows that fractures are found in the Trans Yamuna region. In Trans-Yamuna region mainly the rivers Yamuna, Tons and Belan River flows. The Tons River tributary of the Yamuna and the Belan River flows from Kaimur Hill and joins the Tons River at Chakghat in MP. (Pandey, 2009; District Survey Report-Allahabad (In-Situ Rock), 2018; Saha & Agrawal, 2020). The study area map is shown in the (Fig. 1).

Fig. 1
figure 1

Map of the study area (Trans Yamuna Region, Prayagraj, Uttar Pradesh, India)

Steps for identification of groundwater potential zones

There are five main steps for groundwater potential mapping that are applied for this research, shown in (Fig. 2) as a flow chart: (1) Choosing the controlling parameters for Groundwater potential zone (2) Creation of Geospatial database systems (conditioning factors related to groundwater potential zones) (3) Perform the multicollinearity test to check the collinearity among the all-selected parameters (4) Apply the AHP method, and (5) Validation of the ground water potential map.

Fig. 2
figure 2

Flow chart of methodology used in the present study

Selection of controlling Parameters for Groundwater potential zone

The variety of conditioning parameters is determined by the availability of data in the study area that are likely to influence the groundwater potential zones. As conditioning elements, several layers, including geology, geomorphology, soil, slope angle, NDVI, LULC, topographic wetness index (TWI), precipitation, drainage density, and lineament density were selected because the presence and movement of groundwater are directly or indirectly controlled by these factors, as explained below.

Where groundwater is located and how it is distributed are heavily influenced by the geology and hydrology of a place. The amount of water, a geological formation and aquifer layer can store, give out, or take in is mostly determined by the porosity, permeability, and transmissivity of the aquifer, its interconnectivity, the types of exposed rock and subsurface rock such as sedimentary, igneous and metamorphic rocks in which sedimentary rocks are good for groundwater recharge because it has high porosity and permeability (Arefin, 2020a, 2020b), and the proximity to surface water sources (Jaiswal et al., 2003). Geomorphological features of a region help us understand how different landforms and topographic information are spread out in space, such as how groundwater moves below the surface and it help to understand lithological and structural control on groundwater potential zones (Gnanachandrasamy et al., 2018). Rainfall feeds surface water bodies and recharges groundwater through percolation. Rainfall also affects groundwater levels; however, percolation relies on strength and duration (Lakshmi et al., 2018). LULC decides the water infiltration to Groundwater recharge such as in built-up environments, where the infiltration rate is frequently lower since there are fewer permeable surfaces present. Woods and agricultural regions, on the other hand, allow more water to infiltrate because of the vegetative cover’s capacity to hold onto water and encourage infiltration (Scanlon et al., 2005). Slope regulates the vertical penetration of water, particularly rainwater, and surface runoff (Dhinsa et al., 2022). Drainage density is the number of channels in a landscape and is the opposite of permeability (Mageshkumar et al., 2019). The soil is the topmost layer of the earth. It is where water seeps into the ground. The soil’s ability to hold water and how well it lets water pass through it determine the rate of infiltration (Allafta et al., 2021). The Lineaments density indicates the presence of more permeable zones which is good for groundwater recharge (Sonwane & Ullah Usmani, 2021). The topographic wetness index (TWI) is used to measure how topography impacts hydrological processes and illustrates how topography may cause groundwater seepage and it is also regarded as a secondary topographical characteristic when assessing the wetness conditions of an area (Arulbalaji et al., 2019). NDVI is a widely used as an indicator for monitoring the dynamic of vegetation at regional and global scales which signifies the groundwater potential zone (Senapati & Das, 2022). The integration of these factors with their potential weights is calculated through weighted overlay analysis in ArcGIS software.

Geospatial database preparation

The base map of the Trans Yamuna Region was created using Survey of India toposheets (No. 63G/11, 12, 15, 16; 63H/13; 63 K/3, 4, 7, 8; 63L/1, 5) at a scale of 1:50,000. Thereafter, all toposheets is merged and study area boundary was extracted by digitizing in Arc GIS software. The satellite images of study area, SRTM DEM 30 m Resolution and LANDSAT 8 OLI/ TIRS C2 L2 Image, freely downloaded by https://earthexplorer.usgs.gov/ (Table 1). By using the SRTM DEM data the Slope, Drainage density, Topographic wetness index and Lineament density, Land use & Land cover, Normalized Differential Vegetation Index map were prepared using the LANDSAT 8 multiband data with the help of ArcGIS software of the study area. The Geology and Geomorphology map was prepared using the geological and geomorphological maps obtained from the Geological Survey of India on a scale of 1:250000 and 1:100000, respectively. The previous soil map that was created by the Indian Council of Agriculture Research’s National Bureau of Soil Survey and Land Use Planning Office (NBSS and LUP) served as the basis for the generation of a new soil map at a scale of 1:50000 (ICAR). By georeferencing the maps with Georeferencing tools and digitising them with the Arc Scan tool in the ArcGIS software, these thematic layers were produced. The Indian Meteorological Department (IMD) Pune provided the rainfall data for the years from 1990 to 2020 that was used in this study. The rainfall data was analyzed to produce a thematic layer using a geostatistical kriging interpolation method in a geographic information system. The false colour composite (FCC) image of LANDSAT data was visually interpreted with image interpretation keys along with shape, size, tone, texture, pattern, and affiliation to determine lineaments for the preparation of lineaments density maps and land use & land cover maps using the line density analysis tool and the Supervised classification tool respectively in ArcGIS software. The slope map is produced by the surface function from spatial analyst tool in the Arc GIS software using the projected Digital Elevation Model. To produce the Drainage density map, the drainage network was generated using the function of Hydrology module of the Spatial Analyst tool such as Fill, Flow direction, Flow accumulation, Stream Order, and Stream to Feature. These functions were used to generate a drainage network map, which was then used to generate the drainage density map. The raster calculator tool was used for numeric calculations to generate the drainage density map. The topographic wetness index was developed using the TOPMODEL index (Beven, 1997; Guisan et al., 1999). These all-thematic layers are geo-coded with the Universal Transverse Mercator (UTM) projection, a spherical coordinate system, and a World Geodetic System (WGS) 84, Zone 44 North. After that, each thematic layer has been changed to 30 m resolution raster format. In a GIS environment, all thematic layers are integrated using the weighted overlay method to generate one map of the groundwater potential zones of the study area. For validation the water level depth of well data is taken from CGWB department.

Table 1 Information about the parameters, datatype and dataset used in the present study

Multicollinear test

When one independent parameter has a strong association with another parameter, this is known as multicollinearity and presents a challenge in a multi regression model. This makes it difficult to evaluate the quality of the model (Mukherjee & Singh, 2020). Prior to the selection of parameters for delineating the groundwater potential zone, the multicollinearity test is an absolutely necessary step. If a given parameter exhibits a high correlation to any other parameter, this denotes that any change in the value of one parameter will cause a corresponding change in the value of the other parameter, which in turn will cause variations in the model’s results. If a given parameter does not exhibit a high correlation to any other parameter, the model’s results will remain consistent regardless of the value of any given parameter. This implies that non-trivially accurate predictions can be made for any input parameter given a set of other parameters (such as slope and slope aspects) (Saha, 2017).

To determine the accuracy of this validation, a linear regression analysis is performed, with one parameter serving as the dependent variable and the remaining parameters serving as the independent variables to calculate the tolerance and variance inflation factor (VIF). Tolerance and VIF values can be calculated for each parameter by repeating the preceding steps. The following equations is used to evaluate the tolerance and variance inflation factor (VIF).

$$\mathrm{Tolerance}=1-{R}^{2}$$
(1)
$$\mathrm{VIF}=\frac{1}{Tolerance}$$
(2)

Equations (1, 2) are going to be used, one at a time, for the many other parameters. VIF values between 1 and 5 are generally considered acceptable, suggesting low to moderate multicollinearity and a tolerance value close to 1 suggests no or little multicollinearity. Tolerance values less than 0.2 or 0.1 indicate high multicollinearity (Al-Fugara et al., 2020). 300 random points were selected from 10 groundwater-determining theme layers for the multicollinearity test. According to this conclusion, there is no collinearity between any of the ten factors that determine the potential of groundwater (Table 2).

Table 2 Statistics on the collinearity of the factors

Derivation of normalized weights of thematic layers and evaluation of the groundwater potential zones by AHP method

The Analytical Hierarchy Process (AHP) is the most effective for modelling the resolution of multiple criteria (Saaty & Vargas, 1980). The relative significance of each individual layer can be determined by utilising Saaty’s scale, which ranges from 1 to 9 (Table 3), wherein a rating of 1 denotes identical significance among the two parameters, and a rating of 9 suggests the outrageous importance of 1 parameter contrasted with the other one (Saaty, 1990).

Table 3 Scale preferred by Saaty for comparing two AHP parameters

According to (Alesheikh & Hosseinali, 2008) following steps comprise for the Analytical Hierarchy process method.

  1. 1.

    Making pairwise matrix on the basis of mutual importance and did normalization.

    $$\mathrm{Z}=\begin{array}{ccc}\begin{array}{c}{ x}_{11}\\ { x}_{21}\\ \begin{array}{c} \vdots \\ { x}_{n1}\end{array}\end{array}& \begin{array}{c},\\ ,\\ \begin{array}{c}\vdots \\ ,\end{array}\end{array}& \begin{array}{cc}\begin{array}{c}{x}_{{12 }^{,} \cdots }\\ {x}_{{22 }^{,} \cdots }\\ \begin{array}{c}\ddots \\ {x}_{{n2 }^{,} \cdots }\end{array}\end{array}& \begin{array}{c}{x}_{\begin{array}{c}n1 \\ \end{array}},\\ { x}_{n2 ,}\\ \begin{array}{c}\vdots \\ {x}_{nn} ,\end{array}\end{array}\end{array}\end{array}$$
    $$\mathrm{Normalization }\left(\mathrm{W}\right)=\frac{1}{n}\Sigma \frac{{x}_{ab}}{{\sum }_{1}^{n}{x}_{ab}}$$
    (3)

    Here a, b = 1 ,2 ……., n

    For pairwise matrix construction, the values assigned to a and b according to the Table 3 are determined by the Saaty’s scale.

  2. 2.

    The conventional eigenvalue method in the analytic hierarchy process allows for the measurement of the consistency of the author's preferences as represented in the comparison matrix and provides a way to evaluate the coherence of their judgments (Antonio Alonso & Teresa Lamata, 2006). The principal eigenvalue value (\({\lambda }_{max}\)) was analysed using eigen vector method.

    $$\mathrm{ZW}=\left[\begin{array}{c}\begin{array}{c}\begin{array}{c}{ x}_{11} , {x}_{{12 }^{,} \cdots } {x}_{\begin{array}{c}1n \\ \end{array}},\\ { x}_{21} , {x}_{{22 }^{,} \cdots }{ x}_{2n ,}\end{array}\\ \end{array}\\ \\ \vdots \vdots \ddots \vdots \\ { x}_{n1} , {x}_{{n2 }^{,} \cdots } {x}_{nn} ,\\ \end{array}\right]\mathrm{x}\left[\begin{array}{c}{ w}_{1}\\ { w}_{1}\\ \begin{array}{c}\vdots \\ { w}_{j}\end{array}\end{array}\right]$$

    Here W designate the normalized value

    $${\lambda }_{max}=\frac{1}{n}\Sigma \frac{\Sigma ZW}{{w}_{j}}$$
    (4)

    Here \(\Sigma ZW\) = Addition of Row, \({w}_{j}=\) Each value of Normalized weight, n = Number of factors.

    The groundwater potential mapping approach is based on the fact that the authors expertise determines the relative importance of each of the numerous thematic layers and their teachings. Thus, each element that impacts the quantity of groundwater in a particular region is assigned the appropriate rank and weight according to Table 4.

  3. 3.

    Consistency of the model

    1. o

      3(a). The consistency index indicates the degree to which the matrix is inconsistent and the degree to which our matrix is inconsistent in comparison to a consistent matrix. In the case of a fully consistent matrix, the average of the remaining eigenvalues is zero, resulting in a Consistency Index (CI) value of 0 (Pant et al., 2022)

      Consistency index of Z express by the formula of (Saaty, 2008)

      $$\mathrm{C}=\frac{{\lambda }_{\mathrm{max}}-n}{n-1}$$
      (5)

      Here, \({\lambda }_{\mathrm{max}}\) = biggest eigenvalue of the matrix and may be without problems decided from the mentioned matrix, and n = number of groundwater conditioning aspects.

    2. o

      3(b). The consistency ratio measures consistency index relative to a random index value. The random index represents the average inconsistency estimated in a random comparison matrix of the same size. (Saaty, 1980). Consistency ratio (CR) express by the formula, (Saaty & Vargas, 1980)

      $$\mathrm{CR}=\frac{(Consistency\;index)}{(\mathrm{Random\;consistency\;index})}$$
      (6)

      The random consistency index is given according to the Table 5. If the CR occur < 0.1, then the given weight for the matrix is reliable if the CR occur > 0.1 then need to check again the assigned weight for the matrix (Saaty, 1990; Sonwane & Ullah Usmani, 2021).

      In this work, a lot of existing literature (Agarwal et al., 2013; Bera et al., 2020; Castillo et al., 2022; Ifediegwu, 2022; Kaliraj et al., 2014; Kumar & Krishna, 2018; Mahato et al., 2021; Narayanamurthi & Ramasamy, 2022; Patra et al., 2018; Pinto et al., 2017; Rajasekhar et al., 2019; Rajesh et al., 2021) was carefully looked at before preparing the pairwise matrix and analysis of the consistency Tables 6 and 7. This was done so that we could get a good idea of how variables rank in different environments and in different places.

      Groundwater potential index is a non-dimensional number that can be used to predict the groundwater potential zones in a certain region (Zhang et al., 2021). Arc GIS weighted overlay function was used for this evaluation. The normalized weight of the aspect was multiplied by the weights of classes of every aspect after that summed the multiplication of all of the attributes and obtained the groundwater potential zone (GPZ) which can be evaluated according to (Muralitharan & Palanivel, 2015; Biju et al., 2018; Doke et al., 2021; Maity et al., 2022) using following equation.

      $$\mathbf{G}\mathbf{P}\mathbf{Z}=\mathrm{Total\;of\;the\;aspects\;normalised\;weight}\left(\mathrm{W}\right)\times \mathrm{weight\;of\;each\;aspects\;class}\left(\mathrm{I}\right)\dots .$$
      (7)
Table 4 Weight assignment to different parameters and reason behind that
Table 5 Satty’s Random inconsistency (RI) for various n criteria
Table 6 Constructing the pairwise comparisons matrix based on Mukherjee & Singh (2020) and Arulbalaji et al. (2019) and checking the thematic layers’ consistency, GM geomorphology, GL geology, LD lineament density, DD drainage density, LULC land use & land cover, S soil type, NDVI Normalized Differential Vegetation Index, SL slope, TWI Topographic Wetness Index, Rf rainfall
Table 7 The assigned and normalized weight of different classes for different  Parameters for groundwater potential zoning

Validation of the map revealing the groundwater potential zone

Several analyses, making use of the obtainable well depth data, have validated the groundwater potential zone (Dar et al., 2021; Naghibi et al., 2016). The Receiver Operating Characteristic ROC and Area Under Curve (AUC) have been used to evaluate groundwater potential maps for accuracy in several research projects (Boughariou et al., 2021; Castillo et al., 2022). The ROC curve shows how the false-negative (X-axis) and false-positive (Y-axis) rates balance out for each possible cut-off number. This compromise can be seen as a decrease in the false-negative rate while an increase in the false-positive rate. In ROC curve analysis, the capacity of a prediction system to properly predict previously defined “occurrences” is measured by an indicator called the area under the curve (AUC) (Gorsevski et al., 2006; Ozdemir, 2011; Kim et al., 2019). According to the AUC curve range varies from 0.5 to 1 in which “0.5–0.6,” “0.6–0.8,” “0.8–0.9,” and “0.9–1.0” which correspond to “poor,” “average,” “good,” “very good,” and “excellent” respectively (Mohammady et al., 2012; Pourtaghi & Pourghasemi, 2014; Razandi et al., 2015).

Result and discussion

In this study there are 10 major responsible factors such as geology, geomorphology, lineaments density, slope, topgraphic wetness index, drainage density, rainfall, soil and land cover and land use, NDVI used for their correlation to identify the groundwater potential zone.

Geology (GL)

The study area comprised Vindhyan supergroup and Quaternary sediment rock of the age from Proterozoic to Recent Quaternary sediments. There are two types of alluvium found in the Quaternary sediments: older alluvium and newer alluvium. The Vindhyan supergroup is further subdivided, with the Kaimur rock group consisting of Quartzites and covering about 13.2% (i.e., 394.651 km2) of the land area, and the Rewa rock group consisting of Shale and Sandstone covering about 4.29% (i.e., 128.455 km2) of the land area. The Laterite Mantle lies beneath the crust of quartzite rock. Older alluvium includes both the Banda and Varanasi alluvium; however, the Banda alluvium accounts for the greater proportion of the total area approximately 1914.61 km2 (i.e., 64.01%) and the Varanasi Alluvium occupies the area about 374.849 km2 (i.e., 12.53%). Over the Kaimur and Rewa Group, river material has accumulated to form the Banda Alluvium. The lower part is composed of yellowish to brown silt–clay, while the higher part is composed of coarse to fine sand and reddish-brown silt in abundance. The Newer Alluvium, which comprises Terraces, and channel Alluvium consist the area about 73.31 km2 (i.e., 2.45%) and 25.35km2 (i.e., 0.84%), respectively. Terraces Alluvium and channel Alluvium is manly composed of medium- to fine-grained sand and fine- to medium- grained sand with thin layers of silt and clay (Fig. 3). Highest weight assigned to Quaternary sediment with high permeability and poor holding capacity, and lower weight assigned to Vindhyan supergroup in which Rewa group is assigned by bigger weight than kaimur group because Rewa group contains more permeable Sandstone than kaimur group Quartzite (Kumar Dinkar et al., 2019; Singh & Srivastava, 2011).

Fig. 3
figure 3

Geological map of the Trans Yamuna Region, Prayagraj

Geomorphology (GM)

Geomorphology is the study of the structure and evolution of landforms, which is crucial to understanding the availability and distribution of underground water (Biswas et al., 2012; Waikar & Nilawar, 2014). The area under study consists of seven different geomorphological formations according to their origin. The pediplain–pediment complex, low and medium dissected structural lower plateau, and the younger and older alluvium plain are all the results of denudational processes, while others are the result of depositional processes, such as the younger and older alluvium plain. Most of the denudational hills are covered with vegetation and have a low rise. The study region is mostly covered by pediplain–pediment complex which comprises the 55.85% (i.e., 1671.7km2) of the total area. Second most covering part of the area is Alluvial plain which covers almost 27.7% (i.e., 829.33 km2) of the total area. The denudational hills were found at two locations which is situated in south east region in the study area. The low dissected plateau comprises the 4% (i.e., 120.7km2) of the total about and moderately dissected plateau cover the area 8.7% (i.e., 258.9 km2) of the whole area. There are some natural and artificial reservoirs and ponds in the study area which covers the area about 0.22% (i.e., 6.8 km2) of the whole area. Dams and check dams were built to make these lakes and ponds. Most of the time, croplands, lowlands, and hills with or without plants are all linked to each other (Fig. 4).

Fig. 4
figure 4

Geomorphology map of the Trans Yamuna Region, Prayagraj

Land use/land cover (LULC)

On the surface of the earth, land usage refers to things that people control, like urban areas, vegetation cover, and industries. On the other hand, land cover includes elements such as water bodies, barren soil, mountains, forests, and exposed rocks. Both land cover and land use are considered landforms (Murthy & Mamo, 2009). It demonstrates the importance of groundwater supply and development in any given region (Scanlon et al., 2005). Due to the lack of permeable surfaces, water infiltration abilities are often diminished in urban and arid environments with minimal vegetation cover. Both land that is covered in vegetation and land that is used for agriculture have a significant potential for water infiltration, although agriculture infiltrates water to a lesser degree than does land covered in vegetation. Bodies of water are ideal for recharging aquifers. For this study the LULC map was prepared with the help of supervised classification and classified into six classes. The majority of the studied area (1901 km2; 63.59%) is devoted to agricultural use, followed by scant vegetation (528.28 km2; 17.67%), bare ground (293.92 km2; 9.83%), urbanized land (169.83 km2; 5.68%), water bodies (63.29 km2; 2.11%), and dense vegetation (33.03 km2; 1.10%) (Fig. 5). This map was classified with help of Supervised classification method in ERDAS IMAGINE 2014. For the accuracy assessment of the LULC the kappa statistics is important method (Islami et al., 2022; Kafi et al., 2014). For this classified map kappa statistics is used and the accuracy for this map is 83.5%. For kappa statistics the 110 random points were generated and a matrix Table 8 was created for further calculations. The output of these calculations was given in the Table 9.

Fig. 5
figure 5

Land use and land cover map of Trans Yamuna Region, Prayagraj

Table 8 Table used for evaluating the accuracy of LULC
Table 9 Table containing values obtained after evalution for checking the accuracy of LULC

Soil (S)

It is of the utmost significance for the soil to play a part in the processes of percolation and infiltration as it pertains to the recharging of groundwater (Jasrotia et al., 2016). Varying varieties of soil allow water to move laterally or vertically at different rates due to changes in the soil’s distinguishing properties, such as size, texture, arrangement, and the pore spaces in between the soil grain. These variations are due to the varying degrees of permeability of the soil to the movement of water. These variations can be attributed to the soil’s ability to accommodate water movement (Kumar et al., 2022). On the basis of textural analysis of soil, the Trans Yamuna region is covered by mainly Clayey loamy, Loamy, silty loamy. Type of soil (Fig. 6). Mostly part of the region is dominated by the Loamy soil (1965.1 km2; 65.70%) area followed by silty Loamy soil (555.32 km2; 18.57%) and Clayey loamy soil (470.54 km2; 15.73%).

Fig. 6
figure 6

Soil texture map of Trans Yamuna Region, Prayagraj

Rainfall

Rainfall is one of the main things that affects groundwater recharge. Infiltration and percolation, which impact groundwater recharge, are affected by the intensity and duration of rainfall. Greater amounts of precipitation that fall in a shorter amount of time suggest less percolation of water and higher rates of surface water flow, and vice versa (Kabeto et al., 2022). The data from the year 1990 to 2020 is taken for generation the rainfall map. For this study, the rainfall map is classified into five classes such as 714–764 mm, 764–814 mm, 814–864 mm, 864–915 mm, and 915–965 mm rainfall. Rainfall map (Fig. 7) shows that very high to high rainfall occurs of south-east and very low precipitation occur in the north- west of the study area. By the (Fig. 8) it can be understood that the rainfall is minutely decreasing over the years in the Trans Yamuna Region.

Fig. 7
figure 7

Rainfall map of Trans Yamuna Region, Prayagraj

Fig. 8
figure 8

Trend of Rainfall Yearly Deviation by Mean from the year 1990 to 2020

Drainage density (DD)

The most important thing about hydrology is drainage density. Drainage pattern and density are caused by both surface and sub-surface features, such as the type of vegetation, structural features (Lineaments), and lithology of an area (Biju et al., 2018). Drainage density is calculated by dividing the total length of all stream segments by the area of the study area (Manap et al., 2014). It is based on following formula.

$$\mathrm{DD}=\frac{\mathrm{Stream\;Length}\left(\mathrm{L}\right)}{\mathrm{Basin\;Area}\left(\mathrm{A}\right)}$$

The area with a very high drainage density shows that the infiltration rate is low compared to the surface runoff, and vice versa. This depends on the permeability, which in turn depends on the structurally controlled rock type of the surface and subsurface geology (Edet et al., 1998). It is clear that if the Drainage density is high, the recharge rate will be low, and vice versa. The map of the drainage density is classified into the 5 classes such as 1.48–2.44, 1.23–1.48, 1.04–1.23, 0.84–1.04, and 0.34–0.84 km/km2, on the basis of Drainage density value as shown in the (Fig. 9).

Fig. 9
figure 9

Drainage density map of Trans Yamuna Region, Prayagraj

Lineament density (Ld)

The lineament density has a significant effect on groundwater recharge (Rejith et al., 2019; Sarmah & Das, 2018). Joints, faults, and other surface discontinuities are reflected in the surface’s lineaments, which are linear features visible from above (Han et al., 2018). Groundwater recharge in hard rock areas is done by the lineaments. High lineament density shows that the land is porous and permeable, which means that there is a lot of groundwater potential there and vice versa (Tolche, 2021). Lineament density (Ld) is defined as the total length of segmented liniments divided by the considered area (Yeh et al., 2016).

$$\mathrm{Ld}=\frac{\sum_{i=1}^{i=n}{L}_{i}}{A}$$

\(\sum_{i=1}^{i=n}{L}_{i}\) = Length of the segmented Lineaments. A = Total Area

The study area has undulating surface and contains exposed rock of Kaimur group this Kaimur group consists mainly sandstone. So, there are many fractures and joints present in this region which liable to recharge the groundwater. In this region the lineaments are oriented in the NE—SW direction which is shown by the (Fig. 10). This map is classified into the 5 classes such as 1.03–1.74, 0.65–1.03, 0.39–0.65, 0.21–0.39, and 0.01–0.21 (very low) km/km2 as shown in the (Fig. 11).

Fig. 10
figure 10

Orientation of the lineaments

Fig. 11
figure 11

Lineament density map of Trans Yamuna Region, Prayagraj

Normalized Differential Vegetation Index (NDVI)

NDVI is one of the most important parameters to delineate the groundwater potential zone. (Pande et al., 2021; Sar et al., 2015). NDVI provides a rough estimate of the amount of vegetation present and the groundwater potential zones over the area (Parizi et al., 2020).

NDVI was estimated as the following formula

$$\mathrm{NDVI}= \frac{NIR-RED}{NIR+RED}$$

NDVI values range from -1 to + 1, with values closer to 0 indicating less vegetation, such as barren land and the dark surface of bodies of water and values closer to + 1 indicating more dense vegetation (Hasanuzzaman et al., 2022). In the study region, the NDVI values vary from -0.09 to 0.41 and classified into the five classes such as -0.09–0.08, 0.08–0.13, 0.13–0.16, 0.16–0.2 and 0.2–0.41, which indicate the area has moderate vegetation conditions (Fig. 12).

Fig. 12
figure 12

Normalized Differential Vegetation Index map of Trans Yamuna Region, Prayagraj

Slope (SL)

Slope of any area is very essential criteria to identify the groundwater potential zone (Doke et al., 2021) which is effect directly to Surface runoff, infiltration, and percolation. High slope locations do not permit groundwater recharge and increase the slope's erosion rate due to increased surface runoff, whereas gentle slope areas are suitable for groundwater recharge (Das et al., 2017). This study area has high slope area that is very less compare to the flat area and the slope map is classified into the five classes such as 0–2, 2–5, 5–9, 9–18, and 18–60 as seen in the (Fig. 13). In the study region the Kaimur rocks have high slope compared to the Alluvium. In Alluvium region soil allows the water to be infiltrate and percolate.

Fig. 13
figure 13

Slope map of Trans Yamuna Region, Prayagraj

Topographic Wetness Index (TWI)

The TWI is a good factor for assessing the groundwater recharge. (Melese & Belay, 2022) TWI is developed in order to gain an understanding of the ways in which the structure of the terrain influences the dispersal of water and the location of areas where water accumulates. Flow direction, flow accumulation, and slope are used in mathematical formulas to figure out TWI. The higher value of the TWI designates the high ponding of water indicates the slope is gentle and helps the groundwater to be recharged (Ballerine, 2017).

TWI is calculated using the following mathematical formula,

$$\mathrm{I}=ln\frac{\alpha }{tan\beta }$$

where α is the area attributable to the upslope and β is the topographic gradient.

In the study region the TWI is classified into the five classes such as 2.83–7.52, 7.52–9.2, 9.2–11.35, 11.35–14.35, and 14.35–22.42, as shown in the (Fig. 14).

Fig. 14
figure 14

Topographic Wetness Index map of Trans Yamuna Region, Prayagraj

Groundwater potential zone map of Trans Yamuna Region

Consistency ratios were determined for each factor Table 6 and the classes of each factor layer Table 7 and the values showed that this study’s judgement matrices were valid (< 0.10) and had good consistency. With the help of ArcGIS, we integrated 10 thematic layers using weighted overlay in a GIS environment to produce a map of the groundwater potential zone. Linear combination equations were used to combine all 10 of the thematic layers by multiplying the weights of each theme by the weights of their corresponding classes to get the groundwater potential zone map using the Eq. 7 (Fig. 15). The estimated Groundwater Potential Zones of the Trans-Yamuna region were categorized into three separate classes such as low (11.928%; 354.884 km2), Moderate (76.44%; 2274.4 km2), High (11.267%; 345.943 km2).

Fig. 15
figure 15

Groundwater Potential Zones Map of Trans Yamuna Region

In the study region, all thematic layers influence the Groundwater Potential Zones differently. For High Groundwater Potential Zone, the Geomorphological classes, Alluvial Plain and Pediment Pediplain complex shows high impact (Fig. 16(a)) because both have relatively high porosity and permeability than other geomorphological classes in Trans-Yamuna region. The Geological classes, Banda Alluvium and Varanasi Alluvium performs good (Fig. 16(b)) because Alluvium is very high porous and permeable than other classes in the study region. The Lineament density classes such as low to moderate effect more to High Groundwater Potential Zones (Fig. 16(c)) because these classes cover most of study area than the higher lineament density classes which cover limited study area and it may drain the infiltrated water into drainages by which the percolation rate become lesser than the surface water run-off. The Drainage density classes, (0.34–0.84 km/km2) to (1.23–1.48 km/km2) depict high efficacy than remaining very high drainage density class (Fig. 16(d)) because low to moderate drainage density show less surface water run-off. The LULC classes, Agriculture, Sparse vegetation and Ponds shows high influence (Fig. 16(e)) because these classes allow water to percolate beneath the earth surface and these classes cover large area of the study region than other classes. The Loamy soil, one of class of soil play a good role for High Groundwater Potential Zone (Fig. 16(f)) because this class has capability to allow surface water to percolate. The all NDVI classes depicts good condition for High Groundwater Potential Zone in the study area (Fig. 16(g)) The (0–2) degree slope, the lowest class of Slope parameter behaves good for High Groundwater Potential Zone in the Trans – Yamuna region than other high degree slope classes (Fig. 16(h)) because it restricts the surface water run-off. The class of TWI (2.83–7.52) to (11.35–14.35) has high impact on High Groundwater Potential Zone (Fig. 16(i)) because these classes responsible for groundwater recharge and cover the high study region than remaining high TWI class which contain less influence. The (772–810 mm) to (852–896 mm) classes of Rainfall depict good efficacy on High Groundwater Potential Zone than most of high rainfall class (Fig. 16(j)) because topography of study region lesser the influence of high rainfall and trigger the surface water run-off.

Fig. 16
figure 16figure 16

(a to j): The relative influence of parameters classes on Groundwater Potential Zones

For Moderate Groundwater Potential Zone, the Geomorphological classes such as Pediment pediplain complex, Alluvium plain depict high influence while for Low Groundwater Potential Zone, moderately dissected plateau has high influence (Fig. 16(a)) because it has low porosity and permeability relative to Pediment pediplain complex and Alluvium plain. The Geological classes, Banda alluvium and Kaimur shows high impact on Moderate and Low Groundwater Potential Zone respectively (Fig. 16(b)) because Kaimur contains Quartzite mostly which is low porous and permeable relative to Banda alluvium. The lower to moderate classes of Lineament density show higher efficacy on Moderate as well as on Low Groundwater Potential Zone (Fig. 16(c)) because these classes cover large area of study region and High lineament density is less in the study area so it is not comparatively more effective to recharge the groundwater than other classes. The Drainage density classes, (1.04–1.23 km/km2) influences more to Moderate and Low Groundwater Potential Zones (Fig. 16(d)) because this class covers the both of Zones largely than the other drainage density classes. The agriculture and bare land class of LULC effect the Moderate and Low Groundwater Potential zones respectively (Fig. 16(e)) because agriculture covers the most of study region and allow the water to be percolate relative to bare land. The Soil classes, Loamy soil and Clayey loam soil influence the Moderate and Low Groundwater Potential Zones (Fig. 16(f)) because Loamy soil has more porosity and permeability than clayey loam soil. The NDVI classes such as (0.13–0.16) and (0.08–0.13) has more efficacy to Moderate and Low Groundwater Potential Zones (Fig. 16(g)) respectively because 0.13–0.16 NDVI class allow more water to recharge the groundwater than 0.08–0.13 NDVI class. The 0–2 to 2–5 degree classes of Slope effect the Moderate to Low Groundwater Potential Zones (Fig. 16(h)) respectively because 0–2 degree of slope restrain the surface water run – off than 2–5 degree class of Slope. The lower class 2.83–7.52 of TWI is responsible for the Moderate and Low Groundwater Potential Zones (Fig. 16(i)) The Rainfall classes, (772–810 mm) and (810–852 mm) shows large impact on Moderate and Low Groundwater Potential Zones (Fig. 16(j)) Respectively.

Although, by this map it can be predicted that the study region has the moderate Groundwater Potential Zones.

Map removal sensitivity analysis for groundwater potential zone

Map removal Sensitivity analysis evaluates the variability or unpredictability of model output result by assessing the consequences of removing individual thematic layers from the calculation of the groundwater potential map (Kindie et al., 2019; Thapa et al., 2018).

In this methodology, the approach involves removing each of the thematic layers individually and producing a new groundwater potential zone (GWZ) map by overlaying the remaining layers. Afterwards, the formula, which is utilized to compute a sensitivity index, which serves to evaluate the influence of eliminating each layer, given as follows.

$$\mathbf{S}\mathbf{I}=\frac{\left|\frac{{\boldsymbol{G}}{\boldsymbol{P}}{\boldsymbol{Z}}}{{\boldsymbol{N}}}-\frac{{\boldsymbol{G}}{\boldsymbol{P}}{{\boldsymbol{Z}}}^{\mathbf{^{\prime}}}}{{\boldsymbol{n}}}\right|}{{\boldsymbol{G}}{\boldsymbol{P}}{\boldsymbol{Z}}}\times 100$$

SI denotes the sensitivity index for the removal of a single map, GWP represents the Groundwater Potential Index determined using all the thematic layers, and GWP’ signifies the groundwater potential index computed by eliminating one thematic layer at a time. N and n represent the quantities of thematic layers utilized for computing GWP and GWP’.

The statistical findings from the map removal sensitivity analysis Table 10 indicate that the Geology and Geomorphology theme layers were the most important factors in estimating Groundwater Potential Zones. However, soil type, lineament density, drainage density, and rainfall were found to have a modest level of sensitivity.

Table 10 Map removal sensitivity analysis statistics

After removing the Geological theme layer, the Geomorphology and Lineament density criteria gave higher variation index values. This shows that the lithology of the study area has a big effect on Groundwater Potential Zones.

Validation of outcome map prepared by AHP model

A statistical validation of the groundwater prospect map predictions was carried out in this study using a Receiver Operating Characteristic (ROC) curve (Nandi & Shakoor, 2010; Senapati & Das, 2021) by analysing with depth of water level in metres of 189 wells data using help of ArcSDM toolbox (www.sdmtoolbox.org). This toolbox needs to be added to ArcGIS software. This well data point was attained from the Bhuvan NRSC (https://bhuvan-app1.nrsc.gov.in/gwis/gwis.php#) site. These well’s water level depth points were categorized as Good (< 30 m), Moderate (30 – 80 m) and Poor (> 80 m). These points were placed over the resulting map of GWPZ. In order to evaluate how accurate, the model is, the AUC value was determined to be between 0.5 and 1 (Manap et al., 2014; Naghibi et al., 2016). Being closer to the value 1 indicates that the model's accuracy is higher, whereas being closer to the value 0.5 indicates that the model's accuracy is lower. Although, the ROC curve predicted the authenticity of this study was 81.3%. it suggests that the model performed very well (Fig. 17).

Fig. 17
figure 17

ROC curve for validation of Groundwater Potential Zone

Conclusion

Groundwater is the main source of water for agriculture, domestic and industrial purposes in the Trans Yamuna region. Geospatial techniques with the aid of standard and remotely sensed geospatial data is effective in lessen the time, money, effort, and manpower, thus allowing for appropriate decision-making in regards to groundwater resource management and development. However, for the study region, the groundwater potential zones were determined through the application of an integrated remote sensing and GIS technique based on the AHP method. The outcome of the validation indicates that AHP is a credible method for determining the groundwater potential zones of the study area. The current study has a high level of scientific accuracy (accuracy = 81.3%) and may be helpful to the locals in selecting valuable well water drilling regions, as well as the authorities engaged in water resource management and land use planning. The study area exhibits an approximate 11.928% low zone of Groundwater potential and about 11.267% area is covered by the high Groundwater Potential Zone. Most of the area covered by the Moderate groundwater potential zone which is about 76.44% of the whole area. The final result is based on how each of the input parameters affected the AHP calculation. However, more discrepancies can be found between the final GPZ map and the well data for validation in the Koraon region than in any other region of study area. This region has sandstone rock, older alluvium, high to moderate Drainage Density and high rainfall.

Thus, large area of Koraon region has moderate groundwater potential zones and less area is covered by low groundwater potential zones. Preliminary Draft Report District Survey Report-Allahabad (In-Situ Rock, 2018) reports that groundwater flows in the Koraon region in north-west (high water table contour to low water table contour). Lineament trend is also in north-west direction and it is possible that same trend may exist in the subsurface. Therefore, the water table depth (mbgl) is greater in this region which is the cause of mismatch between final GPZ map and well data for validation. According to this study, sustainable agricultural and economic development in the study region depends on prudent groundwater management. Rainwater harvesting and artificial recharge for the aquifer are needed to avoid the scarcity of water in the study region in the coming years. Plantation is needed in the Trans Yamuna region because it enhances the water holding capacity of the soil and allows the water to be infiltrate and it prevents the water run-off and erosion of the soil.