Introduction

Water is critical to human life; but, due to diminishing of water resource available on the surface, groundwater is now the primary source of fresh available water for home, agricultural and other purposes. The utilization of subsurface water has increased due to the ever-increasing need for water (Das et al. 2018; Das 2019; Ajay Kumar et al. 2020). Groundwater (especially from both tube and dug wells) serves as primary origin of water for irrigation, commercial enterprise, and domestic needs, particularly in the arid zone where yearly rainfall is minimal. Agricultural uses account for 42% of total groundwater worldwide. Both environments of the villages as well as in cities, groundwater acts as crucial sustainable sources for dependable growth of the economy and water availability for drinking (Kaliraj et al. 2014; Andualem and Demeke 2019; Kumar et al. 2020). Groundwater is a dynamic and renewable natural resource, although its availability is limited in hard rock landscapes such as red and lateritic zones. Water resource enhancement is required in India since it is critical to the country's agro-economy development. In addition to that, above 90% of people living in the rural areas and approximately 30% of the people living in the urban areas uses groundwater as the main source of household and drinking requirements (Ghosh et al. 2016; Abrams et al. 2018).

Groundwater is retained in the pore of rock and soil under the water table. This is a valuable as well as essential elements of the natural hydrological cycle. Its accessibility makes it a great asset for residential, agricultural, and social development projects (Agarwal et. al., 2013; Choubin et. al., 2019). Water scarcity has become a problem in densely populated, urbanized areas around the countries throughout the globe, including India, Africa, and China, as the requirement of domestic, Industrial, and agricultural purposes has increased (Arshad et. al., 2020). Several researchers from around the world have used techniques like frequency ratio, logistic regression model techniques, random forest model, decision tree model, artificial neural network, and evidential belief function to locate potential groundwater recharge zones (Hutti and Nijagunappa 2011; Fenta et. al., 2014). Thus, the remote sensing technique offers systematic, comprehensive, and quick repeating region coverage, making it an effective approach for acquiring quick spatiotemporal data from a large area (Al- Rahmati et al. 2015; Gnanachandrasamy et al. 2018; Al-Djazouli et al., 2020; Koli et al., 2020).

The Geographic Information System (GIS) provides a blueprint for dealing with massive and complicated sensing data. Satellite data is widely available and used, and it is coupled with traditional maps and consideration of topographical approaches, has made fundamental data for estimating potential groundwater locations much easier to come by (Yeh et al. 2016; Raju et al. 2019). Using the Integrated Analytical Hierarchy Process (AHP) approach in conjunction with Remote Sensing (RS) and GIS, some studies have effectively identified prospective groundwater recharge zones. Furthermore, by combining multi-criteria decision analysis (MCDA) with RS and GIS techniques, AHP has now been effectively used in several studies of watershed management and groundwater investigation (Mukherjee et al. 2012; Kumar and Krishna 2018; Arulbalaji et al. 2019; Çelik 2019).

AHP technique is recognized as a basic, easy, effective, & reliable tool for MCDA (Hussein et al. 2017; Benjmel et al. 2020; Dar et al. 2020). Due to the obvious increased exploitation of this surface water resources, researchers are concentrating their efforts on creating cost-effective approaches for detecting potential groundwater recharge regions, which could help with transforming sustainable water resource management. It's critical to locate, prepare for the greatest possible use and sustainability of Groundwater Potential Zones (GWPZ) (Magesh et. al., 2012; Yeh et. al., 2016; Patra et. al., 2018). Many geological and geophysical approaches for establishing the existence of groundwater table and aquifers are thought to be significantly more trustworthy, but they are both costly and long delayed methods for assessing the availability of groundwater resources in a given location (Pinto et al. 2017; Nasir et. al., 2018). RS combined with GIS has become a cost-effective and efficient method for exploring groundwater resources, investigation, and operating procedures (Awawdeh et. al., 2013; Fashae et. al., 2014; Maity and Mandal 2017; Kanagaraj et. al., 2019).

In 1987 for the first time in India, National Remote Sensing Agency (NRSA) used RS and GIS techniques to delineate the GWPZ. After that many researches in India used the combination of RS and GIS for delineation of GWPZ. For delineation of GWPZ, most of the researches used a maximum of 8 to 10 thematic maps. The thematic maps such as geomorphology, lithology, slope, hydrological soil group (HSG), drainage density (DD), Land use and Land cover (LULC), rainfall and lineament density (LD) were commonly used by many researchers (Fenta et al. 2014; Kaliraj et al. 2014; Rahmati et al. 2015; Jothibasu and Anbazhagam, 2016; Pinto et al. 2017; Abrams et al. 2018; Patra et al. 2018; Jahan et al. 2019; Murmu et al. 2019). Apart from these thematic maps, few researchers also used distance from river, pre-monsoon groundwater depth, post-monsoon groundwater depth and topography thematic map of the study area (Das 2019; Ajay Kumar et al. 2020; Al-Djazouli et al. 2020; Dar et al. 2020; Koli et al., 2020). After going through many studies, it was found that, most of the researchers developed the suitability map of GWPZ by considering maximum of 8 to 10 thematic maps. And none of the researchers used multicollinearity check and sensitivity analysis as shown in Table 1.

Table 1 Details of literature review for GWPZ

Although the delineation of GWPZ using GIS and multi-criteria analysis saves a lot of time as compared to other direct methods such as collecting the data manually from the field, survey of the area, drilling in order to collect the soil sample as well as lithology data, etc., it comes with some negative impacts due to human activities as well as other reasons like field data collection for validation, resolution of thematic maps, assigning weights to thematic maps (Abrams et al. 2018; Jahan et al. 2019; Djazouli et al., 2020). Collecting accurate field data such as well yield, coordinates, etc., for assessing the validity of suitability map plays a vital role (Hussein et al. 2017; Kanagaraj et al. 2019). GWPZ suitability map also influenced by the resolution of the thematic maps (Das et al. 2018). By considering coarser resolution thematic maps, the accuracy of the suitability maps reduces, whereas with finer resolution thematic maps, the accuracy of the suitability map increases (Shekhar and Pandey 2015; Yeh et al. 2016; Lakshmi and Reddy 2018). Weights assign to each of the thematic maps shows significant impact on the suitability map (Nasir et al. 2018; Dar et al. 2020). By considering inappropriate weights for thematic maps, it would result in changes of the sub classes (i.e., Very high, High, Moderate and Low) area of the suitability map (Arshad et al. 2020). Hence, it is necessary to assign appropriate weight to each of the thematic maps by consulting with expertise related to this field as well as based on literature review (Ghosh et al. 2016; Murmu et al. 2019; Abrams et al. 2018; Al-Djazouli et al. 2020).

The objective of the present study includes preparation of GWPZ using 14 thematic maps in ArcGIS software by considering AHP. The novelty of the study includes inclusion of maximum number of thematic maps, i.e., 14 numbers than that of previously used maximum of 10 thematic maps followed by multicollinearity check as well as sensitivity analysis. Inclusion of multicollinearity check and maximum number of thematic maps improves the accuracy of the GWPZ map. The GWPZ map offers remarkably reliable data that can aid in long-term efficient groundwater resource management in the water shortage areas.

Materials and methods

Study area

The present study area involves the Jamsholaghat sub-basin of the Subarnarekha Basin. Study area extends over provinces of West Bengal and Odisha having total catchment area of 552 km2. Extends of its longitude and latitudes are 86°30′ to 86°50′ E and 22°04′ to 22°32′ N, respectively. Delineated Jamsholaghat sub-basin in Subarnarekha Basin is shown in Fig. 1.

Fig.1
figure 1

Delineated Jamsholaghat sub-basin in Subarnarekha Basin

Data procurement

Digital elevation model (DEM) data were procured from Alaska search facility, Earth data which is available online at https://search.asf.alaska.edu/#/, having resolution of 12.5 m. Landsat 8 data were procured from Bhuvan, National Remote Sensing Centre (NRSC) which is available freely at http://bhuvan.nrsc.gov.in. Rainfall data were acquired from Center for Hydrometeorology & Remote Sensing (CHRS) available at https://chrsdata.eng.uci.edu/. Lithology data having scale of 1:50,000 were obtained from the Geological Survey of India (GSI) available at https://bhukosh.gsi.gov.in/Bhukosh/Public. The soil type and soil depth map having scale of 1:50,000 was procured from the National Atlas and Thematic Mapping Organization's district planning map series.

Methodology

It is crucial to identify the GWPZ in order to install the relief well for a particular basin. In the present study, fourteen thematic maps (i.e., Lithology, Slope, LULC, DD, LD, Distance from river, Rainfall, HSG, Geomorphology, Topographic Wetness Index (TWI), Topographic Position Index (TPI), Profile Curvature, Plan Curvature and Roughness) were prepared using ArcGIS software. Then, the multicollinearity check of 14 thematic maps were carried out followed by overlaying of thematic maps in ArcGIS after assigning weights and ranks using AHP, then followed by sensitivity analysis and validation of GWPZ map by considering the Receiver Operating characteristic Curve (ROC). The flowchart for developing the GWPZ is shown in Fig. 2.

Fig. 2
figure 2

Flowchart for delineating the groundwater potential zones

LULC map shows the pictorial representation of the study area having agricultural land, forests, water bodies, wetlands, built up area, etc. (Jahan et al. 2019). Landsat 8 map was downloaded from Earthexplorer website. Using supervised classification in ArcGIS software, land use map was generated. At first, the band numbers of the red, green and blue color were changed from high to low followed by classification of image based on land types using image classification tools. Accuracy of LULC map increases by increasing the number of training sample for each category of land forms. After saving the training samples, maximum likelihood classification was used in order to produce the LULC map in ArcGIS software. The present study area was broadly classified into five categories, i.e., Forest, Agricultural, Built-up, Barren/Waste and Water Bodies, consisting of 15%, 66%, 10%, 4% and 5% of the study area, respectively.

The rate of infiltration and percolation through the surface to the subsurface and its aquifers is mostly affected by particles of soil (Murmu et al. 2019). According to the United States Department of Agriculture (USDA), HSG was classified into four groups, i.e., A, B, C, and D. At first a new shapefile was created followed by drawing the polylines based on soil groups in the shapefile by using editor tool in ArcGIS software. Then, soil groups were entered in the attribute table for different polygons those were being generated by joining the polylines for respective soil groups. After merging the polygons of same soil groups in attribute table using editor tool, Soil map was developed. Study area consists of mainly four types of soil group, i.e., A, B, C and D, consisting of 19%, 9%, 42% and 30% of the Jamsholaghat sub-basin, respectively.

The slope is quite important in recharging the groundwater as it controls the surface runoff (Lakshmi and Reddy 2018). Using the DEM, Slope map was generated for the study area. This was done by choosing the Surface option from Spatial Analyst Tools. The developed slope map was divided into five categories based on slope in degrees, i.e., Very Gentle, Gentle, Moderate, Moderate Steep, and Steep, accounting for 40%, 49%, 9%, 1%, and 1% of Jamsholaghat sub-basin, respectively.

The density of drainage shows the contiguousness of channel spacing as well as the composition of the surface material, providing a quantitative estimate of average stream channel length for the entire basin (Jothibasu and Anbazhagan 2016). DD map was generated from DEM of the study area. It was done by selecting “Hydrology” option from “Spatial Analyst Tools”. The DEM was first geo-referenced and converted to a projected coordinate system, followed by generating the flow direction map. It was then followed by generating the flow accumulation map that was generated by eliminating the values which were below 500 in the flow accumulation map. Then stream network of the study area was generated by using flow direction and flow accumulation maps. Using this stream network, stream order of the sub-basin was developed followed by use of line density option so as to prepare the DD map in ArcGIS software. DD map of the study area consists of six ranges, i.e., < 0.9, 0.91–1.76, 1.76–2.61, 2.62–3.45, 3.46–4.30 and 4.31–5.15, consisting of 1%, 11%, 40%, 40%, 7% and 1% of Jamsholaghat sub-basin, respectively.

A lithology map of a research area depicts the features of several types of rocks found in the Earth's subsurface (Kumar et al. 2014). Lithology map of the present study area was downloaded from GSI by considering the latitude and longitude extend of the study area. Clip function was used in ArcGIS software in order to extract the required study area. In terms of improving the map's accuracy, lithology data was collected from Central Ground Water Board (CGWB), Bhubaneswar and verified with lithology map downloaded from GSI. Based on field data, lithology map was rectified and redeveloped. Study area consists of mainly five types of rock formations, i.e., Granite, Gravel, Laterite, Quartz and Schist, consisting of 36%, 1%, 20%, 2% and 41% of the study area, respectively.

The primary structural elements such as cleavages, fractures, discontinuity surfaces, and faults are expressed by the lineaments having the property of linearity (Shekhar and Pandey 2015). Landsat 8 map was downloaded from Earthexplorer website. The image was enhanced in PCI Geomatica using the "Enhancements" tool. The Librarian Algorithm was then used to extract Lineament (Çelik 2019). The line density function in the spatial analyst tool was selected in order to prepare the LD map in ArcGIS software. The study area's LD map was categorized into five groups, i.e., 0.5, 0.5–1.1, 1.2–1.6, 1.7–2.2, and 2.3–2.8 km/km2, with 76%, 12%, 6%, 4%, and 2% of Jamsholaghat sub-basin, respectively.

Geomorphology is the study of the earth's form (landform), as well as its characterization and origins. Geomorphology map of the Jamsholaghat sub-basin was downloaded from GSI by considering the latitude and longitude extend of the study area (Hussein et al. 2017; Patra et al. 2018). Clip function was used in ArcGIS software so as to extract the required study area. In terms of improving the map's accuracy, lithology data was collected from CGWB, Bhubaneswar and verified with geomorphology map downloaded from GSI. Based on field data, geomorphology map was rectified and redeveloped. The study area was mainly categorized into five groups, i.e., Hills and Valleys, Alluvial Plain, Pediment Pediplain complex, Flood plain and water bodies, accounting for 4%, 1%, 82%, 10% and 3% of Jamsholaghat sub-basin, respectively.

Distance from river map was developed in ArcGIS software. At first, flow accumulation was set greater than 1000 using map algebra function in spatial analyst tool followed by generating the stream network. Based on distance, multiple ring buffer function was used in order to develop the distance from river map. Study area was categorized into six groups, i.e., < 300, 301–600, 601–900, 901–1200, 1201–1500 and > 1500 m, consisting of 5%, 9%, 13%, 17%, 21% and 35% of Jamsholaghat sub-basin, respectively.

Rainfall recharges groundwater, making it an active natural resource (Arulbalaji et al. 2019). Rainfall data from the different rain gauges of Indian Meteorological Department (IMD) situated inside the buffer zone and its nearby regions of the research area was collected for 30 years (i.e., 1987–2016). Rainfall map was developed in ArcGIS software. At first, annual average rainfall data was estimated in excel for all the rain gauges followed by adding those data to ArcGIS based on latitude and longitude of rain gauges. In ArcGIS software, the spatial distribution map of rainfall was created using the Inverse Distance Weight (IDW) interpolation technique. The rainfall map of the present research area was classified into five groups, i.e., 1420–1471, 1472–1523, 1524–1575, 1576–1626 and 1627–1678 mm, consisting of 6%, 15%, 32%, 40% and 7% of Jamsholaghat sub-basin, respectively.

TWI map was developed in ArcGIS software. At first, slope map was generated using spatial analyst tool. Equation 1 was used in raster calculator function available in map algebra in order to prepare the TWI map (Magesh et al. 2012; Raju et al. 2019).

$${\text{TWI}} = \ln \frac{\alpha }{\tan \beta }$$
(1)

where α is upslope and β is topographic gradient. TWI map of the present research area was categorized into four groups, i.e., very low, low, moderate and high, consisting of 52%, 25%, 15% and 8% of Jamsholaghat sub-basin, respectively.

Using the Jenness method and the Topography tools extension, the TPI thematic map was created in ArcGIS software. TPI map of the present research area was categorized into five groups, i.e., [− 9]–[− 1.1], [− 1]–[− 0.29], [− 0.28]–[0.3], [0.3]–[1.1] and [1.2]–[1.3] consisting of 5%, 29%, 34%, 28% and 4% of Jamsholaghat sub-basin, respectively.

The roughness index measures the variation in elevation among neighboring cells in a DEM. Roughness was calculated using Eq. 2 (Ajay Kumar et al. 2020; Dar et al. 2020).

$${\text{Roughness}} = \frac{{{\text{FS}}_{{{\text{mean}}}} - {\text{FS}}_{{{\text{min}}}} }}{{{\text{FS}}_{{{\text{max}}}} - {\text{FS}}_{{{\text{min}}}} }}$$
(2)

where FSmin is minimum focal statistic, FSmax is maximum focal statistic and FSmean is mean focal statistic. Using the Neighborhood tool, minimum (FSmin), mean (FSmean) and maximum (FSmax) focal statistical thematic layers were created followed by creating the FSmin, FSmax and FSmean using Raster Calculator function in Map Algebra tool then using Eq. 2 (Kanagaraj et al. 2019), Roughness map was developed. Roughness of the present research area was categorized into five groups, i.e., < 0.31, 0.32–0.44, 0.45–0.56, 0.57 – 0.68 and 0.69–0.89, accounting for 6%, 21%, 49%, 19% and 5% of Jamsholaghat sub-basin, respectively.

The maximum slope's direction is parallel to the profile curvature. Profile curvature map of the present research area was developed in ArcGIS by using curvature function in 3D analyst tool (Nasir et al. 2018; Patra et al. 2018). Profile curvature map was categorized into three groups, i.e., convex, flat and concave, consisting of 34%, 54% and 15% of Jamsholaghat sub-basin, respectively.

Plan curvature is perpendicular to the maximum slope direction (Murmu et al. 2019; Al-Djazouli et al. 2020). Plan curvature map of the present research area was developed in ArcGIS by using curvature function in 3D analyst tool. Profile curvature map was categorized into three groups, i.e., convex, flat and concave consisting of 24%, 55% and 21% of Jamsholaghat sub-basin, respectively.

Multicollinearity checks for GWPZ

Multicollinearity is a type of problem related to statistics where at the minimum one input variable is highly correlated with other input variables of the model. It will result in a substantial level of precision in the output of the model. As a result, check for multicollinearity among the input variables before proceeding with the regression model is crucial (Montgomery et al. 2013). The parameters of multicollinearity (i.e., variance inflation factor (VIF) and tolerance) were estimated using Eqs. 3 and 4.

$${\text{Tolerance of the ith predictor variable }}\left( {{\text{T}}_{{\text{i}}} } \right) = 1 - R_{{\text{i}}}^{2}$$
(3)
$${\text{VIF of the ith predictor variable }}\left( {{\text{VIF}}_{{\text{i}}} } \right) = \frac{1}{{T_{{\text{i}}} }}$$
(4)

Multicollinearity problems exist if the tolerance value is less than 0.10 or the VIF is greater than equal to 10 (Montgomery et al. 2013). To survey the multicollinearity problem between all the thematic maps, 500 points (N = 500) were chosen randomly from the Jamsholaghat sub-basin and data was extracted from ArcGIS. Statistical Package for Social Science (SPSS) (v26) was used in order to check the multicollinearity problem.

AHP for GWPZ

AHP is a specific methodologies for determining the suitability of structures for their intended locations, and it is based on Prof. Saaty's MCDA (1987). Within a collection of reciprocal matrices, the AHP approach performs a comparison of different variables. The significance of GWPZ was used to assign a ranking from 1 to 9. In this ranking, a '1' denotes a limited region where no building is recommended, while a '9' indicates an outstanding zone for GWPZ (Saaty 1987). The 'Consistency Ratio (CR),' as advocated by Saaty (1990), was computed by using Eq. 5 to assess the consistency of rankings and weights allocated to distinct thematic maps and their sub-classes. A CR of less than equal to 0.1, according to Saaty (1990), suggests an acceptable decision to proceed with the AHP assessment.

$${\text{Consistency Ratio }}\left( {{\text{CR}}} \right) = { }\frac{{{\text{CI}}}}{{{\text{RCI}}}}$$
(5)
$${\text{Consistency Index}}, {\text{CI}} = \frac{{\lambda_{\max } - n}}{n - 1}$$
(6)

where n is the number of thematic maps, RCI is random consistency index and λmax is principal eigen value.

Overlay analysis for GWPZ

After allocating weights and ranks to all the thematic map as well as its sub-classes, the GWPZ map was created by considering the weighted overlay analysis in ArcGIS software thereby using Spatial analyst tool. Equation 7 shows the formula of overlay analysis.

$${\text{GWPZ}} = \mathop \sum \limits_{i = 1}^{n} (W_{i} \times R_{i} )$$
(7)

where Wi is thematic map weight and Ri is sub-classes rank.

Sensitivity analysis for GWPZ

The sensitivity analysis shows the impact of each input maps over output map. For the present study, two sensitivity analyses (i.e., map removal and single parameter) were used. Sensitivity index (SI) of map removal sensitivity analysis was estimated using Eq. 8 (Arshad et al. 2020).

$${\text{SI}} = \frac{{\left| {\left( {\frac{{{\text{GWPZ}}}}{N}} \right) - \left( {\frac{{{\text{GWPZ}}^{\prime } }}{n}} \right)} \right|}}{{{\text{GWPZ}}}} \times 100$$
(8)

where \({\text{GWPZ}}^{\prime }\) is GWPZ developed by rejecting one of the thematic map at a time, n is the total number of thematic maps considered in order to develop the \({\text{GWPZ}}^{\prime }\) and N is the total number of thematic maps considered in order to generate the GWPZ. Weight factor (W) of single parameter sensitivity analysis was estimated using Eq. 9 (Kumar et al. 2020).

$$W = \frac{{P_{{\text{r}}} P_{{\text{w}}} }}{{{\text{GWPZI}}}} \times 100$$
(9)

where Pr is ranks of each thematic map, Pw is Weights of thematic map and GWPZI is groundwater potential zone index estimated by considering all of the thematic maps.

Results and discussion

Lithology, Slope, LULC, DD, LD, Distance from stream, Rainfall, HSG, Geomorphology, TWI, TPI, Profile Curvature, Plan Curvature and Roughness maps of the Jamsholaghat sub-basin are shown in Figs 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13, 14, 15, 16. In order to identify the suitable locations for relief well, GWPZ map was delineated.

Fig. 3
figure 3

LULC map of the study area

Fig. 4
figure 4

HSG map of the study area

Fig. 5
figure 5

Slope map of the study area

Fig. 6
figure 6

DD map of the study area

Fig. 7
figure 7

Lithology map of the study area

Fig. 8
figure 8

LD map of the study area

Fig. 9
figure 9

Geomorphology map of the study area

Fig. 10
figure 10

Distance from river map of the study area

Fig. 11
figure 11

Rainfall map of the study area

Fig. 12
figure 12

TPI map of the study area

Fig. 13
figure 13

TWI map of the study area

Fig. 14
figure 14

Roughness map of the study area

Fig. 15
figure 15

Profile curvature map of the study area

Fig. 16
figure 16

Plan curvature map of the study area

Multicollinearity checks for GWPZ

Table 2 shows the parameters of the multicollinearity investigation. The findings shows that for all the thematic maps, the values of VIF were below the upper limit, i.e., 10 and the values of tolerance value were above the lower limit, i.e., 0.1. Hence, from the above results it was clear that there was no multicollinearity problem exists between all of the thematic maps.

Table 2 Collinearity statistics for all the parameters of GWPZ

AHP for GWPZ

AHP method was used in order to assign the weightage to each thematic map as well as their sub-classes. Pairwise comparison matrix and normalized pairwise comparison matrix for Jamsholaghat sub-basin are shown in Tables 3 and 4, respectively. Normalized pairwaise comparison matrix values were estimated by dividing value of each cell to that of column total value. Criteria weights were estimated by taking the average of row values. Computation of consistency for Pairwise comparison matrix was done by using Eqs. 7 and 8. Consistency index was estimated as 0.02 and consistency ratio was 0.01. As consistency ratio is 0.01 which is less than 0.1, hence, it can be concluded that, Pairwise comparison matrix is consistence and the weights estimated through Table 4 can be used in order to assign the weights to all the thematic maps. Weight and rank value for all the thematic maps are shown in Table 5. So as to increase the accuracy, each of the theme maps' sub-classes was allocated ranks according to their priority level by considering expert opinion/literature review.

Table 3 Pairwise comparison matrix for GWPZ
Table 4 Normalized pairwise comparison matrix for GWPZ
Table 5 Weight and rank of all thematic maps for GWPZ

LULC map for GWPZ

LULC map was assigned 8.1% weight according to AHP. In the LULC map, each sub-class, i.e., Forest, Agricultural, Built-up, Barren/Waste and Water bodies, was assigned ranks (i.e., from 1 to 9) as 8, 7, 2, 3 and 9, respectively. This was done by reclassify function in spatial analyst tool using ArcGIS software. Water bodies were assigned highest rank due to highest infiltration followed by forest and agricultural land due to little bit less infiltration than water bodies. Built-up lands were assigned lowest rank as it has highly impermeable layer and least infiltration.

HSG map for GWPZ

HSG map was assigned 8.1% weight according to AHP. In the HSG map, each sub-class, i.e., A, B, C and D, was assigned ranks (i.e., from 1 to 9) as 9, 7, 5 and 3, respectively. This was done by reclassify function in spatial analyst tool using ArcGIS software. Group A soil was assigned highest rank due to high infiltration rate, followed by Group B due to moderate infiltration, followed by Group C due to low infiltration. Group D was assigned lowest rank due to very low infiltration.

Slope map for GWPZ

Slope map was assigned 5.9% weight according to AHP. In the slope map, each sub-class, i.e., Very Gentle, Gentle, Moderate, Moderate Steep and Steep, was assigned ranks (i.e., from 1 to 9) as 9, 7, 5, 2 and 1, respectively. This was done by reclassify function in spatial analyst tool using ArcGIS. The slope of the surface has a significant impact on surface runoff and infiltration rates. Higher slope produces low recharge, this is due to water flows with high velocity over steep slope and has less infiltration time through the surface toward subsurface. Flat and mild slopes receive the highest ranking. The low rank was given to slopes that are moderately steep and steep.

DD map for GWPZ

DD map was assigned 6% weight according to AHP. In the DD map, each sub-class, i.e., < 0.9, 0.91–1.76, 1.76–2.61, 2.62–3.45, 3.46–4.30 and 4.31–5.15, was assigned ranks (i.e., from 1 to 9) as 9, 7, 5, 3, 2 and 1, respectively. This was done by reclassify function in spatial analyst tool using ArcGIS software. Permeability and DD are inversely related to each other. It means if DD is high, then it would result in high permeability and vice versa. Hence, it plays a vital role in obtaining the GWPZ map. As a result, a low DD was attributed to a high rank, and a high rank was allocated to a low rank.

Lithology map for GWPZ

Lithology map was assigned 16.9% weight according to AHP. In the lithology map, each sub-class, i.e., Granite, Gravel, Laterite, Quartz and Schist, was assigned ranks (i.e., from 1 to 9) as 5, 9, 7, 1 and 3, respectively. This is done by reclassify function in spatial analyst tool using ArcGIS software. Hydraulic conductivity is a key aquifer property that determines a lithology's groundwater recharge and storage potential. Higher the hydraulic conductivity higher the permeability and higher the infiltration. Based on this fact, highest rank was assigned to Gravel bed followed by Laterite, Granite, Schist and Quartz.

LD map for GWPZ

LD map was assigned 8.1% weight according to AHP. Each sub-class of the LD map, i.e., < 0.5, 0.5–1.1, 1.2–1.6, 1.7–2.2 and 2.3–2.8 km/km2, was assigned ranks (i.e., from 1 to 9) as 2, 4, 7, 8 and 9, respectively. This was done by reclassify function in spatial analyst tool using ArcGIS. The density of lineaments was ranked according to their proximity. It was discovered that as one moves away from the lineaments, the level of groundwater potential reduces. High rank was assigned for high density due to high porosity and infiltration, whereas low rank was assigned for low density classes due to low porosity and infiltration.

Geomorphology map for GWPZ

Geomorphology map was assigned 10.7% weight according to AHP. In the geomorphology map, each sub-class, i.e., Hills and Valleys, Alluvial Plain, Pediment Pediplain complex, Flood plain and water bodies, was assigned ranks (i.e., from 1 to 9) as 2, 7, 6, 8 and 9, respectively. This was done by reclassify function in spatial analyst tool using ArcGIS software. Water bodies were assigned highest rank due to high infiltration. Low rank was assigned to Hills and valleys due to less infiltration and steep slope.

Distance from river map for GWPZ

Distance from river map was assigned 13.6% weight according to AHP. In the distance from river map, each sub-class, i.e., < 300, 301–600, 601–900, 901–1200, 1201–1500 and > 1500 m, was assigned ranks (i.e., from 1 to 9) as 9, 8, 7, 6, 5 and 4, respectively. This was done by reclassify function in spatial analyst tool using ArcGIS software. Closer the distance from river, higher the groundwater recharge and higher the infiltration whereas away from the river results in decrease in groundwater recharge and infiltration. Therefore, high rank was assigned to closer distance areas and lower rank was assigned to far away areas.

Rainfall map for GWPZ

Rainfall map was assigned 4.5% weight according to AHP. In the rainfall map, each sub-class, i.e., 1420–1471, 1472–1523, 1524–1575, 1576–1626 and 1627–1678 mm, was assigned ranks (i.e., from 1 to 9) as 5, 6, 7, 8 and 9, respectively. This was done by reclassify function in spatial analyst tool using ArcGIS software. Infiltration is mainly influenced by two factors, i.e., intensity and rainfall duration. If the intensity of rainfall is high and duration of rainfall is short then it would result in high runoff whereas if the intensity of rainfall is low and duration of rainfall is long then it would result in low runoff. There were strong linear positive connections between groundwater recharge and rainfall. High rank was allocated to the regions with high rainfall, while low rank was allocated to the regions with low rainfall.

TWI map for GWPZ

TWI map was assigned 4.4% weight according to AHP. Each sub-class of the TWI map, i.e., high, moderate, low and very low, was assigned ranks (i.e., from 1 to 9) as 9, 7, 5 and 3, respectively. This was done by reclassify function in spatial analyst tool using ArcGIS software. In general, low TWI area consist of hill and mountains having steep slope results in high runoff. However, high TWI area consist of flat regions with flat slope results in low runoff. Hence, High rank was given to higher TWI and low rank was given to lower TWI.

TPI map for GWPZ

TPI map was assigned 3.4% weight according to AHP. Each sub-class of the TPI map, i.e., [− 9]–[− 1.1], [− 1]–[− 0.29], [− 0.28]–[0.3], [0.3]–[1.1] and [1.2]–[1.3], was assigned ranks (i.e., from 1 to 9) as 9, 7, 5, 3 and 2, respectively. This was done by reclassify function in spatial analyst tool using ArcGIS software. TPI differentiate the elevation of a cell to that of average elevation of the cell's surrounding neighborhood in the DEM. It determines the position of the topographic slope. Positive numbers show that a cell is higher than its neighbors, while negative values show that it is lower. It shows an inverse relationship with groundwater recharge. Therefore, high rank was given to lower TPI value and low rank was given to higher TPI values.

Roughness map for GWPZ

Roughness map was assigned 3.4% weight according to AHP. In the roughness map, each sub-class, i.e., < 0.31, 0.32–0.44, 0.45—0.56, 0.57–0.68 and 0.69–0.89, was assigned ranks (i.e., from 1 to 9) as 9, 7, 5, 3 and 2, respectively. This was done by reclassify function in spatial analyst tool using ArcGIS software. The topography's undulation is often expressed by the roughness index. Roughness shows the linear relationship with undulation. Therefore, higher roughness implies higher surface runoff and low groundwater recharge. Roughness shows an inverse relationship with the recharge of groundwater. Hence, high rank was allocated to low roughness whereas low rank was allocated to high roughness.

Profile and plan curvature map for GWPZ

Profile and plan curvature map were assigned 3.4% and 3.5% weights according to AHP, respectively. For both profile and plan curvature maps, each sub-class, i.e., convex, flat and concave, was assigned ranks (i.e., from 1 to 9) as 3, 5 and 7, respectively. This was done by reclassify function in spatial analyst tool using ArcGIS software. In general, Water shows the deceleration trend toward the surface that is convex and accumulation trend toward the surface that is concave. Convex surface shows an inverse relationship with groundwater recharge whereas concave surface shows direct relationship with the recharge of groundwater. Therefore, higher rank was allocated to concave surface and lower rank was allocated to convex surface.

Suitability map of GWPZ

GWPZ map of the study area was prepared by overlaying Lithology, Slope, LULC, DD, LD, Distance from stream, Rainfall, HSG, Geomorphology, TWI, TPI, Profile/Plan Curvature, and Roughness maps in ArcGIS software by considering the weights as well as ranks given to all the 14 thematic maps and its sub-classes according to AHP. For weighted overlay method in ArcGIS, Eq. 10 was used.

$$\begin{aligned} {\text{GWPZ}} = \,\,& 0.169 \times {\text{Lithology}} + 0.136 \times {\text{Distance from river}} + 0.107 \times {\text{Geomorphology}} + 0.081 \times {\text{HSG}} \\ + & 0.081 \times {\text{LULC}} + 0.081 \times {\text{LD}} + 0.06 \times {\text{DD}} + 0.059 \times {\text{Slope}} + 0.045 \times {\text{Rainfall}} + 0.044 \times {\text{TWI}} \\ & + 0.034 \times {\text{TPI}} + 0.034 \times {\text{Roughness}} + 0.034 \times {\text{Profile Curvature}} + 0.035 \times {\text{Plan Curvature}} \\ \end{aligned}$$
(10)

According to the GWPZ map, the Jamsholaghat sub-basin was categorized into four zones, i.e., very high, high, moderate and low GWPZ consisting of 22%, 45%, 26% and 7% of the Jamsholaghat sub-basin, respectively. Figure 17 shows the GWPZ prepared using all thematic maps.

Fig. 17
figure 17

Suitability map of GWPZ for Jamsholaghat

From the Fig. 17, it was clear that 67% of the area consist of very high and high GWPZ categories. This was due to favorable conditions such as lithology (laterite and quartz), geomorphology (flood plain and water bodies), slope (very gentle and gentle), high/moderate rainfall, LULC (agricultural and forest) and high LD of the present research area, i.e., Jamsholaghat sub-basin. However, 33% of the area consist of moderate and low GWPZ categories. This was due to unfavorable conditions such as hills and valley, steep slope, low LD, and low rainfall.

Sensitivity analysis of GWPZ

Table 6 shows the estimation of map removal sensitivity analysis, whereas Table 7 shows the estimation of single parameter sensitivity analysis. From the Table 6, it was clear that Lithology and distance from river maps have great impact followed by Geomorphology, HSG, LULC, LD and DD have moderate impact and Roughness and Curvature maps have lowest impact in GWPZ computation. The values of variation index were high upon elimination of lithology map after that distance from the river map, geomorphology map. It reveals that the lithology map has a significant impact on GWPZ of the present research area. Table 7 reveals the effective weight in percentage of all the thematic maps. The result shows that, higher the empirical weight higher the effective weight.

Table 6 Statistical analysis of map removal sensitivity analysis for GWPZ
Table 7 Statistical analysis of single parameter sensitivity analysis for GWPZ

Table 8 reveals the changes in percentage of areas for different groups (i.e., Very high, high, moderate and low) of GWPZ after removing one thematic map at a time. The result reveals that exclusion of lithology map and distance from river map influences the GWPZ highly by increasing the low suitability area 42.5% and 30.5%, respectively. However, exclusion of TWI, TPI, Roughness and Curvature maps shows lower impact to GWPZ.

Table 8 Changes of the GWPZ with removal of the thematic map

Validation of GWPZ

A total of 33 well yield data were collect in order to validate the GWPZ using ROC. Out of 33 wells, only 3 wells were shown disagreement to the GWPZ map generated using ArcGIS software. The existing well over GWPZ map is shown in Fig. 18. The well yield data was compared to the produced GWPZ map using SPSS software for the validation process. The cross-validation of GWPZ map using ROC curve is shown in Fig. 19. The Area Under the Curve (AUC) is a statistic that measures how effectively a parameter can differentiate among two groups. The AUC of the GWPZ map is 0.84, indicating that the AHP approach produced very accurate predictions.

Fig. 18
figure 18

Existing well over GWPZ for Jamsholaghat

Fig. 19
figure 19

Cross-validation of the GWPZ map using ROC Curve

Conclusion

In Jamsholaghat sub-basin flood and drought situation occurs often during monsoon and non-monsoon season due to heavy rainfall, high runoff, low infiltration and also due to lack of proper storage facility. In this study, an attempt was made in order to locate the GWPZ by considering combination of AHP, GIS approach and remote sensing. Important part of GWPZ was to locate the most suitable location thereby increasing water storage capacity in aquifer, improving the groundwater table and reducing the impact to environment due to drought during non-monsoon period. Numerous procedures exist, some were insufficient for a particular zones due to provincial conditions, time-consuming and others because of social and political problems. However, use of GIS and remote sensing along with multicollinearity check and AHP methods improves the accuracy of the output as well saves a lot of time.

Previously attempts were made in order to develop the GWPZ map (Agarwal et al. 2013; Abrams et al. 2018; Andualem and Demeke 2019; Çelik 2019; Murmu et al. 2019 and Kumar et al. 2020) but maximum of 10 number of thematic maps were used (Awawdeh et al. 2013; Fashae et al. 2014; Maity and Mandal 2017; Pinto et al. 2017; Gnanachandrasamy et al. 2018; Choubin et al. 2019; Raju et al. 2019; Al-Djazouli et al. 2020 and Koli et al., 2020). Final accuracy of the GWPZ map lies between 0.7 and 0.78 (Nijagunappa, 2011; Magesh et al. 2012; Fenta et al. 2014; Kaliraj et al. 2014; Yeh et al. 2016; Das et al. 2018; Kumar and Krishna 2018; Lakshmi and Reddy 2018; Nasir et al. 2018; Patra et al. 2018; Arulbalaji et al. 2019; Hutti and Jahan et al. 2019; Raju et al. 2019; Arshad et al. 2020; Benjmel et al. 2020) without considering the multicollinearity check and sensitivity analysis. However, inclusion of multicollinearity check as well as sensitivity analysis improves the accuracy of the thematic map up to 0.84.

The novelty of the present study was to consider multicollinearity check and sensitivity analysis for GWPZ map. Furthermore, 14 thematic maps were being consider in order to delineate the GWPZ map. It was found that multicollinearity checks plays crucial role in selecting suitable locations as it checks uncertainty among the data hence, it increases the accuracy of the result. Sensitivity analysis shows the importance of each thematic map over others thereby validating the weights assigned using AHP to each thematic map.