1 Introduction

Floods are a naturally occurring phenomenon that affects human settlements and agricultural areas worldwide. While often associated with negative consequences, the presence of floods can result in environmentally sustainable benefits for areas that have been affected by prolonged droughts (Arya & Singh, 2021; Desalegn & Mulu, 2020). Floods, impacting numerous individuals worldwide, are amongst the most prevalent and catastrophic natural calamities that occur frequently, and their consequences are not limited to the loss of human lives but also result in significant economic and environmental damages (Abu El-Magd et al., 2020). Climate change is one of the major problem world is facing now a days. Due to climate change rainfall duration is slowly reducing and rainfall intensity is slowly increasing, as a result of that flood frequencies are increasing (Ehteram et al., 2022; Lai et al., 2022; Polong et al., 2023). According to the United Nations, floods are responsible for 70% of all natural disasters worldwide, and in the past few years, Floods have become more frequent and intense as a result of changes in the climate and activities by humans (Vörösmasrty et al., 2005). Experts around the world have collaborated to conduct organized attempts to chart out the likelihood of floods and simulate flooding scenarios (Hussain et al., 2021; Ogato et al., 2020; Velasco et al., 2016). Floods are mostly caused by protracted periods of heavy, intense rainfall in a specific location. Insufficient capabilities of embankments, drainage channels, and waterways as well as improper disposal of solid waste, are significant factors contributing to early monsoon season floods (Arrighi, 2021; Singh et al., 2020).

Several researchers have utilized the Analytical Hierarchical Process (AHP) developed by Saaty (1987), in conjunction with the Geographic Information System (GIS), to conduct flood analysis (Amare & Okubay, 2019; Chen et al., 2018; Memon et al., 2020; Morea & Samanta, 2020; Yang et al., 2018). Accurately identifying the causative factors of floods is crucial due to their associated impacts. Flood risk mapping is a complex process that requires the integration of various data sources and analytical techniques. Several approaches have been proposed for flood risk mapping, including empirical, statistical, and physical models. The Flood Risk Index (FRI) map is designed to display various classes of seriousness zones based on predetermined characteristics and weights. This map has been developed using AHP and GIS datasets to provide accurate and reliable information. It is essential to understand that the FRI map's severity zones are determined according to the selected criteria and their corresponding weights. Identifying flood-prone areas and implementing measures to mitigate risks is crucial for disaster management, particularly in developing countries facing increasing climate change-related challenges (Chakraborty & Mukhopadhyay, 2019). While structural mitigation of flooding methods may require extensive ground elevation and cross-sectional data, non-structural approaches combined with geospatial data and the Multi-criteria Decision Analysis (MCDA) approach are commonly used to identify high-risk areas in regions with limited data. As a result, mapping flood risk areas has grown crucial for disaster planners to do effectively mitigate and manage the impact of disasters (Dayala et al., 2020; Merz et al., 2021; Nasiri et al., 2016; Rincon et al., 2018). The Srikakulam district near the coastal region of Andhra Pradesh, India, has experienced significant flooding, with occurrences in the years 1970, 1979, 1987, 1994, 1998, 2002, 2006, 2009, 2011, 2012, 2014, 2016, 2018, 2020, 2021 and 2022. This happens due to presence of two rivers i.e., Nagavali River and Vamsadhara River. Insufficient historical data is a considerable obstacle in flood analysis and the sustainable management of watersheds that are predisposed to flooding.

Despite the prevalence of flooding worldwide, there has been insufficient research conducted on data-poor sub-basins when it comes to analyzing flood data (Gandini et al., 2020). This includes critical information such as cross-sectional discharge data, satellite images of flood inundation, and observed flood marks from past events that can lead to flooding in developing nations. Empirical models are appropriate for areas with a lengthy history of flooding since they are based on previous flood information and statistical evaluation (Samela et al., 2018). However, in data-scarce regions, empirical models may not be effective due to the lack of adequate data. Statistical models use statistical techniques to analyze the relationship between flood variables and additional elements that raise the probability of flooding, such as rainfall, land use, and topography (Petroselli et al., 2019; Vojtek & Vojtekova, 2019). Statistical models are suitable for data-scarce regions as they require only a few key variables to predict flood risk. Physical models are based on the principles of hydraulics and hydrology and simulate the flood behavior of a river system (Baquedano & Ferreira, 2021; Martinez-Gomariz et al., 2021; Requena et al., 2018). Physical models require detailed data on river morphology and hydrological parameters, which may not be available in data-scarce regions.

GIS is an influential tool for geographical information analysis and management and has been widely used for flood risk mapping. GIS can integrate different data from various sources, such as topographic maps, land use maps, and rainfall data, and provide a spatially explicit representation of flood risk (Bonazza et al., 2018; Doorga et al., 2022; Ferreira & Santos, 2020; Gazi et al., 2019; Miranda & Ferreira, 2019; Quinn et al., 2019; Teng et al., 2017; Xiao et al., 2021). MCDA is an evaluative technique that can be utilized to assess the comparative significance of diverse criteria and sub-criteria in the context of flood risk assessment. MCDA can be used to assign weights to different flood risk factors and develop a composite flood risk index (Baquedano & Ferreira, 2021; Mundhe, 2019; Prieto et al., 2020; Samela et al., 2018).

Hydrological parameters, crucial in understanding the flow and behavior of water systems, benefit significantly from the integration of RS and GIS (Palacio-Aponte et al., 2022). RS technologies, such as thermal infrared imagery and radar, provide valuable data on soil moisture, river morphology, and precipitation patterns. GIS processes this information to model and simulate hydrological processes, contributing to the identification of potential flood risk zones and the development of effective floodplain management strategies (Hermas et al., 2021; Schroeder et al., 2016).

In disaster management and resilience planning, the collaboration between RS and GIS is a game-changer. The dynamic mapping capabilities of GIS, fueled by continuously updated RS data, empower authorities to make informed decisions during emergency response and recovery phases (Khosravi et al., 2019). By incorporating real-time RS observations into GIS, emergency responders can assess the evolving situation on the ground, optimize resource allocation, and implement targeted interventions to mitigate the impact of flooding events (Rincon et al., 2018; Shahabi et al., 2021).

The integration of RS and GIS in flood risk mapping represents a synergistic approach that harnesses the strengths of both technologies (Kienberger et al., 2009). This collaboration not only enhances the accuracy of flood risk assessments but also provides a holistic understanding of the complex interactions between various environmental factors. As the technology continues to advance, the RS-GIS partnership is poised to play an increasingly vital role in shaping resilient communities and mitigating the devastating consequences of flooding worldwide (Solomun et al., 2021; Vojtek & Vojtekova, 2019).

In areas where data is scarce, flood risk assessment and management become challenging owing to the inadequacy of necessary facts and information required for accurate flood risk mapping (Amare & Okubay, 2019). To address this issue, Two commonly used technologies for mapping flood risk are GIS and MCDA, their integration can provide a comprehensive approach for evaluating and managing flood risk in data-scarce regions (Martinez-Gomariz et al., 2021; Morea & Samanta, 2020). The Geospatial technique was utilized to create the Flood Vulnerability Index (FVI) and Flood Hazard Index (FHI) for Andhra Pradesh's Srikakulam district in India. These indices were designed to assist local stakeholders and watershed managers in managing excessive water and drought situations. The criteria and sub-criteria for the Flood Risk Index (FRI) map were analyzed using the AHP and MCDA techniques (Abu El-Magd et al., 2020; Desalegn & Mulu, 2020; Xiao et al., 2021). Nonetheless, creating an inundation map of flood for a farming rural zone necessitates an extremely precise digital elevation model, stage-discharge information, and surveyed transverse profiles, which are not available in the current research region. Consequently, the current research work endeavored to produce a flood risk map utilizing the GIS-AHP method in the deficiency of these datasets.

The aim of this study is to create a FRI by utilizing 18 thematic maps in ArcGIS software and applying the AHP. This research stands out due to its utilization of a larger number of thematic maps, specifically 18, compared to the previous maximum of 10 to 12 maps. Furthermore, the study incorporates a thorough assessment of multicollinearity and sensitivity analysis. By including these checks and using a greater number of thematic maps, the accuracy of the FRI map is significantly enhanced. Consequently, the FRI map provides highly reliable data that can greatly contribute to effective long-term watershed management in flood-prone areas.

2 Materials and methods

2.1 Research region

The study conducted in the Srikakulam District of Andhra Pradesh focuses on the Nagavali and Vamsadhara fluvial systems within the coastal tracts. This particular research region spans approximately 5840 km2, located between latitudes 18° 40′ N–18° 09′ N and longitudes 84° 10′ E–83° 39′ E. The Nagavali and Vamsadhara rivers are two prominent river systems in northern Andhra Pradesh, distinctly marked by the Eastern Ghats, with a gradual slope from north to south. Originating from a hill near Lakhbahal village in the Kalahandi District of Odisha, the Nagavali river traverses through Nakrundi, Kerpai, Kalyansinghpur, and Rayagada in Odisha, before merging into the Bay of Bengal near Kallepalli village in Srikakulam, Andhra Pradesh. The total length of the Nagavali river is 256 km, with 160 km falling within Odisha and the remaining portion flowing through Andhra Pradesh. In contrast, the Vamsadhara river stretches for a distance of 254 km until it meets the Bay of Bengal at Kalingapatnam in Andhra Pradesh. The defined study region's Digital Elevation Model (DEM) is illustrated in Fig. 1.

Fig. 1
figure 1

DEM of the study area

2.2 Data procurement

In order to acquire the necessary data for the study, multiple sources were utilized to ensure access to reliable and diverse information. The 12.5 m resolution of DEM data was obtained from the Alaska Search Facility in Earth Data, which is conveniently accessible online at https://search.asf.alaska.edu/#/. Conversely, Landsat 8 data were sourced from Bhuvan, the National Remote Sensing Centre (NRSC), freely available at http://bhuvan.nrsc.gov.in. Rainfall data were acquired from the Center for Hydrometeorology & Remote Sensing (CHRS) and can be accessed at https://chrsdata.eng.uci.edu/. To gather information regarding lithology and road networks, data at a scale of 1:50,000 were procured from the Geological Survey of India (GSI), accessible at https://bhukosh.gsi.gov.in/Bhukosh/Public. The soil type and soil depth map, also at a scale of 1:50,000, were obtained from the National Atlas and Thematic Mapping Organization's district planning map series. Population density data were collected from the Census of India, National Statistical Office, while crop production data were procured from the Agricultural Department in Srikakulam, Andhra Pradesh.

2.3 Methodology

In order to effectively manage floods and minimize the impact on infrastructure and human lives, it is essential to identify the Flood Risk Index (FRI) map. The primary goal of current study is to evaluate the possible threat of flood risk in the Srikakulam district by employing spatial analytical tools and the AHP. By integrating flood vulnerability and hazard maps, the FRI was computed. The determination of main classes and sub-classes weights was based on an extensive literature review and expert input. In contest to spatial resolution of map, the pixel size having matrix with 6344 columns and 5597 rows and a cell size of 12.5 m × 12.5 m was used for all the thematic maps. For conversion of pixel size, “Nearest Neighbor method” was used for discrete data (i.e., Lithology, Land Use and Land Cover (LULC), Distance from River (Dr), Population Density, Crop Production and Road River Interaction) whereas “Cubic method” was used for continuous data (i.e., Hydrological Soil Group (HSG), Slope, Drainage Density (DD), Rainfall, Permeability, Runoff, Normalized Difference Vegetation Index (NDVI), Normalized Difference Built-up Index (NDBI), Modified Normalized Difference Water Index (MNDWI), Topographic Wetness Index (TWI), Profile and Plan curvature).

2.4 Development of FRI map

To generate the spatial database for the FRI map, various meteorological and topographical datasets were utilized. These datasets were then integrated with a GIS platform. The construction of the FRI map involved the consideration of a total of eighteen factors. These factors were carefully selected and analyzed to ensure a comprehensive assessment of flood risk. The present study employed fifteen thematic maps for Flood Hazard Index (FHI) determination. Additionally, three thematic maps, namely Population Density, Crop Production, and Road-River Interaction, were prepared for Flood Vulnerability Index (FVI) analysis. All of these maps were generated using ArcGIS software. To ensure the reliability of the analysis, a multicollinearity check was conducted on all 18 thematic maps. Subsequently, the thematic maps were overlaid in ArcGIS after allotting ranks and weights using the AHP. Sensitivity analysis was then performed for both the FHI and FVI maps. The FRI map for the research region was produced by utilizing the weighted sum technique in ArcGIS software to combine the FHI and FVI maps as shown in Eq. 1.

$${\text{FRI}}={\text{FHI}}\times {\text{FVI}}$$
(1)

To validate the map of FRI, the Receiver Operating Characteristic (ROC) curve was considered. This comprehensive approach, utilizing advanced GIS techniques and analytical methods, facilitated the development of an accurate and validated FRI map, which is crucial for effective flood management and mitigation strategies. The diagram illustrating the process of creating the FRI can be observed in Fig. 2.

Fig. 2
figure 2

Flowchart for delineating the FRI map

2.5 Development of FHI map

During the development of the FHI map, determining the underlying reasons for flood hazards was of utmost importance. In this specific research, multiple factors were considered significant in creating the map. These factors include Runoff, Slope, LULC, Lithology, NDVI, MNDWI, NDBI, HSG, DD, Rainfall, Dr, TWI, Profile, and Plan Curvature. The selection of these factors was based on a thorough literature review and their relevance to the research region (Chen et al., 2018; Doorga et al., 2022; Hussain et al., 2021). The following sections were presented the comprehensive approach employed to generate the digital maps with spatial information for all the chosen variables.

2.6 Runoff map

Runoff refers to the movement of water, such as rainwater or melted snow, over the Earth's surface. It happens when the amount of precipitation is greater than the soil's ability to retain or absorb water. Instead, the excess water flows over the land, typically following the natural slope or topography, and collects in streams, rivers, lakes, or other bodies of water (Arya & Singh, 2021). Initially, the catchment area was partitioned into multiple sub-catchments based on Hydrological Soil Group (HSG). Subsequently, the catchment area was further divided into several sub-catchments based on drainage points. In every sub-catchment, the runoff depth was computed through the employment of the Soil Conservation Service (SCS) curve number technique. The estimation of runoff depth involves considering parameters such as curve number, HSG, rainfall, and LULC characteristics specific to the research region. In the framework of this investigation, the runoff map was sorted into four distinct groups based on the depth of the runoff. The method utilized for classifying these groups was the "Natural Break" technique in ArcGIS.

Within the runoff map, the sub-classes (ranging from 29 to 38 mm, 39 to 71 mm, 72 to 111 mm, and 112 to 145 mm) were allocated ranks from 1 to 9. To accomplish this, the reclassify function within the spatial analyst tool of ArcGIS software was utilized. The reclassification procedure involved assigning a rank of 1 to the lowest runoff values, while higher runoff values received higher ranks. This ranking scheme was based on the underlying principle that higher runoff poses a greater risk of flooding, thus deserving a higher rank, whereas lower runoff corresponds to a lower risk of flooding, warranting a lower rank.

2.7 Slope map

The slope affects surface runoff, which in turn affects groundwater recharge. To assess the slope characteristics in the research region, a Slope map was generated using the DEM (Amare & Okubay, 2019). The Spatial Analyst Tools in GIS software were utilized, selecting the Surface option to generate the slope map. Following the generation of the slope map, it was then divided into five distinct classifications based on the measured slope inclination in degrees. The classification method employed for these groups was the "Natural Break" technique within the ArcGIS framework.

The slope map was classified into sub-classes such as Very low, Low, Moderate, High, and Very High, which were assigned ranks ranging from 1 to 9. This reclassification process was accomplished using the reclassify function within the ArcGIS spatial analyst tool. The ranking scheme was determined based on the understanding that the slope of the surface plays a crucial part in influencing both surface runoff and infiltration rates. Steep or high slopes tend to result in rapid rainwater runoff due to increased velocity. Conversely, flat or low slopes are more susceptible to quick waterlogging or flooding situations and have higher rates of infiltration. Hence, the highest rank was given to the sub-class representing very low or flat slope, indicating a lower risk, while the sub-class representing very high slope received the lowest rank, signifying a higher risk.

2.8 LULC map

The LULC map provides a visual representation of various Land cover-use within the research region, including agricultural land, wetlands, forests, built-up areas and water bodies, among others (Bonazza et al., 2018). For this research, the Earth Explorer website was used to gather the maps of Landsat 8. Landsat 8 data were collected on 12th April, 2022 with 0% cloud cover. The spatial resolution of Landsat image used was 30 m by 30 m. The image was first classified into different sub-classes using supervised image classification followed by resampling in ArcGIS. During dry season the Landsat images were collected. For image classification, the band numbers of the blue, green, and red colors were adjusted from low to high values. The image classification tools were then utilized to classify the image according to various land types. Enhancing the training sample numbers for each land category increased the accuracy of the LULC map. As the study area is small, for each sub-classes a 50 training samples were selected. This research area underwent classification into five primary classes, namely Forest, Barren, Agricultural, Water Bodies, and Built-up. The method employed for classifying these groups was the "Land use unit" in the ArcGIS framework. Hence, a total of 250 training samples were selected. Polygons were used to obtain the samples. After preserving the samples of training, the Maximum Likelihood classification technique was utilized in ArcGIS software to develop the final LULC map. For validation purpose, for each categories 50 points were selected and using Error matrix, an overall accuracy of 86% and kappa coefficient of 0.84 were obtained.

Within the LULC map, various sub-classes such as Forest, Barren/Waste, Agricultural, Water bodies and Built-up were given ranks ranging from 1 to 9. The LULC plays a significant role in influencing hydrological processes, including precipitation, infiltration, base flow, interflow and evapotranspiration. Built-up, barren land and water bodies tend to be more susceptible to floodwater due to higher runoff, whereas agriculture and forests exhibit relatively lower runoff characteristics. As a result, water bodies were assigned the highest rank, indicating their significant contribution to floodwater. This was followed by built-up areas and barren land cover, both of which possess increased vulnerability to flooding. Forest regions, on the other hand, were assigned the lowest rank, reflecting their ability to mitigate runoff through their vegetation and infiltration capacities.

2.9 Lithology map

The lithology map provides information about the different rock types present in the subsurface of a research region (Rincon et al., 2018). In the current research, the lithology map specific to the research region was obtained from the GSI by utilizing the coordinate extends of the research region. To enhance the accuracy of the lithology map, additional information was gathered from the Central groundwater board (CGWB) situated in Visakhapatnam, Andhra Pradesh, India. Subsequently, this information was compared and authenticated with the lithology map procured from the GSI. Using the information gathered from the field, the lithographic map was corrected and improved. The predominant geological features within the research area can be classified into four distinct rock structures. The classification method employed for these groups utilized the "Lithological unit" in the ArcGIS platform.

Within the lithology map, different sub-classes such as Granite, Gneiss, Laterite, and Quartz were given ranks ranging from 1 to 9. This reclassification was performed using the reclassify function within the ArcGIS software's spatial analyst tool. Lithology, in combination with topography, plays a crucial role in influencing flooding patterns. For instance, when impermeable lithology underlie low-lying areas or valleys, they hinder drainage and contribute to water accumulation, thereby increasing the risk of flooding. Conversely, permeable lithology underlying elevated areas facilitate natural drainage and reduce the likelihood of flooding. Considering these factors, the highest rank was assigned to Granite due to its impermeability and favorable influence on flood. Gneiss received the second-highest rank, followed by Laterite and Quartz. This ranking scheme was based on the understanding that lithology with greater impermeability, such as Granite, have a positive impact on flood risk, while lithology with lower impermeability, such as Laterite and Quartz, have a comparatively lesser influence.

2.10 NDVI map

NDVI is an index commonly used in remote sensing and vegetation studies to assess the health, density, and vigor of vegetation cover (Martinez-Gomariz et al., 2021). NDVI was estimated by considering the red and near-infrared (NIR) bands of remote sensing imagery, typically obtained from satellites or aerial sensors. By applying Eq. 2 to the Landsat 8 imagery through the raster calculator function available in ArcGIS software, the NDVI map for the research region was prepared.

$${\text{NDVI}}=\frac{{\text{NIR}}-{\text{RED}}}{{\text{NIR}}+{\text{RED}}}$$
(2)

The NDVI map was subdivided into five groups based on the NDVI values. The classification method employed for these groups was the “Equal Interval” technique within the ArcGIS platform. Within the NDVI map, various sub-classes such as Very Low, Low, Moderate, High, and Very High were assigned ranks varying from 1 to 9. This reclassification process was achieved using the reclassify function within the ArcGIS software's spatial analyst tool. The NDVI is a measure that reflects the density and health of vegetation in an area. Negative value of NDVI generally represents the water, while positive value of NDVI represents the vegetation. The relationship between NDVI and flooding is inversely correlated. Higher value of NDVI represents a greater existence of healthy vegetation, which can contribute to better water absorption, reduced surface runoff, and increased infiltration capacity. Therefore, areas with higher NDVI values tend to have a lower probability of flooding. Based on this understanding, the highest rank was allocated to the Very Low NDVI sub-class, indicating a reduced flood risk, whereas the Very High NDVI sub-class assigned the lowest rank, suggesting a higher vulnerability to flooding.

2.11 NDBI map

NDBI is a remote sensing index commonly used to detect and quantify built-up areas within an urban or rural landscape (Ferreira & Santos, 2020). NDBI was derived from satellite or aerial imagery and is calculated using the NIR and short-wave infrared (SWIR) spectral bands. Equation 3, which refers to the formula for calculating the NDBI, was implemented using the Landsat 8 imagery through the raster calculator function in ArcGIS software. This process involved performing the necessary calculations on the respective spectral bands to derive the NDBI values for each pixel in the research region.

$${\text{NDBI}}=\frac{{\text{SWIR}}-{\text{NIR}}}{{\text{SWIR}}+{\text{NIR}}}$$
(3)

The NDBI map underwent segmentation into five groups, categorized according to NDBI values. The classification method utilized for these groups was the "Equal Interval" technique within the ArcGIS platform. Within the NDBI map, different sub-classes such as Very Low, Low, Moderate, High, and Very High were assigned ranks varying from 1 to 9. This reclassification process was accomplished using the reclassify function within the ArcGIS software's spatial analyst tool. The NDBI is a measure that assesses the extent and density of built-up or urban areas in a region. Negative values of NDBI generally represents the existence of water bodies, whereas positive value of NDBI represents the existence of vegetation or built-up areas within urban settings. The relationship between NDBI and flooding is also inversely correlated. Higher NDBI values indicate a greater density of built-up areas, which can result in increased surface runoff and reduced infiltration capacity. Therefore, areas with higher NDBI values tend to have a higher probability of flooding. Conversely, lower NDBI values indicate the presence of water bodies or less dense urban areas, which typically have a lower risk of flooding. As a result, the ranking scheme was established, with the Very Low NDBI sub-class assigned the highest rank, indicating a lower probability of flooding, whereas the Very High NDBI sub-class allocated the lowest rank, suggesting a higher vulnerability to flooding.

2.12 MNDWI map

The MNDWI is a remote sensing index that is used in aerial or satellite photography to locate and map water bodies. MNDWI was particularly effective in differentiating water bodies from other land cover types, such as vegetation and built-up areas (Xiao et al., 2021). Equation 4, was used for estimating the MNDWI map by considering Landsat 8 image through raster calculator function in ArcGIS.

$${\text{MNDWI}}=\frac{{\text{GREEN}}-{\text{SWIR}}}{{\text{GREEN}}+{\text{SWIR}}}$$
(4)

The MNDWI map was divided into five groups based on MNDWI values. The classification method employed for these groups was the “Equal Interval” technique within the ArcGIS platform. Within the MNDWI map, different sub-classes such as Very Low, Low, Moderate, High, and Very High were assigned ranks varying from 1 to 9. This reclassification process was conducted using the reclassify function within the ArcGIS software's spatial analyst tool. The MNDWI is a remote sensing index that helps identify water bodies based on the spectral properties of the imagery. Negative values of MNDWI generally represents the existence of vegetation, whereas positive values of MNDWI represents the existence of water bodies. In contrast to the previous indices discussed, the relationship between MNDWI and flooding is directly correlated. Higher MNDWI values suggest a greater presence of water bodies, indicating a higher flooding probability. This is because regions with higher values of MNDWI are more likely to have increased surface runoff and reduced infiltration capacity, which contribute to flood risk. Consequently, the ranking scheme was established, with the Very Low MNDWI sub-class assigned the lowest rank, indicating a lower probability of flooding, while the Very High MNDWI sub-class allocated the highest rank, indicating a higher vulnerability to flooding.

2.13 HSG map

The penetration and seepage velocities from the top layer to the underground, encompassing water-bearing layers, are largely impacted by the constitution of the Soil. The HSG has been sorted into four groups by the United States Department of Agriculture (USDA), which are A, B, C, and D (Teng et al., 2017). To represent these soil groups, a new shapefile was created, and polylines were drawn based on the soil groups considering the editor tool in present in the ArcGIS. The attribute table was then utilized to assign the corresponding soil groups to the generated polygons by joining the polylines. A map of the soil was created by consolidating the polygons belonging to identical soil groups in the attribute table with the use of the editor tool.

Within the research region, the prevalent soil groups were categorized into four types. The classification method employed for these groups was the "Soil group unit" technique within the ArcGIS platform. Within the HSG map, the different sub-classes, namely A, B, C, and D, were allocated ranks ranging from 1 to 9. This reclassification process was performed using the reclassify function within the ArcGIS software's spatial analyst tool. The soil types present in an area plays a critical part in determining its susceptibility to flooding. The capacity of the soil to hold water and its ability to absorb water into its layers have a significant effect on the extent of flooding. In this regard, the ranking scheme was established. The highest rank was assigned to Group D soil due to its low infiltration rate, indicating a higher potential for surface runoff and reduced infiltration capacity. Following that, Group C soil received the next highest rank due to its moderate infiltration capacity. Group B soil was assigned a lower rank than Group C due to its higher infiltration capability. Lastly, Group A soil, characterized by very high infiltration capacity, was assigned the lowest rank. This ranking reflects the relationship between soil type and flood susceptibility, with Group D being the most vulnerable to flooding and Group A being the least susceptible due to its high infiltration capacity.

2.14 DD map

The density of drainage is a significant parameter that indicates the spacing and composition of stream channels, offering insights into the mean stream channel length throughout the entire basin (Yang et al., 2018). In order to generate the DD map for the research region, the DEM was utilized. The process involved selecting the "Hydrology" option within the "Spatial Analyst Tools" in ArcGIS software. Initially, DEM was referenced geographically and transformed to a projected coordinate system. Then, the map of flow direction was produced, which was succeeded by the development of the map of flow accumulation. To ensure accuracy, values less than 500 pixels in the flow accumulation map were excluded. The stream system of the research region was subsequently produced utilizing the flow direction and flow accumulation maps. By utilizing the produced stream network, the stream hierarchy of the sub-basin was established. Furthermore, the line density feature in ArcGIS software was utilized to create the DD map.

The DD map illustrates the concentration of drainage in the research area, segmented into five classifications. The classification method employed for these groups was the "Equal Interval" technique within the ArcGIS platform. Within the DD map, different sub-classes such as Very Low, Low, Moderate, High, and Very High were assigned ranks varying from 1 to 9. This reclassification process was carried out using the reclassify function within the ArcGIS software's spatial analyst tool. There exists an inverse relationship between flood hazard and DD. Higher values of DD indicate a lower risk of flooding, while lower values of DD indicate a higher flood risk. Therefore, DD plays a crucial role in generating the FHI map. In light of this relationship, a ranking scheme was established. A high DD, indicating a higher potential for efficient drainage and reduced flood risk, was assigned a low rank. On the other hand, a low DD, indicating a lower ability for efficient drainage and a higher flood risk, was assigned a high rank. By incorporating DD into the analysis, the resulting FHI map benefits from this vital information, enabling a comprehensive assessment of flood hazards.

2.15 Rainfall map

Surface runoff is significantly influenced by rainfall, making it a vital natural resource (Memon et al., 2020; Ray, 2023). In this study, precipitation data covering a time frame of three decades (1993–2022) was gathered from various rain gauges operated by the Indian Meteorological Department (IMD) located in the vicinity and adjacent areas of the study site's buffer zone. To examine the geographical arrangement of precipitation, an ArcGIS software was utilized to generate a map displaying the distribution of rainfall. Initially, the annual average rainfall data for all the rain gauges was estimated in Excel. Afterwards, the information was integrated into ArcGIS utilizing the latitude and longitude coordinates of the precipitation measuring devices. In order to display the spatial arrangement, an interpolation technique known as Inverse Distance Weight (IDW) was executed in ArcGIS, leading to the development of the rainfall map for the designated study area.

The rainfall map was categorized into five distinct groups. The classification technique utilized for these groups was the "Natural Break" method within the ArcGIS platform. Within the map, different sub-classes representing specific rainfall ranges, such as 918–1043 mm, 1044–1100 mm, 1101–1158 mm, 1159–1222 mm, and 1223–1342 mm, were assigned ranks ranging from 1 to 9. This reclassification process was performed using the reclassify function within the ArcGIS software's spatial analyst tool. Infiltration, which refers to the process of water seeping into the soil, is affected by mainly two key parameters: intensity and duration of rainfall. Higher intensity combined with shorter duration leads to increased runoff, while lower intensity coupled with longer duration results in decreased runoff. There exists a strong linear positive relationship between floods and rainfall. Regions experiencing high rainfall were assigned a high rank, indicating a greater likelihood of flooding, while areas with less rainfall were allocated a lesser rank, suggesting a lower flood risk. This ranking scheme reflects the influence of rainfall on the runoff process and its impact on flood occurrence. Regions with higher rainfall amounts are more susceptible to floods due to increased runoff, while regions with lower rainfall amounts are less prone to flooding due to reduced runoff.

2.16 Distance from river map

With the aid of the ArcGIS program, a map indicating the distance from the river was created for the study area. The procedure consisted of various stages. At first, the flow accumulation was modified to a threshold of 500 pixels by utilizing the map algebra function in the spatial analyst tool. Subsequently, the stream network was produced. To formulate the distance from river map, the multi-ring buffer function was employed, which was based on the distances that were calculated (Singh et al., 2020).

The research area underwent division into six distinct groups based on distances. The classification method utilized for these groups was the "Manual" technique within the ArcGIS platform. Within this map, different sub-classes representing specific distance ranges from the river, such as 500 m, 1000 m, 2000 m, 3000 m, and distances greater than 3000 m, were assigned ranks ranging from 1 to 9. This reclassification process was performed using the reclassify function within the ArcGIS software's spatial analyst tool. The distance from a river has a significant influence on flood hazard and infiltration. Areas located closer to the river generally experience higher flood hazards and greater infiltration rates. On the other hand, areas farther away from the river tend to have lower flood hazards and reduced infiltration. As a result, higher ranks were assigned to areas in close proximity to the river, indicating a higher flood hazard and infiltration potential. Conversely, lower ranks were assigned to areas located farther away from the river, indicating a decreased flood hazard and infiltration potential. This ranking scheme captures the relationship between the distance from the river and the associated flood risk. Areas closer to the river are more susceptible to flooding due to their proximity, which can lead to higher flood hazards and infiltration rates. In contrast, areas located at greater distances from the river are generally less prone to flooding, resulting in reduced flood hazards and infiltration.

2.17 Permeability map

Permeability of soil refers to its ability to allow the movement of water or other fluids through its pores or spaces. It is a crucial property that affects the drainage and infiltration characteristics of soil (Dayala et al., 2020; Velasco et al., 2016). The permeability map was prepared by considering soil map and lithology map of the research region. Both maps were overlapped in the ArcGIS by assigning weightage to each thematic maps as well as its sub-classes by considering AHP. The permeability map was segmented into four categories. The classification method used for these groups was the "Equal Interval" technique within the ArcGIS platform. Within this map, different sub-classes representing varying levels of permeability, such as Low, Moderate, High, and Very High, were assigned ranks varying from 1 to 9. This reclassification process was conducted using the reclassify function within the ArcGIS software's spatial analyst tool. Permeability refers to the soil's capacity to allow the movement of water or other fluids through its pores or spaces. It is a crucial characteristic that significantly influences the drainage and infiltration properties of the soil. In the context of flood susceptibility, permeability and flood are inversely related. Soils with low permeability tend to impede water movement, leading to increased surface runoff and higher flood potential. Conversely, soils with high permeability facilitate the movement of water, enhancing drainage and reducing the likelihood of flooding. Therefore, a high rank was assigned to areas with low permeability, indicating reduced water movement and higher flood susceptibility. Conversely, a low rank was assigned to areas with high permeability, indicating enhanced water movement and lower flood susceptibility.

2.18 TWI map

ArcGIS was used to build the TWI map. Initially, the slope map was prepared utilizing the spatial analyst tool, and afterward Eq. 5 was employed in the raster calculator feature, utilizing map arithmetic, to produce the TWI map (Ray, 2023). The equation incorporates the upslope (α) and topographic gradient (β) parameters.

$${\text{TWI}}={\text{ln}}\frac{\mathrm{\alpha }}{\mathrm{tan\beta }}$$
(5)

The research area's TWI map underwent categorization into five groups. The classification approach employed for these divisions was the "Equal Interval" technique, implemented within the ArcGIS platform. Sub-classes within the TWI map, such as very high, high, moderate, low, and very low, were allocated ranks ranging from 1 to 9. This reclassification process was performed using the reclassify function within the ArcGIS software's spatial analyst tool. The TWI provides an indication of wetness within a landscape based on topographic characteristics. In general, areas with a low TWI, such as hilly and mountainous regions with steep slopes, tend to experience high runoff and are considered low-risk zones for flooding. On contrast, regions with a higher values of TWI, such as flat regions with gentle slopes, exhibit low runoff and are associated with a higher risk of flooding. Consequently, higher ranks were allocated to regions with higher values of TWI, representing flatter regions with lower runoff potential and a higher risk of flooding. Conversely, lower ranks were allocated to regions with lower TWI values, representing hilly or mountainous areas with steeper slopes and a lower risk of flooding.

2.19 Profile and plan curvature map

The profile curvature denotes the orientation of the highest incline, which is identical to it (Amare & Okubay, 2019). The map of profile curvature for the current study region was produced in ArcGIS by utilizing the curvature function in the 3D analyst tool. The plan curvature indicates the curvature that is at right angles to the direction of the steepest slope (Quinn et al., 2019). The map of plan curvatures for the current study site was produced in ArcGIS by utilizing the 3D analyst tool's curvature feature.

The categorization of the profile and plan curvature map involved dividing it into three groups. The classification method applied to these groups was the "Equal Interval" technique within the ArcGIS platform. Each sub-categories of the profile and plan curvature maps, namely low, moderate, and high, were allocated ranks varying from 1 to 9, with ranks of 1, 5, and 9 respectively. The classification was accomplished through the utilization of the reclassify feature present in the spatial analyst tool embedded in the ArcGIS. In general, water exhibits a deceleration trend towards convex surfaces and an accumulation trend towards concave surfaces. A convex surface is inversely related to flooding, while a concave surface has a direct relationship with flooding. Consequently, a higher rank was assigned to the high sub-classes, representing concave surfaces, while a lower rank was assigned to the low sub-classes, representing convex surfaces.

2.20 Development of FVI map

The selection of parameters was based on a number of literature review, considering their significance in the domain of flood vulnerability as well as data availability. To conduct the flood vulnerability analysis, distinctive parameters were chosen. The social parameter was determined as the population density (number of individuals per square kilometer), the economic parameter as cropland productivity (kilograms per hectare), and the physical transportation parameter as the density of road-river intersection points per unit area (Arya & Singh, 2021; Bonazza et al., 2018; Chen et al., 2018; Gandini et al., 2020; Hussain et al., 2021; Requena et al., 2018). These indicators were deemed relevant and appropriate for assessing flood vulnerability in the research region.

2.21 Crop production map

The region's industrial and economic sectors heavily rely on agriculture, which highlights the importance of the proportion of cropland. This parameter is crucial when calculating the economic losses incurred by the region due to disasters (Prieto et al., 2020). To obtain data on the mean crop production in a year for the research region, information was collected from the Agricultural Department in Srikakulam, Andhra Pradesh, India. The mean yields of all currently grown crops in the research region were aggregated to determine the aggregate yield of crop per hectare for that specific tehsil.

A Crop Production Map was generated, categorizing the tehsil-wise total yield into five ranges. The classification method utilized for these ranges was the "Equal Interval" technique within the ArcGIS platform. Within the crop production map, sub-classes representing different yield ranges, namely 2389–3089, 3090–3970, 3971–4952, 4953–5808, and 5809–6403 kg per ha, were assigned ranks ranging from 1 to 9. This ranking was achieved using the reclassify function in the spatial analyst tool within ArcGIS software. In accordance with flood vulnerability, higher crop productions were assigned the highest rank, indicating greater susceptibility to flooding. Conversely, lower crop productions were assigned the least rank, indicating lower vulnerability to flood hazards.

2.22 Population density map

The population factor holds significant importance in conducting a comprehensive flood vulnerability study. This particular factor plays a crucial part in determining the vulnerable locations to social losses and damages caused by floods (Doorga et al., 2022). To gather the necessary population data for the research region, the Census of India for the year 2021 was utilized. Considering population as a vital element allows for a comprehensive evaluation of the impact on the community in flood vulnerable areas due to flood-related events. The population density map was prepared in ArcGIS software by considering tehsil-wise population density.

The population density map underwent classification into five categories. The classification method employed for these groups was the "Equal Interval" technique within the ArcGIS platform. Within this map, different sub-classes representing population density ranges, namely 214–515, 515.1–653, 653.1–797, 797.1–1064, and 1064.1–1276 persons per sq. km., were assigned ranks ranging from 1 to 9. These rankings were assigned using the reclassify function within the spatial analyst tool in ArcGIS software. Considering flood vulnerability, the highest rank was assigned to areas with high population density, indicating their greater susceptibility to flooding. Conversely, lower ranks were allocated to regions with lower population density, indicating their lower vulnerability to flood hazards.

2.23 Road river intersection map

Road-river intersections were recognized as high-risk zones when it comes to flood occurrences. The impact of waterlogging at these junctions was demonstrated through an examination of the frequency of road-river intersections per unit area in the research region (Merz et al., 2021). The road network for the area of study was established using information related to road network using the GSI as a reference point. To build the shapefiles of the road and river networks, ArcGIS's intersection function was used, followed by the utilization of the point density tool to analyze the intersection points.

The map depicting road-river interactions was divided into four categories. The classification method applied to these groups was the "Natural Break" technique within the ArcGIS platform. Within this map, different sub-classes representing road river intersection densities, specifically 0–2, 3–5, 6–8, and 9–11 points per sq. km., were assigned ranks ranging from 1 to 9. These rankings were determined using the reclassify function within the spatial analyst tool of ArcGIS software. Considering flood vulnerability, the highest rank was assigned to regions with a high density of road river intersections, indicating their greater susceptibility to flooding. Conversely, lower ranks were assigned to areas with lower road river intersection densities, implying their lower vulnerability to flood hazards.

2.24 Multicollinearity checks for FHI and FVI

Multicollinearity is a statistical issue that occurs when one or more input variables in a model are highly correlated with each other. This can lead to challenges in accurately estimating the output of the model. Hence, prior to implementing a regression model, it is crucial to evaluate the multicollinearity between the input variables. (Montgomery et al., 2013; Ray, 2023). The variance inflation factor (VIF) and tolerance factor are two frequently used metrics for assessing multicollinearity. The tolerance was calculated using Eq. 6, while the VIF was calculated using Eq. 7. These measures help assess the extent of multicollinearity present in the model and guide further analysis and decision-making.

$$\mathrm{Tolerance\,of \,the\,}{{\text{i}}}^{{\text{th}}}\mathrm{ \,parameter }\left({{\text{T}}}_{{\text{i}}}\right)=1-{{\text{R}}}_{{\text{i}}}^{2}$$
(6)
$$\mathrm{VIF\,of \,the\,}{{\text{i}}}^{{\text{th}}}\mathrm{\,parameter }\left({{\text{VIF}}}_{{\text{i}}}\right)=\frac{1}{{{\text{T}}}_{{\text{i}}}}$$
(7)

To evaluate the existence of multicollinearity within the thematic maps, a random sample of 500 points was chosen from the research region. The data was extracted using ArcGIS and analyzed through the Statistical Package for Social Science (SPSS) to identify any multicollinearity concerns. Multicollinearity is deemed problematic if the tolerance value is below 0.10 or if the VIF is equal to or greater than 10. This examination assists in recognizing any significant correlations among the thematic maps, which could impact the dependability and precision of the regression model's outcomes.

2.25 AHP for FHI and FVI

AHP, a methodology developed by Prof. Thomas L. Saaty in 1987, offers a unique approach for evaluating the appropriateness of structures in their designated places. It utilizes the concept of MCDA. The AHP approach makes use of a collection of reciprocal matrices to compare different variables (Saaty, 1987, 1990, 2008). The importance of FHI and FVI is measured on a scale of 1 to 9 to establish a ranking. A rating of '1' indicates a restricted area where flood occurrence is very low, while a '9' signifies an exceptional zone for FHI and FVI which indicates high risk of flood. To evaluate the consistency of weights and ranks assigned to different thematic maps and their sub-classes, the 'Consistency Ratio (CR)' is estimated by considering Eq. 8, as suggested by Saaty (1990). If the CR is equal to or less than 0.1, It signals a decision to move on with the AHP evaluation that is appropriate, according to Saaty (1990).

$$\mathrm{Consistency\,Ratio }\left({\text{CR}}\right)= \frac{{\text{CI}}}{{\text{RCI}}}$$
(8)
$$\mathrm{Consistency\,Index},\mathrm{ CI}= \frac{{\uplambda }_{{\text{max}}}-{\text{n}}}{{\text{n}}-1}$$
(9)

where RCI stands for random consistency index, which is a pre-determined value by considering on the number of criteria being considered, n represents the number of thematic maps, and λmax represents principal Eigen value, which is obtained through the Eigen value analysis of the pairwise comparison matrix.

2.26 Overlay analysis for FHI and FVI

Following the allocation of weights and rankings to every thematic map and their respective subcategories, the FHI and FVI maps were generated through a weighted overlay analysis conducted on ArcGIS software, specifically by utilizing the Spatial Analyst tool (Desalegn & Mulu, 2020). The formula for the overlay analysis as shown in Eq. 10 combines the different thematic maps based on their weights to calculate the final index value for each location:

$$\mathrm{FHI\,or\,FVI}={\sum }_{{\text{i}}=1}^{{\text{n}}}{({\text{W}}}_{{\text{i}}}\times {{\text{R}}}_{{\text{i}}})$$
(10)

where Wi stands for weight of the thematic map and Ri stands for rank of sub-classes. By multiplying each thematic map with its corresponding weight and summing up these weighted values, the overlay analysis produces the FHI and FVI maps. These maps provide an integrated representation of the hazard and vulnerability factors considered in the flood assessment analysis.

2.27 Sensitivity analysis for FHI and FVI

The sensitivity analysis unveils the influence of individual input thematic maps on the output thematic map. In the current research work, two distinct approaches were employed for sensitivity analysis, namely single parameter and map removal. To quantify the sensitivity, the Sensitivity Index (SI) was computed using Eq. 11 in the map removal sensitivity analysis (Ray, 2023). This index allows for the estimation of the extent to which the removal of a particular map affects the overall analysis results.

$${\text{SI}}=\frac{\left|\left(\frac{\mathrm{FHI\,or\,FVI}}{{\text{N}}}\right)-\left(\frac{{{\text{FHI}}}^{\mathrm{^{\prime}}}or\,{{\text{FVI}}}^{\mathrm{^{\prime}}}}{{\text{n}}}\right)\right|}{\mathrm{FHI\,or\,FVI}}\times 100$$
(11)

In the context of the sensitivity analysis, the terms \({{\text{FHI}}}^{\mathrm{^{\prime}}}or\,{{\text{FVI}}}^{\mathrm{^{\prime}}}\) represent modified versions of FHI or FVI, respectively, where one thematic map is excluded at a time. The variable 'n' represents the total number of thematic maps considered to create \({{\text{FHI}}}^{\mathrm{^{\prime}}}or\,{{\text{FVI}}}^{\mathrm{^{\prime}}}\), while 'N' represents the total number of thematic maps utilized to prepare the original FHI or FVI. In the single parameter sensitivity analysis, the weight factor (W) is determined using Eq. 12. This equation allows for the estimation of the influence of individual parameters on the overall analysis results.

$${\text{W}}=\frac{{{\text{P}}}_{{\text{r}}}{{\text{P}}}_{{\text{w}}}}{\mathrm{FHI\,or\,FVI}}\times 100$$
(12)

where Pr represents thematic map rank, Pw represents thematic map weight and FHI or FVI is flood hazard index or flood vulnerability index computed by using all of the thematic maps.

3 Results

All the fifteen thematic maps i.e., Runoff, Slope, LULC, Lithology, NDVI, NDBI, MNDWI, HSG, DD, Rainfall, Distance from River, Permeability, TWI, Profile, and Plan Curvature, were developed for preparation of FHI map. In addition to this, three more thematic maps i.e., population density, crop production and road river interaction, were prepared for FVI map. Finally, both FHI and FVI maps were multiplied in order to develop the FRI map (Table 1).

Table 1 Details of literature review for FRI

3.1 Multicollinearity checks for FHI and FVI

Presented in Tables 2 and 3 are the parameters from the investigation on multicollinearity concerning FHI and FVI. The outcomes reveal that all the thematic maps exhibit VIF values below the prescribed upper limit of 10, and their tolerance values surpass the designated lower threshold of 0.1. Consequently, based on the aforementioned findings, it becomes apparent that no detectable issue of multicollinearity exists among the evaluated thematic maps.

Table 2 Collinearity statistics for all the parameters of FHI
Table 3 Collinearity statistics for all the parameters of FVI

3.2 AHP for FHI and FVI

To determine the weightage for every thematic map and their sub-categories, the AHP method was employed. For FHI, Tables 4 and 5 display the pairwise comparison matrix and the corresponding normalized pairwise comparison matrix for the research region. Similarly, for FVI, Tables 6 and 7 demonstrate the respective pairwise comparison matrix and normalized pairwise comparison matrix. The values in the normalized pairwise comparison matrix were calculated by dividing the value of each cell by the column total value. The criteria weights were obtained by averaging the values in each row. Consistency computation for the pairwise comparison matrix was conducted using Eqs. 8 and 9. For FHI, the consistency index and consistency ratio were determined as 0.06 and 0.04, respectively. In the case of FVI, the values were 0.02 and 0.03, respectively. Since the consistency ratio values of 0.06 and 0.02 are less than 0.1, it can be concluded that the pairwise comparison matrices are consistent. Thus, the weights estimated through Table 5 and 7 can be utilized to assign weights to all the thematic maps. Table 8 presents the weight and rank values for all the thematic maps concerning FHI, while Table 9 displays the corresponding values for FVI. In order to enhance accuracy, the sub-classes of each thematic maps were assigned ranks based on their priority level, considering expert opinion and literature review.

Table 4 Pairwise comparison matrix for FHI
Table 5 Normalized pairwise comparison matrix for FHI
Table 6 Pairwise comparison matrix for FVI
Table 7 Normalized pairwise comparison matrix for FVI
Table 8 Weight and rank of all thematic maps for FHI
Table 9 Weight and rank of all thematic maps for FVI

3.3 Development of FHI

3.3.1 Runoff map

The weighting process in accordance with the AHP assigned a weight of 16.4% to the runoff map. Within the context of this study, the map depicting runoff was categorized into four distinct groups according to the depth of the runoff. These categories are defined as (29–38) mm, (39–71) mm, (72–111) mm, and (112–145) mm. The corresponding areas occupied by each category within the research region are approximately 24%, 26%, 29%, and 21%, respectively. Figure 3 depicts the runoff map for the research region.

Fig. 3
figure 3

Runoff map of the study area

3.3.2 Slope map

In accordance with the AHP, the slope map was allocated a 16.4% weight. The resultant slope map was subsequently segregated into five discrete classifications according to the slope inclination measured in degrees. These categories are named Very Low, Low, Moderate, High and Very High. In the research region, the distribution of these categories accounts for approximately 65%, 21%, 7%, 5%, and 2% respectively. Figure 4 illustrates the slope map representing the characteristics of the research region.

Fig. 4
figure 4

Slope map of the study area

3.3.3 LULC map

According to the AHP, the LULC map was assigned a 11.1% weight. The research area was classified into five primary classes: Forest, Barren, Agricultural, Water Bodies and Built-up. These categories make up approximately 25%, 10%, 55%, 4% and 6% of the research area, respectively. Figure 5 displays the LULC map representing the research region.

Fig. 5
figure 5

LULC map of the study area

3.3.4 Lithology map

The AHP assigned a weight of 11.1% to the lithology map. The research region primarily comprises of four categories of rock structures: Granite, Quartz, Gneiss, and Laterite. These formations constitute approximately 1%, 17%, 54% and 28%, of the research region, respectively. Figure 6 displays the Lithology map, depicting the distribution and characteristics of various rock formations within the research region.

Fig. 6
figure 6

Lithology map of the study area

3.3.5 NDVI map

The AHP allocated a weight of 7.2% to the NDVI map. After generating the NDVI map, it was further divided into five groups based on the NDVI values. These groups are designated as Very Low, Low, Moderate, High, and Very High. Within the research region, these categories represent approximately 17%, 33%, 28%, 19%, and 3% of the total area, respectively. Figure 7 visually represents the NDVI map, depicting the distribution and intensity of vegetation across the research region.

Fig. 7
figure 7

NDVI map of the study area

3.3.6 NDBI map

The AHP assigned a weight of 7.2% to the NDBI map. After generating the NDBI map, it was further divided into five distinct categories by considering the NDBI values. These distinct categories are designated as Very Low, Low, Moderate, High, and Very High. Within the research region, these categories represent approximately 14%, 19%, 23%, 26%, and 18% of the total research region, respectively. Figure 8 visually depicts the NDBI map, illustrating the spatial distribution and intensity of built-up areas across the research region.

Fig. 8
figure 8

NDBI map of the study area

3.3.7 MNDWI map

The AHP allocated a weight of 7.2% to the MNDWI map. Upon generating the MNDWI (Modified Normalized Difference Water Index) map, it underwent classification into five distinct categories based on the MNDWI values. These groups were labeled as Very Low, Low, Moderate, High, and Very High, representing approximately 21%, 38%, 33%, 5%, and 3% of the total research region, respectively. The MNDWI map is visually depicts in the Fig. 9.

Fig. 9
figure 9

MNDWI map of the study area

3.3.8 HSG map

According to the AHP, the HSG (Hydrologic Soil Group) map was assigned a weight of 4.6%. In the research region, the dominant soil groups consist of four groups: A, B, C, and D, covering approximately 17%, 21%, 29%, and 33% of the total research region, respectively. The HSG map for the research region, showcasing the spatial representation and distribution of different HSG categories is illustrated in Fig. 10.

Fig. 10
figure 10

HSG map of the study area

3.3.9 DD map

The AHP assigned a weight of 4.6% to the DD map. The DD map displays the concentration of drainage in the research area and is sorted into five classifications: very Low, Low, Moderate, High, and very High. These groups represent approximately 19%, 24%, 27%, 20%, 7%, and 10% of the total research region, respectively. The DD map of the research region is shown in the Fig. 11.

Fig. 11
figure 11

DD map of the study area

3.3.10 Rainfall map

The AHP assigned a weight of 3.1% to the Rainfall map. The rainfall map was classified into five distinct groups based on rainfall ranges, namely 918–1043 mm, 1044–1100 mm, 1101–1158 mm, 1159–1222 mm, and 1223–1342 mm. These categories represent approximately 13%, 40%, 25%, 18%, and 3% of the total research region, respectively. For a visual representation of the rainfall distribution, refer to Fig. 12.

Fig. 12
figure 12

Rainfall map of the study area

3.3.11 Distance from river map

The AHP allocated a weight of 3.1% to the Distance from River map. The research region was divided into six distinct groups: 500 m, 1000 m, 2000 m, 3000 m, and distances greater than 3000 m. These categories represent approximately 6%, 11%, 19%, 26%, and 39% of the total research region, respectively. Distance from river map is shown in the Fig. 13.

Fig. 13
figure 13

Distance from river map of the study area

3.3.12 Permeability map

The AHP assigned a weight of 2.3% to the Permeability map. The Permeability map is divided into four categories: Low, Moderate, High, and Very High. These groups represent approximately 17%, 21%, 29%, 7%, and 33% of the total research region, respectively. The permeability map of the research region is shown in the Fig. 14.

Fig. 14
figure 14

Permeability map of the study area

3.3.13 TWI map

The AHP assigned a weight of 2.3% to the TWI map. The TWI map of the research area was divided into five categories: Very Low, Low, Moderate, High, and Very High, accounting for 23%, 42%, 15%, 17%, and 3% of the research region, respectively. Figure 15 displays the TWI map of the research region.

Fig. 15
figure 15

TWI map of the study area

3.3.14 Profile and plan curvature map

The weights assigned to both the plan and profile curvature maps were 1.7% each, based on the AHP. The profile curvature map was classified into three categories: low, moderate and high, representing 10%, 69%, and 21% of the research region, respectively. The plan curvature map was classified into three categories: low, moderate and high, representing 9%, 73%, and 18% of the research region, respectively. Figures 16 and 17 displays the profile and plan curvature map of the study area, respectively.

Fig. 16
figure 16

Plan Curvature map of the study area

Fig. 17
figure 17

Profile Curvature map of the study area

3.4 FHI

The FHI map for the research region was developed in ArcGIS software by overlaying several thematic maps, including Runoff, Slope, LULC, Lithology, NDVI, NDBI, MNDWI, HSG, DD, Rainfall, Distance from River, Permeability, TWI, Profile, and Plan Curvature. The weights and ranks assigned to each of these 15 thematic maps and their sub-classes, using AHP methodology, were considered during the overlay process. The weighted overlay method in ArcGIS was employed for this purpose. By applying the weighted overlay method and considering the assigned weights and ranks, the FHI map was generated. This map serves as a comprehensive representation of flood hazard potential in the research region, incorporating multiple factors and their respective influences on flood occurrence.

According to the analysis of the FHI map, the research area was categorized into five separate regions: very high, high, medium, low, and very low FHI. These zones accounted for 41%, 25%, 24%, 8%, and 2% of the research region, respectively. The FHI map, shown in Fig. 18, was created by incorporating information from all 15 thematic maps.

Fig. 18
figure 18

FHI map of the study area

From the analysis of Fig. 18, it was evident that 66% of the region fell into the high and very high FHI classifications. This can be attributed to favorable factors such as low and very low runoff, gentle slopes, presence of water bodies and built-up areas in the LULC data, and the predominant lithology of granite and gneiss in the Srikakulam district of Andhra Pradesh, India, where the research was conducted. However, 34% of the region was classified as low and moderate FHI zones. This can be attributed to unfavorable factors such as presence of mountains, steep or high slopes, low runoff, and the presence of forests and agricultural land.

3.5 Sensitivity analysis of FHI

Table 10 displays the results of the map removal sensitivity analysis, while Table 11 presents the outcomes of the single parameter sensitivity analysis. Based on Table 10, it is evident that the runoff and slope maps have the most significant impact on the FHI, followed by the LULC, lithology, NDVI, NDBI, and MNDWI maps, which have a moderate impact. Conversely, the Curvature and TWI maps exhibit the lowest impact on FHI computation. The elimination of the runoff and slope maps results in substantial variations as indicated by high values of the variation index. This signifies that runoff and slope shows a vital role in developing the FHI for the research region.

Table 10 Statistical analysis of map removal sensitivity analysis for FHI
Table 11 Statistical analysis of single parameter sensitivity analysis for FHI

Table 11 provides the percentage of effective weights for all thematic maps. The results demonstrate that higher empirical weights correspond to higher effective weights, implying their greater influence on the FHI calculation. Table 12 illustrates the changes in the percentage of areas within different FHI categories (i.e., very high, high, moderate, low, and very low) after the removal of individual thematic maps. The outcomes indicate that excluding the runoff map and slope map significantly impact the FHI by increasing the area classified as low suitability by 35.3% and 27.6%, respectively. However, the exclusion of the TWI and Curvature maps exhibits a minor influence on the FHI.

Table 12 Changes of the FHI with removal of the thematic map

3.6 Development of FVI

3.6.1 Crop production map

The crop production map was allocated a weight of 63% based on the AHP. Using ArcGIS software, a Crop Production Map was created, classifying the tehsil-wise total yield into five categories: 2389–3089, 3090–3970, 3971–4952, 4953–5808 and 5809–6403 kg per ha. These categories represent 4%, 13%, 17%, 25%, 30% and 28% of the research region, respectively. Figure 19 showcases the Crop Production Map of the research region.

Fig. 19
figure 19

CP map of the study area

3.6.2 Population density map

The population density map was given a weight of 26% based on the AHP. The population density map was classified into five categories: 214–515, 515.1–653, 653.1–797, 797.1–1064 and 1064.1–1276 persons per sq. km., representing 14%, 24%, 17%, 36% and 10% of the research region, respectively. The population density map of the research region is shown in Fig. 20.

Fig. 20
figure 20

PD map of the study area

3.6.3 Road river intersection map

The road river intersection map was given a weight of 11% based on the AHP. The map depicting road-river interactions was divided into four categories: 0–2, 3–5, 6–8 and 9–11 points per sq. km. These categories represent 68%, 24%, 7% and 1% of the research region, respectively. Figure 21 showcases the road-river interaction map of the research region, offering a visual representation of these categories.

Fig. 21
figure 21

RRI map of the study area

3.7 FVI

The FVI map for the research region was generated using ArcGIS software by overlaying three thematic maps: population density, road river intersection and crop production. The ranks and weights allocated to all of these maps and their sub-categories, based on the AHP, were considered during the overlay process. The weighted overlay method in ArcGIS was employed for this purpose. The production of the FVI map was achieved through the utilization of the weighted overlay technique, which involved the integration of the designated weights and ranking. This map provides a comprehensive representation of flood vulnerability in the research region, taking into account the influences of population density, crop production, and road river interaction.

According to the FVI map analysis, the research region has been classified into five distinct regions: very high, high, moderate, low, and very low FVI. These regions account for 20%, 22%, 29%, 13%, and 16% of the research region, respectively. The FVI map, shown in Fig. 22, was created by incorporating information from all three thematic maps.

Fig. 22
figure 22

FVI map of the study area

From the analysis of Fig. 22, it is evident that 42% of the area falls into the very high and high FVI categories. This can be attributed to favorable conditions such as higher crop production, high population density, and a significant number of road river intersections in the research area. However, 58% of the area is categorized as moderate and low FVI zones. This can be attributed to unfavorable conditions such as lower crop production, lower population density, and a lower density of road river intersections.

3.8 Sensitivity analysis of FVI

Table 13 presents the results of the map removal sensitivity analysis, while Table 14 displays the outcomes of the single parameter sensitivity analysis. Based on Table 13, it is evident that the crop production map has the greatest impact on the FVI, followed by the population density map, while the road river intersection map has the lowest impact on FVI computation. The results of the variation index indicate that the exclusion of the crop production map results in significant variations, followed by the elimination of the population density map. This indicates that the crop production map has a substantial impact on the FVI for the research region. Table 14 provides the effective weights of all thematic maps expressed as percentages. The results demonstrate that higher empirical weights correspond to higher effective weights, indicating their greater influence on the FVI calculation.

Table 13 Statistical analysis of map removal sensitivity analysis for FVI
Table 14 Statistical analysis of single parameter sensitivity analysis for FVI

Table 15 illustrates the changes in the area percentage within different FVI categories (i.e., very high, high, moderate, low, and very low) after the removal of individual thematic maps. The outcomes indicate that excluding the crop production map significantly impacts the FVI by increasing the area classified as low FVI by 41.3%. However, the exclusion of the road river intersection map exhibits a lesser impact on the FVI.

Table 15 Changes of the FVI with removal of the thematic map

3.9 Development of FRI

To generate a comprehensive FRI map for the research region, a combination of FHI and FVI maps was employed. By leveraging the power of the weighted sum method in ArcGIS software, these maps were integrated together to develop the final result. Figure 23 provides an overview of the FRI map for the research region. The FRI map was divided into five divisions: high-risk, high, moderate, low, and very low zones, encompassing 28%, 20%, 16%, 28%, and 8% of the research region, respectively. Notably, 48% of the research region falls within the very high-risk and high-risk zones, indicating favorable conditions such as high runoff, flat slopes, the presence of water bodies and built-up areas, high rainfall, and dense crop production and population. Conversely, 36% of the research region is categorized as low and very low risk, attributed to factors like low runoff, steep slopes, the presence of forests and agricultural land, low rainfall, and lower crop production and population density.

Fig. 23
figure 23

FRI map of the study area

In Table 16, the village and area mapping based on the FRI classification is presented, providing a detailed understanding of the risk distribution across the research region. Furthermore, Fig. 24 depicts a histogram comparison of the FHI, FVI and FRI percentages for different categories within the research region.

Table 16 Villages and area mapping based on FRI classification
Fig. 24
figure 24

Graph showing percentage of study area corresponding to FHI, FVI and FRI

These findings contribute to a comprehensive understanding of the flood risk landscape, aiding in the formulation of appropriate mitigation and disaster management strategies. By identifying areas with high vulnerability and risk, decision-makers can focus resources and interventions where they are most needed, ensuring the safety and well-being of the population and minimizing potential damage caused by flooding events.

3.10 Validation of FRI map

A critical procedure called "model validation" entails methodically contrasting a model's outputs with unbiased real-world data. Assessing the level of concordance between the recorded data or real conditions and the quantitative and qualitative outcomes of the model is the aim. Scholars often use a variety of models to evaluate flood risk in various parts of the world. To make sure the model's outputs appropriately reflect the circumstances in the actual world, it is crucial to validate them.

To validate the FRI map, a total of 50 points were carefully selected, with 10 points allocated to each category. The validation process involved utilizing the ROC analysis. In the ROC curve, the x-axis illustrates the False Positive Rate (FPR), which quantifies how frequently the classifier erroneously classifies negative instances as positive. Meanwhile, the y-axis represents the True Positive Rate (TPR), also known as sensitivity, which assesses the model's proficiency in accurately identifying positive instances. To verify the accuracy of the ground truth points, field visits were conducted, and data regarding flooding zones (based on previous flooding records) were collected from the Andhra Pradesh State Disaster Management Authority (APSDMA) in Andhra Pradesh, India. Among the 50 points, only 3 points exhibited discrepancies compared to the FRI map prepared using ArcGIS software. These ground truth points, along with the FRI map, were visually represented in Fig. 25. In Fig. 25, the ground truth points were represented by different shapes whereas different color indicates classified classes of FRI using ArcGIS. To validate the FRI map, a comparison between the ground truth points and the produced FRI map was performed with the use of SPSS software. The validation process included cross-validation using the ROC curve, as depicted in Fig. 26. The Area Under the Curve (AUC) statistic, which determines the effectiveness of a variable in distinguishing between two groups, were calculated. The AUC in a ROC curve is a measure of the overall performance of a classification model. It quantifies the model's ability to distinguish between positive and negative cases. A higher AUC indicates better performance, with 1 being perfect. An AUC of 0.5 suggests a model that performs no better than random guessing, while values above 0.5 show some level of discrimination. The closer the AUC is to 1, the better the model's ability to correctly classify positive cases while minimizing false positives. In case of FRI map, the AUC value was found to be 0.89, indicating that the AHP approach employed in the model resulted in highly accurate predictions.

Fig. 25
figure 25

FRI map with ground truth points

Fig. 26
figure 26

Cross-validation of the FRI map using ROC curve

4 Discussion and conclusions

The research region under investigation experiences frequent flood and drought situations, both during the monsoon and non-monsoon seasons. These occurrences are primarily caused by factors such as heavy rainfall, high runoff, flat slopes, low infiltration, low permeability, and a lack of proper flood management systems (Desalegn & Mulu, 2020; Gazi et al., 2019). In this study, the aim was to identify FRI zones using a combination of the AHP, GIS approach, and remote sensing techniques. The identification of high-risk locations within the FRI zones was of utmost importance to establish effective flood management systems (Velasco et al., 2016). The implementation of such systems would help minimize losses resulting from floods, improve water storage facilities in the research region to meet domestic, irrigation, and industrial needs during non-monsoon periods, and reduce the adverse environmental impacts caused by droughts (Yang et al., 2018; Abu El-Magd et al., 2020; Gandini et al., 2020). Several existing procedures were considered during the study; however, many of them proved inadequate for specific zones due to regional conditions, time constraints, social factors, or political challenges (Arrighi, 2021; Baquedano & Ferreira, 2021; Chen et al., 2018; Doorga et al., 2022; Memon et al., 2020; Merz et al., 2021; Petroselli et al., 2019; Requena et al., 2018). By incorporating GIS and remote sensing technologies, along with methods to address multicollinearity and the AHP technique, the accuracy of the output was significantly improved (Arya & Singh, 2021; Chakraborty & Mukhopadhyay, 2019; Martinez-Gomariz et al., 2021; Ray, 2023). Additionally, this integrated approach saved a considerable amount of time in the analysis process (Amare & Okubay, 2019; Samela et al., 2018).

In previous attempts to develop the FRI map, a maximum of 10 to 12 thematic maps were utilized. However, the final accuracy of these maps ranged between 0.73 and 0.77 as shown in Table 1. It is important to note that these accuracies were achieved without considering the factors of multicollinearity check and sensitivity analysis as shown in Table 1. To enhance the accuracy of the FRI map, the current study incorporated additional steps such as multicollinearity check and sensitivity analysis. By including these measures, the accuracy of the thematic map was significantly improved, reaching a value of 0.89. This indicates a substantial enhancement in the accuracy and reliability of the FRI map compared to previous attempts (Dayala et al., 2020; Ferreira & Santos, 2020; Hussain et al., 2021; Mundhe, 2019; Requena et al., 2018; Samela et al., 2018; Vojtek & Vojtekova, 2019). The multicollinearity check helps identify and address the issue of collinear relationships between predictor input thematic maps, ensuring that they are not redundantly contributing to the model. By eliminating multicollinearity, the FRI map becomes more robust and reliable in capturing the true flood risk patterns in the research region. Additionally, sensitivity analysis allows for a comprehensive evaluation of the model's performance by examining the impact of varying input thematic maps. This analysis ensures that the FRI map's outputs are not overly sensitive to specific input thematic maps, strengthening its credibility and accuracy (Ray, 2023).

The present study introduced a novel approach by incorporating multicollinearity checks and sensitivity analysis into the development of the FRI map. Unlike previous attempts that used a limited number of thematic maps as shown in Table 1, this study considered a total of 18 thematic maps for delineating the FRI map. The inclusion of multicollinearity checks was found to be crucial in identifying high-risk zones. By addressing multicollinearity, the study minimized uncertainty within the data, thereby enhancing the accuracy of the FRI map's outcomes. This step helped ensure that the identified high-risk zones were based on solid and independent information, improving the reliability of the results (Ray, 2023).

Additionally, sensitivity analysis played a significant role in the study. The AHP was utilized to evaluate the importance of various thematic maps relative to one another, thereby confirming the accuracy of the assigned weights for each thematic map (Amare & Okubay, 2019; Bonazza et al., 2018; Xiao et al., 2021). By conducting sensitivity analysis, the study was able to ascertain the relative significance and contribution of each thematic map to the overall FRI map. This analysis provided valuable insights into the relative influence of different variables and helped validate the weighting scheme derived from the AHP process (Prieto et al., 2020; Ray, 2023; Singh et al., 2020; Teng et al., 2017).

In summary, the inclusion of multicollinearity checks and sensitivity analysis in the study brought several benefits. Multicollinearity checks increased the accuracy of the FRI map by reducing uncertainty within the data. Sensitivity analysis, on the other hand, validated the importance of each thematic map and the weights assigned to them, enhancing the overall credibility of the FRI map's results. Together, these approaches improved the accuracy, reliability, and robustness of the FRI map, making it a valuable tool for flood risk assessment and management.