1 Introduction

In developing world, as the needs of human beings increase, new residential areas and commercial facilities expand near the waterfront sides. Hence, this brings a direct confinement of the waterway and leads to increasing of flood hazards. Floods are one of the major natural hazards which seriously threaten the infrastructures, cultivated agricultural lands, transportation networks in both urban and countryside areas and cause losses of significant economic activities and lives (Akay 2021). So, policymakers have to face challenging decisions on irresistible and increasing economic activities and its conflicting results of flooding hazards. Being aware of this problem, researchers are working on finding solutions to predict the spatial distribution of flood susceptibility. Flood susceptibility maps in terms of the risk groups may be considered as a design criterion or input to assess the flood vulnerability of the structures to be constructed, where necessary. For instance, an engineer could decide about the construction site of a new residential area by taking into consideration of the flood susceptibility maps.

Watershed characteristics are spatially obtained by the extraction of the available data using geographic information system (GIS) which is an important and effective tool, and adopted the contribution as an attribute in susceptibility assessment. Researchers focus on different techniques, based on statistical, fuzzy logic (FL), multi-criteria decision making (MCDM), machine learning, and meta-heuristic algorithms etc. to estimate the spatial distribution of flood susceptibility of flooded and non-flooded zones. Moreover, those methods have attracted attention and been employed for not only flood susceptibility but also for landslide susceptibility, groundwater potential, avalanche susceptibility, and wildfire probability mapping etc. (Mogaji et al. 2015; Arabameri et al. 2019; Choubina et al. 2019; Jaafari et al. 2019; Rahmati et al. 2019; Moayedi et al. 2020).

Researchers performed various methods including frequency ratio (FR) (Tehrany et al. 2019; Arabameri et al. 2019; Sahana and Patel 2019; Vafakhah et al. 2020; Mirzaei et al. 2021; Wang et al. 2021), weights of evidence (WoE) (Tehrany et al. 2014), index of entropy (IoE) (Haghizadeh et al. 2017; Wang et al. 2021), evidential belief function (EBF) (Tehrany and Kumar 2018; Arabameri et al. 2019; Bui et al. 2019), logistic regression (LR) (Nandi et al. 2016; Tehrany and Kumar 2018; Tehrany et al. 2019), statistical index (SI) (Tehrany et al. 2019), support vector machine (SVM) (Tehrany et al. 2014; Shafapour Tehrany et al. 2019), naïve Bayes tree (NBT) (Khosravi et al. 2019; Chen et al. 2020), random forest (RF) (Chen et al. 2020; Vafakhah et al. 2020; Mirzaei et al. 2021), fuzzy logic (FL) (Sahana and Patel 2019), fuzzy DEMATEL ANP (Kanani-Sadat et al. 2019), artificial neural network (ANN) (Kia et al. 2012), analytical hierarchy process (AHP) (Hammami et al. 2019; Kanani-Sadat et al. 2019), Technique for Order Preference by Similarity to an Ideal Solution (TOPSIS) (Arabameri et al. 2019; Khosravi et al. 2019), VlseKriterijumska optimizacija I Kompromisno Resenje (VIKOR) (Arabameri et al. 2019; Khosravi et al. 2019), SAW (Khosravi et al. 2019), adaptive neuro-fuzzy inference system (ANFIS) (Hong et al. 2018; Vafakhah et al. 2020), and gradient boosting tress and multilayer perception algorithms (Costache et al. 2020; Ahmadlou et al. 2021; Wang et al. 2021) for flood susceptibility assessment.

In recent years, results of flood susceptibility mapping studies obtained from aforementioned methods were compared with each other, accuracy of prediction results were discussed and the methods giving the best accuracy were determined, accordingly (Khosravi et al. 2016; Arabameri et al. 2019; Kanani-Sadat et al. 2019; Shafapour Tehrany and Kumar 2018; Shafapour Tehrany et al. 2019; Shana and Patel 2019; Al-Abadi and Al-Najar 2020; Chen et al. 2020). Since there is yet no consensus worldwide on the accuracy of the model structure and limitations of the models and the characteristics of the study area, flood susceptibility mapping studies are still ongoing to improve the accuracy of the spatially predicted results (Costache et al. 2020). Hence, those methods are integrated creating the hybrid or ensemble models to get higher prediction performance (Shafapour Tehrany et al. 2014; Bui et al. 2019; Shafapour Tehrany et al. 2019; Costache et al. 2020). Researchers studying on flash flood susceptibility mapping coupled the EBF with LR, SVM, and other methods in order to improve the accuracy of the predictions and also avoid computational affords (Bui et al 2019; Chowdhuri et al 2020; Costache et al 2020; Talukdar et al 2020). Vafakhah et al. (2020) compared FR, ANFIS, and RF models in predicting flood susceptibility mapping, and the authors found RF model as the best predicting model. Mirzaei et al. (2021) adapted FR, RF, generalized additive model (GAM), and extreme gradient boosting (EGB) models to estimate the spatial distribution of flood susceptibility. Results indicated that RF and GAM models gave the best performance. Ahmadlou et al. (2021) implemented multilayer perceptron (MLP) and autoencoder–MLP models in flood susceptibility mapping. Authors stated that autoencoder-MLP gave outperformed. Wang et al. (2021) hybridized FR, IoE, and probability density (PD) models with MLP, and classification and regression tree (CART) models. MLP-PD was found as the most efficient model in flood susceptibility prediction in the study. Pham et al. (2021) employed best first decision tree (BFT), and bagging, decorate and random subspace ensemble models of BFT. Decorate-BFT was identified as superior model in flood susceptibility predictions. Dodangeh et al. (2020), Mehrabi et al. (2020), Pourghasemi et al. (2020), Arora et al. (2021), and Shahabi et al. (2021) adopted ensemble meta-heuristic algorithms and ANFIS, SVM machine learning techniques in order to optimize parameters in development of their predictions. All the researchers concluded that hybrid or ensemble techniques changed the predictions and contributed the accuracy of the model results. Researches on improvement of the model predictions using optimization and ensemble machine learning algorithms are still ongoing.

The main objective of this study is to contribute to the recent efforts on comprehensive comparison of statistical, FL and MCDM methods. To the best knowledge of the author, a comprehensive study was not implemented using this methodology. For this, some fundamental statistical approaches such as FR, WoE, EBF, IoE, and principal component analysis (PCA) were used. Moreover, some hybridization techniques using those methods in FL and MCDM methods such as AHP, TOPSIS, and VIKOR were tested in order to further understand the capability of the model outcomes. Flood hazards susceptibility maps were predicted separately and compared with each other.

2 Materials and methods

2.1 Study area

Gokirmak Basin sited in the northern part of Turkey has a drainage area of 1375 km2, drains through Ulus town and Bartin city center and then discharges into the Black Sea. The annual rainfall of the basin varies between 850–1250 mm; and the average slope of the basin is 35%. Increase in population and economic activities in the region leads to rapid urbanization causing the curve number of the region to increase, as well. So, the basin frequently faces flash floods due to increase in excessive rainfall, steep topography, deterioration in the basin, and the characteristics of the basin. Some major flash flood disasters took place in the study area, one of which occurred in 1998 that is still in the memories of the local people who experienced it (Akay and Baduna Koçyiğit 2020). Several studies including hydrologic modeling of the hydrological processes in the basin were conducted to further increase the understanding of the runoff of excessive rainfall (Baduna Koçyiğit et al. 2017; Akay et al. 2018). Flash floods occurring in Gokirmak Basin might seriously damage the residential areas, infrastructures, and suspend commercial activities and transportation services causing economic loss each year.

Rapid urbanization also requires infrastructure investments that should be insured against flash flood hazards. Engineers should consider the hydrologic vulnerability and risk assessment of flash floods regarding the safety of any structure in design stage, as well. Hence, a reliable flood hazards susceptibility mapping obtained using different statistical and hybrid MCDM models for Gokirmak Basin carried out in this study is thought to be crucial for especially local authorities for water management activities in the area.

2.2 Methodology

The methodology adopted in this study to obtain flood hazards susceptibility maps is given in Fig. 1.

Fig. 1
figure 1

Flood susceptibility mapping methodology

2.3 Data

Data used in this study consist of flood inventory and spatial distribution of flood conditioning factors. Flood inventory data include hazards occurred at past flood events reported by experts and local authorities while flood conditioning factors are the parameters that directly affect occurrence of flash floods.

2.3.1 Flood inventory

Past floods are one of the most important indicators, which show that similar consequences or hazards may be experienced as a result of future possible floods in the same environment depending on the meteorological, climatological, hydrological conditions of the watershed at that time. So, past flood hazards reported by the authorities at different time intervals were considered in this study to increase the estimation capability of flood hazards susceptibility spatial modeling using various techniques. Besides, flood inventory data also provide high flood susceptible zones in the watershed. On the other hand, non-flooded zones which are generally constituted from hills contribute to test the accuracy of the generated flood hazards susceptibility mapping (Tehrany and Kumar 2018). In the study, flooded and non-flooded points collected from past flood events were introduced into ArcGIS and analyzed, accordingly. 139 flooded and 140 non-flooded points were considered for the generation of flood hazards susceptibility maps, of which randomly selected 98 points were used for training while the rest were used for testing of the generated map (Fig. 2).

Fig. 2
figure 2

Study area and flooding pixels

2.3.2 Flood conditioning factors

To determine the dominant flood conditioning factors is a crucial issue regarding the flood hazards susceptibility mapping. Accuracy of both inventory data and well-defined flood condition factors plays an important role on the accuracy of generated flood hazards susceptibility maps since they are involved directly. Analysis of flooded or non-flooded points is based on the extraction of attributes of relevant flood conditioning factors, and then processing and incorporating by overlapping those attributes. In the literature, some common physical characteristics such as lithology, land use, land cover, geological conditions, precipitation, and drainage characteristics of the watershed are considered in assessment of the landslides or flash floods susceptibility and ground water potential. Three meteorological observation stations collect meteorological data in the basin (Akay et al. 2018). According to the report analyzed from those stations, the basin takes average annual precipitation of 850 mm, but sometimes it increased to 1250 mm. However, unfortunately, these stations seriously suffer from the scarce and lack of continuous data. Such kind of basins is called as poorly gauged or ungauged basins. Moreover, this is a systematic problem that does not significantly change, even reversely affect the flood hazard maps, in poorly gauged watersheds, worldwide. Since the watershed is poorly gauged, precipitation which is the main component of flood hazards could not be used in prediction of spatial distribution of flood hazards. In similar studies, precipitation was left out in prediction of flood susceptibility (Mirzaei et al. 2021; Pham et al. 2021). So, in this study, available aforementioned types of data of the study area which are compatible with similar studies conducted in the literature were selected as flood conditioning factors to estimate the flood susceptibility maps (Mirzaei et al. 2021). Those data may be categorized as local and concentrated characteristics of the watershed. Local attributes of the watershed that are independent of the upstream characteristics are considered to capture the heterogeneities in the watershed while concentrated characteristics of the watershed consider the contribution of the most upstream points. In this study, local parameters such as elevation, slope, aspect, plan curvature, distance from stream and soil groups, and concentrated characteristics such as topographic wetness index (TWI), sediment transport index (STI), drainage density (Dd), elongation ratio (Er), and stream order were considered as flood conditioning factors. Elevation, slope, aspect, plan curvature, land cover, and distance from stream, TWI, STI, and Dd parameters are commonly utilized to assess flood, landslide susceptibility and groundwater potential of the watershed. Er; just as Dd; or stream order are related with shape and drainage properties of the watershed and reflect the flash flood vulnerability, but those parameters have not been used in flash flood susceptibility mapping studies to the best knowledge of the author. All flood conditioning factors except soil groups were processed in ArcGIS using digital elevation model (DEM) with a resolution of 10 m obtained from 1:25,000 scaled topographic maps. Watershed consists of alluvial (A), gray brown podzolic (G), colluival (K), brown forest (M), non-calcareous brown forest (N), and red yellow podzolic (P) great soil groups. Slope, aspect and plan curvature grids were obtained by using only DEM of the watershed, itself. Flow direction and flow accumulation cells were determined from DEM after operation of fill sinks. TWI and STI concentrated factors were calculated as a function of flow accumulation and slope grids. Stream network of the watershed was extracted based on the value of threshold cells, taken as 1000 in this study, of flow accumulation (Arabameri et al. 2019). Studies in close watersheds indicated that the drainage network is sensitive to the threshold value of flow accumulation. However, it does not improve the model accuracy when the threshold value decreases more (Akay 2021). Stream order of the extracted network was designated based on Strahler’s method. Concentrated cells of each designated stream order were defined as a domain of the class. Line density of the extracted stream following the stream segmentation, and drainage line processing, respectively, was used for determination of the drainage density spatial distribution of the watershed. Elongation ratio of the watershed could be spatially calculated by combining the flow accumulation and flow length obtained using flow direction of the watershed. Elongation ratio presented as a grid format herein is one of the most important parameters in determining the flash flood characteristics of the watershed. Previous studies on morphometric analysis emphasized the vital contribution of watershed shape characteristics (Akay and Baduna Koçyiğit 2020).

Multicollinearity test of flood conditioning factors was conducted to detect if repeating variables may occur in independent variables. In case of occurrence, one of the repeating variables may be left out considering the value of variance inflation factor (VIF). VIF value more than 10 is accepted to be exist multicollinearity problem (Talukdar et al. 2020). In ArcGIS, elevation, slope, distance from stream, TWI, STI, Er and Dd conditioning factors were classified based on natural break method in five sub-classes while aspect, curvature, soil groups and order conditioning factors were classified based on supervised classification method. Since Strahler’s order of the main stream is seven, while the number of soil groups in the watershed is six, the number of sub-classes of the relevant factors was classified, accordingly. Aspect in nine classes based on the surface gradient direction and curvature in three classes based on the tangent to the DEM were created. Figure 3 indicates the flood conditioning factors in the watershed.

Fig. 3
figure 3

Flood conditioning factors of the watershed

2.4 Flood susceptibility mapping

Besides the flood inventory data and values of flood conditioning factors at those points, accuracy of the generated flood hazards susceptibility maps also depends on how the flood conditioning factors incorporate with well-defined flood hazards. In similar studies, a basic assumption is that future flash flood hazards will occur under the same conditions as past flash floods. Flash floods are directly related with the spatial information of flood conditioning factors. The flood susceptibility maps of the study area were obtained by using various methods such as FR, EBF, WoE and IoE. Moreover, hybrid integration of those approaches with PCA, FL, AHP, TOPSIS and VIKOR was established in order to examine the accuracy of maps. For example, FL with FR and WoE, separately, PCA with EBF, AHP with IoE and EBF, and TOPSIS and VIKOR with AHP, and PCA weights, separately, were hybridized and flood susceptibility maps were estimated. This new challenge on preparation of accurate flood hazards susceptibility maps requires a detailed analysis.

2.4.1 Frequency ratio (FR)

FR is one of the most popular statistical methods used in mapping studies. FR approach is based on the observed relationships between the distribution of flash floods and each flash flood conditioning factor, to disclose the correlation between flood hazard pixels and the factors in the study area (Lee and Pradhan 2006). FR at a class of a flood conditioning factor is defined as the ratio of the flooded point percent at a class to the class pixel percent in the domain. FR values of each flood conditioning factor are computed and the flood susceptibility mapping is obtained by summing up the raster values of each FR of flood conditioning factors in ArcGIS.

2.4.2 Evidential belief function (EBF)

EBF is a bivariate statistical method which is dominated by four functions as belief (Bel), disbelief (Dis), uncertainty (Unc), plausibility (Pls), and based on the importance of class in a flood conditioning factor. EBF is a probability-based approach and bounded by Bel, and Pls functions as minimum and maximum probabilities, respectively. The EBF model was designed to process flood hazards information at different levels of knowledge. Moreover, no other assumptions are required to represent knowledge since the uncertainty responses is directly allowed directly in representation of system (Althuwaynee et al. 2012). Flood hazards probability of a class of a flood conditioning factor may be approximated by Bel and Pls functions while non-occurrence of flood at any classes of a flood conditioning factor is defined as Dis function. As the Bel value increases, the probability of flood occurrence also increases. Bel and Dis functions may be calculated as given in Eqs. 14.

$$ {\text{Bel}} = \frac{{w_{Cij({\text{flood}})} }}{{\sum {w_{Cij({\text{flood}})} } }} $$
(1)
$$ w_{Cij({\text{flood}})} = \frac{{\frac{{N(C_{ij} \cap D)}}{{N(C_{ij} )}}}}{{\frac{{N\left( D \right) - N(C_{ij} \cap D)}}{{N\left( T \right)-N(C_{ij} )}}}} $$
(2)
$$ {\text{Dis}} = \frac{{w_{{Cij({\text{non-flooded}})}} }}{{\sum {w_{{Cij({\text{non-flooded}})}} } }} $$
(3)
$$ w_{{Cij({\text{non-flooded}})}} = \frac{{\frac{{N(C_{ij} )-N(C_{ij} \cap D)}}{{N(C_{ij} )}}}}{{\frac{{N(T)-N\left( D \right)-N(C_{ij} ) + N(C_{ij} \cap D)}}{{N\left( T \right)-N(C_{ij} )}}}} $$
(4)

where Cij is the jth class of conditioning factors (Ci), and wcij indicates the weight of Cij. N(T), N(Cij), N(D), and N(Cij ∩ D) denote the total pixel in the domain, total flood pixel in the domain, total pixel in the class, and total flood pixel in the class, respectively.

Since the summation of maximum occurrence and non-occurrence of flood hazard probabilities (Pls and Dis) are unity, and the difference of maximum and minimum occurrence probabilities are defined as uncertainty, the governing equations are represented as follows:

$$ {\text{Unc}} = 1-{\text{Bel}}-{\text{Dis}} $$
(5)
$$ {\text{Pls}} = 1-{\text{Dis}} $$
(6)

The Bel, Dis, Unc, and Pls functions at a prescribed class of each flood conditioning factor were calculated and processed in ArcGIS. The combination of each flood conditioning factor was conducted based on Dempster–Shafer theory using ArcGIS environment.

2.4.3 Weights of evidence (WoE)

WoE is a bivariate statistical method based on Bayesian likelihood theorem and combines information from all the flood hazards conditioning factors. In WoE method, in natural logarithm format, occurrence (Wi+) and non-occurrence (Wi) of flood hazards considering the flood inventory data at a prescribed class of a flood conditioning factor are described as likely to occur and not to occur, respectively (Eqs. 7, 8).

$$ W_{i}^{ + } = \ln \frac{{P\left( {B{{\backslash }}A} \right)}}{{P(B{{\backslash }}\overline{A} )}} $$
(7)
$$ W_{i}^{ - } = \ln \frac{{P\left( {\overline{B} {{\backslash }}A} \right)}}{{P(\overline{B} {{\backslash }}\overline{A} )}} $$
(8)

where P is the conditional probability, A is the presence of flood pixels at B flood conditioning factor while over line bar denotes the absence of A and B.

The difference between Wi+ and Wi is expressed as contrast (C). The variances of Wi+ and Wi are calculated as given in Eqs. 9, and 10 in terms of the number of flooded and non-flooded pixels in a class (N). Standard deviation of the contrast (S(C)) is calculated by Eq. 11. So, weight of each class (W) can be calculated by Eq. 12.

$$ S^{2} (W_{i}^{ + } ) = \frac{1}{{N\left( {B{{\backslash }}A} \right)}} + \frac{1}{{N(B{{\backslash }}\overline{A} )}} $$
(9)
$$ S^{2} (W_{i}^{ - } ) = \frac{1}{{N(\overline{B} {{\backslash }}A)}} + \frac{1}{{N(\overline{B} {{\backslash }}\overline{A} )}} $$
(10)
$$ S(C) = \sqrt {S^{2} (W_{i}^{ - } ) + S^{2} (W_{i}^{ + } )} $$
(11)
$$ W = \frac{C}{S(C)} $$
(12)

W values of each class of flood conditioning factors were calculated and the flood hazards susceptibility map was created by summing all the flood conditioning factors in ArcGIS environment.

2.4.4 Index of entropy (IoE)

IoE proposed by Shannon (1948) is revealed to determine the disorderliness in the distribution of values of variables derived from information theory as a measure of redundancy in data (Arora et al. 2019). This approach was then adopted in flood susceptibility studies to understand the weight of contribution of conditioning factors (Haghizadeh et al. 2017). Given Eqs. (1318) were followed to calculate the parameters, and the flood hazards susceptibility mapping is obtained by Eq. 19 in ArcGIS environment.

$$ P_{ij} = FR $$
(13)
$$ (P_{ij} ) = \frac{{P_{ij} }}{{\sum\nolimits_{j = 1}^{Sj} {P_{ij} } }} $$
(14)
$$ H_{j} = - \sum\limits_{{{i} = {1}}}^{{{Sj}}} {\;{(P}_{{{ij}}} )} \log_{2} {(P}_{{{ij}}} ) $$
(15)
$$ H_{j\max } = \log_{2} {(S}_{{j}} ) $$
(16)
$$ I_{j} = \frac{{{H}_{{{j\max}}} -H_{j} }}{{{H}_{{{j\max}}} }} $$
(17)
$$ W_{j} = I_{j} \;P_{ij} $$
(18)
$$ IoE = \sum\nolimits_{i = 1}^{\;n} {\frac{z}{{m_{i} }}\;} CW_{j} $$
(19)

where Pij = frequency ratio, (Pij) = probability density, Hj and Hjmax = entropy values, Sj = number of classes, Ij = information coefficient, Wj = weight of the conditioning factor, i = number of particular conditioning factor map, z = number of classes within the conditioning factor with the greatest number of classes, mi = number of classes within particular conditioning factor map, and C = value of the class after secondary classification (Arora et al. 2019).

2.4.5 Principle component analysis (PCA)

PCA is generally conducted to understand the underlying structure in which how the data was contributed to the total variance explained by factor analysis. In this study, flood conditioning factors were extracted based on the initial eigenvalues of components using SPSS. Therefore, a sample of 500 random points in the study area was generated in ArcGIS (Arabameri et al. 2019). Values of flood conditioning factors at those points were extracted using ArcGIS, and the data were processed based on the initial eigenvalue of 0.95 which explains the total variance as 68%, in SPSS. So, five components were found to be more explanatory. Since the extracted components by PCA method did not have strong cross-correlation coefficients, the rotated factor loading matrices were used (Malik et al. 2019). Flood conditioning factors with the greatest correlation coefficient for each component were determined from the rotated component matrix. Weights of the component were determined as the ratio of the cross-correlation coefficient of that component and the total cross-correlation coefficients (Malik et al. 2019). Then, all classes of dominant flood conditioning factors were ranked starting from 2 to 10 based on the Bel value and raster maps were created, accordingly (Benjmel et al. 2020). So, flood hazard susceptibility map was processed by the summation of the multiplication of the ranks and factor weights.

2.4.6 Fuzzy logic (FL)

According to FL theory, membership value of flood hazards conditioning factors has varying degrees of support and confidence in the range of 0 and 1. Fuzzy classes were obtained by considering the maximum and minimum FR and WoE values of a factor by corresponding that 0 is set for minimum and 1 is set for maximum. A value of one indicates full membership while zero demonstrates minimum membership of the class. Inner class values bounded with the minimum and maximum are computed based on the membership functions. In the study, flood conditioning factors at each class calculated using FR and WoE, separately, were transformed based on a linear membership type of raster maps varying between 0 and 1 in ArcGIS environment. Fuzzified raster maps of flood conditioning factors were overlaid by gamma method which is a function of fuzzy algebraic product and fuzzy algebraic sum in which a default value of gamma of 0.9 was used, converging to fuzzy and method.

2.4.7 Analytical hierarchy process (AHP)

In AHP method, criteria weights of the flood conditioning factors were implemented by hybridization of IoE and compound factor. Wj values of each conditioning factors were sorted in ascending order and then graded from 1 to 10 based on compound factor method. Pair-wise comparison matrix of flood conditioning factors was created based on those compound values and the classification scale proposed by Saaty (1980). The criteria weight vectors of flood conditioning factors were obtained from pair-wise comparison matrix.

The inconsistency index (Ii) of pair-wise comparison matrix was computed to examine the consistency of flood conditioning factors (Eqs. 20, 21).

$$ I_{i} = \frac{{\lambda_{{{\max}}} { - n}}}{{{n - 1}}} $$
(20)
$$ I_{r} = \frac{{I_{{i}} }}{{R_{{i}} }} $$
(21)

where λmax = the largest eigenvalue of the pair-wise comparison matrix, n = number of flood conditioning factors, Ir = inconsistency ratio, and Ri = random index. For n = 11, Ri value can be approximated as 1.51. It should be noted that consistency of the factors is trustworthy when Ir is less than 0.10 (Saaty 1980). In this study, consistency of factors is satisfied since Ir value was calculated as 0.008.

Flood hazard susceptibility map was then processed by summing up the multiplication of the ranks, as mentioned in PCA section by criteria weights.

2.4.8 Technique for order preference by similarity to an ideal solution (TOPSIS)

Flood hazards susceptibility map of the study area was processed by TOPSIS method using the criteria weights obtained from AHP and PCA methods, separately. Decision matrix was created using flood conditioning factor values at generated random 500 points, as aforementioned in PCA sub-section, and then the decision matrix was normalized. Weighted normalized decision matrices were calculated using factor weights obtained from both AHP and PCA, separately. Of using Euclidean distance, positive and negative ideal solutions were computed considering the benefit or cost exhibition of the flood conditioning factors based on linear fitting of class number and Bel function. When the tendency line of the Bel value and the class number has positive slope, this factor was then thought to be cost. Watershed order, TWI and Er factors exhibited cost performance while the rest of the factors exhibited benefit performance. The closeness coefficients of random points were then determined. Flood susceptibility map of the study area was created by interpolating a raster surface from values of closeness coefficients of random points using Inverse Distance Weighting (IDW) method in ArcGIS since IDW method performed better results than Krigging method.

2.4.9 VlseKriterijumska optimizacija I Kompromisno Resenje (VIKOR)

Similar procedures adopted in TOPSIS method was followed, but some differences in techniques were employed in VIKOR method by definition. Instead of closeness coefficient determined from Euclidean distance of positive and negative ideal solutions, a compromising solution was introduced by using the utility, regret indexes, and the weight of the compromise strategy (Opricovic and Tzeng 2004).

2.5 Validation of flood hazards susceptibility maps

After creation of flood hazards susceptibility maps using various methods, the model results at flooded and non-flooded points were extracted from the model to obtain both training and testing data to be used in assessment of the prediction capability of the models using various statistical indicators. Thus, a receiver operating characteristic curve (ROC) analysis was conducted to examine the sensitivity, specificity, and accuracy of the estimated results involved assessment of not only more susceptible zones to flooding, but less susceptible zones, as well. In ROC analysis, 1-specificity in x-direction (Eq. 22) vs. sensitivity in y-direction (Eq. 23) was presented, and some statistical indicators as specificity, sensitivity, accuracy (Eq. 24), and kappa index (Eq. 25) were computed as follows:

$$ x = 1-{\text{specificity}}\; = \;1 - \frac{{\text{TN}}}{{{\text{TN}} + {\text{FP}}}} $$
(22)
$$ y = {\text{sensitivity}}\; = \frac{{\text{TN}}}{{{\text{TP}} + {\text{FN}}}} $$
(23)
$$ {\text{Accuracy}}\; = \frac{{\text{TP}} + {\text{TN}}}{{{\text{TP}} + {\text{TN}} + {\text{FP}} + {\text{FN}}}} $$
(24)
$$ Kappa\; = \;\frac{{P_{c} - P_{\exp } }}{{1 - P_{\exp } }} $$
(25)

where TN = true negatives, FP = false positives, TP = true positives, FN = false negatives, Pc = number of pixels to be classified correctly, and Pexp = expected agreement. As the ROC parameters increased, the prediction capability of the models implemented also increased. Furthermore, Kappa index was assessed quantitatively and qualitatively: poor (0–0.4), moderate (0.4–0.55), good (0.55–0.85), excellent (0.85–0.99), and perfect (0.99–1) (Monserud and Leemans, 1992; Kanani-Sadat et al. 2019).

The area under the ROC curve (AUC) is also associated with efficiency of the predicted susceptibility maps especially for flooded pixels. The AUC values can be interpreted as: weak (0.5–0.6), moderate (0.6–0.7), good (0.7–0.8), very good (0.8–0.9), and excellent (0.9–1) (Yesilnacar and Topal 2005).

Model results were also classified using natural break method in ArcGIS in five categories as very low (VL), low (L), moderate (M), high (H), and very high (VH) flood hazards susceptibility. This enabled to further understand the behavioral compatibility of spatial distribution of the predictions. So, another significant indicator, seed cell area index (SCAI) given in Eq. 26 was used to determine the categorized class density per flood pixel density in the class. Based on the SCAI values in a flood hazards susceptibility class, it is a reasonable judgment to make that the values of SCAI increase as the susceptibility class inversely decreases. Hence, a linear relationship between the susceptibility class and the SCAI values was fitted to decide about the method, which is more compatible with SCAI definition. It is also surmised that the greater negative inclined-fitted line with a reasonable coefficient of determination may give better classified maps.

$$ {\text{SCAI}} = \frac{{\text{Percentage}}\;{\text{of}}\;{\text{the}}\;{\text{class}}\;{\text{pixels}}}{{{\text{Percentage}}\;{\text{of}}\;{\text{the}}\;{\text{flood}}\;{\text{hazards}}\;{\text{pixels}}\;{\text{on}}\;{\text{each}}\;{\text{class}}}} $$
(26)

3 Results and discussion

Multicollinearity test was conducted to specify the VIF values of each conditioning factor (Table 1). Since VIF values are less than 10, all the conditioning factors were considered in flood hazards susceptibility mapping. Class ranges of flood conditioning factors obtained from natural break method, pixels in domain and flooding pixels in the class are presented in Table 2 which also shows the results of FR and EBF methods. It can be said that flood pixels were concentrated on the specific class in elevation, slope, curvature, distance from river, TWI, and STI factors. For instance, there is a strong belief and high sensitivity that flood occurs in domain for elevation < 474.12, slope < 16.61º, curvature not convex, distance < 1589 m, TWI > 11.47, STI < 17.46. Moreover, flat surfaces, and Er > 0.50 pixels are found to be significant in producing flooding. Those results are also compatible with FR results. Greater FR or Bel values mean more prone area to flooding at that class. Flood hazards susceptibility maps obtained from FR, and EBF methods are given in Figs. 4 and 5, respectively.

Table 1 Multicollinearity test of flood hazards conditioning factors
Table 2 Flood conditioning factors at prescribed classes, and results of FR and EBF methods
Fig. 4
figure 4

Flood hazards susceptibility map using FR method

Fig. 5
figure 5

Flood hazards susceptibility map using EBF method

Table 3 shows the WoE and IoE results at prescribed classes. WoE results regarding the dominant and significant classes are also compatible with EBF and FR findings. However, some classes at a flood conditioning factor have a negative impact on flood. Final weights of IoE suggest that STI has a maximum weight of 25.87 while Dd has a minimum weight of 0.04 on flood conditioning. Flood hazards susceptibility maps processed with results of WoE and IoE methods are given in Figs. 6 and 7, respectively.

Table 3 WoE and IoE results of flood conditioning factors
Fig. 6
figure 6

Flood hazards susceptibility map using WoE method

Fig. 7
figure 7

Flood hazards susceptibility map using IoE method

Since the extracted values of 500 random points in the study area had inadequate cross-correlation in PCA method, the rotated component matrix was considered to specify five significant components (Table 4). Distance from river, TWI, Er, Dd, and aspect were found to be more significant for components 1, 2, 3, 4, and 5, respectively. Moreover, these flood conditioning factors were found to be in a good correlation. The Spearman correlation coefficients of five components were used to determine the weights of flood conditioning factors (Table 5). Weights of the parameters were closer to each other and varied between 0.17 and 0.24. Flood hazards susceptibility map implemented with more significant flood conditioning factors is given in Fig. 8.

Table 4 Significant components in PCA method
Table 5 Weights of the flood conditioning factors of PCA method
Fig. 8
figure 8

Flood hazards susceptibility map using PCA method

FR values and C/S(C) of WoE values at each class were fuzzified using linear membership type, separately. Flood hazards susceptibility maps of FL-FR and FL-WoE were overlaid using gamma and membership functions as shown in Figs. 9 and 10, respectively.

Fig. 9
figure 9

Flood hazards susceptibility map using FL-FR method

Fig. 10
figure 10

Flood hazards susceptibility map using FL-WoE method

Pair-wise comparison of flood conditioning factors in AHP method based on Wj values in IoE method, and compound factor, using Saaty’s scale (Saaty 1980), is given in Table 6. Weights of the flood conditioning factors given in Table 8 were computed by using ranks which varied between 2 and 10, given in Table 7. Flood hazards susceptibility maps based on AHP method were generated as shown in Fig. 11. In flood hazards susceptibility mapping, STI was found to have the most dominant impact while aspect and Dd had the least impact.

Table 6 Pairwise comparison of flood conditioning factors
Table 7 Ranking of classes of each flood conditioning factor
Table 8 AHP weights of flood conditioning factors
Fig. 11
figure 11

Flood hazards susceptibility map using AHP method

Flood hazards susceptibility maps determined by Euclidean distance-based TOPSIS method coupled with AHP and PCA weights were generated, as shown in Figs. 12, and 13, respectively. Flood hazards susceptibility maps obtained by VIKOR method also enabled comparison of the performance of two different decision models (Figs. 14, 15).

Fig. 12
figure 12

Flood hazards susceptibility map using TOPSIS-AHP method

Fig. 13
figure 13

Flood hazards susceptibility map using TOPSIS-PCA method

Fig. 14
figure 14

Flood hazards susceptibility map using VIKOR-AHP method

Fig. 15
figure 15

Flood hazards susceptibility map using VIKOR-PCA method

Figures 16 and 17 present the ROCs of flood hazards susceptibility maps estimated for training and testing processes generated from aforementioned methods. Sensitivity, specificity, accuracy, kappa index and AUC values computed from the extracted values from flooded and non-flooded points for both training and testing data are given in Table 9. Findings of the method which involved the training process based on statistical measures can be interpreted as follows: Sensitivity was calculated the best for IoE, and followed by AHP and EBF. Specificity of FL-FR for training and FL-WoE for testing was found to be the best performing model in their processes. Accuracy of EBF and IoE models for training, FR, and IoE for testing performed the best. Kappa index of IoE model gave the best result. AUC of PCA gave the best results. When excluding TOPSIS and VIKOR methods, AUC of other methods was assessed as excellent.

Fig. 16
figure 16

ROCs of training data of the applied methods

Fig. 17
figure 17

ROCs of testing data of the applied methods

Table 9 Statistical indicators computed from extracted values from flood hazards susceptibility maps

A framework assessment should be conducted to determine the efficiency of the methods in generating flood hazards susceptibility maps using statistical measures since values of those are conflicting, and do not concentrate on the same model. A simple-relative approach based on the compound factor was adopted such that statistical measures for both training and testing were graded from one for the best model to twelve for the worst. In case the model results gave the same statistical measure values, they were successively ranked and re-ranked using the average of successive grades. Ranks of the statistical measure were taken as the average of training and testing grades. Final grades were determined by summing up the grades of statistical measures (Table 10). It can be concluded from that assessment, and the linear membership of the grades in four categories that it can be concluded that IoE method gave the best results of all (Arabameri et al. 2019). Moreover, other statistical methods, FR, EBF, PCA and AHP, also gave very good results like IoE that cannot be contradistinguished. On the other hand, results of WoE and FLs models might be evaluated as good while TOPSIS and VIKOR models gave the poorest results.

Table 10 Overall assessments of performed models

Another assessment criterion for the interpretation of generated flood hazards susceptibility maps is based on the classification of five susceptibility levels determined by natural break method in ArcGIS (Table 11). In a well-classified flood hazards susceptibility map, an increasing flood hazards potential trend of SCAI with decreasing susceptibility level is expected, and this may be measured by fitting a line whose negative slope tends to become perpendicular. IoE method was not successful at capturing flood hazard pixels since flood intensity concentrated on moderate susceptibility class. In FR method, flood hazard pixels concentrate on high class, but SCAI values are in the desired range. In EBF method, flood hazards pixels concentrate on very high class that is a huge amount of the drainage area. PCA or AHP methods may be evaluated as reasonable models predicting the susceptibility classes.

Table 11 SCAI values at each class of flood conditioning factors of methods

In overall assessment, hybridization of IoE and EBF with AHP and EBF with PCA gave very good results since IoE and EBF themselves predicted flood hazards susceptibility map very good. PCA method slightly improved the prediction of flood hazards susceptibility maps in comparison with AHP. However, hybridization of AHP and PCA did not contribute to TOPSIS- and VIKOR-based flood hazards susceptibility maps. Moreover, dimension reduction by PCA method in overlaying flood conditioning factor maps inversely impacted the results in TOPSIS and VIKOR methods. Relatively poor predictions of TOPSIS and VIKOR methods may be due to the number of random points implemented by Arabameri et al (2019) adopted in this study may not be compatible with grid cell size of DEM and scale of the map used. The model prediction capability may be improved by increasing the number of random points, implementing different distance measures in TOPSIS and changing strategy coefficients in VIKOR methods in order to overcome under-fitting problem. That may even be more reasonable to improve the PCA model results. Sensitivity analysis of criteria weights of AHP and PCA may also further contribute to optimize the results of flood conditioning factors. Adaption by hybridization of FR and WoE methods inversely impacted FL methods. Especially FR method was seriously influenced by fuzzification method. Extensive research on membership function type may help to improve the prediction results. Probabilistic approaches using various operators may also improve the fuzzy logic results.

It is assumed that future flash flood hazards would occur under the same conditions as past flash floods. Moreover, flash floods are correlated with the spatial information of flood conditioning factors. Using those assumptions in adaptation of some models in order to predict the spatial distribution of flood hazards susceptibility that is a very complex phenomenon may result in some errors. Those assumptions may also lead to previously unseen generalization errors. Model results may misclassify the training data that causes to decrease in accuracy of the predictions. Random selection of flood hazard pixels which may not be representative samples may not consider the internal dynamics of flood hazards in the watershed. So, it should be kept in mind that both the quality of conditioning factors, representativeness of flood hazard pixels and the implemented model are crucial in prediction of model results with low generalization and training error. A model may predict flood hazards susceptibility map with low training error, but high test error. This over-fitting problem may be overcome by increasing the number of representative training data with low generalization error (Han et al 2012). In the study, since adopted methods excluding TOPSIS and VIKOR gave good results and those results was improved in testing process, the predicted results were found to be satisfactory.

Many studies were conducted to improve prediction capability of flood susceptibility. Some of the researchers adapt hybridization techniques while some adapts ensemble models. Basic statistical models may be considered as a strong tool to develop prediction results and reduce computational efforts. Results of the recent studies also revealed that hybridization with statistical model improved predictions. For instance, Wang et al. (2021) stated that probability density model hybridized with MLP gave the most efficient prediction results based on considering ROC analysis. New ensemble models may enable to be constructed on these statistical models. It should be kept in mind that some of the ensemble algorithms were re-modified based on inclusion of new assumption of the existing algorithms. On the other hand, statistical representation of class values in order to improve prediction results, and avoid computational effort may be employed in ensemble machine learning models in natural hazards management studies. So, hybrid and ensemble models can be combined in order to estimate the susceptibility. The outcomes of such studies may be discussed and those basic studies may be addressed in near future.

4 Conclusion

It is important to guarantee an asset management in terms of expectation of return of investments in a projected life cycle. So, risk perception for assets and new investments may be introduced in planning and design stages coupled with a management strategy according to a risk category. Especially, natural hazards seriously threaten the infrastructures, and even lives of human beings. So, it is inevitable to estimate nationwide flood hazards susceptible zones. In this study, a study area which frequently exposed to flash flood hazards was chosen and flood hazards data and flood conditioning factors of the study area were coupled with GIS-based approach to generate flood hazards susceptibility maps.

It is also important and challenging to decide about the method and the data to be used, and how to combine these since the methods involve interpretation and processing of data. So, various methods such as FR, EBF, WoE and IoE, and hybrid integration of PCA, FL, AHP, TOPSIS, and VIKOR were used to generate flood hazards susceptibility maps. In the study, generated flood hazards susceptibility maps were validated for both training and testing data sets using sensitivity, specificity, accuracy, kappa index, AUC, and SCAI. Some methods gave very good results while the other methods gave relatively poor results. Since the whole statistical measures did not point out the same method, a simple approach was adopted to decide the most acceptable method. So, PCA and AHP methods were found to be more accurate at prediction of flood hazards susceptibility mapping with regard to ROC analysis and SCAI variation. This study may be evaluated as a contribution to the similar efforts on the estimation of accurate flood hazards susceptibility maps by hybridization methods and assists policymakers who have to face challenging decisions on land use. Strategies to improve TOPSIS and VIKOR predictions may be conducted using ensemble algorithms using non-stationary models. Generated flood hazards susceptibility maps may be commonly used in design, operation, and inspection stages of any structure.