Introduction

Generally, flash flood is defined as a rapid beginning of flood in a short time that mostly has a high peak discharge; thus, this flood is usually caused by the rainfall that has a 1-h duration (Elkhrachy 2015). Annually, a variety of natural disasters such as floods, earthquakes, and landslides cause great damages to lives and properties throughout the world (Tierney et al. 2001; Tehrany et al. 2015a). The number of significant flood events in the world has meaningfully increased during the last three decades (Kourgialas and Karatzas 2011). Flood, which is the outcome of complex geological, geomorphological, and hydrological conditions, is probably the most devastating, widespread, and abundant natural disaster in the world (Gashaw and Legesse 2011) that leaves harsh socio-economic and environmental consequences behind (Wu and Sidle 1995; Glade 1998). The confluence of rivers is the prime place of concern for flood hazards (De Moel and Aerts 2011). As with the road flooding, the main concerns revolve around the damages mainly caused by high water levels after heavy rain showers in areas where rivers and roads intersect or where culverts and waterways are blocked by brushwood (Plate 2009). Intervention in natural conditions by human activities, such as road construction and timber harvesting, could increase the risk of occurrence of floods (Montgomery 1994; Chung et al. 1995). As a result of burgeoning population growth and the subsequent pressure, nature and behavior of floods have altered. Susceptibility to flood refers to the natural tendency to produce flood.

It should be noted that although flooding is inevitable and avoidance of it is impossible, the assessment and management of future floods can be achieved through proper analysis and forecasting methods (Cloke and Pappenberger 2009; Tehrany et al. 2015b). Identification of prone areas to floods or preparation of susceptibility maps of flooding is an important tool to mitigate future flood damages. Hence, by identifying locations with low susceptibility to flooding, suitable areas for developmental activities could be recognized (Sarhadi et al. 2012). Kourgialas and Karatzas (2011) state that flood management strategy consists of three phases: (1) pre-flood measures, (2) flood forecasting, and (3) post-flood activities; while Konadu and Fosu (2009) believe that the management of flood can be accomplished through four stages: anticipation, preparation, prevention, and damage assessment. Schanze (2006) divide flood risk management into flood risk assessment and mitigation. Obviously, strategies about impact of floods require identification of prone areas (Tehrany et al. 2013) to facilitate quick response, decrease the impact of possible flood events, and provide a means for early warning (Kia et al. 2012).

Previous works

In recent years, several approaches have been developed by hydrologists to model flood risk and hazard (Jayakrishnan et al. 2005; Bahremand et al. 2007). In general, it should be noted that traditional hydrological methods have not been able to meet the needs of comprehensive flooding susceptibility assessment in regional studies (Li et al. 2012; Tehrany et al. 2015b); these methods are mainly based on linear assumptions that, in this case, are inappropriate for watershed studies with non-linear structures (Liu and De Smedt 2004; Tehrany et al. 2013). Application of RS and GIS has been a great evolution in environmental sciences such as landslides, groundwater resources, and flood susceptibility mapping and has also provided new insights into flood assessment studies. In general, several researchers have developed various techniques in the field of environmental studies by the application of GIS and RS.

Among different methods, the most popular and widely used models based on GIS in zoning different natural disasters include frequency ratio (Lee et al. 2012; Tehrany et al. 2013; Youssef et al. 2014), weights-of-evidence (Mohammady et al. 2012; Pourghasemi et al. 2012a), logistic regression (Pradhan 2010a; Akgun 2012; Felicisimo et al. 2012; Nampak et al. 2014), analytical hierarchy process (Chen et al. 2011; Althuwaynee et al. 2014; Kazakis et al. 2015), artificial neural network (Oh and Pradhan 2011; Kia et al. 2012), support vector machine (Tehrany et al. 2014b), decision trees (Tehrany et al. 2013), Shannon’s entropy (Bednarik et al. 2010; Pourghasemi et al. 2012c; Sharma et al. 2013; Jaafari et al. 2014), weighted factor (Yalcin et al. 2011), statistical index (Pourghasemi et al. 2013), and fuzzy logic (Pourghasemi et al. 2012b).

Kazakis et al. (2015) in a study entitled the assessment of flood hazard areas at a regional scale in Greece used index-based approach and AHP models. Finally, a comparison between the flood maps was produced and the historical events showed that the model was capable of flood hazard mapping.

Tehrany et al. (2015a) combined the two support vector machine (SVM) and frequency ratio (FR) models in the Kelantan Watershed. The results showed that the hybrid model had slightly better performance than the DT model.

Tehrany et al. (2015b), using SVM models with four different functions, prepared the susceptibility maps of flooding in the Kuala Terengganu Watershed in Malaysia. They finally stated that SVM-RBF model with the AUC of 84.97% had the highest level of accuracy among four different functions. By measuring Cohen’s kappa, the researchers claimed that, except for surface runoff factor, other factors had a positive impact on flooding.

Youssef et al. (2016) performed an analysis on the susceptibility to flooding in Jaddah, Saudi Arabia, by applying FR and LR methods and their combination. They stated that the hybrid approach, with the AUC of 91.3%, had been more reliable and accurate than the FR model with the AUC of 89.6%.

According to the literature mentioned above, the Shannon’s entropy data-mining technique is one of the new methods applied in environmental sciences such as groundwater and landslide studies. Yet, this approach is new to the flooding assessment studies. Application of the statistical index (SI) and weighting factor (Wf) is completely novel in the field of flood susceptibility zoning, while it has been commonly used in the landslide hazard assessment. Hereby, the aim of this study is flood susceptibility mapping in the Haraz Watershed, Mazandaran Province, by deploying Shannon’s entropy, statistical index, and weighted factor methods. The efficiency of these models will also be taken into account. One of the other objectives of this study is to determine which factors have the highest impact on the incidence of flooding in the study area. This watershed witnesses many floods annually that cause widespread losses and property damages. Despite the fact, no appropriate measures have been contemplated in these areas to prevent or mitigate the damages.

Study area

The Haraz Watershed is located in Amol County, between 51° 43′ to 52° 36′ E and 35° 45′ to 36° 22′ N. The total area enclosed in this watershed measures roughly 4015 km2. The elevation ranges from 328 m at the bottom to about 5595 m at the rim. Nour, Akhensar, Shirkolaroud, and Namarestagh are some of the main rivers flowing in the area. Major elevations of the area include the mounts of Damavand, Shimkouh, Emamzadeh Ghasem, Yakhli, Zanjirband, and Lou. The main settlement areas are Polor, Nashal, Tiran, Rineh, Kandovan, Ab-e-Ask, Gazanak, Baijan, Belghalm, Belde, and Nour. Figure 1 provides an overview of the study area. Due to extreme flood incidences and the resulting life and property damages, the Haraz Watershed has been chosen as the case study area.

Fig. 1
figure 1

Location of flooding point. Case study position and its hillside

Methodology

According to Tehrany et al. (2014b), the analysis of susceptibility to flooding is one of the most imperative issues in river hydrology, which has been carried out in the current study by considering three methods of SE, SI, and Wf in the Haraz Watershed. The flowchart of this research is presented in Fig. 2.

Fig. 2.
figure 2

Flow chart of methodology in this research

Data collection

Flash flood historical mapping

Flood inventory maps are necessary for the study of the relationship between floods and their causative factors. To prepare flood susceptibility map, the first step is to obtain the appropriate data and create a spatial database. It is vital to provide the necessary data with high accuracy (Jebur et al. 2013). In this study, flood inventory map was prepared by the historical floods of 2002, 2006, and 2013 according to the documentary sources and extensive field inventories. In the meantime, a total of 211 flood points on the scale of 1:25,000 were identified, as shown in Fig. 1.

Causative factors

To develop a methodology for evaluating the susceptibility to flooding, it is necessary to determine the causative factors and their relationship with flood occurrences (Liu and De Smedt 2005; Pradhan 2009). In fact, regional assessment of floods should be practical and applicable for the study area; thus, the input parameters should represent reliable and simply achievable factors (Oh and Pradhan 2011). For this study, ten flood causative factors were selected based on literature review, as follows: slope angle, plan curvature, altitude, topographic wetness index (TWI), stream power index (SPI), distance from river, rainfall, geology, land use, and normalized difference vegetation index (NDVI) (Figs. 3a–j). All factors were subsequently converted into raster maps with the pixel sizes (resolution) of 20 × 20 m.

Fig. 3
figure 3figure 3

Flood causative factors in Haraz Watershed: a slope angle, b plan curvature, c altitude, d topographic wetness index TWI, e stream power index SPI, f distance from river, g rainfall, h geology, i land use, and j NDVI

Digital elevation model (DEM), as a principal source for extracting topographic factors, is one of the most important flood causative factors which has been used in various research works (Tehrany et al. 2013; Youssef et al. 2014). Topography plays an absolutely crucial role in spatial variability of hydrological conditions such as soil moisture and groundwater flow. Extracted topographical factors include slope, plan curvature, altitude, TWI, and SPI. One of the most important factors affecting flood in the area is slope (Tehrany et al. 2013). As slope steepness increases, the rate of water infiltration is reduced and the velocity of water increases; hence, huge volume of runoff reaches the river and flat area and leads to flooding. Thus, any increase in slope gradient could result in more runoff and increase in its velocity, but floods commonly take place in areas with less slopes. Slope map of the region was divided into five classes ranging from 0° to 66.8°, which are shown in Fig. 3a. Plan curvature is another important factor for flooding. In this regard, flat areas have become exposed to the highest potential of flood occurrence. The plan curvature represents the morphology of the topography; therefore, to map the plan curvature in GIS, negative values were treated as Concave, zero as flat and positive numbers as convex surfaces (Oh and lee 2010). As noted, the plan curvature map was divided into three classes, which are shown in Fig. 3b. Another important factor for flood occurrence is altitude. The research conducted by Botzen et al. (2013) states that the occurrence of flood is almost impossible in high altitudes. Altitude of the area was accordingly classified into nine classes ranging from 328 to 5595 m, as illustrated in Fig. 3c. TWI represents the accumulation of flow at any point in the catchment area as the result of the runoff tendency to travel down the slope by the force of gravity (Gokceoglu et al. 2005), and this is closely related to the soil moisture content.

SPI represents the erosive power of water flow (Jebur et al. 2014). TWI and SPI maps were prepared in SAGA-GIS Software and finally divided into ten classes, which are shown in Fig. 3d, e. Another very important factor that is considered in this research is distance from the river (Fig. 3f), as it affects the velocity and extent of floods in the region (Glenn et al. 2012). Precipitation map was prepared from 20-year rainfall data records of 17 rain gauges (Khosravi et al. 2016) and then was classified into nine classes (Fig. 2g). Geological layers were acquired in the shape file format from the Regional Water Department of Sari City. This factor was reproduced from the geological map at a scale of 1:100,000. Geological map was classified into three classes including Mesozoic, Paleozoic, and Cenozoic (Fig. 2h). Geological characteristics of the study area are summarized in Table 1. Most of the mountains and reliefs in the region, such as Damavand, are within the Cenozoic class. Using the Operational Land Imager (OLI) sensor images of the Landsat 8 satellite, land use and NDVI maps were prepared for year 2013 in ENVI 5.1 software. For land use mapping, neural network algorithms and supervised classification were applied. Land use map was divided into seven classes of water bodies, residential, rangeland, orchards, irrigated land, forest, and bare land (Fig. 2i). The majority of the watershed was devoted to the rangeland class. The range of NDVI index values varied between +1 and −1 that show a measure of vegetation characteristic of an area. The negative values indicating the water, values close to zero (−0.1–0.1) show the barren areas, low positive values (0.2–0.4) represent the grasslands and shrub, and high values show the rainforests and dense forest. NDVI index map was divided into ten classes (Fig. 2j).

Table 1 Lithology of the Haraz Watershed

Application of BSA

Shannon’s entropy model

Entropy is a measure of disorder, instability, imbalanced behavior, energy distribution, and uncertainty in a system (Yufeng and Fengxiang 2009; Pourghasemi et al. 2012a). In sum, entropy means the quantity of irregularities between causes and results or decisions on various debated topics (Wan 2009). Entropy is the representation of evenness, in which the groups are equally and uniformly distributed among organizational units (Massey and Nancy 1988). Entropy index is regarded as a measure of the average difference of the proportion of unit groups from the total system (Theil 1972; Naghibi et al. 2014). Shannon entropy is function of probability distribution and the criterion for measurement of the amount of its uncertainty (Hosseinpoor Milaghardan and Abbaspoor 2015) as follows:

$$ {E}_i=S\left({P}_1,{P}_2,,.....{P}_n\right)=-M{\displaystyle \sum {P}_i\ Ln\ {P}_i}\kern0.5em i=1,2,\dots, n $$
(1)

where P i is probability distribution function and M is the constant value for adjustment of entropy (between 0 and 1) (Eq. 1). If all the values of P i equal each other (P 1 = P 2 = … = P n), then for all of the i and j values (Asgharizadeh and Nasrolahi 2006), calculation is as follows:

$$ {P}_i=1/n $$
(2)

If there are n alternatives and k criteria in the decision matrix, the result of the matrix for the jth criterion is calculated as follows:

$$ \begin{array}{ccc}\hfill {P}_{ij}=\frac{f_j\left(\mathrm{ai}\right)\ }{{\displaystyle {\sum}_{i=1}^n{f}_j\left(\mathrm{ai}\right)}};\hfill & \hfill j=1,2,\dots, n\hfill & \hfill \forall j\hfill \end{array} $$
(3)

There is a one-to-one relationship between the quantity and value of the system entropy and the level of irregularity that is called Boltzmann principle and is commonly used to display the thermodynamic condition of a system (Yufeng and Fengxiang 2009). Shannon has been modified based on the Boltzmann model, and it uses the entropy model for the theory of information (Pourghasemi et al. 2012a). Several important factors introduce excess entropy to the system. Entropy values can be used to calculate the target weight of the index system (Yang et al. 2010). Shannon entropy (SE or E j ) can be calculated as:

$$ {E}_i=-M{\displaystyle \sum_{i=1}^n\begin{array}{ccc}\hfill {P}_{ij}\hfill & \hfill Ln\;{P}_{ij};\hfill & \hfill {\forall}_j\hfill \end{array}} $$
(4)
$$ M=\frac{1}{Ln\kern0.5em n} $$
(5)

where n is the number of that; in this study, it is the number of classes for each causative factor. Degree of diversification (d j ) shows how much proper information the jth criterion gives to the decision maker.

$$ \begin{array}{cc}\hfill {d}_j=1-{E}_j;\hfill & \hfill \forall \mathrm{j}\hfill \end{array} $$
(6)

Finally, the weight could be calculated based on Eq. 7 as follows:

$$ \begin{array}{cc}\hfill {V}_j=\frac{d_j}{{\displaystyle \sum_{j=1}^n{d}_j}};\hfill & \hfill {\forall}_{\mathrm{j}}\hfill \end{array} $$
(7)

Statistical index model

The SI bivariate statistical method has been introduced by van Western (1997) for landslide susceptibility mapping. It requires selection and mapping of the important parameters and their classification into relevant classes, flood inventory mapping, overlaying the flood inventory map with each causative factor, determination of the density of floods in each class of parameters, and calculation of weight values. Subsequent steps are assigning the values to various parameters, overlaying layers, and preparation of the final susceptibility map for each land unit (Aleotti and Chowwdhury 1999; Yalcin 2008). Weight values for each class of parameters are calculated as the natural logarithm of flood density of the class divided by the total flood density of the map (Van Western 1997; Rautela and Lakhera 2000; Cevik and Topal 2003; Pourghasem et al. 2013). Implementation of SI method is based on the following equation (Van Westen 1997):

$$ {W}_{SI}=Ln\left(\frac{E_{ij}}{E}\right)=Ln\left(\frac{L_{ij}/{L}_T}{P_{ij}/{P}_L}\right) $$
(8)

where W SI is the assigned weight to the given class i of the parameter j; E ij is the flood density in the class i of the parameter j; E denotes the total flood density within the entire map; L ij is the number of flooding locations in the class i of the parameter j; L T is total number of floods in the entire map; P ij is the number of pixels in the class i of the parameter j; and P L is the total number of pixels in the entire map.

This method is based on statistical correlation between the flood inventory map and characteristics of various parameters (Yalcin 2008). In this study, all of the parameters were crossed with the flood inventory map, and the resulting flood density in each class of parameters of interest was calculated. The correlation outcomes were sorted in the resultant raster, and flood density was calculated for the classes of parameters. The SI values were calculated for each of the features. Finally, flood susceptibility mapping was performed by overlaying the different layers.

Positive values of W SI indicate the proper and robust relationship between the classes and flooding distribution such that the stronger the relationship is, the higher the score will become. Negative values for W SI mean that there is no correlation between the class and the flood occurrence.

Weighting factor model

As the FR model for determination of its weights used the SE model, Wf model utilities the SI value to determination of its weights. In other words, Wf method can be regarded as the modified version of the statistical index (Cevik and Topal 2003; Oztekin and Topal 2005; Yalcin 2008). In these methods, the Wf weights are calculated by the procedures that SI considers for determination of each attribute and each SI weight. Firstly, flood inventory map is rasterized, and then, it is crossed with other rasterized parameters. SI values are calculated for each flood pixel in each layer. Then, the values of all the pixels belonging to each layer are summed. Eventually, the weighting factor values for each layer are calculated in a range of 1 to 100 by the following equations (Yalcin 2008; Yalcin et al. 2011).

$$ TSI={\displaystyle \sum_{i=1}^nSI\times \mathrm{S}.pix} $$
(9)
$$ {W}_{Wf}=\frac{\left({TSI}_{\mathrm{value}}\right)-\left({\mathrm{MinTSI}}_{\mathrm{value}}\right)}{\left({\mathrm{MaxTSI}}_{\mathrm{value}}\right)-\left({\mathrm{MinTSI}}_{\mathrm{value}}\right)}\times 100 $$
(10)

where TSI is the total value of the flooding pixels of a given class of parameters of interest, n the number of classes of the parameter, W wf the weight factor for each layer of maps (final weight), MinTSI the minimum value of the total weights among the layers, and MaxTSI maximum value of the total weights among the layers. Wf values of each layer are multiplied by SI values of the classes of the maps. At last, all the factors are added together and flood susceptibility map is prepared according to the Wf method (Yalcin 2008; Yalcin et al. 2011).

Result and discussion

FSM using SE model

Shannon’s entropy results with the relationship between the occurrence of floods and the flood factors are shown in Table 2. Slope angle, plan curvature, and altitude were given weights equal to 0.719, 0.134, and 1.711, respectively, which reflect the fact that among topographical factors, the most important factor is altitude followed by slope angle and plan curvature.

Table 2 Spatial relationship between flood causative factors and flooding locations using SE

TWI and SPI were assigned weights of 0.388 and 1.054, indicating that the SPI was the most important factor among the hydrological factors. The calculated weights for other causative factors were as follows: distance from river (1.236), rainfall (0.365), geological condition (0.008), land use (1.09), and NDVI (1.89). Overall, the most and the least important factors for flooding were distance from river and geology, respectively. Finally, the flood susceptibility mapping (FSM) prepared by SE (Fig. 4a; Eq. 11) showed that very low, low, medium, high, and very high covered 13.57, 26.66, 30.04, 21.53, and 8.17% of the area, respectively.

$$ \begin{array}{l}{FSM}_{SE}=0.719\times \mathrm{slopeangle}+0.134\times \mathrm{plancurvature}+1.711\times \mathrm{altitude}+0.388\times TWI+1.084\times SPI+\hfill \\ {}2.11\times \mathrm{distancefromriver}+0.554\times \mathrm{rainfall}+0.008\times \mathrm{geology}+1.091\times \mathrm{landuse}+1.899\times \mathrm{NDVI}\hfill \end{array} $$
(11)
Fig. 4
figure 4

Flood susceptibility mapping using a SE model, b SI model, and c WF model. F equal to flood and N equal to non-flood location

FSM using SI model

Weights derived by this procedure for each class of each layer are shown in Table 3. According to Table 3, for slope angle classes of 0–5.7 and 5.7–15.9, positive weight values of 1.78 and 0.78 have been obtained, respectively, while higher slope classes have gained negative weights, indicating that higher correlation values between the occurrence of floods and the class would result in larger positive values and lower correlation values between the occurrence of floods and the class would result in low and negative values.

Table 3 Spatial relationship between flood causative factors and flooding locations using SI and Wf

Naturally, because the flood occurs in an area with low slope, the weights that are obtained using SI method should be positive for low slope with decrease in slope and the SI value should increase. The value for the convex class of the plan curvature parameters was negative, while it was positive for the other two classes (concave = 0.51 and flat = 0.14). With respect to these weights, it can be concluded that the concavely curved and flat surfaces are more likely prone to flooding than the convex surfaces, which corresponds to the findings of Tehrany et al. (2015a, b). In case of altitude, 328–2456 and 2456–2770 m obtained positive and negative weights, respectively, while no flood occurrence was recorded for higher altitudes. The highest weight was for the low-altitude class (1141–328 m), which is consistent with the normal behavior of flood in reality. For lower values of TWI Factor, negative weights were obtained; while for higher values, positive weights were obtained. In general, we can say that by increasing TWI, the probability of flooding increases. The highest weight obtained was equal to 1.69, corresponding to the TWI of 7.1–8. The flooding locations recorded for model training for the SPI factor occurred in the range of 0 to 1,810,539 with no registered floods for higher values. In the domain of flooding locations, by increasing the SPI value, high weight was acquired. For lower SPI values in the range of 0 to 27,432, negative weight was obtained, while the maximum weight occurred in the range of 1,234,458 to 1,810,539.

The sixth factor considered in this study was distance from river, which according to the review of literature, was one of the most important factors affecting floods (Tehrany et al. 2015b). For this parameter, only the first class (i.e., 0 to 500 m) gained positive weight, while other classes were negatively weighted; this reduction of weight was held by succeeding farther from the river. Thus, by this interpretation, with increasing distance from the river, the probability of flooding decreases.

For the rainfall factor, increase in rainfall resulted in smaller weights, meaning reduced probability of flooding by increasing rainfall. This is best explained by the fact that higher altitude receives higher precipitation, while these areas are marked with much less probability of flooding. Positive weights were obtained for three classes of 183–267, 267–329, and 329–375 mm, turning into negative values for the next three classes from 375 to 468 m with no registered flooding for precipitation exceeding 468 mm.

Haraz Watershed geology consists of Paleozoic, Mesozoic, and Cenozoic formations weighted 0.23, 0.02, and −0.06, respectively. By this notion, the geology formations of Paleozoic are the primary factor affecting flooding due to higher positive weight, with Cenozoic unit as the least important. For land use, the maximum weights were of the water bodies at 3.57, followed by bare lands at 2.1, and finally residential areas at 1.04. For rangeland with a weight of −0.06 and forest with −0.58, there is negative correlation between these factors and floods. Despite the presence of 132 out of 151 flood locations in the rangeland areas, the weight value was negative due to extensive rangeland cores compared with the distribution of flooding locations. In orchards and agricultural land use types, no flood situations have been registered. The last intended factor was NDVI, which for Haraz Watershed obtained a range of −0.69 and −0.58. No special relationship cannot be established between these parameters and the positive and negative weights. In the ten classes related to this factor, the weight for the first two classes was positive; for the next three classes was negative; for the next four classes was positive; and for the final class was negative. The highest weight was for the first class with a range of −0.35 to −0.69 and with the weighting of 3.38. Finally, according to Eq. 12, map of flood susceptibility by the SI method (Fig. 4b) was created:

$$ \begin{array}{l}{FSM}_{SI}=\mathrm{slope}\;{\mathrm{angle}}_{SI}+\mathrm{plan}\;{\mathrm{curvature}}_{SI}+{\mathrm{altitude}}_{SI}+{TWI}_{SI}+{SPI}_{SI}+\mathrm{distancefrom}\;{\mathrm{river}}_{SI}\hfill \\ {}+{\mathrm{rainfall}}_{SI}+{\mathrm{geology}}_{SI}+{\mathrm{landuse}}_{SI}+{\mathrm{NDVI}}_{SI}\hfill \end{array} $$
(12)

The map was classified based on the natural break classification scheme into low, low, moderate, high, and very high susceptibility classes.

Most of the area, by 31.17%, was devoted to the low susceptibility to flooding, while the least area was identified by the very high susceptibility of 0.58% (22.07 km2). The area with high susceptibility to flooding covered 662.2 km2 that equals 17.57% of the study area.

FSM using Wf model

The last method considered for assessment of susceptibility to flooding in the Haraz Watershed was the Wf approach. The weights obtained by this approach can be attained from the SI method; put simply, WF method receives its weights from the SI method and the final weight is calculated as the combination of the both weights shown in Table 3. The Wf method, same as SE method, is applicable for the calculation of the weights of the layers; but Wf method ranges from 0 to 100. In this method, the most important factors for the flooding were distance to river with a weight of 100, altitude with a weight of 71.4, TWI with a weight of 42.2, and slope angle with a weight of 40.1. The least important factor in the flooding, according to the results of the recent method, was geology. The map obtained from Eq. 13 was classified into five classes of susceptibility to flooding (Fig. 4c). Percent areas of the susceptibility classes of very low, low, moderate, high, and very high were 33.89, 28.25, 22.43, 9.3, and 6.1%, respectively.

$$ \begin{array}{l}{FSM}_{Wf}=40.15\times \mathrm{slope}\;{\mathrm{angle}}_{SI}+10.58\times \mathrm{plan}\;{\mathrm{curvature}}_{SI}+71.42\times {\mathrm{altitude}}_{SI}+42.21\times {TWI}_{SI}+19.26\times {SPI}_{SI}+\hfill \\ {}100\times \mathrm{distance}\;\mathrm{from}\ {\mathrm{river}}_{SI}+14.63\times {\mathrm{rainfall}}_{SI}+1\times {\mathrm{geology}}_{SI}+9.33\times {\mathrm{landuse}}_{SI}+27.66\times {\mathrm{NDVI}}_{SI}\hfill \end{array} $$
(13)

Validation of the FSMs

Validation is one of the primary and important steps in the development and evaluation of the obtained maps from different methods (Pourghasemi et al. 2012a), without which no models would be regarded as scientifically sound (Chang and Fabbri 2003; Nampak et al. 2014). Efficiency of flood susceptibility model is usually estimated by independent information that is not available to build a model. The total of 211 flood locations recorded in the Haraz Watershed were randomly divided into two groups, 70% (151 location) and 30% (60 location), which were respectively used for the purposes of modeling and validation. In the current study, the following two methods were used for validation of the produced maps of susceptibility to flooding by the SE, SI, and Wf methods.

Histograms

To implement this method, in addition to the recorded floods, non-flooded locations were also utilized. For recording of 211 non-flooded locations as random spots on the hills, mountains, and places, where there were no flood events, Google Earth images were used (Tehrany et al. 2014a). Non-flooded locations were divided into two classes of 70% (151 positions) and 30% (60 locations); 30% of flooded and non-flood locations were used for the preparation of the histogram. Initially, susceptibility map was broken into ten levels (Dai and Lee 2002; Shirzadi et al. 2012); then, it overlaid the map of flooded and non-flooded locations and the final histogram was produced (as illustrated in Fig. 5). If the achieved maps are accurate, 30% of flood locations should be located in high-susceptibility classes (right wing of the graph) and 30% of non-flooded location should be located in low-susceptibility classes (left wing of the graph) (Meng et al. 2015). Therefore, no flooding locations must fall into the low to very low classes, and this also holds for the highly accurate and valid non-flooded locations. Thirty percent of the non-flooded locations must fall into the lower tails of the graph and must not fall into the high to very high classes; otherwise, the results of the model would not be sufficiently robust.

Fig. 5
figure 5

Histogram of the predicted flood susceptibility for the validation data set (flooding and non-flooding location)

The results indicated that the Wf and SI models were comparatively more accurate, since most of the flooding locations fell into the high to very high susceptibility classes. However, given the other two classes, it is clear that the SI model is particularly accurate (there were flooding locations in the moderate class in the Wf method, while none was observed in the SI). SE model did not work properly (in comparison to SI and Wf) as there were flooding locations, even in the fourth class, contributing to the low and very low susceptibility classes.

ROC curve

This method is one of the most popular (Pradhan 2010a, b; Pourghasemi et al. 2013; Tehrany et al. 2015a, b; Youssef et al. 2016) techniques to evaluate efficiency of the models, since it estimates the result of the model and quantitatively calculates efficiency. Prediction rate and success rates should be assessed as a necessary part of any program (Pourghasemi et al. 2012b). Thus, to evaluate the effectiveness and validity of FSMs, both success rate and prediction rate curves were produced using 70 and 30% of flooding locations, respectively (Youssef et al. 2016). Training data sets of floods were used in the production of flood maps; thus, success rate could not be used to assess the ability of the model (Pourghasemi et al. 2012b; Tehrany et al. 2015b; Youssef et al. 2016). Prediction rate, which is now widely used to measure and evaluate the performance and ability of the models (Lee et al. 2007; Tien Bui et al. 2011), was applied to efficiency of FSMs. Prediction rate indicates how well the model can predict floods in a given area. In this regard, 30% of non-flood locations that have been generated in the histogram section have been used there also. AUC of prediction rate can quantitatively measure the accuracy of model’s prediction (Pradhan and Lee 2010). Figure 6a, b denotes success rate and prediction rate, respectively, for SE, SI, and Wf models. The range of AUC is between 0.5 and 1, where one represents the highest accuracy indicating that the model is able to predict natural hazards without any bias (Pradhan 2010a).

Fig. 6
figure 6

Validation of FSMs in Haraz Watershed: a success rate curve and b prediction rate curve

For the preparation of success rate and prediction rate, susceptibility maps with 70 and 30% of flooding locations were overlaid and, eventually, the cumulative percentage of flood occurrence (in descending order, from highest to lowest) was calculated. The success and prediction rate plots were prepared with the x-axis representing the cumulative percentage of flood occurrence and the y-axis representing the flood probability index.

According to the success rate curve and based on the model validation results, the AUC values for SE, SI, and Wf models were respectively measured 0.9253, 0.9971, and 0.981, which were responsible for 92.53, 99.71, and 98.1% accuracy achievement. According to prediction rate curve and based on the model validation results, AUC values for SE, SI, and Wf models were measured 0.9142, 0.9872, and 0.976, respectively, which were responsible for 91.42, 98.72, and 97.6% accuracy achievement. Yesilnacar (2005) believes that quantitative relationship between the AUC and the accuracy of the model’s prediction can be classified into the following classes: weak (0.5–0.6), moderate (0.6–0.7), good (0.7–0.8), very good (0.8–0.9), and excellent (0.9–1). Overall, bivariate statistical models are very suitable for large size of study areas that have a limited data (Zhang et al. 2016) and are very easy to run in a GIS environment (Oh et al. 2011) but have a disadvantage that equal weights are assumed for different effective factors (Pourghasemi and Kerle 2016). Meanwhile, this problem is solved by Shannon’s entropy approach. With this interpretation, results of all of the models fall in the excellent category and propose for land use planning and flood management department in the Haraz Watershed, Mazandaran Province.

Conclusion

Flood is one of the world’s most hazardous natural events that human societies have faced for decades, while many measures have been done to control and mitigate it. One of the critical steps in this regard is preparation of flood susceptibility maps for future flooding; the most important item for preparation of FSMs is having a reliable and accurate method. The main aims of this study were as follows: (1) floods susceptibility mapping in the Haraz Watershed, Mazandaran Province (due to annual severe flood events) by various methods, namely, FR, SE, SI, and Wf; (2) assessment of models’ efficiency in producing flood susceptibility maps; and (3) determination of the most important factors for the occurrence of floods in the study area.

For preparation of flood susceptibility map, initially, the ten causative factors were determined based on the literature review as slope angle, plan curvature, altitude, TWI, SPI, distance from river, rainfall, geology, land use, and NDVI. Then, 211 flood locations (according to field surveys and documentary sources) were identified, of which 70 and 30% were designated, respectively, for the purpose of modeling and validation. Three methods, including SE, SI, and Wf, were used and the relationships between flood events and each of the ten factors were evaluated. In order to evaluate their performance, the models’ efficiencies were compared with that of the FR model, which is one of the popular bivariate statistical techniques in this field.

The histogram and ROC curve methods were used for assessment of the models’ ability. In the histogram method, the best model for identification of flooding and non-flooding locations was SI method. By using the AUC, success rate and prediction rate were quantitatively measured for assessment and comparison of FSMs achieved by the four models. The largest area under the success rate curve belonged to the SI (99.71%), followed by the Wf (98.1%) and SE (92.53%).

The results showed that SI method particularly better fitted the training data sets and obtained higher prediction rate by 98.72%. Following the SI method, the AUC values for the Wf and SE methods were measured 98.1 and 92.53%, respectively. The results of four models in the ranking of results were excellent. The results of this research showed that the SI and other models could be regarded as efficient and effective methods for FSMs in GIS environment. Another objective of the current study was determination of the most important causative factor in the Haraz Watershed. Based on the results of the Wf, the order of factors was distance from the river, TWI, and slope angle, while it changed for the SE method to distance from the river, land use, and altitude classes. For either method, the least significant factor was geology.

The results of this study, in addition to utilization as a base map for flood hazard and flood risk mapping, can be used as a contingency plan for the governmental departments of water, disaster management, planning, and others to take appropriate measures to prevent and mitigate floods and their consequent damages in the future.