Introduction

Paschim Medinipur district is one of the most vulnerable district as far as floods are concerned. The district has four river basins i.e. Kangsabati, Kaliaghai, Silabati and Subarnarekha. Kangsabati, Kaliaghai, and Silabati catchment are more vulnerability of flood then Subarnarekha. The floods have been distributed starting from the monsoon month of July to post monsoon month of October. Most devastating floods have been occurred in the month of October, September, August and July and their impacts had been so severe that it could be able to cripple the economy of the district. Most devastating floods occurred in the year of 1823, 1834, 1840, 1907, 1914, 1919, 1959, 1978, 1997, 2007 and 2015. The poor and marginalized people like the small and landless farmers, agricultural labourers and daily wage earners are most suffer by these events. The vulnerability analysis and risk assessment is recognised as an important tool to evolve disaster mitigation and preparedness plans in the hazard prone areas. For disaster prevention, mitigation and preparedness, it is important to identify vulnerable areas and factor configuring disaster risk where the impact could reach to severe magnitude for the affected communities. It is expected that the information will help preparedness planning and mitigation and for raising the level of awareness about the disasters in the community.

Objective of the study

The main objective of the study to develop an approach for identifying, categorizing and quantifying elements at risk and their vulnerabilities to floods. Also analysis the people’s vulnerability and risk in the flood plain area due to flood and coping strategies devoted by them.

Location of the study area

Paschim Medinipur district, located in the Southern part of West Bengal, between 22°57ʹ10ʺ North and 21°36ʹ35ʺ North latitude and 88°12ʹ40ʺ E and 86°35ʹ55ʺ East longitude. Geographical area of the district is 9295 sq. km. The district is further divided into 4 sub-divisions, 29 blocks and 8 Municipalities (see Fig. 1).

Fig. 1
figure 1

Geographical location of the study area

Concept of vulnerability and risk

The vulnerability describes the degree to which an area, people, physical structures or economic assets are exposed to loss, injury or damage caused by the impact of the hazard. Risk is viewed as the probability of occurrence or the degree of loss of a specified element expected from a specific hazard. Risk as the overlapping areas of three factors- hazard, exposure and vulnerability- that act simultaneously to generate the risk of natural hazards, which can be expressed as:

$${\text{Risk}}={\text{hazard}} \times {\text{vulnerability}}$$
(1)
$${\text{Risk}}={\text{hazard}} \times {\text{exposure}} \times {\text{vulnerability}}$$
(2)

while hazards are a potential threat to populations and the environment, risk is the interplay between hazard and vulnerability.

Function of AHP

The hierarchical decision model is the design phase of AHP in which the top level exhibits the overall goal of the decision. Indicators can be criteria in the upper level of the model and each of them is further broken down into sub-criteria. Each pair of criteria or sub-criteria element is compared in terms of its relative importance using a 9-points system from 1 (if two indicators equally contribute to the objective) to 9 (when one indicator is strongly favoured over another to meet the objective), (see Table 1) and forms a comparison matrix namely, a pairwise comparison (see Table 2). A score of 1 denotes equal importance, a score of 3 refers to weak preference, while scores 5 and 7 represent obvious and strong preference. The even numbers (i.e., 2, 4, 6 and 8) are used when a compromise is needed between the odd numbers.

Table 1 Semantic scale of the AHP method.
Table 2 Example of a pairwise comparison matrix

The local priority (weight) for a criterion is computed from a pairwise comparison matrix by normalizing the points in the columns (divide a cell value by the sum of a column) and averaging the normalized points in the row of the criterion. The consistency of the comparisons is evaluated by calculating a Consistency Ratio (CR). If the CR is equal to or less than 0.1, the comparisons are considered consistent, otherwise it would be revised. The CR is defined by the following equation:

$${\text{CR }}={\text{ Consistency Index/ Random Index}}$$
(3)

The Random Index (RI) refers to a randomly generated reciprocal matrix from the 9-point scale and it can be obtained by referring to the RI table. Saaty (1980) provides a function of n in relationship (see Table 3). The consistency index (CI) is defined as:

Table 3 Random average consistency indexes for various n
$${\text{CI }}={\text{ }}\left( {{\lambda _{{\text{max}}}}-{\text{ n}}} \right)/{\text{ }}\left( {{\text{n}} - {\text{1}}} \right)$$
(4)

where, λmax is the largest Eigen value derived from the comparison matrix and n is the number of criteria. A WLC is conducted by multiplying indicators by the corresponding weights and aggregating all weighted layers. In this study, the following equation was used:

$${\text{VI }}=\begin{array}{*{20}{c}} i \\ {\mathop \sum \nolimits^{} } \\ {i=1} \end{array}\begin{array}{*{20}{c}} i \\ {\mathop \sum \nolimits^{} } \\ {i=1} \end{array}{W_i}{W_{ij}}x$$
(5)

where, VI is the Vulnerability Index for physical or social indicator, wi and wij are the weights for the ith and jth sub-criterian and x represents the value of an indicator.

Data base and conceptual framework of methodology

Data regarding physical vulnerability assessment were collected from a number of sources. An elevation data collected from SRTM dataset. Land-use and land-cover data obtained from the LISS 3 image and river network vector files/distance to active channel were collected from MBGIS. Geology data were collected from the geology map of Medinipur. To determine the indicators for social vulnerability and coping capacity, a range of variables was constructed from diverse sources. Socio-economic and demographic attributes were collected from the PHED, Govt. Of West Bengal and District Annual Plan, 2015–2016 which represent the population census of 2011.

Analytic hierarchy process (AHP) was first introduced by Thomas L. Saaty in the year of 1980. The AHP is based on multi-criteria that prioritize the criteria identified by different groups of people involved in the decision-making process in order to arrive at the best decision (see Fig. 2).

Fig. 2
figure 2

Methodology flow chart showing the overall methods of the study

Physical vulnerability analysis

Elevation

Elevation is the most important criteria because it has a tremendous effect on flood hazard in the study area, as most of the areas in southern and south-eastern part of the study area are low lying lands. As expected the high hazard category is less than 30 m. (see Fig. 3). Therefore, elevation of less than 30 m. was given the highest weight, while elevation greater than 270 m. was assigned to the lowest weight (see Table 4).

Fig. 3
figure 3

Elevation in each hazard zone, by area

Table 4 Physical vulnerability decision hierarchy model

Land use and land cover

The land use and land cover data comprises of five categories namely, cultivated rural settlement, wetland and water bodies, vegetation and bare soils and built-up area. Since the potential effects of floods high for the cultivated category in terms of agricultural loss due to floods has a considerable impact on the rural people and also the local economy. That is the causes of cultivated category getting relatively highest weight (see Table 4).

Distance to active channel

People and property are in considerable danger up to 1.5 km away from the existing river networks. However, the maximum weight was allocated to the distance less than 0.5 km from the rivers and the lowest weight was given to the category with a distance greater than 1.5 km. (see Fig. 4; Table 4).

Fig. 4
figure 4

Distance categories according to different flood hazard classes, by area

Geology

Geologically, the area can be divided into four categories. Younger intrusive and DalmaVolcanics is a highly elevated area and it is not usually inundated during the monsoon season. Therefore, it was assigned to the lowest weight. Flood plain deposit is areas where flood is a normal phenomena. This was given the highest importance followed by the sand with silts and clays category.

Regarding the importance of the criteria at the upper level (level 2), elevation was considered the most important criteria and given the highest weight. The second most important was given to land use category because consequent land use change is exacerbating river water flooding in the study area. Therefore, two other parameters, distance from the river and geology are given less importance in the second decision level (see Table 5). After deriving the weights for each variable using the AHP, WLC was used. So all variables could be weighted and combined to derive a PVI. Before performing the WLC, the values for elevation and distance to active channels were standardized using the below equation:

Table 5 Pairwise comparison matrix for physical indicators
$$~~\rho {\text{ }}=\frac{{{\text{max}} - x}}{{{\text{max}} - {\text{min}}}}$$
(6)

where, ‘ρ’ is standardized value, min and max represent the minimum and maximum values for each dataset respectively and ‘x’ is the cell value (see Fig. 5).

Fig. 5
figure 5

Physical variables

Social vulnerability analysis

Population density

Population density is regarded as one of the most important indicators in determining social vulnerability. Therefore, the indicator of population density was given higher importance at all levels of the decision hierarchy. Higher importance was allocated for higher density, while lower importance was given to lower density (see Table 6).

Table 6 Decision hierarchy model for socioeconomic and infrastructure vulnerability

Children and elderly population/dependent population

Age group of 0–14 and above 60 are generally dependent population and they are highly vulnerable to natural hazards because of their restricted mobility and difficulty with evacuation during emergencies (Cutter and Emrich 2009). A total number of populations for these groups was extracted and classified into four categories: aged 0–4, 5–9, 10–14 and over 60. Children in the age group of 0–4 in the study area were given maximum weight in the lowest decision hierarchy (See Table 6) and the age group of 5–9, 60 + were given subsequent importance in the pairwise comparison (see Table 7).

Table 7 Pairwise comparison matrix for social vulnerability indicators

Gender

Gender is an important consideration when depicting vulnerability to natural hazards (Cutter et al. 2003; Liverman 1990). Particularly in developing, moreover, most of the females in Paschim Medinipur district devote their lives to household activities from an early age and have fewer opportunities to participate in educational and social activities then males, making the female population highly vulnerable to natural hazards. Therefore, higher importance was given to females in the gender variable (see Figs. 6, 7, 8, 9; Table 6).

Fig. 6
figure 6

Distribution of Demographic Indicators

Fig. 7
figure 7

Distribution of socioeconomic indicators

Fig. 8
figure 8

Distribution of lifeline and infrastructure indicators

Fig. 9
figure 9

Decision hierarchy model for social vulnerability assessment

Poverty

The linkage between hazards, vulnerability and poverty are complex (Few 2003); however, many studies assert that low-income people suffer the most from natural hazards (Dasgupta 2007). Higher importance and subsequent weighting were allocated to the greater proportion of poor people in each community and the lowest weight given to the lowest proportion (see Table 6).

Types of housing

Most of the houses in rural area are katcha type and many houses built on floodplains and in close proximity to water bodies tend to exhibit greater vulnerability. Three types of housing information are available in the study area: katcha, semi-pucca and pucca. These housing types were incorporated into the vulnerability evaluation. As katcha houses are poor in quality and subject to severe damage from floods, they were given higher Weight (see Table 6) than the other two categories in the pairwise comparison matrix (see Table 7).

Illiteracy

The number of illiterate population (males and females) was used with the assumption that illiterate people are dependent on others to prepare and evacuate themselves during a forthcoming disaster. Thus, the highest illiteracy rates of the district were given the highest importance when allocating weight in the decision hierarchy (see Table 6).

Sources of water

Mixing of drain water well water also polluted in the time of flood. So drinking water exceeded standard limits and pure drinking water becomes scarce in the time of floods. There are a number of water sources for drinking purposes namely; taps, tube wells, wells, ponds and rivers are available in the study area. Therefore, other water sources received the highest weighting, as people relying on this source like to be at a greater risk of water-borne disease. Well, tube well and tap water’s weights were allocated with subsequent importance (see Table 6).

Sanitation

The sanitation of Paschim Medinipur district is not encouraging, as thousands of people are at risk of communicable diseases after a flood event. The no sanitation category was given the most weight, followed by the other mean of sanitation (see Table 6).

Road density

Infrastructure facilities such as roads and rail networks are in the category of “lifeline” and play a significant role in evacuation and post event relief and recovery. As these networks are important in determining the lifeline of a community, the density of roads was calculated from the Geo-Eye image with the GP boundary and weighted using the AHP (see Table 6).

Table 7 represents the pairwise comparison matrices for demographics, socioeconomic and infrastructure and lifeline criteria with their corresponding consistency ratios. The census values for the indicators were first converted into ratios from count and subsequently the weights were assigned to the ratio. The normalization was then performed using the following equation:

$$S=\frac{{\hbox{max} - x}}{{\hbox{max} - \hbox{min} }}$$
(7)

where, S is the normalized value, min and max represents the minimum and maximum values of the indicator variable respectively and ‘x’ is the weighted value of the community. The derived social vulnerability index (SVI) indicated that the higher the value, the greater the vulnerability of a particular community.

Coping capacity analysis

The selected indicators for coping capacity analysis are the total literacy rate, per capita hospitals, per capita flood shelters and flood awareness (see Figs. 10, 11), as shown in the decision hierarchy model (see Table 8). For these indicators, higher importance was given to the sub-criteria, with higher values in the pairwise comparison computation (see Table 9) so that areas with better coping capacity could be mapped.

Fig. 10
figure 10

Decision hierarchy model for coping capacity analysis

Fig. 11
figure 11

Distribution of coping capacity indicators

Table 8 Decision hierarchy model for coping capacity criteria
Table 9 Pairwise comparison matrix for coping capacity indicators

Flood awareness

The classic natural hazard paradigm shows that prior experience of hazards plays a crucial role in mitigating the negative consequences of floods and preparing against disaster (Paul 1997). Therefore, the flood frequency variable was derived by overlaying multi-temporal flood maps from 2011, 2013, 2015 and 2016 which are collected from BHUVAN disaster series and on the community database. The highest weight was given to this variable among other coping capacity variables in the second level of the decision hierarchy, as disaster preparedness is an important factor for a community to cope with natural disaster. A pairwise comparison matrix prepared for Coping Capacity Index (CCI) by AHP method.

Literacy rate

Education is an important human wealth that allows people to make appropriate decision and take pertinent mitigation measure to withstand or recover from natural disaster (D’Oyley and Blunt 1994). It also affects people’s lifetime earnings (Cutter et al. 2003). Literature suggests that households with higher levels of education cope better than households with lower level of education (Paul and Routray 2010). To examine this proposition, the total literacy rate was used and weighted for each community to assist in understanding their coping capacity (see Table 8). The highest weight was assigned to communities with the highest literacy rate for both males and females, and lowest weight was assigned to those with lower literacy rate.

Per capita shelters and hospitals

The availability of shelters and health-care facilities in hazard-prone areas is a vital indicator of coping capacity of a particular community. A database containing the locations of educational institutes, community, religious centres and different medical facilities calculate as the per capita shelters and hospitals in the study area. The higher the numbers of shelters and hospitals in each community, the higher the weight in terms of the third level of the decision hierarchy; however, the highest weight was given to the per capita shelter indicator in the second level of the decision hierarchy (see Table 8). The resulting Coping Capacity Index (CCI) indicates that the higher the index, the better the coping capacity of a particular community.

Derivation of Composite Vulnerability Index (CVI)

A composite vulnerability map was produced by combining vulnerability and coping capacity maps using WLC technique. An equal weighting scheme was adopted for this purpose (Cutter et al. 2003). To combine three maps and estimate CVI, first of all, mean PVI score for each community was derived by intersecting it with the community boundary file. Then, the CCI was linearly inversed so that the lower the CCI value, the higher the contribution of a particular community to flood vulnerability. The following equation was used to derive the overall vulnerability or CVI:

$${\text{CVI }}={\text{ }}0.{\text{33}} \times {\text{PVI }}+{\text{ }}0.{\text{33}} \times {\text{SVI }}+{\text{ }}0.{\text{33}} \times {\text{CCI}}$$
(8)

It should be noted that the inversed CCI = max − CCI.

Flood risk assessment

A flood risk map is one of the most important measures for preventing flood loss potential (Yoshino and Yoshikawa 1985). A flood risk map based on fine spatial units (e.g., at the community level) is particularly useful for planners and emergency managers when developing appropriate counter measures. Flood risk is defined in this study as a function of flood hazard and vulnerability (Apel and Aronica 2009). The following equation is used to derive flood risk map in the study area (Fig. 12):

Fig. 12
figure 12

Flood hazard map

$${\text{Flood Risk }}={\text{ hazard}} \times {\text{vulnerability}}$$
(9)

Results and discussion

Physical vulnerability

The PVI map revealed that areas closer to water bodies or near to river with a low elevation (< 5 m.) constitute higher vulnerability to flood (see Fig. 13). Due to their low elevation and close proximity to river networks communities in these areas are highly susceptible to extreme flood loss also (see Table 10). For instance, more than 50% of the population is vulnerable with respect to their locational attribute, as mean distance from active channels. In study area Southern and South-eastern part are in the high to very high vulnerable zones (0.82–1.00) in terms of locational and demographic characteristics. In south-eastern direction Ghatal sub-division are highly affected by recurrent flood due to absolute locational factor and also in terms of densely populated and productive nature of the soil. A small percentage of population (2.02%) is in the very low physical vulnerability zone because these zone is highly altitude (> 90 m). So, those are the locational factor is an important indicator in estimating flood vulnerability in Paschim Medinipur District.

Fig. 13
figure 13

Spatial patterns of physical and social vulnerability and coping capacity

Table 10 Distribution of variables in relation to physical vulnerability

Social vulnerability

The SVI map revealed that some of the Eastern, Southern and South-eastern directions of the study area are in the low (0.24–0.36) to very low (0–0.23) social vulnerability in respect of less poverty, high literacy and lower percentage of katcha houses, implying that people in this zone are able to absorb losses relatively quickly. Very high social vulnerable zone (0.78–1) are distributed along the south and south-eastern margin and also northern and north-western part of the study area. Table 11 explains that the social vulnerability of high and very high vulnerable zones is due to high levels of poverty, a large number of katcha houses and high population density. In contrast, access to safe water and sanitation is very low, suggesting a degree of vulnerability to floods. The resulting map also revealed that few communities in western and middle part of the study area very low social vulnerability (0–0.23).

Table 11 Distribution of variables regarding the SVI

Coping capacity

A coping capacity map showed different results compared to those of vulnerability maps, partly because of the integration of the flood awareness variable into the derivation of the coping capacity of communities. However, a coping capacity indicator can be conceptualized as a form of the adaption capacity of individuals or a community in the study area (Few 2003). Communities that experience floods almost every year exhibit a very high coping capacity (> 0.87) and which communities did not experience floods in a regular way, they exhibit a very low coping capacity (< 0.31).

Composite vulnerability

This study conceived that the coping capacity is part of vulnerability assessment (UNDP 1992) and should be included in the estimation of the overall vulnerability of a particular area. A CVI was therefore mapped out by combining PVI, SVI and CCI (see Fig. 14) and further reclassified according to the severity of vulnerability. The population distribution in each zone indicated that 27 and 15% of people are located in the high and very high zones of vulnerability respectively. The percentage of katcha houses, which are usually home of poor people, is higher in the very high vulnerable zone (39%) (see Table 12).

Fig. 14
figure 14

Spatial pattern of Composite Vulnerability Index

Table 12 Distribution of variables relating to CVI
Fig. 15
figure 15

Flood risk map

Flood risk mapping

Flood risk map was derived by multiplying the mean hazard score and composite vulnerability map. While hazard represent physical process, flood vulnerability indicates susceptibility to damage and the risk of human lives. To understand the human dimensions of flood risk, population and other parameter were intersected with the derived risk map and statistical analysis was conducted. Most of the elevated places have been no risk and low elevated areas are likely to be higher at the risk spectrum (see Fig. 15).

Using population census data from 2011 and the socio-economic distribution, Table 13 summarizes vulnerable subgroups into different flood risk zones. A total of 24.25% of the population lives in high to very high flood risk zones, which is further compounded by the number of females, young (0–14), elderly (> 60) and illiterate people. At least 17.76% of housing units (mud house) are located in high to very high risk zones. As noted earlier, the study is based on 2011 census data and does not take into account potential population growth in the study area, which may underestimate the actual population at risk. However, the flood risk map developed can be used as base line information for the development of adaption measures to mitigate future losses driven by flood hazard.

Table 13 Distribution of variables in different flood risk zones