1 Introduction

Floods rank among the most serious natural disasters in the world. Currently, floods tend to occur more frequently than other disasters (i.e., earthquakes, forest fires, typhoons, heavy snows, and droughts) (Atangana Njock et al. 2020; He et al. 2011). According to recent estimates, the economic losses caused by floods account for 40% of the total losses caused by all natural disasters (Xia et al. 2008). And several studies indicated that frequent flood events had led to an increasing flood risk in the whole world (Hu et al. 2018; Lyu et al. 2019a, b, 2020a; Werren et al. 2016;; Wu et al. 2018). Based on the statistics from the Emergency Disasters Database (EM-DAT, CRED, http://www.emdat.be/), 1,483 floods occurred from 2000 to 2020 in the Belt and Road region, accounting for 44.9% of the total floods around the world. However, most of the countries along the Belt and Road region are developing countries with underdeveloped economies and weak disaster resilience lacking the material reserves and emergency relief capabilities which are needed to respond to disasters (Cui et al. 2020). According to the EM-DAT, the disaster losses in the developing countries along the Belt and Road are more than twice the global average, and the flood-induced mortality is much higher than the global average (Ge et al. 2020). For example, between 2011 and 2013, the Philippines suffered recurrent floods that caused more than 100 fatalities per year, while India and Nepal recorded 6,648 flood fatalities in 2013. Additionally, frequent floods are a major threat to investment and engineering safety and regional development in the Belt and Road region (Ge et al. 2020). More seriously, under future climate change scenarios, the incidence and intensity of flood disasters are likely to increase significantly (Chen et al. 2015;; Fang et al. 2016; Jongman et al. 2012; Ntajal et al. 2017). In this sense, it is of paramount importance and necessity to establish a scientific basis for flood prevention and mitigation in the Belt and Road region.

In the context of the increasing international attention to disaster prevention and mitigation, improving the understanding of vulnerability will contribute to the development of prevention and mitigation measures of floods (Jongman et al. 2015). Theory and previous studies suggest that vulnerability assessment is of great significance to regional and national disaster prevention and mitigation work (Ding et al. 2016; Jongman et al. 2012; Tanoue et al. 2016). Vulnerability studies are currently gaining momentum, and the concept of vulnerability has been widely used in many fields and disciplines, as well as at different spatial levels. Yet, this term varies by discipline and research field (Fraser et al. 2006; Janssen et al. 2006; Liverman and O'Brien 1991; Metzger et al. 2006). For the past decades, the understanding of flood vulnerability has improved due to the use of combined indicators, case studies and analogues, stakeholder-driven processes, and scenario construction methods (Fekete 2012; Malone and Engle 2011). Currently, flood vulnerability is frequently conceptualized as a component that consists of exposure to disturbance or external stress, sensitivity to disturbance, and the capability to adapt or the coping capacity (Adger 2006; Weis et al. 2016; Zhou et al. 2018). In addition, previous studies have shown that flood vulnerability is positively correlated with exposure and negatively correlated with the disaster reduction capability (Rawat et al. 2012). Throughout history, the disaster reduction capability has primarily included susceptibility, resilience, and coping capacity (Bodoque et al. 2016; Ding et al. 2016; Gallopín 2006; Spitalar et al. 2014; Xiong et al. 2019a). Although the concept of the disaster reduction capability varies according to different authors, in this study, it consists of the susceptibility and coping capacity which are both the measure of the physical and social nature of the carrier. Thus, a vulnerability assessment model that consists of the social exposure, susceptibility, and coping capacity was developed.

The four main types of methods that have been used to assess flood vulnerability include (1) the historical data-based approach (Adikari et al. 2010; Shiet et al. 2011), (2) the scenario simulation method (Li et al. 2010; Shi 2013), (3) the vulnerability curve method (Zong 2013), and (4) the index-based system (Ding et al. 2016; Hoque et al. 2019). The historical data-based approach, which responds to the combined effects of the natural and social vulnerability, is highly practical for vulnerability distillation. However, this method is directly influenced by the level of informativeness of the disaster documentation (Lyu et al. 2020b). The scenario simulation method is a quantitative prediction which is based on the scenario model. Scenario-based vulnerability analysis is typically used to predict flood vulnerability in small areas, but the application of this method is limited by the fact that floods usually occur at the regional scale (Sampson et al. 2012). As for the vulnerability curve method, it primarily focuses on assessing the relationship between the intensity of hazards and the degree to which various hazard-bearing individuals are affected (Shi 2013). However, this method requires field statistics and questionnaires; therefore, it is not suitable for replication and implementation on a large scale. Among the four methods, the index-based system is the most widely applied method in flood vulnerability assessment (Zhang 2014). Notably, the index-based system has the advantages of convenient data acquisition, simple modeling and calculations, and the ability to reflect the regional vulnerability situation at a macroscopic level. Thus, the index-based system was applied in this study. To quantify the indexes more precisely, quantitative research has progressed from the investigation of a single method to that of multiple methods. Generally, these methods include multiple-criteria decision making (MCDM) methods (Lyu et al. 2020b) and machine learning methods (Xiong et al. 2019b). Among the MCDM methods, the triangular fuzzy number-based analytic hierarchy process (TFN-AHP) is one of the most widely used and mature approaches (Lyu et al. 2020b). Additionally, compared with other machine learning methods, the support vector machine (SVM) is a supervised machine learning technique based on statistical learning theory and the principle of structural risk minimization. The SVM not only avoids the effects of human factors, but it also expands the amount of information. Thus, in this study, the TFN-AHP and SVM were used as the quantification methods for the index-based system.

However, the knowledge base regarding large-scale flood risk and vulnerability is quite limited (Jongman et al. 2012). Traditionally, flood vulnerability assessments have been limited to national (de Moel and Aerts 2011; Xiong et al. 2019a) and even regional scales (Bouwer et al. 2010; Lyu et al. 2020b). Based on the physical and social aspects, some scholars have conducted temporal and spatial analyses of the vulnerability of flood disasters in the USA (Cutter and Finch 2008; Spitalar et al. 2014). They concluded that indicator-based vulnerability assessments are the key factors in emergency preparedness, disaster mitigation plannings, immediate responses, and long-term disaster recoveries. At the regional scale, many studies have focused on quantitative methods of vulnerability assessments. This differs from the statistical analyses applied at the national scale. For instance, Ding et al. selected 13 index factors to conduct an assessment of the debris flow vulnerability in the upper reaches of the Min River (Ding et al. 2016). In addition, Lyu et al. applied the AHP and the TFN-AHP in an inundation risk assessment of the metro system in Shenzhen (Lyu et al. 2020b). Their results showed that the TFN-AHP had a higher accuracy than AHP. Obviously, previous studies suggest that the indicator-based system is currently the dominant flood vulnerability evaluation method. However, the application of the methodologies and data used in regional studies has been difficult to apply at larger scales (Hu et al. 2018; Jongman et al. 2012). In particular, most countries along the Belt and Road region are so underdeveloped that they lack information sharing and international disaster reduction cooperation mechanisms, resulting in many of their fundamental data being inaccurate (Ge et al. 2020). Thus, research regarding the flood vulnerability in the Belt and Road region requires more in-depth investigations, because this region has experienced increasing climate extremes in recent years and is extremely vulnerable to flood disasters currently (Komolafe et al. 2019; Lugeri et al. 2010; Sam et al. 2017).

Given these problems, in this study, a flood vulnerability assessment based on the need for flood disaster prevention and mitigation throughout the Belt and Road region was conducted. The primary objectives of this study are as follows: (1) establish an index system for evaluating the vulnerability of flood disasters throughout the Belt and Road region, including exposure and disaster reduction capability; (2) build a new vulnerability assessment model for large-scale studies based on the TFN-AHP and the SVM to obtain the results of flood vulnerability assessment throughout the Belt and Road region; (3) map and analyze the spatial characteristics of the flood exposure, disaster reduction capability, and vulnerability based on the assessment results.

2 Materials

2.1 Study area

The principle of openness and generosity of the Belt and Road Initiative has led to the absence of precise boundary in the region. Targeting urbanization, we considered 65 developing countries (Liu et al. 2019; Zhou et al. 2020), including China, as the study area (Fig. 1). The countries covered almost all Asian countries except Japan, North Korea and South Korea. In Africa, we only selected Egypt. As for Europe, these countries to the east of Finland, Germany and Austria are all included. Generally, the Belt and Road region (approximately 50,640,400 km2) accounts for 38.5% land area, 30% gross domestic product (GDP), and 62.3% population of this world, respectively (Hafeez et al. 2019). As shown in Fig. 1, this region has an undulating terrain. The highest altitude is greater than 8500 m, while the lowest altitude is less than 0 m. In addition, this region is dominated by eight types of climate, including both monsoons and continental climate characteristics, and the regional distribution of water resources is uneven (Zhou et al. 2020). In terms of precipitation, the average annual total precipitation from 2000 to 2018 increased from 0.92 mm in the northwest to 6,067.71 mm in the southeast in this area. Due to the diverse natural environments, the types of terrains, and the uneven distribution of precipitation, natural disasters, especially droughts and floods, occur frequently in this area. According to the incomplete statistics from the Emergency Disasters Database (EM-DAT, CRED, http://www.emdat.be/), 1561 floods have occurred throughout the Belt and Road Region since the beginning of the twenty-first century, while the number of floods from 1975 to 2000 was only 700 (Hu et al. 2018).

Fig. 1
figure 1

Location of the Belt and Road region

2.2 Datasets and sources

In this study, data related to the flood vulnerability were classified into four aspects: (1) the data related to society and the economy, including population data, land use data, and GDP data. These data were obtained from the World Pop (https://www.worldpop.org/), the National Aeronautics and Space Administration, NASA (https://ladsweb.modaps.eosdis.nasa.gov/), and the Dryad (https://datadryad.org/stash/dataset/doi:10.5061/dryad.dk1j0). Notably, the GDP data on the Dryad came from the Gridded global datasets for Gross Domestic Product and Human Development Index over 1990–2015 (Kummu et al. 2018). (2) The data connected with infrastructure, including data from hospitals, shelters, and road density data, were downloaded from OpenStreetMap (https://www.openstreetmap.org/). (3) The digital elevation model (DEM) data were obtained from the Shuttle Radar Topography Mission (SRTM) data (http://srtm.csi.cgiar.org/srtmdata/). (4) The impervious surface data were collected from the Global Human Settlement Layer, GHSL (https://ghslsys.jrc.ec.europa.eu/). In summary, Table 1 presents the characteristics of the datasets used in this study.

Table 1 Sources and resolutions of the datasets

2.3 Establishing the assessment index system

2.3.1 Assessment unit

Generally, the assessment results are directly affected by the size and boundary of the assessment units, which are the fundamental elements of vulnerability assessment (Cascini 2008; Li et al. 2012). The assessment units that previous studies have commonly used include regional units, grid units, slope units, topographic units, and uniform condition units (Liu et al. 2017b; Zhou et al. 2014). In this study, the exposure assessment indexes (population density, economic density, building density, and farmland density) were expressed completely as raster maps. In addition, the assessment indexes of the disaster reduction capability (hospital density, shelter density and road density) were expressed as vector maps. The other indexes (impervious surface, the dependent population, female population, and GDP per capita) were expressed in raster format. In order to make a more detailed analysis of all of these indexes, the total study area was divided into 646,191 grid cells with a spatial resolution of 0.1° × 0.1° (12 km*12 km approximately).

2.3.2 Index system

The establishment of an assessment index system is a key step in building a flood vulnerability assessment. This index system was established after summarizing the results of previous studies which have detailed descriptions of the vulnerability (Ding et al. 2016; Weis et al. 2016; Xiong et al. 2019a). In the numerous previous studies reviewed, vulnerability was most commonly conceptualized as a component consisting of exposure to a disturbance or external stress, sensitivity to a disturbance, and the capability to adapt (Adger 2006; Gallopín 2006). In this study, the disaster reduction capability was regarded as a combination of the susceptibility and coping capacity. Therefore, the flood vulnerability was divided into two portions: the exposure and disaster reduction capability. As shown in Fig. 2, eleven assessment indexes were selected. The exposure indexes consist of population density, economic density, building density, and farmland density. The disaster reduction capability indexes include impervious surface, dependent population, female population, hospital density, shelter density, road density, and GDP per capita.

Fig. 2
figure 2

Flood vulnerability assessment index system for the Belt and Road region

3 Methodology

3.1 Exposure assessment

3.1.1 Calculation of the exposure indexes

According to previous studies, in areas with higher exposure, the higher the vulnerability, and the greater the losses a flood will cause. Recently, flood vulnerability was divided into two parts: physical vulnerability and social vulnerability (Erena and Worku 2019; Hoque et al. 2019); however, some studies have only analyzed one level, the physical or social level (Rani et al. 2015; Rimba et al. 2017; Susan and Cutter 2003; Zhang and You 2014). In this study, the primary focus was the social level. To some extent, the comprehensive impact of hazards and disasters on the environment was regarded as a threat or danger, while exposed units, such as real estate, farmland, and a human presence, were regarded as characteristics of the regional socioeconomic system (Huang et al. 2011). Thus, in this study, the population density, economic density, building density, and farmland density were used to calculate the social exposure. In addition, because the exposure factors were measured at different scales, certain reclassifications were necessary to convert the exposure factors into five comparable units or exposure classes: 5 (extremely high exposure), 4 (high exposure), 3 (moderate exposure), 2 (low exposure), and 1 (extremely low exposure) (Mahmoud and Gan 2018). The specific indicators of exposure were defined as follows:

  1. (1)

    Population density

    Population is directly related to flood disasters, and the consequences of flood disasters are also larger in densely populated areas. In this study, the population density represents the population per assessment unit (D = Pi/Si), where D is the population density; Pi is the population of region i; and Si represents the area of region i.

  2. (2)

    Economic density

    In this study, economic density refers to the GDP per assessment unit (D = Gi/Si), where \(D\) is the economic density; Gi is the GDP of region i; and Si represents the area of region i. Proverbially, the GDP is closely connected to flood disasters. In most cases, regions with higher GDP have stronger disaster adaptation capacity, but the consequences of a flood disaster are also likely to be greater in these regions.

  3. (3)

    Building density

    Building density refers to the building area per assessment unit (D = Bi/Si), where D is the building density; Bi represents the area of buildings in region i; and Si is the area of region i. Here, the building density was characteristic of using the ratio of the building area to the Belt and Road region, while neglecting the different heights of the buildings.

  4. (4)

    Farmland density

    The farmland density refers to the farmland area per assessment unit (D = Fi/Si), where D is the farmland density, Fi represents the area of farmland in region i; and Si is the area of region i. In this study, the building density and farmland density factors were both extracted from the land use data, and they indicate the importance of local construction and agricultural activities, respectively.

3.1.2 Triangular fuzzy number-based AHP

In this study, the triangular fuzzy number-based analytic hierarchy process (TFN-AHP) was used to calculate the exposure. In the application of the traditional AHP, since the pair comparison is based on the opinions of the decision makers, the value that represents the relative importance is subjective. The bias that is caused by this subjective importance leads to the uncertainty in the evaluation weight. In comparison, the TFN-AHP is based on the application of a triangular fuzzy number in the traditional AHP, which can reduce the impact of this uncertainty (Lyu et al. 2019a). Instead of the definite number the AHP uses, in the TFN-AHP, a triangular fuzzy number is used to reveal the degrees of importance of the evaluation factors. More detailed, the median value of the triangular fuzzy number denotes the largest probability of the evaluation model, while the minimum and maximum values represent the ambiguity corresponding to the largest probability (Cheng et al. 2020; Lyu et al. 2019a, 2020a; Yang et al. 2013). Generally, the TFN-AHP can assess vulnerability with the largest possibility to improve the accuracy of the results. This is the main reason that the TFN-AHP is selected to calculate exposure in this study. In the application of the TFN-AHP, the triangular fuzzy number is defined as P, and it can be calculated as follows:

$$ P = (l,m,\mu ) \, (l \le m \le \mu ) $$
(1)

where P is the triangular fuzzy number; m is the most likely value; and l and μ are the minimal and maximum values, respectively. From this we can see that the most probable value for m is jointly determined by the parameters l and μ. The meaning of the triangular fuzzy number and the detailed methods of the parameter calculation have been discussed in previous studies (Lyu et al. 2019a; Zhang and You 2014). In this study, the TFN-AHP was used to confirm the relative significance of each pair of factors by using a 9-point system from 1 (equal importance) to 9 (extreme importance) (Table 2).

Table 2 Linguistic variables and the corresponding triangular fuzzy numbers

In the application of the TFN-AHP, the triangular fuzzy numbers were used in the judgment matrix. Notably, it is of vital importance to construct the triangular fuzzy judgment matrix used in the TFN-AHP. Equation (2) shows a fuzzy judgment matrix of the triangular fuzzy numbers. According to the normalized indexes of the exposure assessment, the corresponding weights of these indexes were calculated using the TFN-AHP, as shown in Table 3.

$$ \left( \begin{gathered} \, \left[ {1, \, 1, \, 1} \right] \, \left[ {l_{12} ,m_{12} ,\mu_{12} } \right] \, \cdots \, \left[ {l_{1i} ,m_{1i} ,\mu_{1i} } \right] \, \cdots \, \left[ {l_{1n} ,m_{1n} ,\mu_{1n} } \right] \hfill \\ \left[ {\frac{1}{{\mu_{12} }},\frac{1}{{m_{12} }},\frac{1}{{l_{12} }}} \right] \, \left[ {1, \, 1, \, 1} \right] \, \cdots \, \left[ {l_{2i} ,m_{2i} ,\mu_{2i} } \right] \, \cdots \, \left[ {l_{2n} ,m_{2n} ,\mu_{2n} } \right] \hfill \\ \, \cdots \, \cdots \, \ddots \, \cdots \, \cdots \, \cdots \hfill \\ \left[ {\frac{1}{{\mu_{1i} }},\frac{1}{{m_{1i} }},\frac{1}{{l_{1i} }}} \right] \, \left[ {\frac{1}{{\mu_{2i} }},\frac{1}{{m_{2i} }},\frac{1}{{l_{2i} }}} \right] \, \cdots \, \left[ {1, \, 1, \, 1} \right] \, \cdots \, \left[ {l_{in} ,m_{in} ,\mu_{in} } \right] \hfill \\ \, \cdots \, \cdots \, \ddots \, \cdots \, \cdots \, \cdots \hfill \\ \left[ {\frac{1}{{\mu_{1n} }},\frac{1}{{m_{1n} }},\frac{1}{{l_{1n} }}} \right] \, \left[ {\frac{1}{{\mu_{2n} }},\frac{1}{{m_{2n} }},\frac{1}{{l_{2n} }}} \right] \, \cdots \, \left[ {\frac{1}{{\mu_{in} }},\frac{1}{{m_{in} }},\frac{1}{{l_{in} }}} \right] \, \cdots \, \left[ {1, \, 1, \, 1} \right] \hfill \\ \end{gathered} \right) $$
(2)
Table 3 Judgment matrix and normalized weights of the exposure indexes

3.2 Disaster reduction capability assessment

3.2.1 Calculation of the disaster reduction capability indexes

Previous studies suggest that the higher the disaster reduction capability is, the lower the losses caused by flood disasters will be. In a general sense, susceptibility is the degree or the depth to which the disaster-bearing body is affected by the disaster of the same intensity. Numerous studies have suggested that not all individuals and groups exposed to a hazard are equally susceptible (Hoque et al. 2019; Huang et al. 2011; Liu and Wang 2013). Therefore, data of the impervious surface, dependent populations, and female populations were collected as susceptibility factors. In this study, the coping capability primarily relies on the capability of people, organizations and systems to manage the effects of disasters using the available skills and resources (Rana and Routray 2018). At this level, four coping capability criteria, namely the hospital density, the shelter density, the road density and the GDP per capita, were selected here. The specific disaster reduction capability indicators of flood disasters were selected as follows:

  1. 1

    Susceptibility

  2. (1)

    Impervious surface

Urbanization has an adverse impact on the environment, especially when the urban growth rate and the carrying capacity do not co-evolve, including the management capacity, natural resources, and basic service provisions (Lee et al. 2015). One of the effects of urbanization is an increase in impervious surface, which makes peak discharges intense and rapid.

  1. (2)

    Dependent population

Children (0–15 years old) and the elderly (people over 65) were considered to be dependent people, who often need support from other groups when floods are occurring (Li et al. 2012; Sharma et al. 2018). To some extent, children and the elderly often have less access to information and more difficulty in taking personal actions to prepare for floods. In this study, the index of the dependent population was produced using population data.

  1. (3)

    Female population

Females are affected by floods in numerous ways due to their limited mobility and difficulty in evacuation during emergency cases (Eric and Thomas 2007; Hoque et al. 2019). Generally speaking, the female population is more vulnerable than the male population.

  1. 2

    Coping capacity

  2. (1)

    Hospital density

The hospitals in this study refer to the large hospitals in modern cities, not those in small regions. Proverbially, the higher the hospital density in a city, the higher the capacity of the people coping with floods and recovering from flood disasters. Moreover, the hospital density refers to the number of hospitals per assessment unit (D = Hi/Si), where D is the hospital density; Hi is the number of hospitals in region i; and Si represents the area of region i.

  1. (2)

    Shelter density

In this study, shelters were defined as the emergency shelters where people can take refuge from flood disasters. The shelter density refers to the number of shelters in an assessment unit (D = Shi/Si), where D is the shelter density; Shi is the number of shelters in a region i; and Si denotes the area of region i.

  1. (3)

    Road density

Here, roads primarily refer to freeways, provincial highways, and national highways. It is assumed that the higher the road density is, the higher the capacity of the people coping with floods will be. Road density refers to the total length of all types of roads in per assessment unit (D = Ri/Si), where D is the road density; Ri is the total length of all roads in region i; and Si represents the area of region i.

  1. (4)

    GDP per capita

The GDP per capita was used to evaluate the economic development level of a grid unit. Throughout history, the GDP per capita is an appropriate indicator of the status, rate, and level of economic development. Generally speaking, the higher the GDP per capita is, the stronger the coping capacity of a region. In this study, the GDP per capita data were calculated by dividing the GDP data by the population data.

3.2.2 Support vector machine

In this study, initially the weights in the indexes of the disaster reduction capability were not clear. In addition, though the TFN-AHP using the fuzzy theory can reduce the subjective influence of an artificial weight determination, it is actually a method of manual weighting which cannot perform completely objectively in the vulnerability assessment. Since TFN-AHP does not completely avoid human intervention, thus, the support vector machine (SVM) was used to calculate the disaster reduction capability. The SVM is a new method developed in recent years that is based on a nonlinear transformation, and it is widely used in flood risk assessment (Xian 2010). Compared with previous models, the SVM method not only avoids the effects of human factors, but it also expands the amount of information (Xiong et al. 2019a). Notably, the SVM is also a supervised learning binary classifier that is based on the structural risk minimization principle (Wan and Lei 2009; Yao et al. 2008), and it can explore the hidden relationships between inputs and outputs (Xu et al. 2012). The mechanism of a flood vulnerability assessment is complex due to the effects of incomplete information and the numerous uncertainties. To reduce the impact of these uncertainties, the SVM model was used to calculate the internal rules based on many complex fuzzy input and output variables. The primary steps of the algorithm are as follows:

  1. (1)

    Assume a training set of a known sample set is T = {x1, x2, …, xn, y}, where xi is the ith input data (xi ∈ Rn), and \(y\) is the output data, i = 1, 2, …, n.

  2. (2)

    Then, these data are classified into two categories, and n-dimensional hyperplanes are used to obtain the maximum interval. This is demonstrated in Eqs. (3) and (4).

    $$ \frac{1}{2}\left\| {\left. w \right\|} \right.^{2} $$
    (3)
    $$ {\text{Subject to }}y_{i} ((w \cdot x_{i} ) + b) \ge 1 $$
    (4)

    where ||w|| represents the norm of the hyperplane normal; b denotes a scale base; and (·) is the scalar product operation.

  3. (3)

    Using the Lagrange multiplier, the definition of the cost function is as follows:

    $$ L = \frac{1}{2}\left\| {\left. w \right\|} \right.^{2} - \mathop \sum \limits_{i = 1}^{n} \lambda_{i} (y_{i} ((w \cdot x_{i} ) + b) - 1) $$
    (5)

    where λi is the Lagrange multiplier. The solution can be obtained by using the dual minimization of Eq. (6) with w and b (Cherkassky 1997).

  4. (4)

    For inseparable situations, the constraint conditions can be modified by introducing slack variables, ξi (Cherkassky 1997):

    $$ y_{i} ((w \cdot x_{i} ) + b) \ge 1 - \xi_{i} $$
    (6)

Therefore, Eq. (7) becomes:

$$ L = \frac{1}{2}\left\| {\left. w \right\|} \right.^{2} - \frac{1}{vn}\mathop \sum \limits_{i = 1}^{n} \xi_{i} $$
(7)

where v ∈ (0,1], which is introduced to explain the misclassification (Franklin 2005; Xu et al. 2012).

In this study, a kernel function, K (xi, xj), was used to make the nonlinear decision boundary clear (Cherkassky 1997). In recent studies, the linear kernel (LN), the polynomial kernel (PL), the sigmoid kernel (SIG), and the radial basis function kernel (RBF) are the kernel types most used for SVM analysis. The RBF was used in this study:

$$ K(x_{i} ,x_{j} ) = e^{{ - \gamma (x_{i} - x_{j} )^{2} }} ,\gamma > 0 $$
(8)

where γ is a parameter of the kernel function.

3.3 Vulnerability assessment

Previous studies have verified that flood vulnerability is positively correlated with the degree of exposure and negatively correlated with the disaster reduction capability (Ding et al. 2016; Rawat et al. 2012). Thus, the vulnerability was calculated by using various data types as proxies for the exposure and disaster reduction capability. Based on previous studies (Ding et al. 2016; Liu and Lei 2003; Xiong et al. 2019a), the vulnerability model was applied using Eq. (9). In this study, using the natural break point method, the results of the exposure, the disaster reduction capability, and the vulnerability were divided into five levels: very low, low, moderate, high, and extremely high (Hoque et al. 2019; Tehrany et al. (2013); (2014).

$$ V = E(1 - \sqrt {\text{Re}} ) $$
(9)

where V is the vulnerability; E is the exposure; and Re is the disaster reduction capability.

3.4 Assessment procedure

As indicated in Fig. 3, the flowchart consists of three major phases. Firstly, the data that were collected from several websites were preprocessed. The primary step in this phase was normalizing the data and inputting these data into the grids. Subsequently, in the second phase, the data were divided into two aspects: the exposure index and the disaster reduction capability index. After that, the weights of the exposure factors were calculated using the TFN-AHP based on ArcGIS 10.6. As for the calculation of the disaster reduction capability, the factors were input into the SVM model using the R software. In the third phase, the results of the flood vulnerability were mapped and the spatial analysis was conducted.

Fig. 3
figure 3

Flowchart of the flood vulnerability assessment throughout the Belt and Road region

4 Results

4.1 Exposure assessment

In this study, in order to intuitively quantify the results, the exposure was calculated and generated based on ArcGIS 10.6. Then, the results of the exposure were placed into 646,191 grid cells in the ArcGIS 10.6. As illustrated in Table 4, the statistical results of each level of the quantity, area, and proportion in the exposure assessment can be seen. It was discovered that most regions (43,781,500 km2) have extremely low exposure, accounting for approximately 86% of the total area. The area of the low exposure regions is 3,446,200 km2, which is larger than the entire area of India. Furthermore, 25,977 grid cells (2,035,700 km2) have moderate exposure, 13,053 grid cells (1,021,100 km2) have high exposure, and 4523 grid cells (355,900 km2) have extremely high exposure. The three levels of exposure only account for approximately 0.70% of the Belt and Road region.

Table 4 Exposure of floods for the grid cells throughout the Belt and Road region

As shown in Fig. 4, a map of the flood exposure throughout the Belt and Road region was produced. It can be seen that the exposure in most areas is low and extremely low. However, in this study, our attention was focused more on those areas with higher exposure, which are likely to suffer more economic and population losses when floods occur. From the perspective of continental scale, the higher exposure areas are primarily located in the eastern and southern parts of Asia, including eastern China, northern India, the Indus Valley in Pakistan, the central region of Indonesia, and most of Bangladesh (Fig. 4). Locally, the areas with extremely high exposure are primarily distributed in the capitals and major cities of the countries in the Belt and Road region, such as Beijing, Shanghai, Chengdu, New Delhi, Bangkok, Jakarta, Moscow, Abu, Riyadh, Kyiv, Baghdad, and Cairo. Obviously, these areas with high exposure are densely populated and economically developed. This is consistent with the larger proportions of the population density and economic density in the exposure indicators.

Fig. 4
figure 4

Spatial distribution of the exposure throughout the Belt and Road region

4.2 Disaster reduction capability assessment

The statistical results of each level of the quantity, area, and proportion of the disaster reduction capability are presented in Table 5. It was found that the disaster reduction capability in most regions is extremely low (407,853,000 km2) and low (7,869,200 km2), accounting for 80.53% and 15.54% of the total area, respectively. The areas with moderate disaster reduction capability merely only account for 3.26%. Notably, the areas with high and extremely high disaster reduction capacity both account for less than 1% of the total area. There are only 3,683 grid cells (287,200 km2) and 648 grid cells (49,300 km2) having high and extremely high disaster reduction capability, accounting for 0.57% and 0.10%, respectively.

Table 5 Disaster reduction capability of floods for the grid cells throughout the Belt and Road region

As for the spatial distribution patterns, Fig. 5 shows that more than half of the areas have low disaster reduction capability, which is consistent with the above statistical results. The areas with moderate disaster reduction capability are primarily located in the western and eastern parts of the Belt and Road region. Among these areas, only a portion are distributed in eastern China, southern Nepal, the central region of Indonesia and the Cairo region of Egypt, while the vast majority are concentrated in most of the regions of Europe. Notably, few areas have high disaster reduction capability. As shown in Fig. 5, these areas are mainly concentrated in the major cities in the Belt and Road region, such as Beijing, Shanghai, Hong Kong, Taipei, Bangkok, Jakarta, New Delhi, Riyadh, Abu Dhabi, Cairo, Moscow, and several major cities in Europe.

Fig. 5
figure 5

Spatial distribution of the disaster reduction capability throughout the Belt and Road region

4.3 Vulnerability assessment

According to the assessment results of the exposure and disaster reduction capability, the vulnerability of each grid cell was calculated based on Eq. (9). The statistical results of each grade of the quantity, area, and proportion of the vulnerability are presented in Table 6. As can be seen, vulnerability of most regions (43,817,500 km2) is extremely low, accounting for 86.53% of the total area. Furthermore, 41,938 grid cells (3,287,800 km2) have low vulnerability, and 22,229 grid cells (1,744,200 km2) have moderate vulnerability. In the areas with high and extremely high vulnerability, the disaster distribution density is 1,236,100 km2 and 554,800 km2, which account for 2.36 and 1.19% of the total area, respectively.

Table 6 Vulnerability of floods for the grid cells throughout the Belt and Road region

According to Eq. (9), the vulnerability to floods is positively related to the exposure and negatively related to the disaster reduction capability. Similar to the distribution of the exposure (Fig. 4), most areas have low and extremely low vulnerability, occupying most of the proportion of the Belt and Road region (Fig. 6). In relative terms, the areas with high and extremely high vulnerability are primarily distributed in the southern and eastern parts of China, northern India, most of Bangladesh, the Indus Valley in Pakistan, the Nile River Basin in Egypt, the central region of Indonesia, and some major cities in the Belt and Road region. These highly vulnerable areas coincide with the areas characterized by frequent economic activities and dense populations, such as eastern China and northern India, in particular. In contrast to the positive correlation between the exposure and vulnerability, the disaster reduction capability decreases the vulnerability to floods. Thus, those areas with higher disaster reduction capability have lower levels of vulnerability, such as Bangkok of Thailand, Kuala Lumpur of Malaysia, Riyadh in the Kingdom of Saudi Arabia, the Kyiv of Ukraine, and Moscow in Russia.

Fig. 6
figure 6

Spatial distribution of the vulnerability throughout the Belt and Road region

5 Discussion

5.1 Assessment methodology

In this study, a multi-dimensional analysis of the indexes that were used to calculate the flood vulnerability throughout the Belt and Road region was proposed. In the past decades, many studies have emphasized the importance of quantifying the exposure and disaster reduction capability (Kim and Choi 2011; Li et al. 2010; Xiong et al. 2019a). Thus, the model was extended to a more comprehensive assessment of the flood vulnerability in the Belt and Road region, which included social exposure, susceptibility, and coping capacity. Based on the index-based system, we conducted a flood vulnerability assessment of the Belt and Road region. In addition, we proposed a vulnerability assessment model based on TFN-AHP and SVM model. To our knowledge, there are currently few studies combining TFN-AHP with SVM model for the flood vulnerability assessment. Compared with previous models, the TFN-based weight is a triangular fuzzy number, not a definite number, which can reduce the impact of these subjectivities on the vulnerability assessment (Lyu et al. 2020b). In addition to the characteristics of the general machine learning methods, the SVM model is a supervised machine learning technique based on statistical learning theory and the principle of structural risk minimization (Bui et al. 2012; Tehrany et al. 2014,2015; Yao et al. 2008).

5.2 The spatial patterns of the exposure, the disaster reduction capability, and the vulnerability to floods.

To our knowledge, there has been no specific attempt to map flood vulnerability by simultaneously considering exposure and disaster reduction capability in the Belt and Road region with high spatial resolution. Our results enhance insight into flood vulnerability drivers, levels of vulnerability to geospatial change, and possible economic and demographic losses. The findings emphasize that countries and regions with dense populations and high economic levels should pay more attention to their disaster prevention and mitigation capabilities, as they tend to have higher vulnerability. Currently, many of the large-scale flood vulnerability studies analyze only at national scale and do not consider refined indicators (Jongman et al. 2015; Tanoue et al. 2016). The findings of this paper provide a reference for a deeper understanding of the vulnerability distribution of the Belt and Road region and the methodology of large-scale vulnerability studies. However, due to the large area of this region, the maximum and minimum values of the data vary greatly, leading to the result that low and extremely low vulnerability occupy most of the area. Future researches could consider these aspects and explore new quantitative and grading methods for large-scale flood vulnerability assessment.

In this study, the results indicate that the vulnerability is positively related to exposure and negatively correlated with the disaster reduction capability (Ding et al. 2016; Xiong et al. 2019a). In fact, there is no vulnerability when the exposure is equal to 0 because “a hazard is not hazardous unless it threatens something; and vulnerability does not exist unless some elements at risk are threatened by something” (Ding et al. 2016). During the assessment procedure, the weights of the assessment indexes have an important influence on the assessment result. Notably, the population density and economic density are key indexes of exposure, while the other indexes are less important (Table 3) (Ding et al. 2016). As shown in Fig. 4, the spatial pattern of the exposure shows that areas with higher population density and economic density have the higher exposure to flood disasters (Ding et al. 2016; Hoque et al. 2019; Tanoue et al. 2016). For instance, in the eastern parts of China, northern India, and the central region of Indonesia, the exposure levels are relatively higher due to the impact of their dense populations and high economic densities (An et al. 2020). In contrast to other areas, numerous capitals and major cities of these countries along the Belt and Road region have higher exposure (i.e., Beijing, Shanghai, Guangzhou, Chengdu, Taipei, Hanoi, Manila, Bangkok, Bandung, New Delhi, Kolkata, Dhaka, Moscow, Cairo, and Abu Dhabi (Fig. 4)). Because capitals are often the political, economic, and cultural centers of their countries. In addition, the exposure of most areas of the Belt and Road region is low and extremely low (Fig. 4), accounting for approximately 93% (Table 4). The reason for this phenomenon is that the maximum and minimum values of these indicators vary too much, although all of the exposure indicators were normalized and reclassified.

With respect to the disaster reduction capability, the findings indicate that the disaster reduction capability of most areas of the Belt and Road region is low, which corresponds to the results of Ge (2020) and Cui (2020), who demonstrated the weak disaster prevention and mitigation capabilities throughout the Belt and Road region. As shown in Fig. 5, most areas of the Belt and Road region have extremely low (40,785,300 km2) and low (7,869,200 km2) disaster reduction capability, accounting for 80.53% and 15.54% of the total area, respectively (Table 5). Notably, most of the countries in this region have underdeveloped economies, low levels of education, and high population densities, leading to the weak disaster prevention and resilience capabilities (Cui et al. 2020). In contrast, the more developed cities and regions have higher disaster reduction capability (i.e., Beijing, Shanghai, Chengdu, Guangzhou, Taipei, Hanoi, Bangkok, Kuala Lumpur, Dhaka, New Delhi, Moscow and Abu Dhabi). Regionally, the areas with higher disaster reduction capability are primarily distributed in the European countries, while a small amount is located in eastern China, the central region of Indonesia, the Cairo region of Egypt, and several cities in western Russia (Fig. 5). The people in these areas can respond quickly to flood disasters and have the strong resistance to disaster losses (Jongman et al. 2012; Okazawa et al. 2011). In addition, the spatial patterns of the disaster reduction capability reveal that the areas with high hospital densities, high shelter densities, high road densities and high GDP per capita have a lower incidence of flood disasters (Hoque et al. 2019), especially in the European countries (Fig. 5).

The results presented here support some of the important findings of previous studies. Similar to the observations reported in several studies (Fang et al. 2015; Liu et al. 2017a; Liu et al. 2018), the resulting vulnerability throughout the Belt and Road region (Fig. 6) indicates that the distribution characteristics of the vulnerability are similar to that of the population density and GDP. In general, the spatial distribution of the vulnerability (Fig. 6) is similar to that of the exposure (Fig. 4), which is in agreement with Eq. (9). Compared to earlier models which calculate vulnerability through the direct addition of the exposure, susceptibility, and coping capacity, the information used in the approach proposed in this study is expanded (Ding et al. 2016). It can be seen from Fig. 6 that the vulnerability of most regions (43,817,500 km2) is extremely low, accounting for 86.53% of the total area. Regionally, those areas with extremely high vulnerability are primarily distributed in eastern China, northern India, the Indus Valley in Pakistan, and most of Bangladesh, which are densely populated and economically concentrated. When these areas suffer from floods, the flood-induced mortality and economic losses in these areas will be more severe than in other areas. Additionally, the level of vulnerability in many major cities (Fig. 6) is lower than that of the exposure (Fig. 4) due to their higher disaster reduction capabilities (Xiong et al. 2019a). As shown in Fig. 6, the typical regions and major cities are Bangkok, Kuala Lumpur, Bangalore, Cairo, Riyadh, Moscow, and Kyiv. Here, the response to floods mainly depends on the various flood defense structures. However, there will be many difficulties in mitigating flood vulnerability if people only rely on flood defense structures. Methods, such as “living with rivers” and “making space for water,” not only provide space for storing flood water, but also protect natural habitats (Johnson et al. 2007).

5.3 Importance and limitations

The United Nations Disaster Reduction Agency (UNISDR, 2011) recently stated that the exposure of economies affected by floods is increasing in all regions of the world, while the risk of death in developed countries is declining due to the increased income and improved governance (i.e., the emergency planning and preparations) (Jongman et al. 2012). However, most of the countries in the Belt and Road region are developing countries with limited levels of economic, social, technological and educational development, leading to their disaster prevention capabilities and mitigation foundations and abilities are weak relatively (Ge et al. 2020). In addition, these countries are focusing more on building their economies, while ignoring the constructions of disaster prevention and mitigation systems (Gray and Mueller 2012; Jongman et al. 2012; Tanoue et al. 2016). In the cities and regions that are more susceptible to disasters, regardless of whether the risks are to human lives or the economies, the preventions and mitigations of floods should be emphasized. Subsequently, flood awareness should be increased to deal with flood risks in a sustainable way (Johnson et al. 2007). Currently, more reliable vulnerability studies need to be conducted for the entire Belt and Road region, and this study contributes to this need.

Despite progress being made via this study, there are limitations to this study. Initially, a larger resolution was considered for use (i.e., 1 × 1 km or 0.5 × 0.5 km) (Ding et al. 2016), but later it was found that there were millions of grids corresponding to these resolutions, which was a huge test for the experimental equipment used. Regrettably, we only conducted the spatial analysis of the vulnerability, not the temporal analysis. In this study, due to the large area of the Belt and Road region and the availability of data, some common indicators were not included (i.e., the literacy rate, cultural level, existing medical conditions, and unemployment rates) (Ding et al. 2016; Hoque et al. 2019; Lee et al. 2015). Additionally, the GDP per capita data used were calculated by GDP and population data and may decrease the accuracy of the results. In fact, there is still a lack of a consistent set of metrics to measure flood vulnerability, because the understandings of vulnerability are different among scholars (Janssen et al. 2006; Liverman and O'Brien 1991). However, the index system is a vital step in flood vulnerability assessment (Ding et al. 2016; Lyu et al. 2020b); thus, future studies should consider the influences of these factors, and this aspect should prove to be an interesting topic.

6 Conclusions

We proposed a quantitative model of indexes based on the TFN-AHP and the SVM model. A map of the Belt and Road region of flood vulnerability profile was generated. Firstly, we prepared a database including 11 flood influential factors that were classified into five levels. Then, the exposure and disaster reduction capability were quantified based on the TFN-AHP and SVM model, respectively. Finally, the spatial distribution of the exposure, the disaster reduction capability, and the vulnerability in the Belt and Road region were generated and analyzed. Although this study has limitations in terms of the temporal scale and spatial resolution, the results for the high-vulnerability regions will hopefully encourage local government officials to pay attention to “hot spots” where more accurate analyses need to be conducted. Through the analysis of exposure, disaster reduction capability and vulnerability results, the following conclusions can be drawn: (1) The high-exposure areas are primarily distributed in areas with higher population densities, more built-up land, and relatively developed economies. (2) The disaster reduction capabilities of the countries along the Belt and Road are inextricably linked to their socioeconomic development. The areas with high disaster reduction capability are mainly concentrated in the major cities in the Belt and Road region (i.e., Beijing, Shanghai, Hong Kong, Taipei, Bangkok, Jakarta, New Delhi, Riyadh, Abu Dhabi, Cairo). (3) The vulnerability patterns are largely determined by exposure, while the disaster reduction capability may alleviate flood pressure at local scales. The high-vulnerable areas (accounting for 3.54%) are primarily distributed in southern and eastern China, northern India, most of Bangladesh, the Indus Valley in Pakistan, the Nile River Basin in Egypt, the central region of Indonesia, and the major cities in the Belt and Road region. Obviously, these highly vulnerable regions and cities coincide with the areas that are characterized by frequent economic activities and dense populations, indicating that areas with high populations and economic densities may be more susceptible to flood disasters in the future.