1 Introduction

In recent years, meteorological disasters have become increasingly frequent as a result of global warming. In the meanwhile, compared with the rapid urbanization process, the less developed water system planning and underground pipe network construction in China lead to frequent rainstorms and waterloggings, imposing a serious effect on the traffic, tourism, routine work and life, even bringing huge threats to the personal safety of inhabitants.Affected by climate and topography,the Yangtze river basin always suffers from long periods of heavy rain in summer and is one of the most effected areas with rainstorm in China.Footnote 1 It is even more difficult for Nanjing to escape it. Therefore, urban rainstorm waterlogging has become a common concern in all sectors of society, since it may lead to congestion and submergence of areas that are prone to be submerged. And this is why Nanjing rainstorm event is a major disaster worthy of consideration. However, the form in carrying out emergency rescue work targeted at the high-risk places. All these formidable site condition makes it difficult for governments and emergency organizations to know the detail about the disaster site and the public emotion, which also poses a difficulty been hot and difficult issues in the process of the disaster emergency management.

However, the bloom of social media big data, such as microblogging, and data mining technologies, provides a viable channel for us to know disaster details and the public emotion. China owns the most netizens in the world: by the December 2015, Chinese citizens had amounted to 731 million and the popularity rate of the Internet had reached 53.2%; phone netizens had amounted to 620 million, the percentage of which increases to 90.1%. With the enlargement of netizen scale, the usage of social platforms, like microblogging and wechat, shows a gradually rising trend, for example, Sina Weibo, launched in China in August, 2009. In 2010, four largest portal websites dedicated to the development and publicity of microbloggings, especially Sina Weibo, achieving the greatest success, with active users amounting to 50 million within 1 year in 2010. Statistics suggest that the number of microblogging messages that Chinese netizens post and forward every day are 250 million. As a kind of social tool, microblogging has become a major platform for more and more people to express personal emotions and record their life. Featured by real time and timeliness, microblogging serves as a platform for the extremely rapid spread of hot issues. Therefore, microblogging contents could be a valid source for the public opinion analysis, emotion extraction and personal opinion mining. Owing to the low efficiency of information dissemination of traditional media, in the case of waterlogging disaster brought by rainstorm, relevant departments are unable to obtain the first-hand information and elicit quick response. With strong timeliness, microblogging could be employed by the public to post disaster details once natural disaster happens, which would provide a means to extract the disaster information through social big data.

Meanwhile, with global warming and the rapid progress of urbanization in China, urban rainstorm and waterlogging disasters have become frequent, which has drawn scholars’ attention to the emergency management of urban waterlogging disasters (Kaur and Singh 2016; Song and Du 2017). However, once a rainstorm disaster occurs in the city, the finding of the disaster and the waterlogging points in the emergency situation and the building of the traffic emergency network are undoubtedly important theoretical and practical problems in the emergency management of the disaster.

In this paper, we focus on the extraction of microblogging disaster data and the optimization of post-disaster rescue routes. These two parts were combined skillfully to solve the problem of rapid collection and application of disaster information. When the heavy rain came and the road was cut off, the smooth flow of road traffic will be affected by waterlogging points. Using the microblogging data, the waterlogging points can be found precisely. Based on the construction of location-routing problem model and the application of the NSGA III algorithm, the new road traffic route can be redesigned timely. Then, this work of article is novel and will be of great significance to the emergency rescue of sudden disasters such as rainstorms.

The rest of the paper is as follows: the second part is the literature review; the third part is the research of finding the waterlogging points and the disasters by using microblogging data; the fourth part is the model construction; the fifth part is the simulation calculation; and the last is the summary.

2 Literature review

2.1 Disaster information extraction of microbloggings

Currently, studies on the microblogging disaster extraction mainly center on the earthquake area. Once an earthquake happens, rescue must be conducted within the shortest time in order to reduce the loss to the minimum. Due to the lack of timeliness of traditional media, it is difficult for relevant departments to control the disaster and rescue in time and accurately. Microbloggings enjoy strong timeliness. After an earthquake, the public generally post various conditions that they are facing through microblogging and other social media. By analyzing microblogging big data, disaster details could be known, including the location and casualty of the earthquake.

In recent years, the attention and interest on using microblogging big data in disaster emergency logistics are increasing. Xu et al. (2015) puts forward the concept of position microblogging. Sakaki et al. (2010) and other researchers put forward the real-time earthquake monitoring system based on the Twitter through the Bayesian Decision Method: take keywords as the evidence. In practical application, the system has successfully detected more than 80% earthquakes and forecasted earlier than the local earthquake organizations. The system can not only detect the occurrence of earthquakes, but also can estimate the location of the earthquake on account of the location information contained in the microblogging data. Choi and Bae (2015) introduced how Social Big Board, a real-time monitoring system based on the social big data, represented the disaster and its spreading trend on the map for further disaster estimation, forewarning and precaution development by analyzing earthquake-relevant contents posted on Twitter after the Haiti Earthquake happened in 2010. Yan et al. (2010) believed that microblogging contents play a vital role in extracting disaster information by analyzing the microblogging message, developing trend of various themes and propagation process through various analysis methods by the example of Yushu Earthquake in 2010.

In a word, by mining microblogging data, earthquake information could be known and it is efficient to prevent and reduce the loss that might be caused by earthquakes. And some paper about using data from Twitter and authority to analysis flooding disaster. Nevertheless, the microblogging big data has not been applied to the emergency management research of urban rainstorm waterlogging disaster.

2.2 Emotion analysis of microbloggings

By mining and analyzing microbloggings contents, relevant departments could know the emotion of the public timely so that they could take corresponding measures to control the public emotion. By now some scholars have developed relevant research by taking advantage of microblogging big data. For instance, Zheng et al. (2012) raised an integrated microblogging data process method and detecting algorithm about news topics. Li et al. (2013) presented the topic propagation evolution model that could describe the fission propagation mode with immunization rate. Hou (2013) made an integrated analysis on the evolution process of the public emotion about the unexpected group events by the example of Shenfang Lindgrenite Event. Yang (2014a, b) raised the method to monitor the public emotion about the emergency on the internet on the basis of the microblogging data.

Microblogging emotion analysis is conducted through three steps: text preprocessing, emotion information extraction and emotion classification. This kind of research is quite common. For instance, Zheng (2014) came up with the concept of emotion polarity: classify the emotion polarity into positive, neutral and negative by comparing the number of positive and negative emotion vocabularies in the text. Zhou et al. (2013) put forward the method to build Chinese emotion dictionary based on the HowNet and SentiWordNet. The experiment represented that the dictionary is superior to those emotion dictionary with general polarity, which provides valid dictionary resources for emotion analysis studies. Preethi et al. (2015) introduced a generalized model that could be adopted to determine the possibility of an event happens in the future by estimating the causal relationship between the event and its corresponding emotion reaction on the basis of timeliness emotion analysis of Twitter. Take the Sandy Hurricane happened in October, 2012 as an example, Neppalli et al. (2017) visualized the hurricane-related emotions on the map based on the Twitter big data. The study showed that the public emotions are not only related to their geographical position, but also the distance range that the hurricane has affected. Although currently there are abundant studies about emotion analysis on the microblogging big data, the emotion analysis on the urban rainstorm waterlogging based on the microblogging big data are quite rare.

On the basis of the above research, the paper extracts and analyzes the Sina Weibo data during the heavy rain occurrence to further analyze the disaster information and the public emotion, select urban rainstorm waterlogging areas with high risks and provide empirical support for disaster emergency management by the example of the heavy rainfall on June 18th and July 7th in 2016.

2.3 Urban emergency logistics in waterlogging disasters

The decision-making in the presence of urban rainstorm disasters is a focus of attention for scholars. People like Chen et al. (2016) build an urban waterlogging disaster emergency shelter to improve the flexibility and capacity for decision making in emergency management departments; Deng et al. (2016) put forward suggestions on how Shanghai could prevent and cope with waterlogging. Hu et al. (2015) constructed the two-stage stochastic mixed integer programming model at the aim of minimizing the total cost of logistics; Su et al. (2016) presented an integrated simulation method to analyze the effects of urban rainstorm and waterlogging on traffic congestion. However, the research on how to carry out disaster relief material delivery on flooded roads after urban rainstorm and waterlogging is rare. But it is a key issue that needs to be solved in real life as well as a difficult and hot issue that people are concerned about. Therefore, this paper, by proposing an optimization model of post-disaster material delivery and carrying out the demonstration by taking Nanjing as an example, is of great theoretical and practical significance.

Disaster relief supplies can be classified as a sort of location-routing problem. In 1970s, Cooper (1972, 1976) combined the distribution center location problem with the vehicle routing problem, and put forward the transportation-positioning problem. People like Rahman and Smith (2000) suggested that the key to solving the location-routing problem was constructing a reasonable optimization model. During the delivery of emergency supplies, the effect of the relief effort depends on the ability to deliver emergency supplies to affected areas with the minimum amount of time and minimal cost of emergency relief. People such as Li and Zhou (2014) and Moshref-Javadi and Lee (2016) constructed the rescue model aiming at spending the shortest time; people like Pranmudita et al. (2014) constructed the model with the aim of minimizing the total cost; people including Wei and Özdamarb (2007), Rath and Gutjahr (2014), Wang et al. (2014), Burkart et al. (2016), Khalipourazari and Khamseh (2017), Tayal and Singh (2017), Chang et al. (2017) and Song and Du (2017) constructed multi-objective models considering a variety of mixing factors to solve the multi-objective problems.

The difficulty of research lies in how to obtain optimal or satisfactory solutions of these models. Scholars often use heuristic algorithms, among which the two-stage heuristic algorithms are used more frequently (Mete and Zabinsky 2010; Han et al. 2011; Ahmadi et al. 2015). Some other algorithms such as simulated annealing algorithm and ant colony algorithm are also used by scholars (Jotshi et al. 2009; Yu et al. 2010; Dalfard et al. 2013; Ting and Chen 2013). Genetic algorithm is one of the most common used algorithms, but the hybrid optimization algorithm is more adopted by combining basic genetic algorithm with other algorithms due to the complex design and long running time of basic genetic algorithm (Wu et al. 2002; Wang et al. 2015; Asgari et al. 2017). For instance, by comparing and studying, scholars (Rabiee et al. 2012; Fallah-Mehdipoura et al. 2012; Bandyopadhyay and Bhattacharya. 2013; Kalaivani et al. 2013; Pasandideh et al. 2013; Pranmudita et al. 2014; Song et al. 2017) found that non-dominated sorting genetic algorithm had obvious advantages among the current multi-objective optimization algorithms; The improved NSGA-III algorithm, in particular had great advantage in running time and solution quality. Therefore, in recent years, the improved NSGA-III algorithm has been applied to the research of emergency and other multi-objective planning in many literatures (Mohapatra et al. 2015; Duan et al. 2016; Yu et al. 2017; Zhou et al. 2017; Feng et al. 2017). Based on the above considerations, the algorithm was also selected in this paper.

Based on the existing literature review, there exists very little research in which a city in taken as an example to study emergency material delivery under the circumstance of rainstorm flood disaster. As far as the researchers’ vision is concerned, there has not yet been research of building the LRP model based on the disaster information obtained via the microblogging. Therefore, in the paper, information on the disasters is obtained through microblogging data for the first time, and then the paper presented the LRP model in the context of rainstorm and waterlogging disaster, and by taking Nanjing as an example, used non-dominated sorting algorithm NSGA-III to get solutions. The strategy and method of the research has enriched the existing research literature of disaster management, and the conclusion has provided an empirical reference for Nanjing to cope with the rainstorm and waterlogging disaster.

3 Obtainment of disaster information via microblogging data

3.1 Research steps

On the basis of the Sina Weibo content, take the following three steps to extract the urban rainstorm waterblogging information from microbloggings of numerous users:

First step: data extraction and IP address translation. Take the persistent strong rainfall from the mid-June to the beginning of July in 2016 as an example, 6,385 microbloggings are extracted from June 18th to July 7th. Owing to the long time quantum, microblogging contents are divided into two parts and processed separately to specifically reflect the variation trend of waterlogging during this period in Nanjing: from June 18th to June 30th and from July 1st to July 7th. The paper will analyze the seriousness level of waterlogging in all administrative regions of Nanjing and find out areas prone to be submerged during this strong rainfall via microblogging contents. Since the IP address contained by the microblogging cannot directly reflect the administrative region of the blogger, all IP addresses are translated into physical addresses through IP address convertor. Then all microbloggings can be classified according to administrative regions for further analysis.

Second step: repetition and noise elimination, word segmentation and index establishment. It is necessary to eliminate the repetition and noise, segment words and establish the index because of the loud noise and plenty of invalid repetition in the original microblogging data. ICTCLAS is used for word segmentation to process microblogging contents. The result of the word segmentation is classified according to the rainfall reasons, the description about the ponding of the public, the effect of the waterlogging on traffic and the emotion reaction of the public (see the attachment Table 9 for details); the waterlogging information and emotion description are rated and coded in order to compare the strong rainfall information and emotion reaction in various administrative regions of Nanjing (see attachment Tables 10, 11 for details).

Third step: contrastive analysis of these two phases. According to the word segmentation result, for one thing, the variation trend of the waterlogging in the two phases and watrerlogging differences in various administrative regions of Nanjing can be determined and areas prone to be submerged can also be found; for another, the varying regularity of the public’s emotions during the urban waterlogging could be analyzed and the emotion difference of the public in various administrative regions could be achieved.

Targeting at the urban rainstorm waterlogging disaster, the paper designs the following extraction process (Fig. 1).

Fig. 1
figure 1

The extraction process of urban rainstorm waterlogging information based on the microblog big data

3.2 Research methods

The present paper adopts the word segmentation to analyze the enormous microblogging text data. Words are processed by the currently universal ICTCLAS, which enjoys high accuracy and can label the part of speech and its corresponding weight and frequency of each word. The main purpose of the paper is to distinguish the disaster degree of waterlogging in all administrative regions of Nanjing. After the word segmentation, the urban rainstorm waterlogging reflecting table and disaster classification coding table could be established and then disaster values of all administrative regions could be normalized by the word frequency and weight of the strong rainfall description.

3.3 Empirical results

3.3.1 A brief introduction about the disaster

In 2016, affected by El Nino, the rainfall in the Yangtze River basin was significantly more than that in previous years, and brought great influence to the entire river basin. Nanjing was one of the most affected cities in the rainstorm, where the rainfall was of high and sudden intensity and long duration. Nanjing began to rain form June 5th, 2016 and it nearly did not stop until the beginning July. From the mid-June to the end of June, the rain was normal, mainly thundershower and moderate rain; at the beginning of July, the rain started to be stronger, mainly moderate and torrential rain. This persistent rain refreshed the rainfall amount record of Nanjing, amounting to 441 mm on June 27th.

The persistent strong rainfall directly caused serious ponding in urban areas of Nanjing, so that the emergency office of municipal government kept raising the rating of flood prevention and emergency response. There were numerous ponding in urban areas: the depth of ponding in Nanjing University of Science and Technology amounted to 1.5 m; the rainwater submerged the garage in the Yinchengdongyuan housing estate; numerous cars were submerged; the metro station of the Mingfa Square was submerged by rainwater, the depth of ponding on the Saihongqiao Street amounted to almost 2 m, which imposed great negative influence on citizens’ property safety and transportation.

From the result of word segmentation of microbloggings, people gave the following description: convective weather, convection, violent typhoon, Nepartak, plum rains, plum, and so on. Since June belongs to rainy seasons, adding to that strong convective weather and typhoon are common in summer, these two factors led to the lasting rainstorm with huge amount of rainfall. People also offered the following description about the disaster: water level rises of Qinhuaihe River; dragon water on the city wall; traffic paralysis; the Yangtze River exceeds the warning level; rapid increase of water level of Xuhe River. The disaster brought huge challenge to the city water prevention system of Nanjing.

3.3.2 Contrastive analysis on the waterlogging level

How to evaluate the waterlogging level in various administrative regions of Nanjing is an important issue for research. Generally, people regard disaster degree as the rating standard to measure the loss of the natural disaster. Cheng et al.(2011) put forward the synthetical disaster index (SDI), which can be used to normalize a series of disaster evaluation factors and obtain quantitative disaster evaluation index by weighted addition. Ma (2006) advocated to adopting the disaster degree rating standard of dual-factor evaluation at the early stage: the death roll and the value of the social property loss in a disaster. In the study of the disaster degree of floods, some scholars presented indicators that reflect the size of the disaster area (including the size, population and the number of counties that the disaster has affected). By three-factor analysis, Mao (2000) built the flood disaster index (including the affected area, affected population and food reduction) to evaluate the disaster degree of flood, waterlogging and flood-waterlogging disaster in Dongtinghu Region.

It can be known from the microblogging content that this torrential rain did not cause any casualties; however, the direct and indirect economic loss caused by waterlogging was unlikely to be calculated through micrblog contents. Therefore, the disaster rating of the urban rainstorm waterlogging is mainly influenced by the amount of rainfall, ponding range, ponding depth and traffic jam. Based on the previous research, the paper summarizes the following formula to calculate the disaster rating of urban rainstorm waterloggingFootnote 2:

$$ {\text{Disaster}}\,{\text{index}} = {\text{ponding}}\,{\text{range}}*30\% + {\text{ponding}}\,{\text{depth}}*30\% + {\text{traffic}}\,{\text{jam}}*40\% $$

Based on the above formula, the value of rainstorm waterlogging disaster is quantized and classified into five ratings: low-grade, mild, ordinary, serious, grave. The results can be seen in the following two figures (Figs. 2, 3).

Fig. 2
figure 2

The rainstorm waterlogging disaster from June 18th to June 30th

Fig. 3
figure 3

The rainstorm waterlogging disaster from July 1st to July 7th

In the last 10 days of June, ponding in Qinhuai Region and Xuanwu Region was ordinary—even more serious than the other regions; ponding in Pukou Region, Lishui Region and Gaochun Region was mild. In the first 10 days of July, the rain became strong, mainly heavy rain, rainstorm and heavy rainstorm. Adding to the previous persistent rain, the increase of rainfall directly led to severe ponding in various administrative regions of Nanjing. There were plenty of direct descriptions about the waterlogging in contents of microbloggings. For instance, see the sea; the deepest depth of ponding amounts to 1.5 m; hundreds of cars were submerged; and so on. The ponding in Nanjing was more serious than before as whole: the ponding in Qinhuai Region, Gulou Region and Jiangning Region was grave; the ponding in Jianye Region and Yuhuatai Region was serious; the ponding in Pukou Region, Gaochun Region and Lishui region was mild. At the same time, urban places prone to be submerged can be determined by the ponding depth based on the direct description of ponding depth on specific streets in the microblogging, such as “the deepest depth of ponding was 1.5 m”, “the deepest depth of ponding was more than 2 m”, “the depth of ponding was from 60 to 80 cm”, and so on. Areas prone to be submerged mainly centered in Xuanwu Region, Qinhuai Region, Gulou Region, Jianye region, Yuhua Region and Pukou Region. Among these areas, the ponding in Saihongqiao Street was the most grave, with depth of 2 m; the ponding depth in Nanjing University of Science and Technology was 1.5 m; the rainwater submerged the metro station in Mingfa Square, which greatly affected the normal function of metro system.

The ponding in Qinhuai Region was the most serious. Nanjing University of Science and Technology, Zijin East Road and HoubiaoYing Road all become places prone to be submerged. Located at the south part of Zijin Mountain, all these places were low-lying. Adding to the enormous flood from the Zijin Mountain, lots of ponding failed to discharge in time, which made the ponding worse. The layout of areas prone to be submerged is showed as follows (Fig. 4).

Fig. 4
figure 4

The layout of areas prone to be submerged from July 1st to July 7th

3.3.3 The subjective emotion of the public

After the urban rainstorm waterlogging, the emotion of the public can be estimated by analyzing contents of microbloggings, so that corresponding emergency management could be conducted on the basis of the public opinion. After the strong rain happened in June 2016 and subsequent ponding, microblogging users expressed their emotion reactions immediately, for example: “smile”, “titter”, “full of tears”, “cry”, “curse in rage”, and so on. Based on the emotion analysis on microblogging contents, the present paper divides the emotion reaction of the public into five categories: exultant, happy, ordinary, unhappy and terribly upset.

From June 18th to June 30th, the public basically had no negative emotions. Although there were some negative vocabularies, such as “tears”, “cry”, “heartbroken”, and the like, the percent of these words was quite small. Instead, there appeared some positive vocabularies like “haha”, “titter”, “smile” and “handclap”. It can be seen that there was not obvious negative emotion among the public during this phase.

From July 1st to July 7th, the public began to show strong negative emotions compared with the previous phase: “whine”, “sad”, “fear”, “full of tears” and “sorrow”. The percentage of negative vocabularies was large. There were a few positive vocabularies with low percentage. Therefore, the public expressed intense negative emotion during this phase, mainly shown as anger, panic, fear, sadness, and annoyance. The negative emotion in Qinhuai Region and Gulou Region was the most intense, rating as terribly upset; the negative emotion in Jianye Region and Xuanwu Region rating as unhappy; the negative emotion in Lishui Region and Gaochun Region was the least intense, rating as exultant.

The emotion reactions of the public toward the urban rainstorm waterlogging in June and July can be seen in the following figures (Figs. 5, 6).

Fig. 5
figure 5

Emotion reactions of the public from June 18th to June 30th

Fig. 6
figure 6

Emotion reactions of the public from July 1st to July 7th

4 Construction of location-routing problem model

According to the disaster information obtained in the third part, the LRP model is built, and then the road traffic data of Nanjing is used to optimize the road traffic after interruption.

4.1 Problem description

The location-routing problem of emergency logistics after rainstorm and waterlogging disaster is described as follows. The location of emergency logistics distribution center, the construction cost and capacity, the capacity of distribution vehicle and the location of the affected point (area) of material rescue have been known. Besides, the demand for various materials can be collected and predicted by the logistics information platform. Based on the information above, the emergency distribution center needs to be determined to make sure that the total demand of the affected points on each distribution route starting from a distribution center is less than the capacity of the distribution center, and that the amount of transport supplies on a single route is less than the total capacity of all the vehicles on the route. Transportation route should be planned reasonably based on the above-all two premises with an aim of making sure the transport time is the shortest and the cost is the least in the whole transport network.

4.2 Basic hypotheses and symbol description

  1. (1)

    Hypothesis about the background of Web design

    In the design of the city’s logistics network, exclusive areas which the urban road network can reach are taken into consideration, and the road transport is the exclusive choice for transportation. It is assumed that the road network condition is static; however, due to the complexity of real condition, network circuitous factors are added to the model.

  2. (2)

    Hypothesis about distribution centers

    Assume that the distribution centers are more than two and the capacity is limited and known. After waterlogging disaster, a certain number of distribution centers are started for material distribution according to the demand of the affected areas; the total demand of affected points on all distribution routes from a distribution center should be less than the maximum capacity of the distribution center.

  3. (3)

    Hypothesis about transport vehicles

    Assume that the vehicles are of the same size and have limited capacity and that the same vehicle can be used to deliver different demand materials; the transport vehicle starts from a distribution center and returns to it after distributing to all the affected points on a route to wait for the next round of delivery. For the sake of simplification, the transport vehicles on each route are considered as a whole, and the total demand of the route is determined before allocating specific number of vehicles; the vehicle load on a route should be greater than or equal to the total demand of all affected points on the route.

  4. (4)

    Hypothesis about affected points

    According to real situation, although the demand of affected points can be predicted by corresponding logistics information system, it is not accurate. Therefore, the material demand of affected points can be signified by triangular fuzzy numbers and be transformed into real numbers in the interval [0, 1] by function in operations (Fig. 7).

    Fig. 7
    figure 7

    Conceptual model

The established model symbols are illustrated below:

  1. (1)

    Collection setting

    P:

    Distribution CENTER COLLECTION, P = {1, 2, …, p};

    Q:

    Affected points collection, Q = {p + 1, p + 2, …, p + q};

    R:

    Node collection, R = P ∪ Q = {1, 2, …, p + q};

    V:

    Vehicle collection, V = {1, 2, …, k}

  2. (2)

    Parameter setting

    \( F_{i} \):

    Construction cost of material distribution center i,\( i \in P \);

    \( \delta_{i} \):

    Maximum storage capacity of i, \( i \in P \);

    \( n_{k} \):

    Maximum carrying capacity of vehicle k, \( k \in V \);

    \( C_{k} \):

    Fixed use cost of vehicle k, \( k \in V \);

    \( d_{ij} \):

    Distance between i and j, \( i,j \in R \);

    \( v_{k} \):

    The average travel speed of vehicle k, \( k \in V \);

    \( c_{ijk} \):

    The transportation cost of per unit distance of vehicle k from i to j, \( i,j \in R \), \( k \in V \);

    \( \partial_{ij} \):

    The circuitous factor of node i to node j, \( \omega_{ij} \ge 1 \), \( i,j \in R \), when \( i,j \in R \), \( \partial_{ij} = 0 \);

    \( MT_{j} \):

    The time limit for the delivery of goods to affected point j, \( j \in Q \);

    \( T_{jk} \):

    The time when vehicle k reaches node j, \( j \in R \), when \( j \in P \), \( T_{jk} = 0 \);

    \( t_{ijk} \):

    The time vehicle k takes from node i to node j, \( i,j \in R \) , \( k \in V \), \( t_{ijk} = d_{ij} /v_{k} \);

    \( \tilde{D}_{j} \):

    The demand of affected point j is expressed by triangular fuzzy numbers \( \tilde{D}_{j} \left( {\alpha_{j} ,\beta_{j} ,\gamma_{j} } \right) \); \( \alpha_{j} \), \( \beta_{j} \) and \( \gamma_{j} \) can be collected by the emergency logistics information system under the circumstance of disaster;

    \( p_{1} \):

    Confidence coefficient limited by emergency vehicle capacity, \( p_{1} \in \left[ {0,1} \right] \);

    \( p_{2} \):

    Confidence coefficient limited by distribution center, \( p_{2} \in \left[ {0,1} \right] \)

  3. (3)

    Variable setting

    When emergency distribution center i\( (i \in P) \) is started, xi = 1, or else xi = 0;

    When delivery vehicle k is called from emergency distribution center I \( (i \in P,k \in V) \), yik = 1, or else yik = 0;

    When the delivery vehicle drives from node i to node j\( (i \ne j,i \in P,k \in V) \), zijk = 1, or else zijk = 0.

4.3 Model establishing

Objective functions:

$$ \hbox{min} Z_{1} = \sum\limits_{i \in Q} {\sum\limits_{k \in V} {MT_{ik} } } $$
(1)
$$ \hbox{min} Z_{2} = \sum\limits_{i \in P} {F_{i} x_{i} + \sum\limits_{i \in R} {\sum\limits_{j \in R} {\sum\limits_{k \in V} {c_{ijk} \partial_{ij} d_{ij} z_{ijk} + \sum\limits_{i \in Q} {\sum\limits_{k \in V} {c_{j} y_{ik} } } } } } } $$
(2)

Constraint conditions:

$$ V_{1} \left( {\sum\limits_{i \in R} {\sum\limits_{j \in Q} {z_{ijk} \tilde{D}_{j} \le n_{k} } } } \right) \ge p_{1} ,\quad \forall k \in V $$
(3)
$$ V_{2} \left( {\sum\limits_{i \in R} {\sum\limits_{j \in Q} {\sum\limits_{k \in V} {y_{lk} } z_{ijk} \tilde{D}_{j} \le \delta_{l} x_{l} } } } \right) \ge p_{2} ,\quad \forall l \in P $$
(4)
$$ T_{jk} = T_{ik} + t_{ijk} z_{ijk} ,\quad \forall i,j \in R,\;\forall k \in V $$
(5)
$$ MT_{j} \ge T_{jk} ,\quad \forall j \in Q,\;\forall k \in V $$
(6)
$$ \sum\limits_{j \in Q} {\sum\limits_{k \in V} {z_{ijk} - x_{i} \ge 0,} \;when\;} x_{i} = 1\;and\;\forall i \in P $$
(7)
$$ \sum\limits_{j \in Q} {z_{ijk} - x_{i} \le 0,whenx_{i} = 0 \, and\quad \forall i \in R,\forall k \in V} $$
(8)
$$ \sum\limits_{j \in R} {z_{ijk} - } \sum\limits_{j \in R} {z_{jik} } = 0,\quad \forall i \in R,\;\forall k \in V $$
(9)
$$ \sum\limits_{k \in V} {z_{ijk} = 0,\quad \forall i,j \in P} $$
(10)
$$ \sum\limits_{i \in R} {\sum\limits_{k \in V} {z_{ijk} } } > 0,\quad \forall j \in R $$
(11)
$$ x_{i} = \left\{ {0,1} \right\},\quad \forall i \in P $$
(12)
$$ y_{ik} = \left\{ {0,1} \right\},\quad \forall i \in P,\;\forall k \in V $$
(13)
$$ z_{ijk} = \left\{ {0,1} \right\},\quad \forall i,j \in R,\;\forall k \in V $$
(14)

Objective functions (1) and (2) are the general goals of the LRP model.

In the main objective function, (1) represents the total amount of time of delivering all materials, but the time of returning to the distribution center is not included. (2) represents the total cost of emergency rescue including the construction cost of the emergency distribution center and the cost of transport vehicles. The constraint condition (3) ensures that the demand of the affected point is not greater than the probability of the vehicle capacity but is not less than the given probability P1. Condition (4) ensures that the demand of affected points is not greater than the probability of the capacity of the emergency distribution center and not less than the given probability P2. Condition (5) gives explanation for Tjk. Condition (6) ensures that the time that the vehicle actually reaches the affected point is less than the specified time limit. Condition (7) ensures that the distribution center must deliver supplies to the affected areas after started. Condition (8) ensures that the distribution center does not deliver supplies without being started. Condition (9) ensures that the vehicle starting from a distribution center eventually returns to it. Condition (10) ensures that there is no more than one emergency distribution center on an independent rescue route. Condition (11) ensures that every affected point can get supplies. Conditions (12), (13) and (14) are integer variables, which stipulate the value of the variable is zero or one.

Constraint conditions (3) and (4) contain the triangular fuzzy number \( \tilde{D}_{j} \), which needs to be processed by real numbers. Suppose the demand of affected point j to be \( \tilde{D}_{j} \left( {\alpha_{j} ,\beta_{j} ,\gamma_{j} } \right) \), the remaining capacity of transport vehicle k after delivering to be \( \Delta \tilde{\varLambda }_{k} \), and the remaining capacity of distribution center i after serving all the affected points within to be \( \Delta \tilde{\varOmega }_{i} \). \( \Delta \tilde{\varLambda }_{k} \) and \( \Delta \tilde{\varOmega }_{i} \) are both triangular fuzzy numbers, namely,

$$ \begin{aligned} \Delta \tilde{\varLambda }_{k} & = \left( {\Delta \varLambda_{k1} ,\Delta \varLambda_{k2} ,\Delta \varLambda_{k3} } \right) \\ & = \left( {n_{k} - \sum\limits_{l \in R} {\sum\limits_{j \in Q} {z_{ljk} \gamma_{j} ,n_{k} - \sum\limits_{l \in R} {\sum\limits_{j \in Q} {z_{ljk} \beta_{j} ,n_{k} - \sum\limits_{l \in R} {\sum\limits_{j \in Q} {z_{ljk} \alpha_{j} } } } } } } } \right) \\ \end{aligned} $$
(15)
$$ \begin{aligned} \Delta \tilde{\varOmega }_{i} & = \left( {\Delta \varOmega_{i1} ,\Delta \varOmega_{i2} ,\Delta \varOmega_{i3} } \right) \\ & = \left( {\delta_{i} x_{i} - \sum\limits_{l \in R} {\sum\limits_{j \in Q} {\sum\limits_{k \in V} {z_{ljk} \gamma_{j} } ,\delta_{i} x_{i} - \sum\limits_{l \in R} {\sum\limits_{j \in Q} {\sum\limits_{k \in V} {z_{ljk} \beta_{j} } ,\delta_{i} x_{i} - \sum\limits_{l \in R} {\sum\limits_{j \in Q} {\sum\limits_{k \in V} {z_{ljk} \alpha_{j} } } } } } } } } \right) \\ \end{aligned} $$
(16)

and:

$$ V_{1} \left( {\Delta \tilde{\varLambda }_{k} \ge 0} \right) = \left\{ {\begin{array}{*{20}l} {1,} \hfill & {\Delta \varLambda_{k2} > 0} \hfill \\ {\frac{{\Delta \varLambda_{k3} }}{{\Delta \varLambda_{k3} - \Delta \varLambda_{k2} }},} \hfill & {\Delta \varLambda_{k2} \le 0 \le \Delta \varLambda_{k3} } \hfill \\ {0,} \hfill & {\Delta \varLambda_{k3} < 0} \hfill \\ \end{array} } \right. $$
(17)
$$ V_{2} \left( {\Delta \tilde{\varOmega }_{i} \ge 0} \right) = \left\{ {\begin{array}{*{20}l} {1,} \hfill & {\Delta \varOmega_{k2} > 0} \hfill \\ {\frac{{\Delta \varOmega_{k3} }}{{\Delta \varOmega_{k3} - \Delta \varOmega_{k2} }},} \hfill & {\Delta \varOmega_{k2} \le 0 \le \Delta \varOmega_{k3} } \hfill \\ {0,} \hfill & {\Delta \varOmega_{k3} < 0} \hfill \\ \end{array} } \right., $$
(18)

By the above transformation, constraint conditions (3) and (4) are respectively equivalent to:

$$ V_{1} \left( {\Delta \tilde{\varLambda }_{k} \ge 0} \right) \ge p_{1} ,\quad \forall k \in V $$
(19)
$$ V_{2} \left( {\Delta \tilde{\varOmega }_{i} \ge 0} \right) \ge p_{2} ,\quad \forall l \in P $$
(20)

5 Stimulation application

5.1 Algorithm

Non-dominated sorting genetic algorithm NSGA, based on the optimality concept of Pareto, was proposed by Srinivas and Deb (1995). Deb et al. (2000) proposed the improved algorithm NSGA-II, which expanded the sample space, by introducing the elite strategy and reduced the computational complexity of the algorithm by using rapid non-dominated sorting method. The steps are taken as follows.

  1. (1)

    Supposing that there is a population A of size N, after applying genetic operators (selection, recombination and mutation) to which, population B of size N is obtained. Population C of size 2N can be obtained by combining population A and B.

  2. (2)

    Rank population C in a non-dominant way to obtain individuals of non-dominant layers 1, 2, 3 …, and add all the individuals in order to set D of the next generation until the size of set D is larger than N. Record the non-dominant layer as L, and then select K individuals from layer L to make the sum of K and all of the previous layer individuals equal to N.

  3. (3)

    Operation of function standard quantization

    First, calculate the minimum on each target dimension i in M objective functions to obtain the corresponding minimum zi of the goal i. The set of zi is the ideal point set, perform quantitative operations on which and the formula is: \( f_{i}^{{\prime }} \left( x \right) = f_{i} \left( x \right) - z_{i}^{\hbox{min} } \). Then seek for the extreme point by traversing each objective function with \( ASF\left( {X,W} \right) = MAX_{i = 1:m} \frac{{f_{i}^{{\prime }} \left( x \right)}}{{W_{i} }} \) to obtain individuals of the minimum ASF, which are the extreme points. Based on the specific function values of these points, the intercepts on the corresponding axes are calculated and recorded as ai. Finally, the normalization operation is carried out according to the following formula: \( f_{i}^{n} \left( x \right) = \frac{{f_{i}^{{\prime }} \left( x \right)}}{{a_{i} }} \).

  4. (4)

    Associating individuals with reference points

    The corresponding divisiory reference points are obtained with recursive method to construct the vector quantity of the reference points. The individual of each population is then traversed to find the nearest reference point for each population and meanwhile record the information of reference points and the corresponding shortest distance.

  5. (5)

    Selecting the sub-generation and deleting reference points

    After non-dominated sorting, assume that from the first non-dominated layer to layer L the total number of members of a population is larger than population size N, and then define St + 1 as a collection that contains all the individuals in L. St + 1 needs to be screened because its number is larger than the pre-set number of members of the population. First, traverse each reference point to view the times that it is cited by St + 1 except layer L, and then find out the reference point with the least cited times, which is the one that is associated with least amount of population individuals, and record the cited times as pi. Next, the following points need to be discussed. (1) Assuming pi = 0, if individuals in layer 1, 2, ……, L are associated with the vector quantity of the reference point, find out the point of the shortest distance and extract and then add it to the selected next population, setting pi = pi + 1; otherwise, delete the vector quantity of the reference point. (2) If pi > 0, select the reference point of the shortest distance. The program does not end until the size of population becomes N.

5.2 Application background analysis

In 2016, affected by El Nino, the rainfall in the Yangtze River basin was significantly more than that in previous years, and the water level was much higher than the annual average. Since June 27, there had been a wide range of continuous heavy rain in areas along and southwest parts of the middle and lower reaches of the Yangtze River, which had stricken 2.772 million people from Jiang Su and other 10 provinces (municipalities directly under the central government), 37 cities (autonomous prefecture) and 133 counties (cities or areas). The rainstorm caused 14 dead and 20 missing as well as direct economic loss of 3.14 billion yuan. Nanjing was one of the most affected cities in the rainstorm, where the rainfall was of high and sudden intensity and long duration. The precipitation in many areas including the main urban area, Gaochun and Lishui all surpassed the peak of the same period in history. By July 7, the rainfall in the main urban area reached 235.5 mm. At least dozens of sections of road were encountered with traffic interruption because of serious seeper in urban area; subway line 3 was forced to shut down for water inflow; the water surface in serious waterlogging stricken areas even drew close to the height of traffic light. Water flowed backward in parts of the road, which had caused serious threat to people’s life and property. Under this circumstance, how to carry out the distribution of emergency supplies became an important problem to be solved urgently.

5.3 Data collecting and dealing

Assume under the rainstorm waterlogging disaster, each candidate distribution center has been established. The only question is to decide which centers are to be started supposing that the distribution center has enough supplies to meet the demand of the whole city.

  1. (1)

    Demand data of the affected areas

    After severe rainstorm and waterlogging, the number of people affected by the disaster determines the amount of supplies. The higher the population intensity is, the greater the demand for supplies is. The eleven districts of Nanjing were selected as the affected areas, and the regional permanent population, administrative area and population density were shown in Table 1.

    Table 1 Data of population and area of district in Nanjing

According to the density of affected population and the number of resident population as well as the actual data, the demand for all sorts of materials that the affected areas of Nanjing needed in rainstorm could be predicted. Rainstorms typically occur during summer, so the supplies needed are food, water, tents and emergency medicine. Referring to Yang Yang’s design (2014), considering both the minimum and maximum amount of these items per person needed per day and the number of the resident population of each district, the demand for all sorts of materials that the affected points in each area needed were figured out as shown in Table 2.

Table 2 Supplies demand of the affected areas (tons)
  1. (2)

    Data of emergency distribution center

    According to the actual situation of Nanjing, the five districts of Qinhuai, Gulou, Jianye, Pukou and Jiangning were selected to build candidate emergency distribution centers. Old urban areas of Nanjing including Qinhuai and Gulou district, owing to its large resident population and outdated planning of drainage systems, the occurrence of waterlogging is really frequent. Jianye district, which is near the Yangtze River and located in the west of Qinhuai River, was low-lying and the pressure on drainage system was high. However, it has developed into the vice center of Nanjing. Pukou and Jiangning districts, respectively located at the core of the north and south of Nanjing, can radiate the surrounding areas. The commissioning cost and inventory capacity of the candidate emergency distribution centers are assumed data, and the location is where each district government lies. The specific data are shown in Table 3.

    Table 3 Data of candidate emergency logistics distribution center

It is assumed that the location of each distribution center and the affected point is determined, as shown in Fig. 8. Code A–K corresponds to the affected points of eleven districts.

Fig. 8
figure 8

Location of distribution center and affected points

According to Baidu map, the shortest practical distance between points is figured out by kilometer, among which, 1–5 in Table 4 are the corresponding candidate emergency distribution centers in Table 3. Since there is only one distribution center per route, it is assumed that the distance between each distribution center is finite. The data of distance between each node is shown in Table 4.

Table 4 Distances between each node

According to the actual situation of Nanjing, other parameters are set. The average speed of the vehicle is set as \( v_{k} = 50\,{\text{km/h}} \), the driving cost of unit distance as cijk = 10.5 yuan/km, vehicle carrying load as \( n_{k} = 20 \) tons, fixed using cost of vehicle as \( C_{k} = 450 \). And set \( p_{1} = 0.88 \), \( p_{2} = 0.85 \), \( \partial_{ij} = 1.2 \).

5.4 Solution to model application

  1. 1.

    First, assume that all routes are accessible. The shortest time is the main objective function, so the calculation result of the shortest time is the final result. With Java programming, after the iteration of 10,000 times, the time required for delivery was 1.974 h and the total cost was 175,294.75 yuan. The emergency distribution center, route arrangement as well as the maximum material demand and the needed relief vehicles of each route are shown in Tables 5 and 6.

    Table 5 Emergency distribution centers and route arrangement
    Table 6 Distributed materials and vehicles required on each route

    The unloading time of vehicles after reaching the affected points should be taken into consideration. If there are multiple affected points on a route, the relief vehicles required on this route are the sum of the vehicles needed for these affected points. For instance, on route 6, five vehicles are issued from the distribution center of Jiangning district, of which two stop and are unloaded at the affected point of Lishui district, and the other three continue to transport to Gaochun district. The route arrangement is shown in Fig. 9 below.

    Fig. 9
    figure 9

    Route arrangement without route interrupts

  2. (2)

    According to the actual situation analyzed above, the old urban areas of Nanjing such as Qinhuai and Gulou districts are vulnerable to waterlogging. It is assumed that the road between the affected point of Qinhuai district and the affected point of Gulou district is interrupted by the south road, which causes the road between the two affected points is not directly accessible, so the route needs to be re-planned and re-calculated. The time needed for the delivery is 1.974 h, and the total cost is 19,0566.7 yuan, which is 1,5271.95 yuan more than the cost of the initial calculation. The emergency distribution center, route arrangement as well as the maximum material demand and required relief vehicles of each route are shown in Tables 7 and 8. The route arrangement is shown in Fig. 10 below.

    Table 7 Emergency distribution centers and route arrangement
    Table 8 Distributed materials and required vehicle
    Fig. 10
    figure 10

    Route arrangement with route interrupts

6 Conclusion

The present paper introduces the process and methods to analyze the urban rainstorm waterlogging and its corresponding emotions of the public on the basis of microblogging big data. First of all, eliminate the repetitive contents of original microblogings, translate all IP addresses into physical location and classify all microbloggings according to administrative regions of Nanjing. Second, Segment words and label the part of speech of each word of all microbloggings by ICTCLAS. Third, establish the urban rainstorm waterlogging reflecting table and waterlogging disaster and subjective emotion classification coding table. Last but not least, divide microbloggings of all administrative regions into two phases: from June 18th to June 30th and from July 1st to July 7th, extract the rainstorm waterlogging and emotion vocabularies and calculate their percentages separately to determine the disaster degree of waterlogging and the emotion reaction of the public in various administrative regions of Nanjing. On the ground of descriptions about the ponding depth of a certain location that frequently appeared in microbloggings, areas prone to be submerged during rainstorm can be determined by classifying the depth of ponding.

With the waterlogging points in the presence of disasters found in the first part, the re-optimization is made to develop traffic road. This paper constructed the location-routing problem model of urban emergency logistics in the situation of rainstorm and waterlogging disaster, and by taking Nanjing as an example, found out the dynamic emergency distribution path of Nanjing in the situation of waterlogging disaster with NSGA-III algorithm. Meanwhile, assuming that the route between Gulou and Qinhuai is interrupted, the paper re-planned and re-calculated the distributing route. The feasibility of the location-routing problem model was verified in the case.

However, what needs to be noted is that the design of the location-routing problem model contains many hypotheses such as distribution centers are fixed, route network are closed, and that a vehicle can only running between the distribution center and its destination. However, in reality, there exists temporary distribution centers, much wider road network and changeable parameter values, therefore, the orientation of future research be targeted on how to make the model more realistic.

Finally, with the approaching big data and rapid development of big data analysis technologies, microblogging and other social media will gradually become an important source of disaster information extraction due to its timeliness, huge data amount and abundant contents. These information could timely provide the location of disaster areas, the material requirements and the damaged routes. So they are of importance for disaster relief, especially for the delivery of emergency relief supplies and planning of transportation routes. In the future, there are also lots of problems about disaster management that could combine with the big data analysis technologies to develop the ability and level of disaster relief.