1 Introduction

Social media, more specifically twitter and microblog, play more and more important roles in emergency management (Starbird and Palen 2012; Merchant et al. 2011; Pennington-Gray et al. 2013). Applying social media data to crisis management has become a growing trend since social media are regarded as knowledge management systems in 2010 Haitian earthquake (Couling 2014; Yates and Paquette 2011). Current studies mainly focus on four aspects. First, social media data were applied to early disaster detection and early warning (Sakaki et al. 2010; Abel et al. 2012; Abdelhaq et al. 2013). Each user shared what he/she observed and experienced through social media anytime and anywhere. Social media offered early warning information and sometimes reported breaking news more rapidly and timely than mainstream media (Guan and Chen 2014). Second, researches were looking into the mechanism of online disaster-related individual and group behaviors, such as user motivation, tweet- or retweet-behavior, and various factors on social media behaviors from the perspective of twitter and microblog contents (Correa et al. 2010; Stieglitz and Dang-Xuan 2013; Rudat et al. 2014; Morchid et al. 2014; Pennington-Gray et al. 2013). Third, researches tried to discover the mechanism of online social network and information propagation during disaster (Starbird et al. 2010; Bakshy et al. 2012). As an important information spreading platform, retweet attracted much attention, such as waiting time of a retweet from the perspective of the users, external context, and prediction whether a tweet will be retweeted based on its owner and content (Spiro et al. 2012; Starbird and Palen 2012; Zaman et al. 2010). Fourth, social media also supported emergency decision-making, and served as crowdsourcing platform for disaster relief mission (Gao et al. 2011a, b), emergency preparedness (Merchant et al. 2011), situation awareness (Yin et al. 2012), and disaster response planning through knowledge sharing (Yates and Paquette 2011). These studies show a close relationship between social media and disaster detection, users’ behavior, information propagation, decision-making. However, with twitter data being widely used, Chinese microblog (e.g., Sina Weibo) has been paid relatively less attention in emergency management.

Sina Weibo, one of the most popular Chinese microblogs, is a Twitter-like Chinese microblog (Gao et al. 2012; Chen et al. 2011). Sina Weibo was launched by Sina in August 2009 while Twitter was established in March 2006. Each user can release and receive short messages less than 140 characters by PCs, mobile phones or other media anytime and anywhere. When one user follows other users, he/she can forward and comment on the microblog messages. When many users have heated discussion on a same topic, a hot topic can be established by using a hastag, which is marked by “#”. The open API (an open Application Programming Interface) of Sina Weibo provides an opportunity to capture microblog data.

Sina Weibo has a strong influence on emergency management. Some researchers have started to enhance emergency response using Sina microblog data. Statistical analysis was conducted to explore some characteristics of hot social events based on Sina Weibo data (Guan et al. 2014). The evolving process of collective emotions was recognized using Sina Weibo data analysis (Xiong et al. 2013). A keyword-based semantic analysis was conducted on Sina microblog data to obtain public opinions of online collective behaviors and to support decision-making (Ma et al. 2014). Content, trend and information spreading were analyzed based on Sina microblog data for 2010 Yushu Earthquake (Qu et al. 2011). Qualitative methods are generally employed.

Crowdsourcing has emerged for many years. Crowdsourcing means distributing a query to several Twitter users to aggregate the results and exploit the wisdom-of-crowds effect (Demirbas et al. 2010; Doan et al. 2011). It includes three elements: crowd, initiator and process (Estellés-Arolas and González-Ladrón-de-Guevara 2012). Crowdsourcing is widely used because it is generally believed that the aggregated opinion of many non-experts from the local community with local knowledge will approach the quality of the opinion of an expert (Eskenazi et al. 2013). The wisdom of crowds also supports emergency preparedness and emergency response. Rapid development of social network offers an important platform for the expression of public wisdom and for data mining and information extraction during crisis. Twitter, Ushahidi, and Sahana are some widely used crowdsourcing applications in western countries (Chan 2014). These applications can provide accurate and timely information during crisis, which is the foundation of a disaster relief information system (Gao et al. 2011a, b).

The consequences of the disasters, such as, casualty, destruction of infrastructure systems, loss of properties, and chaos of daily life, challenges to disaster response of various levels of government agencies. Social media are very helpful for information dissemination, situation awareness, early warning, risk assessment, damage evaluation, command and control. During crisis, the affected people use Twitter, Facebook, Microblog to describe situation and keep in touch with others. Each user acts as a moving sensor. A large number of the affected people offer massive information; therefore, social media imply tremendous valuable information to support emergency response. In this study, a two-stage model is proposed to assess risk and damage of a disaster. Degree of risk and degree of damage are constructed as two composite scores to measure risk level before and after disaster. Composite measures can be simplified to two single measures based on microblog data since there is a close relationship between social media data and severity of impact. Degree of risk is used to describe all potential risks when a disaster is coming, while degree of damage describes its impact on local society, infrastructure systems, and mentality of the affected people. They are estimated by an index system. All indices in the index system are extracted from Chinese microblog data collected from Sina Weibo. From the spatial perspective, the degrees of risk and damage are distinguished at different regions. According to the degrees of risk and damage at different regions, risk-map and damage-map can be drawn to support disaster preparedness and response. Typhoon Haiyan (one of the strongest tropical cyclones ever recorded in the Philippines history) is selected as an example to provide an empirical illustration of the new model.

2 Methodology

2.1 Fuzzy comprehensive evaluation

Fuzzy comprehensive evaluation is a quantitative assessment model to solve a complex problem described by a series of indices. Each index is assessed a score and assigned a weight based on its contribution to the problem. It is widely used in the quantitative assessment of many complex systems, such as urban response ability (Tian and Yang 2008) and power system (Zhao et al. 2008). With rapid growth of online collectives, the model is also introduced to unconventional emergency. It focuses on the statistical analysis rather than content analysis on social media data (Lan 2011). This paper concentrates on the semantic analysis on microblog data to extract wisdom of crowds and support disaster response. The framework of research method is shown in Fig. 1. A large number of users express their opinions on Sina Weibo. Based on semantic analysis on the collected data, risk and damage can be evaluated using the index model to support emergency decision-making.

Fig. 1
figure 1

Framework of crowdsourcing for disaster response

2.2 Proposed evaluation model

Public opinion online can facilitate situation awareness and damage assessment. People, both disaster managers and general public express their opinions on social media before, during and after disaster. Government officials stay in the decision-making roles in disaster management. The messages from government officials are removed from the datasets. Messages posted by general public are treated as the crowdsourcing data to collect public opinion. Before disaster, public opinion expresses individuals’ risk perception through their descriptions on disaster, their mentality and activities. For example, if many people in a region express that they bought some emergency goods before typhoon, such as food, water, and radio, it indicates that people in the area are relatively well prepared. During or after disaster, people tend to express the current situation, future perspective, response measures and their mentality to reflect the damage level. For instance, the extent of damage may be high in a place where many people are calling for help or saying lack of food. Therefore, risk and damage can be assessed during crisis using microblog data.

Based on microblog data mining and semantic analysis, a new model is developed to predict risk before disaster and assess damage after disaster. The model is an index system based on microblog data. It is quite different from traditional ones because the indices here are the terms or statements extracted directly from microblog data through semantic analysis. Semantic analysis is a content analysis method to infer the meaning of a message and mine the relation between messages (Ma et al. 2014). For an index system, four aspects are determined: (1) index selection, (2) score of index, (3) weight of index, and (4) evaluation function. These four aspects will be discussed one-by-one as follows.

2.2.1 Index selection

The messages on social media contain a large amount of information. The valuable information during crisis is divided into three categories: disaster description, public demands and mitigation measures. The mentioned information is described and reflected in the disaster-related messages to contribute to risk assessment and damage evaluation.

2.2.1.1 Disaster description

Physical parameters are direct variables to measure the intensity of a disaster. They mainly include disaster intensity (e.g., magnitude and intensity of a disaster, affected people and scope), tendency of intensity, disaster-related location or time, secondary or derivative disasters and destruction of infrastructures. The severity of a disaster is described by general public to represent their risk perception for the disaster or actual damage degree.

2.2.1.2 Public demands

The demands of the general public reflect their risk perception and damage level. The general public talk about their daily routines and what they are experienced on the Internet. Messages convey what they need through the Internet if they cannot obtain direct aids during disaster. The demands often include emergency supplies, mentality, activities and help from government. The demands indicate that their risk awareness and severity of situation surrounding them.

2.2.1.3 Mitigation measures

Mitigation measures are actions to reduce risks and avoid severe damage during disaster. If the general public are well prepared, such as prepared adequate food, water and other supplies, the risk level may be reduced. Some official measures, such as reporting instant news, offering alert information, communicating with the general public, and seeking psychological counseling, can also mitigate risk. The measures and the participation of governments in disaster response are reflected by public opinion. For example, if many people complain, called for help and lived without food, government response is considered as ineffective, and risk and damage levels are high. People also express the aids from governments which mitigate the risk and potential damage. Therefore, mitigation measures include government effective response and public well preparedness.

The new model is developed based on three-category information, as shown in Fig. 2. Three categories are disaster description, public demands and mitigation measures. Each category is composed of several subcategories which are the first-grade indices. The first-grade indices of disaster description include intensity (I 1), tendency of intensity (I 2), key locations or time (I 3), secondary or derivative disasters (I 4) and infrastructure destruction (I 5). Four first-grade indices of public demands are supply demands (I 6), mentality (I 7), activities (I 8) and organizations (I 9). Two first-grade indices of mitigation measures are government measures (I 10) and public preparedness (I 11). There are total eleven first-grade indices in this model. The following analysis is based on these first-grade indices.

Fig. 2
figure 2

Index model of risk prediction/damage assessment

Each index contains many terms which are the second-grade indices, and each term represents a similar meaning. For instance, the infrastructure systems (I 5) include the networks of power, water, gas, communication and road, et al. The ith index can be obtained and shown in Eq. (1).

$${\text{index}}_{i} = \left\{ {{\text{term}}_{i1} ,{\text{term}}_{i2} , \ldots ,{\text{term}}_{ij} , \ldots } \right\}\quad (i = 1,2, \ldots ,11;\;j = 1,2, \ldots ,n)$$
(1)

where index i is the ith index and term ij represents all potential terms contained in index i .

There is an important feature of Chinese language, which is synonym. It means that several words express one meaning. Then, a certain term is just a collection of different statements with similar meaning. For example, the statements “饿 (hungry)”, “没有食物 (no food)” and “肚子叫 (growling)” describe the meaning “缺乏食物 (lack of food)”. In this study, several statements with a high similarity are counted as a same term. The jth term in index i can be obtained and shown in Eq. (2).

$${\text{term}}_{ij} = \left\{ {{\text{statement}}_{ij}^{1} ,{\text{statement}}_{ij}^{2} , \ldots ,{\text{statement}}_{ij}^{k} , \ldots } \right\}\quad (i = 1,2, \ldots ,11;\;j = 1,2, \ldots ,n;\;k = 1,2, \ldots ,m){\kern 1pt}$$
(2)

where \({\text{statement}}_{ij}^{k} {\kern 1pt} {\kern 1pt} (k = 1,2, \ldots ,m)\) represents all statements that occur and have a similar meaning with term ij .

2.2.2 Score

Each statement (third-grade index) is estimated a score based on its contributions to the degree of risk or damage. Different statements reflect different degrees of risk or damage, which is the standard for measuring scores. When a certain emotional statement is referred in many messages, it indicates that many people have resonated. Once the emotion is pessimistic, some irrational behaviors are easy to be triggered and risk may be expanded. The score of this statement is therefore relatively high. Different statements have different contributions to risk and damage levels, and they will be assigned for different scores.

The expert evaluating method is used to estimate the score of each statement. The score of a statement is determined by the disaster experts based on its contribution to disaster risk or damage level. The score range is from −5 to 5. When a statement has opposite contribution to the risk or damage level, its score is assigned to a negative value. For example, the statement “power recovery” decreases the risk or damage level, so its score is a negative value. The larger the absolute value of the score is, the bigger its contribution to the degree of risk or damage is.

2.2.3 Weight

There are two kinds of weight. One is the weight between first-grade index and second-grade (term), which is denoted by w ij . The other is the weight between first-grade index and second-grade (statement), which is denoted by w k ij .

Based on term frequency weight method (Salton and Buckley 1988), the weight of an index can be calculated based on its frequency to a certain degree. For instance, a statement representing a necessity is repeated for many times, which shows the necessity is required by a large scale of people. It causes a relatively higher risk and more severe damage for lack of emergency supplies and poor preparation. Absolute frequency and relative frequency are widely used in Chinese natural language processing. However, absolute frequency is not appropriate since less frequent statements from crowd are not necessarily uncritical and more frequent statements are not always significant. Relative frequency is an alternative method. Term frequency-inverse document frequency (TF-IDF), as a widely used method of relative frequency calculation, characterizes the text better than term frequency (TF) method and is more appropriate to actual situation (Forman 2008; Xie et al. 2015).

TF-IDF method is a typical method of calculating the values of important terms in a document through an inverse proportion of the frequency of the term in a particular document to the proportion of that term over the entire document corpus (Chen and Xiao 2016; Ramos 2003). Equation (3) is the classical formula of TF-IDF used for term weighting.

$$w_{ij} = tf_{ij} \times \log \left( {\frac{N}{{df_{i} }}} \right)$$
(3)

where w ij is the weight for term i in document j, tf ij is the term frequency of term i in document j, N is the total amount of documents in the document corpus and df i is the number of document containing term i in the document corpus.

In this study, a microblog message is a document. Term i is not always contained in only one document (e.g., document j). For a document corpus, document j and other documents may contain term i. A weight shall be calculated for the term in each document containing it. The weight for the term in the document corpus is the average of the weights for the term in all documents containing it. The formula of the weight for a term in document corpus is calculated as Eq. (4).

$$w_{i} = \frac{1}{{df_{i} }} \times \left( {\sum\limits_{{j \in D_{i} }} {tf_{ij} } } \right) \times \log \left( {\frac{N}{{df_{i} }}} \right)$$
(4)

where w i is the weight for term i in document corpus and D i is the collection of documents containing term i.

Terms common in a single or a small group of documents tend to have higher TF-IDF values than common terms. For instance, the term “typhoon”, common in many messages in typhoon situation, will not be given high weight. In addition, some less frequent but important terms are not ignored because of their low df i values. TF-IDF can calculate the weight of a term which is appropriate to actual situation. Therefore, the method of TF-IDF is employed to assign the weight of a statement.

The value of w ij is simply set as unity in the case study for the limitation of data. It means that the importance of all terms (second-grade indices) is considered similar. Several cases have been used to test the method. The results have a relatively high quality and are more appropriate to actual situation. This method is applied to the case study.

2.2.4 Evaluation function

The degree of risk or damage is the total value of all indices. The degree of statement k ij is the result of multiplying its score by its weight. The degrees of term ij , index i are defined in Eq. (5) and Eq. (6), respectively.

$${\text{DTerm}}_{ij} = \sum\limits_{k = 1}^{m} {w_{ij}^{k} S_{ij}^{k} } \quad (i = 1,2, \ldots ,11;\;j = 1,2, \ldots ,n;\;k = 1,2, \ldots ,m)$$
(5)
$${\text{DIndex}}_{i} = \sum\limits_{j = 1}^{n} {w_{ij} {\text{DTerm}}_{ij} } \quad (i = 1,2, \ldots ,11;\;j = 1,2, \ldots ,n)$$
(6)

where DTerm ij is the degree of term ij , S k ij is the score of statement k ij , w k ij is the weight of statement k ij , DIndex i is the degree of index i and w ij is the weight of term ij .

The degree of risk or damage is the sum of the degrees of all indices, and it is defined in Eq. (7).

$${\text{DRisk/DDamage}} = \sum\limits_{i = 1}^{11} {{\text{DIndex}}_{i} } \quad (i = 1,2, \ldots ,11)$$
(7)

where DRisk is the degree of risk and DDamage is the degree of damage.

In conclusion, the degree of risk or damage is defined in Eq. (8).

$${\text{DRisk/DDamage}} = \sum\limits_{i = 1}^{11} {\sum\limits_{j = 1}^{n} {\left( {w_{ij} \sum\limits_{k = 1}^{m} {w_{ij}^{k} S_{ij}^{k} } } \right)} } \quad {\kern 1pt} (i = 1,2, \ldots ,11;\;j = 1,2, \ldots ,n;\;k = 1,2, \ldots ,m)$$
(8)

3 Case study

Recently, natural disasters happen frequently and cause huge losses for local society. On November 7, 2013, Typhoon Haiyan, one of the strongest tropical cyclones ever recorded, landed in Philippines at Guiuan (Eastern Samar). The storm’s maximum sustained wind was at 275 km/h, and the affected population was up to 9.6 million; subsequently, it grazed in Hainan at 16:00 on November 10 with the wind speed of 121 km/h; it landed in Guangxi at 5:00 on November 11 with the speed of 84 km/h. After it made landfall in China, its intensity was weakened. Its track was shown in Fig. 3. Without early situation awareness to support evacuation before it landed in China and effective damage assessment to help rescue after it made landfall, local society suffered heavy losses. Its early risk prediction and effective damage assessment caught wide attention. For this goal, a new model with statement frequency analysis of the most common statements and comparative analysis is proposed to evaluate the degrees of risk and damage based on the disaster-related microblog data.

Fig. 3
figure 3

Path of typhoon Haiyan

3.1 Data collection

Large quantities of microblog messages are collected for the life-cycle response. Sina Weibo is selected as the source to collect disaster-related microblog data. The keyword-based method is applied to capture microblog data by using Weibo API and some web page crawling algorithms. The terms “Haiyan”, “typhoon” and “typhoon Haiyan” are the keywords to collect microblog posts and the period of data collection is from 3 November, 2013 to 15 November, 2013. 3098 pieces of keyword-related microblog messages are collected. The detailed information of the collected data includes author’s name, ID, user’s region, content, post time and URL. However, some may be noise data, such as data keyword-related but not disaster-related or data posted by governments. In order to ensure the validity of the data as far as possible, manual sorting method is also used to filter noise data. All collected data are classified into related and unrelated categories manually. Then each piece of unrelated microblog posts is double-checked in case of wrong selection. Finally, 2539 pieces of related messages are remained with geographic location of the author and post time attached for the following analysis.

Hainan and Guangxi provinces are considered as two of the worst-hit provinces in China. When typhoon Haiyan landed in Philippines and is on its way to China, the number of related microblogs increases greatly in China, as shown in Fig. 4. It grazes in Hainan at 16:00 on November 10 and lands in Guangxi at 5:00 on November 11. Temporally, within the 24 h before and after its landfall, the microblog data are collected to help risk prediction and damage assessment, respectively. For Hainan, the period of data collection for risk prediction is from 16:00 on November 9 to 16:00 on November 10, and it for damage assessment is from 16:00 on November 10 to 16:00 on November 11. Corresponding to Hainan, the period of data collection for risk prediction is from 5:00 on November 10 to 5:00 on November 11, and for damage assessment, it is from 5:00 on November 11 to 5:00 on November 12 in Guangxi.

Fig. 4
figure 4

Trend of number of microblog posts in China in typhoon Haiyan

3.2 Data pre-processing

After collecting microblog messages from Sina Weibo, the collected messages are processed and analyzed, including classification, segmentation, stopwords removing and term frequency analysis.

The collected microblog messages are classified based on the geographic location of the users. A new user registers geographic information in Sina Microblog and it appears in the user’s homepage. Most users register their provinces but not offer detailed address information. Therefore, province is treated as the smallest study area in this study. Microblog messages posted by Hainan and Guangxi citizens are analyzed. The messages posted by Hainan and Guangxi citizens for four specific periods are picked out. Based on the post time, the selected messages are divided into four datasets: (1) before grazing in Hainan Province (16:00 on November 9 to 16:00 on November 10); (2) before landing in Guangxi Province (5:00 on November 10 to 5:00 on November 11); (3) after grazing in Hainan Province (5:00 on November 10 to 5:00 on November 11); (4) after landing in Guangxi Province (5:00 on November 11 to 5:00 on November 12).

Segmentation means dividing a complete sentence into a series of terms or phrases based on some Chinese language rules. In these terms and phrases, some words (called stopwords, e.g., a, the, an) have small significance for the expression of a sentence while their frequency is very high, so this kind of words are removed. Relative frequency analysis is conducted to count the frequency of a term and obtain its relative frequency. The terms are ranked based on their frequency, and the most common terms (e.g., in the top 100 in each dataset) are selected to evaluate the degree of risk or damage.

4 Results

4.1 Semantic analysis

Semantic analysis helps governments know more about what citizens concerned in different areas and at different stages. The concerns of citizens are inconsistent at different areas. Knowing what they need, some targeted measures can be executed to them. Figure 5 provides some of the most common terms in four datasets, and they are before grazing in Hainan (Dataset 1), before landing in Guangxi (Dataset 2), after grazing in Hainan (Dataset 3) and after landing in Guangxi (Dataset 4). Some terms, such as the name of typhoon or places (e.g., Haiyan, Hainan or Gaungxi), are frequently occurred in almost all messages. Their contributions to the degree of risk or damage are negligible, so they are removed when Fig. 5 is drawn.

Fig. 5
figure 5

The most common terms and results of risk and damage degrees. The terms “cry”, “candle”, “shocked” and “disappointed” are four emotions translated into their meanings

4.1.1 Semantic analysis before typhoon landing

Before typhoon Haiyan lands in Hainan, the terms “power outage”, “preparing goods”, “without water”, “hungry”, “call for help” and “alert”, “distance” are frequent in the microblogs posted by Hainan citizens. It indicates that Hainan citizens focus on preparation, necessities, infrastructure systems and alert information. However, Guangxi citizens pays more attention on the key places or time (e.g., “Yingge Town”, “16 o’clock”), wind speed (e.g., “391”, “530”) and their mentality (e.g., “candle”, “disappointed”) before it lands in Guangxi. In addition, with the high frequency of “@News Head”, citizens in both provinces are calling for help from the official organizations. It indicates the important role of official social media in disaster response.

4.1.2 Semantic analysis after typhoon landing

After Typhoon Haiyan grazes in Hainan, the terms “power outage”, “Ledong County”, “Wanquan River”, “rescue” and “without water” are frequent in the messages posted by Hainan citizens, which indicates that the attention of Hainan citizens is paid to the damage of infrastructure systems (e.g., power, water, road, etc.), necessities (e.g., food, water, etc.), affected area (e.g., Wanquan River) and rescue mission, as shown in Fig. 5. However, citizens in Guangxi concern the storm caused by the typhoon, recovery after the typhoon landed and desire to go work or attend class, because the words “rain storm”, “umbrella”, “attend class”, “go to work” and “power recovery” have high frequency. At the same time, the mentality of Guangxi citizens is relatively unstable, because the emotions “cry”, “shocked” and “candle” for praying are frequent in their messages. Based on further comparative analysis, it is clear that Hainan citizens pay more attention to the basic needs while citizens in Guangxi on the storm, recovery and desire for daily activities. The official social media, such as News Head, CCTV, are popular for citizens to call for help after disaster.

4.1.3 Comparison of semantic analysis between before- and after-landing

Before- and after-landing in Hainan, the concerns of citizens are on different aspects. For disaster description, citizens focus on real-time situation and the tendency before it lands in Hainan, while they pay more attention on the affected areas after it makes landfall. For mitigation measures, preparing measures and alert are important before it lands in Hainan while rescue mission is the most important after it makes landfall.

There is difference of citizens’ concerns between before- and after-landing in Guangxi. Before it makes landfall, citizens pay more attention on disaster description, such as typhoon speed, key time or places, while public demands (e.g., go out, go to work) and mitigation measures (e.g., power recovery) are more important after it lands in Guangxi.

4.2 Quantitative analysis

The paper provides a simple but complete demonstration of the system by using only one index in typhoon Haiyan to control the length of the paper and avoid repetition. The detailed illustration of the index model is shown in Table 1. The index-supply demands (I 6)—is chosen as an example to give the detailed information. The information includes the most common terms belonging to the index in the study period, the statements in each term, the score and weight of each statement, and the DTerm corresponding to each term.

Table 1 An example of the index model. The data are from semantic analysis before Typhoon Haiyan grazing in Hainan (Dataset 1)

The degree of the index-supply demands (I 6)—is estimated in Eq. (9), and the result is 3.50.

$${\text{DIndex}}_{6} = \sum\limits_{j = 1}^{n} {w_{ij} {\text{DTerm}}_{ij} } = 3.50$$
(9)

In the case study, the value of w ij is simply set as unity for the limitation of collected data.

4.2.1 Risk prediction before typhoon landing

When typhoon Haiyan is coming, risk analysis is conducted. Based on the index model, the risk degrees at different areas are estimated. Before it grazes in Hainan, the risk degree of Hainan is 18.5, while it is 16.6 before it lands in Guangxi, as the results shown in Fig. 5. It shows the risk of Hainan is higher than that of Guangxi. The reason may be when it enters Guangxi, its intensity is weaker (The wind speed decreases from 121 km/h when Typhoon Haiyan grazes in Hainan to 84 km/h when it lands in Guangxi), so public risk perception is decreased. When governments take measures, more attention is paid to the areas with high risk.

4.2.2 Damage assessment after typhoon landing

After typhoon Haiyan lands in an area, it results in a serious damage to local society and infrastructure systems. All information can be reflected through social media data to help estimate the degree of damage. After it grazes in Hainan, the degree of damage is 60.8, while the degree of damage is 42.6 after it lands in Guangxi, as shown in Fig. 5. Though typhoon Haiyan has landed in Guangxi while just grazes in Hainan, damage in Hainan is much more severe than that in Guangxi. Many measures, such as power recovery, alert and rescue, are taken into action by Hainan governments before and after Typhoon Haiyan grazing in Hainan, which indicates that the risk and damage levels in Hainan are relatively high. According to the official statistics from GDACS, the affected population is 1.1 million when Typhoon Haiyan grazes in Hainan while there are no people affected when it lands in Guangxi. So the assessment results based on the index model are consistent with the losses in actual situation. The index model using disaster-related social media data supports emergency decision-making.

4.2.3 Comparison of quantitative analysis between before- and after-landing

Before typhoon landing, the ratio of risk degrees in these two provinces is 1.12, while the ratio of damage degrees is 1.43 after typhoon landing. It means that the difference between Hainan and Guangxi becomes bigger during the evolution of typhoon Haiyan. Before typhoon landing, citizens cannot accurately know where the typhoon may land, so the difference of two provinces is relatively small. When Haiyan makes landfall, citizens have a knowledge of the situation to a certain degree. Because the damage of Hainan is more severe than that of Guangxi, the difference of damage degrees becomes bigger.

5 Conclusions

Risk prediction is crucial for the evacuation plan, and damage assessment is required for rescue plan in disaster response. This paper proposes a new method for risk prediction and damage assessment. Based on social media data, what people concerned can be obtained from their messages. An index system is developed from three perspectives: disaster description, public demands and mitigation measures. The difference between the new model and traditional ones is that the new index is extracted directly from the disaster-related microblogs. The score of each index is determined by emergency management experts. The weight is calculated based on the TF-IDF method, which is a classical term frequency weight method. With the evaluation function, the degree of risk or damage can be estimated. By this way, once the microblog data are collected, the degree of risk and damage can be evaluated quickly to provide rapid risk prediction and effective damage assessment.

Using typhoon Haiyan as an example, this paper demonstrates the use of the new model. As two worst-hit provinces in China, risk prediction and damage assessment are conducted in Hainan and Guangxi provinces based on social media data. Before typhoon makes landfall, the degree of risk in Hainan is higher than that in Guangxi. After it makes landfall, the degree of damage in Hainan is also higher than that in Guangxi. Through comparison with actual situation, the assessment result based on the index model is consistent with actual situation, which underlines the feasibility of implementation of the new model.