1 Introduction

The USA and its economy are challenged by weather- and climate-related disasters that impart large social and economic costs (Gall et al. 2011; Field et al. 2012; NCA 2014). Consequently, natural disaster cost estimates are referenced by a wide variety of users for varying purposes. However, there are notable differences in the uncertainty surrounding different natural disaster types reflecting the quality of the data available, methodology and assumptions (Kron et al. 2012). For example, in the USA, drought and flooding events have higher potential uncertainty values around their loss estimates due to less coverage of insured assets (Smith and Katz 2013). Conversely, severe local storm events have lower potential uncertainty around their loss estimates due to more complete insurance coverage of wind and hail damage.

Research examining natural disaster costs on social and economic systems is substantial. Example studies include: normalizing disaster loss trends over space and time using population and wealth variables (Downton and Pielke 2005; Pielke et al. 2008; Barthel and Neumayer 2012; Simmons et al. 2013), examining how developing countries and smaller economies often suffer more greatly due to natural disaster impacts (World Bank 2005; Hallegatte and Dumas 2009; IPCC 2014), and exploring how developed countries have more capacity to rebound from natural disasters impacts due to their wealth and financial systems (Rasmussen 2004; Toya and Skidmore 2007; Cavallo and Noy 2009). Other research seeks to quantify total, direct losses (i.e., both insured and uninsured) resulting from specific natural hazard events using independent estimation methodologies (ECLAC 2003; Munich Re 2014; Swiss Re 2014). However, there are few empirical studies that seek to quantify the uncertainty and confidence intervals surrounding natural disaster cost estimates (ex post). To better frame the data limitations associated with natural disaster loss estimates, a range of losses can be evaluated by conducting multiple analyses and varying certain input parameters to which the losses are most sensitive (FEMA 2015). This paper contributes to the literature by examining new approaches for better understanding the uncertainty surrounding three US natural disaster cost estimate case studies, by running Monte Carlo simulations to quantify the 95, 90 and 75 % confidence intervals. This research also performs a sensitivity analysis for one of the case studies examining which input data variables and assumptions are the most sensitive and contribute most to the overall cost uncertainty of the estimate.

The foundation for this research is the US billion-dollar weather and climate disaster report developed by the National Oceanic and Atmospheric Administration’s National Climatic Data Center. This analysis quantifies the loss from numerous weather and climate disasters including tropical cyclones, floods, drought and heat waves, severe local storms (i.e., tornado, hail, straight-line wind damage), wildfires, crop freeze events and winter storms (NCDC 2014). These loss estimates reflect direct effects of weather and climate events (i.e., not including indirect effects) and constitute total losses (i.e., both insured and uninsured). The insured and uninsured direct loss components include physical damage to residential, commercial and government/municipal buildings, material assets within a building, time element losses (i.e., businesses interruption), vehicles, boats, offshore energy platforms, public infrastructure (i.e., roads, bridges, buildings) and agricultural assets (i.e., crops, livestock, timber). These loss assessments do not take into account losses to natural capital/assets, healthcare-related losses, or values associated with loss of life. Only weather and climate disasters which cause losses of ≥1 billion dollars in calculated damage including Consumer Price Index (CPI) inflation adjustment are included in this dataset (Table 1). While this threshold is arbitrary, these billion-dollar events account for roughly 80 % of the total US losses for all combined severe weather and climate events (Munich Re 2012; NCDC 2014).

Table 1 Damage cost statistics from US billion-dollar disaster events (1980–2013) reflecting number of events, event frequency, CPI-adjusted loss (present year), percent of total losses and average event cost

These natural disaster cost assessments require input from a variety of public and private data sources including the Insurance Services Office (ISO) Property Claim Services (PCS), Federal Emergency Management Agency (FEMA), National Flood Insurance Program (NFIP) and Presidential Disaster Declaration (PDD) assistance, and the United States Department of Agriculture (USDA), National Agricultural Statistics Service (NASS) and Risk Management Agency (RMA), among others. Each of these data sources provides unique information as part of the overall disaster loss assessment. Previous research analyzed the suitability of these data sources including trends, data accuracy and potential biases (Smith and Katz 2013) and found the US billion-dollar disaster estimates had a consistent underestimation bias of roughly 10–15 %. This bias was corrected during a reanalysis of the disaster event loss data to reflect new loss totals (NCDC 2014). However, there are still uncertainty envelopes encompassing these reanalyzed disaster loss estimates, which this research will offer new approaches for quantifying.

An outline of the paper is as follows. Primary insurance loss data and assumptions of natural disaster loss estimates are described in Sect. 2. Next, a Monte Carlo simulation method for estimating uncertainty surrounding disaster loss estimates, focusing on specific disaster examples, is presented in Sect. 3. A sensitivity analysis examining which data and assumptions are most important within one of the three disaster case studies is examined in Sect. 4. Finally, Sect. 5 contains a discussion and conclusions on the uncertainty and sensitivity analysis results, to improve the US billion-dollar disaster cost analysis.

2 Primary insurance loss data and assumptions

A number of US insurance participation surveys have been performed over the last several decades for residential, automotive and commercial lines of insurance. The following paragraphs discuss the data from these surveys and how it informs our methodology to estimate the total, direct loss for a natural disaster event. One central adjustment is that insured loss payment data ($) are inflated by a factor representing the reciprocal of the insurance market participation rate for a specific type of insurance for each impacted state. Also, we test our disaster cost methodology calculations using a Monte Carlo analysis, which perturbs each data input value, to estimate 95, 90 and 75 % confidence intervals surrounding the overall loss estimate.

2.1 Residential insurance

Annual surveys on residential property insurance participation such as the Census American Housing Survey (1980–present), the Insurance Information Institute survey (2011–present) and All-Industry Research Advisory Council (1981) have indicated on average that >90 % of US homeowners have multi-peril property insurance for their residence and contents (Table 2). This coverage includes cost reimbursement from damage due to wind, hail, lightning, snow and ice, among others, but does not include coverage for inland or coastal flood damage (discussed in Sect. 2.4).

Table 2 American Housing Survey percentage of US households carrying residential property insurance

Research also shows that residences are often underinsured. In 2013, 60 % of homes were underinsured by an average of 17 %; in 2012, 61 % of homes were underinsured by an average of 18 % (Marshall & Swift/Boeckh 2013). Scaling these statistics to represent the full housing stock implies that homeowners are underinsured by an average of ~10 %. Therefore, in addition to the 10 % who are also uninsured, we assume that 80–90 % of home losses are covered by insurance policies. The remaining 10–20 % represent uninsured and underinsured property assets.

It is also important to note that wind and water insurance deductibles average about 10 % of all paid insurance claims (ISO Property Claims Service 2014; NFIP 2014) while insurance fraud payments represent about 10 % of total property insurance payments (National Insurance Crime Bureau 2014; Insurance Information Institute 2014). Therefore, we assume that these two effects are offsetting (i.e., +10 % for deductibles, −10 % for fraud, with respect to total insurance payments) and viewed as tangential to our core analysis since fraud and deductible loss data are not consistently available.

2.2 Automotive insurance

Annual US uninsured motorist surveys (1989–2012) by the Insurance Research Council (IRC 2014) have also found a relatively stable percentage ranging from 80 to 90 % of automobiles insured (Table 3). Across the USA, the estimated percentage of insured motorists has increased in recent years (IRC 2014). In 2001, the average of insured motorists across all states was 85.8 %, which increased to 87.8 % in 2011 (IRC 2014). Given the variability of automobile insurance over time and space, we assume that 80–90 % of automobiles are insured. The remaining 10–20 % represent uninsured and underinsured property assets.

Table 3 Percent of uninsured motorists by year averaged across the USA

2.3 Commercial insurance

Understanding damage to US businesses is more complex. For example, a survey by the National Association of Insurance Commissioners (NAIC 2007) found that over 90 % of the small businesses surveyed have property insurance. The percentage of companies with property insurance increases with the size of a company, which may reflect better risk management. However, many businesses lack business interruption insurance which is a large cost driver in the weeks and months following natural disaster events. For example, 32 % of firms with annual revenue <$1 million had such coverage versus 48 % of higher-revenue firms (Table 4). This is further validated by reports, indicating that 20–40 % of small businesses that close after a major natural disaster never reopen their business (IBHS 2007; Travelers 2014; III 2014). This can be due to a variety of reasons such as interruption of critical supplies and product distribution, power outages or other utility failures, loss of customer base and critical data, restricted or blocked access, and employees unable to report to work. Also, like homeowners, it was found that 75 % of commercial buildings (i.e., physical property) are underinsured by an average of 40 % (Marshall & Swift/Boeckh 2011; Travelers 2014). Marshall & Swift/Boeckh (MSB) has data for 2600 locations across the country and compares the information it has collected with actual reconstruction costs derived from its insurer clients’ claims experience and adjusts as appropriate. Accounting for this variability and uncertainty surrounding different forms of commercial insurance, we estimate that 40–60 % of natural disaster-induced business losses are covered by insurance.

Table 4 Study by NAIC (2007) found that commercial property, auto and business interruption insurance coverage varies by business size

In summary, we apply the 80–90 % range to both residential and automotive PCS loss data and a 40–60 % to commercial losses. We now take this research a step further to investigate the range of possible error by using a Monte Carlo analysis perturbing each of the data input values by ±3 and ±5 % (i.e., reflecting the insurance survey data uncertainty on which assumptions are based). These values were chosen based on regulatory audits that routinely confirm the reliability and accuracy of ISO/PCS estimates, finding that final adjusted PCS estimates are within 5 % accuracy (Kerney, 2010). Exploring different error levels allows us to better understand how the assumed insurance coverages affect the 95, 90 and 75 % confidence intervals surrounding our total loss estimate.

2.4 NFIP flood insurance

We also have data on flood insurance participation, which factors into our disaster loss analysis. Residential and commercial flood insurance is most widely provided by FEMA’s National Flood Insurance Program. Mortgage lenders require any residence within FEMA’s Special Flood Hazard Areas (SFHAs) to purchase flood insurance. The SFHAs are commonly referenced as those within the 100-year flood plain boundaries. However, the enforcement and participation are not uniform, and many at-risk properties do not have proper insurance coverage for flood or storm surge-related damage (Table 5). Annual polls on NFIP participation percentage by region show some consistency on a large scale across the Northeast, Midwest and Western regions. Other studies have found the NFIP policy participation across the USA is higher (26 %) for eligible parcels (PricewaterhouseCoopers 1999). However, these are still inadequately low participation rates, leading to higher flood cost uncertainty.

Table 5 Insurance Information Institute annual survey on NFIP flood insurance participation percentage by region for all households

There is also a spatial bias in flood insurance policy coverage as NFIP participation is 16 % in communities with 500 or fewer homes in the SFHA, 56 % in communities with 501 to 5000 homes in the SFHA, and 66 % in communities with >5000 homes in the SFHA zone (Dixon et al. 2006). Also, the same research found that the chances of purchasing flood insurance are higher for SFHA communities subject to coastal flooding/storm surge (63 %) compared to communities more at risk to riverine flooding (35 %). One additional factor is that flood insurance coverage outside the high-risk flood areas (SFHAs) is very low (<10 %). Yet, NFIP data show that 25 % of all flood insurance claims come from the low-to-moderate-risk areas beyond the 100-year floodplain, which are largely uninsured losses (FEMA 2014). There are also NFIP coverage limits for residential ($250k structure, $100k contents) and commercial ($500k structure, $500k contents) properties (FEMA 2014). For these varied reasons, we have defined NFIP policy participation for the Monte Carlo uncertainty analysis as 10–25 % for inland states and 25–50 % for coastal states.

2.5 USDA crop insurance

The USDA and associated private crop insurance programs represent over 2 million crop insurance policies across all states (USDA 2014). USDA data show that on average across all states, 70 % of eligible acres are insured and most producers select 70 % of crop yield to be covered (USDA 2012). Therefore, we approximate the total crop loss by applying a factor to the crop insurance claims data, that is, 100 % all possible crops/[(x % insured) × (50–75 % yield coverage)] = multiplier value(s) for estimating total crop damage costs. The yield coverage is important as crop insurance is paid only after the crop loss has surpassed the selected yield coverage. For example, if a producer selected 70 % coverage, the crop producer must first cover the first 30 % of crop loss. This is effectively like a deductible, but paid to no one—just an absorbed cost.

In addition to crop losses, we also incorporate the total livestock feeding cost (i.e., corn and hay for cattle) when it exceeds the 5-year national average for feedstock (i.e., dollars/per ton). Drought can limit the availability of corn and hay feed stocks, which increases the cost, forcing ranchers to sell off more cattle than they would during a non-drought year (e.g., increasing long-term meat production costs). Comparing the 5-year national versus state feedstock costs against those during a severe drought year offers a useful comparison.

2.6 Framing uncertainty

In summary, we have set up our disaster cost uncertainty analysis using the following insurance participation ranges for the each line of insurance (Table 6). The next section discusses the output from Monte Carlo simulations to determine the 95, 90 and 75 % confidence intervals surrounding our total loss estimate for the following selected US disaster events:

Table 6 Insurance participation rate ranges used in the Monte Carlo simulation
  • the historic US drought (2012),

  • the southeast tornado super-outbreak (late April 2011)

  • and Hurricane Ike (2008).

3 Method for estimating uncertainty surrounding natural disaster loss estimates

A Monte Carlo approach is employed to assess the uncertainty of disaster loss estimates. For each of the three events detailed below, the parameters of the equations as outlined in Smith and Katz (2013) were perturbed to produce a distribution of loss estimates. There are two general categories of input parameters: (1) insurance participation rates and (2) loss values. The defined possible ranges for insurance participation rates were justified in Sect. 2. We assumed either a ±3 or 5 % error for the loss values. For final ISO/PCS insurance loss results, a 3 % bound of uncertainty should be adequate. Regulatory audits routinely confirm the reliability and accuracy of PCS estimates. Historically, after such regulatory data audits, the final adjusted estimate has differed by at most 5 % (Booz Allen Hamilton personal communication 2013). We also assumed that the parameters were either uniformly or normally distributed within the ranges. Therefore, there are four cases examined for each of the following events:

  • ±3 % error on loss values with uniformly distributed parameters,

  • ±5 % error on loss values with uniformly distributed parameters,

  • ±3 % error on loss value with normally distributed parameters and

  • ±5 % error on loss value with normally distributed parameters.

For each case, 10,000 simulations were run by random draws within the defined ranges with the defined distributions. Confidence intervals were calculated by sorting the 10,000 loss estimates in ascending order and then using a percentile method to define the confidence region. For example, to construct the 95 % confidence region, the 250th and 9750th predictions were selected as the lower and upper bounds, respectively.

3.1 US drought

Our first example is the 2012 drought, which resulted in the most extensive drought impacts to affect the USA in decades (NCDC 2014). Moderate-to-extreme drought conditions affected more than half the country for the majority of 2012. The most costly drought impacts occurred across the central agriculture states, resulting in widespread harvest failure for corn, sorghum and soybean crops, among others. Using the USDA crop insurance and feed cost data with the disaster cost methodology described in (Smith and Katz 2013), the 2012 US drought cost to agriculture was estimated to be $30.0 billion (NCDC 2014).

To verify the uncertainty surrounding this estimate, this research examines four different cases of Monte Carlo simulation for this event. This was performed using a USDA ‘percent of eligible acres insured,’ as a separate crop value metric from the USDA ‘percentage of insurable crop value’ used to calculate the cost of the 2012 US drought, as described in Sect. 2. In all cases, the following loss values used are summarized in Table 7 and the range of crop insurance participation rates (p crop_yield_cov) are referenced in Table 6.

Table 7 USDA crop indemnity (loss) payout information for each state due to the combined effects of drought and heat in 2012

The final loss estimate was calculated by:

$${\text{Total}} = \mathop \sum \limits_{{i = {\text{all}}\;{\text{states}}}} \left[ {m_{i} v_{i} } \right] + v_{\text{incFeedCost}}$$

where

$$m_{i} = \frac{100}{{p_{{{\text{acres}}\_{\text{insured}}}} \times p_{{{\text{crop}}_{{{\text{yield}}_{\text{cov}} }} }} }}$$

The results of the Monte Carlo analysis are provided (Table 8). We see that the estimated $30 billion total, direct cost of the 2012 US drought (NCDC 2014) is reasonably close to the mean and median of the Monte Carlo simulation. Figures 1a, b and 2a, b present the histograms of the 10,000 estimates for each case, along with confidence interval bounds.

Table 8 Results (in $ million) of the four cases analyzing the total cost of the 2012 US drought on crops and livestock (in original year dollars)
Fig. 1
figure 1

a Case 1 results: 3 % perturbation on values, parameters normally distributed. b Case 2 results: 3 % perturbation on values, parameters uniformly distributed

Fig. 2
figure 2

a Case 3 results: 5 % perturbation on values, parameters normally distributed. b Case 4 results: 5 % perturbation on values, parameters uniformly distributed

A histogram of multiplier (m i ) values from a selection of states (i.e., Illinois, New York and North Dakota) affected by the 2012 US drought is shown in Fig. 3. Here we see the range of possible multiplier values applied to the state insurance payout amount. Given the inverse relationship of the multiplier to the percent of acres insured, we see that states with smaller percentages of acres insured (i.e., New York) will have larger associated multiplier values.

Fig. 3
figure 3

Histograms of multiplier values (m i ) for several states affected by the 2012 US drought disaster. All of these are from Case 3 (±5 % error, normally distributed parameters). Illinois had the largest insurance payout (top left). New York had the smallest percent of eligible acres insured (top right). North Dakota had the largest percent of eligible acres insured (bottom)

3.2 April 25–28, 2011, tornado outbreak

Our second example examines a historic tornado outbreak across numerous central and southern states in late April 2011. Several major metropolitan areas were directly impacted by strong tornadoes including Tuscaloosa, Birmingham and Huntsville in Alabama and Chattanooga, Tennessee, causing the estimated damage costs to soar. The total, direct cost for this event was estimated to be approximately $9.8 billion (NCDC 2014). To verify the uncertainty surrounding this estimate, this research examines four different cases of Monte Carlo simulation for this event. In all cases, the following loss values used are summarized in Table 9, indicating the data values for each state and insurance claim type used in the total, direct loss calculation.

Table 9 Loss values from the April 25–28, 2011, Tornado Outbreak

The final loss estimate was calculated by:

$$\begin{aligned} {\text{Total}}\;{\text{loss}} = & \mathop \sum \limits_{{i = {\text{all}}\;{\text{states}}}} \left[ {m_{{i,{\text{PCS}}_{\text{comm}} }} v_{{j,{\text{PCS}}_{\text{comm}} }} + m_{{i,{\text{PCS}}_{\text{res}} }} v_{{j,{\text{PCS}}_{\text{res}} }} + m_{{i,{\text{PCS}}_{\text{auto}} }} v_{{j,{\text{PCS}}_{\text{auto}} }} } \right. \\ \left. {\quad +\,v_{{i,{\text{FEMA}}_{\text{PDD}} }} \times \left( {v_{{i,{\text{FEMA}}_{\text{PDD}} }} > m_{{i,{\text{PCS}}\_{\text{total}}}} v_{{i,{\text{PCS}}_{\text{total}} }} } \right)} \right] \\ \end{aligned}$$

where \(m_{{i,{\text{PCS}}_{\text{comm}} }}\) is the multiplier for commercial PCS, \(m_{{i,{\text{PCS}}_{\text{res}} }}\) is the multiplier for residential PCS, \(m_{{i,{\text{PCS}}_{\text{auto}} }}\) is the multiplier for automotive PCS, and these multipliers are defined as:

$${\text{Multiplier}} = \frac{100}{{{\text{Insurance}}\;{\text{participation}} \;{\text{rate}}}}$$

For the comparison part of the equation, we define:

$$\begin{aligned} v_{{i,{\text{PCS}}\_{\text{total}}}} = \, \,& v_{{i,{\text{PCS}}\_{\text{comm}}}} + v_{{i,{\text{PCS}}\_{\text{res}}}} + v_{{i,{\text{PCS}}\_{\text{auto}}}} \;({\text{NOTE}}:{\text{no}}\;{\text{multipliers}}\;{\text{involved}}) \\ m_{{i,{\text{PCS}}_{\text{total}} }} = \,\, & \frac{{m_{{i,{\text{PCS}}_{\text{comm}} }} v_{{j,{\text{PCS}}_{\text{comm}} }} + m_{{i,{\text{PCS}}_{\text{res}} }} v_{{j,{\text{PCS}}_{\text{res}} }} + m_{{i,{\text{PCS}}_{\text{auto}} }} v_{{j,{\text{PCS}}_{\text{auto}} }} }}{{v_{{i,{\text{PCS}}\_{\text{total}}}} }} - 1 \\ \end{aligned}$$

The results of the Monte Carlo analysis are provided (Table 10). Table 6 contains the insurance participation rates. We see that the estimated $9.8 billion total, direct cost of the April 25–28 tornado outbreak (NCDC 2014) is reasonably close to the mean and median of the Monte Carlo simulation. Figures 4a, b and 5a, b present the histograms of the 10,000 estimates for each case, along with confidence interval bounds.

Table 10 Results (in $ million) of the four cases analyzing the April 2011 tornado outbreak total cost
Fig. 4
figure 4

a Case 1 results: 3 % perturbation on values, parameters normally distributed. b Case 2 results: 3 % perturbation on values, parameters uniformly distributed

Fig. 5
figure 5

a Case 3 results: 5 % perturbation on values, parameters normally distributed. b Case 4 results: 5 % perturbation on values, parameters uniformly distributed

3.3 September 2008: Hurricane Ike

In September 2008, Hurricane Ike caused extensive storm surge and wind damage in Texas. There was also caused considerable wind and flood damage across many other coastal and inland states (i.e., Louisiana, Arkansas, Tennessee, Illinois, Indiana, Kentucky, Missouri, Ohio and Pennsylvania). Severe gasoline shortages occurred in the southeast USA due to damaged oil platforms, storage tanks, pipelines and off-line refineries. After examining the loss data sources using our tropical cyclone cost methodology, it was determined that the total, direct cost of Ike was approximately $30.0 billion (NCDC 2014). To verify the uncertainty surrounding this estimate, this research examines four different cases of Monte Carlo simulation for this event. In all cases, the following loss values used are summarized in Table 11. Table 6 contains the insurance participation rates. Note that we consider coastal and inland states separately when including a multiplier to the FEMA NFIP loss values. Coastal states for this example are Alabama, Louisiana and Texas. The inland states are Illinois, Indiana and Missouri.

Table 11 Loss values from the Hurricane Ike disaster

The final loss estimate was calculated by:

$$\begin{aligned} {\text{Total}}\;{\text{loss}} = & \mathop \sum \limits_{{i = {\text{all}}\;{\text{states}}}} \left[ {m_{{i,{\text{PCS}}_{\text{comm}} }} v_{{j,{\text{PCS}}_{\text{comm}} }} + m_{{i,{\text{PCS}}_{\text{res}} }} v_{{j,{\text{PCS}}_{\text{res}} }} + m_{{i,{\text{PCS}}_{\text{auto}} }} v_{{j,{\text{PCS}}_{\text{auto}} }} + m_{{i,{\text{FEMA}}_{\text{NFIP}} }} v_{{i,{\text{FEMA}}_{\text{NFIP}} }} } \right. \\ \left. {\quad + v_{{i,{\text{FEMA}}_{\text{PDD}} }} \times \left( {v_{{i,{\text{FEMA}}_{\text{PDD}} }} > v_{{i,{\text{PCS}}_{\text{total}} }} } \right)} \right]\quad + v_{\text{state}} + v_{\text{USDA}} \\ \end{aligned}$$

where \(v_{{i,{\text{PCS}}\_{\text{total}}}} = v_{{i,{\text{PCS}}\_{\text{comm}}}} + v_{{i,{\text{PCS}}\_{\text{res}}}} + v_{{i,{\text{PCS}}\_{\text{auto}}}}\) (NOTE: no multipliers involved), \(m_{{i,{\text{PCS}}_{\text{comm}} }}\) is the multiplier for commercial PCS, \(m_{{i,{\text{PCS}}_{\text{res}} }}\) is the multiplier for residential PCS, \(m_{{i,{\text{PCS}}_{\text{auto}} }}\) is the multiplier for automotive PCS, \(m_{{i,{\text{FEMA}}_{\text{NFIP}} }}\) is the multiplier for FEMA NFIP and

$${\text{Multiplier}} = \frac{100}{{{\text{Insurance}}\;{\text{participation}}\;{\text{rate}}}}$$

The results of the Monte Carlo analysis are provided (Table 12). We see that the estimated $30 billion total, direct cost of Hurricane Ike is remarkably close to the mean and median of the Monte Carlo simulation. Figures 6a, b and 7a, b present the histograms of the 10,000 estimates for each case, along with confidence interval bounds.

Table 12 Results (in $ million) of the four cases analyzing the total cost of Hurricane Ike
Fig. 6
figure 6

a Case 1 results: 3 % perturbation on values, parameters normally distributed. b Case 2 results: 3 % perturbation on values, parameters uniformly distributed

Fig. 7
figure 7

a Case 3 results: 5 % perturbation on values, parameters normally distributed. b Case 4 results: 5 % perturbation on values, parameters uniformly distributed

4 Effects of sensitivity analysis on overall cost estimate

For each of the total, direct loss estimates for natural disaster events, there are a number of input parameters, each with an associated degree of uncertainty. We wanted to explore how the estimates respond to perturbations in the input parameters, so we performed a sensitivity analysis for a single case study. The following analysis is one-dimensional, that is, perturbing only one parameter at a time. We began with a base set of parameters θ, taken to be the mean of the ranges defined in the examples of Sect. 3. The associated loss estimate f(θ) was calculated based on this set of parameters. Then, we applied a 1 % perturbation to each parameter forming a new set of parameters θ i , for i = 1, … n, where n is the number of input parameters. Then, the associated loss estimates f(θ i ) were calculated. Finally, in order to compare sensitivities fairly, they were normalized by the magnitude of the values. The normalized sensitivity of f to the ith parameter, s i , can be written

$$s_{i} = \frac{{{\text{d}}f(\theta )}}{{{\text{d}}\theta }} = \left( {\frac{{f\left( {\theta_{i} } \right) - f(\theta )}}{{\theta_{i} - \theta }}} \right) \times \left( {\frac{\theta }{f(\theta )}} \right)$$

Taking the Hurricane Ike disaster as an example, we explored the sensitivity of the total loss estimate to each of the input parameters. In all, there are 76 inputs for this calculation. The results are presented in Fig. 8. We see that the most sensitive parameters for the Hurricane Ike estimate are the PCS commercial insured loss value and associated multiplier for Texas. We may interpret this to mean that a change in the PCS commercial value or multiplier for Texas would impact the total loss estimate of Hurricane Ike more than changing any other parameter. Therefore, if the goal is to reduce the uncertainty of the total loss estimate for this Hurricane Ike example, the best way forward is to reduce the uncertainty in the PCS commercial value and the PCS commercial insurance participation rate for Texas, and then continue with subsequent parameters in order of descending sensitivity.

Fig. 8
figure 8

Normalized sensitivity analysis of damage totals caused by Hurricane Ike by state and loss data category

5 Discussion and conclusion

The Monte Carlo simulations for all three of the natural disaster case studies provide additional confidence in our total, direct loss estimates (NCDC 2014) for these events since the estimates are within the confidence limits and close to the mean and median of the Monte Carlo simulations for each disaster example. The reason that some differences are present is because the total, direct loss estimates for each event, as reported in NCDC (2014), use a slightly different approach to loss calculation. Whereas in NCDC (2014) the PCS loss component values are grouped together, in this analysis, we treat the residential, commercial and automotive category losses separately. Here we are leveraging the information gathered through the survey review, as described in Sect. 2, to incorporate more granular insurance multiplier values than those used previously as well. In this analysis, we defined ranges for insurance participation and subsequently used these to calculate the associated multipliers.

The confidence intervals are of value because they provide greater clarity to the quality of the assessment. For each of the three case studies, we examined the impact of two main assumptions, the error present in the loss values and the distribution of the possible values. For the April 25–28, 2011, tornado outbreak and Hurricane Ike, the impact of assuming 3 or 5 % error on loss values was negligible according to hypothesis testing to determine whether the samples were from the same distribution (using Kolmogorov–Smirnov, Wilcoxon rank-sum and Wilcoxon signed-rank tests). However, these same hypothesis tests indicate that the choice of a normal or a uniform distribution of loss values was more important. This is especially noticeable when comparing histograms, where the simulations with the normal assumption tend to have longer upper tails than the simulations with the uniform assumption. Therefore, in the interest of being more conservative in the choice of confidence bounds, we recommend the confidence intervals as derived from assuming 5 % error on uniformly distributed loss values.

The 2012 US drought example had different results from the hypothesis testing. Here, the impact of both assumptions was important. The cases assuming uniformly distributed values resulted in wider and hence more conservative, confidence regions. Increasing the assumed error on loss values from 3 to 5 % also caused the confidence regions to enlarge and additionally caused a positive shift. Although the confidence regions from the four cases are not perfectly nested, all things considered assuming 5 % error on uniformly distributed loss values provides the most conservative confidence region estimate.

We also performed a sensitivity analysis examining how the separate loss variables contribute to the overall loss total for a specific case study. It is not surprising that when examining the Hurricane Ike disaster we found that the Texas insurance cost variables had the most sensitivity, as Texas experienced the largest loss values for all data categories: residential, automotive, commercial and NFIP flood-related losses. The offshore energy losses were also quite sensitive to the overall cost estimate. The total loss estimate for Hurricane Ike used 76 input parameters, each of which has some uncertainty which contributes to the uncertainty in the total loss estimate. The intent of the sensitivity analysis described in Sect. 4 was to isolate the most important parameters to focus on reducing the uncertainty, which will in turn have the greatest impact in reducing the uncertainty in the total loss estimate.

Although we explore Monte Carlo simulations for just several disaster examples, we seek to apply this methodology to all of the US billion-dollar weather and climate event loss estimates (NCDC 2014) to provide better context regarding disaster cost uncertainty. This research is a next step to enhance the value and usability of estimated disaster costs given data limitations and inherent complexities.