1 Introduction

Economic activities are frequently interrupted by natural disasters, like the 2005 Hurricane Katrina and severe winter storms in recent years. While most existing economic studies examine the impact of disasters on the national economy (see Cavallo and Noy (2011) and Kousky (2014) for a review), more and more researches start looking at regional economic impacts. In this study, we focus on the impact on per capita income. In this regard, most existing studies find that natural disasters have a significant negative impact in the short-run (Strobl 2011; Xiao 2011; Coffman and Noy 2012; Anttila-Hughes and Hsiang 2013).Footnote 1 However, the long-run impact on regional income is rarely discussed except in Coffman and Noy (2012) and Xiao (2011).

Xiao (2011) examines impacts of the 1993 Midwest flood on local economies and finds that per capita income drops significantly in the year of the event. In the long run, the impact on per capita income is negative but not significant.Footnote 2 Coffman and Noy (2012) apply the synthetic control method to examine the economic impact of hurricane Iniki on the Hawaiian island of Kauai. They find that hurricane Iniki has mainly affected population. It significantly reduces per capita income in the short run but not in long run. The significant short-run impacts and the insignificant long-run impacts seem to suggest that policy makers should focus on the short-run instead of the long-run.

Most existing regional studies on disaster impacts focus on one particular type of natural disasters, such as hurricanes (Anttila-Hughes and Hsiang 2013; Belasen and Dai 2014; Coffman and Noy 2012; Deryugina 2013; Strobl 2011), flooding (Husby et al. 2014; Xiao 2011) and wildfires (Nielsen-Pincus et al. 2014). The exception is Vu and Hammes (2010). They investigate impacts of multiple natural disasters on annual output and output growth over the period 1995–2007 in China. One concern about focusing on one particular type of disasters is the neglect of disaster incidences of other types, which raises a question about the validity of the control group. Should counties stricken by tornados but not by flooding be included in the control group in a study of flooding?

To address this concern, we investigate the disaster impact on per capita income considering multiple types of disasters occurred in the Contiguous US from 1990 to 2012. After controlling for the potentially long-lasting impact from previous disasters, we construct counterfactual control groups for counties hit by large natural disasters. We collapse the time series data into pre- and post-event periods to produce consistent standard errors (Bertrand et al. 2004). After controlling for the potential serial correlation problem in the difference-in-differences model, we find that comparing with the neighboring counties, the per capita income in counties directly stricken by large natural disasters are significantly reduced in the long-run.

The remainder of the paper is organized as follows. Section 2 describes the econometric model. Section 3 describes the data and the assignment of the treatment and control groups. Section 4 presents results and discusses robustness checks and  Sect. 5 concludes the paper.

2 Econometric models

It is necessary to clarify notations and terminologies used in this paper in order to facilitate the discussion. When a large natural disasterFootnote 3 occurs, we say a treatment event has occurred. Because the dataset includes disasters occurred in multiple years, it is important to differentiate two related but different concepts: year and period. In this study, the year that the treatment event occurs is called the treatment year (yr). The year preceding the treatment year (yr − 1) is referred to as the pre-event period. In order to investigate whether a disaster affects county per capita income j years after the treatment event, we define j as the impact duration, and call the jth year after the treatment year as the post-event period. For any given disaster, the post-event period refers to a different calendar year if the impact duration is different. For example, if a disaster occurred in 1997, the treatment year is 1997. If the impact duration of interest is 10 years (j = 10), the corresponding post-event period refers to the calendar year 2007. According to Cavallo et al. (2013), any impact observed with impact duration greater (less) than 5 years is considered as the long (short)-run impact. To avoid confusion, in the rest of the paper, we use “year” only for the calendar year and “period” only for the pre- or post-event period.

Because the serial correlation in the economic outcomes may underestimate the standard deviation of the estimators, and overestimate significance levels, we collapse the times series data into the pre- and post-event periods as suggested by Bertrand et al. (2004). For a given impact duration j, let y it  be the real per capita income for county i in period t. The regression model is:

$$\ln y_{it} = \beta_{1} {\text{DID}}_{it} + \beta_{2} X_{it} + \beta_{3} {\text{Period}}_{t} + c_{i} + u_{it} \quad {\text{for}}\quad t = 0\;{\text{or}}\;1$$
(1)

where DID it is a dummy variable that equals one for counties in the treatment group in the post-event period, and zero otherwise. X it is a vector of weather variables including annual precipitation and mean temperature, as well as their squared terms to control for nonlinear effects. c i is the county fixed effect used to control for unobserved county characteristics.Footnote 4 Period t is the time fixed effects to capture common changes from the pre-event period to the post-event period. u it is the idiosyncratic error. β 1 estimates the mean income difference between the constructed treatment and control groups due to the incidences of large disasters.

3 Data

3.1 Data description

The data on natural disasters come from the Spatial Hazard Events and Losses Database for the US (SHELDUS™ Version 12.0), which is maintained by the Hazards and Vulnerability Research Institute (HVRI) at the University of South Carolina.Footnote 5 SHELDUS is a county-level dataset covering 18 types of natural disasters including avalanche, coastal, drought, earthquake, flooding, fog, hail, heat, hurricane/tropical storm, landslide, lightning, severe storm/thunder storm, tornado, tsunami/seiche, volcano, wildfire, wind, and winter weather. This dataset covers natural disasters occurred from January 1960 to present.

In this research, the direct economic damage is measured by the value of property damage. However, property damages are not directly comparable across regions due to the regional differences in the economic development and living costs. For instance, the property damage from the destruction of a small house in Manhattan, New York, can be very different from the destruction of the same house in a small rural county. To address these differences across regions, we use the concept of relative property damage,Footnote 6 which is defined as the property damage divided by the per capita income in the pre-event period. It is used to measure the property damage as a multiple of average county income. For example, a relative damage value of ten means that the property damage is equivalent to ten times the county per capita income in the pre-event period. The use of relative damage is similar to the normalization using country GDP in Cavallo et al. (2013).Footnote 7 It also allows us to compare direct damages from disasters of different types.Footnote 8

Figure 1 shows the mean and moving average of the relative damage from 1969 to 2012. It is evident that disasters with large relative damages happen more frequently after 1990, suggesting a fatter upper tail in the distribution of relative damage. This is similar to the observation in Misiewicz (2011). Because of this structural change, we only consider disasters that occurred after 1990. The spatial distribution of relative damages in 2005 is plotted in Fig. 2. Most counties with the highest relative damage are located in Louisiana, Mississippi, Alabama and Florida where Hurricane Katrina occurred.

Fig. 1
figure 1

Temporal distribution of disaster relative damages from 1969 to 2012

Fig. 2
figure 2

Spatial distribution of disaster relative damage in 2005

Annual data on county per capita income are available for the period between 1969 and 2012 from the US Bureau of Economic Analysis (BEA).Footnote 9 Nominal values of per capita income are converted to real values using the US consumer price index (1982–1984 = 1) from the Bureau of Labor Statistics. Because real per capita income exhibits time trend and is non-stationary, we use the log real per capita income in our analyses.Footnote 10 From here on, we will refer the log real per capita income simply as income.

Annual weather variables are included because evidences show that temperature and precipitation can significantly affect regional economic outcomes (Brown et al. 2013; Dell et al. 2012; Horowitz 2009). The station-wide daily mean temperature (in centimeters) and precipitation (in Celsius degrees) are obtained from the Global Historical Climatology Network (GHCN) database at the National Oceanic and Atmospheric Administration (NOAA). We then aggregate daily weather data to get county-level annual total precipitation and mean temperature.

County rural–urban continuum codes are used in this analysis. They are updated decennially by the Economic Research Service (ERS) of the U.S. Department of Agriculture in 1983, 1993 and 2003. Following the definition of ERS, a county with rural–urban continuum codes less than or equal to three is considered as a metropolitan county. Otherwise, it is a non-metropolitan county.

3.2 Assignment of the treatment and control groups

We define large natural disasters as disasters with relative damage exceeding some cutoff values, and denote the cutoff value as k Footnote 11, which is chosen based on the probability distribution of relative damages from all disasters occurred between 1990 and 2012 in the US. For example, k = 250 means the disaster damage exceeds 250 times the per capita income of the previous year, which approximately corresponds to the 95th percentile of the relative damage distribution. If a county is hit by a natural disaster generating a relative damage greater than or equal to this cutoff value, the county is called a candidate treatment county with cutoff value k = 250.

The quality of a difference-in-differences estimation hinges on the quality of the control group. In this study, due to the lack of exogenous annual county-level data, the quality of the difference-in-differences estimation in essence depends on the degree to which the selected control group can help to control other confounding factors. We use county fixed effects to control the time-invariant unobserved county-specific characteristics. In order to control the unobserved time-varying factors, we resort to the Tobler’s first law of geography: “Everything is related to everything else, but near things are more related than distant things” (Tobler 1970) and construct the control group using neighboring counties.

We define the original dataset as the raw data. A county that receives a treatment event is called a candidate treatment county. A county sharing a common border with the candidate treatment county is called a candidate control county. In order to qualify as a control county, the candidate control county should satisfy the following requirements. Firstly, no treatment event occurs in the county in the same treatment year. This implies that the control county either has no disaster in the treatment year or when a disaster occurs, its relative damage is smaller than the cutoff level. Secondly, the county should have the same metropolitan and non-metropolitan classification as the candidate treatment county according to the rural-urban continuum code. Because metropolitan and non-metropolitan counties can differ significantly in the size and economic structure, this requirement avoids using a metropolitan county as a control for a non-metropolitan treatment county or vice versa. Thirdly, no treatment event should occur in a candidate treatment or control county in the 15 years before the treatment year, which ensures that the pre-event county income of the treatment or control group is not contaminated by previous treatment events. This implicitly assumes that the impact of large natural disaster does not last for more than 15 years. The sensitivity of the estimation results to this assumption will be discussed later. Finally, when we investigate whether the impact lasts for j years (i.e., impact duration equals to j), we also require that a candidate treatment or control county has no treatment event in the j years following the treatment year. This ensures that the county income in the post-event period is not contaminated by treatment events happened after the treatment year. This also implies that the sample size decreases when we try to capture long-run effects. For instance, if the cutoff value equals 150, separate datasets are created for each value of the impact durations (j = 1, 2,…, 10). Other things equal, the sample size for j = 1 is larger than that for j = 10, because the latter requires that both the treatment and control group should be free of treatment events for more years.

If a candidate treatment county has multiple candidate control counties satisfying all these requirements, we use the synthetic method to calculate the average income of these candidate control counties to construct the control group.Footnote 12 If a candidate treatment county does not have any candidate control county satisfying all these requirements, this candidate treatment county is excluded from the treatment group. This can occur for several reasons: (1) all neighboring counties fall into a different metropolitan or non-metropolitan category; (2) all neighboring counties receive a treatment event in the same treatment year; or (3) all neighboring counties receive at least one treatment event in the 15 years before the treatment year, or between the treatment year (yr) and the post-event period (j years after the treatment year: yr + j).

The above data generation process is illustrated in Fig. 3. Suppose the treatment event is defined as a relative damage exceeding some cutoff value and the impact duration is 9 years and County A is a non-metropolitan county that receives a treatment event in year 2000. In this case, the treatment year is 2000, the pre-event period is year 1999 and the post-event period is year 2009. Because no other treatment event occurred in this county from 1985 to 1999 (15 years before the treatment year) or from 2001 to 2009 (between the treatment year and the post-event period), this county is a candidate treatment county. We then proceed to construct its counterfactual control group. We start with neighboring counties that share the common border with County A. This allows us to exclude County B1 at the lower-left corner of Fig. 3. We then exclude County B2 and B3 because they are metropolitan counties. We exclude County B4 because it has a treatment event that occurred in the treatment year 2000. We then exclude County B5 because it has a treatment event occurred in 1990 which is within the 15 years prior to the treatment year. We also exclude County B6 because it has a treatment event in 2005, which is between the treatment year (2000) and the post-event period (2009). Consequently, only Counties B7, B8 and B9 are left. For this treatment event in County A in 2000, we construct one counterfactual control county using the average per capita income of these three counties. In this case, we have County A as a county in the treatment group and the counterfactual county in the control group. In cases where we cannot construct a counterfactual county, County A is excluded from the treatment group.

Fig. 3
figure 3

Construction of the counterfactual county in the control group

To illustrate how the selection procedure affects the spatial distribution of the treatment group, we plot all selected treatment counties in Fig. 4. It shows the spatial distribution of the treatment counties with cutoff value k = 250 and the impact duration j = 1. Some candidate treatment counties in Fig. 2 disappear because they do not satisfy some of the requirements.

Fig. 4
figure 4

Distribution of the treatment counties 1990–2012 (cutoff = 250, impact duration = 1)

There are several potential concerns in this selection of the treatment and control group that could lead to underestimate of the significance. Firstly, in cases where we cannot construct a counterfactual county, a candidate treatment county is excluded from the treatment group. Taking hurricanes as an example, counties at the center of hurricanes tend to suffer large damages. However, they are typically excluded from the treatment group because all their neighboring counties are probably stricken by the same hurricane.Footnote 13 The exclusion of these severely affected counties can potentially make the estimates insignificant. Secondly, for a disaster that generates damages in multiple counties, the total damage of all the counties are equally divided among the counties (HVRI 2013). We refer to this procedure as spatial average. This spatial average may overestimate the damage on counties that are marginally affected by the disaster. Comparing a marginally affected county with its neighboring counties not receiving the treatment event tends to generate an insignificant estimate for the impact. In short, in the construction of the treatment group, we tend to exclude counties with severe damages and include counties with marginal damages. In either case, the dataset we generate tends to underestimate the impacts of large natural disasters and leads to a statistically insignificant outcome. Given this tendancy to underestimate the impact, a significant estimated impact is a strong evidence suggesting some real impact of large disasters on income.

Finally, while the use of neighboring counties improves the similarity between the control and the treatment counties, it also introduces the potential spatial spillover effects. Consequently, the estimated difference-in-differences model is not the treatment effect in the common sense. It measures the relative economic performance of disaster-stricken counties as compared to their neighbors and allows the spatial spillover effects to unfold over time. This is an important practical question in itself because the relative economic performance is a key factor determining household/firm location decisions and jurisdictionary competition.

3.3 Data characteristics

To investigate how the severity of damage affects income in both the short and long run, we explore various cutoff levels (k) of the relative damage. Here, we present three cutoff values (k) 125, 250 and 500, which roughly correspond to the 92.5th, 95th and 97.5th percentile of the relative damage distribution, respectively.Footnote 14

As an example, Table 1 shows the summary statistics of economic variables and the distribution of disaster types for treatment counties across three cutoff values and one impact duration (j = 1). Per capita income, average transfer and employment rate in the selected sample are similar to those in the raw data, which include 4249 treated counties and span the years from 1990 to 2012. The top six disaster types by frequency in the selected sample are the same as in the raw data. However, the detailed composition is different. The selected sample contains relatively more incidences of tornado and flooding, and relatively fewer incidences of severe storm/thunder storm and wind.

Table 1 Summary statistics for treated counties from 1990 to 2012 (impact duration = 1)

3.4 Comparison between treatment and control groups before the treatment year

Because we normalize the property damage by average county income in the pre-event period, other things equal, a poor county maybe more likely to be included in the treatment group while a rich county maybe more likely to be included in the control group. To address this concern, we conduct the mean-comparison tests in the pre-event period and report the t-statistics in Table 2. For all cutoff values and impact durations, the mean incomes are not statistically different between the treatment and control groups.

Table 2 T-statistics of the mean-comparison test of the pre-event income

As shown in Fig. 5, the treatment and control groups have almost identical income distribution, and exhibit similar trend of change before the treatment event (see Appendix A in supplementary materials for a more detailed discussion).

Fig. 5
figure 5

Distributions of income in the treatment and control groups (impact duration = 1)

4 Results and discussions

4.1 Estimated results

As an example, Table 3 reports the estimation results from Eq. (1) for impact duration j = 6. It shows when the cutoff value is either 250 or 500, income of the treatment counties in the post-event period is significantly lower than that in the control counties. In addition, income is affected by extreme temperature, mostly the fat tail effects from temperature: lower income is associated with colder or hotter temperature.

Table 3 Estimation results of Eq. (1) with impact duration j = 6

We are interested in the relative income difference between the treatment group and the neighboring control group, as well as how such a difference depends on the cutoff values and impact durations. So we report estimates of β 1 in Eq. (1) for each of the three cutoffs (\(k = 125, \, 250, \, 500\)) and each of the ten impact durations (j = 1, 2, …, 10) in Table 4.

Table 4 Estimated treatment effects on income for all counties

Table 4 presents the main result of this paper: large natural disasters have significantly negative long-run impacts (impact duration \(j \, \ge \, 5\)) on income. Depending on the cutoff value and the impact duration, the difference of the estimated long-run effect between the treatment and control group ranges from −0.008 to −0.02, corresponding to an average income loss of about $112 to $267 in treatment counties as compared to the control counties.

The significant long-run impacts appear to be counterintuitive. However, it reiterates an important point made by Kousky: “different [spatial and temporal] boundaries of analysis can generate different results” (Kousky 2014). There could be several mutually non-exclusive explanations for this statistic pattern. Firstly, the short-run increase in the construction demand and capital investment due to property damages helps to smooth the short-run losses in capital stock. Households and firms might have planned for remodeling, reconstruction or capital investment even without the disaster. The occurrence of natural disasters only changes the timing of these investments. Therefore, the short-run increase in rebuilding and reconstruction is achieved at the cost of a decrease in the long-run investment. Secondly, the increase in disaster aids in the short-run diminish over time, as shown in the case of hurricanes (Coffman and Noy 2012; Deryugina 2013), thus the negative impacts become more evident in long run. Thirdly, natural disasters can lead to changes in risk perceptions and induce household/firm relocation among neighboring counties. If the occurrence of large natural disasters makes the treatment counties less desirable as compared to their neighboring counties, the income in the treatment counties will be lower than the control counties after the disaster (Banzhaf and Walsh 2008). Because migration is a slow process, this consequence may only be evident in long run. Finally, the insignificance of the short-run impacts may reflect the limitation of this research: the construction of the treatment and control counties in this research tend to generate insignificant results.

To examine the first two possibilities, we conduct a simple panel analysis using county-level employment and transfer paymentFootnote 15 data between 1990 and 2012. The panel regression model is written as:

$$\ln q_{{i{\text{yr}}}} = \alpha_{1} {\text{Short}}_{{i{\text{yr}}}} + \alpha_{2} {\text{Middle}}_{{i{\text{yr}}}} + \alpha_{3} {\text{Long}}_{{i{\text{yr}}}} + \alpha_{4} {\text{Metro}}_{{i{\text{yr}}}} + \alpha_{5} Z_{{i{\text{yr}}}} + c_{i} + \gamma_{\text{yr}} + \varepsilon_{{i{\text{yr}}}}$$
(2)

where q iyr is the log employment or log income transfer of county i in year yr. Dummy variables short iyr, middle iyr and long iyr are used to capture the disaster impacts 1–3, 4–7 and 8–10 years after the treatment event. Dummy variable metro iyr equals one if county i is metropolitan and zero otherwise. Z iyr is a vector of weather variables including mean temperature and annual precipitation. c i and γ yr are the county and year fixed effects, respectively. ɛ iyr is the error term.

We find both the employment and transfer payment increase in the short-run and decrease in the long-run (see Table 5). These results are consistent to the recent studies on hurricanes (Deryugina 2013), wildfires (Nielsen-Pincus et al. 2014) and tornados(Boustan et al. 2012), suggesting the first two explanations may contribute to the long-run significance in Table 4.

Table 5 Disaster impacts on employment and transfer income in affected counties

To investigate the last possibility,Footnote 16 we apply a panel regression using the raw data:

$$\ln y_{{i{\text{yr}}}} = \sum\limits_{j=10} {\theta_{1j} {\text{ Treat}}_{{i{\text{yr}} - j}} } + \theta_{2} {\text{Metro}}_{{i{\text{yr}}}} + \theta_{3} Z_{{i{\text{yr}}}} + c_{i} + \gamma_{\text{yr}} + e_{{i{\text{yr}}}}$$
(3)

where treat iyr is the dummy variable to show whether a disaster occurs in county i in year yr and j is the impact duration from one to ten. e iyr is the error term. The other variables are the same as in Eq. (2).

Equation (3) is run for four different cases as shown in Table 6. In the first case, dummy variable treat = 1 if any type of disasters occurs in the county. Otherwise, Treat = 0. This is equivalent to the case of cutoff value k = 0. The results are reported in the second column. Columns three to five report results of setting cutoff levels k = 125, 250 and 500, respectively. The only difference with the baseline regression in Table 4 is that the raw data are used in the regression and the assignment of treatment and control groups discussed in Sect. 3.2 is not applied. Results are summarized in Table 6. For all four cases, the long-run impacts of incidences of large disasters are significantly negative. Compared with Table 4, the results are statistically more significant, which is probably due to the serial correlation in the data (see Bertrand et al. 2004). As expected, the insignificant short-run impacts in the baseline model seem to be sensitive to the way the treatment and control groups are constructed.

Table 6 Impacts of disasters on income using the raw data panel sample

4.2 Robustness checks

We first test the sensitivity of our results to the disaster-free assumption before the treatment event. In the selection of the treatment and control groups, we impose a constraint that both the treatment and control counties should be free from large disasters in the 15 years before the treatment event. This assumption seems strong but is necessary to control for the potential long-run impacts from past incidences as shown in Table 4 and Table 6. However, it raises concerns about the sensitivity of the results to this particular assumption.

As a robustness check, we relax this assumption and require that both the treatment and control counties should be free of large disasters in the ten, five and two  yearsFootnote 17 before the treatment event. Three datasets are constructed based on these three specifications, and results are reported in Fig. 6. The estimates from the baseline case with the restriction of 15 years are plotted as the solid line in black. The corresponding dotted lines in black represent the 90 % confidence interval. The estimates and the corresponding confidence intervals for the ten-, five and two-year cases are plotted in red, green and blue, respectively. It is evident that the long-run significance is not sensitive to this specific assumption.

Fig. 6
figure 6

Relationship between estimated treatment effects and impact durations for three cutoff levels. The x-axis is the impact duration from one to ten, and the y-axis is the estimated treatment effect, together with 90 % confidence interval (CI)

As an extension of the baseline cases with three cutoff values of relative damage, we include another eight cutoff values ranging from 75th to 97.5th quantiles of the relative damage distribution.Footnote 18 As shown in Fig. 7, for cutoff values ranging from 125 to 500, significant income differences between the treatment and control groups are observed in long run but not in short run. When the cutoff values are too small or too big, neither the long-run nor the short-run impacts are significant. If the cutoff value is too small, the impact may not be strong enough to generate statistically significant differences. When the cutoff value is too high, the rapid reduction in the number of observations results in the statistical insignificance.

Fig. 7
figure 7

Relationships of treatment effects and cutoff levels for each impact duration. The cutoff levels  are ploted on the x-axis. The estimated treatment effect, together with 90 % confidence interval (CI) are ploted on the y-axis

As an alternative specification, we use the detrended income as the dependent variable in Eq. (1). As shown in Table 7, results are consistent with the baseline results in Table 4. We also split out sample by metropolitan and non-metropolitan counties and find similar results (see Appendix B in supplementary materials).

Table 7 Estimated treatment effects on detrended income for all counties

5 Conclusion

In this paper, we examine the impact of large natural disasters occurred between 1990 and 2012 on county per capita income using the SHELDUS dataset that covers 18 types of natural hazards. In contrast to existing regional studies that typically focus on one specific type of disasters, we investigate the regional economic impacts of multiple types of natural disasters.

Using a difference-in-differences model with fixed effects and controlling for serial correlation in the county per capita income data, we find that comparing with neighboring counties, large natural disasters have significant long-run impact, and the difference of the real per capita income falls in a range between $112 and $267. Recognizing these differences and developing location-based emergency plans are important for effective disaster assistance programs. Results from this paper also suggest that we probably need to reform the existing disaster relief programs in order to mitigate the long-run impacts.