1 Introduction

Although the house price bubble appears obvious in retrospect–all bubbles appear obvious in retrospect–in its earlier stages, economists differed considerably about whether the increase in house prices was sustainable… whether the bubble was national or confined to a few local markets…

~ Ben S. Bernanke, former Chairman of the Board of Governors of the Federal Reserve, at the Annual Meeting of the American Economic Association in January 2010

This study examines the nature of the recent bubble-like boom-bust cycle in the US from a forward-looking perspective, particularly for the housing crisis since 2007. The answer to this question means much to households and real estate investors since a nationwide housing crisis implies few risk diversification opportunities. Otherwise, if the US housing crisis is only localized, it is possible to substitute housing assets in non-crisis regions for those suffering a price collapse. In addition, it provides important implications for policy-making effectiveness. On the one hand, a localized housing crisis is indicative of the limited power of monetary policies in stabilizing housing market dynamics since they could be influenced by different shocks. On the other hand, a nationwide housing crisis suggests an active role of the government in mitigating unfavorable fluctuations in local housing markets.

This paper utilizes a dynamic factor model, which makes factor-based forecasts for the US housing market in the spirit of Banerjee et al. (2008), Hendry and Clements (2004), and Stock and Watson (2002b, 2007) who all propose well-estimated factors under instability. The framework is capable of incorporating a large number of housing prices and housing volumes,Footnote 1 extracting housing factors to represent their co-movements, and taking a variety of structural instability into consideration. The three scenarios of structural instability consist of a structural break in the housing-factor loadings, a break in housing factor dynamics, or a break in the idiosyncratic dynamics of housing markets. The research differs from the housing literature as it investigates both housing prices and volumes at disaggregate levels from a forward-looking perspective, motivated by Leamer (2007) and Moench and Ng (2011). As a result, it estimates the housing factors that govern the cross-state co-movements in terms of prices and volumes.

The sample period spans from 1988 to 2012, covering two boom-bust housing cycles: the previous one in the 1990s and the recent one in the 2000s. This study chooses 1999 to serve as the single break, which is suggested by many empirical studies. In order to examine the nature of the housing boom-bust cycle after 1999, this study compares predictive performances of three scenarios: first, the forecast using housing factors estimated from the full sample and full-sample estimates of the factor loadings (“full–full”); second, using housing factors estimated from the full sample and split-sample estimates of the factor loadings (“full-split”); third, using split-sample estimates of housing factors and factor loadings (“split–split”).

The main contributions of this study lie in the implications for the nature of the recent housing boom-bust cycle through comparison analyses among the three scenarios of factor-based forecasts. If full–full forecasts are the best ones, it suggests that forecasts of disaggregate housing market dynamics are not improved as instability across the two subsamples is considered. If split–split forecasts are the best ones, it implies that the estimated housing factors significantly differ across the pre-1999 and post-1999 subperiods. If full-split forecasts are the best ones, it indicates that instability of housing market dynamics matters from a forward-looking perspective, but extracted housing factors over subsamples are qualitatively the same compared to those estimated over the whole sample period from the forecast viewpoint. Thus, the housing factors over the entire period are sufficient to reflect various common shocks to local housing markets.

Overall, the results suggest that state-level housing price cycles are more unstable than housing volume cycles. The probability of rejecting stability for the Northeast is the highest among 4 regional housing markets. This study provides confirmative evidence for difficulties in predicting disaggregate housing markets dynamics owing to instability. The results show that we gain more as we make forecasts by using common factors estimated based on the housing time series spanning over a long period (full sample) and time-varying coefficients across subperiods. In particular, in the post-1999 subperiod, full-split forecasts yield better performances than the other two methods for all state-level housing markets. The findings suggest that no new shock emerges to influence cross-state co-movements after 1999, aligning with the argument made by Stock and Watson (2012) who attempt to quantify the uniqueness of the 2007–2009 recession. If there is a dynamic distinction between crisis states with dramatic housing boom-bust cycles and non-crisis states with milder housing market fluctuations in the post-break period, a new common shock occurs after 1999. The results of forecast comparisons give support to the absence of new factors which uniquely belong to the post-break subperiod. Thus, the co-movements across state-level housing markets remain after the structural break, and the post-1999 boom-bust cycle is nationwide. Important, the ways state-level housing markets respond to the old common shocks in the post-1999 are different from those in the pre-1999 since the factor loadings alter across subsamples in full-split forecasts. Based on the superiority of full-split forecasts over alternatives, the study provides supportive evidence on the nationwide nature of the recent housing boom-bust cycle, and the result reconciles with the argument by Del Negro and Otrok (2007).

The remainder of this paper is organized as follows. Section 2 reviews the literature which motivates this study. Section 3 presents the data and methods. Section 4 discusses the main findings regarding housing market instability and factor-based forecasts. Finally, Sect. 5 makes concluding remarks.

2 Motivation

This study is primarily motivated by and builds upon four strands of literature: studies using dynamic factor models to investigate macroeconomic aggregates, financial and local housing markets, analyses about forecasts of housing market dynamics, papers addressing structural instability of housing markets, and the literature emphasizing the housing volume. The following paragraphs cite a few highly relevant papers that support this research or whose results can be contrasted with our own.

Numerous empirical studies use dynamic factor models to forecast macroeconomic aggregates. For instance, Ng (2012) adopts probit forecasting models, which incorporate various risk factors, to predict the duration and turning points of the US recession. Poncela et al. (2011) compare forecasting performances of dynamic factor models with other three competing frameworks for the US macroeconomy. Stock and Watson (2012) quantify the extent to which the 2007–2009 recession differs from previous ones; they use a high-dimensional dynamic factor model which incorporates 200 macroeconomic variables and 6 housing factors. Some authors observe macro-economies of countries other than the US. For example, Boysen-Hogrefe (2013) analyzes the European debt crisis under a dynamic factor model with time-varying loadings and 2 factors. Koop and Korobills (2011) improve forecasts of the UK macroeconomy by developing dynamic model averaging and model selection in order to solve the problem of over-parameterization in block-factor models. Lee (2012) addresses business cycle co-movements in Europe by using a dynamic factor model with time-varying parameters. Zaher (2007) considers 167 monthly data of UK economic and financial variables, and argues that dynamic factors models outperform alternatives. Recently, some authors have applied factor-based models to study financial and commodity markets. For instance, Akay et al. (2013) use a dynamic factor model with Markov regime-switching components to discuss contagion and time-varying risk-adjusted return for the hedge fund. Berger and Pozzi (2013) use a latent factor model to investigate time-varying linkages of stock markets across five developed countries. Cordis and Kirby (2011) analyze stock markets by establishing regime-switching factor models. Garcia-Martos et al. (2011) extract common factors from electricity prices to forecast electricity markets.

This study is particularly motivated by the empirical literature which applies factor-based models to investigate local housing markets. For instance, Fadiga and Wang (2009) specify the common cyclical and trend components in the four regional housing markets in the US, and point out that they share three common cyclical and two common trend components under a multivariate state-space model. As Holly et al. (2010) address, there are unobserved common factors and spatial effects across state-level housing markets, which are governed by error terms of the housing price regression on some economic fundamentals. Extending the framework of Holly et al. (2010), Baltagi and Li (2014) suggest that spatial dependence at city levels is larger than that at state levels, and show the interesting fact that factor loadings alter over time horizons: estimated factor loadings of the bubble housing marketsFootnote 2 are negative in 1975–2003, but many of them turn out positive when the estimation period covering 1975–2011. Moench and Ng (2011) extend the dynamic hierarchical factor model to analyze associations between housing and consumption, emphasizing the distinction between national and regional common factors extracted from disaggregate housing markets in the US. Del Negro and Otrok (2007) adopt a dynamic factor model to describe how changes in 48 state-level house price indexes are governed by two factors: a latent national factor and a state-level factor. Their research suggests that house price dynamic differences can be largely explained in terms of the states’ exposures to the national factor during the recent housing boom. Stock and Watson (2009) utilize a dynamic model with time-varying volatility to observe co-movements across state-level housing permits.

The literature discussing co-movements across state- and MSA-level housing markets also provides light on policy-making implications: strong cross-market interplay is indicative of relative effectiveness of monetary policies in stabilizing housing cycles. In addition, the nature of the recent housing bubble is intensely debated by empirical studies. For instance, Del Negro and Otrok (2007) suggest the occurrence of a national bubble: the housing boom in 2001–2005 was a “national” phenomenon. More recently, Flor and Klarl (2017) advocate that the decline in the mortgage rate enhanced similarity between the MSAs and the national housing cycle in the pre-2006 period and thus monetary policy played a role during the housing boom. Otherwise, Miles (2015) suggests that regional housing markets were driven by “local” bubbles in the early years of the housing bubble 2001–2005, followed by a “national” housing bust in 2005–2012.

Regarding forecasts of housing markets, Crawford and Fratantoni (2003) claim the difficulty in predicting the prices of individual houses, but they argue the possibility of forecasting an aggregate housing price index. Campbell and Cocco (2007) and Miller et al. (2011a, b) show that the predictable component of the housing price influences macroeconomic aggregates more strongly than does the unpredictable one. Hassani et al. (2017) compare forecasting performances for national and regional house sales and suggest the superiority of the nonparametric approach of singular spectrum analyses. Huang (2013b) examines housing price predictability at nationwide and state levels, and finds that households’ expectation has a stronger predictive power than interest rates. Rapach and Strauss (2009) propose that housing price forecastability differs across states, and suggest that the presence of local housing bubbles is associated with low price predictability.

However, as stated by Stock and Watson (2007), factor-based forecasts with structural breaks have hardly been studied. Among few examples, Breitung and Eickmeier (2011) develop a generalized LM test of structural breaks in dynamic factor models and observe whether there are structural breaks in the US and European business cycles. To take structural instability into consideration, this study uses the 1st quarter of 1999 (1999Q1) as the breakpoint to divide the entire sample into two sub-periods, 1988Q2–1998Q4 and 1999Q1–2012Q2, in an attempt to incorporate a structural break in forecasts of housing price dynamics. The breakpoint helps us examine whether the recent housing boom-bust cycle is national or localized. The papers that focus on methodological specifications for structural breaks of housing markets are rare: most related studies only approximate the breakpoint of the recent housing dynamics. Among few examples, Goodman and Thibodeau (2008) point out that homeownership increased substantially from 1999 to 2005; Huang (2013a) chooses 1999 as the permanent structural break in the US housing price dynamics; Lai and Van Order (2010) suggest that the momentum behavior of housing prices increased after 1999; Shiller (2006) finds that US second home mortgages doubled since 1999; Shiller (2008) advocates that the US housing boom showed a nationwide spillover since 1999. Although none of these studies propose a method to specify structural breaks in the housing market, they implicitly suggest that the US housing market displays noticeably different patterns in the post-1999 compared to the pre-1999 subperiod.

This study extracts housing factors from local housing prices and housing starts. The importance of housing volumes is advocated in the recent literature on housing market dynamics. For instance, using US data, Akkoyun et al. (2013) discuss correlation differences between housing prices and transaction volumes at different frequencies. Clayton et al. (2010) propose a positive correlation between housing prices and volumes as they characterize the housing market cycle from 1990 to 2002. Croce and Haurin (2009) suggest performances of consumer sentiment in forecasting dynamics of three housing volumes: housing permits, housing starts and sales. Leamer (2007) documents that instead of the price cycle, the housing volume cycle matters to the US business cycle. For European countries, Oikarinen (2012) shows positive linkages between housing price changes and housing sales and negative relations between housing price levels and transactions; Wit et al. (2013) find a positive correlation between housing price growth and housing transactions in the Dutch housing market.

In particular, housing starts are commonly used to represent housing boom-bust cycles (Baffoe-Bonnie (1998), Edelstein and Tsang (2007), Hao and Ng (2011), Moench and Ng (2011), Silos and Vilan (2009), Stock and Watson (2012), Topel and Rosen (1988, etc.). One of the pioneering papers is Topel and Rosen (1988). They point out that housing starts are used to characterize booms and busts of the US housing markets. In the spirit of Topel and Rosen (1988), Edelstein and Tsang (2007) define housing investments as the units of new housing starts. Moench and Ng (2011) emphasize that the housing starts and permits are informative for the future market conditions as they incorporate housing volumes into their analyses about US local housing markets. Thus, the paper chooses housing starts to proxy for housing volume cycles, motivated by the existing literature.

3 Dynamic Factor Model

The static version of the factor model takes the following form:

$$ X_{t} = \varLambda_{t} F_{t} + e_{t} , $$
(1)

where \( X_{t} \)=(\( X_{1t} \),…, \( X_{nt} \)), Ft is the r-vector of static factors; \( \Lambda _{t} \) is the n × r matrix of factor loadings; \( e_{t} \)=(\( e_{1t} \),…, \( e_{nt} \))′ is the n-vector of idiosyncratic disturbances. Noticeably, Eq. (1) differs from standard formulations of the DFM by allowing for possibly time-varying factor loadings.

The price data used in this study are the state and regional housing prices from Freddie Mac. These Freddie Mac house price indexes are deflated by the core CPI (Consumer Price Index for all urban consumers: all items less food and energy) which comes from U.S. Department of Labor: Bureau of Labor Statistics. The state and regional volume data chosen for the study are Privately Owned Housing Starts Authorized by Building Permits: 1-Unit from U.S. Department of Commerce: Census Bureau. Because the volume data are available from 1988, the analyzed period spans from 1988 to 2012Q2.Footnote 3 All time series are seasonally-adjusted by the Census X-12 approach. All 51 states are selected and the four regions are the Northeast, the Midwest, the South, and the West. There are 55 house price series and 54 housing volume series (excluding Washington, DCFootnote 4). For each state and region, the real housing price (volume) growth is computed as the difference in the logs of the real housing price (volume). This study incorporates 109 housing market variables, state-level and regional housing price and volume growths, into the estimation of housing factors.

Following the literature on dynamic factor models, this study assumes finite-order autoregressive dynamics for the housing factors and the idiosyncratic term:

$$ F_{t} = \varPhi_{t} F_{t - 1} + \eta_{t} $$
(2)
$$ e_{it} = a_{it} \left( L \right) e_{it - 1} + \varepsilon_{it} ,\quad i = \, 1, \ldots ,n $$
(3)

where \( \eta_{t} \) is the r-vector of factor innovations with E(\( \eta_{t} \)|\( F_{t - 1} \),\( F_{t - 2} \),…, \( X_{it - 1} \),\( X_{it - 2} \),…) = 0. Equation (1)–(3) describe the static factor model derived from the dynamic factor framework if finite lag lengths and VAR factor dynamics are assumed. In the dynamic version, \( F_{t} \) contains lags of the dynamic factors and \( \varPhi_{t} \) is a companion matrix so that the static factor follows the 1st-order dynamics.

Suppose that E(\( \varepsilon_{is} \)|\( F_{t} \),\( F_{t - 1} \),…,\( X_{it} \),\( X_{it - 1} \),…) = 0 and E(\( \eta_{s} \)|\( F_{t} \),\( F_{t - 1} \),…, \( X_{it} \),\( X_{it - 1} \),…) = 0 for s > t, and that the idiosyncratic errors {\( \upvarepsilon_{it} \)} are uncorrelated with the factor disturbances {\( \eta_{t} \)} at all leads and lags. Then, given the data and factors over time t, and the assumption that the potentially time-varying parameters are available, the h-step ahead conditional expectation of \( X_{it + h} \) is as follows:

$$ \begin{aligned} X_{it + h|t} & = E(X_{it + h} |F_{t} ,F_{t - 1} , \ldots ,X_{it} ,X_{it - 1} , \ldots ) \\ & = E(\varLambda_{t + h} F_{t + h} + e_{t + h} |F_{t} ,F_{t - 1} , \ldots ,X_{it} ,X_{it - 1} , \ldots ) \\ & = \beta_{it}^{h\prime } F_{t} + a_{it}^{h} (L) e_{it} \\ \end{aligned} $$
(4)

where \( \beta_{it}^{h\prime } \)= \( \Lambda _{it + h} \prod\nolimits_{s = t + 1}^{t + h} {\varPhi_{s} } \) and \( a_{it}^{h} \)(L)\( e_{it} \) = E[\( a_{it + h} \)(L)\( e_{t + h - 1} \)|\( F_{t} \),\( F_{t - 1} \),…,\( X_{it} \), \( X_{it - 1} \),…] = E[\( e_{it + h} \)|\( e_{it} \),\( e_{it - 1} \),…], where the latter is obtained by making the assumption that {\( e_{it} \)} and {\( \eta_{t} \)} are independent and that expectations are linear.

Motivated by the literature on structural breaks in housing market dynamics, this study assumes a single break at time t = τ and considers three scenarios, which incorporate a break in Λ, Φ, or \( a_{it} \)(L), respectively. The three scenarios are presented as follows:

Scenario 1: A single break in factor loadings, \( \varLambda\), of the forecast regression

Specifically, \( \varLambda_{it} \)= \( \varLambda_{i1} \), t < τ, and \( \varLambda_{it} \)= \( \varLambda_{i2} \), t ≥ τ. In the scenario, Eq. (4) is modified to be the following:

$$ X_{it + h|t} = \left\{ {\begin{array}{*{20}l} {\beta_{i1}^{h\prime } F_{t} + a_{i}^{h} \left( L \right)e_{it} , t < \tau } \hfill & \quad {where \qquad \beta_{i1}^{h\prime } = \varLambda_{i1} \varPhi^{h} } \hfill \\ {\beta_{i2}^{h\prime } F_{t} + a_{i}^{h} \left( L \right)e_{it} , t \ge \tau ,} \hfill & \quad {where \qquad \beta_{i2}^{h\prime } = \varLambda_{i2} \varPhi^{h} } \hfill \\ \end{array} } \right. $$
(5)

Scenario 2: A single break in the factor autoregressive process, Φ, of the forecast regression

Specifically, \( \varPhi_{t} \)= \( \varPhi_{1} \), t < τ, and \( \varPhi_{t} \)= \( \varPhi_{2} \), t ≥ τ. In the scenario, Eq. (4) is modified as follows:

$$ X_{it + h|t} = \left\{ \begin{array}{l}\beta_{i1}^{h\prime} F_{t} + a_{i}^{h}(L)e_{it} , t < \tau \quad where \qquad \beta_{i1}^{h\prime } = \varLambda_{i} \varPhi_{1}^{h} \\ \beta_{i2}^{h\prime } F_{t} + a_{i}^{h}(L)e_{it}, t \ge \tau , \quad where \qquad \beta_{i2}^{h\prime} = \varLambda_{i} \varPhi_{2}^{h} \hfill \end{array} \right. $$
(6)

Scenario 3: A single break in innovations’ autoregressive dynamics, a it ( L ), of the forecast regression

Specifically, \( a_{it} \)(L)=\( a_{i1} \)(L), t < τ, and \( a_{it} \)(L)=\( a_{i2} \)(L), t ≥ τ. In the scenario, Eq. (4) is modified to be the following:

$$ X_{it + h|t} = \left\{ {\begin{array}{*{20}l} {\beta_{i1}^{h\prime } F_{t} + a_{i1}^{h} \left( L \right)e_{it} , } \hfill & {t < \tau } \hfill \\ {\beta_{i2}^{h\prime } F_{t} + a_{i2}^{h} \left( L \right)e_{it} ,} \hfill & {t \ge \tau } \hfill \\ \end{array} } \right. $$
(7)

where \( \beta_{i}^{h\prime } \) = \( \varLambda_{i} \varPhi_{h} \).

Although the scenario of instability is not known a priori, we are able to identify the feature of observed structural instability by working backwards (Stock and Watson (2008). This study conducts 4-period ahead predictions (i.e., p = 4) for the US housing market. The 4-period ahead forecast, \( X_{it + 4}^{\left( 4 \right)} \), corresponds to housing price (volume) growths over the next 4 periods. All forecasts are obtained by the following forecasting regression:

$$ X_{it + 4}^{\left( 4 \right)} = \mu_{i} + \beta_{i}^{\prime } \hat{F}_{t} + \mathop \sum \limits_{j = 0}^{p - 1} \alpha_{ij}^{4} \widehat{{e_{it - j} }} + error $$
(8)

4 Empirical Findings

The paper has three main tasks. First, it determines the number of housing factors for (sub)samples. Second, it conducts instability tests for in-sample factor-loadings and the three scenarios of out-of-sample forecasts for price and volume cycles. The three scenarios are: the forecast using housing factors estimated from the full sample and full-sample estimates of the factor loadings (“full–full”), that using housing factors estimated from the full sample and split-sample estimates of the factor loadings (“full-split”), and that using split-sample estimates of housing factors and factor loadings (“split–split”). Third, it compares the forecasting performances of the three scenarios in the pre- and post-break subsamples.

4.1 How Many Housing Factors?

This study adopts two approaches to estimate the number of housing factors. Table 1 shows the first estimation: Panel information criteria (ICp) suggested by Bai and Ng (2002), and the results point out that there are four factors for the full sample and 2–4 factors for subsamples. Table 2 exhibits the second approach proposed by Stock and Watson (2002a, b), and close-to-unity canonical correlations suggest that the full-sample and subsample factors are spanned over the same factor space. The result indicates that the squared canonical correlations decline substantially as four factors are considered for both the full and subsamples, particularly for the post-1999 subsample (drop from the 3rd canonical correlation of 0.74 to the 4th one of only 0.26), corresponding to the estimates based on Bai and Ng (2002). Based on the results of the two approaches, this study chooses four factors for the full sample and three factors for the two subsamples.

Table 1 Number of factors estimated using Bai and Ng (2002) criteria
Table 2 Canonical correlations between subsample and full-sample estimates of the factors

4.2 Structural Instability of Housing Price Cycles and Volume Cycles

Table 3 shows Chi squared Chow statistics based on Newey–West (1987) to test stability for three scenarios. The results suggest that in general housing prices are more unstable than housing starts. With respect to factor loadings (in-sample), there are 32 states with instable prices, but only 17 states with instable housing-start dynamics at the 5% significance level. Regarding 4-period ahead forecasts, all states display instability of factor-loading forecasts (except for Connecticut, Maine, and New York for housing prices; North Dakota for housing volumes). Also, most states display unstable forecasts of factor autoregressive process across subperiods, except for California, Connecticut, Georgia, Maine, New York, Oklahoma, and Washington DC for house prices; Connecticut, Hawaii, and North Dakota for housing starts. Otherwise, the instability test results of the third scenario, instability in idiosyncratic dynamics, are mixed. There are in total 19 states showing stable forecasts of idiosyncratic housing price dynamics, and only 14 states displaying stable forecasts of idiosyncratic volume movements. Still, state-level housing markets showing instable idiosyncratic dynamics dwarf those lacking this instability feature. The results provide evidence that it is challenging to make forecasts of housing price and volume cycles for most states primarily due to instability in factor loadings and factor autoregressive process.

Table 3 Chow statistics testing the stability of the factor loadings and three scenarios instability in 4-step-ahead forecasting equations

Table 4 reveals instability patterns across different regional housing markets. Not surprisingly, the probability of rejecting the null hypothesis of stability for the Northeast (80%), which contains many states with high fluctuations in housing market dynamics, is the highest among the 4 regions. The West has the second highest probability (61%), and the South has the lowest one (51%). Instability in the Northeastern housing market occurs in the first and second scenarios, factor loadings and housing factor dynamics, but is not the case in the idiosyncratic dynamics (the probability is only 45% for the Northeast). The results echo the ideas in the real estate literature, which points out that the Northeast and the West have nonlinear housing market dynamics and are more subject to housing bubbles than the Midwest and the South (e.g., Huang 2014; Kim and Bhattacharya 2009; Moench and Ng 2011; etc.).

Table 4 Summary of Chow tests by 4 housing regions: null-hypothesis rejection proportion at the 5% significance level

4.3 A Nationwide or Localized Housing Bubble? Implications from Three-Scenario Forecast Comparisons

Table 5 displays how much forecasts of housing cycles can be improved as structural instability is considered by showing relative mean square errors (MSEs), the ratios of two residuals from two different in-sample regressions. Specifically, this study computes the ratios for MSEs of full-split forecasts (computed by the residuals from split-sample regressions onto full-sample factors) over those of full–full forecasts (computed by the residuals from full-sample regressions onto full-sample factors). Less-than-unity ratios indicate that full-split forecasts have lower MSEs and thus outperform full–full forecasts. Similarly, as the MSEs of the split–split predictions are smaller than those of the full-split methods, split–split forecasts outperform full-split forecasts.

Table 5 Relative MSEs of 4-step ahead Forecasts: Housing Prices and Volumes

Regarding housing price cycles, the MSE ratios of all states show that full-split forecasts outperform full–full forecasts because all the ratios are smaller than unity before 1999. In particular, price full-split forecasts of Nevada, Florida and Arizona are improved the most significantly compared with full–full scenarios because their ratios are quite low (0.13, 0.14, and 0.22, respectively). On the other hand, price full-split forecasts of Alaska, Connecticut, Maine, Oklahoma, and South Dakota show little improvement (by less than 20%) compared with full–full scenarios. Also, in the post-1999 subperiod, price full-split predictions are better than full–full scenarios for all states, even though improvement in full-split forecasts of some states (Illinois, Massachusetts, New York, Pennsylvania, Washington DC, and Wisconsin) is marginal (by less than 10%). Nevertheless, the comparison results between full-split and split–split forecasts are mixed. In the pre-1999 subperiod, among the 19 states and 2 regions whose full-split forecasts are better than split–split forecasts, split–split forecasts are improved marginally (by less than 10% in MSEs) compared with full-split scenarios for 7 states and 2 regions. In the post-1999 subperiod, full-split forecasts serve as the best method among all scenarios for all states.Footnote 5

Similar to housing price cycles, the relative MSEs of all states show that full-split forecasts of housing volume cycles outperform full–full forecasts (except for Rhode Island and Vermont in the post-1999 subsample). In general, the forecast improvement in the pre-1999 subperiod is more substantial than in the post-1999 subperiod given that MSE ratios are lower for the former than the latter. In addition, full-split forecasts are dominated by split–split forecasts in more states in the pre-1999 (17 states and 1 region) than the post-1999 (5 states). However, for a majority of states, full-split forecasts are superior to full–full and split–split scenarios in housing volume cycles. The superiority of full-split forecasts of local housing market dynamics suggests a nationwide nature of the recent housing boom-bust cycle. In general, superior forecast performances based on full-sample factors suggest that subsample factors do not capture new common shocks to state-level housing markets after 1999. In other words, there is no peculiar shock occurring during the recent dramatic housing boom-bust cycles after 1999. Since factor loadings differ across subsamples in the full-split scenario, the findings lend support to the idea that local housing markets respond to old common shocks in a different manner during the post-1999 subperiod.

If compared to the pre-break subperiod, cross-state linkages did not change and the dynamic distinction between crisis states and non-crisis states was trivial in the post-break period, there was no new common shock to state-level housing markets over the recent boom-bust cycle. Thus, the 2007 housing crisis prevailed in a nationwide scope. The logic of this argument is supported by Stock and Watson (2012).Footnote 6 Moreover, this study’ findings are consistent with those in Del Negro and Otrok (2007), who advocate that the house price swing from 2001 to 2005 is a national phenomenon. Overall, this study provides new evidence for a nationwide housing crisis and thus suggests an active role of the government in mitigating unfavorable fluctuations in US housing markets during the recent decade.

5 Conclusions

This study provides insights in regard to the question, raised by former Chairman of the Board of Governors of the Federal Reserve in 2010: “whether the bubble was national or confined to a few local markets”. A dynamic factor model, which incorporates house prices and volumes in all states and the 4 Census regions in the US, is utilized for investigations into local housing markets from 1988 to 2012. Three varieties of structural instability in forecasts of housing markets are examined: a structural break in housing factor loadings, a break in housing factor dynamics, or a break in idiosyncratic dynamics of housing markets.

The findings suggest that state-level housing price cycles are generally more unstable than housing volume cycles. The evidence of dynamic instability for the Northeastern housing market is the strongest among 4 regional housing markets. The results underscore the difficulty in predicting disaggregate housing markets dynamics owing to instability. We can obtain superior forecasts by using housing factors estimated by housing prices and volumes over a long period (full sample) and time-varying coefficients across subperiods before and after 1999. In the post-1999 subperiod, the full-split forecast has the best performance among the three methods. To sum up, this study provides new evidence for the nationwide nature of the recent housing boom-bust cycle, and thus suggests few risk diversification opportunities in real estate investment and an active role of the Fed in stabilizing the nationwide housing market dynamics.

The paper yields risk management and policy-making implications from the nationwide housing cycle, and it extends the literature along several directions. First, it would be of interest for future research to explore underlying drivers of housing boom-bust cycles. In particular, adding some influential predictors, such as interest rates and housing-market expectation indexes, into the factor-based frameworks could enhance forecast performances of housing market dynamics. Second, the issue concerning effectiveness of monetary-policies in stabilizing housing cycles is worthy of discussions from other perspectives. Third, more of well-developed methods can be applied to examine instability and identify the breakpoints of housing markets. Finally, more efforts can be taken to analyze dynamic linkages across local housing markets. All these lines of future research are exciting.