1 Introduction

Prior studies in housing market have shown the significant role of housing assets in the capital allocation (i.e., Mill 1989; Cannon et al 2006). Understanding the house price behavior is undoubtedly crucial for successful modeling of asset prices. Furthermore, the correct identification of true nature of data-generating process for house prices will enable policymakers to design the appropriate forecasting model for housing market which signals information about future movements in economics activity. Such appropriate identification process could lead to a better policy control. The literature in non-linear dynamics of house price is not to great extent compared to nonlinearity in security prices (Hsieh 1991; Lim and Brooks 2011; and Caraiani 2014). While existing studies on house pricing predominantly focused on using linear framework, the non-linear dynamics property in housing markets is of critical concern because it is indicative of market inefficiency and possible presence of housing bubbles. If the housing markets were indeed characterized by nonlinear property, using a linear model to forecast housing price may result in an incorrect house pricing and poor forecasting performance. Recent attempts that address the issue of nonlinearity in housing markets do not provide a clear understanding of the nature of non-linear dependence driving the home prices. In general, the nonlinearity can stem from either nonlinearity in functional form or non-Gaussian innovations. Understanding the source of asymmetry has become increasingly more important in designing a successful forecasting model. The non-Gaussian model may become a correct approach if the nonlinearity in housing markets was caused by non-Gaussian innovations as opposed to a true nonlinearity in the model.

There are many reasons that lead us to believe that U.S. housing markets may exhibit non-linear price pattern that might be consistent with bubbles. First, unlike other financial markets, housing markets possess unique characteristics such as a lack of quality information disclosure, high transaction costs, low liquidity, limited house pricing transparency, very rigid supply side and short-trading limitation, making this housing market different and more prone to exhibit non-linear price behavior and bubbles (Herring and Wachter 2002). Second, housing prices in the U.S. adjust more quickly to positive market events leading to an increase in equilibrium price rather than negative market events causing a price decline particularly during the Great Recession (Kim and Bhattacharya 2009). Third, a huge lump sum of transaction costs and the lack of short trading in the housing markets can cause a few arbitrage opportunities arising from large deviations between house prices and fundamentals. Such departure of house prices from fundamental values could create a nonlinear behavior and a bubble (Rosenthal 1999; Muellbauer and Murphy 1997). Fourth, housing market is unique because it does not clear instantly after economic shocks. It takes some time for buyers and sellers to locate each other and for the suppliers to construct new buildings to meet the demand, which can lead to prices discontinuity and non-linearity.

Motivated by the importance of U.S. housing market as an investible asset and its unique characteristics, this study comprehensively analyzes the nonlinear behavior and its underlying source of nonlinearity for aggregate U.S. house prices along with plausible explanations for chaos and bubble-like characteristics during 1987 to 2019 period. We extend our analysis of nonlinearity to housing markets in five regions including Northeast, Midwest, Southeast, Southwest, and Western using housing data across 20 different cities. Typically, house prices react more to the regional economic and demographic shocks rather than nationwide shocks. The focus on regional housing markets allows us to compare the housing dynamics across regions.

This study analyzes the following four research questions systematically. First, do the U.S. housing markets exhibit some forms of dependency? Second, if the dependency persists, is it due to linear, non-linear, or chaotic behavior? In addition, is the nonlinearity in housing markets caused by nonlinearity in the return series or by movement in macroeconomic variables? Third, is the nonlinear dependency consistent with the characteristic of a bubble? Lastly, is the nonlinearity in housing markets caused by asymmetric nonlinear behavior in the functional form or asymmetric innovations? To derive the consensus view of nonlinearity in housing markets, this paper utilizes a variety of techniques, including the BDS test, the chaos test, the Markov chain test, the duration dependence test for bubbles, and the time reversibility test for the source of nonlinearity.

The contribution of this study in discovering true underlying nature of data-generating process for house prices will improve the forecasting performance. If the house price movement indeed reflects non-linear adjustment, employing a linear model to forecast house price movement will generate an inaccurate and inefficient forecast of house prices, which in turn resulting in wrong prediction of economy as house prices typically signal the stage of real economic activities. The finding of true cause of asymmetric price pattern helps policymakers and investors to better design a forecasting model that accounts for nonlinearity possibly induced by non-Gaussian innovations as opposed to nonlinearity in a functional form. The most notable contribution of our research provides insights as to housing markets in which regional areas are more likely to exhibit asymmetry and prone to experience bubbles. The housing price cycles would help the policymakers to recognize housing imbalances in each region as the developments in housing markets have the major impact on the local economy.

Our results based on a BDS test report evidence of nonlinear dependence in the aggregate market along with housing markets in 20 cities. The K-map Z-map analysis shows that nonlinear dependency in housing markets in all cities is consistent with the chaotic behavior. Furthermore, Markov Chain test reports the persistence of either positive or negative returns in housing markets. Applying the duration dependence test on positive runs of housing returns confirms that housing markets in all cities experience some episodes of bubbles, except Detroit, MI, and Minneapolis, MN in Midwest region. A time reversibility test further reveals that the mechanism generating nonlinear dependence in the housing markets in all four cities in Midwest region comes from the non-Gaussian innovations. Similar finding is also reported for housing markets in Atlanta, GA, Charlotte, NC, Dallas, TX, and San Diego and San Francisco, CA, suggesting that a linear function with non-Gaussian error terms is appropriate for modelling these housing markets.

The remainder of the paper is organized as follows. Section 2 provides the literature review for nonlinear dependency and bubbles in housing markets. Section 3 describes our data and summarizes the descriptive statistics of the U.S. Case-Shiller housing index for national and 20 cities. Section 4 presents the descriptions and empirical results of the BDS, K-map and Z-map, Markov Chain, Duration Dependence, and Time Reversibility methodologies in details. Section 5 offers some concluding remarks.

2 Literature review

The literature in non-linear price behavior and bubbles in housing markets is relatively scant. Genesove and Mayer (2001) attributed the nonlinear behavior in housing prices in Boston to the sellers’ loss aversion. Seslen (2004) showed that households behave rational and react differently in response to the upturn and downturn in housing markets causing nonlinear behavior. Similar findings are reported by Kim and Bhattacharya (2009) where nonlinear in housing market is due to the asymmetric response to different phases of expansion and contraction. Muellbauer and Murphy (1997) also suggested that a huge lump sum transaction cost can cause nonlinearity and the appreciation of house prices. Study by Enders and Siklos (2001) showed that nonlinearity in house prices is due to asymmetric adjustment in the underlying determinants of housing value such as GDP and interest rates. This is confirmed by Skalin and Teräsvirta (2002) who showed that nonlinearity in housing price is driven by the nonlinearities in underlying macroeconomic variables. Nonlinearity in housing price is also substantiated by the superior forecasting performance of nonlinear techniques (Miles 2008; Trindade et al. 2010).

There is ample research in nonlinear price behavior that supports characteristics of housing market bubbles in the U.S. and other countries (Himmelberg et al. 2005; and McCarthy and Peach 2004). Shiller (2005) showed that U.S. housing market experienced bubbles. Belke and Marcel (2005) found that housing markets in 15 states showed the largest price increase consistent with bubble characteristics. Mikhed and Zemcik (2007) revealed that U.S. house prices and rents are not cointegrated, indicating the presence of a housing bubble. Huang (2013) showed possible existence of a housing bubble during 2001 to 2004 due to violation of relationship between housing volatility and realized returns. Hott (2012) further showed that investor’s herding behavior causes house prices in European and non-European OECD countries to fluctuate more than fundamentally justified, suggesting evidence of a bubble. Engsted et al. (2016) reported that 18 OECD countries except Germany and Italy experienced explosive housing market bubbles. Shi et al. (2016) found the housing bubbles in Australia before the 2008 global financial crisis. Besarria et al. (2018) implemented cointegration method between the house rental prices and selling prices and reported housing bubbles in Brazil. Asal (2019) compared long-run price with actual price of housing in Sweden and reported evidence of housing bubble in the early 2004.

In summary, the extant literature provides several pieces of evidence supporting the nonlinearity and bubbles in housing markets in U.S. and around the world. However, to the best of our knowledge, none of previous studies provides a comprehensive analysis of asymmetric price behavior that could lead to the possible chaos and price bubble in the U.S. housing market. The next section will describe our data and empirical approaches.

3 Data

This study uses monthly seasonally adjusted data of S&P Case-Shiller U.S. Home Price Index for national and 20 cities to form five regional housing markets of Midwest, Northeast, Southeast, Southwest, and Western regions. The time period studied is from January 1978 to March 2019 except the home price index for Detroit (1991:01), Atlanta (1991:01), Dallas (2000:02), Denver (2001:01), and Seattle (1990:01). The data are obtained from the Federal Reserve Bank of Saint Louis: https://fred.stlouisfed.org/. The continuously compounded monthly return of each housing index series is computed as, \({{\varvec{R}}}_{{\varvec{t}}}=100\boldsymbol{*}{\varvec{ln}}({{\varvec{P}}}_{{\varvec{t}}}/{{\varvec{P}}}_{{\varvec{t}}-1})\), where \({P}_{t}\) represents the housing index value at the end of the month t and \({P}_{t-1}\) is the prior month closing value. We also used four macroeconomic variables including economic conditions index (EC), interest rate (INT), inflation rate (INF), and U.S. population growth (PopGrowth) in the K-map and Z-map analysis. The casual relationship of these macroeconomic variables and housing markets behavior has been well documented by many studies (ie., Hepsen and Kalfa 2009; Leung and Ng 2019; Gallagher 2019; and Maynou et al. 2021).

The summary statistics for monthly U.S. aggregate housing returns and 20 cities are reported in Table 1. The housing market in Portland, OR generates the highest average returns of 0.6544%, while the housing market in Cleveland, OH has the lowest return of 0.2170%. The most volatile housing market is Las Vegas, NV which shows the highest variance of 1.2614%, while the house price changes of Charlotte, NC exhibit the least volatility of 0.4431%. All the housing markets exhibit a departure from normality as shown by significant skewness, kurtosis, and Jarque–Bera test statistics at the 1% significant level, except for New York. This is the first descriptive indication of the serial dependence in the higher moments in the housing returns in most of the cities and regions. Next section will examine this serial dependence in empirical settings.

Table 1 Summary statistics of monthly returns of S&P Case-Shiller U.S. national and city home price index, seasonally adjusted

4 Methodology and empirical results

We empirically test whether there exists some form of dependency in the return behavior of U.S. housing markets and such dependency can be explained by nonlinear, chaotic dynamics or bubble-like structures. First, the Brock-Dechert-Scheinkman (BDS) test (Brock et al. (1996)) is implemented to check for any dependency in the house prices for each city. To further uncover the nature of dependency, the housing return is first modelled as a linear autoregressive moving average (ARIMA) and then non-linear Generalized Autoregressive Conditional Heteroscedasticity (GARCH) and the residuals are subjected to BDS test to identify whether dependence is in linear or non-linear form. The application of the K-map and Z-map model developed by Larrain (1991) is subsequently conducted to test whether the nonlinear stochastic dependence in housing markets is consistent with chaos form, or it is driven by underlying macroeconomic variables. In complement with the BDS test, a Markov Chain test developed by McQueen and Thorley (1991) is also implemented to test for departure from random walk and predictable patterns in housing markets. As nonlinear dependency in time series is known to be one of characteristics of bubbles, a duration dependence developed by McQueen and Thorley (1991) test is exploited to test for bubbles in housing prices. To further identify the main driver of underlying nonlinearity in housing markets, a time reversibility developed by Ramsey and Rothman (1996) is employed to uncover whether the non-linear dependence is due to the functional form or non-Gaussian error terms. Such information is useful in delivering a superior fit model for forecasting pricing of U.S. housing markets.

4.1 BDS test

The BDS test is a commonly known test in detecting dependency in many financial time series. If a time series is generated by an IID process, the probability that the distance between any pair of observation is smaller than an arbitrary number should be the same for all pairs.

$$p\left(\left|X_t-X_{t+k}\right|<\varepsilon\right)=p\left(\left|X_t-X_{t+l}\right|<\varepsilon\right)=p_1(\varepsilon)$$
(1)

where k and Ɩ are arbitrary numbers. Therefore, for any IID financial time series, the joint probabilities that each pair of a sample will satisfy the condition can be expressed as:

$$\prod_{d=1}^mp\left(\left|X_{t+d}-X_{t+k+d}\right|<\varepsilon\right)=p_m\left(\varepsilon\right)=p_1^m(\varepsilon)$$
(2)

The BDS test statistic is calculated as follows:

$$W_m\left(l\right)=\frac{\sqrt T\left[p_m\left(\varepsilon\right)-p_1^m\left(\varepsilon\right)\right]}{\sigma_m\left(\varepsilon\right)}$$
(3)

The results of the BDS test for dependency in aggregate and cities housing index are reported in Table 2. The null hypothesis of no dependence is consistently rejected for housing returns in all cities at the 1% significant level, suggesting that U.S. housing prices do not follow a random walk. To further investigate whether the form of dependency could be attributable to linear or non-linear structures, the residuals from the fitted linear ARIMA model and fitted nonlinear GARCH are subjected to the BDS tests. If dependency is due to the linearity or nonlinearity in the model, fitting ARIMA or GARCH models to the housing returns should remove linear or nonlinear dependency and therefore, residuals should be IID. However, the BDS test on the residuals from ARIMA model continues to reject the null hypothesis of no dependence across all cities and regions at the 1% significance level, suggesting that the use of autoregressive moving model does not help to remove any potential linear dependence. To further explore the structure of dependency, the residuals from fitted nonlinear GARCH model for each city are tested for possible nonlinear dependency via the BDS test.

Table 2 The BDS test statistics on S&P Case-Shiller U.S. national and city home price index, seasonally adjusted

Table 2 reveals the estimated values of the BDS test on the GARCH residuals from each series for embedding dimensions (m) from 2 to 6. The null hypothesis of IID in residuals is still unequivocally rejected for overall U.S. housing markets and 20 cities at all dimensions at the 1% significance level.

The findings of non-IID in the residuals from nonlinear GARCH filtered series virtually suggest that the pricing behavior of overall U.S. housing markets and 20 cities could possibly be generated by a nonlinear model. Our results are in line with current literature of nonlinear dependence in housing markets (Engelhardt 2003; Seslen 2004; Kim and Bhattacharya 2009).

4.2 K-map and Z-map analysis

Although the BDS test is a popular test for detecting dependency in a time series, the pitfall of the test is that it will reject IID if the time series is chaotic and does not converge. However, Hsieh (1991) pointed out that a chaotic time series can be generated by nonlinear deterministic process which may look random, but not all non-linear dynamics exhibit a chaotic behavior. According to May (1976), a time series \({\mathrm{X}}_{\mathrm{t}+1}\), \({\mathrm{where\;X}}_{\mathrm{t}+1}=a{\mathrm{X}}_{\mathrm{t}}\left(1-{\mathrm{X}}_{\mathrm{t}}\right)\) will be a general nonlinear process for most values of \(a\). However, for the \(\mathrm{values\;of}\;a\) between 3.57 and 3.8, the process will behave like a chaos. This illustrates the point that chaos is only a small subset of nonlinear process.Footnote 1

The K-map and Z-map analysis is further conducted to determine whether the dependency in the housing markets is specifically driven by a chaotic structure. Following the study by Larrain (1991), non-linear components in the housing returns are modelled as K-map to characterize chaotic behaviors. The linear behavior components in the returns of housing are modelled as Z-map to include four macro-economic factors such as economic condition index (EC), inflation (CPI), interest rate (INT), and population growth (PopGrowth).Footnote 2 It is more logical to use more state-level heterogeneity controls in the K-map and Z-map analysis. We, therefore, include economic condition indices (EC) which measure average economic growth in each metropolitan statistical area (MSAs). These indices are first constructed by Arias et al. (2016) and include 15 variables to gauge various aspects of economic activities in the MSAs.Footnote 3 The inclusion of broader variables in economic condition indices in the MSAs makes it a viable option as it defines Gross Metropolitan Product Growth potentially affecting home price across regions. This allows for a unified comparison across metro areas.

These four macroeconomic variables are used in this chaos test to examine whether past nonlinear housing returns and macroeconomic variables exhibit stable or chaotic effects on future behaviors of overall and each city U.S. housing prices. Studies by Peng (2016) showed that unemployment rate has a negative effect on housing markets. Similarly, interest rate also negatively affects the returns on housing markets (Peng and Tsai 2019; Stevenson 2008). Lee et al. (2017) showed that the causal relationship between inflation and housing price behavior is positive. Lastly, the increase in population growth positively affects the returns on housing markets (Stevenson and Young 2014; Otto 2007).

The K-map and Z-map of the housing returns can be modelled as follows:

$$R_t=c+\beta_1R_{t-1}^1+\beta_2R_{t-1}^2+\beta_3R_{t-1}^3+\beta_4R_{t-1}^4+\beta_5\left(EC\right)+\beta_6\left(Interest\right)+\beta_7\left(Inflation\right)+\beta_8\left(PopGrowth\right)+\varepsilon_t$$
(4)

If the dynamics of housing market is driven mainly by erratic and chaotic behavior, this will result in the more significant and powerful of the estimated coefficients on the K-map nonlinear return variables (c, \({\beta }_{1},{\beta }_{2},{\beta }_{3},\) and \({\beta }_{4})\) when compared with the estimated coefficients of the Z-map linear macroeconomic variables (\({\beta }_{5,}{\beta }_{6},{\beta }_{7,}\) and \({\beta }_{8})\).

The first step in testing for chaos in housing markets is to determine the best degree of non-linearity in housing returns in the K-map model using a stepwise regression. The ordinary least square regression is then performed on the full model that is composed of significant non-linear K-map components and linear fundamental economic Z-map components. Table 3 showed that the nonlinear K-map coefficients are dominant for the U.S. national housing index and for all 20 cities. They are more significant than coefficients of Z-map with larger magnitude. This result indicates that home price index in all cities across five regions are driven by chaos. Surprisingly, economics condition index is the dominant and significant factor in the Z-map analysis in most of the cities. Population growth does not have any explanatory power in pricing behavior of housing in any cities. The home price index in Illinois, Dallas, San Francisco, and Seattle cannot be explained by the movement of any of four macroeconomic variables. These findings validate that nonlinear stochastic dependence in housing markets in most of the cities are predominantly driven by chaotic behavior but not the underlying macroeconomic variables.

Table 3 K-map and Z-map anlaysis on S&P Case-Shiller U.S. national and city home price index, seasonally adjusted

4.3 Second-order Markov chain

The second order Markov chain test developed by McQueen and Thorley (1991) is further conducted to detect for nonlinear predictable components in the U.S. house price index. This test examines for nonlinear dependence based on price behavior occurred in the past two periods. The structure of the Markov chain test is to test the null hypothesis that house prices follow a random walk. If the housing markets follow a random walk, the probability of observing a positive or negative return in the current period should be invariant to what occurred in previous states.

A two-state second-order Markov process can be constructed by first defining It process as follows:

$${{\varvec{I}}}_{{\varvec{t}}}=\left\{\begin{array}{c}1, {{\varvec{R}}}_{{\varvec{t}}}\ge 0\\ 0, {{\varvec{R}}}_{{\varvec{t}}}<0\end{array}\right.$$

The It process is then translated into a two-order transition probability ij) as shown follows:

$$\lambda_{ij}=Prob\left(\left.I_t=0\right|I_{t-2}=i,I_{t-1}=j\right)$$
(5)

where i and j can take the value of either 1 or 0. To illustrate, λ00 is the probability that a negative return will continue to persist in the current period given two preceding negative returns. Consequently, (1-λ00) is the probability that a sequence of two negative returns in prior states will revert to a positive return in the current period.

The random walk hypothesis postulates that the chance of either state 0 (It = 0) or state 1 (It = 1) occurrence in the current period should be invariant to any prior two-state sequence. Thus, a rejection of the null hypothesis is an indication of the presence of nonlinear dependence in the housing markets. Seven null hypotheses of equal transition probabilities for different states can be formed. Let \(\lambda\) is the set of all possible probabilities of having a negative return given the past two periods,\(\lambda =\left\{{\lambda }_{00}, {\lambda }_{01 }, {\lambda }_{10}, {\lambda }_{11}\right\}\). Define H as the set of null hypotheses to be tested \(H=\left(\genfrac{}{}{0pt}{}{\lambda }{2}\right)\) which is the combination of \(\lambda\) by two. \({H}_{1-6 null}=\left\{{\lambda }_{00}={\lambda }_{11 }, {\lambda }_{00}={\lambda }_{10 },\dots ,{\lambda }_{10}={\lambda }_{11}\right\}\) versus \({H}_{1-6 alternative}=\left\{{\lambda }_{00}\ne {\lambda }_{11 }, {\lambda }_{00}\ne {\lambda }_{10 },\dots ,{\lambda }_{10}\ne {\lambda }_{11 }\right\}.\) The more restrictive null hypothesis to test the equality of four probabilities is \({H}_{7,null}:{\lambda }_{00}={\lambda }_{01 }={\lambda }_{10}={\lambda }_{11 }versus {H}_{7,alt}:{{\lambda }_{00}\ne {\lambda }_{01 }\ne \lambda }_{10}\ne {\lambda }_{11 }.\)

The less restrictive null hypotheses 1 to 6 posit that the probability of observing negative returns in the current period should be the same regardless of what happened in the prior two periods. For more restrictive null hypothesis 7, the probability of a negative or positive occurrence in the current period should be same independent of pattern happened in prior two periods. To test each null hypothesis, the log likelihood ratio test is then calculated based on restricted transition probability vs. unrestricted transition probability.

As reported in Table 4, the LRT consistently rejects seven null hypotheses of randomness in the U.S. housing markets and 19 cities at the traditional significant level. This implies that U.S. housing markets exhibit a non-random walk pattern due to an inherent nonlinear dependence in housing price. Only housing market in Phoenix, AZ exhibits a random price pattern during 1987 to 2019.

Table 4 Markov chain tests for monthy returns on S&P Case-Shiller national and city home price index, seasonally adjusted

The estimated values of \({\widehat{\lambda }}_{00}\) are larger than those of \(\left(1-{\widehat{\lambda }}_{00}\right)\) for all 20 cities, pointing to the persistence of negative returns. For example, 83.30% \(\left({\widehat{\lambda }}_{00}\right)\) of the time the overall U.S. housing market exhibits three consecutive negative returns, while only 16.70% \(\left(1-{\widehat{\lambda }}_{00}\right)\) of the time the housing market will revert to positive returns after two negative returns. In other words, the U.S. housing markets in each city tend to show continued price depreciation or persistence of negative returns. Similarly, the smaller estimated values of \({\widehat{\lambda }}_{11}\) than \(\left(1-{\widehat{\lambda }}_{11}\right)\) for home markets in all cities suggest the likelihood of a house price appreciation for three consecutive periods. For overall U.S. housing markets, there is a 93.40% \(\left(1-{\widehat{\lambda }}_{11}\right)\) chance that a positive house price change continues to occur in the current period given prior two positive prices change. This is significantly higher than the probability of observing negative price change in current period after two positive price changes of 6.60% (\({\widehat{\lambda }}_{11}\)). Overall, these transition probabilities point to an existence of predictable components in the U.S. housing markets across five regions where negative (positive) return is more likely to occur after two consecutive sequences of negative (positive) returns. The empirical findings of a persistent pattern in positive returns indicate the possibility of a bubble in the U.S. housing markets.

4.4 Duration dependence test

A housing bubble is a phenomenon where there is a continued rise in demand driving up property price to an unsustainable level. One of the characteristics of a bubble is the tendency of a positive run up in prices causing positive returns to persist. This bubble feature can be captured by the application of a duration dependence test.

The implementation of duration dependence developed by McQueen and Thorley (1994) to detect for housing bubbles is unique to this study because it does not take into account the fundamental variables such as rental incomes as it commonly used in cointegration test for bubbles. The main premise of the cointegration test is that an existence of long-run relationship between assets value and their underlying fundamental variables would be evidence against the presence of bubbles. The problem of cointegration test is that it requires the correct identification of the fundamental variables that explain the movement of underlying asset values.Footnote 4 The duration dependence tests for a bubble by examining the relationship between positive returns and its length. This test differs from cointegration tests in that it does not require a prior correct identification of underlying fundamental factors, therefore, it provides a superior advantage of not testing the joint null hypothesis of no bubbles and no model misspecification.Footnote 5

The duration dependence test simply examines duration of the house price increases and the likelihood that it will revert to a price decline when a bubble bursts. Putting it differently, the presence of a rational bubble would imply that the probability of obtaining a negative return (a bubble bursts) should decrease as the length of positive return increases (a bubble grows). If such rule breaks, a bubble cannot thrive in the market, which implies that the hazard rate should be negative.

The first step for a duration dependence test is to divide the housing returns into a group of positive and negative returns. A run in this case is a sequence of positive or negative returns which is defined as follows.

$$R_k\in\left\{X_i<0\vert X_{i-1}>0,\dots,X_{i-k}>0,X_{i-k-1}<0\right\}\forall i\leq\;n\;and\;k\leq i$$
(6)

where Xt is a time series with n abnormal returns and Rk is a positive run with length k. The numbers of positive or negative runs at a particular length k are counted. Theoretically, the hazard rate (hi) or probability that a bubble will burst should decline as the length of the positive run increases to support the survival of a rational bubble. The hazard rate is then expressed as a function of log of the lag length:

$$h_i=\frac1{1+e^{-d_i}}$$
(7)
$$d_i=\alpha+\beta Lni,$$
(8)

The relationship between the likelihood that a run will end \({(d}_{i})\) and lag length (i) is investigated through the estimated \(\beta\) parameter obtained via the logit regression. A likelihood ratio test is then carried out to test the null hypothesis of \(\beta\) = 0 (no bubble). An existence of a bubble would result in \(\beta\) < 0 or a negative hazard rate.

Table 5 shows that the LRT consistently rejects the null hypothesis of no bubble or a constant hazard rate \((\beta\) = 0) in favor of \(\beta\) < 0 for all housing markets across four regions at the 1% level. Only housing markets in Detroit, MI and Minneapolis, MN in Midwest region do not experience the bubbles. For illustration purpose, the logit regression yields \(\beta =\)-0.6520 for Boston, MA, -0.6009 for New York, NY, and -0.5441 for Washington, DC, which indicate a negative relationship between the duration of the run and the probability of a run to end. As the housing price continues to appreciate in value, the probability that it will revert to a depreciation in value will diminish in support of a growing bubble. It is interesting to note that when a duration dependence test was conducted on the S&P Case-Shiller U.S. national home price index, we fail to report evidence of housing bubbles. One plausible explanation where we failed to find evidence of bubbles in U.S. national home price but reported strong evidence of bubbles in 18 cities could be attributable to the loss of information resulting in less variation when the data is aggregated.Footnote 6 Acemoglu et al. (2007) explained how the volatility from individual data series can pass to the aggregate data and lot of details could be masked.Footnote 7

Table 5 The duration dependence test on positive runs on monthy returns on S&P Case-Shiller national and city home price index, seasonally adjusted

4.5 Time reversibility test

We further investigate the underlying source of nonlinearity that characterizes each housing market. Discovering the true cause of nonlinearity is crucial as it provides a blueprint for policymakers in developing an appropriate forecasting model for each housing market that could lead to a better policy control. In doing so, the non-linear time reversibility test (TR) developed by Ramsey and Rothman (1996) is conducted to enhance our understanding of mechanism driving housing dynamics. The truth about time reversibility test is that nonlinear dynamics in financial times may be stirred by either nonlinear asymmetry in the original functional form or the asymmetry in the innovation process.

Conducting time reversibility test requires an initial understanding of symmetric behavior of time series with respect to the time. The time reversibility test is built on the notion that for any times series that is symmetric, reversing the time axis will not alter the behaviors of the symmetric series. In such case, a symmetric time series is also time reversible. A rejection of time reversibility in the housing returns would indicate the presence of asymmetric patterns. Such pattern is consistent with the non-random walk behaviors.

According to Ramsey and Rothman (1996), a time series {Xt} is time reversible if \(E\left[{X}_{t}^{i}*{X}_{t-k}^{j}\right]=E\left[{X}_{t}^{j}*{X}_{t-k}^{i}\right]\) for all i, j, k ϵ N, and k is the lag periods. Once the time reversible series is formed, a bicovariance function \({(\gamma }_{\mathrm{2,1}}\left(k\right))\) for i = 1 and j = 2, is calculated as follows:

$${\gamma }_{\mathrm{2,1}}\left(k\right)=\left\{E\left[{X}_{t}^{2}*{X}_{t-k}\right]-E\left[{X}_{t}*{X}_{t-k}^{2}\right]\right\}$$
(9)

Under the null hypothesis (H0) of {Xt} is time reversible, the expected values of sample estimate of a bicovariance function of \({\widehat{\gamma }}_{\mathrm{2,1}}\left(k\right)\) are zero for all lags k. The sample estimate of this bicovariance function is then calculated based on the following formula:

$${\widehat{\upgamma }}_{\mathrm{2,1}}\left(\mathrm{k}\right)=\frac{\sum_{\mathrm{t}=\mathrm{k}+1}^{\mathrm{T}}{\mathrm{X}}_{\mathrm{t}}^{2}*{\mathrm{X}}_{\mathrm{t}-\mathrm{k}}-\sum_{\mathrm{t}=\mathrm{k}+1}^{\mathrm{T}}{\mathrm{X}}_{\mathrm{t}}*{\mathrm{X}}_{\mathrm{t}-\mathrm{k}}^{2}}{\mathrm{T}-\mathrm{k}}$$
(10)

The standardized TR test statistic, \(\frac{{\widehat{\gamma }}_{2.1}(\mathrm{k})}{\mathrm{VAR}[{\widehat{\gamma }}_{2.1}(\mathrm{k}){]}^{1/2}}\), which is the ratio of sample estimate of a bicovariance function to its standard deviation is next computed.

To further identify the critical values for the standardized TR test statistics, the Monte Carlo simulation is implemented. The first step in Monte Carlo simulation is to identify fitted ARIMA model for each housing market series. Based upon the fitted ARMA model, a Monte Carlo simulation is conducted to generate 1,000 values of \({\widehat{\gamma }}_{2.1}\left(k\right)\) along with its corresponding standard deviations for each lag k. The null hypothesis of time reversibility for each housing market is jointly tested for all lags 1 to 10 based on the following TR portmanteau statistic, which is distributed as \({\chi }^{2}\) with n-m + 1 degrees of freedom.

$${P}_{m,n}=\sum_{k=m}^{n}{\left(\frac{{\widehat{\gamma }}_{\mathrm{2,1}}\left(k\right)}{{\widehat{\sigma }}_{\gamma }}\right)}^{2}$$
(11)

If the null hypothesis of time reversible or symmetry in housing price is rejected, we can further identify whether asymmetry is inherent in functional form or in the innovation of data generating process. Based on Ramsey and Rothman (1996), the sources of time irreversibility come in two different forms. The first form of irreversibility is induced by non-linear functional form as opposed to non-Gaussian innovations. This is referred to “Type I time irreversibility” in which a nonlinear model with Gaussian innovations is the appropriate model. The second form of irreversibility is caused by non-Gaussian innovations as opposed to non-linearity in functional form. This is called “Type II time irreversibility.” In such case, the suitable model for housing markets will be a linear model with non-Gaussian innovations. Differentiating between two types of asymmetries is often overlooked yet remains utmost important for developing a correct forecasting model for any time series. These two types of time irreversibility can be differentiated by performing TR test on the standardized TR test statistics on the ARIMA residuals. The TR test statistics, \({\gamma }_{\mathrm{2,1}}\left(k\right)\) are calculated using residuals from fitted model and standardized by their standard deviation which is calculated as follows:

$$var\left[{\widehat\gamma}_{2,1}\left(k\right)\right]^{1/2}=2\left(\mu_4\mu_2-\mu_3^2\right)/\left(T-k\right)-2\mu_2^3\left(T-2k\right)/\left(T-k\right)^2$$
(12)

where \({\mu }_{3}=E\left[{X}_{t}^{2}\right],{\mu }_{3}=E\left[{X}_{t}^{3}\right],\) and \({\mu }_{4}=E\left[{X}_{t}^{4}\right].\) When the null hypothesis of time reversibility is rejected under both the raw housing returns and ARMA residuals, one can conclude that asymmetric behavior is caused by the non-linearity in the functional form of the model. The logic is that fitting ARIMA to the housing model should remove the linear dependency in the housing return dynamics. On contrary, if pattern of housing returns is time irreversible, but the ARIMA residuals fail to do so, one can conclude that such asymmetric behavior is caused by asymmetry in non-Gaussian innovations.

The empirical TR test results reported in Table 6 reveal that housing markets in nine out of twenty cities exhibit a time irreversible or asymmetric pattern. The TR portmanteau test statistics reject the null hypothesis of time reversible for these housing markets at the 1% significance level. It is interesting to note that all housing markets in the Midwest region show strong evidence of time irreversible pattern. These cities are Cleveland, OH, Detroit, MI, Illinois, CH, and Minneapolis, MN. This is also the case for housing markets in Miami and Tampa, FL in Midwest region, Atlanta, GA and Charlotte, NC in Southeast region, Dallas, TX in Southwest region, and San Diego and San Francisco in Western region. However, when the TR test is performed on the residuals from the ARIMA model, the null hypothesis of time reversible cannot be rejected for the entire U.S. housing market and any city as reported in Table 7. The finding of asymmetric or time irreversible behavior in housing markets in nine cities, but not in the residuals suggests that such asymmetry is driven by non-Gaussian innovations of the housing returns, which is consistent with Type II time irreversibility. Therefore, the appropriate-designed housing market model for these nine cities will be a linear model with non-Gaussian error terms.

Table 6 Standardized time reversibility test statistics for S&P Case-Shiller U.S. national and city home price index, seasonally adjusted
Table 7 Standardized time reversibility test statistics for ARMA residuals for S&P Case-Shiller national and city home price index

5 Conclusions

This study provides a comprehensive analysis of overall U.S. housing market dynamics in twenty cities across five census regions during 1987 and 2019. A battery of tests is employed to discover the nonlinear nature in U.S. housing markets and whether such nonlinearity is in the chaos form and consistent with a characteristic of bubbles. The source of nonlinearity is also identified in response to the need of designing an appropriate model for forecasting housing markets.

The overall U.S. housing markets including home markets in twenty cities exhibit some nonlinear serial dependence. The nonlinear dependence is driven by erratic and chaotic behavior, not by the movement of underlying macroeconomic variables. Persistence of positive or negative returns tends to characterize U.S. house price pattern in various cities except for Phoenix, AZ. Nonlinear behavior in housing markets in eighteen cities is consistent with bubbles, except for home markets in Detroit, MI, Minneapolis, MN as well as U.S. national home market. Housing price patterns in nine cities are time irreversible or asymmetric. Such asymmetric behavior is caused by asymmetry in innovations, not in the functional form. These housing markets are Cleveland, OH, Detroit, MI, Illinois, CH, Minneapolis, MN, Atlanta, GA Charlotte, NC, Dallas, TX, and San Diego and San Francisco, CA. The finding here suggests that the appropriate model for these markets is a linear model with non-Gaussian innovations.

With the overwhelming evidence of nonlinear chaotic process in the U.S. housing markets coupled with the predictable components and bubble episodes, the overall findings here imply that U.S. housing market is relatively inefficient. In addition, macroeconomic variables have little or no explanatory power in predicting house price movements. Similar nonlinear chaotic bubble-like characteristics are shown to explain the behavior of housing markets in most of the cities in five census regions suggesting that U.S. housing prices seem to behave uniformly across regions. As for researchers and policymakers, the consistent finding of nonlinear behavior warrants the use of nonlinear time series model to accurately forecast the house prices at the aggregate as well as city levels. The most notable finding here provides some warning for forecasters and policymakers against relying on the empirical finding solely on the aggregate index. While city home prices are driven by a bubble-like process, the national home index does not have a bubble due to diversification effect across cities. The next logical extension is to develop a model that has the features determined in this study to help successfully predict house prices and potential bubble occurrence.