Introduction

Real estate is a popular investment vehicle for many individual investors. One 2012 national survey estimates that one out of eight American adults own real estate investment property.Footnote 1 Home Mortgage Disclosure Act (HMDA) data suggests that about 10% of home mortgages taken out in 2010 were used to purchase investment property.Footnote 2 Despite the importance of real estate as an investment vehicle, little is known about the performance of real estate as an investment for individuals. This paper presents a new perspective of real estate investments as viewed through the lens of “retail” investors in the residential market. Specifically, we characterize the risk and returns of individual investors who hold multiple residential properties in their investment portfolios. The size of the residential market was $31.8 trillion in 2017, making it an important asset class.Footnote 3 Our paper sheds light on the role of investors in the residential market.

Past work primarily focuses on the role of housing/real estate investments in an individual’s portfolio among other asset classes. For example, Cocco (2005) shows that housing investments crowds out stock market participation. We diverge and track property investments besides an individual’s primary home. This distinction is crucial as consumption related housing decisions may distort optimal investment decisions. While we ignore investors’ other assets such as bonds, stocks, etc., we focus on the entire housing portfolio and relate the returns on such investments to underlying economic factors. Thus, our advantage is to track entire real estate investment portfolios and study factors influencing returns.

In this paper, we examine the performance of real estate investments by individuals. If the market for real estate is efficient, we would expect the price appreciation experienced by real estate investors to be similar to the appreciation of broad-based real estate indices. On the other hand, there are reasons to believe that real estate investors may outperform real estate indices. First, people buy most houses and condominiums to live in them. Likely returns on the property may take a backseat to the amenities it offers and corresponding consumption related decisions. In contrast, investors are less likely to have a sentimental attachment to a property or to value anything other than the return the property generates. Second, many house buyers have little control over the timing of their transactions. The start of a new job or the start of a school year may dictate the timing of purchases and sales. Investors, on the other hand, can time their purchases to provide liquidity and their sales to take advantage of liquidity. Third, real estate investors may have knowledge of local market conditions and negotiating skills that will allow them to outperform indices.

Another view of real estate investors is that they may be prone to behavioral biases that lead to poor investment performance. Bayer et al. (2016) examine the role of contagion in real estate investment. The likelihood that an individual enters the real estate market as an investor increases by about 20% if a nearby house is flipped or if a neighbor starts investing in real estate. Individuals who begin investing in real estate as a result of contagion pay more for property relative to the expected price, earn lower returns, and are more likely to default on a mortgage. Bayer et al. (2011) find that real estate investors in Los Angeles failed to anticipate the market’s peak and “may have contributed to local housing price bubbles.”

In this paper, we examine returns (price appreciation) for real estate investments by individuals in 14 states over 1999–2015.Footnote 4 As a whole, real estate investors outperform real estate indices. Regression estimates indicate that they earn positive returns when indices are unchanged, and that investors’ returns are more sensitive to positive index returns than to negative index returns. It seems likely that real estate investors earn higher returns than other homebuyers in part because investors are motivated to buy solely by a property’s investment potential and not its amenities. That might also explain why investors earn significantly smaller returns on their own homes than on their investment properties.

Different real estate investors should differ in their ability to earn abnormal returns in predictable ways. An experienced investor is likely to be better at selecting properties and negotiating prices. A local investor may have knowledge about the local real estate market that distant investors do not possess. Local investors may be able to see properties more quickly and therefore be better able to take advantage of opportunities when they arise. Hence an investor who lives in an area is likely to do better on investments than an investor who lives farther away. Finally, investors who can pay cash may be in a better negotiating position than those who need to wait for a mortgage to be approved. Cash buyers may be able to move more quickly than other investors.

We show that investors earn larger returns if they live in the same three-digit zip code as the property that they buy. The more properties an investor owned before buying a new property, the greater the return on the new property. Buyers who are able to pay cash earn greater returns than those that take out a mortgage to buy property. These results are very robust. They hold across states and over time.

One reason that these investor characteristics are associated with greater returns is that they are associated with lower purchase prices for property. For each purchase by an investor, we calculate the ratio of the purchase price to the assessed value. This ratio is significantly lower, meaning that the investor pays less relative to the assessed value if the investor pays cash, lives in the same three-digit zip code as the property and already owns investment properties.

We present a series of robustness tests that confirm that our results are robust to alternate explanations. First, we measure price appreciation as a proxy for returns. This implicitly does not account for any property improvements and rents potentially earned. To alleviate concerns relating to this, we consider only land and mobile home transactions and re-do the analyses. Land and mobile homes arguably do not involve improvements and rental streams of income. Our results and interpretations persist for such transactions, thereby alleviating concerns that lack of improvement expenditure and rental data may be skewing our findings. In addition, we find similar results for newer properties where renovations are likely to play a minor role. Secondly, we measure returns using repeat sales. This raises the concern that investors may hold winners and divest losers. We test for differential rates of assessment related appreciation and note that properties that were sold off have similar appreciation rates as those that were held in portfolio.

We contribute to the body of literature on real estate as an asset class by focusing on investment in residential real estate by retail investors. Given the depth in our sample, we are able to characterize entire portfolios of residential real estate investments and identify the role of non-consumption related objectives, experience, equity positions and information asymmetry in generating excess returns. To the extent that our results are generalizable, our results imply that investors generate superior returns.

The rest of the paper is organized as follows. Section 2 provides a review of the literature. Section 3 discusses our data. Section 4 examines investor returns on real estate investments. Section 5 shows that investors who live near properties, pay cash, and have purchased property before pay less for real estate. Section 6 provides robustness tests. Section 7 shows that leading up to peak real estate prices in 2006–2007, real estate investors were less likely to pay cash and less likely to live near the property they bought. We draw conclusions and summarize our findings in Section 8.

Literature Review

Typically, past work focuses on either real estate as a trade-off between consumption and investment or characterizes the effect of housing investment within an individual’s overall portfolio. For example, Flavin and Yamashita (2002) study optimal portfolio allocation across various assets when individuals invest in housing to obtain consumption related benefits. Following a mean-variance efficiency framework, the authors theorize that the level of stock holdings in portfolios is contingent on the ratio of housing to overall net worth. Thus, varying levels of investment in stocks across younger and older households can be explained by the ratio of housing to net worth and the corresponding risk. More recently, Chetty et al. (2017) characterize the varying effects of equity and mortgage debt positions on an individual’s overall portfolio. Through a portfolio choice framework, the primary prediction is that increases in property value (expressed as a sum of mortgage debt and home equity) reduce stock holdings, whereas increases in home equity increases stock holdings. While we ignore an investor’s other assets such as bonds, stocks, etc., we focus on the entire housing portfolio and relate the returns on such investments to underlying factors. In contrast to the literature on housing as a trade-off between investment and consumption, our primary focus is on acquisition of residential properties for investments. In addition, within investor portfolios, we do differentiate the returns of properties that have a consumption objective.

Closest to our paper is a stream of literature that examines purchases beyond primary homes. Additional homes can be identified among individuals’ portfolios either from debt records or by examining property transactions. Using debt related data, Li (2015) depicts an increase in the share of new mortgages by households with more than one first lien from 20% in 2000 to 35% in 2006–07. Furthermore, the author documents that the rise was even more prominent in the sun-belt states, from 20% in 2000 to 45% in 2006–07. Additionally, Gao and Li (2012) study mortgage applications and originations from the Home Mortgage Disclosure Act (HMDA) to examine the prevalence of real estate investors. Prior to 2000, about 5% of mortgage applications were for purchase of investment property. The proportion of mortgage applications for investment property increased steadily until 2005–2006 when it peaked at around 14%. The percentage was higher in the bubble states, particularly Florida, where it reached 25% in 2005.

Next, similar to our paper, a contemporaneous stream of literature uses data on residential property transactions. For example, Giacoletti and Westrupp (2017) examine the returns to real estate flippers in four California cities, Los Angeles, Sacramento, San Diego and San Francisco over the period of 1998 to 2013. In this study, a house is considered to be flipped if it is bought and sold within 12 months, whereas a flipper is considered a professional if he or she bought and sold at least two houses in a two-year period. Besides real estate transactions and recording sale characteristics, the authors use a proprietary dataset of home remodeling permits from Buildzoom to compute excess returns. The results imply that flippers, on average, earn positive abnormal returns and there are large differences in returns across individual houses. In addition, most flippers flip just a few properties per year and one loss can negate any gains. We contribute by looking at the entire portfolios of longer term investors across 14 states and relate returns to underlying characteristics, thereby testing for skill of longer term investors in contrast to flippers in Giacoletti and Westrupp (2017).

Our work is also related to recent literature identifying the role of information asymmetry in housing markets. Kurlat and Stroebel (2015) study the differences in information about neighborhood characteristics among buyers and sellers. The authors contrast transactions of real estate professionals with other “less informed” market participants and show that the presence of informed sellers in a market predicts subsequent house price declines. Additionally, this effect is shown to be larger for areas that have a greater exposure to neighborhood characteristics as informed sellers capitalize and liquidate assets. Chinco and Mayer (2016) examine the impact of out-of-town second home buyers who have relatively less information on housing prices than local buyers. Second home buyers are identified by comparing the address used for tax bills with the property address. Here, distance is a friction. The authors present evidence that out-of-town homebuyers act as misinformed speculators and their presence predicts house price appreciation rates during the financial crisis period. We add to this information based literature and identify the role of distance on investor returns.

Mills et al. (2017) present empirical evidence on buy-to-rent institutional investors in the U.S. residential market. The authors identify buy-to-rent investors from anecdotal sources and present evidence suggesting that presence of such institutional investors is related to an increase in house prices. Additionally, Allen et al. (2018) examine residential purchases in Miami-Dade County and summarize that institutional investors purchase at a discount and their presence is associated with improvement in neighborhood property prices. While examining institutional participants is beyond the scope of this paper, we focus on retail counterparts that have a similar investment objective.

Recent work also identifies the contagion effect of investors. Bayer et al. (2016) study the effect of a set of investors on other nearby individuals. The authors find that novice/inexperienced investors entered housing markets after observing other nearby investors. Furthermore, the study presents evidence that such novice investors underperform relative to others, and thus document a negative externality in housing markets. In contrast, DeFusco et al. (2017) study the spatial spillover of housing market booms and do not find evidence of boom related contagion across markets. However, the authors find evidence of price spillovers in markets neighboring areas that are experiencing a boom. While our work focuses on the returns and characteristics of investors’ specific housing portfolios, questions on externalities arise.

Finally, our work complements a recent study by Peng and Zhang (2019) that relates property type and houses’ stock market betas. The authors analyze the systematic risk in individual houses and highlight that stock market betas vary for different house types, i.e. for differently priced houses. In contrast, our paper adds to the literature on micro level housing dynamics by relating investor characteristics and returns.

Data

Our primary dataset comprises residential transactions across 14 U.S. states obtained from Zillow who secure such records from county offices. We use transactions from 1999 through 2015. For each real estate transaction, we have the date of the purchase, the name of the buyer and seller, the mailing address the buyer uses to receive tax bills, the price paid for the property, the amount of the mortgage and some mortgage characteristics. We identify investors’ portfolios by matching their name, address and track properties’ purchase and subsequent divestment. For each property, we have the street address, county, city, zip code, state, latitude and longitude and a brief description that we use to classify the property as a house (single-family dwelling), condominium, apartment, mobile home, recreational property, or land. We also know the year an apartment or house was built and house characteristics like the number of bedrooms and bathrooms and the square footage. For most properties, we have annual total assessed values and separate annual assessed values for land and buildings. Assessed values are most complete for 2002–2014, so tests involving assessed values use those years.Footnote 5

We identify investors as individuals who own two or more properties in the same state. We avoid including individuals who have moved to a new house while waiting to sell the old one and are potentially using the sale proceeds for purchase. We require that the ownership of the property overlap for at least six months and both the name and the home address of the investor have to match exactly for different properties. By matching on both the name and home address we avoid incorrectly assuming two different people with the same name are one investor. As an example, a perusal of the Los Angeles white pages reveals that there are well over 100 Jose Garcias in the metropolitan area. In this case, matching on names alone could result in counting real estate purchases as investments when they were purchases of primary residences by individuals with the same name. Our methodology does miss some investors though. For instance, if names on deeds do not match exactly, we do not include the investor. So, if the same person is listed as Elvis Presley on one deed and Elvis A. Presley on a second, we would assume they are two different people. Likewise, addresses must match exactly. Our requirement that the individual own two or more properties in the same state means that we will miss an investor who owns, for example, one property in New York and another in New Jersey. Overall, our methodology undercounts the number of retail investors and investments.

We restrict our attention to fourteen states. We include the “bubble” or “sand” states of Arizona, California, Florida, and Nevada. These states include the metropolitan areas that had the largest run-up of prices in 2003–2006 and suffered the largest decline in prices afterwards. The other states include Georgia, Illinois, Michigan, Minnesota, North Carolina, New Jersey, New York, Ohio, Pennsylvania, and Texas. These states are selected so that the sample includes the ten most populous states and also a number of Midwestern states that avoided the large price run-ups and declines of the bubble states. In total, the 14 states in our sample accounted for over 60% of the U.S. population in 2010. Zillow assembles data on a county-by-county basis, and appears to have begun collecting data for heavily populated counties first. The omitted states include many with small or mainly rural populations. Hence, data for these states is sparse in the early years of our sample period. Most studies that have looked at real estate investors have restricted their attention to one or a handful of large metropolitan areas. By focusing on states, we include smaller cities like Reno, Nevada and Fresno, California.

We obtain monthly Zillow index values for each zip code. Zillow calculates indices for all homes, for single-family houses, for condos and for various house sizes and prices. The Zillow index reflects the median home value in an area. Each month Zillow estimates prices for every home in an area. In a given month, most houses are not sold, of course. Zillow uses hedonic models that incorporate characteristics of the property and land, tax assessment information, and prior sale information to assign a value. The index value is then adjusted for residual systematic error. Seasonal adjustments are also made.

Figure 1 depicts price indices for bubble and non-bubble states from 1999 through 2015. We calculate the average percentage price change for bubble states and other states by taking a simple average of the Zillow zip code price change across all zip codes. The starting value of each index is set to 100 and then changed by the average percentage price change each month. For both the bubble and non-bubble states, prices rise steadily from 1999 to 2007, drop until 2012 and then rise again through 2015. The magnitude of the changes, however, are very different. Prices rise about 160% between 1999 and the peak of the housing bubble for bubble states. For the other states, the total increase is only about 60%. The decline from house price peaks to their trough in 2012 is also much steeper for the bubble states. Because real estate prices behaved so differently in bubble and non-bubble states, we analyze returns to investors in these states separately.

Fig. 1
figure 1

Price indices for residential property in bubble and non-bubble statesBubble states are Arizona, California, Florida, and Nevada. Non-bubble states are Georgia, Illinois, Michigan, Minnesota, North Carolina, New Jersey, New York, Ohio, Pennsylvania, and Texas. For each month, percentage changes in real estate prices for bubble (non-bubble) states are a simple average of the changes for each zip code in the bubble (non-bubble) states.

We calculate returns for each three-digit zip code index by taking an equal-weighted average of the Zillow price changes across all zip codes with the same first three digits. The three-digit indices correspond closely with cities, but have the advantage of being available for smaller cities that do not have Zillow real estate indices.

Table 1 provides summary statistics on the properties purchased by investors. For the bubble states, there are 420,914 observations. These purchases include houses, condominiums and apartment buildings. Land and mobile homes are not included, nor are properties with missing or ambiguous descriptions. Most of the properties are not sold during our sample period.

Table 1 Summary statistics on property purchasers by state

For bubble states as a whole, the mean buy price is $253,916 while the median price is $162,000. Purchasers live in the same zip code 41.5% of the time, in the same 3-digit zip code 65.6% of the time, and in the same state 87.6% of the time. An investor is counted as a resident of the property if the mailing address is the same as the property address. Investors live in 27.6% of the properties that they purchase. Foreclosed properties make up 11.5% of their purchases.

The next four rows provide results separately for each of the four bubble states. A few things stand out. First, with an average price of $358,231, properties are much more expensive in California than elsewhere. Second, most of the purchases, 219,438 out of 420,914 are in Florida. California, despite its larger population, accounts for only 162,478. Third, a smaller proportion of purchasers live near the properties in Nevada than elsewhere. Only 52.2%, live in the same 3-digit zip code and only 66.0% live in the same state.

There are 285,551 property purchases in the other ten states. Prices are lower than in the bubble states. The mean purchase price is $135,749 and the median is just $75,000. Purchasers are more likely to live near their property in these states than in the bubble states. More than 42.1% live in the same zip code, 71.1% live in the same 3-digit zip code and 93.4% live in the same state. New Jersey and Texas stand out for having few purchases. This is because property descriptions were more often ambiguous or missing there than elsewhere. Property prices are particularly low in Michigan with a median value of only $23,750. Many of these properties are in the inner city of Detroit or other depressed areas.

We separate properties at the time of purchases into three categories: houses, condominiums, and apartments. Table 2 shows the proportion of properties of each type purchased in each state. Apartments include all multi-family rental properties, from duplexes to buildings with more than 50 units.Footnote 6 Single family houses are 68.7% of bubble state properties and 72.5% of properties in other states. Condominiums are 22.4% of properties in bubble states and 9.0% of properties in other states, while apartments represent 8.8% of bubble state purchases and 18.5% of purchases in non-bubble states. Not surprisingly, we find that there is significant variation across states in property types. Houses make up almost all of the properties in Georgia, Michigan, North Carolina and Texas. Condominiums and apartments are the majority of properties in Arizona, Illinois, and New Jersey. The second to the last column of Table 2 provides the median year built for properties in each state, Bubble states feature more recently built properties. The median year built in bubble states is 1982, while the median year built for properties in the other states is 1950.

Table 2 Characteristics of properties purchased by state

The last column of Table 2 provides the proportions of properties that were purchased for cash in each state. The proportions are surprisingly high. For all bubble states combined, the proportion is 58.7%, while the proportion is even larger, 66.1%, for other states. Keep in mind though that these are purchases by investors, and not typical homeowners. In addition, in some states the median price of homes purchased by investors is quite low. In Michigan, it is only $23,750. According to CoreLogic, nationwide cash sales peaked at 46.6% of house transactions in January, 2011. In October 2015, near the end of our sample period, cash sales still accounted for 46.7% of transactions in Florida and 46.3% of transactions in New York.Footnote 7

Most properties purchased by investors are not sold during the sample period. Table 3 provides some summary statistics for properties that were both bought and sold during the sample period. For bubble state repeat sale transactions, 61.6% are by investors who live in the same 3-digit zip code and 29.6% are purchased without a mortgage. On average, properties are held for 5.34 years and the mean annualized price appreciation is 4.72%. For other states, 66.8% of investors are in the same 3-digit zip code as the property and 54.4% purchase without a mortgage. On average, in these other states, properties that are both bought and sold during the sample period are held for an average of 5.08 years and produce an annualized price appreciation of 12.54%.

Table 3 Characteristics of properties that were bought and sold

Table 4 shows the percentage of properties that were both bought and sold during the sample period that were purchased or sold in each year. For the bubble states, 53.58% of the properties were bought between 2003 and 2006. For the others, purchases were more evenly distributed across years, but 46.26% were still purchased between 2003 and 2006. Sales are more evenly distributed, but concentrated toward the end of the sample period. We saw in Table 3 that investors earned lower returns in the bubble states than elsewhere. Table 4 shows one reason why – they were more likely to be purchasing before the 2006–2007 peak in real estate prices.

Table 4 Percentage of property purchases and sales that take place each year

Price Appreciation on Properties that Were Bought and Sold in the Sample Period

Do Real Estate Investors Outperform the Real Estate Market?

There is reason to believe that real estate investors may outperform real estate indices as a whole. Most transactions consist of people buying or selling their principal residence. They do not buy or sell often, and do not develop expertise or skill in real estate transactions. They may be moving from outside the area and with little knowledge of the local real estate market. They may need to buy a home quickly and therefore have limited time to search for a house or negotiate a favorable price. Finally, and perhaps most importantly, personal tastes may lead them to buy property with characteristics that make it unlikely to be a good investment. So, with most transactions coming from homeowners it would not be surprising if real estate investors outperformed real estate markets.

Identifiable characteristics of investors are likely to be associated with higher real estate returns. An investor who lives near a property is likely to know more about the local market. In addition, living in an area makes it likely that he will be able to take advantage of opportunities quickly. If an investor can buy a property with cash rather than taking out a mortgage, he will be in a better negotiating position with sellers and may be able to buy at a lower price. Paying cash may also allow an investor to move quickly when an opportunity arises. Experience in real estate investing may also be associated with higher returns. An investor who has already bought properties may know more about the real estate market and which properties are likely to appreciate in value.

To see whether investors outperform real estate indices, we estimate the following regression:

$$ {Ret}_i={\alpha}_0+{\alpha}_1{Ret}_{3 Digit\ Zip}+{\alpha}_2{Ret}_{3 Digit\ Zip}>0+{\varepsilon}_i $$
(1)

Here, Reti is the annualized price appreciation earned by the investor who owned property i, Ret3Digit Zip is the annualized return from the average of Zillow zip codes indices in the 3-digit area over the same period that investor owned property i, Ret3Digit Zip > 0 is the annualized return for the Zillow indices in the 3-digit zip code if positive and zero if negative.

Some caveats are in order. The dependent variable in (1) is referred to as the return, but it is really the annualized price appreciation of the property over the time the investor owns it, calculated from the purchase and sale prices. We do not have any information on rents, and this is an important part of the return of any investment property. In calculating excess returns, we implicitly assume that the rent earned by investors is the same as the rent earned by all real estate in the area. We also ignore leverage. We are measuring the price appreciation of the property, not the return to the investor. Consistent with this, if a property is foreclosed, the return is measured using the next sale price of the property, even if it is sold by the lender.

Another issue is that we only observe returns when investors sell property. If investors sell their real estate winners and hang on to losers, like they do with equities, the return to real estate investment will be overstated. Genesove and Mayer (2001) examine the condo market in Boston over 1990–1997 and find that condo owners are reluctant to realize losses. They also find, though, that investors in condos display far less loss aversion, especially with respect to large losses. We will show that for property bought by investors, growth in assessed values of real estate from purchase to sale is very similar to the growth of assessed value of property that is not sold over the same period.

Panel A of Table 5 reports regression results. To minimize the effect of outliers, we require annualized property returns to be between −99% and 2000%, and holding periods to be greater than six months. To eliminate repeat sales that do not come from arm’s length transactions we require buy and sell prices to be at least $5000.Footnote 8 We require that the property characteristics include the year built, and to ensure that a property is not built during the investor’s holding period, we include only properties that were built in years before the purchase year. To make returns from different holding periods comparable, we annualize all returns.

Table 5 Regressions of annualized holding period returns to real estate investments on Zillow indices

The first two columns provide coefficient estimates and robust t-statistics for regressions that include all properties not identified as land, recreational properties, or mobile homes. We also require the property characteristics to include a year built. The first regression includes returns on properties in the four bubble states. The intercept for this regression is 0.0156, indicating that investors earn an annual return of 1.56% on their real estate investments when the Zillow index return is zero. The coefficient on the Zillow index return is 1.4309. Of more interest is that the coefficient on the Zillow index returns when the return is positive is 0.1701, with a robust t-statistic of 3.86. Real estate investors’ returns on their holdings are sensitive to index returns, but are more sensitive on the upside. A potential explanation for this is that real estate investors have timing ability. They buy property that is particularly sensitive to local real estate returns before area real estate appreciates, but buy property that is less sensitive to local real estate returns before property values fall.

The second regression includes properties from the ten other or non-bubble states. The intercept coefficient for this regression is 0.0632. In other states, investors earn an average annual return of 6.32% if the Zillow index is unchanged. The coefficient on the index return is 0.7135, indicating that real estate investors’ returns in these states fall by less than the Zillow index when the index declines. The coefficient on positive index returns is 1.5013, so real estate investors earn 6.32% plus 0.7135 + 1.5013 = 2.2148 times as much as the index when the Zillow index return is positive.

The next two columns provide regression estimates using just properties that are identified as houses, condos, or apartments. Properties that are not described, or that have ambiguous descriptions are omitted. As in the first two regressions, intercepts are positive and highly significant for both the bubble state and other states regressions. When the Zillow indices are flat, investors still earn positive returns. The coefficients on positive Zillow index returns are positive and significant. Real estate investor returns move more with the Zillow indices when the indices are rising than when they are falling. This again could be interpreted as market timing ability. Real estate investors buy properties that move with the local market before real estate prices increase, and buy properties that do not move much with the market before real estate price declines.

As a whole, the results in Panel A indicate that real estate investors purchase properties that outperform real estate indices. This is not a surprising result. Most real estate purchases are made by people who intend to live in the house or condo. They have little experience in real estate and may be new to the area. In addition, these buyers purchase real estate for consumption as well as investment. They may be willing to pay for amenities that the market will not value highly.

Real Estate Investor, Property Characteristics and Investment Returns

Even on properties bought for investment purposes, we would expect real estate investor returns to be a function of their experience, skill, knowledge, and ability to negotiate. To test this, we include variables for investor characteristics in the regressions. We also include interactions between characteristics and the index return and between characteristics and a variable that equals index returns when the index returns are positive and zero when negative. This allows us test whether the characteristic helps investors time the market and take advantage of market-wide price appreciation. Specifically, we estimate

$$ {Ret}_i={\alpha}_0+{\alpha}_1{Ret}_{3 Digit\ Zip}+{\alpha}_2{Ret}_{3 Digit\ Zip}>0+{\sum}_{n=1}^N{\beta}_n{Characteristic}_{n,i}+{\sum}_{n=1}^N{\gamma}_n{Characteristic}_{n,i}\bullet {Ret}_{3 Digit\ Zip}+{\sum}_{n=1}^N{\delta}_n{Characteristic}_{n,i}\bullet {Ret}_{3 Digit\ Zip}>0+{\varepsilon}_i $$
(2)

In the remainder of Table 5 we estimate Eq. (2) using one characteristic at a time. In Table 6, we estimate (2) using all investor and property characteristics together.

Table 6 Regressions of real estate investment returns on investor characteristics

Returns on Properties that Investors Occupy

If real estate investors’ focusing on the investment characteristics of property rather than its consumption value is a significant reason they outperform indices, we would expect their returns to be lower on properties that they occupy. To test this, we estimate Eq. (2) with a dummy that takes a value of one if the investor lives in the property as the characteristic. These regressions are reported in Panel B of Table 5. The first four regressions do not contain interactions between index returns and the dummy variable for the investor living in the house. The four regressions represent bubble and non-bubble states and the two different criteria for including properties, those explicitly defined as houses, condos or apartments, and all properties not defined as land or mobile homes. In each case, the coefficient on the dummy variable for living in the house is negative, significant, and larger in magnitude than the intercept coefficient. Properties owned by real estate investors that they do not occupy earn positive returns when the local Zillow index is flat, while the houses they live in earn negative returns. Real estate investors earn abnormal returns, but not on their own homes.

We include interactions between the Zillow index return and the Zillow index return when positive and the dummy for living in the house in the last four regressions of Panel B. With these additional terms, the coefficients on the dummy variable for living in the house remain negative and significant in each regression. Houses purchased by investors to live in underperform. In the bubble states, the coefficient on the interaction between the return on the index and the dummy for living in the house is negative. Investors choose properties that are less sensitive to local real estate values when they live in a house than when they buy a property as an investment. In the other states, the coefficient on the interaction between living in the house and the index return is positive and significant, while the coefficient on the interaction between a positive index return and the dummy for living in a house is negative and significant. This is the opposite of what we would expect with successful market timing. Houses that investors live in are sensitive to the index during downturns but insensitive to the index when it performs well. This is very different from what we see with properties that investors buy but do not live in.

Investor Experience and Returns

We expect investor experience to affect returns both because we expect experienced investors to do a better job of selecting properties and because we expect them to do a better job of negotiating purchases. We estimate (2) using a variable that equals the total number of properties that the investor has purchased, up to and including the current property. The coefficient on this variable can be interpreted as the extra annual return earned on later purchase. In considering these regressions, bear in mind that we may miss some purchases. Some investors may have bought property before the start of our data. In the instances where this occurs, investors will have more experience than indicated by the experience variables. We will also miss purchases if they occur in a different state or if the investor name or address are recorded slightly differently on different deeds. These problems may add noise to the regressions and possibly weaken results.

We report the regressions in Panel C of Table 5. The first four regressions in Panel C do not include interactions between Zillow Index returns and the number of properties owned by the investor. In each case, the coefficient on the number of properties is positive and highly significant. These regressions suggest that investors learn from their real estate experiences and become better investors. This is not surprising. We would expect them to learn about desirable and undesirable characteristics of properties. We might also expect experience to make them skilled negotiators. The last four regressions include interactions between the number of properties owned by the investor and Zillow index returns and positive Zillow index returns. Even with these interaction terms, the coefficient on the number of properties is positive and significant in each case. For bubble states, the coefficient on the interaction between the index return and the number of properties is positive and significant. Experienced investors pick properties that are more sensitive to index returns than inexperienced investors. For the other states, the coefficient on the interaction between number of properties and the index return is negative and significant, while the coefficient on the interaction between the number of properties and a positive index return is positive and significant. This is consistent with experienced investors being better able to time their real estate investments than other investors.

Mortgage Use and Returns

A large proportion of real estate investors pay cash when buying properties. A big advantage of paying cash is that it puts the investor in a better negotiating position when buying the property. The seller knows that the transaction will be completed. There is no concern that the assessed value of the property will be too low to support the mortgage. There is no uncertainty about whether the buyer will obtain financing. In addition, investors who purchase properties at auction are typically required to pay cash. Finally, an investor who can pay cash can move quickly to take advantage of a favorable price for a property.

We rerun regression (2) with a dummy variable that takes a value one when the purchase is made without a mortgage. Results are shown in Panel D. The first four regressions do not include interactions between the no mortgage dummy and index returns. In each of these regressions, the coefficient on the variable for no mortgage is positive and highly significant. The coefficients are large, ranging from 0.1678 for properties other than land or mobile homes in bubble states, to 0.2257 for the properties designated as houses, condos, or apartments in non-bubble states. All else equal, an investor earns about 20% per year more on properties that are purchased for cash. When the no mortgage dummy is included, the intercept coefficient is negative in each regression. When the local real estate index return is zero, investors who buy with a mortgage lose money while those who buy without a mortgage earn positive returns. The t-statistics on the no mortgage dummies are very large, ranging from 39.99 to 48.23. Adjusted R2s are larger for these regressions than for any others in Table 5. Paying cash seems to be a very important component of investor success.

The regressions in the next four columns include interactions between the dummy for no mortgage and both index returns and positive index returns. Interactions between the no mortgage dummy and positive index returns are particularly interesting because the ability to buy with cash is helpful for investors who are trying to time the market. In each of the regressions, the coefficient on the interaction between the no mortgage dummy and index returns is negative and highly significant while the coefficient on the interaction between the no mortgage dummy and positive index returns is positive, large, and highly significant. When local house prices fall, houses bought without mortgages experience much smaller price declines. When house prices rise, the prices of houses bought without mortgages rise much more. These results suggest that paying cash for properties helps investors time the market. They can quickly purchase property that is sensitive to local market conditions before an upturn.

Buyer Location and Return

Real estate investors have traditionally lived near their investments, but many in our dataset live in different cities within the same state, or even in different states from their property. A local investor is likely to know more about the area and the local real estate market. In addition, local real estate investors may learn of opportunities more quickly than investors located farther away. They may be able to act more quickly to take advantage of opportunities. To test this, we include a dummy variable that equals one if the investor lives in the same 3-digit zip code as the property in Eq. (2). These regressions are reported in Panel E of Table 5. In the first four regressions, we do not include interactions between the dummy for living in the same three-digit zip code and index returns. In each of these regressions, the coefficient is positive and highly significant. Investors earn returns that are higher by about 2% per year when they live close to properties they own. For the bubble states, the coefficients on the interaction between the dummy for living in the three-digit zip and the index return are negative and significant, while the coefficients on the interaction between the dummy for living in the three-digit zip and the positive index return are positive and significant. This suggest that local investors in the bubble states were able to time the market. Another way to interpret this is that in the bubble states, the investors who were buying into high risk areas before the downturn were not local investors.

Returns on Purchases of Foreclosed Properties

Foreclosed properties account for a significant proportion of the properties that investors purchase in our sample. There were many foreclosures in the bubble states after prices began to decline. Depressed areas like Detroit, Michigan also had numerous foreclosures. Lenders that acquire properties through foreclosure typically have no interest in owning real estate and make for “motivated” sellers. Investors who purchase foreclosed properties may therefore be able to negotiate favorable prices and may be able to earn large returns.

Estimates of Eq. (2) with a dummy variable that is one if the house had been foreclosed when purchased are shown in Panel F. The coefficients on the foreclosure dummy are 0.1523 and 0.1539 in the bubble states, and are highly significant. Investors who buy foreclosed properties in the bubble states earn annual returns that are about 15% greater than returns on indices. The coefficients on the foreclosure dummy for the non-bubble states are very large: 0.3928 and 0.3687. These coefficients are also highly statistically significant. The last four regressions include interactions between foreclosed properties and index returns and positive index returns. For the bubble states, the interactions between foreclosure and index returns are negative while the interactions between foreclosure and positive index returns are positive. For the other states, these interactions are insignificant.

Some caution is needed in drawing conclusions from the returns to foreclosed properties. If investors spend more money to renovate properties than is spent on the typical house or condo in the Zillow index, excess returns may be overstated. Foreclosed properties may require more repairs and renovations than typical properties. Homeowners who expect to default on mortgages are likely to skimp on maintenance and owners of property that needs major repairs may be more likely to default on loans, all else equal. This could help explain the very large abnormal returns earned on foreclosed property in the non-bubble states.

Real Estate Returns when all Investor Characteristics Are Considered

Now, rather examining the impact of investor characteristics separately, we combine them into one regression. We include dummy variables for the investor living in the same 3-digit zip code as the property, for living in the same zip code, for living in the property, for buying the property without a mortgage and for foreclosed property. We also include the number of properties owned by the investor. In additional regressions, we also include interactions between each characteristic and the index return and positive index return variables. Table 6 presents regression estimates.

As in Table 5, we estimate separate regressions for bubble and non-bubble states. For both locations, we estimate one regression using all properties that are not designated as land, mobile homes, or recreational properties, and one that includes only properties explicitly categorized as houses, apartments or condos. In these regressions, the coefficients on investor characteristics generally have the same sign and significance level that they do in the regressions that include only one characteristic at a time.

In each regression, after including all the other variables, the coefficients on the dummy variables for living in the same three-digit zip code as the property, for purchasing the property without a mortgage, and for purchasing foreclosed property are positive and highly significant. Investors earn higher returns on property if they live in the same city, can pay cash, and buy when the house has been foreclosed. The coefficient on the number of properties owned is also positive and significant. Even after adjusting for all of the other variables, experienced investors earn higher returns than inexperienced ones. The coefficient on the dummy variable for living in a house is negative and highly significant for the non-bubble states, but not for the bubble states.

It is particularly interesting that in each of these regressions, even after including all other variables, the interaction between buying without a mortgage and the index return is negative and significant, while the interaction between buying without a mortgage and a positive index return is positive and significant. More than any other variable, paying cash for a property is associated with ability to time the local real estate market.

Investor Characteristics and the Purchase Prices of Properties

Investors can earn abnormal returns on real estate in several ways. First, they can use their knowledge of local real estate, or their negotiating ability or their ability to provide liquidity to purchase properties at favorable prices. Or, they can use knowledge of local economies to forecast areas and property types that will provide better returns. Finally, it is possible that they earn superior returns by providing property improvements. We examine that issue in the next section of the paper.

In this section, we test whether the characteristics that are associated with large real estate returns are also associated with purchasing real estate at low prices. If specific real estate investor characteristics allow them to earn abnormal returns on their investments, it is likely that these characteristics are associated with the investors paying less for properties. The ability to buy without a mortgage may allow investors to negotiate better prices, and to move more quickly when good buying opportunities arise. Foreclosed properties may have “motivated sellers” and be available at better prices. People living near properties may be more aware of when they can be purchased at a favorable price. Finally, investors who have purchased numerous properties may be better at negotiating purchase prices and assessing property values.

To test whether investor characteristics affect purchase prices, we run the following regression,

$$ {\left(\frac{Purchase\ Price}{Assessed\ Value}\right)}_i={\alpha}_0+{\alpha}_1{D}_{No\ Mortgage,i}+{\alpha}_2{D}_{Foreclosed,i}+{\alpha}_3{D}_{Same\ 3- digit\ zip,i}+{\alpha}_4{Number\ Prop}_i+{\alpha}_5{D}_{Live\ in\ House}+{\varepsilon}_i $$
(3)

The correspondence between purchase prices and assessed values is likely to differ significantly across locations, so we include fixed effects for the five-digit zip code of each property and cluster standard errors on the purchase year. We present results in Table 7.

Table 7 Regressions of the ratio of purchase price on assessed value

Panel A shows results for bubble states. We do not need sale prices for properties, just purchase prices. Hence, these regressions have more observations than the return regressions. We first run separate regressions for each investor characteristic that is associated with real estate returns. The first regression contains only the dummy variable for no mortgage as an explanatory variable. With fixed effects, the intercept is determined arbitrarily. Stata chooses it so that the predicted value calculated at the mean of the independent variables equals the mean of the dependent variable. In the first regression, it is 1.5855, indicating that investors pay about 59% over the assessed value. The coefficient of −0.3178 means that investors without a mortgage paid almost 32% of the assessed value less than other investors. The ability to purchase without a mortgage appears to allow investors to negotiate better prices or to move quickly to buy promising properties. In the second regression, the coefficient on the dummy variable for a foreclosed property is −0.4795. Investors bought foreclosed property for about 48% of the assessed value less than other properties. In the third regression, we estimate the impact of the investor living in the same three-digit zip code as the property on purchase prices. Living nearby is associated with a purchase price that is only 1.7% lower, as a percentage of the assessed value, than is paid by an investor living in another city. Unlike the coefficients on all the other characteristics, this one is statistically insignificant with a t-statistic of −1.31. The fourth regression has just the number of properties held by the investor as an independent variable. The coefficient is −0.0208, indicating that more properties are associated with lower purchase prices relative to assessed values. The coefficient on the dummy variable for living in the house, shown in regression (5) is 0.0870. Investors who intend to live in a house pay more, relative to the assessed value, than investors who buy property solely for an investment. When all the variables are included together, in the sixth regression, all retain the same signs. The coefficient on the dummy variable for living in the same 3-digit zip code shift from being insignificant to having a t-statistic of −8.50. The number of properties and the dummy variable for living in the house are now statistically insignificant.

Panel B shows results for other states. Results are similar to the bubble state results but stronger. All coefficients have the expected sign, are larger in absolute value than the bubble state coefficients, and are more statistically significant. For example, in the regression for non-bubble states with all variables, the coefficient on the dummy for the investor living in the same 3-digit zip code is −0.0849 with a t-statistic of −9.33, while the same variable has a coefficient of −0.0335 with a t-statistic of −8.50 in the bubble state regression.

The results in Table 7 are consistent with real estate investors earning abnormal returns by buying at low prices relative to assessed values. The same characteristics of investors that allow them to earn large returns on real estate investments also allow them to buy at low prices.

Robustness Checks

Robustness Tests: Are the Results Dependent on the Zillow Indices?

Our results thus far indicate that individual investors outperform Zillow indices on their investments, particularly if they live in the same city as the investment property, buy it without a mortgage, and if the property has been foreclosed. The Zillow indices are popular and widely followed, but the details of their construction is relatively opaque. Before concluding that individual real estate investors earn abnormal returns, we examine the sensitivity of our results to the real estate index.

We construct monthly repeat sales indices for four three-digit zip codes areas in Florida, and three three-digit zip codes areas in California. The Florida areas are 331 (Miami), 334 (Boca Raton, West Palm Beach), 339 (Fort Myers) and 342 (Sarasota). The three California areas are 922 (Palm Springs, Riverside County), 923 (San Bernardino County), and 945 (Napa, Fremont, Valejo). Estimation of repeat sales indices requires a lot of observations, and these were the three-digit zip codes in California and Florida with the largest number of repeat sales over 2000–2015. For each three-digit zip code, we generate a repeat sales index.Footnote 9

We regress annualized investor returns on annualized index returns, investor and property characteristics, and interactions between positive index returns and property characteristics using our repeat sales indices and the Zillow indices used throughout the paper. Results are presented in Table 8. The first column provides coefficients and t-statistics for the regression that uses our repeat sales indices and does not have interactions between index returns and characteristics. The second column reports results of a similar regression that uses the Zillow index.

Table 8 Robustness Tests: Comparing Results Using Zillow Indices and Repeat Sale Indices

For the most part, results are insensitive to index choice. In both regressions, the coefficients on the dummy variables for buying the house without a mortgage and buying foreclosed property are positive and statistically significant. The coefficient on the number of properties owned is also positive and significant in both regressions. The one important difference is that the coefficient on a positive index return for the three-digit zip code is positive and significant when the Zillow indices are used, but negative when the repeat sale indices are included in the regression.

The last two columns of Table 8 show regression results when index returns and positive index returns are interacted with investor characteristics. Now, in both regressions the coefficient on positive index returns is negative and significant. This is not what we find in the full sample, but appears in this subsample regardless of which index is used. In both cases, the coefficient on the interaction between index returns and the no mortgage dummy is negative and significant, while the interaction between positive index returns and buying with no mortgage is positive and highly significant. Investors are better able to time the real estate market if they buy with cash. The interaction between index returns and the foreclosure dummy is negative and significant for both regressions while the interaction between positive index returns and the foreclosure dummy is positive and of marginal significance in both cases.

The main lesson of Table 8 is that our results are not sensitive to the real estate index that we use to measure abnormal returns. Regressions that use our repeat sales indices produce similar results to regressions that use the Zillow indices. All-in-all, the Zillow indices appear to be somewhat better than the simple repeat sale indices. R2’s are higher for regressions that use the Zillow indices than similar regressions that use the repeat sales indices. In addition, the repeat sales indices used in Table 8 were estimated using areas with large numbers of repeat sales. It is likely that repeat sales indices estimated for areas with fewer transactions would not work as well.

Do Real Estate Investors Sell Winners and Keep Losers?

Based on properties that they both buy and sell, it appears that real estate investors outperform local real estate indices. This is particularly true for certain types of investors. Those who live in the same three-digit zip code as the property and hence are likely to have better information about the local area than outsiders, earn positive abnormal returns. Investors who do not need to take a mortgage to buy property are in a better position to negotiate a purchase and can move quickly when a desirable property is available at a good price. These investors also outperform local indices on properties that they buy and sell. Investors with more experience as property owners are also likely to make fewer mistakes in buying, selling and managing property. We would expect them to earn higher returns, and our evidence suggests that they do.

These estimates are based on properties that are both bought and sold in our sample period – a minority of all investor held properties. Investors in equities have been shown to sell winners while hanging on to losers. If the same is true for real estate investors, their returns could be overstated.

To see if real estate investors are selling winners, we compare the change in assessed value between a property’s purchase and sale years with the change in assessed values over the same time period for properties that were held through that year. We do that by estimating the following regression,

$$ {Ret}_{t,t+k}^{Assessed,i}={\alpha}_0+{\alpha}_1{D}_{Sold\ t+k}^{\mathrm{B} ought\ t}+{\varepsilon}_i $$
(4)

Where the return is based on assessed values for years t and t + k and the dummy variable takes a value of one if the property was bought in year t and sold in year t + k. Other observations are properties that were bought in year t and sold at a later date, or not at all during our sample period. Fixed effects for the three-digit zip code of the property capture the average annualized assessed value return for properties in that zip code bought in year t and held at least until year t + k. To ensure that we do not include properties that were built while the property was held, we only include observations if the building was built at least a year before the purchase date. A positive value for α1 indicates that properties that were sold outperformed, at least in assessed value, properties that the investor continued to hold. Or, put another way, a positive α1 estimate suggests that investors are selling winners.

Table 9 presents the α1 coefficients, or the difference between annualized returns for properties that were sold and returns for properties that were held. Panel A provides results for the bubble states. Each coefficient is the coefficient from the regression with a specific purchase and sale year. Recall that each regression contains all assessed value returns for properties that were bought in year t and sold in year t + k or later. Therefore, the coefficient estimates are not independent. Many of the properties that were bought in, say, 2002 and not sold in 2008 were also not sold in 2009, and the assessed returns for these properties from 2002 to 2009 may be strongly correlated with the assessed returns from 2002 to 2008.

Table 9 The difference between assessment value returns for properties that were sold and properties that were held

If investors were more likely to sell winners, the coefficients would be positive. That is not what we see. In Panel A, coefficients are more likely to be negative, particularly if investors purchased the property in 2006 or 2007. T-statistics, in parentheses under the coefficients, indicate that several of these coefficients are significantly less than zero. If anything, investors in the bubble states were selling losers and hence returns could be understated. Panel B provides results for non-bubble states. Coefficients are about half positive and half negative. These is little to suggest that investors are selling winners. This evidence suggests that it is unlikely that calculating returns using completed repeat sales imparts a bias to the return estimates.

Do Real Estate Investors Outperform Indices or Do they Spend More to Improve Properties?

It appears that real estate investors, particularly those who live near their properties and buy without a mortgage, earn returns that exceed local real estate indices. One reason for this is that they appear to buy these properties at attractive prices. It is also possible though that investors are spending more to renovate and improve properties than others. Note that individuals who purchase properties to live in also spend money on improvements and renovation, and the value of the local real estate indices reflect these improvements. The issue then is whether real estate investors spend more on renovations than others, and whether the extra returns earned by them are eliminated when the extra amount that they spend on improvements is included.

There are investors who purchase houses, with the aim of renovating them and “flipping” them quickly. Note, however, that we omit flippers who buy and sell in six months or less from our return calculations. Instead, the average holding period of properties used to calculate returns is over five years. Nevertheless, to get at this issue, we estimate the return regression using returns on land and mobile homes. It is unlikely that returns from buying and selling either of these forms of property can be attributed to improvements made by the investor. Abnormal returns must instead come from superior knowledge of property values, better negotiating of prices, or fewer behavioral biases than the typical property buyer.

Table 10 presents the regression results separately for land and mobile homes along with the comparable regressions for houses, apartments, and condos. There are far fewer observations of returns on land and mobile homes than there are for houses, condos and apartments. This is particularly true for the other states. Hence we omit interactions between index returns and investor characteristics in these regressions. The first two regressions are for bubble state properties. For land and mobile homes the coefficient on Ret3-digit Zip > 0 is large, 6.2296 with a t-statistic of 30.80. This is not surprising. Building material and construction labor prices do not fluctuate much, so most changes in real estate prices should come from changes in the value of a location. Land values will be particularly sensitive to those changes.

Table 10 Investor returns on land and mobile homes versus houses, condominiums, and apartments

Of more interest is how investor characteristics affect returns on land and mobile homes. For land and mobile homes, like houses, condos, and apartments, investors earn larger returns if they live in the same state and same 3-digit zip code as the property. This suggests that value of living close to the property isn’t that it makes it easier to improve the property. That would not apply to land. Likewise, investors earn larger returns on land or a mobile home if the property is bought without a mortgage. This suggests that the extra return from paying cash doesn’t arise because the investor has more resources to put into renovation. It is likely that there is no renovation for land or mobile homes. Also like houses, condos and apartments, land investors who have already invested in properties earn larger returns than those who are new to real estate investing. This can be explained by experienced investors knowing more about the local real estate market, but not by them being better at renovating property.

The next two columns show results by property type for non-bubble states. The land and mobile home regressions again show that returns are affected in the same way by the same investor characteristics that affect returns on houses, condos, and apartments. Returns are larger for land investors if they live close to the property, pay cash, and already own property. Returns are larger on foreclosed properties than others.

Recently built houses are also unlikely to be improved or renovated by real estate investors. Abnormal returns from investing in these houses (or condos or apartments) are likely to come from better negotiating skills, better knowledge of local economies and real estate markets, or an ability to supply liquidity to property sellers. To compare investor returns on new properties that are unlikely to need renovation with returns on older properties, we divide repeat sale transactions into those in which the property was bought five or fewer years after it was built, and those in which the property was bought between six and one hundred years after its construction. We then estimate the price appreciation regression separately for new and old buildings. Regression estimates are presented in Table 11.

Table 11 Investor Returns on New and Old Properties

The first two columns show bubble state regressions for properties that are one to five years old, and properties that are six to one hundred years old. For the new properties, the coefficient on the no mortgage variable is 0.1141. An investor who buys a recently built house and pays cash earns an extra 11.41% annually on her investment. This again suggests that the value of paying cash is that investors can negotiate a better deal, not that they have money to pay for renovations. Finally, each extra property owned before purchasing the recently built house means an extra 1.01% annual return. The number of properties owned is statistically significant with a t-statistic of 3.44.

The intercept coefficient for the regression is −0.0753. So, an investor who lived in the same 3-digit zip code but not the same 5-digit zip code as the property, had no mortgage, and owned one property before this purchase could be expected to earn returns of −0.0753 + 0.0167 + 0.0108 + 0.1141+ 2 × 0.0101 = 8.65% per year when local real estate prices were unchanged. Given that these are recently built properties, it is highly unlikely that these large returns relative to local real estate indices come from investors spending more on improvements than the typical homeowner spends on improvements for properties in the index.

The next column contains the regression results for older properties in bubble states. Returns are somewhat more sensitive to the investor living near the property and not having a mortgage, but total excess returns are similar. An investor who lived in the same 3-digit zip code but not the same 5-digit zip code as the property, had no mortgage, and owned one property before this purchase could be expected to earn returns of 14.25% per year.

Regressions for the non-bubble states are reported in the next two columns. Returns are more sensitive to the investor’s location in these states, and returns are much lower if the investor lives in the house. For recently constructed properties in the non-bubble states, investors who lived in the same 3-digit zip code but not the same 5-digit zip code as the property, had no mortgage, owned one real estate investment before the purchase and did not live in the property, earned returns of −0.0628+ 0.0315 + 0.0149 + 0.0565 + 2 × 0.0087 = 5.75% if the local Zillow index was flat. Again, given how recently these properties were constructed, it seems highly unlikely that property renovations could account for the appreciation that investors earn.

The price appreciation regression for older properties in non-bubble states is shown in the last column. Like returns on newer properties, returns to investors in older properties are greater for nearby properties, if the investor pays cash, and if the investor already owns real estate. A difference here is that the return on older properties is more sensitive to positive index returns than to negative index returns.

The findings in Tables 10 and 11 do not, of course, indicate that remodeling and renovations play no part in the returns earned by real estate investors. But, the fact that similar returns are earned on investments in land and in new properties suggests that renovations play a minor role in investors’ real estate returns.

Real Estate Investors and the Housing Bubble

Our results suggest that real estate investors are, as a whole, rational investors who outperform real estate indices with their investments. This raises the question of whether real estate investors were different in the bubble states and during the housing bubble. We have seen consistently that investors who paid cash and investors who lived near properties earned larger returns than investors who bought with mortgages or lived farther from properties. We next examine investors who bought in bubble states and during the bubble period differed from other investors in those characteristics.

Fig. 2 plots the proportion of all investor property purchases each year from 1999 to 2014 that were made by investors that lived in the same 3-digit zip code as the property. The solid line is the proportion of bubble state purchases by local investors while the dashed line is the proportion of purchases in other states from local investors. The proportion of local investor purchases in bubble states declines steadily from over 70% in 1999 to 50% in 2006 and 2007, the peak of housing prices. It then increases to about 60% by 2014. The proportion of purchases by local investors is higher in non-bubble states than bubble states every year. Nevertheless, it also decreases from almost 80% in 1999 to about 60% in 2006 and 2007. It then increases back to 60% by the end of our sample period.

Fig. 2
figure 2

The proportion of real estate purchases by investors living in the same 3-digit zip code by year

Paying cash is also a robust predictor of investment success. Figure 3 shows the proportion of investor purchases each year that were made without a mortgage. For bubble states, the proportion is around 35% during 2002–2004 before dropping to under 30% in 2005–2007. For non-bubble states, the proportion is around 50% in the early part of the period and does not change noticeably during the bubble period. After 2007, the proportion of cash purchases increases sharply, especially in the bubble states.

Fig. 3
figure 3

The proportion of real estate purchases by investors that do not use a mortgage by year

So, the investors who perform poorly make up a larger proportion of purchases in bubble states and a larger proportion during the bubble period. It appears that the run-up in real estate prices attracted less qualified investors into the market in bubble states.

Conclusions

We examine the returns earned by individuals who invest in real estate. As a whole, investors outperform real estate indices. This is not surprising. The marginal buyer of a house is someone who is buying a place to live. The investment prospects for a house or condo are likely to take a backseat to its amenities.

We identify several specific characteristics of individual investors that are associated with significantly higher returns on investments. One is living close to the investment property. Investors who live near a property are likely to be more knowledgeable about local economic conditions and the local real estate market. They may learn about investment opportunities before outsiders and may be able to take advantage of them quickly. Another is paying cash for the property. An investor who is able to pay cash may be able to move more quickly when opportunities arise. She may be better able to negotiate a favorable price. Finally, we show that experienced real estate investors who had previously purchased property earned higher returns than inexperienced ones. Investors with these characteristics outperform real estate property indices.

One reason that investors with these characteristics outperform indices is that they buy at low prices relative to assessed values. This can occur if investors are good negotiators, or if they are better able to provide liquidity to the real estate market, or if they are better equipped to take advantage of opportunities when they arise. It is possible that real estate investors appear to outperform indices, but only because they spend more to renovate properties. We find, however, that real estate investors (with the right characteristics) also outperform indices when they invest in land, mobile homes, or recently constructed property. It is unlikely that they spend more money to improve these properties than the average homeowner represented in the Zillow indices.

Most of the properties that investors buy are not sold before the end of our sample period. It is well known that individual stock market investors are reluctant to realize losses by selling stocks and much more likely to sell stocks with capital gains. If real estate investors have the same biases to the same degree, our estimates of real estate returns may be too high. We find though, that properties that were bought but not sold had just as much appreciation in assessed values as properties that were bought and sold.