Mortgage Lending Discrimination Across the U.S.: New Methodology and New Evidence

Delis, Manthos D.; Papadopoulos, Panagiotis

doi:10.1007/s10693-018-0290-0

Mortgage Lending Discrimination Across the U.S.: New Methodology and New Evidence

Published: 17 March 2018

Volume 56, pages 341–368, (2019)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Financial Services Research Aims and scope Submit manuscript

Mortgage Lending Discrimination Across the U.S.: New Methodology and New Evidence

Download PDF

Manthos D. Delis¹ &
Panagiotis Papadopoulos²

1770 Accesses
18 Citations
Explore all metrics

Abstract

Is there discrimination in mortgage-loan origination and pricing? If so, does the level of discrimination differ before and after the eruption of the subprime crisis? Using data from 6.5 million loan applications from 2004 through 2013, we propose a novel approach aiming to substantially lower the notorious omitted-variable bias of the Home Mortgage Disclosure Act (HMDA) database and identify the level of racial, ethnic, and gender discrimination in mortgage lending across the United States. In stark contrast with previous studies, we find, on average, very little discrimination in loan origination. Although discrimination increases somewhat after 2007, its probability remains well below 1%. In contrast, we find that white (non-Hispanic) applicants pay a lower spread on the originated loans by 0.37 (0.11) basis points, a result that almost entirely comes from the pre-crisis period.

Are Minorities Still Paying Higher Mortgage Interest Rates?

Article 10 January 2023

Refinance and Mortgage Default: A Regression Discontinuity Analysis of HARP’s Impact on Default Rates

Article 28 June 2016

An Unintended Consequence of Mortgage Financing Regulation – a Racial Disparity

Article 12 November 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

This paper revisits the issue of mortgage-lending discrimination, which has vital implications for egalitarianism in general and individuals’ social and economic welfare in particular. We specifically address two old questions about whether race, gender, and ethnicity affect (1) the probability of banks granting a new mortgage loan and/or (2) the spread that banks charge on newly originated loans. We also ask a new question: What is the role of the subprime crisis with respect to these issues? The main novelty of our work, relative to the extant literature, rests in improvement of identifying discrimination practices when using countrywide U.S. loan-level data, along with placing the role of the subprime crisis at the heart of our analysis. Answering these questions in the most precise way possible is especially relevant for policy-makers and the general public, because housing is, inter alia, a core part of human well-being and a well-known part of the American dream (Pager and Shepherd 2008).

Existing literature on identifying discriminatory practices in the mortgage loan market considers both discrimination in a bank’s decision to originate the loan or not (henceforth called loan origination discrimination) and discriminatory practices in the pricing of loans (henceforth called loan pricing discrimination). The premise is that minority applicants, mostly African-Americans or Hispanics, are potentially discriminated against relative to non-minority applicants (i.e., whites and non-Hispanics) with respect to loan origination and/or pricing. The most prominent branch of the existing literature augments information from the Home Mortgage Disclosure Act (HMDA) with information on specific loan applicants and property values, etc., and it shows that white applicants have at least an 8% higher chance of receiving a loan relative to African-Americans, ceteris paribus (e.g., Munnell et al. 1996; Hubbard et al. 2012; Wheeler and Olson 2015).

Identifying discriminatory practices was one important reason for the creation of the HMDA database. Unfortunately, the HMDA does not provide crucial information on several dimensions of the bank’s decision-making process regarding loan origination and pricing. The missing information includes, but is not limited to, the loan-to-value ratio as well as the applicant’s wealth and creditworthiness. Thus, inference on discriminatory practices from research that uses only the HMDA database will very likely overstate the level of discrimination significantly and thus be invalid. For this reason, researchers and policy-makers tend to examine specific subsets of the HMDA—usually in local markets, where information on these basic omitted variables is available.

This approach can also be criticized, however, along two dimensions. First, the list of important omitted variables can be very idiosyncratic at the individual level. In fact, when discriminatory practices are alleged the involved financial institutions defend their operating methods based on this argument (see the numerous lawsuit cases filed by the National Association for the Advancement of Colored People 2007^{Footnote 1}). Essentially, this idiosyncratic omitted-variable problem means that any augmentation of an HDMA subset might still lack the information required to clear the endogeneity of the variables characterizing, for instance, race or ethnicity. Second, even in the unlikely event that researchers and policy-makers had access to augmented local data, and these data were sufficient to overcome the omitted-variable bias for these local markets, inference across the full gamut of mortgage loans and the entire United States cannot be made. As a result, regulators find discriminatory practices very hard to identify, let alone penalize and reduce. In addition, new legislation on discrimination might be hampered by the lack of specific evidence.

This paper’s main goal is to introduce a new empirical strategy that significantly reduces the omitted-variable bias in identifying mortgage-loan discrimination (both in loan origination and in loan pricing) using information solely from the HMDA, and thus across the entire United States, for a very large number of loan applications (more than 6 million from 2004 through 2013).

Our method runs in two stages. In the first stage, we estimate separate equations for the requested (i.e., the amount in the loan application) and granted (i.e., the amount in the loan application if the loan is originated; zero if the loan is not originated) loan amounts, with regional (census-tract) fixed effects, year fixed effects and, in some sensitivity tests, individuals’ income as the explanatory variables. Subsequently, we take the difference of the residuals between the two equations, which includes the unobserved reasons behind banks’ willingness to originate a loan or not relative to the applicant’s reasons in applying for the loan. Among these unobserved reasons is also the potential effect of race and gender. In the second stage, we estimate a binary model on the bank’s decision whether or not to grant the loan, using observable characteristics such as race, ethnicity, and gender as independent variables along with the difference in the residuals from the first stage. Essentially, this means that we extract the effects of race, ethnicity, and gender from the remaining unobserved component that is also included in the binary model.

Our results indicate that white applicants had approximately a 0.39% higher probability of being granted a loan than African-Americans from 2004 through 2013, and non-Hispanic applicants had a 0.25% higher probability than Hispanic applicants. These estimates, although statistically significant, are quite low and show limited discrimination on average. Further, our estimates are significantly lower compared with, for example, most previous studies’ estimates of 8% or higher loan approval for whites and non-Hispanics. Our findings support the suggestion that omitted-variable bias is very idiosyncratic and cannot be easily captured, even in data sets augmented with additional applicant characteristics. The equivalent estimates for gender are even lower and economically unimportant.

In forming our two-stage model, we discuss the potential bias from imprecise estimation of the first stage, as well as bias from applicants’ income misreporting. We carry out a number of sensitivity tests to remedy these sources of measurement error, including the use of the model of Erickson et al. (2014) and of data only from regions with low probability of income misreporting (Piskorski et al. 2015).

Moreover, by splitting our sample to the periods before and after 2007, we find an economically small, albeit notable, increase in the estimated level of racial and ethnic discrimination after 2007. Specifically, before the crisis, white (non-Hispanic) applicants have a 0.23% (0.14%) higher probability of being granted a loan compared with African American (Hispanic) applicants, whereas after the crisis, the foregoing probability increases to 0.66% with respect to African Americans and 0.54% with respect to Hispanics. Therefore, we find that the subprime crisis increased discrimination, even though these estimates remain far lower than those of previous studies.

Subsequently, we turn to discrimination in loan pricing. We show that, when all other factors are equal, white applicants pay 0.37 basis points lower spreads on their mortgage loans compared with African Americans and non-Hispanic applicants pay 0.11 basis points lower spreads compared with Hispanics. The difference between genders is negligible. The significant difference in spreads between minority and non-minority applicants is observed almost entirely during the pre-crisis period. Together with the findings on discrimination in loan origination, we suggest that before the financial crisis, discrimination in loan origination was very limited but some pricing discrimination did occur. After 2007, discrimination in loan pricing disappeared, most likely as a result of the credit crunch.

In a nutshell, we find that using as a regressor the difference between the unobserved reasons behind banks’ willingness to provide a specific loan amount or not and the applicant’s reasons in applying for that loan amount, controls for substantial information in a simple model of the bank’s decision to grant the loan or not. Using this control, lowers the probability of discrimination of minority groups from about 7–8% to less than 1%. Thus, we posit that, even though not panacea, our approach lowers the omitted-variable problem to “more acceptable” levels, highlights that discrimination is much more limited than previously thought, and allows drawing better conclusions from the HDMA data and the entire gamut of U.S. loan applications. These are the contributions of research.

The following section discusses the theoretical considerations on mortgage-lending discrimination, the identification methods and empirical estimates of the most prominent existing literature, and the subprime crisis’ potential role in banks’ lending practices. Along these dimensions, Section 2 specifies our paper’s three research questions. Section 3 presents our data set and discusses in detail our empirical approach. Section 4 presents the empirical results, and Section 5 provides policy implications based on our findings and concludes the paper.

2 Theoretical considerations and empirical facts

Almost 50 years after the 1968 enactment of the Fair Housing Act,^{Footnote 2} and more than a generation after the Equal Credit Opportunity Act (ECOA) of 1974, the Home Mortgage Disclosure Act (HMDA) of 1975, and the Community Reinvestment Act (CRA)^{Footnote 3} of 1977, lending discrimination remains a key issue in the social, political, and research agenda. Several early studies posit that, in the past, mortgage lenders discriminated against disadvantaged groups of borrowers and that much of the discrimination was actually part of their lending policy (Munnell et al. 1996; Ladd 1998).

The antidiscrimination laws imply that the lenders should use only objective information on a loan’s expected return to decide whether to originate the loan, and they should avoid using the applicant’s membership in a protected group as part of their decision. ECOA clarifies that the term “protected group” encompasses a neighborhood’s racial composition.

Turner and Skidmore (1999) summarize the possible key forces that lead to discrimination.^{Footnote 4} Assuming that lenders are profit-maximizers, lending discrimination in its purest form implies the unlikely event that lenders are willing to forgo profits. In more plausible forms of lending discrimination, however, economic interest occurs as a possible explanation when lenders believe that personal characteristics, such as race or ethnicity, are reliable proxies for factors that affect credit risk and cannot be easily observed. In this case, lenders have the economic motive to deny credit to minority applicants based on an average approximation of that minority’s creditworthiness and not based on the economic characteristics of individual applications. Such discrimination is usually termed “statistical” or “profit-motivated”.

A more direct form of discrimination relates to prejudice (Quillian 2006) which occurs when lenders consider minority borrowers as inferior and do not want to interact with them. Such prejudice is usually referred to as “taste-based discrimination.” Notably, if local home owners and depositors of local banks prefer not to have minorities as neighbors, the local banks might be negatively inclined against minority applicants. Yinger (1986) and Ondrich et al. (2003) provide evidence for such behavior on the part of real estate agents. Cultural affinity is another possible reason for taste-based discrimination, which occurs when white loan officers exert less effort to help minority borrowers meet underwriting criteria compared with non-minority borrowers.

The literature on lending discrimination is very large and can be distinguished between loan origination discrimination and loan pricing discrimination.^{Footnote 5} We do not review this literature here because detailed reviews are already available.^{Footnote 6} We do briefly note, however, some of the relevant history behind the academic interest in lending discrimination (mostly racial discrimination) and the most influential findings as a means to guide our own research.

2.1 Discrimination in loan origination

In the 1990s, academic interest in mortgage lending discrimination exploded, mainly as a result of the HMDA database’s amendment in 1989. The HMDA database provides information on individual mortgage loans to assist, inter alia, in determining whether financial institutions are serving their communities’ housing needs and in identifying possible discriminatory lending practices. The 1989 amendment made extra information (on the outcome of loan applications, the location of the property, the applicant’s race and gender, etc.) available to the public and allowed more-coherent studies of discriminatory practices in loan origination.

However, the lack of data on crucial information considered during the underwriting process, such as credit scores, loan-to-value (LTV) ratios, debt-to-income (DTI) ratios, etc., still hampered empirical identification of a causal effect running from racial and gender characteristics to the loan origination decision. Munnell et al. (1996) took advantage of unique data collected by the Federal Reserve Bank of Boston in 1990 (38 variables, including risk of default, loan characteristics, personal characteristics, etc.) and amended the HMDA to circumvent the omitted-variable bias. Their findings showed that mortgage loan applicants who belong to a minority group have approximately an 8% higher probability of rejection than white borrowers. This effect is, of course, economically quite potent, despite the fact that the addition of the new variables significantly reduces the estimated level of discrimination.

Munnell et al. (1996), however, did not clear the picture on the actual level of discrimination. Specifically, a number of studies show that even the augmented list of variables used by Munnell et al. is insufficient (Horne 1997) and that racial discrimination is either lower (e.g., the separate studies by Day and Liebowitz 1998, and Faber 2013, estimate it to be about 2.8%) or non-existent (e.g., Harrison 1998). More-recent studies (most notably Hubbard et al. 2012 and Wheeler and Olson 2015) find that Munnell et al.’s estimates are more or less accurate.

The debate on the existence of discrimination in loan origination leads us to revisit an old empirical question, which we formulate as follows:

Question 1: Is there racial, ethnic, and/or gender discrimination in the origination of loans in the United States?

2.2 Discrimination in loan pricing

The second type of discrimination concerns loan pricing. The post-2000 literature argues that loan-pricing discrimination became increasingly important mainly because of the subprime boom that took place in the early to mid-2000s (e.g., Williams et al. 2005; Faber 2013; Ghent et al. 2014). The premise is that instead of excluding minority applicants, lending institutions opened up credit to African-Americans and Hispanics, but minorities had to pay higher loan spreads (Rugh and Massey 2010).

In 2004, the HMDA amended its database with partial loan pricing information. Specifically, lenders must report the spread (difference) between a loan’s annual percentage rate (APR) and the rate on Treasury securities of comparable maturity—but only for loans with spreads above designated thresholds. Thus, rate spreads are reported for some (relatively high-risk) mortgage loans. Consequently, the last 15 years have seen a boom in research on loan-pricing discrimination, using either the HMDA database alone (e.g., Avery et al. 2005) or the HMDA enhanced with additional information (e.g., Courchane 2007; Bocian et al. 2008; DeLoughy 2012; Ghent et al. 2014; Bayer et al. 2014). Most of this literature highlights the existence of significant loan-pricing discrimination against African American and Hispanic applicants.

A small part of the literature on loan-pricing discrimination considers the role of gender. Sen (2012) suggests that female applicants who visit the same lenders as identically situated male applicants receive approximately the same amount of high-cost loans. In a similar vein, Haughwout et al. (2009), and Cheng et al. (2011, 2014) downplay the importance of gender in mortgage lending discrimination, suggesting that any differences in mortgage rates either are attributable to the different “shopping behavior” of female applicants (they are more likely to choose lenders by recommendation instead of searching for the lowest rates) or exist only among African-American female applicants.

Based on the foregoing discussion, we formulate our second research question as follows:

Question 2: Is there racial, ethnic, and/or gender discrimination in the pricing of loans in the United States?

2.3 The role of the subprime crisis

Almost a decade after the global financial crisis began, a third and novel question emerges: does discrimination in mortgage lending, either in loan origination or loan pricing, differ before and after the subprime crisis? This question is based on the theoretical assumption that discriminatory policies might be exacerbated during crisis periods, which see an overall increase in economic and social uncertainty, decrease in trust, and increase in anti-social behavior.

More specifically, historical experience and academic literature shows that a financial crisis constitutes a serious challenge to protecting fundamental rights for specific groups of people. This challenge is particularly serious for the most marginalized people, who are also the most vulnerable because they probably already suffer from discrimination. For example, the International Labour Office (2011), the Agency for Fundamental Rights of the European Union (2010), and UNESCO (2009) note that a crisis feeds many types of direct or indirect discrimination because of rising economic or social inequality and insecurity. Further, the same organizations argue that during periods of turmoil, policymakers tend to de-prioritize policies targeting discrimination.

In that line, lending institutions might increasingly decide on granting mortgage loans during crisis periods by using racial and ethnical stereotypes. As in the broader social spectrum, crisis periods create severe insecurity and uncertainty in the banking sector. Given that informational asymmetries are higher during crises, banks might be less willing to take risks and be more suspicious that minority applicants lack the ability to repay a loan. Consequently, banks would reject these applicants to minimize the risk of loan default.

Correspondingly, lenders might embrace elements that reflect the role of race or ethnicity in other related markets (Reskin 2012), which affects minority borrowers’ capacity to repay loans. For instance, it has been documented that minorities face higher unemployment rates and greater decreases in wealth during economic downturns.^{Footnote 7} Some lenders might thus perceive that minority applicants are more likely to be unemployed or see a substantial decrease in their wealth during (or shortly after) economic downturns, and these lenders might place a higher probability of default on loan applications by minority applicants. If this perception does not follow solely from the economics of the decision to supply the loan, such lender behavior implies increased statistical discrimination during economic downturns. In fact, according to Avery et al. (2010), mortgage lending to African-Americans and Hispanics fell more quickly than the U.S. average from 2007 to 2008.

The role of the subprime crisis in discrimination, especially with respect to loan origination, has not yet been given due consideration. Based on the theoretical considerations analyzed in this subsection, we expect that, if anything, discrimination in loan origination increased after the subprime crisis erupted. The subprime crisis’ role in loan-pricing discrimination is much more nebulous. On one hand, following the same reasoning with loan origination, the subprime crisis might increase loan-pricing discrimination. On the other hand, if discrimination in loan origination increases, then loan-pricing discrimination might fall because minority applicants are not granted a loan at all.

Considering our theoretical arguments, we reiterate our third research question about the role of the subprime crisis in mortgage lending discrimination as follows:

Question 3: Has the level of racial, ethnic, and/or gender discrimination in loan origination or loan pricing intensified since the subprime financial crisis?

3 Data and econometric methodology

Our study aims to make progress on the identification of lending discrimination using the full gamut of mortgage loan applications across the United States, not only for specific regions or subsamples with superior data availability. In doing so, we provide a method that can serve as a policy tool for researchers, regulators, and policymakers. To fulfill this aim, we use applicant-level information solely from the HMDA for the period 2004 through 2013. Data before 2004 do not include information on ethnicity and thus limit the study on ethnicity-based discrimination. Also, the specific time period encompasses the years associated with the subprime crisis of 2007 and allows us to answer Question 3.

The HMDA provides information on a number of financial and demographic characteristics of mortgage loans. Specifically, it includes information on the requested loan amount (in thousands of dollars), loan type (conventional, insured by the Federal Housing Administration, guaranteed by Veterans’ Administration or provided by the Farm Service Agency or Rural Housing Service, etc.), loan purpose (home purchase, home improvement, or refinancing), whether or not a preapproval was requested, the owner’s occupancy status (whether or not the owner plans to occupy the home as a principal dwelling), property type (whether the application was for a one- to four-family dwelling, manufactured housing, or multifamily dwelling), loan decision details (whether the mortgage was originated, the application was denied by the financial institution or withdrawn by the applicant, etc.), lien status (whether the loan is securitized or not), and the application’s denial reason(s) if the loan was not approved.

Further, the HMDA has information on the lender’s identification number and supervisory/regulatory agency code, as well as information on the applicant’s and co-applicant’s race (American Indian or Alaska Native, Asian, Black or African American, Native Hawaiian or Other Pacific Islander, White), ethnicity (Hispanic/Latino or not), gender, and income (gross annual income in thousands of dollars). Finally, the property’s location information includes the identification number of the Metropolitan Statistical Area (MSA), the state and county codes, and the census tract number.

To ensure comparability across the loans, we homogenize our data set by using only applications for conventional loans for home purchases. Further, we limit the sample to securitized (by a first or a subordinate lien) loans on owner-occupied (as a principal dwelling), one- to four-family properties. Our goal is to examine whether minorities have equal housing opportunities as borrowers who belong to non-minority groups. We thus are interested only in home-purchase loans (rather than refinancing or home improvement loans), because they increase individuals’ access to homeownership. For the same reason, we focus on loans for owner-occupied as a principal dwelling properties and not for second or vacation homes.

We also remove from the analysis observations with missing data for any of the foregoing variables. The final restriction we make is to exclude applications that did not result in either originations or denials. Thus, we rule out any other type of action taken (e.g., applications withdrawn by the applicant or approved but not accepted) because the decisions made for these applications are not under the lender’s control. Following our cleansing procedure, we have a sample of 6,452,279 loan applications for the 2004–2013 period. For descriptive statistics (along with a summary of variable definitions) see Table 1.

Table 1 Variables description and summary statistics

Mortgage Lending Discrimination Across the U.S.: New Methodology and New Evidence

Abstract

Similar content being viewed by others

Are Minorities Still Paying Higher Mortgage Interest Rates?

Refinance and Mortgage Default: A Regression Discontinuity Analysis of HARP’s Impact on Default Rates

An Unintended Consequence of Mortgage Financing Regulation – a Racial Disparity

1 Introduction

2 Theoretical considerations and empirical facts

2.1 Discrimination in loan origination

2.2 Discrimination in loan pricing

2.3 The role of the subprime crisis

3 Data and econometric methodology

4 Empirical results

4.1 Discrimination in loan origination

4.2 Discrimination in loan pricing

5 Conclusions and policy implications

Notes

References

Author information

Authors and Affiliations

Corresponding author

Appendix

Appendix

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL classification

Search

Navigation