Introduction

This study focuses on the loan pricing behavior of lenders under racially neutral lending policies and further investigates if mortgage financing regulation may unintentionally lead to minorities paying more for a mortgage loan. The issue of racial discrimination in mortgage lending has consistently drawn the attention of researchers since the enactment of the Equal Credit Opportunity Act (ECOA) in 1974. Ross and Yinger (2002) and Turner and Skidmore (1999) note that racial discrimination in mortgage markets manifests itself in several forms. Differential treatment discrimination occurs when equally qualified individuals are treated differently due to explicitly considering prohibited factors such as a borrower’s race or ethnicity. For example, lenders may deny loan applications of minority applicants with a higher frequency, steer minority borrowers into the less favorable instruments (e.g., subprime mortgage market) or price loans differently given the borrower’s level of risk. Simply put, differential treatment discrimination means that applicants with equivalent credit-related characteristics are treated differently. Disparate-impact discrimination occurs when lenders apply a racially neutral policy to all borrowers, however the policy itself disproportionately excludes or burdens borrowers in a certain group. An extensive literature exists that has examined these forms of discrimination and specifically whether fair-lending laws and regulations have successfully eliminated race-based disparity in credit allocations and in loan pricing.Footnote 1 Overall, the results are mixed. The focus of this study on the loan pricing behavior of lenders is consistent with the lending industry’s shift towards a risk-based pricing system. Under this system, borrowers with different risk levels have access to credit, but at rates relative to their risk (Turner and Skidmore 1999; Ghent et al. 2014). This implies borrowers with equivalent levels of risk should pay equivalent prices.

Using a sample of 30-year first-lien fixed-rate subprime mortgage loans for home purchase, this study evaluates the existence of contract rate disparity between racial/ethnic groups within a framework which mimics lenders’ loan pricing behavior. We first employ a competing-risks hazard model to simultaneously assess the default and prepayment risks of each loan in the sample based on the loan performance data. In the loan hazard model, the effects of a borrower’s race as well as the effects of the racial composition of a borrower’s neighborhood are assessed. For each loan in the sample, the probability of a borrower defaulting upon or prepaying a loan is predicted based on the loan hazard model. The loan-level predicted default and prepayment probabilities are then incorporated into a loan contract rate determination model to investigate whether race-based disparity in mortgage loan pricing exists after explicitly controlling for termination risk.

The results of the competing-risks loan hazard model indicate African American borrowers and Hispanic borrowers tend to be less likely to exercise their prepayment option than similar non-Hispanic white borrowers. However, we fail to reject the null of no impact on default risk of the race and ethnicity variables. In a completely competitive market, lower prepayment risk of loans reduces borrowing cost and enhances returns to lenders, thus it should be reflected in loan pricing and result in more favorable loan terms. However, the current fair-lending laws and regulations prohibit lenders from using a borrower’s race and ethnicity to assess a borrower’s risk profile, leading lenders to over-assess the risk level of a loan to a minority borrower. Overstating a borrower’s termination risk leads to a higher contract rate being paid. The estimation results indicate African American borrowers pay an additional 15–45 basis points and Hispanic borrowers pay roughly 10–24 extra basis points for their contract rate.

The methods applied in this study make it noticeably different from prior studies. In most of the prior racial studies in mortgage lending, the existence of racial disparity in the contract rate is evaluated through a reduced-form contract rate determination model. In a reduced-form model, a loan’s contract rate is regressed against a borrower’s race or the racial composition of a borrower’s neighborhood, as well as a set of covariates that are believed to be associated with the risk level of a loan. One potential problem with this approach is that a reduced-form model does not enable us to explicitly explore the effects of race and ethnicity on loan termination patterns and to separately derive mortgage pricing implications of these termination patterns. By contrast, the analysis utilized in this study allows us to mimic lenders’ pricing behavior by predicting the probability of a borrower defaulting upon or prepaying a loan and explicitly account for loan termination risk on the contract rate by including predicted probabilities of termination. More importantly, while prior studies using the reduced-form model normally focus on default risk factors while ignoring the impact of prepayment risk on the loan’s contract rate, this study simultaneously assesses the competing risks of mortgage default and prepayment. Kau et al. (1992) note that default and prepayment are substitutes for one another and one cannot accurately value either option without the other. In addition, numerous authors have emphasized the importance of incorporating prepayment risk in the valuation of mortgage-backed security (e.g., see Dunn and McConnell 1981a & 1981b; Schwartz and Torous 1989, Schwartz and Torous 1992; Chernov et al. 2017). Firestone et al. (2007) note that in practice default is a relatively rare event relative to mortgage prepayment (0.6% vs 92% in their study). Using this framework, we are able to explore whether borrowers of different racial and ethnic groups have different prepayment risk, and thus enabling us to determine whether fair-lending laws and regulations lead to minority borrowers paying a significantly greater contract rate.

Literature Review

Discrimination against minority borrowers in loan pricing may be directed at the individual borrower or based on the racial and ethnic composition of a borrower’s neighborhood (redlining).Footnote 2

The Survey of Consumer Finance (SCF) data and the American Housing Survey (AHS) data are widely used in prior studies to examine mortgage discrimination in loan pricing as both data sets have information on a borrower’s race. Using the 1983 SCF data and focusing on conventional fixed-rate mortgage loans, Duca and Rosenthal (1994) did not find evidence of racial discrimination in the conventional mortgage market. Getter (2006), using more recent 1998 and 2001 SCF data which better differentiates households with severe delinquency problems from households with minor repayment problems, arrived at the same conclusion - a borrower’s race does not affect a loan’s contract rate. However, Cheng et al. (2015) using the 2001, 2004, and 2007 SCF data consisting of various types of mortgage products, found that racial disparity in the mortgage contract rate is both statistically and economically significant. The newer SCF data used by Cheng et al. (2015) contain detailed information on a borrower’s race as well as the borrower’s characteristics including income, wealth, debts, credit quality, age, and education. Based on the 1989–2001 AHS data, Boehm et al. (2006) employed the Blinder’s decomposition approach commonly used in empirical studies on labor discrimination and found empirical evidence of racial discrimination in the mortgage market.

Although both the SCF data and the AHS data appear to provide rich data sources for analysis of racial disparity in mortgage lending, they have some drawbacks. The SCF data include detailed information on the borrower, but do not contain detailed information on the attributes of the loan or the collateral. The AHS data, which is a household-level data set, fails to provide information on a household’s financial condition (e.g., a borrower’s credit score), but includes detailed information on a borrower’s personal traits, the loan terms, and the collateral. These limitations have led to the use of proprietary data sets to investigate the issue of racial disparity in mortgage lending, with the hope these data will contain information to better account for risk.

Based on a proprietary data set of conventional loans and FHA/VA loans originated by a national home mortgage lender from 1988 to 1989, Crawford and Rosenblatt (1999) focused on differences in yield premiums across racial groups. They conclude that conventional loan interest rates are race-neutral, and although African American borrowers are shown to pay a smaller premium on average, the difference is not economically significant. Black et al. (2003) utilize another proprietary data set from a major mortgage lending institution to investigate whether racial differences in overage pricing exist in either the home purchase mortgage market or refinancing market.Footnote 3 Their results indicate minority borrowers are significantly more likely to pay a positive overage than similarly situated white borrowers for home purchase mortgage loans. However, they did not find any significant difference in overage pricing across racial groups in the refinancing mortgage market. Courchane (2007), using a sample of conventional home purchase loans and refinancing loans, fail to provide any evidence of racial discrimination in either of the two mortgage markets. By contrast, Zhang (2013), by matching proprietary loan data from a national bank to HMDA data to obtain information on a borrower’s race, finds non-Hispanic Asians and non-Hispanic African Americans tend to receive a lower price for their first-lien conventional loans for home purchase.

Other studies have investigated the issue of racial discrimination in mortgage lending by emphasizing the issue of redlining. Nothaft and Perry (2002) match their loan origination data to census survey data to obtain the racial composition of a borrower’s neighborhood. They find that borrowers in predominately Hispanic or Asian neighborhoods pay slightly higher interest rates, while borrowers in predominately African American neighborhoods occasionally pay slightly lower interest rates. By contrast, also using census survey data to measure the racial composition of a borrower’s neighborhood, Kau et al. (2012) reveal that borrowers in predominantly African American neighborhoods or Hispanic neighborhoods tend to prepay less frequently, and after the predicted loan default and prepayment probabilities are controlled for, only borrowers in predominately African American neighborhoods pay significantly higher contract rates.

A recent study by Ghent et al. (2014), utilizing loan level data combined with Home Mortgage Disclosure Act (HMDA) data to obtain a borrower’s race and ethnicity as well as the racial and ethnic composition of a borrower’s neighborhood, simultaneously investigate the impact of both a borrower’s race and the neighborhood’s traits on loan contract rate after accounting for the predicted loan default and prepayment probabilities.Footnote 4 Their results provide empirical evidence of adverse pricing against African Americans and Hispanics. In addition, they also find that the evidence of adverse pricing is strongest for home purchase mortgage loans and loans originated by non-depository institutions. Bayer et al. (2017) find that minority borrowers tend to be more likely to obtain a high cost loan, and racial and ethnic disparities can be largely explained by the lender’s foreclosure risk.

The current study utilizes similar technique of Kau et al. (2012), but is able to focus on the impact of the individual borrower’s race and ethnicity on loan pricing. The technique employed allows the current study to relax the orthogonality assumption of Ghent et al. (2014) so that the contract rate of a loan is allowed to impact loan performance. Additionally, a semi-parametric matching approach is utilized in the current study as a robust test. The results show minorities may pay higher mortgage rates despite regulation that prohibits discrimination, but the results are weak in that the discrimination result is not present in a matched sample; however, the study unambiguously shows minorities pay higher rates than would be the case if regulation allowed lenders to compensate minorities that prepay less frequently by offering those minorities lower mortgage interest rates.

Model

The framework of the modeling is established from a lender’s perspective, as a loan’s contract rate is determined at the time of loan origination. In setting the contract rate of a loan, the risk level of the loan at origination is assessed, including the likelihood of a borrower to default upon and to prepay a loan over the loan’s life. It is assumed lenders have an accurate model to evaluate loan termination risk. In our model, the borrower’s loan termination behavior is derived from the borrower’s default and prepayment behavior observed in the sample.

Borrower’s Loan Termination Behavior Model

The borrower’s loan termination behavior is modeled using the Cox discrete-time competing-risks loan hazard model (Deng et al. 2000). The model takes into consideration that default and prepayment are substitutes (Kau et al. 1992). The borrower’s loan termination behavior is observed and modeled at monthly intervals (in other words, for each loan, the information is restructured to have one observation for each month in which the loan is current). For each month of each loan subsequent to loan origination, loan performance data is used to determine whether a borrower continued, prepaid, or defaulted upon a loan. This discrete-time model solves the issue of left truncation and right censoring which are common issues in the mortgage literature, and allows the loan default and prepayment probabilities to be estimated appropriately.Footnote 5 Specifically, a multinomial logit model is utilized to model a borrower’s loan termination behavior asFootnote 6:

$$ \ln \left({p}_{jt}/{p}_{0t}\right)={\delta}_j{\alpha}_{jt}+{\varphi_j}^{\hbox{'}}{\mathbf{q}}_{jt}+{\beta_j}^{\hbox{'}}{\mathbf{x}}_{jt}+{\varepsilon}_{jt}\kern1.5em j=1,2 $$
(1)

where p0t is the probability of a loan being continued in period t; pjt is the probability of a loan being terminated in period t given this loan had been continued by the beginning of period t, where j = 1 is the probability of default at time t, and j = 2 is the probability of prepayment at time t. In this study, t refers to mortgage time. The variable αjt is the baseline hazard rate for default and prepayment respectively, and is allowed to vary by mortgage time. The vector qjt includes a set of variables indicating a borrower’s race and ethnic group, as well as variables measuring the racial and ethnic composition of a borrower’s neighborhood at mortgage time t. The vector of non-racial covariates (xjt) describes the observed characteristics of the loan, the borrower, the collateral, the neighborhood, and the economic conditions. These covariates may or may not be time varying.

It is assumed in the model that the possible future mortgage interest rate path is of particular interest to the lender when predicting loan termination probabilities and is among the time-varying covariates in xjt.Footnote 7 In this study, the 10-year treasury constant maturity yield is adopted as the benchmark for the mortgage interest rate of 30-year fixed-rate residential mortgage loans.Footnote 8 In order to predict the possible future 10-year yield path seen from loan origination yt(y0), the commonly used Cox, Ingersoll and Ross (CIR) term structure model is employed.Footnote 9 In this term structure model, the whole term structure is assumed to be driven by a spot interest rate (r(t)). This spot interest rate is assumed to follow a mean-reverting stochastic process with volatility affected by the level of the spot rate as shown in Eq. (2):

$$ dr(t)=\gamma \left(\theta -r(t)\right) dt+\sigma \sqrt{r(t)} dz(t) $$
(2)

where the first term on the right side of the equation is the deterministic part with θ as the long-term mean of the spot interest rate and γ as the reversion rate, whereas the second part describes the stochastic movements.

Using the estimated parameters in Equation (2),Footnote 10 the density of future spot interest rate for any forecast interval conditional on the spot interest rate at origination dF(r(t)| r(0)) is forecasted, based on the transition density of the spot interest rate implied by the CIR term structure model.Footnote 11 As a change in the spot interest rate in the future (r(t)) is believed to result in a change in the 10-year yield in the future (yt(r(t))), and the latter impacts the probability of a borrower defaulting upon or prepaying a loan, the forecasted conditional density of future spot interest rate dF(r(t)| r(0)) is employed to predict default probability and prepayment probability for each loan in each period t as in Equations (3) and (4)Footnote 12:

$$ {\widehat{p}}_{1t}\left({y}_0\right)=\int {p}_{1t}\left[{y}_t\left(r(t)\right)\right]\ dF\left(r(t)|r(0)\right) $$
(3)
$$ {\widehat{p}}_{2t}\left({y}_0\right)=\int {p}_{2t}\left[{y}_t\left(r(t)\right)\right]\kern0.5em dF\left(r(t)|r(0)\right) $$
(4)

where \( {\widehat{p}}_{1t}\left({y}_0\right) \) is the predicted default probability in period t seen from loan origination given that this loan had been continued by the beginning of period t and \( {\widehat{p}}_{2t}\left({y}_0\right) \) is the predicted prepayment probability in period t. Here, as the forecasted conditional future spot interest rate (r(t)| r(0)) is a continuous stochastic variable, in Equations (3) and (4) the integrated expectations are numerically approximated through a discretization approach in which the spot interest rate domain was divided into numerous but finite intervals.

The predicted probability of each termination event in any particular period t, calculated based on Equations (3) and (4), is aggregated over a 10-year span to generate a total predicted probability of each event \( {\widehat{P}}_k \) (k = 1, 2) seen from origination as in Equation (5).Footnote 13 This reflects the lenders’ concern on the total predicted probability of each event over the life of a loan rather than that of a time-specific predicted probability for a particular period t.

$$ {\widehat{P}}_k=\sum \limits_{t=1}^T{\left(1+\frac{y_0}{12}\right)}^{-t}{\widehat{p}}_{kt}\prod \limits_{s=1}^{t-1}\left(1-\sum \limits_{k=1}^2{\widehat{p}}_{ks}\right)\kern5.5em k=1,2 $$
(5)

Here, \( {\widehat{p}}_{kt} \) is the predicted probability of event k in period t given that the loan had been continued by the beginning of period t with the probability as \( \prod \limits_{s=1}^{t-1}\left(1-\sum \limits_{k=1}^2{\widehat{p}}_{ks}\right) \). Hence, \( {\widehat{p}}_{kt}\prod \limits_{s=1}^{t-1}\left(1-\sum \limits_{k=1}^2{\widehat{p}}_{ks}\right) \) is the unconditional predicted probability of event k in period t. These probabilities are discounted by the 10-year yield at origination (y0) with the assumption that loan termination at earlier stages of a loan is more severe to the lenders. Summation of discounted unconditional time-specific predicted probability of event k over a 10-year window results in a total predicted probability of each event (\( {\widehat{P}}_k \)).

Loan Contract Rate Determination Model

Risk-based pricing by lenders depends on the assessment of a loan’s risk level. To capture termination risk, the total predicted probabilities (\( {\widehat{P}}_k \)) based on borrower’s termination model are included in a linear loan contract rate determination model, Eq. (6). Such a model can be written as:

$$ {C}_0={\alpha}_0{y}_0+{\beta}_1{\widehat{P}}_1+{\beta}_2{\widehat{P}}_2+{\lambda}^{\hbox{'}}\mathbf{b}+{\xi}^{\hbox{'}}\mathbf{z}+\varepsilon $$
(6)

In addition to the predicted termination probabilities, the model includes: 1) the 10-year yield at origination (y0) which is believed to be the benchmark used to set the interest rate for 30-year fixed-rate mortgage loans; 2) a set of race and ethnicity variables, b, including variables indicating a borrower’s race and ethnicity group as well as measures of the racial and ethnic composition of a borrower’s neighborhood at origination; and 3) a set of non-racial variables at origination (z) describing the traits of the loan, the borrower, the collateral, and the collateral’s neighborhood. The estimation of this model allows one to test whether a borrower’s race and ethnicity have an effect on the contract rate after controlling for termination risk as well as the benchmark interest rate.

Data and Specifications

Data

The data used in this study contain 30-year first-lien fixed-rate home-purchase residential mortgage loans serviced by GMAC Residential Capital Company, LLC (GMAC ResCap). GMAC ResCap was a finance firm that provided home financing, loan servicing, and mortgage-back securities (MBS) issuance in the U.S. before the recent financial crisis. Loans in this data were all packaged into private-label mortgage-backed securities and traded in the secondary mortgage market. As the loan servicer, GMAC ResCap collected detailed information on the loan, borrower, and collateral at the time of loan origination. In addition, they also tracked the performance of each loan they serviced on a monthly basis. The monthly loan performance data provides information on the current interest rate and balance of a loan, as well as prepayment and delinquency status.

GMAC ResCap data do not provide information on a borrower’s race or ethnicity. This information is available in the Home Mortgage Disclosure Act Loan/Application Register (HMDA-LAR) data. The Home Mortgage Disclosure Act (HMDA) is a federal law that was enacted by the Congress in 1975 and implemented by the Federal Financial Institutions Examination Council (FFIEC). Starting in 1989, amendments to HDMA, resulting from the Financial Institutions Reform, Recovery, and Enforcement Act (FIRREA), required financial institutions to collect and disclose data on race, gender, and income of the applicant and the co-applicant (if applicable) for each loan application (the LAR data). It is estimated that in 2007, more than 8800 lending institutions reported their data to FFIEC, accounting for approximately 80% of all home lending nationwide (Avery et al. 2007).Footnote 14

To identify a borrower’s race and ethnicity, the loan data from GMAC ResCap were matched with the HMDA-LAR data. One difficulty in matching the data is the two data sets define the location of the property underlying the loan differently. Loans in the GMAC ResCap data are reported based on the zip code, while loan applications in the HMDA-LAR data are reported on the basis of the census tract. In addition, the definition of a census tract adopted in HMDA-LAR data varies across time.Footnote 15 To address this issue, we first matched the GMAC ResCap loan data to a source of readily available property transactions data from Miami-Dade County, FL, in order to identify the geographic coordinates of the property securing each loan. To make this matching feasible, the GMAC ResCap loan data were restricted to 30-year first-lien fixed-rate home-purchase mortgage loans with underlying properties located in Miami. This loan-property-sales matching is conducted based on the value of the underlying property, property sale month (loan origination month), property type, and zip code, leading to a matched loan-property-sales sample.Footnote 16 Using geographic coordinates from the property sales, the location of the loans within the 1980, 1990, and 2000 census tracts are identified. Each matched loan-property-sale combination was then matched to approved conventional loan applications in the HMDA-LAR data based on the loan amount (in thousand dollars), loan origination year (loan action year), property type, property occupancy status, loan purpose, lien status (if applicable), and census tract.Footnote 17 Only loans with a unique HMDA-LAR match are included for analysis in this study.Footnote 18

One issue with the use of a borrower’s race and ethnic information in the HMDA-LAR data is the reporting rules changed significantly in 2004 with regard to the classification of race and ethnicity. Prior to 2004, an applicant (co-applicant) was required to self-identify his/her race from among the five racial classifications: American Indian, Asian, African American, Hispanic, and White. However, since 2004, applicants were asked to separately report their ethnicity (Hispanic or non-Hispanic) and race (American Indian, Asian, African American, Native Hawaiian, and White). In order to align the race and ethnicity data pre- and post-2004, a hierarchy among all of the race and ethnic classifications is adopted.Footnote 19 Specifically, if an applicant post-2004 self-identified as an African American/American Indian/Asian/Native Hawaiian, no matter whether this applicant self-identified as a Hispanic or not, this applicant is treated as an African American/American Indian/Asian/Native Hawaiian. If an applicant reported Hispanic for ethnicity and White for race, he/she is treated as a Hispanic. Under this hierarchy, white borrowers are non-Hispanic whites. If the applicant and the co-applicant chose different racial and ethnic groups, a hierarchy ordering similar to the one described above is used. If either of the two applicants self-identified as an African American, that loan is considered to be taken by an African American borrower. If one applicant self-identified as a Hispanic and the other self-reported as a non-Hispanic white, that loan is treated as one to a Hispanic. If both of the two borrowers self-reported as non-Hispanic whites, the mortgage loan is considered to be given to a non-Hispanic white.

The matched loan-property-HMDA data were augmented with other data sources used to describe the characteristics of the underlying property’s neighborhood, based on the property’s location. The decennial census survey data, normalized to the 1990 census tract boundaries, was used to generate time-varying variables describing a neighborhood’s traits (including housing occupancy rate and poverty rate) on a monthly basis by using a linear time-trend between the decennial data from 1990 to 2000 and from 2000 to 2010. In addition, the yearly HMDA data was aggregated on the census-tract level to create measures of the demographic characteristics of each census tract, specifically, measures of median applicant income as well as the racial and ethnic composition of a neighborhood.Footnote 20 These aggregated HMDA neighborhood variables are based on a three-year window (the previous year, the current year, and the next year). To generate these variables, we use all of the loan applications including applications that were approved or denied for 1-to-4 family dwelling purchases in a census tract across all lenders. The resulting yearly census-tract level measures were further linearly time-trended to generate monthly measures.

Characteristics of the local housing market are measured by the change of and variation in housing prices within a neighborhood (census tract) using the property transactions data.Footnote 21 A median housing price index was generated to measure the changes in house price over the life of a loan relative to the time of origination. The inflation-adjusted median house price index was calculated for each census tract based on the 1990 census tract definitions, and for each month of analysis by creating a three-year window of sale, eighteen months before and eighteen months after that month.Footnote 22 The resulting index is a unique median house price index for each census tract on a monthly basis. The standard deviation of the housing sale prices is also calculated monthly for each census tract using the same three-year window to measure the heterogeneity in housing sale price in a neighborhood. Recent house price appreciation rate is calculated for each loan at origination using the change in the median house price on a census tract in the two 3-year periods prior to origination.Footnote 23

There were initially 1994 loans originated in Miami-Dade County, FL that could be matched to the property transaction and HMDA-LAR data sets. Loans with missing values on the loan, the borrower, the collateral or the neighborhood characteristics, or loans without completed loan performance records were deleted. The sample is restricted to loans originated from Jan. 1997 to Dec. 2006. The window could not be extended beyond 2006, due to very few loans being originated beyond this point. In addition, since only a few loans were originated to American Indian, or Asian, or Native Hawaiian borrowers, they were removed from the data. The final sample consists of 1404 observations of 30-year first-lien, fixed-rate subprime residential mortgage loans for home purchase. Their performance was tracked from Jan. 2000 to Oct. 2010, a period covering the recent mortgage crisis.

Model Specifications and Endogeneity Issue

On the basis of the option-theoretic model of mortgage loan “financial” termination (Kau et al. 1992), default and prepayment are driven by different financial incentives. Therefore, different baseline hazard rates (αjt) are applied to the default and prepayment hazards. A scaled Standard Default Assumption schedule (SDA) is used for default hazard and mortgage year fixed effects are employed to measure the prepayment baseline hazard rate.Footnote 24 Kau et al. (1992) note that changes in market interest rates and changes in the value of the collateral are the two most prominent time-varying factors affecting a borrower’s decision to default or prepay. In this discrete-time model, the market interest rate change at time t is measured by the gap between the 10-year Treasury constant maturity yield at loan origination and the 10-year yield at time t lagged by 2 periods (y0 − yt − 2).Footnote 25 To measure the change in the value of the underlying house, a monthly census-tract-level median housing price index is employed with the assumption that the value of a house changes at the same pace as the median housing price index in its census tract. For each mortgage time t, in order to reflect the changes in the house price since loan origination, the ratio of the median house price index level at time t to the one at origination is calculated, and this ratio is named the relative house price at time t (RHPt). Additionally, an interaction term of the market interest rate change (y0 − yt − 2) and relative house price at time t (RHPt) is also included in this competing-risks model to account for any correlation between interest rate change and house price change.

Variables describing the traits of the loan, borrower, property, and neighborhood are also included in the hazard models. Loan traits include the loan contract rate spread at origination (C0 − y0), original LTV, FICO score of the borrower, original loan amount, loan origination season fixed effects, and dichotomous indicators for whether a borrower provided full income documentation at origination, whether the loan is encumbered by a prepayment penalty at time t, whether the underlying property is owner occupied, and whether the property is a single family detached house or a condo.Footnote 26 Also, several time-varying variables are included to measure the characteristics and evolution of an underlying property’s neighborhood, including the heterogeneity in housing sale price, housing occupancy rate, poverty rate, and the median income of loan applicants at time t.Footnote 27

The specification of the default and prepayment models differs. In the default equation, the prepayment penalty variable at time t is excluded, whereas in the prepayment equation, the original LTV ratio, the income documentation status variable, and the variable of neighborhood-level heterogeneity in housing price at time t are excluded, because these variables are not believed to directly affect the corresponding hazard, and also because of identification purpose.Footnote 28

The contract rate C0 appears in the multinomial logit Equation (1) as part of the loan contract rate spread and will be endogenous if there are omitted variables in the estimation of the contract rate equation that are correlated with the errors in the estimation of Equation (1). In other words, if a lender believes a borrower is more likely to default or prepay, the lender would charge a higher contract rate (spread), however being charged a higher contract rate (spread) makes the borrower more likely to terminate a loan. Since the usual two-stage least square procedure is inappropriate of a non-linear logit model, we follow Petrin and Train (2003) and use a control function method to address this endogeneity issue. To implement the CF method, a reduced-form contract rate model is estimated in which the contract rate is regressed against all of the exogenous variables in the system. The residual from the reduced-form contract rate model is then included in the estimation of the multinomial logit model to solve the potential issue of endogeneity. Recall that the termination probabilities (pjt) are estimated based on the monthly observations of each loan and then combined with the estimated CIR term structure model according to Eqs. (3) and (4), and the results aggregated over time, according to Eq. (5), to obtain the expected probabilities (\( {\widehat{P}}_k \)) at origination.

Endogeneity is also an issue in the contract rate model (Eq. (6)) due to each of the predicted probabilities (\( {\widehat{P}}_k \)) in the final contract rate equation being a function of the contract rate (spread). To address this issue, a set of generated IVs (\( {\overset{\sim }{P}}_k \)) are used in the estimation of the final contract rate in place of the corresponding predicted probabilities (\( {\widehat{P}}_k \)). Each generated IV (\( {\overset{\sim }{P}}_k \)) is calculated as its corresponding generated variable (\( {\widehat{P}}_k \)), but with the actual contract rate spread (C0 − y0) being replaced with a predicted contract rate spread (\( {\widehat{C}}_0-{y}_0 \)). The predicted contract rate (\( {\widehat{C}}_0 \)) is obtained from the estimation of a reduced-form contract rate equation (the same reduced-form contract rate model used in the CF method, results not reported in the paper). The predicted contract rate (\( {\widehat{C}}_0 \)) is a function of all of the exogenous variables. Thus, each generated IV (\( {\overset{\sim }{P}}_k \)) is exogenous and serves as a valid IV for its corresponding generated variable (\( {\widehat{P}}_k \)) in the final loan contract rate equation. With those generated IVs, 2SLS is used to estimate the contract rate model, since it is a linear in the parameters.

Other covariates included in the loan contract rate determination model are the 10-year yield at origination (y0), as well as a set of covariates (z) that theory suggests should affect loan pricing.Footnote 29 These additional covariates include the original LTV, FICO score of the borrower, original loan amount, loan origination season fixed effects, prepayment penalty fixed effects, borrower’s income documentation status fixed effects, underlying property type fixed effects, property occupancy status fixed effects, and a list of neighborhood characteristics at loan origination including recent house price appreciation rate, heterogeneity in housing sale price, housing occupancy rate, poverty rate, and median income of loan applicants. Individual- and neighborhood-level race and ethnicity variables are included to test whether race and ethnicity affect loan pricing. Additionally, a trend term in calendar time is also included. Table 11 in Appendix A lists all of the variables used in this study and provides a detailed description of each variable.

In both the loan hazard model and loan contract rate model, some variables take a nonlinear function form based on prior studies and/or theoretical reasons. The original LTV ratio is transformed into categorical variables: loans with LTV ratio less than or equal to 80%, loans with LTV ratio greater than 80% but less than or equal to 90%, loans with LTV ratio greater than 90% but less than or equal to 100%, and loans with LTV ratio exceeding 100%. Furthermore, FICO score is entered as a continuous linear spline function with a knot point at 700 based on the assumption that once the FICO score is above a threshold (e.g., 700), an additional increase in FICO score would have relatively small marginal effects on loan termination probabilities/loan contract rate.Footnote 30 Finally, following prior studies, a quadratic function form of the original loan amount is utilized to allow for a non-linear relationship.

Statistical Descriptions

Table 1 presents a brief description of the characteristics of the loans at the time of loan origination in the pooled sample, including the traits of the loan, borrower, collateral, and neighborhood. The average original LTV ratio in the sample is approximately 85%, and 88 loans (6.27%) have a LTV ratio greater than 100%. More than half of the borrowers (52.35%) in the sample failed to provide full income documentation and 18.52% of the borrowers (260 loans) have a FICO score below 650. Approximately 33% of the loans are encumbered by a prepayment penalty. The average contract rate of 7.96% is relatively high, and is approximately 304 basis points higher than the 10-year treasury yield at origination. During the observation period of loan performance in this study, 122 loans (8.69%) of the 1404 loans were defaulted upon and ended up with a foreclosure/short sale/deed in lieu of foreclosure, while 1129 loans (80.41%) were prepaid. In this study, default is defined as the occurrence of a borrower being 90-days delinquent, and that occurrence eventually results in a foreclosure, short sale, or deed in lieu of foreclosure.

Table 1 Descriptive statistics of mortgage loans at loan origination

Table 2 provides descriptive statistics of the loans by the racial and ethnic group of the primary borrower. Around 71% of the loans (995 loans) in the sample were extended to Hispanic borrowers, about 22% of the loans (309 loans) were taken by non-Hispanic white borrowers, and approximately 7% of the loans (100 loans) were to African American borrowers. These proportions are close to the proportions from the census survey describing the racial and ethnic composition of all the residents in Miami-Dade County, FL. This indicates the working sample represents the overall Miami-Dade population.Footnote 31

Table 2 Descriptive statistics of mortgage loans at loan origination by race

Based on the information in Table 2, borrowers of different racial and ethnic groups appear to have different loan termination patterns. The observed average default rate within the study period by African American borrowers is around 13.00%, approximately twice as high as the observed default rate by non-Hispanic white borrowers (6.47%). Meanwhile, the observed prepayment rate by African Americas (77.00%) or by Hispanics (78.39%) is lower than that by non-Hispanic whites (88.03%). The observed differences in loan termination patterns among the three groups might be explained by the differences in the credit risk of a borrower(s). African Americans appear to have the highest credit risk simply based on the average original LTV and the FICO score, while non-Hispanic whites appear to have the lowest risk based on these measures. Additionally, African Americans seem to be more likely to be encumbered by a prepayment penalty. African American borrowers also tend to live in a neighborhood with a relatively lower recent housing price appreciation rate, a higher poverty rate, and lower median income. The observed differences in the characteristics of the loan, the borrower, and the neighborhood among the three groups indicate the importance of accounting for all of those traits when examining the relationship between a borrower’s race and loan hazard probability/ loan contract rate.

Results

Modeling Discrimination

The individual- and neighborhood-level race and ethnicity variables at time t are included in the estimation of the loan hazard probability to test if they are associated with the default and prepayment hazards after a set of loan risk factors in xjt are accounted for.Footnote 32 Ross and Yinger (2002) note that under fair-lending laws, it is illegal for lenders to use a borrower’s race and ethnicity as a proxy for unobserved risk factors by including race and ethnicity in the calculation of the predicted termination probabilities. They further note that all of the race and ethnicity variables should be included in the estimation of the loan hazard model, otherwise parameters of other variables in this hazard model would be biased if they are correlated with those race and ethnicity variables. Our base case strictly follows this guidance and thus fair-lending laws and regulations by including race and ethnicity variables in the estimation of the loan hazard model, but excluding (setting their coefficients to zero) them from predicting loan termination probabilities. This avoids what Ross and Yinger (2002) call statistical discrimination. We refer to these predicted probabilities as color-neutral. Separately, we build a regulation-free model where the termination probabilities are forecasted including the impact of race and ethnicity variables. This approach allows one to measure the impact of the over-assessment of termination risk in the contract rate.

Do Minorities Have Different Loan Termination Patterns?

The loan default hazard estimates, reported in Table 3, do not provide any empirical evidence that a borrower’s race and ethnicity are associated with loan default probability, as the estimates on the individual-level race and ethnicity variables are not statistically significant. In addition, the neighborhood-level race and ethnicity variables are not shown to affect loan default probability. These findings indicate that the differences in the observed average default rate across the three racial and ethnic groups reported in Table 2 are fully explained by the differences in the covariates incorporated in the default hazard model.

Table 3 Default hazard model estimates

Overall, the results indicate a borrower’s decision to default is mainly driven by financial incentives. The change in the value of the collateral affects the default probability. The coefficient on the relative house price at time t (RHPt) variable is statistically significant and negative. This is consistent with the option theory of mortgage loan termination (Kau et al. 1992) – as house price decreases, borrowers lose equity on their houses and tend to be more likely to default. In addition, estimation results on the variables describing the traits of the loan, the borrower, and the collateral, demonstrate that borrowers with an original LTV ratio above 100%, a lower FICO score, a larger original loan size, limited income documentation, or borrowers using a loan to purchase a house as investment rather than as their primary residence tend to be more likely to default on the loan. Note that the estimation results on the two FICO score variables indicate that when a borrower’s FICO score is above a threshold, 700, an additional increase in the FICO score does not have significant marginal effects on default likelihood. Meanwhile, the original loan size has a positive impact on default probability but at a decreasing rate as the original loan size increases. Among the neighborhood-level variables, only the estimate on the median applicant income variable is significant and negative. The results indicate borrowers living in a relatively poorer neighborhood tend to be more likely to default. These results are consistent with most of the prior studies.

Table 4 demonstrates the estimation results on prepayment hazard. The estimation results on the individual-level race and ethnicity variables indicate both African American borrowers and Hispanic borrowers are less likely to prepay than non-Hispanic white borrowers after other covariates are controlled for. This result is consistent with prior studies focusing on prepayment patterns of minority borrowers (Kelly 1995; Clapp et al. 2001; Deng and Gabriel 2006; Firestone et al. 2007). It is worth noting that all of those prior studies have found that minority borrowers are less likely to prepay. Possible explanations for this finding are those minority borrowers might face more obstacles gaining access to credit in the refinance market, have limited knowledge or information of mortgage refinance opportunities, or are less mobile. Because of the limitation of the data, we leave this question to future research. However, the results clearly demonstrate minority borrowers prepay less frequently and therefore have a relatively lower lending cost than similar non-Hispanic white counterparts.

Table 4 Prepay hazard model estimates

Overall, the results on other covariates are in accordance with theoretical expectations and prior studies. The results indicate changes in market interest rates play an important role in a borrower’s decision to prepay. As the market interest rate drops, borrowers are more likely to prepay, as rational borrowers always have financial incentives to take advantage of a lower interest rate through prepayment (Kau et al. 1992). In addition, the likelihood of a borrower prepaying a loan is shown to also be affected by the change in the value of the collateral. In a market seeing great house price appreciation, borrowers would have more incentives to prepay their loans to cash out the equity on their house.

In addition to market interest rate change and house price change, other factors are shown to affect the probability of loan prepayment. The contract rate spread at origination (C0 − y0) appears to be positively associated with loan prepayment probability. The contract rate spread (C0 − y0) measures the extent to which the contract rate at closing (C0) deviates from the baseline interest rate (y0), a measure of risk premium. The results indicate that a borrower with a higher risk premium tends to be more likely to prepay. This result may also reflect the impact of points used to buy down a contract rate. As theory suggests, a borrower anticipating a lower likelihood of prepaying a loan in the near future self-selects more points at loan closing in exchange for a reduced contract rate. Thus, the likelihood of a loan being prepaid is positively associated with loan contract rate (spread).Footnote 33 Estimation results on other covariates demonstrate borrowers with a higher FICO score or a larger loan size appear to be more likely to prepay. Meanwhile, borrowers encumbered by a prepayment penalty, or borrowers using a loan to buy a condo versus a single-family detached house tend to be less likely to prepay, ceteris paribus.

Do Minorities Pay Higher Contract Rates?

The results in Table 5 indicate that when a lender sets a loan’s contract rate, the lender takes into account how likely a borrower is to default or prepay in the future, as the coefficients on the predicted loan termination probability variables are significant and positive. Specifically, a 10-percentage-point increase in the default probability leads to a 15 basis point increase in the contract rate. A 10-percentage-point increase in the prepayment probability would lead to an increase in the contract rate by 26 basis points.

Table 5 Loan contract rate estimates (Color neutral) -2SLS a

The estimation results on other covariates in the loan contract rate determination equation are consistent with either theoretical expectations and/or prior studies. The estimate on the 10-year yield at origination is significantly positive and close to 1. The estimates on the three category variables of original LTV ratio are all significant and positive. The results show as the original LTV ratio increases, the loan contract rate rises monotonically. A borrower’s FICO score also plays an important role in determining a loan’s contract rate. Lenders are shown to charge a higher contract rate for borrowers with a lower FICO score. However, when a borrower’s FICO score is above 700, we fail to reject that an additional increase in FICO score does not marginally impact loan contract rate. In addition, lenders tend to charge a lower contract rate to borrowers who provide full income documentation at origination, as well as borrowers who use a loan to purchase a house as the primary residence rather than as an investment. Borrowers using a loan to purchase a condo are charged a higher contract rate on average than borrowers using a loan to purchase a single-family detached house. Among the neighborhood traits variables, the recent house price appreciation rate and the housing occupancy rate at origination are shown to affect loan contract rate in a negative way.

The estimation results on the original loan size variables and two prepayment penalty category variables require additional explanations. The results on the original loan size indicate as the original loan size rises, the contract rate decreases, but at a decreasing rate. This finding is consistent with most of the previous related studies. One possible explanation for this finding is that the original loan size is believed to be positively associated with a borrower’s income and wealth, and the latter is anticipated to be negatively correlated with loan contract rate. The estimates on the two category prepayment penalty variables are shown to be significant and positive. This finding may initially be somewhat puzzling given that the prepayment penalty transfers a portion of the prepayment risk from the lender to the borrower, and thus should reduce the contract rate. The results show that as the length of the penalty period increases, the contract rate decreases, indicating the penalty itself reduces the loan contract rate. The observed positive relationship between the prepayment penalty and loan contract rate reflects that borrowers selecting a prepayment penalty tend to be riskier, other things equal.Footnote 34

Turning our attention to the estimates on the individual- and neighborhood-level race and ethnicity variables. The results indicate an African American borrower pays a contract rate approximately 20 basis points higher than a non-Hispanic white borrower, ceteris paribus. Meanwhile, we fail to reject the null that the racial and ethnic composition of a borrower’s neighborhood, or being a Hispanic borrower, does not have a significant impact on a loan’s contract rate. Overall, we find empirical evidence of adverse pricing against African American borrowers in the subprime mortgage market over this study period, after controlling for the predicted loan termination probabilities as well as a set of covariates in this contract rate determination model. This result may be driven by aggressive lenders or high-risk lenders operating in minority markets (Ross et al. 2008) or members of minority groups may be less likely to comparison shop for mortgage products. Whatever the reason, African American borrowers are treated unfavorably in the mortgage market even with the presence of fair-lending laws and regulations.

Do Fair Lending Laws and Regulations Help or Hurt Minorities?

Recall that the results on the prepayment hazard model (Table 4) show that after controlling for a full set of covariates, minority borrowers including African American borrowers and Hispanic borrowers are less likely to exercise the prepayment option than non-Hispanic white borrowers. Therefore, those minorities have a relatively lower lending cost than their non-Hispanic white counterparts, ceteris paribus. In a competitive lending market, loans with a relatively lower lending cost should be originated at a relatively lower contract rate.Footnote 35 However, fair lending laws and regulations prohibit lenders from taking a borrower’s race and ethnicity into account in assessing a loan’s risk level. In other words, because lenders are unable to take a borrower’s race and ethnicity into account in assessing a loan’s risk level, lenders are overcompensated for loan termination risk when dealing with loans to minorities and thus disadvantaging minority borrowers.

In order to explore the extent to which minority borrowers are disadvantaged by those fair lending regulations, we estimate the contract rate determination model (Equation (6)) after allowing the termination probabilities to vary by a borrower’s race and ethnicity. Specifically, we re-calculate the termination probabilities based on the estimates from Eq. (1) allowing the probabilities to vary by race and ethnicity. We name these predicted probabilities regulation-free to distinguish them from the color-neutral probabilities used in the prior analysis. The regulation-free scenario allows lenders to recognize the lower termination risk of minority borrowers. The primary hypothesis of the regulation-free model is that the coefficients on the individual race and ethnicity variables will increase to reflect the lower rate these borrowers should be charged due to their lower level of prepayment risk. The results of the estimation of the contract rate model with regulation-free probabilities are reported in Table 6. The results show that both of the individual-level race and ethnicity fixed effects variables are significant and positive. Specifically, the coefficient estimate on the variable of African American is approximately 35 basis points, while the coefficient estimate on the variable of Hispanic is nearly 10 basis points. Recall that in the color-neutral base case with fair-lending laws and regulations, only one individual-level race and ethnicity variable, African Americans, is significant and its coefficient estimate is around 20 basis points, while the other one, Hispanics, is not (Table 5). Combining these results indicates that both African Americans and Hispanics are disadvantaged by fair-lending laws and regulations. African American borrowers pay an additional 15 basis points and Hispanic borrowers pay an additional 10 basis points for their mortgage loans. This unintended consequence of regulation is equivalent to an increase in monthly mortgage payment of approximately $19 for African American borrowers or of roughly $13 for Hispanic borrowers for a 30-year fixed-rate mortgage loan of $180,000 with an annual interest rate of 8%.Footnote 36 Note that the coefficient estimates on other covariates are similar between color-neutral and regulation-free scenarios.

Table 6 Loan contract rate estimates (Regulation free) -2SLS a

Robustness Test - a Matching Approach

To explore the robustness of the main result of this study, a matching approach is employed. The matching approach mimics risk-based pricing; borrowers with the same level of loan termination risk should be charged the same contract rate regardless of their race and ethnicity. The matching approach applies a quasi-experimental design to construct a control (racial) group of loans that are as similar as possible to a target (racial) group of loans in terms of loan termination risk. This matching technique should mitigate systematic differences in loan termination risk between the two racial and ethnic groups and allow for systematic differences in loan contract rate between the two groups to be examined. Note that in this study, the risk level of a loan is represented by the predicted probability of loan termination (\( {\widehat{P}}_k \)).Footnote 37

To generate a balanced matched sample, we use a nearest 1-to-1 matching process with replacement based on Mahalanobis distance along with calipers restrictions. Details on this matching approach have been provided in Appendix B. The resulting matched sample is examined to ensure the target and control group borrowers have the same distributions of loan termination risk, as well as the matching variables, to determine if a balanced sample was found. A standard measure used to determine balance in the matching literature (Rubin and Thomas 2000; Austin 2011) is a mean difference t-test. However, the mean difference t test has been criticized by many scholars (Imai et al. 2008; Austin 2008 & 2009) because the significance levels are confounded with sample size. Therefore, in addition to the mean difference t test, the standardized difference of the mean and the ratio of the variance of each variable is used to check if a matched sample is well balanced.Footnote 38 Following Rubin (2001), a variable is well balanced if and only if the standardized difference of the mean falls in the range of (−0.25, 0.25) and the ratio of the variance falls in the range of (0.5, 2). If the matched sample is determined not to be balanced for a specific caliper radius, the sampling process described in Appendix B is repeated with other values of caliper radius until a balanced matching sample is arrived at and in which there are sufficient loans for comparison across the two groups.Footnote 39 Following prior studies analyzing the optimal caliper radius ratio (ρ) for matching (Cochran and Rubin 1973; Rosenbaum and Rubin 1985; Austin 2011), an appropriate caliper radius ratio is adopted if the caliper radius ratio could significantly reduce the mean difference between matched pairs without losing many unsuccessful matching pairs. For the matching between African American borrowers and non-Hispanic white borrowers and the matching between Hispanic borrowers and non-Hispanic white borrowers, caliper radius ratios of 0.8 and1.0 turn out to meet this criterion. A caliper radius ratio (ρ) of 0.8 requires that the difference in a matching criterion variable between loans in two racial groups should not exceed 80% of the pooled standard deviation of that variable.

Using this matching process, we generate a matched sample for African American borrowers (target group) and non-Hispanic white borrowers (control group) and a second sample for Hispanic borrowers (target group) to non-Hispanic white borrowers (control group). These samples are generated separately based on the color-neutral probabilities and the regulation-free probabilities and reported for two caliper radii (ρ = 0.8 and ρ = 1.0) resulting in 8 samples. In the appendix, Tables 12,13,14,15 report the matching balance diagnosis for the color-neutral and the regulation-free scenarios for each of the matched samples. All of the matching criterion variables are balanced in those matching samples except for the original loan amount variable. The original loan amount variable is unbalanced in the matched sample for African American borrowers (target group) and non-Hispanic white borrowers (control group) when the caliper is equal to 1. The matched balanced samples allow us to compare the mean contract rate between the control and target groups to measure whether there is a systematic difference in the contract rate between the two groups. We also estimate the loan contract rate determination model (Eq. (6)) over each of the matched balanced samples to account for the potential locational/neighborhood variation in the matched loans.Footnote 40

Table 7 reports the results on nearest 1-to-1 matching between loans of African American borrowers and loans of non-Hispanic white borrowers based on two matching samples (ρ = 0.8 and ρ = 1.0), while Table 8 demonstrates the results on two matching samples between Hispanics and non-Hispanic whites. The samples in both Tables 7 and 8 are generated using the color-neutral probabilities. Panel A of the Tables report the mean difference in loan contract rate between the control and target groups and a t-value to test if the mean difference is zero. Panel B of these tables contains the regression adjustment results for each matching sample. Based on the matching of African American borrowers to non-Hispanic white borrowers, the results show little evidence of a significant difference in the contract rates between the two groups. While the mean difference in the ρ = 1.0 matched sample is statistically significant, it is not found to be significant once neighborhood characteristics are controlled for in the contract rate equation based on the same matched sample. The results based on the matching of Hispanic borrowers to non-Hispanic white borrowers show no evidence of a significant difference in the contract rates between the two groups.

Table 7 Results on nearest 1-to-1 Matching between African Americans and non-hispanic whites - color-neutral a
Table 8 Results on nearest 1-to-1 matching between hispanics and non-hispanic whites - color-neutral a

Tables 9 and 10 contain the matched sample results based on the results of the regulation-free scenario. The results based on the matched sample of African American borrowers to non-Hispanic white borrowers indicate African American borrowers pay a significantly higher contract rate. This result is consistent across the mean comparison and the contract rate estimation. In Table 10, the results based on the matching of Hispanic borrowers to non-Hispanic white borrowers indicate a significant difference in the contract rates between the two groups once neighborhood characteristics are controlled for in the contract rate equation.

Table 9 Results on nearest 1-to-1 matching between African Americans and non-hispanic whites - regulation-free a
Table 10 Results on nearest 1-to-1 matching between hispanics and non-hispanic whites - regulation-free a

The matching results provide evidence that minority borrowers are disadvantaged by fair-lending laws and regulations in the subprime mortgage market over this study period. Based on the regression results of the color-neutral matching samples, we fail to find any evidence of adverse pricing against minority borrowers. However, the results in Tables 9 and 10 demonstrate both African American borrowers and Hispanic borrowers pay a significantly higher contract rate, given their prepayment habits, than their non-Hispanic white counterparts. In other words, because lenders are not allowed to use race and ethnicity to evaluate risk, minority borrowers are charged a higher interest rate. The magnitudes of the race-based contract rate disparities shown in Tables 9 and 10 are slightly larger than those reported in Table 6.

Discussion

A limitation of the data used in this study is that the number of points paid by a borrower is not observed. As points are usually used by borrowers to buy down a loan contract rate, one might argue that a possible explanation for the higher contract rates paid by African American borrowers and Hispanic borrowers is they tend to choose less points on average at loan closing. However, in this study both African American borrowers and Hispanic borrowers are shown to be less likely to prepay, other things equal. Rational borrowers, anticipating a lower likelihood of prepaying a loan in the near future, would choose to pay more points, ceteris paribus.Footnote 41

The study is also limited by its focus on the Miami area. However, it is important to note that we are not alone in this prepayment finding, several prior studies have found empirical evidence that a borrower’s race and ethnicity are associated with loan prepayment patterns. Kelly (1995), using loan-level data from the VA’s mortgage program from 1971 to 1989, found that African American borrowers and Hispanic borrowers prepaid less frequently than whites. Based on a pool of fixed-rate residential mortgage loans originated from 1993 to 1994, with their performance tracked through 1998, Clapp et al. (2001) concluded that a borrower’s minority status is negatively associated with prepayment rate. The same conclusions could also be found in Deng and Gabriel (2006) and Firestone et al. (2007). While some of these studies are also based on city level data, several are based on national data (e.g., Kelly (1995), Deng and Gabriel (2006), and Firestone et al. (2007)). The result of minority borrowers prepaying with less frequency does not seem unique to the Miami metropolitan area.

Conclusion

Overall, this study provides empirical evidence that fair-lending laws and regulations may actually have unintended consequences with respect to mortgage pricing. What should not be overlooked is this consequence is a direct result of prepayment risk, a phenomenon more generally observed than default. Further, given the consideration that a borrower’s race and ethnicity may be correlated with loan termination behavior, we explicitly model and control for loan termination risk (default and prepayment) in the contract rate estimation rather than using a reduced-form approach. The results demonstrate that African American and Hispanic borrowers are less likely to prepay than non-Hispanic white borrowers. This loan performance finding provides us with mortgage pricing implications. In a completely competitive mortgage market, the lower prepayment risk would be reflected in the pricing of those loans.

Fair-lending laws and regulations prohibit lenders from using a borrower’s race or ethnicity in assessing a loan’s risk. In this color-neutral world, minority borrowers with a relatively lower prepayment risk would not be offered a lower contract rate given their prepayment habits. The empirical results provide little support for a contract rate disparity in this color-neutral world; the rates paid by borrowers, charged by lenders, are consistent with the goals of equal contract rates across racial and ethnic groups.Footnote 42

However, in this color-neutral world, the termination risk of minority borrowers does not reflect the true termination risk, minority borrower’s termination risk is overestimated given they are less likely to prepay. If the prepayment risk of minority borrowers is considered in determining termination risk, what we term as a regulation-free scenario, the estimation results on the loan contract rate model indicates African American borrowers pay an additional 15 basis points and Hispanic borrowers pay roughly 10 extra basis points for their contract rate as a result of regulation. Such an increase in contract rate raises the monthly payment by $19 for African Americans and $13 for Hispanics based on the average size and contract rate of the loans in the sample. These results are robust to a matching approach.