Introduction

Prepayment hazard modelling is of great importance in forecasting the expected future cash flows from a mortgage loan, and hence is essential to the valuation of mortgage loans and mortgage-backed securities (MBS). Mortgage lenders, firms engaged in mortgage securitization, and investors may suffer losses if the actual prepayment rate deviates from the expected one. While default has garnered more attention because of the loss of initial capital, mortgage prepayment occurs much more frequently.Footnote 1 Thus, understanding a borrower’s prepayment propensity is an important issue to mortgage pricing.

The standard way to model a borrower’s propensity to prepay a mortgage loan is to adopt the basic principle of option theory that a mortgage loan is embedded with both default and prepayment options. In terms of prepayment, as the interest rate drops, when the current value of a mortgage loan exceeds its par value, the call option is “in-the-money,” and a rational borrower should prepay.Footnote 2 Accordingly, there has been extensive literature deriving the optimal time for a borrower to prepay (Dunn and McConnell, 1981a & 1981b; Shoven 1986; Hendershott and Van Order 1987; Kau et al. 1992; Follain et al. 1992; McConnell and Singh 1994; Agarwal et al. 2013; among others). However, prior research suggests that the empirically observed mortgage prepayment rates deviate from those predicted by theory (e.g., Giliberto and Thibodeau 1989; Chang and Yavas 2009; Agarwal et al. 2015).Footnote 3 In some cases, borrowers fail to exercise the “in-the-money” option, whereas in other cases borrowers prepay when the interest rates had risen.

Various motivations behind prepayment may help explain the differences in prepayment rates, including: 1) sale of the house caused by relocation, divorce, unexpected negative income shocks (e.g., unemployment, unexpected medical bills), and so forth; 2) refinance to obtain a lower interest rate (rate refinance); 3) refinance to extract equity from collateral for consumptions, other debt repayments, or investments (cash-out refinance); 4) refinance to reduce monthly mortgage payments due to improvements in a borrower’s creditworthinessFootnote 4; 5) fully repay the mortgage debt due to positive income shocks; and so on. Empirically, it is difficult to disentangle prepayments due to one motivation from those arising from others.Footnote 5

Following some prior theoretical studies on mortgage terminations and pricing of mortgage loans and MBSs (Dunn and McConnell 1981a, b; Brennan and Schwartz 1985; Kau et al. 1992; Archer and Ling 1993), the current study attempts to model a borrower’s prepayment behavior by making a distinction between pecuniary and non-pecuniary prepayments and to incorporate both of them in mortgage pricing. Pecuniary prepayment occurs when the market interest rate at the time of prepayment has dropped below that at the time of origination; while non-pecuniary prepayment occurs when the market interest rate at prepayment is greater than or equal to the rate at origination.Footnote 6 Both types of prepayment disturb the timing and amount of the expected cash flows of a mortgage loan, and prevent lenders/MBS investors from receiving the anticipated future interest rate payments. Additionally, both of them affect the effective maturity and duration of MBSs. MBS investors typically forecast the prepayment rate of a mortgage pool based on which they also anticipate effective maturity and duration of their MBSs, calculate the MBS yield, and determine the strategy to hedge against risks associated with their MBSs in their portfolio (Hancock and Passmore 2011).Footnote 7 As hedging MBS is costly, a failure to accurately account for pecuniary or non-pecuniary prepayment probability, which results in an inaccurate estimate of the effective maturity and duration of the MBS, would incur extra hedging costs. Besides, given the competing-risks nature of various forms of loan termination, the occurrence of non-pecuniary prepayment would limit the value of either default or prepayment option, and thus impacts mortgage/MBS price (Kau et al. 1992; Archer and Ling 1993). Overall, as stated in the prior theoretical studies (e.g. Dunn and McConnell, 1981), in an efficient market, the possible occurrences of both pecuniary and non-pecuniary prepayment will be reflected in the price of a mortgage loan/MBS, and a mortgage pricing model should incorporate not only pecuniary prepayment but also non-pecuniary prepayment.

Also indicated by those prior theoretical studies, these two types of prepayment may be determined by different factors. While pecuniary prepayment is believed to be mainly driven by a decline of market interest rate, non-pecuniary prepayment occurs due to some non-interest-rate reasons. In addition, these two types of prepayment may result in different consequences. When pecuniary prepayment occurs, lenders/investors can only reinvest the proceeds from a prepaid loan at a relatively lower interest rate. By contrast, upon the occurrence of non-pecuniary prepayment, lenders/investors can reinvest the proceeds at a relatively higher interest rate. Additionally, as pointed out by Archer and Ling (1993), in a pool of mortgage loans with heterogeneous prepayment transaction costs, due to the “burnout” phenomenon,Footnote 8 the occurrence of pecuniary prepayment truncates the distribution of the call price after the consideration of the transaction costs, while non-pecuniary prepayment would not have an impact on it.Footnote 9 Thus they conclude that a prepayment model must distinguish between pecuniary and non-pecuniary prepayments.

This study utilizes a competing-risks hazard model which accounts for pecuniary and non-pecuniary prepayments as well as default hazards. The paper examines whether market conditions (interest rate change, local house price change and volatility), loan characteristics, borrower traits, underlying property characteristics, and/or neighborhood traits have different impacts on pecuniary vs. non-pecuniary prepayment hazards. In addition, a loan pricing behavior is examined when cumulative probabilities of default, pecuniary prepayment, and non-pecuniary prepayment for each loan, as well as simulated future interest rate paths based on a term structure model, are included in a contract rate determination model to test if pecuniary and non-pecuniary prepayment risks separately affect the contract rate of a loan.

The analysis is based on a sample of 30-year first-lien fixed-rate nonconforming conventional mortgage loans for home purchases in Miami-Dade County, FL originated from 1997 to 2006, with loan performance being observed between 2000 and 2010. The results clearly show that pecuniary and non-pecuniary prepayments are induced by different factors and affected differently by certain overlapping factors. Not surprisingly, the market interest rate change has a significant impact on the likelihood of pecuniary prepayment, but not on the probability of non-pecuniary prepayment. The results indicate that borrowers living in an affluent neighborhood tend to be more likely to pecuniarily prepay and less likely to non-pecuniarily prepay compared to borrowers living in a relatively poorer neighborhood. The results also show combining these two distinct types of prepayment into a single measure shifts some coefficients towards zero and hence yield inaccurate predictions of loan termination probabilities. The contract rate determination model reveals that both pecuniary and non-pecuniary prepayment risks contribute when pricing mortgages.

The empirical results of the current study hence demonstrate that separating non-pecuniary and pecuniary prepayments would help mortgage lenders, financial intermediates, and MBS investors make some improvements on mortgage termination predictions and loan pricing, especially given the recent launch of the Uniform Mortgage-Backed Security (UMBS) program. In June, 2019, under the direction of the Federal Housing Finance Agency (FHFA), Fannie Mae and Freddie Mac started issuing a new common MBS, called the UMBS, in replacement of their current offerings of To-Be-Announced (TBA)-eligible MBSs.Footnote 10 The major goal of this program is to improve the liquidity of the overall TBA MBSs market, and thus the liquidity of the housing finance market. A major concern on this UMBS launch is the loan quality in terms of default and prepayment risks should be very similar for Fannie and Freddie mortgage loans, otherwise investors may see a “race to the bottom” in loan quality.Footnote 11 The major finding of this study indicates a mortgage pricing model which distinguishes pecuniary and non-pecuniary prepayments and incorporates both of them in loan pricing should be considered by Government-Sponsored Enterprises (GSEs) and regulators to ensure that prepayment and default risks are truly similar.

The paper is organized as follows. The second section offers an overview and a discussion of previous studies on mortgage prepayment hazard modelling. The third section explains the model in details. The fourth section describes the data and explains model specifications. The fifth section presents the results. The final section provides the conclusion.

Literature Review

Prepayment has long been of interest to both scholars and practitioners interested in the valuation of mortgage loans and mortgage-backed securities. Earlier studies, both theoretical and empirical, emphasize the basic principle of option theory that prepayments are simply assumed to be mainly induced by interest rate fluctuations. However, the observed borrowers’ prepayment behavior deviated significantly from the predications of the option-based model (e.g., Giliberto and Thibodeau 1989).

Recent studies on prepayment have attempted to mimic the complexity of the option-based model by incorporating additional explanatory variables, especially variables describing the characteristics of the borrower and the loan (e.g., LaCour-Little 1999; Bennett et al. 2000, 2001). A number of prior studies also emphasize the role of transaction costs, which may vary across borrowers, in a borrower’s decision to prepay (e.g., Follain et al. 1992; Stanton 1995; Archer et al. 1997; Dunn and Spatt 2005). Institutional constraints may also play an important role in affecting a borrower’s decision to prepay a loan, as they affect a borrower’s ability to qualify for a new mortgage loan. Overall, previous studies indicate that the probability of a borrower exercising the prepayment option is positively affected by their post-origination income (Archer et al. 1996; Deng and Gabriel 2006), equity in housing (Archer et al. 1996; Peristiani et al. 1997; Green and LaCour-Little 1999), and credit ratings (Peristiani et al. 1997; Bennett et al. 2001). Additionally, the role of the broker (LaCour-Little and Chun 1999), the impact of prepayment penalties (Mayer et al. 2013; Beltratti et al. 2017; etc.), the effects of points (Chang and Yavas 2009), and the consideration of a household’s life cycle (Quigley 1987) may also affect a borrower’s propensity to prepay.

Other recent literature attempts to distinguish among various motivations for mortgage prepayments. Using unique loan-level data, LaCour-Little (1999) focuses on the “pure” refinancing behavior after eliminating borrower mobility and liquidity demand factors, while explicitly controlling for borrower and loan characteristics.Footnote 12 He finds that borrowers and loan traits do affect “pure” refinancing risk, though primarily in the region where the prepayment option is “at-the-money.” By matching mortgage termination data to housing transaction data, Pavlov (2001), Clapp et al. (2000, 2001) disentangle prepayments arising from mobility from those due to in-place refinancing, and investigate whether these two types of prepayments are driven by different factors within a competing-risk model framework. Pavlov (2000) finds that in-place refinance risk is highly sensitive to interest rate change and the value of a mortgage, while mobility risk is sensitive to local economic conditions and independent of the value of the mortgage. Clapp et al. (2000) show that borrowers with high expected mobility/refinance risk are not more likely to refinance/move, indicating the necessity of differentiating movers from refinancers when evaluating the prepayment risk of a loan. Clapp et al. (2001) demonstrate that refinances are driven more by financial incentives like the market value of a loan, while a borrower’s characteristics including age, income, and minority status tend to have more influence on the move decision. They also show that combining these two distinct choices into a single prepayment measure may yield an inaccurate prediction of a loan’s termination risk. In addition, some studies (e.g., Pennington-Cross and Chomsisengphet 2007; LaCour-Little et al. 2010; Bhutta and Keys 2016) focus on factors affecting a borrower’s cash-out refinancing decision as well as refinancing amount. For example, Pennington-Cross and Chomsisengphet (2007) find that cash-out refinancing is mainly driven by house price appreciation and interest rate drops, and affected negatively by neighborhood-level median income.

Although some prior studies are able to distinguish among various types of (motivations for) prepayments due to their unique datasets, a majority of the mortgage performance studies are not able to and thus employ a single prepayment model. The division of prepayment by the expected interest rate change, as in the current study, represents a common market environment in which to view prepayment. This division is also supported by some prior theoretical studies on mortgage/MBS pricing which highlighted the importance and necessity of distinguishing pecuniary and non-pecuniary prepayments from each other. For example, Dunn and McConnell (1981a, b) and Brennan and Schwartz (1985) use a frictionless optimal call model to explain a borrower’s pecuniary prepayment behavior, to which a Poisson process is added in order to model non-pecuniary prepayment. The simulation results by Dunn and McConnell (1981a, b) show that non-pecuniary prepayment has a positive effect on Ginnie Mae (GNMA) MBS price and reduces its interest rate risk and expected return. Brennan and Schwartz (1985) arrived at similar conclusions. Both by specifying a conditional probability of non-pecuniary prepayment, Kau et al. (1992) and Archer and Ling (1993) allow pecuniary and non-pecuniary prepayments to interact with each other, and to disentangle the separate effects of interest rate call and non-pecuniary prepayment on mortgage/MBS price. Kau et al. (1992) concluded that non-pecuniary prepayment in the presence of pecuniary prepayment should increase the value of a mortgage loan to a lender. The simulation results by Archer and Ling (1993) support this conclusion, however, they also found that the impact of non-pecuniary prepayment on the value of call depends on the shape of the yield curve. In addition, empirically, a distinction between pecuniary and non-pecuniary prepayments allows for the examination of important factors and differences in impacts on loan pricing of these factors. Further, this study explores the loan pricing implications of separating pecuniary and non-pecuniary prepayments. Thus, the current study is able to examine the pricing impact of default as well as pecuniary prepayment, and non-pecuniary prepayment on loan pricing.

Model

Mortgage Loan Performance Analysis

The borrower’s loan termination behavior is modelled, on a monthly basis, using the Cox discrete-time competing-risks loan hazard model (Deng et al. 2000).Footnote 13 The loan performance data is restructured to have one observation for each loan and each month until the loan is terminated or until the end of the loan performance observation window. A distinction is made between pecuniary and non-pecuniary prepayments depending on whether the market interest rate at prepayment is below or above the one at loan closing. Therefore, each month, the borrower of a loan is observed to either continue, or default upon, or pecuniarily prepay, or non-pecunairily prepay a loan. Specifically, a multinomial logit model with four outcomes is utilized to model a borrower’s loan termination behavior as in Eq. (1)Footnote 14:

$$ \ln \left({p}_{jt}/{p}_{0t}\right)={\delta}_j{\alpha}_{jt}+{\beta}_j\hbox{'}{\mathbf{x}}_{jt}+{\varepsilon}_{jt}\kern1.5em j=\mathrm{1,2,3} $$
(1)

where p0t is the probability of a loan being current in period t, while pjt is the probability of a loan being terminated either through default (j=1), pecuniary prepayment (j=2), or non-pecuniary prepayment (j=3) in period t given that this loan is still current at the beginning of period t. In Eq. (1), t represents mortgage time. αjt refers to the baseline hazard rate of each event and is a function of mortgage time. The vector xjt includes a set of covariates describing the market conditions (interest rate change and house price change), the characteristics of the loan, the borrower, the underlying property, as well as the neighborhood. Some covariates are time varying while others are time invariant. This multinomial logit model thus allows one to examine whether pecuniary and non-pecuniary prepayments are driven by different factors and whether those covariates have different impacts on them.

Mortgage Loan Pricing

On the basis of this discrete-time competing-risks loan hazard model estimation results, this study further mimics mortgage lenders’ loan pricing behavior at the time of loan origination, predicting the probabilities of default, pecuniary prepayment, and non-pecuniary prepayment for each loan in the sample. Evaluating the probability of each event separately for each loan allows the investigation of whether these risks are taken into consideration as a part of mortgage pricing.

When predicting loan termination probabilities, the forecast of future market interest rate change is of particular interest to the lenders. Following prior studies, and the fact that most of the loans in the sample of this study were terminated within the first 10 years, this study adopted the 10-year treasury constant maturity yield (y) as the benchmark market interest rate for 30-year fixed-rate fully-amortizing residential mortgage loans.Footnote 15 The one-factor Cox, Ingersoll and Ross (CIR) term structure model is utilized to predict the possible paths (density) of future market interest rate (10-year treasury yield) seen from loan origination yt(y0).Footnote 16 Specifically, in this CIR term structure model, the whole yield curve is assumed to be driven by a spot interest rate (r(t)), and this spot interest rate is assumed to follow a mean-reverting stochastic process with its volatility impacted by the level of the spot interest rate as below:

$$ dr(t)=\gamma \left(\theta -r(t)\right) dt+\sigma \sqrt{r(t)} dz(t) $$
(2)

where θ is the long-term mean of the spot interest rate, γ is the reversion rate, σ is the volatility, and z(t) is a Wiener process. Hence, γ(θ − r(t)) is the deterministic part and ensures mean reversion of the spot interest rate towards its long-term mean, and \( \sigma \sqrt{r(t)} dz(t) \) describes the stochastic movements with \( \sigma \sqrt{r(t)} \) as its volatility.

Based on the estimated parameters in Equation (2),Footnote 17 the density of future spot interest rate for any forecast interval conditional on the spot interest rate at the time of loan origination dF(r(t)| r(0)) is forecasted.Footnote 18 As the whole yield curve is assumed to be driven by the spot interest rate, a change in the future spot interest rate (r(t)) would lead to a change in the future 10-year treasury yield, and would further affect the probability of loan termination. Therefore, the forecasted conditional density of future spot interest rate dF(r(t)| r(0)) is used to predict the probabilities of default, pecuniary prepayment, and non-pecuniary prepayment for each loan at each mortgage time t as in Equations (3)–(5).Footnote 19

$$ {\hat{p}}_{1t}\left({y}_0\right)=\int {p}_{1t}\left[{y}_t\left(r(t)\right)\right]\ dF\left(r(t)|r(0)\right) $$
(3)
$$ {\hat{p}}_{2t}\left({y}_0\right)=\underset{y_t\left(r(t)\right)<{y}_0}{\int }{p}_{2t}\left[{y}_t\left(r(t)\right)\right]\ dF\left(r(t)|r(0)\right) $$
(4)
$$ {\hat{p}}_{3t}\left({y}_0\right)=\underset{y_t\left(r(t)\right)\ge {y}_0}{\int }{p}_{3t}\left[{y}_t\left(r(t)\right)\right]\ dF\left(r(t)|r(0)\right) $$
(5)

Where \( {\hat{p}}_{1t}\left({y}_0\right) \) is the predicted default probability at mortgage time t seen from loan origination given that this loan has been current till the beginning of period t, \( {\hat{p}}_{2t}\left({y}_0\right) \) and \( {\hat{p}}_{3t}\left({y}_0\right) \) are time-specific predicted conditional pecuniary and non-pecuniary prepayment probabilities seen from loan closing, respectively. Recall that, pecuniary prepayment is defined as the one when the market mortgage interest rate at prepayment (10-year treasury yield, yt(r(t))) drops below the rate at origination (y0); while non-pecuniary prepayment is defined as the one when the market interest rate at prepayment is equal to or above the rate at loan closing.

The predicted time-specific conditional probability of each event from Equation (3)–(5) is aggregated over a 5-year window to calculate a predicted cumulative probability of each event \( {\hat{P}}_k \) (k = 1, 2, 3) seen from origination as in Equation (6).Footnote 20

$$ {\hat{P}}_k=\sum \limits_{t=1}^T{\left(1+\frac{y_0}{12}\right)}^{-t}{\hat{p}}_{kt}\prod \limits_{s=1}^{t-1}\left(1-\sum \limits_{k=1}^3{\hat{p}}_{ks}\right)\kern5.5em k=1,2,3, $$
(6)

where \( {\hat{P}}_{kt} \)is the predicted probability of event k in period t given that the loan has been active by the beginning of period t with the corresponding continuation probability as \( \prod \limits_{s=1}^{t-1}\left(1-\sum \limits_{k=1}^3{\hat{p}}_{ks}\right) \). The time-specific unconditional predicted probability of event k in period t, \( {\hat{p}}_{kt}\prod \limits_{s=1}^{t-1}\left(1-\sum \limits_{k=1}^3{\hat{p}}_{ks}\right) \), is discounted by the 10-year yield at origination (y0) with the assumption that loan terminations at earlier stages of a loan would matter more to lenders when assessing loan termination risk. Thus, those time-specific discounted unconditional predicted probabilities over a 5-year window are summed up to arrive at a cumulative predicted probability of each event (\( {\hat{P}}_k \)).

In order to investigate whether lenders have incorporated each risk into mortgage pricing, the forecasted 5-year cumulative probability of each event (\( {\hat{P}}_k \)) is included in a loan contract rate determination model as in Equation (7):

$$ {C}_0={\alpha}_0{y}_0+{\beta}_1{\hat{P}}_1+{\beta}_2{\hat{P}}_2+{\beta}_3{\hat{P}}_3+\gamma \hbox{'}\mathbf{z}+\varepsilon $$
(7)

where C0 is the contract rate of a loan, y0 is the 10-year treasury yield (the benchmark mortgage interest rate) at the time of loan origination, and z represents a full set of covariates at loan origination depicting the traits of the loan, the borrower, the collateral, and the neighborhood.Footnote 21

Data and Specifications

Data

A sample of 30-year first-lien fixed-rate residential mortgage loans for home purchases is employed for empirical analysis. All of the mortgage loans in the sample were serviced by GMAC Residential Capital Company, LLC (GMAC ResCap) which was a finance company that specialized in servicing nonconforming conventional mortgage loans and issuing non-agency mortgage-backed securities (MBSs) before the recent financial crisis. Those loans were originated by different underwriters, sold in the secondary mortgage market, and packaged into MBSs. For each loan in the records, in addition to detailed information on the loan, the borrower, and underlying property at the time of loan origination, GMAC ResCap also reported its delinquency and prepayment status on a monthly basis, allowing the performance of loans to be tracked over time. Similar to prior studies, default is defined as the occurrence of a borrower being 90-days delinquent which eventually leads to a foreclosure or foreclosure alternative (e.g., deed in lieu of foreclosure or a short sale, etc.).

In addition to financial incentives/constraints factors that are believed to affect the probability of a borrower prepaying a mortgage loan based on theories and prior studies (e.g., interest rate change, a borrower’s FICO score and income, and so forth), some prior studies also found that a borrower’s personal characteristics, especially a borrower’s race and ethnicity group, have an impact on loan prepayment rate (Kelly 1995; Clapp et al. 2001; Deng and Gabriel 2006; Firestone et al. 2007; Kau et al. 2018).Footnote 22 To incorporate these factors, this study matches the GMAC ResCap loan data with the Home Mortgage Disclosure Act Loan/Application Register (HMDA-LAR) data to identify the race and ethnicity of the borrower of each loan in the sample.Footnote 23 One challenge to match these two data sets is the location of the property securing each loan is defined differently in these two data sets. Specifically, GMAC ResCap loan data records the location of each loan by its zip code, while loan applications in the HMDA-LAR data are reported based on the census tract. In addition, the definition of a census tract in HMDA-LAR data varies across time.Footnote 24 To solve this issue, the GMAC ResCap loan data were first matched with a data set of property transactions from Miami-Dade County, FL, which is readily available. This first-step match identifies the geographic location of the property securing each loan, and thus its 1980, 1990, and 2000 census tract identifiers. Specifically, the GMAC ResCap loan data were first restricted to 30-year fixed-rate home purchase mortgage loans from Miami-Dade, and the matching was conducted based on a series of property transaction characteristics, including property sale price (value), sale month (loan origination month), property type, and zip code.Footnote 25 The resulting loan-property-sale matched sample, with each loan’s location by its census tract identified, was then matched to HMDA-LAR data on the basis of the loan amount (in $1000), loan origination year, property type, property occupancy status, loan purpose, lien status (if applicable), and census tract.Footnote 26 Only loans with a unique HMDA-LAR match were included in the final sample for empirical analysis.Footnote 27

To obtain the neighborhood characteristics of a loan, the matched loan-property-HMDA data were further augmented with the census survey data based on the 1990 census tract boundaries of a loan’s underlying property. The 1990, 2000, and 2010 census survey data, normalized to the 1990 census tract boundaries, were employed to measure the traits of a loan’s neighborhood including housing occupancy rate, poverty rate, and so forth. In order to generate time-varying neighborhood trait variables and also to track the change of a neighborhood over the life of a loan, the decennial census survey data from 1990 to 2000 and from 2000 to 2010 were linearly time-trended on a monthly basis.Footnote 28 Additionally, the HMDA data are aggregated annually to the census tract level to proxy a borrower’s income.Footnote 29 Specifically, for each census tract and each year, all loan applications (approved, denied, or withdrawn) for 1-to-4 family dwelling purchases from all lenders were pooled together to calculate the median applicant income based on a three-year window (the prior year, the current year, and the next year). To generate monthly HMDA median applicant income measures, the resulting yearly HMDA measures were time-trended.

The whole pool of property transactions from Miami-Dade County, FL were utilized to generate measures that describe the characteristics and the evolution of each local housing market.Footnote 30 An inflation-adjusted median house price index was created for each 1990 census tract and each month of analysis. This monthly tract-level house price index was employed to track the value change of the underlying property over a loan’s life.Footnote 31 Specifically, for each 1990 census tract and each month, all arm’s length transactions of residential properties within a three-year window of that month (18 months before and 18 months after that month), were pooled together to find the median house price. Similarly, in order to measure the time-varying housing heterogeneity in a neighborhood, for each census tract and each month, the standard deviation of house sale prices based on the whole pool of transactions in that tract was calculated using a three-year window around that month.

After matching the loan-property-HMDA data with neighborhood data, there are 1404 observations of 30-year first-lien fixed-rate fully-amortizing nonconforming mortgage loans for home purchases with completed loan performance records and no missing values on the characteristics of the loan, the borrower, the underlying property, and the neighborhood. The sample is restricted to loans originated from Jan. 1997 to Dec. 2006.Footnote 32 Their loan performance is observed from Jan. 2000 to Oct. 2010, a period covering the recent financial crisis.

Specifications

As default and different forms of prepayment may be induced by different financial incentives and encumbered by various financial constraints, the multinomial logit model as represented by Equation (1) is estimated through three separate logit models, one for each event, with different baseline hazard rates (αjt) and different specifications.Footnote 33 In the default hazard model, a scaled Standard Default Assumption schedule (SDA) is used as the baseline hazard rates αjt, while in prepayments hazard models, mortgage year (age) fixed effects are adopted for αjt which allow for more flexibility in baseline hazard rates.

Indicated by the option-based model of “financial” mortgage terminations (Kau et al. 1992), the two most prominent factors determining whether the default or prepayment option is “in-the-money,” and thus a borrower’s tendency to terminate a loan, are the change in the value of the collateral and market interest rate change. As discussed above, the 10-year treasury constant maturity yield (y) is used as the benchmark market interest rate for 30-year fixed-rate residential mortgage loans. Therefore, the gap between the 10-year yield at the time of loan origination and the 10-year yield at time t lagged by 2 periods (y0 − yt − 2) is utilized to measure the market interest rate change at time t.Footnote 34 The value change of the collateral is measured by the change of the generated monthly census-tract-level median house price index. Specifically, the ratio of the median house price index at time t to the index at the time of loan origination (time 0) is calculated and employed to measure how the value of the collateral at time t changes since loan origination. This ratio is named as the relative house price at time t (RHPt). In order to account for any correlation between interest rate change and house price change at time t, an interaction term of market interest rate change (y0 − yt − 2) and relative house price (RHPt) is also included. All of the three market characteristics variables (interest rate change, relative house price, and the interaction term) are included in the default and prepayment hazard models.

A set of covariates depicting the characteristics of the loan, the borrower, the underlying property, and the neighborhood are also included in this competing-risks hazard model. Variables describing the characteristics of the loan include the contract rate spread at origination (C0 − y0),Footnote 35 the original loan-to-value (LTV) ratio, the original loan amount, a borrower’s credit score (FICO), a dichotomous variable indicating whether a borrower provided full income documentation at loan closing, another dichotomous variable indicating whether a loan is encumbered by a prepayment penalty at time t, and loan origination season fixed effects. Property characteristics variables include property type fixed effects and property owner occupancy status fixed effects. A set of time-varying neighborhood traits variables are also incorporated in the hazard model including tract-level house price heterogeneity, housing occupancy rate, poverty rate, and relative median applicant income at time t.Footnote 36 In addition, a borrower’s race and ethnicity are also included given the findings by prior studies that borrowers of different racial or/and ethnical groups may have different loan default/prepayment patterns.Footnote 37

Following prior studies, considering possible nonlinear relationships between a covariate and loan termination hazard, a few covariates take a nonlinear form. The original LTV ratio (in percentage) is transformed into categorical variables with the following cutoffs: 80%, 90%, and 100%. A quadratic function form of the original loan amount is employed. With the assumption that once a borrower’s FICO score is above a threshold (e.g., 700), an increase in FICO score might have a(n) less/insignificant marginal impact on loan termination rate, a continuous linear spline function form of the FICO score with a knot point at 700 is adopted.Footnote 38 In addition, please note the dichotomous variable indicating whether a loan is encumbered by a prepayment penalty at time t is excluded in the default hazard model, as this variable is not believed to directly affect loan default probability.Footnote 39

Turning our attention to the contract rate determination model represented by Equation (7), in addition to the three predicted cumulative loan termination probabilities, the current study has included the 10-year treasury yield at origination (y0) as well as a full set of covariates indicated by theories to affect loan pricing. They are the original LTV ratio, a borrower’s FICO score, the original loan amount,Footnote 40 a borrower’s income documentation status fixed effects, prepayment penalty fixed effects,Footnote 41 loan origination season fixed effects, property type fixed effects, property owner occupancy status fixed effects, a borrower’s race and ethnicity group fixed effects, and several variables depicting neighborhood traits at the time of loan origination, including tract-level past housing price appreciation rate,Footnote 42 heterogeneity in housing price, housing occupancy rate, poverty rate, and relative median applicant income. Additionally, a calendar time trend variable is also included.Footnote 43

Endogeneity Issue

The issue of endogeneity arises in both the discrete-time competing-risks loan hazard model (Equation (1)) and the contract rate determination model (Equation (7)). In the competing-risks loan hazard model, one covariate – the contract rate spread (C0 − y0) might be endogenous, as this variable might be correlated with the error term in Equation (1). The reason for this concern is in empirical mortgage loan performance analysis, it is normally very challenging to measure and thus incorporate every loan termination risk affecting factor. If a loan termination risk affecting factor is not observed in the data, but is observed by lenders/underwriters, it would be captured in the error term in Equation (1), while lenders/underwriters would take it into consideration when setting up the contract rate. In this scenario, the contract rate spread would be correlated with the error term. Given the non-linear nature of this competing-risks loan hazard model, following Petrin and Train (2003), a standard control function (CF) method is employed to address this potential endogeneity issue. Specifically, a linear reduced-form contract rate model is estimated first in which the contract rate of a loan is regressed against all exogenous variables in the system. The residual from this reduced-form contract rate model is obtained and then included in the competing-risks loan hazard model together with other covariates.

The final contract rate determination model (Equation (7)) also suffers from endogeneity issue. Recall that the contract rate spread is included in the competing-risks loan hazard model (Equation (1)), and the three predicted cumulative loan termination probabilities (\( {\hat{P}}_k \)) are generated based on the estimation results of this competing-risks loan hazard model. Hence, each of the three predicted cumulative termination probabilities is a function of the contract rate (spread), while the dependent variable in Equation (7) is the contract rate of a loan. Given that this contract rate determination model is a linear one, a set of generated IVs (\( {\overset{\sim }{P}}_k \)) are calculated and included in the estimation of the final contract rate model in place of the three original predicted probabilities (generated variables, \( {\hat{P}}_k \)). To generate each IV (\( {\overset{\sim }{P}}_k \)), the same procedures to calculate its corresponding generated variable (\( {\hat{P}}_k \)) is utilized, but replacing the actual contract rate spread (C0 − y0) with a predicted rate spread (\( {\hat{C}}_0-{y}_0 \)). This predicted contract rate spread is simply the difference between a predicted contract rate and the 10-year yield at the time of loan origination. This predicted contract rate is obtained from the estimation of the linear reduced-form contract rate equation used in the CF method. As this predicted contract rate (\( {\hat{C}}_0 \)) is a linear function of all exogenous variables in the system, the three generated IVs (\( {\overset{\sim }{P}}_k \)) would also be exogenous, with each as a valid IV for its corresponding generated variable (\( {\hat{C}}_k \)). Using those generated IVs, 2SLS is employed to estimate the final contract rate model represented by Equation (7). Additionally, given that the three predicted probabilities (\( {\hat{P}}_k \)) are generated variables, in order to correct the standard errors of the covariates estimates in this contract rate model, the method outlined in Appendix 6A of Wooldridge (2010) is employed.

Results

Statistical Descriptions

A brief description of the traits of the loans in the pooled sample at the time of loan origination is presented in Table 1, including the characteristics of the loan, the borrower, the underlying property, and the neighborhood.Footnote 44 Roughly 9% of loans in the sample were defaulted upon, 56% of the loans were prepaid when market interest rates dropped (pecuniarily prepaid), and 24% of the loans were prepaid when the rates had risen (non-pecuniarily prepaid). As all of loans in this sample are nonconforming loans, their high-risk nature is reflected by the statistics of the following variables. The average contract rate is high (around 8% annually), approximately 300 basis points higher than the prevailing 10-year treasury constant maturity yield. The original LTV ratio is 85% on average, and the mean FICO score of the borrowers in the sample is about 700. Less than half of the borrowers (48%) were able to provide full income documentation while applying for a loan, and a great portion of the borrowers (33%) were encumbered by a prepayment penalty for the first few years. For most of the variables describing the neighborhood characteristics, both the standard deviation and the range are large relative to the mean value, indicating a substantial level of spatial variation in the performance of the local housing markets and in the socioeconomic status of the local residents/homeowners. In this pooled sample, approximately 7% of loans were extended to African American borrowers, and 22% of them were originated to non-Hispanic white borrowers. The majority of the borrowers (71%) were Hispanics.Footnote 45

Table 1 Descriptive Statistics of Mortgage Loans at Loan Origination

Table 2 provides descriptive statistics of the pooled sample by the eventual termination event by Oct., 2010 (censored, defaulted, pecuniarily prepaid, non-pecuniarily prepaid). Among the four event groups, the average contract rate at origination (C0) is the highest for loans that were pecuniarily prepaid. This finding is somewhat puzzling, however comparison on the average benchmark mortgage interest rate (10-year yield at origination, y0) across these four event groups reveals that it is because those loans that were eventually pecuniarily prepaid were originated in a period with relatively higher interest rates. The descriptive statistics seem to imply that loans that were pecuniarily prepaid differ from those that were non-pecuniarily prepaid, indicating the importance of making a distinction between these two types of prepayment. Loans in these two types of prepayment groups appear to be different from each other indicated by the following variables, including the average original LTV ratio (83% vs. 89%), the average FICO score at origination (700 vs. 691), the proportion of the borrowers with full income documentation (54% vs. 44%), the proportion of the borrowers without prepayment penalties (75% vs. 52%), the mean tract-level poverty rate (13% vs. 16%), and the average tract-level relative median applicant income (1.32 vs. 1.05).

Table 2 Descriptive Statistics of Mortgage Loans at Loan Origination by Event

What Impacts Pecuniary Vs. Non-pecuniary Prepayments?

The prepayment hazard model estimates are presented in Table 3. Model (1) is for a prepayment hazard model in which pecuniary and non-pecuniary prepayments are combined; while model (2) and (3) are for pecuniary and non-pecuniary prepayment hazards, respectively. Comparisons on the estimation results between this combined prepayment model and the separated ones underscore the importance of separating these two types of prepayment from each other. In the combined prepayment model, the estimated coefficient of the market interest rate change variable (y0 − yt − 2) is not significantly different from zero. However, in the separated ones, this coefficient is positive and significant in the pecuniary prepayment model, while it is negative but insignificant in the non-pecuniary prepayment model. This finding is consistent with the anticipation that observing a declining market interest rate, a borrower would have more financial incentives to refinance the mortgage loan to lower the overall borrowing costs. Although the value change of the collateral (relative house price at time t, RHPt) is found to have a significant and positive impact in each of the three models, the estimated coefficient in the pecuniary/non-pecuniary prepayment hazard model is slightly smaller than that in the combined model. Overall, this finding indicates that regardless of the interest rate environment (rising or declining), as the value of the collateral increases, a borrower tends to be more likely to prepay a loan.

Table 3 Prepayment Hazard Model Estimates

Some additional differences between the models appear when comparing the estimation results of covariates describing the traits of the loan, the property, and the neighborhood. Although the estimated coefficient of the contract rate spread variable is positive and significant in the combined and non-pecuniary models, the coefficient in the combined model is only approximately one-third of that in the non-pecuniary prepayment hazard model. The contract rate spread variable captures the impacts of points and hence a borrower’s expectation on how long the property would be held on non-pecuniary prepayment probability. Borrowers that plan to move soon in the near future would rationally self-select less points and thus a higher contract rate at origination to lower the effective borrowing cost; while borrowers expecting to stay longer would choose the opposite combination. Thus, a positive association between contact rate spread and non-pecuniary prepayment probability would be expected, with which the empirical results here are consistent.

The original loan amount is normally used to capture the scale effects of loan size (Clapp et al. 2001). Given the assumption that loan prepayment transaction costs tend to be fixed, a larger mortgage loan would provide a larger dollar incentive to prepay, keeping other variables the same. The results indicate that the original loan size appears to have a positive impact on non-pecuniary prepayment probability but at a decreasing rate, while its impact on pecuniary prepayment probability, and in the combined model, is not significant.

A borrower providing full income documentation upon loan application tends to be less likely to prepay when interest rates rise than an equivalent borrower with limited/no income documentation. The income documentation status variable may capture the impact of job stability on a person’s residential mobility and hence non-pecuniary prepayment rate. The results also indicate that prepayment penalties deter prepayment, a result that holds across all three models.

Results on the two property traits variables indicate that a borrower purchasing a house as the primary residence appears to be less likely to prepay in an environment of rising interest rates than a similar borrower buying the house as an investment. The results also indicate that a borrower living in a condo tends to be less likely to pecuniarily prepay than a borrower living at a single-family house. One possible explanation for this finding is the financial benefits from pecuniary prepayments (lower monthly payments) might be less for a loan used to purchase a condo given its smaller original loan amount and a shorter expected holding period on average.Footnote 46 Please note here the property owner occupancy status variable does not have a significant impact on pecuniary prepayment probability, and the property type variable does not significantly affect non-pecuniary prepayment rate.

The estimation results on neighborhood characteristics variables indicate that pecuniary and non-pecuniary prepayments tend to occur in neighborhoods with different traits. These traits are shown to lack significance in the combined model. The probability of non-pecuniary prepayment tends to be higher in a census tract with a higher level of house price heterogeneity and a lower level of median applicant income. By contrast, borrowers living in a more affluent neighborhood appear to be more likely to prepay when interest rates drop, ceteris paribus. In addition, results of a borrower’s race and ethnicity group indicate that minority borrowers are less likely to take advantage of a lower interest rate through prepayment than equivalent non-Hispanic white borrowers. However, there is no evidence of significant differences in non-pecuniary prepayment rates across racial and ethnic groups.Footnote 47

The comparisons indicate that separating pecuniary from non-pecuniary prepayments allows one to identify variations in the impacts of the covariates on each termination decision. The separation by prepayment types also helps to avoid the pitfalls of the combined prepayment model which fails to capture the significance of some coefficients.

Given the substitute nature of default and prepayment, the default hazard model is also estimated within the framework of a competing-risks loan hazard model. Table 4 reports the default hazard estimation results. Overall, the results are very consistent with both theories and prior studies. Default is mainly induced by a drop in the value of the collateral, as the variable of relative house price at time t (RHPt) is shown to have a significantly negative impact on default probability. The original loan amount variable impacts the default rate in a positive way, but at a decreasing rate. Borrowers with a relatively higher FICO score, with full income documentation upon loan application, with the underlying property as the primary residence, or borrowers living at a neighborhood of higher income tend to be less likely to default. However, the marginal impact of a higher FICO score on default rate becomes insignificant when the FICO score is above 700. In addition, a borrower’s race and ethnicity is not shown to significantly impact default probability.

Table 4 Default Hazard Model Estimates

Loan Prepayment Probabilities Simulation Results

To further illustrate how identified factors impact the overall pecuniary vs. non-pecuniary prepayment probabilities, this study simulated 5-year cumulative probability of each event (default, pecuniary prepayment, and non-pecuniary prepayment) by the factors that are shown to impact pecuniary and non-pecuniary prepayment rates differently.Footnote 48 The simulations are conducted in two interest rate change environments: declining interest rates and rising interest rates.Footnote 49 Specifically, the 5-year predicted cumulative probability of each event for a sample base loan is calculated first. Here, a base loan is assumed to be taken by a non-Hispanic white borrower with limited income documentation to purchase a single-family house for investment purposes. The values of the time-invariant covariates are set at the full sample median, while the values of the time-varying covariates are set at the sample median of each period. Each factor is then varied to see how the 5-year predicted cumulative probabilities vary.Footnote 50

Table 5 demonstrates the predicted 5-year cumulative probabilities of pecuniary and non-pecuniary prepayments by those covariates. In a declining interest rate environment, the probability of a baseline borrower pecuniarily prepaying a loan is 54%, and the predicted cumulative non-pecuniary prepayment probability by year 5 is around 17%. By contrast, in a rising interest rate environment, for the base loan, the predicted cumulative probabilities of pecuniary and non-pecuniary prepayments by year 5 are approximately 5% and 56% respectively.

Table 5 Predicted 5-year Cumulative Prepayment Probabilities a

As would be expected, the 5-year cumulative probability of non-pecuniary prepayment rises substantially with the contract rate spread at origination in both an environment of declining interest rates and that of rising interest rates, and the increase is more substantial in a rising interest rate environment. Specifically, in a rising interest rate environment, the simulated non-pecuniary prepayment probability with a 75% percentile contract rate spread is about 32 percentage points higher than that with a 25% percentile contract rate spread; while in a declining interest rate environment, this gap is only around 16 percentage points. The original loan amount has a similar impact on the 5-year cumulative non-pecuniary prepayment probability. In a rising interest rate environment, the simulated non-pecuniary prepayment probability rises from 48% to 64% as the original loan amount increases from the 25% percentile to the 75% percentile. In a declining interest rate environment, that increase is only 8 percentage points (14% to 22%). By contrast, the impacts of the original contract rate spread and loan amount on the 5-year cumulative pecuniary prepayment probability are relatively much smaller, especially in a rising interest rate environment.Footnote 51

A borrower’s income documentation status (0,1), the underlying property owner occupancy status (0,1), property type (0,1), and a borrower’s race and ethnicity group are all shown to affect the predicted cumulative probabilities of pecuniary or/and non-pecuniary prepayments. Recall that the predicted 5-year cumulative probabilities of pecuniary and non-pecuniary prepayments of a base loan are 54% and 17% respectively in a declining interest rate environment, and are 5% and 56% in a rising interest rate environment. The base loan is assumed to be taken by a non-Hispanic white borrower with limited income documentation to purchase a single-family house for investment purposes. Keeping others the same, the 5-year cumulative probability of non-pecuniary prepayment among borrowers with full income documentation drops from 17% to 12% in a declining interest rate environment, and from 56% to 45% in a rising interest rate environment relative to borrowers with limited income documentation. The underlying property owner occupancy status is shown to have a very similar impact on the simulated 5-year non-pecuniary prepayment probability as the factor of the income documentation status. Turning our attention to the impact of the underlying property type on the predicted cumulative pecuniary prepayment probability, this simulated probability among loans with the underlying property as a condo drops from 54% to 47% and from 5% to 4% in a declining/rising interest rate environment respectively, in contrast to that among loans with the underlying property of a single-family house. Additionally, the results in Table 5 clearly demonstrate the lower pecuniary prepayment probability by either Hispanics or African Americans relative to their non-Hispanic white counterparts. For example, in a declining interest rate environment, the cumulative pecuniary prepayment probability is 54% for non-Hispanic whites, 50% for Hispanics, and 47% for African Americans.Footnote 52

A higher level of housing price heterogeneity is shown to elevate non-pecuniary prepayment likelihood. Five years after loan origination, the non-pecuniary prepayment likelihood with a 75% percentile housing price heterogeneity is about 7.5(14) percentage points higher than that with a 25% percentile housing price heterogeneity in a declining(rising) interest rate environment. The tract-level relative median applicant income serves to elevate pecuniary prepayment likelihood and to damp non-pecuniary prepayment propensity. For example, in a declining interest rate environment, as the relative median applicant income increases from the 25% percentile to the 75% percentile, the simulated 5-year pecuniary prepayment probability increases from 50% to 60%, while the predicted 5-year non-pecuniary prepayment likelihood decreases from 19% to 13%. Similarly, in a rising interest rate environment, this gap is +2% for pecuniary prepayment probability and − 11% for non-pecuniary prepayment probability. Overall the results are consistent with the findings in Table 3, which underscores the importance of distinguishing pecuniary prepayments from non-pecuniary ones for mortgage loan performance analysis and loan pricing.

Implications for Loan Pricing

In order to further investigate the impacts of the three risks on the pricing of mortgage loans, the 5-year cumulative probability of each event (default, pecuniary prepayment, and non-pecuniary prepayment) is predicted for each loan in the sample and included in the loan contract rate determination model as in Equation (7).Footnote 53 The results in Table 6 show that all of the three risks have positive and significant impacts on the contract rate. Specifically, a 10-percentage point increase in default probability would lead to an increase in the original contract rate by 10 basis points. The increase in the contract rate due to a 10-percentage-point rise in pecuniary prepayment probability is 13 basis points, while that from a 10-percentage-point increase in non-pecuniary prepayment probability is 8 basis points (5 basis points smaller than that of the pecuniary one). This finding indicates that pecuniary and non-pecuniary prepayment risks do not appear to have the same impact on the contract rate, and lenders appear to charge a marginally higher risk premium for pecuniary prepayment risk than for a non-pecuniary one. Although it would generally be hypothesized that non-pecuniary prepayment would lower mortgage contract rates,Footnote 54 the results here indicate positive risk premiums for both prepayment types. The positive risk premium of non-pecuniary prepayment needs to be explained through the lens of the subprime market and non-agency MBS. Mayer et al. (2013) find that in a rising interest rate environment, one of the major reasons for a subprime borrower to non-pecuniarily prepay a mortgage loan is an improvement in the borrower’s creditworthiness or a positive wealth shock, and the propensity of non-pecuniary prepayment would be higher for risker borrowers with initial lower credit scores.Footnote 55 Thus, when a borrower non-pecuniarily prepays his mortgage loan with the same lender due to an increase in his creditworthiness (or in property equity), the contract rate on the new loan would probably be lower than that on the existing one. This occurrence would bring a negative impact on a lender’s net cash inflows. Accordingly, a rational lender who anticipated this type of risk upon loan origination would charge a risk premium for it.Footnote 56 In addition, as previously mentioned, the hedging strategy adopted by MBS holders depends on their estimates of the prepayment rates and thus the effective maturity and duration of the MBSs. As non-pecuniary prepayment is not driven by the call option but rather some “exogenous” reasons (e.g., relocation, improvement in borrowers’ creditworthiness, etc.) and seems more random, it is relatively more difficult to accurately predict non-pecuniary prepayment rate. Therefore, the risk of non-pecuniary prepayment may make it more challenging to effectively hedge the portfolio or may incur extra hedging costs to MBS investors, which may have been incorporated into MBS/mortgage pricing, especially given the fact that approximately 25% of the loans in the sample of this study were non-pecuniarily prepaid. Based on those considerations, non-pecuniary prepayment risk may raise the contract rate of a loan. Overall, the empirical results indicate that the potential positive impact of non-pecuniary prepayment risk on loan pricing outweighs its negative effect based on this sample of the subprime loans.

Table 6 Loan Contract Rate Estimates - 2SLS a

The estimation results on all other covariates in this contract rate model are consistent with expectations. A borrower would face a higher contract rate with a higher original LTV ratio, a lower FICO score, without full income documentation, with a prepayment penalty, and with the underlying property as an investment. As the original loan amount rises, the contract rate decreases at a decreasing rate. Also, the results show that lenders rely on information on the local recent housing price appreciation rate to set up the contract rate. In addition, the results show that African American borrowers in this study pay a significantly higher contract rate than their non-Hispanic white counterparts.

Conclusion

This study explores mortgage pricing behavior and prepayment when a distinction is made between pecuniary and non-pecuniary prepayments. These two types of prepayments, which occur in different interest rate environments (falling & rising), are separated from each other, as they may result in different effects on the future reinvestment rate of return of a mortgage lender/investor. Within the analysis framework of a competing-risks hazard model, the factors driving pecuniary and non-pecuniary prepayment rates are investigated.

Based on a sample of 30-year first-lien fixed-rate nonconforming mortgage loans for home purchase from Miami-Dade County, FL, the results show that different factors drive these two prepayment decisions, and certain variables describing the characteristics of the loan, the borrower, the collateral, and the neighborhood have different impacts on pecuniary vs. non-pecuniary prepayment likelihoods. Overall, both pecuniary and non-pecuniary prepayments are induced by house price appreciation and deterred by prepayment penalties. However, pecuniary prepayment is mainly driven by the change (decline) of market interest rates, and affected by a borrower’s income and race group, as well as the type of the underlying property. By contrast, non-pecuniary prepayment propensity is more affected by the traits of the loan (original contract rate spread and original loan amount), the characteristics of the borrower (income documentation status and income), the features of the underlying property (owner occupancy status), as well as the local housing market conditions (house price heterogeneity). A comparison of the estimation results between the hazard models where pecuniary and non-pecuniary prepayments are separated, and the combined model, demonstrates that the separated prepayment hazard models would be preferred, as the combined model does not reflect both interest rate environments. This finding is also supported by the simulation results.

Those findings have important implications for mortgage pricing as the market value of a mortgage loan is affected by various risks. By including the three predicted cumulative termination probabilities (default, pecuniary prepayment, and non-pecuniary prepayment) into a contract rate determination model, this study finds that all of the three termination risks impact the contract rate of a loan positively. This finding underscores the importance of taking into account the three termination risks separately, but simultaneously in mortgage pricing.