1 Introduction

This paper studies crowding-out across savings, home ownership, and retirement investment portfolio choices in the Chilean defined contribution (DC) retirement system.Footnote 1,Footnote 2 The literature has extensively demonstrated the benefits of default rules, showing that tehy increase participation in retirement savings plans (e.g., Madrian and Shea (2001), Thaler and Benartzi (2004), Gelber (2011), Chetty et al. (2014) and Thaler (2016)).

However, it is not clear whether policies for increasing retirement savings increase total savings or just savings within retirement systems (Chetty et al. 2014).Footnote 3 It is expected that individuals behave differently regarding different savings choices. For example, retirement wealth is not liquid, and the rate of returns over retirement savings are different than that over other savings (Attanasio and Brugiavini 2003). The same can occur with other investments. Individuals may be willing to quit on retirement savings to focus on down or mortgage payments. Previous evidence shows that each dollar of pension wealth is associated with a 37 to 50 cent decline in nonpension wealth, although most of the effect is concentrated in the upper tail of the distribution (Blau 2016; Engelhardt and Kumar 2011). Evidence shows that crowd-out results are very sensitive to the empirical specifications (Blau 2016).

One of the difficulties in studying crowding-out is that preferences are unobserved (Beshears and Choi 2012). Workers may have different unobserved preferences for savings, which might be correlated with other choices (Gelber 2011).Footnote 4 For example, an individual could select into the labor market and commit to contributing to her retirement account while holding voluntary savings because of high unobserved tastes for savings. Many papers in the literature use reduced-form approaches that do not consider the non-linearity of the individual’s maximization problem or made oversimplified assumptions (Blau 2016; Card and Ransom 2011; Engelhardt and Kumar 2007).

There are other limitations that have affected the generalization of results. Many studies rely on policy reforms for identification where counterfactuals for alternative reforms are not observed (Blau 2016). Others, consider non-representative experiments (e.g., Madrian and Shea (2001), Choi et al. (2004), Thaler and Benartzi (2004) and Carroll et al. (2009)). Many articles consider only the immediate impact of the reforms. Finally, papers that use survey data typically have selection bias and measurement error (Engelhardt and Kumar 2007). Some exceptions are Chetty et al. (2014), who use a 41 million observation administrative dataset from the Danish retirement system, and Engelhardt and Kumar (2007), who use survey data merged with administrative records while controlling for selection using non-standard econometric approaches.

To consider these issues, I jointly estimate a multiple equation, reduced-form dynamic model. The equations capture simultaneous decisions such as labor market participation, formality and contribution status, default adoption in investment portfolios, participation in voluntary savings within and outside the retirement system, and housing asset choices. All equations are correlated through a permanent and a time-varying individual-level unobserved heterogeneity component. Unobserved heterogeneity captures differences in preferences, risk tolerance, tastes, etc. The distribution of unobserved characteristics is jointly estimated with the coefficients of the model using semi-parametric full information likelihood methods.Footnote 5 The approach allows to address several sources of estimation bias: selection, endogeneity, and measurement error.Footnote 6

Unbiased estimates arise from many corrections. First, because non-random selection into behaviors is jointly modeled. This correction is achieved by considering several choices, all forming the estimated set of equations. Second, because unobserved heterogeneity that is correlated across decisions and with endogenous determinants of choices is flexibly modeled. Thus, estimated reduced-form parameters are not a function of biases coming from the omission of relevant variables. Third, the estimated parameters are cleaned of classical measurement error bias. This source of bias is accounted by modeling the error term of variables that might be reported with error (e.g., portfolio adoption, savings decisions). Measurement error bias is also reduced by using administrative records on accumulated wealth.Footnote 7 The estimation also allows me to incorporate the non-linearities that come from the individual’s decision-making process without making any assumptions about preferences and expectation processes.

Since I model the decision-making process over time, I analyze crowding-out of simulated policies rather than depending on observed policy reforms. I use the estimates of the model to simulate contemporaneous (short-run) and life-cycle (long-run) effects of increasing participation in voluntary retirement accounts, increasing housing assets, and extending compulsory participation on retirement programs to self-employed workers.

The paper is built upon Parada-Contzen (2019). Specific differences are: (1) I disaggregate savings based on their liquidity. (2) I study housing choices to incorporate other sources of wealth. (3) An important focus is put on the employment, formality, and contribution status so that policy experiments concerning them can be simulated. (4) I consider active investment decisions, meaning that I classify individuals according to their decision to be on the default or to opt-out. (5) I do not consider changes in financial risks faced by individuals, thus, I am able to estimate a simpler model.

I use the first four waves (2002–2009) of the Chilean Survey of Social Protection (EPS) merged with administrative records of the Chilean Bureau of Pensions. The EPS is a validated survey used for retirement policies (Behrman et al. 2011; Joubert 2015). Using this dataset has several advantages: (1) the Chilean model has served as a prototype for the implementation of DC systems in several countries.Footnote 8 (2) In the Chilean model, there are two levels of default. On the one hand, there are no participation conditions once the individual has selected into the labor market. On the other hand, once enrolled, workers can choose their own investment portfolio or follow a default scheme provided by the system (opt-out default). (3) It includes rich individual data and administrative records on retirement wealth. Kristjanpoller and Olson (2015) use a cross-section of the EPS to study default adoption. Differently, I consider a dynamic setting and focus on a larger number of choices.

The remainder of the paper is as follows. The empirical model and estimation strategy are presented in Section 2. Details about the data are presented in Section 3. Results are presented in Section 5, and finally, Section 6 concludes.

2 Empirical model

2.1 Institutional background

In the Chilean system, formal workers are required to contribute 10% of their pre-tax earnings. Any individual eligible to contribute to the system at least once, is enrolled. Contributions are automatically deducted from the worker’s paycheck and transferred to her mandatory retirement account. I refer to retirement wealth (Ait) to denote accumulated assets in the mandatory contribution account. Informal employees, self-employed workers and the non-employed may voluntary contribute or not contribute at all. Contribution choices are denoted by cit. Any individual may hold voluntary savings within the retirement system regardless of her employment and contribution status. Participation choices in retirement voluntary programs are denoted by \({s}_{it}^{r}\).

Enrollees may select one or two voluntary retirement accounts out of five available accounts to invest retirement savings. These accounts are offered by the system, and vary in their level of financial risk. The riskiest fund is Account A, which invests 40–80% in equities, whereas the range for Account B is 25–60%, that of Account C is 15–40%, that of Account D is 5–20%, and that of Account E is less than 5%.

If an individual does not explicitly choose an account, she is defaulted into a predetermined investing scheme according to age and gender. In the default, individuals under the age of 35 are assigned to Account B; women between the ages of 35 and 50 years old are assigned to Account C, and those aged 50+ years old are assigned to Account D. Men between the ages of 35 and 55 are assigned to Account C, whereas those aged 55+ years are assigned to Account D.Footnote 9 In the model, portfolio default status is denoted by dit. Note that an individual can choose her default accounts, which means she is not defaulted into that account but she still is observed to be in the default.Footnote 10

2.2 Timing and notation

Individual i begins period t with an information set Ωt. The individual observes a wage offer \({w}_{it}^{* }\) (unobserved for the econometrician) drawn from the wage distribution and jointly decides her employment status (eit), contribution status (cit) and portfolio status (dit). Additionally, the individual decides her optional savings status within (\({s}_{it}^{r}\)) and outside the retirement system (\({s}_{it}^{o}\)), and housing category (hit): an owner with no loan or an owner but paying, renting, or leasing from a relative. These decisions are the endogenous choice variables that are jointly estimated.

Informality is defined through pension coverage (Joubert 2015). A worker can be formal (covered by the pension system) or informal (uncovered by the pension system). Formal workers are dependent employees who sign a contract, while informal workers might be informal employees (salaried workers with no contract) or self-employed (at the survey was run, self-employed workers were not required to contribute).

Because these decisions are jointly made, they are all a function of the same observable characteristics (Ωit). The information set contain stock variables up to period t − 1 and lagged decisions. Lagged decisions include previous contribution status (ci,t−1), default path adoption (di,t−1), housing status (hi,t−1), and saving choices (\({s}_{i,t-1}^{r}\) and \({s}_{i,t-1}^{o}\)). Stock variables include accumulated assets (Ai,t−1), accumulated work experience (Ei,t−1), marriage history (Mi,t−1) and history of children (Ni,t−1). These variables are endogenous control variables, also referred to as predetermined. Endogenous choice variables are different from the endogenous control variables. Endogenous choices are behaviors chosen at period t, while endogenous control variables are lagged decisions and stock variables that are not decisions itself but are affected by lagged choices (e.g, work experience is affected by previous employment choices).

The information set also includes a vector of exogenous individual characteristics (Xit, such as age, gender, and education) and a vector of exogenous market-level characteristics (Zit), such as unemployment rates, interests rates, and prices from related markets, among others. \(\widetilde{{\Omega }_{it}}\) is the set of endogenous control variables in the information set that also behave as explanatory variables, and Ωit refers to \(\{\widetilde{{\Omega }_{it}},{X}_{it},{Z}_{it}\}\).

After the decisions are made, the econometrician observes the wage draw wit if the individual is working. The individual observes the realization of two endogenous characteristics: her marital status (mit) and number of children (nit), which are a function of choices in period t, \({\tilde{\Omega }}_{it}\), Xit, and supply-side market-level characteristics (\({Z}_{t}^{M}\) for the marriage market and \({Z}_{t}^{N}\) for the children market).

The individual observes the realization of financial returns and updates her information set. The individual moves to the next period with information set Ωi,t + 1. The timing of the model is presented in Fig. 1 in Appendix A.

Endogenous variables in t are predetermined in t + 1, serving as instruments. There are other sources of identification (see Section 4.4). Market-level characteristics capture relevant factors that might affect decisions. For example, deposits per capita at a regional level. An area with fewer deposits might pay higher interest rates. A complete model should include both, individual and market characteristics to describe the problem appropriately. Market-level characteristics serve as the typical exclusion restrictions.

2.3 Estimable model

2.3.1 Wages

The wage equation follows the specification proposed by Mincer (1974) and includes demographic characteristics (Xit), endogenous predetermined variables, such as work experience (Eit), and job characteristics. Productivity may also vary among individuals with the same human capital characteristics.Footnote 11 Since productivity is unobserved, I assume that work productivity also depends on family characteristics, such as the number of children and marriage duration. I also include demand-side factors (\({Z}_{it}^{E}\)) at a regional level. Finally, I include permanent (\({\mu }_{i}^{1}\)) and time-varying unobserved wage determinants (\({\nu }_{it}^{1}\)).

$$ln\ {w}_{it}=w({e}_{it},{\tilde{\Omega }}_{it},{X}_{it},{Z}_{it}^{E},{\mu }_{i}^{1},{\nu }_{it}^{1})$$
(1)

2.3.2 Endogenous choice variables

Recall that upon observing a wage offer \({w}_{it}^{* }\), individuals make choices. These wage offers are unobserved to the econometrician: she observes wages if the individual decides to be employed, wit. Thus, if there are individual unobserved characteristics that simultaneously affect selection into employment, contribution status and wages, there is a risk of potential selection bias. To account for this source of bias, I jointly estimate employment choices with contribution status and wages in period t by allowing for correlation in the error terms that affect each equation.

If unobserved characteristics are correlated across savings choices, default status, and home ownership, even after correcting for selection into employment, potential biases arise. It is expected that saving choices, portfolio choices, and home ownership are endogenous to earnings. Higher earnings may generate higher availability of resources allocated to savings and investments. Similarly, higher earning levels generate higher levels of retirement wealth, which might affect portfolio default adoption.

Consequently, all decisions that impact wealth accumulation are jointly modeled and estimated with wages. I refer to these variables by endogenous choice variables (a subset of the dependent variables of the model). Because behaviors are jointly chosen, they are all specified as a function of the variables in the information set Ωit. Behaviors are also specified as a function of equation-specific permanent and time-varying unobserved characteristics. The distributions of unobserved characteristics are jointly estimated with the parameters of the model. Demand equations representing choices are presented from eqs (2) to (7).

$$ln\left[\frac{P({e}_{it}=j)}{P({e}_{it}=0)}\right]={e}^{j}({\tilde{\Omega }}_{it},{X}_{it},{Z}_{it},{\mu }_{i}^{2},{\nu }_{it}^{2});\quad j=\{1,2\}$$
(2)
$$ln\left[\frac{P({c}_{it}=j)}{P({c}_{it}=0)}\right]={c}^{j}({\tilde{\Omega }}_{it},{X}_{it},{Z}_{it},{\mu }_{i}^{3},{\nu }_{it}^{3});\quad j=\{1,2\}$$
(3)
$$ln\left[\frac{P({d}_{it}=j)}{P({d}_{it}=0)}\right]={d}^{j}({\tilde{\Omega }}_{it},{X}_{it},{Z}_{it},{\mu }_{i}^{4},{\nu }_{it}^{4});\quad j=\{1,2\}$$
(4)
$$ln\left[\frac{P({s}_{it}^{r}=1)}{P({s}_{it}^{r}=0)}\right]={s}^{r}({\tilde{\Omega }}_{it},{X}_{it},{Z}_{it},{\mu }_{i}^{5},{\nu }_{it}^{5})$$
(5)
$$ln\left[\frac{P({s}_{it}^{o}=1)}{P({s}_{it}^{o}=0)}\right]={s}^{o}({\tilde{\Omega }}_{it},{X}_{it},{Z}_{it},{\mu }_{i}^{6},{\nu }_{it}^{6})$$
(6)
$$ln\left[\frac{P({h}_{it}=j)}{P({h}_{it}=0)}\right]={h}^{j}({\tilde{\Omega }}_{it},{X}_{it},{Z}_{it},{\mu }_{i}^{7},{\nu }_{it}^{7});\quad j=\{1,2,3\}$$
(7)

where \({\tilde{\Omega }}_{t}=\{{c}_{i,t-1},{d}_{i,t-1},{s}_{i,t-1}^{r},{s}_{i,t-1}^{o},{h}_{i,t-1},{E}_{i,t-1},{A}_{i,t-1},{M}_{i,t-1},{N}_{i,t-1}\}\). Vector Xit includes individual characteristics. Vector Zt includes exogenous market-level characteristics at a regional level from the labor market (such as the unemployment rate, total employment, local minimum wage), the credit market (share over national deposits and credits, deposits and credits per capita, and banks per capita), the marriage market (gender ratio, marriages per capita) and the family market (college tuition). The equation-specific components μi and νit represent non-linear unobserved heterogeneity that is correlated across equations (from now on, correlated unobserved heterogeneity).

To model this correlation, total unobserved heterogeneity is decomposed into three parts. First, there is an idiosyncratic shock that is independent and identically distributed and is assumed to be a type I Extreme Value distributed error (ϵit), giving the functional form to each choice equation. Second, there is a permanent component representing permanent unobserved heterogeneity (μi). Third, there is a time-varying component representing time-varying unobserved heterogeneity (νit).

Per-period alternatives are constructed based on the survey questions and are the following: eit = {0, 1, 2}, indicating formal workers, self-employed or informal workers, and non-employed workers, respectively; cit = {0, 1, 2}, indicating non-contributors, mandatory contributors, and voluntary contributors, respectively; dit = {0, 1, 2}, indicating defaulted into the system’s default scheme, opting into the default scheme, and opting out of the default, respectively; \({s}_{it}^{r}=\{0,1\}\), indicating no or some optional savings within the retirement system; \({s}_{it}^{o}=\{0,1\}\), indicating no or some other savings outside the retirement system; hit = {0, 1, 2, 3}, indicating a home owner with no loans and a home owner but paying, renting, or leasing from a relative, respectively.

Once choices are made, wealth accumulates according to exogenous market returns Rit, which is portfolio-specific, and to new contributions. Thus, wealth at the end of t is updated following Ait = Ai,t−1 × Rit(dit) + 0.1wit.

2.3.3 Endogenous individual characteristics

While I do not explicitly model changes in marital status and the number of children as choices, I endogenize them into the problem by allowing realizations in period t to depend on current choices (see Fig. 1 for the timing in Appendix A).Footnote 12 Therefore, the transitioning of family outcomes are not exogenous processes. Because it is expected that unobserved characteristic affect both choices and family outcomes (e.g., an individual may jointly decide her employment status and whether to have children in that period), I allow for correlation between family outcomes and the rest of the equations in the model through an equation-specific permanent and time-varying unobserved component.

The probability of being married (mit = 1) relative to not being married (mit = 0) is given in eq. (8) and depends on period t choices, such as employment status (eit), predetermined state variables (\({\tilde{\Omega }}_{it}\)), and exogenous individual characteristics (Xit). Although not modeled explicitly, I assume that there is a marriage market such that supply-side factors (\({Z}_{it}^{M}\)) also impact marriage probability. As before, there is a permanent (μi) and time-varying (νit) correlated error term and an idiosyncratic shock (εit) assumed to be type-I extreme value distributed.

$$ln\left[\frac{P({m}_{it}=1)}{P({m}_{it}=0)}\right]=m({e}_{it},{\tilde{\Omega }}_{it},{X}_{it},{Z}_{it}^{M},{\mu }_{i}^{8},{\nu }_{it}^{8})$$
(8)

Children in the household may increase or decrease over time (due to pregnancies, age of the child, changes in marital status, mortality, etc). These transitions depend on that period’s employment decisions (eit), predetermined endogenous choices (\({\tilde{\Omega }}_{it}\)), individual exogenous characteristics (Xit), supply-side factors specific to the child market (\({Z}_{it}^{N}\)), such as education prices, permanent (μi) and time-varying (νit) correlated unobserved heterogeneity, and a random, type-I extreme value distributed shock (\({\varepsilon }_{it}^{N}\)). The probability of increasing the number of children (nit = 1) and of decreasing the number of children (nit = −1) in period t relative to no change (nit = 0) is given by:

$$ln\left[\frac{P({n}_{it}=j)}{P({n}_{it}=0)}\right]={n}^{j}({e}_{it},{\tilde{\Omega }}_{it},{X}_{it},{Z}_{it}^{N},{\mu }_{i}^{9},{\nu }_{it}^{9});\quad j=\{-1,1\}$$
(9)

3 Data and research sample

I use four waves (2002, 2004, 2006, 2009) of the EPS merged with administrative records of the Chilean Bureau of Pensions. The EPS is administered by the Ministry of Labor and Social Security in Chile jointly with the University of Chile and the University of Michigan. Its design was based on the Health and Retirement Study. Its implementation was sponsored by the University of Pennsylvania.Footnote 13

3.1 Research sample

The estimation sample contains 7179 individuals observed 4 times. Because the objective is to model life-cycle wealth accumulation, individuals aged from 25 to 59 years old in 2002 are included.Footnote 14 Due to the dynamic nature of the model, only individuals observed in all four waves are considered. Individuals included have no missing information in work experience, optional savings, marital status, and region of residence.

No statistical differences between average age and gender shares for the whole sample and the reference sample (aged between 25 and 59 years in 2002) are detected. The only difference is in the education as the reference sample has considerably less missing information since most of those individuals attrited or contain missing information in other variables. The share of individuals with less than high school education is higher for the research sample than for the reference sample (53% and 41%, respectively).

Summary statistics are presented in Appendix B. Most individuals in the sample are formal workers. Informality represents around 20% of the sample. Voluntary contributors and participation in voluntary retirement savings accounts represent 4.5% and 5.6%, respectively. Individuals who participate in other voluntary savings programs represents around 14% of the sample. Most individuals are home owners, married, and with no changes in the number of children in the household.

4 Estimation

4.1 Error structure

Unobserved characteristics may be correlated across the k = {1, 2, . . . , 9} equations of the model. The total error (εit) in each equation is decomposed in three parts:

$${\varepsilon }_{it}^{k}={\mu }_{i}^{k}+{\nu }_{it}^{k}+{\epsilon }_{it}^{k}$$
(10)

μi and νit represent permanent and time-variant unobserved individual characteristics. ϵit is an idiosyncratic independent and identically (iid) distributed shock that is assumed to be a type-1 extreme value for discrete variables and normal for continuous variables.

The decomposition allows for nonlinear unobserved individual-level heterogeneity. To estimate the parameters of the unobserved heterogeneity distribution, I use a semi-parametric discrete factor random effect (DFRE) estimation method. This methodology is a generalization of Heckman and Singer (1984), by Mroz and Guilkey (1992) and Mroz (1999), which allows the econometrician to not impose any functional or distributional form for the correlated unobserved heterogeneity. Instead, its cumulative distribution is approximated by a step function where mass points weights are jointly estimated with the other parameters of the model (Guilkey and Lance 2014). The assumption on the iid component for the total error, allows one to semi-parametrically estimate the distribution on unobserved heterogeneity.

Evidence from simulations shows that the DFRE performs better than other estimation approaches (Guilkey and Lance 2014). This evidence is important when the error distribution is not jointly normal and when there is a high correlation across equations; which is expected to happen in microeconomics applications.Footnote 15,Footnote 16,Footnote 17

One could follow a single equation approach and estimate each behavioral equation using fixed or random effects. Nevertheless, single equation estimation methods generate biased results. There are endogenous variables (e.g., selection into employment and simultaneity with savings decisions), omitted information (e.g., savings amounts, risk preferences, tastes for alternatives), and measurement error (e.g., self-reported survey measure) in the set of equations. Recall that reduced-form parameters are a function of the primitives of the structural model. Therefore, each estimated parameter is a function of unobserved characteristics. To correct for these biases, it is necessary to model all of these processes (i.e., the correlated error terms).

The DFRE differs from standard random effects estimation as it jointly estimates the distribution of unobserved characteristics together with the reduced-form parameters. Regarding fixed effects, the DFRE method does not present a substantial loss of degrees of freedom and uses both variation over individuals and over time to account for unobserved heterogeneity (Fout and Gilleskie 2015; Gilleskie et al. 2017).

4.2 Initial condition equations

In the first wave of the EPS, some of the endogenous variables are non-zero. Because I do not observe the history of decisions before this wave, the dynamic specification cannot be used to explain this variation. Thus, I specify static reduced-form equations that depends on exogenous individual characteristics. These initial condition equations are correlated with the other equation through a permanent unobserved component.

Initial conditions consider all endogenous variables entering the first period. These include: employment status (polychotomus: full- and part-time, self-employed and informal, not employed), work experience (continuous variable), contribution status (polychotomus: mandatory, voluntary, not contributing), savings inside and outside the retirement system (both dichotomous variables), housing status (polychotomus: owner, owner with loan, renting, lending), marital status (dichotomous) and number of children (continuous). Each initial equation is denoted by k where k = {10, . . . , 17}.

The error structure for initial conditions is as follows:

$${\varepsilon }_{it}^{k}={\mu }_{i}^{k}+{\epsilon }_{it}^{k},\ {\rm{where}}\ k=\{10,...,17\}$$
(11)

The initial condition equations are identified with exogenous market-level characteristics and individual exogenous characteristics that do not enter subsequent equations (e.g., father’s and mother’s schooling).

4.3 Estimated likelihood function

The likelihood function conditional on correlated unobserved heterogeneity is:

$${{\mathcal{L}}}_{ct}(\Theta ,\mu ,{\nu }_{t})=\mathop {\prod\limits_{i=1}^N}\left\{{f}_{w}(\mu ,{\nu }_{t})\mathop {\prod\limits_{j=1}^J}{\left[Pr(I({a}_{t}^{j}={a}^{j})| \mu ,{\nu }_{t})\cdot {f}_{j}({\epsilon }_{t}^{j}| \mu ,{\nu }_{t})\right]}^{I({a}_{t}^{j}={a}^{j})}\right\}$$
(12)

where Θ represents the vector of all parameters to be estimated, \({a}_{t}^{j}\) is a choice j = {E, C, D, Sr, So, H, M, N}, f(⋅) represents the density function of the error term of each equation, Pr(⋅) is the cumulative distribution function for each choice, and \(I({a}_{t}^{j}={a}^{j})\) is an indicator of a particular choice.Footnote 18

The unconditional likelihood function for the joint estimation of the system is:

$${{\mathcal{L}}}_{t}(\Theta )={\sum \limits_{q=1}^{Q}}{W}_{\mu q}{\sum \limits_{r=1}^{R}}{W}_{\nu r}{\prod \limits_{t=1}^{T}}{{\mathcal{L}}}_{ct}(\Theta ,\mu ,{\nu }_{t})$$
(13)

where Wμq is the probability of observing q mass points for the permanent component μ and Wνr is the probability of observing r mass points for the time-varying component νt. These approximate the true distributions of μ and νt.Footnote 19

4.4 Identification

The reduced-form life-cycle model consists of 17 jointly estimated equations. The total number of estimated coefficients correspond to the number of coefficients on observed characteristics (886) plus the coefficients that capture the distribution of unobserved heterogeneity, whose mass points are empirically determined.Footnote 20

The empirical specification is presented in Table C.1 in Appendix C. Explanatory variables can be classified as predetermined (or endogenous control variables), exogenous (individual and market-level characteristics), or unobserved heterogeneity. Predetermined variables capture the dynamic nature of the decision-making process. While predetermined variables represented endogenous choices in the previous period, they are predetermined to individual choices in t.Footnote 21 The number of explanatory variables in each equation is presented in Table C.2 in Appendix C. Observed explanatory variables may enter linearly as higher-order moments and interacted (e.g., a variable and its squared).

The system is completely identified and therefore, causal effects can be distinguished. Identification comes from several sources. First, identification comes from the dynamic nature of the model and timing assumptions on the decision-making process, following the standard requirements for dynamic-panel estimation, even with general patterns of correlation across equations (Arellano and Bond 1991; Bhargava 1991). As instruments there are two types of variables: (1) predetermined (lagged) choices and (2) market-level characteristics.Footnote 22 Because endogenous variables in t are predetermined in t + 1, I am able to achieve identification by incorporating lagged endogenous behaviors as instruments. For example, employment decisions in t depend on predetermined variables such as work experience, which is known for the individual at the beginning of t and depends on past choices (i.e., employment decisions in the previous period). Thus, work employment is a predetermined variable each period that can be used as an explanatory variable. There is sufficient variation considering the four waves of data. This set of instruments is valid if there is no auto-correlation steaming from the idiosyncratic component which is an assumption commonly used with the DFRE (Yang et al. 2009). Observed values of endogenous variables are included in the specification instead of predicted values.

Some of the predetermined variables are excluded from some equations. Most lagged choices are excluded from the wage equation and family equations (only employment status shows up). All lagged choices are excluded from initial condition equations. Some of these market-level characteristics that explain choices in t are excluded from the outcome equations in the same period and therefore serve as exclusion restrictions. Table C.1 in Appendix C also shows sources of identification based on the theoretical exclusion restrictions.

Regarding exclusion restrictions, all equations depend on some market-level characteristics denoted by Z. Exclusion restrictions summarize characteristics of the market where the individuals are participating. The individual’s choices do not affect the market-level information. Thus, these variables are exogenous to the individual decision-making problem. All the exclusion restrictions are theoretically justified.

For example, wages in period t depend on current characteristics of the labor market (\({Z}_{it}^{E}\)) but not on characteristics of other markets, conditional on choices made at period t. Similarly, conditional on choices at period t, only characteristics of the marriage market (\({Z}_{it}^{M}\)) and of the family market (\({Z}_{it}^{N}\)) affect marital status and children variation, respectively. Moreover, the dynamic specification of the model includes lagged endogenous variables (pre-determined at period t) that are a function of market-level characteristics (i.e., Zi,t−1 explains choices in period t − 1). This argument follows the standard argument to identify dynamic models (Arellano and Bond 1991).

Second, the functional form assumption for the distribution of the idiosyncratic error term in each equation (ϵit) serves to identify the system as it gives the functional form to equations. It is impossible to identify the distribution of the correlated unobserved heterogeneity if there is no inclusion of an idiosyncratic component into total unobserved heterogeneity and if no assumptions regarding its distribution are made. Thus, the method is a semi-parametric estimation method that non-parametrically estimates the distribution of permanent and time-variant unobserved characteristics.

Third, identification comes from the number of factors allowed for the step approximation for the correlated unobserved heterogeneity. Ideally, one would like to identify a distribution of permanent and time-varying unobserved heterogeneity affecting each behavioral equation and each outcome of the model. Nevertheless, this cannot be identified. Instead, I estimate two distributions (one permanent and one time-variant) affecting all choices. Lastly, identification comes from the nonlinear nature of the system of equations (Guilkey and Lance 2014; Morales et al. 2016).

5 Results

5.1 Specification and model fit

There are 1082 estimated parameters. The model that best captures the distribution of unobserved characteristics and has the best model fit has five mass points for the permanent unobserved heterogeneity and four mass points for the time-varying unobserved heterogeneity. Before presenting the simulations, I compare the observed data with the simulated outcomes from the data generating process. Evaluating the model fit is important since the model is used to simulate counterfactual policies. Simulated values are obtained using observed explanatory variables without updating and with 100 replications for the types’ probabilities. The comparison shows the model fits the data well (see Table C.3 in Appendix C).

5.2 Estimation results

5.2.1 Preferred model

I now refer to the results for equations that capture individual choices. All results tables are presented in Appendix C in Tables C.4–C.10.Footnote 23 Because I model behaviors simultaneously, estimates effects are causal.

Earnings

There is a statistically significant wage gap of 18.9%. This gap goes in line with other findings in the literature. Using the first three waves of the EPS (2002–2006), Perticará and Bueno (2009) find a wage gap between 12.7 and 18.7%. Using administrative records on unemployment insurance between the years 2004 and 2009, Cruz and Rau (2019) find a wage gap of 24.5%. It is found that informal workers earn 32.3% less than their formal counterparts. No significant effect of children on wages is found. The results show a significant but small effect of marital status.

Employment status (relative to formal workers)

Individuals are significantly less likely to be informal workers or not work if they were mandatory contributors in the previous period. Individuals with more wealth and work experience are more likely to be formal workers. As an individual ages, she is more likely to be informally employed or not employed. There is a significant decrease in the probability of being informally employed or not employed as education increases. I find no gender effects, except for the marriage-female interaction, meaning that married women are more likely to be informally employed or not employed. I find no significant effect of previous default status or savings holdings.

Contribution status (relative to not contributing)

There is significant inertia in contribution status: individuals are significantly more likely to be mandatory (voluntary) contributors if they were mandatory (voluntary) contributors in the previous period. Being a voluntary contributor in t − 1 increases the probability of being a mandatory contributor in t, while the opposite is not true (no significant effect of previous mandatory contribution status on voluntary savings).

Individuals with previous voluntary savings within the system are more likely to contribute, while individuals with previous savings outside the system are significantly more likely to be voluntary contributors. This suggests that individuals who are not forced to contribute use other available savings tools. The likelihood of being a mandatory contributor increases with wealth and work experience (both with significantly diminishing rates). The wealth effect is barely significant for voluntary contributors, and no work experience effect is found. In both cases, no gender effects are found, except when interacting female with family characteristics. For females, marriage and more children increase the probability of not contributing. The probability of not contributing decreases as education increases.

Default adoption (relative to defaulting with no action)

Previous contribution status (relative to not contributing in the previous period) increases the probability of choosing or opting out of the portfolio default, relative to following the default. There is also a significant inertia in default status, meaning that an individual is more likely to choose (opt out) the default if chosen (to opt out) previously. Individuals with voluntary savings within the retirement system are significantly more likely to choose the default and to opt out.

Interestingly, individuals with voluntary savings outside the retirement system are more likely to opt out, but no significant effect is found for choosing the default. This suggests that individuals who use savings tools outside the retirement system are more sophisticated in their investment strategies. As retirement wealth increases, individuals are less likely to be assigned to the default. No significant experience effects are found. Similarly to other research, younger individuals are more likely to be defaulted and as wealth increases, the probability of opting out increases (Kristjanpoller and Olson 2015). Women are less likely to choose the default than men, while no significant differences are found with respect to opting-out from the default. These results complement others in the literature (Kristjanpoller and Olson 2015).

Voluntary savings

There is significant inertia in savings status. As retirement wealth increases, the likelihood of holding voluntary savings within the retirement system increases, but the same cannot be accepted for savings outside the system. No significant effect of previous contribution status is found, and little effect with respect to previous default adoptions is evidenced. Contrary to our expectations, individuals paying for a home and renting are more likely to hold savings within the system.

5.2.2 Single equation models and alternative DFRE specifications

I now compare the estimation results with fixed-effects and alternative DFRE specifications. Results are presented in Appendix D in Tables D.1–D.6. The comparison is done in log odd terms. Generally, the estimates obtained from single equation fixed-effects are statistically different from the ones obtained after correcting for endogeneity biases.

Let’s take for example the multinomial logit on employment status (Table D.1 in Appendix D). Most predetermined behaviors (e.g., lagged choices such as contribution status, housing status) and stock variables (e.g., retirement wealth, work experience) are statistically different from DFRE estimates. The difference is the bias that one is not able to pull out when applying methods that do not correct for correlated errors. As a second example, let’s take the multinomial logit on contribution status (Table D.2 in Appendix D). The fixed-effects estimation show that coefficients on predetermined variables, both lagged choices and stock variables, are statistically different than the DFRE results, evidencing the estimation bias that arises when assuming that errors are uncorrelated and only capture permanent unobserved characteristics. In both cases, exogenous individual characteristics (e.g., age and education) also present statistical differences.Footnote 24

Additionally, I compare the results with: DFRE with one equation (multinomial logit), DFRE with two correlated equations, full DFRE without exclusion restrictions, and fixed-effects with no market-level exclusion restrictions (see Table D.6 in Appendix D).

The one-equation DFRE (column (2)) evidences the bias when one does not control for correlated heterogeneity. Results indicate that differences with respect to the full model (column (1)) are substantial and statistically significant. Results from the two-equations DFRE model allows controlling for correlation across two equations (column (3)). In particular, employment decisions are jointly modeled with wages. Note that the results are very similar to the ones obtained from a one-equation model, suggesting that modeling only two outcomes is not enough to control for estimation biases.

The omission of supply-side relevant factors result in biased estimates toward zero (column (4)). Fixed-effect models (with and without exclusion restrictions, columns (5) and (6), respectively) do not show substantial differences for key variables. Thus, log odd terms on predetermined endogenous variables are not sensitive to the inclusion of market-level characteristics for the single equation panel estimation. However, note that when controlling only for uncorrelated permanent unobserved heterogeneity the bias does not disappear and the estimates change in the opposite direction.

5.3 Contemporaneous marginal effects

To calculate marginal effects I use the standard procedure in DFRE papers.Footnote 25 Marginal effects are computed without updates in response to previous choices (i.e., short-run effects). Standard errors are calculated using predictions based on 100 draws of the estimated coefficients from the estimated variance-covariance matrix. I focus on effects of stock variables (work experience, age and wealth) and the following behaviors: lagged optional savings choices and lagged default behavior. All results are expressed in percentage change with respect to the baseline scenario.

Work experience partially captures experience in the system; while age captures the aging evolution of choices. Retirement wealth captures the effect of accumulating assets (Table 1). A marginal increase in wealth significantly increases the probability of being a formal worker by 0.12% (column 3 in Table 1). I argue that the 0.12% is the causal effect only of retirement wealth. The probability of not contributing and of being assigned to the default significantly decreases by 0.25% and 0.30%, respectively. An increase in retirement wealth also generates a small but significant (at 10% level) increase in savings outside the system and in home ownership with a loan (at 5% level). This finding suggests that as retirement wealth needs are covered, individuals switch to other forms of investment. The evolution of investments through the life cycle is not attributable to aging, as the short-run effect of age does not significantly explain portfolio default adoption, savings choices and home ownership (column 2).

Table 1 Contemporaneous marginal effects of stock variables (%)

These results are consistent with other findings in the literature, such as Attanasio and Brugiavini (2003) for Italy and Attanasio and Rohwedder (2003) for the U.K.Footnote 26 Unlike Attanasio and Brugiavini (2003), I attribute the switching to retirement wealth rather than to aging. The insignificance of age effects is also consistent with other studies, where for the U.S., using a sample of older individuals, Gustman and Steinmeier (1999) find little and insignificant effects of retirement wealth on other savings. Because individuals are transitioning to other forms of savings, this finding might explain why participation in voluntary savings retirement programs are low.

An additional year of work experience (column 1) mostly explains employment status (e.g., an additional year of work experience significantly increases the probability of being a formal worker by 0.90% and decreases the probability of not working by 1.95%. No effects are found for savings or investment behavior.

Contemporaneous marginal effects of lagged saving choices and default behavior are presented in Table 2. Optional savings within the system (column 1) significantly decreases the probability of not contributing by 2.30% and increases the probability of mandatory or voluntary contribution by 1.13% and 1.17%, respectively. Optional savings within the system also significantly decreases the probability of being defaulted into the system’s designed portfolio by 4.26%. This latter result suggests learning with respect to portfolio choices, as it shows that individuals with higher exposure to the system pursue more sophisticated investment strategies. These results follow the same pattern of results documented by Chetty et al. (2014), who find that active savers are more financially sophisticated than passive savers, among other findings.

Table 2 Contemporaneous marginal effects of previous investment behaviors (%)

An investment action in the previous period (i.e., choosing or opting out of the default) significantly decreases the probability of being defaulted by ~15% (see columns 3 and 4). Once an individual opts out, the probability of maintaining that decision in the next period significantly increases by 14.67%. Generally, individuals are free to choose any investment account, but there are restrictions to older individuals. For example, men over 55 years old and women over 50 years old cannot choose account A. If they are in that investment plan before, they can be defaulted if they have not transfer at the time they achieve the limit age. Even if individuals made an active choice in the past, that does not necessarily mean that they will make another active choice in the future. This result might suggest that sophisticated individuals regarding their portfolios, tend to follow their own choices rather than following the system’s designed investment path. Unlike other papers that evidence inertia in retirement investment decisions once individuals are defaulted into plan characteristics, I find a substantial inertia when individuals decide to opt out of the default (Carroll et al. 2009; Choi et al. 2004; Madrian and Shea 2001).

There is substantial inertia in savings outside the system (column 2). Holding savings outside the system in t − 1 significantly increases the probability of holding savings in t by 11.49%. Previous choices regarding outside savings significantly decreases the probability of not contributing by 0.59%. Although this effect follows the same pattern as the effect generated by previous choices with respect to voluntary savings within the system, the effect is significantly smaller for savings outside the system. Both previous choices significantly increase the probability of voluntary contribution, but we cannot reject the null that the effects are significantly different from each other.

Some key issues regarding participation in the system arise. Even though there are no significant impacts on formal market participation, any action taken, significantly decreases the probability of not contributing by at least 2% and increases the probability of mandatory contribution by almost 2%. Opting out of the default increases the probability of voluntarily contributing by almost 1%. However, this is not the case for individuals who take action but follow the suggested investment path, where no significant effect is found. This finding is an important one, as it suggests that as individuals become more sophisticated in investment strategies, they are also more likely to go beyond compulsory savings (see Chetty et al. (2014)).

5.4 Counterfactual simulations

Simulations quantify long-run effects by incorporating the dynamic effects of behavior on future outcomes. Because the estimated dynamic decision model allows me to model the sequential decision-making process, I use the estimated model as the data-generating process.Footnote 27 Simulating counterfactual scenarios is ultimately the main goal of papers that applied the DFRE method. The simulations are perform in the standard way.Footnote 28

I obtain the matrix conformed by the estimated coefficients for each equation, and use it as input to predict behaviors based on observed characteristics.Footnote 29 Simulated outcomes are used to update next period’s endogenous explanatory variables. Each individual is replicated 100 times, from draws from the unobserved heterogeneity distribution. Individuals enter the first period with their initial characteristics, and the treatment is permanently applied starting in period t = 2. Standard errors are calculated using predictions based on 100 draws of the estimated coefficients from the estimated variance-covariance matrix. All results are expressed in percentage change with respect to the baseline scenario.

5.4.1 Effect of participating in voluntary savings programs

I find no evidence of long-run crowding-out across savings. Table 3 shows the response per period. Note that no effect is statistically significant from zero, meaning that individuals do not treat these two type of accounts as substitute goods. Outside savings do not significantly affect behaviors within the system, such as the share of individuals with inside voluntary accounts, contribution status and default adoption.

Table 3 Crowd-out effects across voluntary savings accounts in the treated group (%)

Table 4 presents long-run effect of voluntary savings on default adoption. All enrolled individuals are treated. Savings outside the system have no effect on portfolio (column 2). There is evidence of an important long-run effect of voluntary retirement savings on portfolio adoption (column 1). For the first two years after the treatment, individuals who choose the default significantly increased by 20 and 26%. There is no effect on choosing more sophisticated portfolio actions (i.e., opt out). Individuals who chose the default or opt out increase by 31% and 41%, respectively at t = 4. In the last 4 years, individuals who follow the most sophisticated strategy increase from 41 to 48%. This suggests that long-run effects of participating in voluntary programs are important for increasing active participation in portfolio decisions.

Table 4 Effect of holding voluntary savings on default adoption in the treated group (%)

I find no crowd-out effect between outside savings and home ownership (Table 5, column 2). Home owners with loans significantly decreases between 7 and 10% for the first four years after treatment. This suggests that home ownership and participation in voluntary retirement programs are substitute goods. Policy makers should be careful in the design of policies that seek to increase savings within the system, as there might be important welfare implications once the retirement age is reached.

Table 5 Effect of holding voluntary savings on default adoption in the treated group (%)

5.4.2 Home ownership effects

I now simulate all enrolled individuals to be homeowners (with no loan) and then evaluate how it impacts participation in contributions and in voluntary savings. No significant home ownership effect is found on savings outside the system (Table 6). Corroborating the substitution between retirement savings and home ownership, results indicate that once all enrolled individuals are homeowners (with no loan), their participation in voluntary retirement programs significantly decreases. Four years after the treatment, there is a decrease in such participation between ~16 and 21%. Homeowners are also less likely to contribute by between ~3 and 5% for three years after the treatment and permanently less likely to be mandatory contributors by between ~3 and 7%.

Table 6 Effect of home ownership on the treated group (%)

5.4.3 Effect of extending mandatory contributions to informal workers

I evaluate the effect of extending mandatory contributions to informal workers. A policy reform in Chile in 2008 implemented in 2019 anticipates that self-employed workers will be required to contribute. Because I cannot separately identify both categories of informal workers, the results are upper bounds of the effects of such a policy.

The treated are informal workers. I find no significant long-run effects on voluntary savings participation (columns 1 and 2 in Table 7). The likelihood of opting out increases by 18 and 28% between the first year after the treatment (t = 3) and the last year (t = 8). In periods 5 and 6, treated individuals significantly increase the probability choosing the default. These results indicate that forcing contribution does not result in a reduction in other savings. They also suggest that the treated group behave as sophisticated investors. Simulations indicate that home ownership and mandatory contributions work as substitute goods (see Table 8). When forcing individuals to contribute, home ownership significantly decreases, between 11 and 12%.

Table 7 Effect of extending mandatory contributions to informal workers (%)
Table 8 Effect on housing status of extending mandatory contributions to informal workers (%)

6 Conclusions

This paper investigates crowding-out across savings choices and assets. Results indicate no short- or long-run crowd-out across savings. In the short run, once retirement wealth needs are covered, individuals switch to other forms of investments. These transitions are not attributable to aging or work experience. The transition in savings might prevent participation in voluntary retirement savings programs.

Long-run simulations suggest that home ownership and retirement voluntary savings are substitute goods. The same is observed with mandatory savings and home ownership. Individuals with greater exposure to the system are more sophisticated in investment strategies. Individuals that are more sophisticated in investment strategies are also more likely to participate in voluntary retirement savings programs.

The long-run effects of participating in voluntary savings programs are substantial for active portfolio decisions. This key finding could be important for the design of default rules. For example, we could consider policies that force individuals to make investment choices rather than assigning them to a default portfolio. This could have the increased benefit of increasing participation in voluntary savings programs.

Future research should analyze how crowding-out between voluntary savings and home ownership affect resources available after retirement. A hypothesis is that retirees might be better off by investing in home ownership. This hypothesis is conditional on the amounts of such investments, and characteristics of the housing market. Future research could explore the possibility of modeling amounts invested. It would also be of interest to consider intra-household decisions.