1 Introduction and background

The decades of the 1990s and the 2000s have provided startlingly different experiences for American families. While the 1990s were largely characterized by the continuation of the economic expansion that started in the 1980s, the experience of the “great recession” dominated the decade of the 2000s. Additionally, the shrinking wage gap between men and women that started in the 1980s continued over the following decades (for example, see Rios-Avila and Hotchkiss 2014). And further, there has been a fairly dramatic rise in the share of families in which wives earn more than husbands. Wang et al. (2013) report that the share of married mothers that out-earn their husbands rose from 4 % in 1960 to 15 % in 2011. Winkler (1998) calculates that married families in which wives (not necessarily just mothers) earn more than husbands rose from 16 % in 1981 to 23 % in 1996. While the incidence of higher earning wives is overall lower for families with children (as women in these families, ceteris paribus, are less likely to participate in the labor market in the first place), the growth over time is clear. The purpose of this paper is to assess the impact of economic changes in the 1990s and 2000s on the welfare of married households, taking into account the earnings of the husband and wife relative to one another.

There is some evidence in the literature to suggest that the relative earnings structure within the household affects the family’s welfare level, or general satisfaction, and their fragility (e.g., Bertrand et al. 2015).Footnote 1 There is also evidence provided as to the reasons for the growing share of women that out earn their husbands (e.g., Winkler 1997). As far as we can tell, however, there are no analyses of how families with different earnings structures have fared in face of very different economic pressures over time. The research question here is this: Given the changes in wages and non-labor income observed across time, how has welfare changed in families where the husband and wife have different relative earnings potential? We find that over two very different decades, families in which the wife is the higher wage earner increased their total welfare by as much during the 1990s and by more during the 2000s than families in which the husband earned the higher wage.

Our results are consistent with earlier work that found that, due to the relative wage changes during the 1980s, families in which the wife was in a higher earning category than her husband were materially better off (Hotchkiss et al. 1997). Taken together, the implication of the results in this paper, along with that of earlier analysis, is that over three decades of the shrinking wage gap and other economic influences, families in which the wife is the higher wage earner or is in a higher earnings category than her husband experienced at least as much welfare gain as families with a different relative earnings structure.

2 Methodology

To answer the research question stated above, we estimate utility function parameters that allow us to calculate the dollar equivalent change in utility between two time periods for families in which the husband earns the higher wage, in which the wife earns the higher wage, and in which the husband and wife earn similar wages. The parameters are estimated in the context of a joint labor supply model within a family utility maximization framework that recognizes the importance of labor supply decisions of each family member being made with consideration of market opportunities of the other member. The structural utility maximization framework that we employ in this paper is preferred to alternative labor supply specifications, as found, for example, in Hausman and Ruud (1984), Hoynes (1996), and van Soest et al. (2002), because it allows us to deconstruct welfare changes based on the welfare valuation by the family of consumption and non-market time of each member.

We model family utility in a neoclassical joint utility framework, which can be thought of as a reduced-form specification of family decision-making. This has the advantage of giving us a clear-cut expression of family welfare that allows for cross wage effects on each member’s labor supply decisions, hence capturing some of the relativity in market wages that seems to be driving preferences. In order to allow for heterogeneity in preferences across family types, utility function parameters, hence labor supply elasticities, are allowed to differ for families of different relative earnings types. Of course, we are still assuming homogeneity of preferences within family type, but this assumption is not unique to our analysis or the application of a structural utility maximization framework.

It is important to remember in this exercise that it is not only changes in income that will affect family welfare, but also how the family values that income, especially relative to how the family values the non-market time of each member as that also changes in response to wage changes. For example, in an environment in which the wage gap between men and women is shrinking, it might be expected that welfare will increase more in families that depend more on the wife’s earnings. However, changes in wages (and non-labor income) not only affect total income/consumption potential, but also affect working hours of both members, which will also impact welfare. Comparing welfare changes across families with different relative earnings structures, it becomes clear that it is not only the relative changes in wages that matter, but also the way each family type values income and non-market time.

Because we are interested in the change in utility for different family types (based on relative wages) and because preferences may vary by family type, we estimate a different set of parameters for each family type. In addition, we consider that preferences (even within family type) are likely to have changed over time, so we estimate utility function parameters at two different time periods (for each family type)—in 2012 in order to evaluate change in utility during the 2000s, and in 2003 to evaluate change in utility during the 1990s (roughly speaking).

While some of the effects observed in the data might be driven by changes in marriage sorting across time, we do not model selection of individuals into marriages of different relative earnings capacities. The evidence provided by the literature of significant discomfort associated with one particular type of relative earnings structure (i.e., wife earning more than her husband) suggests that individuals, on average, do not necessarily sort into these types of families on purpose. In addition, because of strong evidence for assortative mating, especially in dual-earner families (see Bloemen and Stancanelli 2015), which spouse earns more may be more likely attributable to random or institutional (e.g., discrimination) factors.Footnote 2 In addition, results in Shore (2015) suggest that marital sorting may be based more on the volatility of spousal earnings, rather than on their relative value. However, if couples do sort based on their preference for a particular relative earnings structure, we are likely to over-estimate the welfare gain among any family type in which wage changes have up-ended that earnings structure. Given that the 2007 recession disproportionately affected men (Wall 2009), it may be the case that families in which the wife earns more than her husband in 2012 are more likely than other earnings types to have had their preferred relative earnings structure reversed.

2.1 The family utility framework

The assumption of joint family utility (or, “collective” utility) is often rejected in favor of a bargaining structure to household decision making (for example, see McElroy 1990 and Apps and Rees 2009). However, there is evidence that the choice of structure for household decision making has very little implication for conclusions in micro simulation exercises (see Bargain and Moreau 2003). In addition, Blundell et al. (2007) find that both collective and bargaining models are consistent with their household labor supply model estimated in the U.K. Also, using U.S. data, Winkler (1997) finds that income pooling (implied by the collective utility model) is not rejected when cohabitors are in a long-term relationship, especially when children are present.Footnote 3 The joint utility framework is used here in order to evaluate welfare changes of the family (as opposed to independent welfare changes of the individual family members).

Within the framework of the neoclassical family labor supply model, a family maximizes a utility function that represents the household welfare. Assuming, for simplicity, that there are only two members in the household (husband and wife) and the family chooses levels of non-market time for each member and a joint consumption level in order to solve the following problem:

$${\max_{({L_1},{L_2},C)}} U = U({L_1},{L_2},C) \\ {\rm subject}\ {\rm to}\;C = {w_1}{h_1} + {w_2}{h_2} + Y.$$
(1)

Define T as total time available for an individual; L 1 = Th 1 will be referred to as the husband’s non-market time, and L 2 = Th 2 will be referred to as the wife’s non-market time; h 1 is the labor supply of the husband; h 2 is the labor supply of the wife; C is total money income (or consumption with price equal to one); w 1 is the husband’s after-tax market wage; w 2 is the wife’s after-tax market wage; and Y is after-tax non-labor income.Footnote 4 Note that the terms L 1 and L 2 correspond to all uses of non-market time, including home production activities and leisure.Footnote 5

The solution to the maximization problem in Eq. (1) can be expressed in terms of the indirect utility function, which is solely a function of the wages of the husband and wife and non-labor income of the family:

$$\begin{array}{ccccc}\ V\left( {{w_1},{w_2},Y} \right) = U\left\{ {\left[ {T - h_1^{\rm{*}}\left( {{w_1},{w_2},Y} \right)} \right],\left[ {T - h_2^{\rm{*}}\left( {{w_1},{w_2},Y} \right)} \right]} \right.,\\ \\ \left. {\left[ {{w_1}h_1^*\left( {{w_1},{w_2},Y} \right) + {w_2}h_2^*\left( {{w_1},{w_2},Y} \right) + Y} \right]} \right\},\\ \end{array}$$
(2)

where \(h_1^*\left( {{w_1},{w_2},Y} \right)\) and \(h_2^*\left( {{w_1},{w_2},Y} \right)\) correspond to the optimal labor supply equations (desired hours) for the husband and wife, respectively.Footnote 6

By totally differentiating the indirect utility function, we can simulate the change in welfare that results from changes in optimal hours of work and consumption in response to changes in wages and non-labor income (also see Apps and Rees 2009, p. 263):

$$dV = - {U_1}dh_1^* - {U_2}dh_2^* + {U_3}d{C^*},$$
(3)

where U 1 is the family’s marginal utility of the husband’s non-market time, U 2 is the family’s marginal utility of the wife’s non-market time, and U 3 is the family’s marginal utility of consumption. Equation (3) makes it clear that the change in welfare not only depends on the individual labor supply responses, but also on the family’s marginal evaluation of a change in non-market time and non-labor income.

Expressed in terms of changes in wages and non-labor income, and re-arranging terms to illuminate the contribution of those changes to family welfare through their impact on husband’s labor supply, wife’s labor supply, and total family income, the total derivative in Eq. (3) becomes:

$$\begin{array}{ccccc}\ dV = - {U_1}\left( {\frac{{\partial {h_1}}}{{\partial {w_1}}}d{w_1} + \frac{{\partial {h_1}}}{{\partial {w_2}}}d{w_2} + \frac{{\partial {h_1}}}{{\partial Y}}dY} \right)\\ \\ - {U_2}\left( {\frac{{\partial {h_2}}}{{\partial {w_1}}}d{w_1} + \frac{{\partial {h_2}}}{{\partial {w_2}}}d{w_2} + \frac{{\partial {h_2}}}{{\partial Y}}dY} \right)\\ \\ + {U_3}\left[ {\left( {{w_1}\frac{{\partial {h_1}}}{{\partial {w_1}}} + {h_1} + {w_2}\frac{{\partial {h_2}}}{{\partial {w_1}}}} \right)d{w_1}} \right.\\ \\ + \left( {{w_1}\frac{{\partial {h_1}}}{{\partial {w_2}}} + {h_2} + {w_2}\frac{{\partial {h_2}}}{{\partial {w_2}}}} \right)d{w_2}\\ \\ \left. { + \left( {{w_1}\frac{{\partial {h_1}}}{{\partial Y}} + 1 + {w_2}\frac{{\partial {h_2}}}{{\partial Y}}} \right)dY} \right].\\ \end{array}$$
(4)

The multiple steps involved in calculating this change in family welfare are described below along with a discussion of the estimation issues faced when undertaking this sort of simulation exercise. Remember, we are after an assessment of the progress of welfare in families of different earnings structures, rather than merely a snapshot of their relative welfare levels.

2.2 Estimation issues

The direction and magnitude of the change in utility that result from changes in the husband’s and wife’s wages and family non-labor income cannot be determined analytically; they depend on the direction of the wage changes and the size of the labor supply responses of the husband and wife to own and to spouse wage changes, as well as the relative size of the additional utility the family attains from consumptions of non-market time (i.e., additional leisure or home production) by the husband and wife and from changes in non-labor income.

There are many divergent empirical issues raised in the literature in relation to estimating labor supply responses to wage changes, i.e., estimates of labor supply elasticities. The goal here is to produce reasonable labor supply elasticities that are consistent with the literature. Toward that end, the methodology adopted takes the simplest approach possible while maintaining basic theoretical and empirical integrity.Footnote 7

2.2.1 Functional form

In order to obtain estimates of the pieces of the change in utility in Eq. (4) a specific functional form of utility must be specified. Following others (e.g., Heim 2009; Hotchkiss et al. 1997, 2012; Ransom 1987), we estimate a quadratic form of the utility function:

$$U\left( Z \right) = \alpha \left( Z \right) - \left( {\frac{1}{2}} \right)Z\prime {\rm BZ,}$$
(5)

where Z is a vector with elements Z 1 = Th 1, Z 2 = Th 2, and Z 3 = w 1 h 1 + w 2 h 2 + Y; α is a vector of parameters and Β is a symmetric matrix of parameters.Footnote 8 This functional form has the advantage of belonging to the class of flexible functional forms in the sense that it can be thought of as a second order approximation to an arbitrary utility function (when the second order conditions with respect to non-market time comply with \({U_{11}} < 0,{U_{22}} < 0\,\& \,{U_{11}}*{U_{22}} >U_{12}^2\)). In addition, it is possible to produce analytical closed-form solutions for both the husband’s and wife’s labor supply functions. Obtaining the first order conditions of this unconstrained maximization problem results in a system of equations linear in h:Footnote 9

$$\frac{{\partial U}}{{\partial {h_1}}} = {{\rm{\Omega }}_1}{h_1} + {{\rm{\Omega }}_2}{h_2} + {{\rm{\Omega }}_3} = 0$$
(6)
$$\frac{{\partial U}}{{\partial {h_2}}} = {{\rm{\Omega }}_2}{h_1} + {{\rm{\Omega }}_4}{h_2} + {{\rm{\Omega }}_5} = 0.$$
(7)

This system can be solved simultaneously, and the desired hours become \(h_1^* = f\left( {{w_1},{w_2},Y} \right)\) and \(h_2^* = g\left( {{w_1},{w_2},Y} \right)\), which represent the number of hours the members of a household would like to work, given the parameters that define their household utility function, given after-tax wages and non-labor income.

Observed hours (\(\tilde h\)), however, might differ from the optimum hours due to stochastic errors, such that:

$${\tilde{h}_1} = \left\{\begin{array}{ll}{{h_1^*} + {e_1}} & {{\rm if}\,{h_1^*} + {e_1} >0} \\ 0 & {\rm otherwise}\end{array}\right.$$
(8)
$${\tilde{h}_2} = \left\{ \begin{array}{ll}{{h_2^*} + {e_2}} & {{\rm if}\,{h_2^*} + {e_2} >0} \\ {0} & {\rm otherwise}\end{array}\right.$$

where we assume that (e 1, e 2) follows a bivariate Normal distribution with mean 0 and covariance matrix ∑.

2.2.2 Empirical specification

Estimation of this model can be thought of as a simultaneous Tobit model, where we have four kinds of families: those where both spouses work, those where only one of the spouses works (2 cases), and those where neither of them work. Allowing for hours adjustments along the extensive margin for the wife when assessing labor supply responses to wage changes has been found to make a significant difference when assessing total labor supply response (for example, see Eissa et al. 2008; Heim 2009). However, extensive margin hours adjustments appear to be unimportant for men (Blundell et al. 1988; Heim 2009). We opt for the most flexible specification, which allows for extensive margin hours adjustments for both the husband and wife (as in Hotchkiss et al. 2012).

The presence of non-working wives and husbands raises one empirical issue identified by Keane (2011) that must be addressed: market wages are not observed for family members who do not work. To obtain estimates of those wages, we take the standard approach in the literature of estimating a selectivity-corrected wage equation on the sample of working men and women, using regressors observable for both working and nonworking individuals (Heckman 1974).Footnote 10 The resulting parameter estimates are then used to predict wages for nonworking men and women based on their observable characteristics.

The maximum likelihood function corresponding to the joint labor supply optimization problem can be written as follows:

$$\begin{array}{ccccc}\ L = \mathop {\prod }\limits_{i = 1}^N {\left[ {\left( {\frac{1}{{{\sigma _1}{\sigma _2}}}} \right)\psi \left( {\frac{{{{\tilde h}_1} - h_1^*}}{{{\sigma _1}}},\frac{{{{\tilde h}_2} - h_2^*}}{{{\sigma _2}}},\rho } \right)} \right]^{\left( {H = 1,W = 1} \right)}}\\ \\ *{\left[ {\frac{1}{{{\sigma _1}}}\varphi \left( {\frac{{{{\tilde h}_1} - h_1^*}}{{{\sigma _1}}}} \right)\left\{ {1 - {\rm{\Phi }}\left( {\frac{{{\sigma _1}h_2^* - \rho {\sigma _2}\left( {{{\tilde h}_1} - h_1^*} \right)}}{{{\sigma _2}{\sigma _1}\sqrt {1 - {\rho ^2}} }}} \right)} \right\}} \right]^{\left( {H = 1,W = 0} \right)}}\\ \\ *{\left[ {\frac{1}{{{\sigma _2}}}\varphi \left( {\frac{{{{\tilde h}_2} - h_2^*}}{{{\sigma _2}}}} \right)\left\{ {1 - {\rm{\Phi }}\left( {\frac{{{\sigma _2}h_1^* - \rho {\sigma _1}\left( {{{\tilde h}_2} - h_2^*} \right)}}{{{\sigma _2}{\sigma _1}\sqrt {1 - {\rho ^2}} }}} \right)} \right\}} \right]^{\left( {H = 0,W = 1} \right)}}\\ \\ *{\rm{\Psi }}{\left( {\frac{{ - h_1^*}}{{{\sigma _1}}},\frac{{ - h_2^*}}{{{\sigma _2}}},\rho } \right)^{\left( {H = 0,W = 0} \right)}},\\ \end{array}$$
(9)

where φ and Φ correspond to the probability density and cumulative distribution functions of a univariate normal, and ψ and Ψ represent the probability density and cumulative distribution functions of the bivariate normal. Also, H = 1 if the husband is working and W = 1 if the wife is working (0 otherwise), σ i (i = 1, 2) represents the standard deviations of (e 1, e 2), and ρ is the correlation between the stochastic errors.

The stochastic errors accounted for in Eq. (8) represent errors in optimization—observed hours do not exactly reflect desired hours. Keane (2011) points out that there may exist measurement error in observed wages and non-labor income. This classical measurement error may bias elasticity estimates toward zero. Heim (2009), using a methodology most similar to the one used here, presents results showing that accounting for measurement error produces elasticities practically identical to when it is not accounted for. A typical strategy to mitigate the introduction of measurement error on wages per hour has been to restrict the sample to hourly paid workers. Unfortunately, the estimation strategy detailed here requires a lot of data and is not estimable with the restricted sample size of hourly workers only. Instead, if the person is not paid by the hour, we use information available about usual weekly earnings and usual hours worked per week. This means our wage estimate might suffer from what Keane refers to as “denominator bias,” which will have the tendency of biasing labor supply elasticities downward. If this were the case, the estimated labor supply response to own wage changes, for example, would be larger than reported here.

2.2.3 Endogeneity

Keane (2011) also identifies two potential sources of endogeneity. First, it is reasonable to expect that observed wages and non-labor income are correlated with a person’s taste for work (reflected through hours of work). Both fixed effects and instrumental variables have been used to resolve this issue but are simply not possible in this case since we do not have panel data and because of the non-linear nature of the labor supply functions to be estimated. In addition to the inclusion of variables expected to affect the taste for work (e.g., children), we expect that the inclusion of spousal variables (through the estimation of joint labor supply) will help to remove additional sources of correlation from the error term (i.e., because of positive assortative mating, people with similar taste for work will be married to each other; see Lam 1988 and Herrnstein and Murray 1994).

Second, in light of the kinked budget constraint created by the progressivity of the tax system, the after-tax wage rate and after tax non-labor income depends on which tax bracket the worker falls, which depends on the number of hours worked. The simplest approach to addressing this issue was first proposed by Hall (1973) and basically “linearizes” the budget constraint segment on which each person is observed by simply recalculating unearned income to find its “virtual” intercept as if it were extended beyond the specific segment. This strategy amounts to assuming that preferences are strictly convex, which means that family members would make the same hours choice facing this linearized budget constraint that they would have made facing the non-linear budget constraint.Footnote 11 If this assumption is binding, Keane points out that labor supply elasticities will be biased in a negative direction, underestimating the labor supply response to wage changes. Also, this assumption will only have implications for those few families for which the changes mean a movement across tax brackets, and we are focusing on the impact of relatively small changes in real wages and real non-labor income.

An additional concern Keane (2011) identifies in the literature is making sure the hours/wage combinations observed in the data are coming off workers’ labor supply curve, rather than off employers’ labor demand curve. Identification of the labor supply relationship boils down to including regressors (determinants of hours) that reflect the demand for a person’s skills (thus determine the observed wage) that are not reflective of that person’s taste for work. Toward that end, we include an indicator for race that could affect observed wage through employer discrimination, but, ceteris paribus, should not affect taste for work.

Further, we only marginally control for the presence of fixed costs of working raised by Apps and Rees (2009) by including the presence of children in the determination of hours. Heim (2009) presents results showing that once demographics are controlled for, additional consideration of fixed costs only very slightly impacts estimates of the parameters of the utility function (Heim 2009, p. Table 3).

3 Data

The Current Population Survey (CPS) is administered by the U.S. Bureau of Labor Statistics each month to roughly 60,000 households. The survey has a limited longitudinal aspect in that households are interviewed for four consecutive months, not interviewed for 8 months, then interviewed again for 4 months. Households, families, and individuals can be matched across these survey months if they remain in the same physical location. In survey months four and eight, the household is said to be in the “outgoing rotation” group, and members of the household are asked more detailed questions about their labor market experience, such as wages and hours of work.

We make use of the CPS outgoing rotation groups in March, April, May, and June from 2012 and 2003 in order to construct the samples for which the family labor supply model is estimated at each time period’s end point. Detailed non-labor income is obtained by matching each family to the March supplement, which is the month in which this information is collected. Multiple months of outgoing rotation groups are used in order to expand the sample size. We restrict the samples in the following ways:

Structural restrictions

  • include only households with husband and wife present and between18–64 years of age, in order to focus on adult family members prior to retirement;

  • exclude households with unmarried same- or opposite sex adults/partners or children older then 18 years old in order not to confound the spousal labor supply decision with that of other adult household members;

  • exclude households in which the main activity of either spouse is self-employment, since it is difficult to estimate market hourly earnings (wage) for someone who is self-employed.Footnote 12

Homogeneity restrictions

  • exclude households with non-positive total household income, negative non-labor income, after-tax hourly wages greater than $250 or less than $0.50, or after-tax weekly total income greater than $10,000

  • exclude households for which the calculated marginal tax rate is 75 % or higher

The estimation of this joint labor supply model is tricky. These were the most minimal “trimming” that we needed to undertake to generate convergence on theoretically consistent utility function parameter estimates (e.g., positive marginal utilities). Less than 1 % of our sample is lost due to homogeneity restrictions.

Information on the detailed sources of family income, number of children, and earnings available from the CPS is used to calculate the marginal tax rate on earnings (wages) and the total tax liability (in any year of interest) using the National Bureau of Economic Research (NBER) TAXSIM tax calculator (http://www.nber.org/~taxsim/; see also Feenberg and Coutts 1993).Footnote 13 The Tax calculator requires more information, such as number of itemized deductions, than is available from the CPS, so we made assumptions for the missing values as recommended by the managers of the tax calculator. For example, there is no information in the CPS that would allow one to calculate itemized deductions (mortgage payments, charitable contributions, etc.), so values of zero are entered for the missing information. Although this means we do not have as accurate a calculation of the family’s actual tax liability as we would like, it is important to remember that the assumptions for each family do not change across years of comparison.

3.1 Family classification

Based on husbands’ and wives’ after-tax hourly wages, both effective and predicted hourly wages, families are placed in one of three groups: (1) husband and wife have similar wages (within 0.2 log points, or 20 %, of one another), (2) the husband’s wage is greater (0.2 log points higher) than his wife’s wage, and (3) the wife’s wage is greater (0.2 log points higher) than her husband’s wage.Footnote 14 Creating a band for “similar” wages is done for two primary reasons. First, the band of similarity is to allow for measurement error in the reporting and calculating of hourly wages (also see Winkler 1998, who describes the implausible number of families with members who earn exactly the same amount). Secondly, the 20 % band accommodates family members who switch high-wage status from time to time. In other words, we want to be sure that the relative wage classification that we observe at one point in time is reflective of this family’s typical relative wage situation. Winkler et al. (2005) find that the relative earnings position over a 3-year period of time switched for up to 40 % of married couples. This is also one reason we use relative hourly wages to classify families, rather than relative earnings. We expect wages to reflect a more consistent valuation of relative earning potential within the family than earnings at any point in time, which is also dependent on hours worked. We expect much greater variation in a person’s hours than in their earning potential. In addition, the exercise to obtain utility function parameters is the estimation of optimal labor supply (hours), which figures into the construction of earnings—in other words, we don’t want to use our ex post outcome to classify families ex ante.

Separate utility function parameters are estimated for families in each of these groups since we would expect, based on the literature cited earlier, that preferences differ depending on which family member is observed earning a higher wage. For those cases in which the husband or wife is not working, their imputed after-tax wages are used to classify them with respect to the three family types. After-tax wages and non-labor income are in real values, reflecting the end-point of each decade.

The trends reflected in our data in the shares of families in which the wife earns a higher wage than her husband are consistent with trends shown by others in relative earnings (e.g., Bertrand et al. 2015; Winkler et al. 2005). The share of families in which the wife earns a higher wage than her husband increases from 16 to 18 to 21 % from 1994 to 2003 to 2012, respectively. Figure 1 reproduces, using our data, the charts used by Bertrand et al. to illustrate the distribution of families based on the wife’s relative contribution to family income. Rather than plot the distribution by the relative contribution to total income (which would include hours of work), Fig. 1 plots the distribution of families based on the wife’s contribution of relative hourly earning potential (wife’s wage divided by the sum of wife’s and husband’s wage) for 3 years, 1994, 2003, and 2012. The data in Fig. 1 reflect the “cliff” at 50 % illustrated in Bertrand et al.’s data. They point to this distinct break as evidence that families find being in the right tail (where the wife earns more than her husband) distasteful.

Fig. 1
figure 1

Distribution of families based on the wife’s relative hourly earning potential, 1994, 2003, and 2012. Note: Authors’ calculations, U.S. Bureau of Labor Statistics, Current Population Survey (http://www.bls.gov/cps/earnings.htm#demographics). The wife’s relative hourly earning potential is calculated as the ratio of the wife’s wage divided by the sum of the wife’s and husband’s wage. Calculated using only families with observed wages for both members. Density points to the left of the 50 % demarcation reflect families in which the husband’s wage is greater than the wife’s wage; points to the right are where wife’s wage is greater than the Husband wage

3.2 Sample means

Table 1 contains sample averages across family types for the 2 years used for estimation. As expected, labor supply is lower for both husbands and wives across all family types in 2012 than in 2003.Footnote 15 Families in which the wife earns the higher wage have the fewest young children; the highest non-labor income; and the highest education, wages, and age among wives. Generally, after-tax real wages and non-labor income were rising in the 1990s and increasing less in the 2000s. Additionally, consistent with Wang et al. (2013), we also see that total consumption (after-tax earnings plus non-labor income) is greatest, on average, among families in which the wife earns more than her husband.

Table 1 Sample means for families in each relative earnings structure

The periods of time used for analysis were chosen for a number of reasons. Earlier work (Hotchkiss et al. 1997; Hotchkiss and Moore 2002) analyzed welfare changes among two-earner couples during the 1980s, so this analysis can be seen roughly as an extension of that work, with a different focus.Footnote 16 In addition, there was a major redesign of the CPS in 1994, so that year provides an obvious starting point. Also, both time periods are the same length; contain a recession, albeit the 2007–8 recession was considerably deeper; and end at least 2 years after the end of a recession.

3.3 Simulating changes in wages and non-labor income

An important component of simulating the change in family welfare (detailed in Eq. 4) is to obtain an appropriate counterfactual of the average wage and non-labor income, in both cases after taxes, that households in, say, 2012, would have faced in 2003. Since we don’t have access to a panel data set (i.e., we can’t see the actual wages families in 2012 earned in 2003), we opt to use an inverse probability weighting methodology, akin to the one used in DiNardo et al. (1996), in order to create a counterfactual distribution of 2003 wages with the household characteristics observed in 2012.

This amounts to estimating the probability of an observation (in the combined 2003 and 2012 samples) being observed in year 2012 using as explanatory variables age of the husband, age of the wife, their squares and cross multiples, education of husband and wife, interaction between education and age, race of husband and wife, and the number of children of different ages:

$$P({\rm{year}} = 2012\left| X \right.) = {\rm{\Lambda }}(X\prime b).$$
(10)

The parameter estimates from this logit model are then used to construct the inverse probability ratio, \(\frac{{{\rm{\Lambda }}\left( {X\prime \hat b} \right)}}{{1 - {\rm{\Lambda }}\left( {X\prime \hat b} \right)}}\), for each household in 2003. This is used as a weight for calculating what the average husband and wife wages (and household non-labor income) would be for the 2012 families, based on the 2003 wage (and non-labor income) distribution:

$$\tilde w_{t = 2003}^{cf} = \frac{{{\rm{\Lambda }}\left( {X\prime \hat b} \right)}}{{1 - {\rm{\Lambda }}\left( {X\prime \hat b} \right)}} * {w_{2003}}.$$
(11)

These 2003 counterfactual wages (cf) are then averaged within family type (i.e., husband/wife earns more or the same) in order to compare with observed wages in 2012 for the same family type. The expected wage change of, for example, the husband between 2003 and 2012 (dw 1 in Eq. 4) for family type k is then given by

$$d{w_1} = \left[ {\frac{1}{{N_k^{2012}}}\mathop {\sum }\limits_{i{\it{\epsilon }}k}^{N_k^{2012}} {w_{1,t = 2012}}} \right] - \left[ {\frac{1}{{{T_k}}}\mathop {\sum }\limits_{i{\it{\epsilon }}k}^{N_k^{2003}} \tilde w_{1,t = 2003}^{cf}} \right],$$
(12)

where \({T_k} = \mathop {\sum }\limits_{i{\it{\epsilon }}k}^{N_k^{2003}} \left[ {\frac{{{\rm{\Lambda }}\left( {X\prime \hat b} \right)}}{{1 - {\rm{\Lambda }}\left( {X\prime \hat b} \right)}}} \right]\). This calculation is analogously performed for dw 2 and dY in Eq. (4). This procedure is repeated to obtain 1994 counterfactual wages and non-labor income for families observed in 2003.

Changes in wages (dw 1 and dw 2) and non-labor income (dY) are also reported in Table 1. These are the actual stylized level changes in wages and non-labor income that are plugged into the change in utility equation (Eq. 4). Both husbands and wives experienced positive wage changes across all family types in the decade ending in 2003. On a percentage basis, wives experienced greater gains during the decade, with the largest percentage gains going to wives in families in which the husband is the higher earner (note that a 9.1 % change over the decade amounts to a little less than 1 % per year). The greatest percentage gains in (virtual) non-labor income during this early period accrued to families in which the wife earns more. Among these families, the greatest gains in non-labor income (not shown) were in the form of educational benefits, disability, and retirement, and other/unspecified income. The largest contributors to the loss in non-labor income experienced by families with similar wages were declines in welfare and survivor’s benefits.

In the decade ending in 2012, all family types, except where the husband earns more, experienced gains in non-labor income. The largest gains for families in which the wife earns more were from welfare, supplemental security income, and rental income. Wage gains for wives were very similar across all family types during the later decade, with the largest wage gains for men appearing in families with similar wages.

As will be seen later, these larger wage and non-labor income gains among some family types suggest where we are likely to see the greatest gains in welfare across the decades, but the mapping is not one-to-one. Welfare changes are not determined simply by change in wage and income but also the value families of different types place on that income and on the non-market time that will adjust to changes in wages.

4 Results

4.1 Estimated labor supply elasticities

The maximum likelihood labor supply estimates are reported in Appendix 2 and are as expected, for the most part. For example, across all family types, hours increase at a decreasing rate with age; the presence of children has a mixed impact on the labor supply of both husbands and wives across different family types; black (Hispanic) husbands and wives work fewer (more) hours than their white counter-parts; and husbands with higher levels of education work more hours than those with less than high school, whereas only wives with a high school degree work more hours than their less-than-high-school educated counter-parts.

The estimated marginal utilities and labor supply elasticities of interest are shown in Table 2. As expected, income elasticities on the intensive (hours) margin and extensive (participation) margin are negative for both husbands and wives; and both intensive and extensive cross-wage elasticities (when non-zero) are negative, except when husband and wife earn similar wages. We also observe, as expected, that wives’ hours and participation decisions are more sensitive to wages than their husbands’.

Table 2 Elasticity and marginal utility estimates for the average family of each earning structure type

The estimated own wage hours elasticities for husbands are consistent with estimates reported by Kaiser et al. (1992) for Germany; and Ransom (1987), Pencavel (2002), and Heim (2009) using U.S. data.Footnote 17 In addition, the estimates for wives’ labor supply elasticities are on the low side, but within the ballpark of those reported in the literature using U.S. data. For example, the range of estimates found in Blau and Kahn (2007), Cogan (1981), Hausman (1981), Heim (2009), Hotchkiss et al. (1997), Ransom (1987), and Triest (1990) is 0.12–0.97.Footnote 18 Furthermore, the estimated negative cross-wage elasticities (except where the wife and husband earn similar wages) indicate that husbands and wives view their non-market time as substitutes; this is consistent with cross-elasticities estimated in Heim (2009), Hotchkiss et al. (2012, 2014), and Ransom (1987). The bottom line here is that the estimated labor supply elasticities (both extensive and intensive) are reasonable.

4.2 Changes in family welfare

Figure 2 illustrates the estimated dollar equivalent utility change for the average family in each relative wage classification.Footnote 19 Dollar equivalent utility change is calculated by dividing total utility change (dV) by the family’s marginal utility with respect to income (U 3). Considering the changes in both real after-tax wages and non-labor family income across two decades, and potential changes in preferences, families in all relative wage categories experienced much larger welfare gains during the 1990s than in the 2000s (the decade of the Great Recession). The average 1990s family in which the husband earned a higher wage experienced a marginally bigger (although not statistically significantly larger) gain in welfare across the decade than the average 1990s family in which the wife was the higher wage earner ($91.30 per week vs. $84.81 per week, respectively). Both of those family types gained more welfare than the average family in which the spouses earned about the same wage; at $41.14/week, this average family gained only half as much welfare as the other types of families.

Fig. 2
figure 2

Dollar equivalent changes (per week) in family welfare across two time periods

Note: Husband and wife have similar wages means they are within 0.2 log points of one another; the husband’s wage being greater (less than) the wife’s wage means that his wage is more (less) than 0.2 log points of his wife’s wage. Bootstrapped (250 iterations) standard errors in parentheses

The story is different in the 2000s. During the decade of the Great Recession, families in which the wife was the higher wage earner gained the greatest amount of welfare—an average of $21.55/week—higher (but not statistically) than the marginally lower gain of $20.51/week for the average family in which the husband and wife earned similar wages and statistically significantly more than the $3.88/week gain for the average family in which the husband was the higher wage earner. This is not too surprising as the 2007–2009 recession was known for it’s disproportionate impact on industries in which men are over-represented—a fact that led to the dubbing by some of the recession as a “man-cession” (see, for example Rampell 2009). Consequently, a smaller welfare gain among families in which the husband is the higher wage earner (or, rather, higher potential wage earner) could derive from him not actually realizing that potential in the labor market or from suffering larger declines in wages than his wife.

Referring back to Table 1, we can see one contributor for these different welfare change outcomes across the decades.Footnote 20 In the 2000s, in families in which she was the higher wage earner, the wife experienced an average real after-tax wage gain of $0.38 but only $0.33 in families in which the husband earned the higher wage. At the same time, husbands experienced only a $0.09 average real after-tax wage increase in families in which the wife earned the higher wage and actually faced an average wage decline of $0.03 if he earned the higher wage. In addition, virtual non-labor income rose more—by an average $5.09/week—in families in which the wife earned the higher wage, whereas families in which the husband earned the higher wage experienced an average loss in weekly virtual income of $1.75/week. In other words, in the 2000s, wage gains by both the husband and wife, as well as gains in non-labor income were smaller among families in which the husband was the high wage earner.

During the 1990s, wages rose for both the husband and wife in all types, although by different relative amounts depending on who was the higher wage earner. This suggests why the welfare gains comparing families in which the wife or husband earned the higher wage were not statistically different from one another in the 1990s. By contrast, wage gains were smaller for both the husband and wife in families in which they earned about the same, and virtual non-labor income declined in these families—making the welfare gains for this family type in the 1990s significantly less than the other family types.

4.3 Decomposing changes in family welfare

The change in family utility across the two time periods can be decomposed into contributions from the direct influence from wages and non-labor income and the indirect influence (through labor supply changes) of changes in behavior in response to those wage and non-labor income changes. Table 3 illustrates how Eq. (4) can be decomposed in this way, using the full sample change for the decade ending in 2012 for illustration. Between 2003 and 2012, the average family experienced a welfare gain of nearly six utils (a dollar equivalent $10.48 as seen in Fig. 2). Positive gains in wages and non-labor income contributed the largest component (6.6017) to this welfare gain; the increase in wife wages exceeded the decline in husband wages, producing an overall positive direct impact.

Table 3 Decomposition of change in family utility (in utils) for full sample, 2012

The net labor market response resulted in a decline in non-market time for both family members (an increase in non-market time from own wage elasticity and small decline in non-market time from cross wage elasticity), hence a negative contribution to the welfare gain (−0.6136). In addition, the total contribution of wage changes (5.3572) exceeded the contribution of the change in non-labor income (0.6309). Table 4 presents the decomposition of change in utility for all family types across both decades.

Table 4 Decomposition of welfare change across both time periods for the average family in each family type

For all family types, the direct effect of changes in non-labor income and wages dominates the indirect labor market effect in their respective contributions to change in welfare. However, ignoring the indirect effect would result in mis-calculating the total welfare change by as much as a negative 21 % to a positive 9 %. In addition, we cannot just look at the changes in wages and non-labor income to discern which family type experiences the greatest welfare gain. In order to make comparisons across family types, we need to divide the welfare change in utils by the family type’s marginal utility of income to translate it into the dollar equivalent welfare change.

Between 1994 and 2003, although families in which the husband earns a higher wage than his wife experienced greater percentage gains in wages and a greater gain in “utils,” the different valuation those family types place on utils produces dollar equivalent welfare gains that are statistically insignificantly different from welfare gains of families in which the wife earns the higher wage. Similarly, across the 2003–2012 time period, while households in which the husband and wife earn similar wages faced higher percentage increases in wages, their dollar equivalent welfare gain is lower, but not statistically different, than the dollar equivalent welfare gain in families where the wife earns a higher wage.

The point of this exercise is to illustrate that estimating labor supply responses to changes in wages and non-labor income over time in the context of a structural utility model allows a much more nuanced and accurate assessment of the relative welfare gains from those changes across different types of families.

5 Conclusions

The purpose of this paper was to assess the impact of economic changes in the 1990s and 2000s on the welfare of married households, taking into account the earnings of the husband and wife relative to one another. The share of families in which the wife earns a higher wage than her husband has been growing over the past several decades (e.g., see Bertrand et al. 2015), and this paper shows that changes in wages and non-labor income over the past two decades impacted family welfare differently across relative earnings structures of the husband and wife.

We find that total welfare of the average family in which the wife earns a higher wage than her husband rose just as much during the prosperous 1990s as it did for the average family in which the husband was the high wage earner, despite the latter experiencing larger earnings increases. In addition, the highest welfare gain in the dismal 2000s was among families in which the wife was the high wage earner (although the welfare gains were only marginally better than for families in which the spouses earned about the same wage). For all families, of course, the decade of the Great Recession meant smaller welfare gains all around than were experienced by families in the 1990s.

By decomposing those welfare gains across family type, we were also able to illustrate that while direct changes in income (non-labor and earned income) dominated the observed changes in welfare, the indirect effect of welfare caused by the labor supply responses was negative for all families but those where wives earned higher wages than their husbands. We also see that a ranking based merely on the level or percentage changes in wages and non-labor income by family type does not necessarily determine the ranking of the impact of those changes on welfare.

Earlier work by Hotchkiss et al. (1997) found that families in which the wife was in a higher earning category than her husband had greater increases in material well-being across the decade from 1983 to 1993.Footnote 21 This means that over three decades of analysis, families in which the wife is the higher wage earner or is in a higher earnings category than her husband experienced at least as much welfare gain as families with a different relative earnings structure. This doesn’t necessarily mean, however, that families in which the wife is the higher wage earner are happier (i.e., have higher utility levels)—in fact, Bertrand et al. (2015) would suggest otherwise. However, the results in this paper suggest that the growing incidence of wives earning more than her husband is, at the least, not putting welfare growth of families in jeopardy. In fact, since men have been hit hardest in every recession since at least the 1970s (Wall 2009), the non-traditional earnings structure may serve as a kind of insurance for welfare growth during economic downturns, as was seen in the results here for the decade ending in 2012.