Introduction

The retirement of the male partner in older Italian couples does not seem to lead to a more equitable distribution of housework between partners. Social norms and/or differences in bargaining power may explain the marked differences found in Italy with respect to other European countries (Caltabiano et al. 2016; Ciani 2016; Mills et al. 2008).Footnote 1 One possible explanation is that Italian married men have strong bargaining power and leave most of the housework to their wives, even after retirement. However, the influence of the bargaining process is difficult to specify and evaluate, considering that bargaining between partners strongly depends on their latent cultural and psychological characteristics.

Researchers have generally used proxy variables to analyse the bargaining effect in order to explain working time allocation in the household. Eckstein and Wolpin (1989) emphasized the relevance of the earning profile of each partner, while Brines (1994) and Gupta (2007) considered the “inverse” of the measure of the partner’s economic dependency given by the contribution of each partner to the household labour income. However, these approaches to evaluating the influence of bargaining did not lead to robust findings (see Baxter and Hewitt 2013; García-Mainar et al. 2011; Gupta and Ash 2008).

The aim of this study was to provide a convincing specification of the influence of bargaining in the relationship between a man’s retirement and the partner’s commitment in both paid and unpaid working time. In doing this, we started with the assumption that factors strongly related to women’s bargaining power are represented by the degree of economic dependency and by the perception of fairness in a woman's division of labour with her male partner (fairness perception) (e.g., Blair and Johnson 1992; Caltabiano et al. 2016; Kawamura and Brown 2010).

However, women’s perception of fairness regarding the division of housework time may depend on how working time is distributed among the partners. This mechanism of reciprocal dependence may involve a "reverse-causality" effect in the relationship between the man’s retirement and the division of labour. Namely, if a woman with higher bargaining power wants a more equitable distribution of housework time with her partner, this can induce the male partner to retire earlier to have more time to devote to work in the home and family. The opposite occurs if the woman has lower bargaining power. This implies a selection effect leading to an overstatement of the effect of a man’s retirement on the time devoted to housework by a woman with a higher bargaining power and, conversely, to an understatement of the effect of a man’s retirement on the housework time of a woman with lower bargaining power.

In this study, we tried to solve the problem of the selectivity effect of bargaining by taking into account the extent to which the endogenous component of bargaining influences the commitment in housework and paid work of both partners, and by correcting the estimation results accordingly.

As for the empirical analysis and specification of the model, for Italy there have been no longitudinal time-use surveys aimed at evaluating the change over time in partners’ division of labour. As a consequence, it has not been possible to evaluate the impact of the man's retirement by considering the intra-household allocation of time before and after this event. Longitudinal studies on the effect of retirement on intra-household work allocation in Italy have used data provided by the Generations and Gender Survey (GGS) (cf. Caltabiano et al. 2016). However, the GGS Survey does not provide detailed information on the allocation of time such as a specific survey on time use based on the diary method. The difficulty of performing a detailed longitudinal analysis did not discourage us from pursuing the aim of evaluating how the allocation of working time in the family changes if the male partner retires. We decided to analyse the effect of retirement in a cross-sectional context, along the lines followed by previous studies (see Battistin et al. 2009 and Ciani 2016, for Italy; Stancanelli and Van Soest 2012, for France).

For this reason, we performed our empirical analysis by using cross-sectional time use data on older Italian couples provided by the Italian National Institute of Statistics (Istat) with reference to the years 2008–2009.Footnote 2 The age of the subjects examined in this survey, in particular, was compatible with the eligibility rules imposed by Italian law for early retirement on a voluntary basis. This circumstance allowed us to empirically evaluate the effects of the voluntary decision to retire.Footnote 3

With respect to the studies regarding the Italian case, we would like to emphasize the role of bargaining process between partners as a result of the influence of cultural factors, for example, gender attitudes and aspects of women’s personality. The influence of these factors has been considered particularly relevant in Italy (e.g., Anxo et al. 2011). However, although the Italian case may be of particular interest for an analyst, the influence of bargaining on the relationship between retirement and labour division within the couple may be further analysed. In doing this, the problem of the selectivity effect of bargaining on retirement decision should be investigated further, adopting more specific methods to identify the effect of bargaining in a cross-sectional framework.

In order to identify the bargaining effect in a cross-sectional framework, in this study we decided to compare the working time of couples where the man had retired with the working time of couples in which the man had not retired (counterfactual situation). To this end, a matching procedure using a propensity score was adopted to link the couples belonging to each state (whether or not the male partner had retired).

By performing a matching procedure corrected for the endogeneity of bargaining, we introduced a correction term obtained by a preliminary estimate of a proxy of the woman’s bargaining power as a covariate in the equation of the male partner’s propensity to retire. Specifically, woman’s “predicted” bargaining power was obtained by combining behavioural and cultural factors as well as the extent of woman’s weight in household’s income production and woman’s availability of time. In doing this, two alternative estimates of woman’s bargaining power were performed: (a) a latent construct obtained by a Generalized Structural Equations Model (GSEM) and (b) a Linear Probability Model (LPM) regression using women’s satisfaction with labour division as a dependent variable.

We estimated the most common treatment parameters in order to evaluate the impact of the man’s retirement on the time devoted to paid and domestic work by both partners. In addition, we stratified the analysis for different couple profiles, each characterized by a different level of the female partner’s bargaining power. In this way, a measure of the influence of bargaining was given by the difference in the estimated housework time of households in which the woman had “more bargaining power” and households in which the woman had “less bargaining power”.

As a result of this analysis, the man's retirement did not lead to a significant reallocation of the woman’s working hours between paid work and domestic work, although a man belonging to a family where the woman had high bargaining power in household decisions was generally encouraged to leave work in order to devote more time to family commitments.

In the next section we present a brief survey on the findings of the most relevant studies on the relationship between partners’ retirement and their bargaining in labour division. In the following paragraphs we discuss, in sequence: the rationale of our methods, the characteristics of the dataset and the sample composition, and the estimation results. Finally, we conclude with final observations and remarks.

Latent Bargaining Process in Labour Division and Decision to Retire

In this section we briefly discuss three relevant aspects of decision-making within the household that influence the bargaining process between partners and the man’s decision to retire. We consider in particular personality aspects, the woman’s economic dependency, and her perceived fairness of the division of labour with her partner as factors related to her bargaining power. In addition, we discuss the potential endogeneity of the male partner’s decision to retire with respect to the partners’ division of labour as a consequence of the influence of bargaining.

Bargaining and Personality Aspects

Early models of time allocation among a household’s members were specified in terms of a unitary model that assumed that the household’s components maximized a single utility function (e.g., Becker 1981). In this approach there was no notion that the partner who earned more increased his/her bargaining power, as in bargaining or exchange theory. The alternative collective approach (Apps and Rees 1997; Chiappori 1988; Donni and Molina 2018; Matteazzi et al. 2017; Menon et al. 2018) considered family members, and not the household as a whole, as core decision makers; this implies that decisions relating to production, consumption, and labour participation should be taken simultaneously by each member in an equilibrium model. The collective model of labour supply was extended to household production in the studies of Apps and Rees (1996) and Chiappori (1997). Estimations of this model were proposed by Bourguignon and Chiuri (2005) and, more recently, by Mangiavacchi and Rapallini (2014) who, to estimate the sharing rule within the household, used self-reported economic condition of the household members obtained as responses to a questionnaire related to a Time Use Survey (TUS) in Italy.

More recently, several approaches took into account the behaviour of each household member resulting from individual preferences and supported by his/her bargaining power compared to that of the other household members (e.g., McElroy and Horney 1981). This is the case, for example, of the cooperative approach that allows differences between spouses to affect household decision-making by specifying a sharing rule given by a household welfare function (e.g., Manser and Brown 1980; McElroy and Horney 1981).

An alternative approach assumes that household members may act non-cooperatively. This approach is based on a model with individual preferences, but assumes that realized outcomes are determined by finding a Nash equilibrium in the behaviour of the household members (Lundberg and Pollak 1993).

In general, it is difficult to specify a model according to which family members adopt specific (cooperative or non-cooperative) behaviour in decision making. In reality, it is likely that different households behave in different ways, and that the household components are influenced differently by their own personality aspects, attitudes, and beliefs. Few studies have tried to combine different modelling approaches into one paradigm. One that has is the study by Del Boca and Flinn (2012), where a model of household time allocation, allowing for both efficient and inefficient household modes of interaction, was specified and estimated. This model incorporated compatibility constraints that required the utility of each household member to be no lower that it would be in the (non-cooperative) Nash equilibrium.

A study in which a cooperative/non-cooperative modelling framework similar to that of Del Boca and Flinn (2012) also included the impact of personality traits (Flinn et al. 2018). This study analysed the influence of personality traits in household time-allocation decisions and labour market outcomes. A personality traits’ measurement aims to capture “patterns of thought, feelings and behaviour” that correspond to “individual differences in how people actually think, feel and act” (Borghans et al. 2008). In practice, personality traits measure individual openness to experience, conscientiousness, extraversion, agreeableness, and neuroticism (the opposite of emotional stability).Footnote 4 In the above-cited study (Flinn et al. 2018), personality traits operated as potential determinants of the bargaining power of each partner jointly with education and cognitive ability. More specifically, personality traits gave a relevant contribution to the determination of the so-called Pareto’s Weights in order to quantify to what extent cooperative behaviour prevails between the partners.

Economic Dependency

In several time allocation studies, researchers have assumed that the bargaining power of the household members also depends on their own level of economic dependency (e.g., Brines 1994; Gupta 2007). In particular, the dependency model of the division of domestic work is based on the assumption that household labour is provided by a household member in return for economic support. According to this view, for example, since most married women earn less than their husbands and depend on them to some extent for support, wives continue to perform most of the domestic tasks (see Brines 1994). According to this model, an exchange relation between (dominant) breadwinners and (dependent) partners occurs: money is exchanged for domestic labour under an implicit contract stipulating the rights and obligations of parties.Footnote 5

In other studies, analysts found a positive impact of the woman’s availability of time on her bargaining power (see Bertocchi et al. 2014). This implies the presence of a pattern of specialization within the household that explains why an inverse relationship occurs between the woman’s commitment to paid work and her bargaining power in decision making. On the other hand, the woman’s weight in the household decisions is positively correlated to her ability in terms of income production, as well as in the above cited economic dependency models. In sum, according to this approach, women that wield high weight in household decision making show, jointly, low commitment in paid-work, and high available income.

In our model we specified a bargaining equation that took into account a measure of women’s economic dependency and a proxy variable of their availability of time (leisure time).

Satisfaction with Household Labour Division and Woman’s Perceived Fairness

Perception of equity is another factor considered by researchers in explaining division of labour within the family. Kawamura and Brown (2010), assuming that a higher perception of the equitable treatment of women in the division of labour implies a balanced division of housework tasks in the family, modelled the woman’s perception of fairness as a dichotomous response variable depending on both the wife’s perception of how much she mattered to her husband and the partner’s share of household work. Providing estimation results, they showed that “mattering” was significantly positively related to perceived fairness.

Perception of equity in partners’ division of labour is strictly related to the influence of gender attitudes and social norms. Several socioeconomic analyses have investigated how social norms affect the division of labour between partners. Especially in Southern European countries, gendered social norms influence the structure of the welfare state by leading to a division of labour between state, market, and family, in which family and domestic work are carried out predominantly by women (Jurado Guerrero and Naldini 1996; Saraceno 1994). In particular, Chiappero-Martinetti (2005), analysing Italian microdata on the use of time and on individual well-being, observed a stronger relationship between gendered division of labour and several specific indicators of well-being, mainly relating to health, education and knowledge, and psychological state.

Similarly, in our analysis we took into account the woman’s perceived fairness of treatment, related to her weight in household decisions. In particular, we assumed as a proxy of the woman’s perceived fairness of treatment the degree of her satisfaction with housework division, here measured by a dichotomous dummy variable, equal to zero if the woman was “dissatisfied” with housework division in the family, and equal to one if she was “satisfied”.

Endogeneity of the Male Partner’s Retirement Decision

Along these lines, in our analysis we assumed personality aspects, economic dependency, and perceived fairness in labour division to be relevant components of the woman’s bargaining power, as to influence on the man’s decision to retire.

However, because the bargaining process in labour division affects the decision of partners to retire, the estimation of the effect of retirement on the intra-household's allocation of time is contaminated by a selectivity effect, given the circumstance that a man’s propensity to retire may increase in couples in which the woman has higher bargaining power.

In order to solve the problem of endogeneity of the retirement decision in labour allocation within the couple, analysts have adopted several remedies based on the use of control variables. For instance, Blau (1998) specified potentially endogenous explanatory variables as time-lagged regressors. Blau and Riphahn (1999) used covariates measuring financial incentives, health, and preferences to control for unobserved couple-specific heterogeneity. Stancanelli and Van Soest (2012), using cross-sectional time-use data, specified a recursive model of retirement and time allocation of the two partners, endogenizing both time allocation between partners and the man’s retirement. Caltabiano et al. (2016), using longitudinal data drawn from the Italian Gender and Generation Survey (GGS), adopted a residual-based correction of the influence of omitted variables, as in a Seemingly Unrelated Regression model (SUR), to obtain correct estimates of the simultaneous paid and unpaid work equations of both partners.

In our analysis, we tried to specify an equation explaining the bargaining power of the woman taking into account the factors discussed previously related to psychological condition and personality, the degree of economic dependency, and the perception of her weight in household decision making. As a result of the estimation of the bargaining equation, we obtained predicted values of bargaining that we introduced as explanatory variables in estimating the retirement propensities of the male partners. In doing this, we adopted two distinct methods to identify woman’s bargaining power, and compared the results. The rationale of the strategy in our empirical analysis and a detailed description of the adopted estimation methods will be extensively discussed in the following section.

Model Specification and Estimation

The aim of this empirical analysis was to evaluate the extent to which the time spent on housework tasks differed between couples in which the male partner was retired and couples in which he was not. This comparison was based on a propensity-score matching method that allowed us to link couples characterized by a different status of the man (whether or not he is retired). In doing this, each man’s propensity to retire was obtained as a predicted value of the estimation of a discrete choice equation (using Probit), whose dependent variable was a dichotomous dummy equal to one if the male partner was retired (retirement equation).

In order to correct the estimation results of the retirement equation for the selectivity effect of bargaining, we simultaneously estimated the retirement equation and a further bargaining equation, implementing a two-stage procedure. The bargaining equation allowed us to identify and estimate, at the initial stage, the bargaining weight of the woman in household decision-making. Predicted values of the bargaining equation were used, at the second stage, as an explanatory variable in the male-retirement Probit equation.

At the first stage, the bargaining equation was specified as follows:

$${B}_{i}^{*}={{z}^{\prime}}_{bi}{{{\beta}}}_{b}+{u}_{bi}$$
(1)

In Eq. (1), \({{z}}{^{\prime}}_{bi}\) is a row vector, showing, for each observation, the values of the variables influencing bargaining. \({{{\beta}}}_{b}\) is a vector of coefficients, \({u}_{bi}\) an independent error term with zero mean. Regarding the specification of the dependent variable, we proposed, alternatively, two different proxies of a woman’s bargaining power. In particular, we indicated the dependent variable as a dummy, \({B}_{i}^{S}\), signalling whether the woman was satisfied with the division of labour with her partner (perception of equity); it was equal to 1 if the woman was satisfied, and to 0 otherwise.

As an alternative specification of a woman’s bargaining power, we indicated as \({B}_{i}^{L}\) a variable obtained as a result of a latent construct based on the contribution of structural variables, such as the woman’s economic dependency and perception of equity, and of measurement variables given by proxies of a woman’s personality aspects, education, health and leisure time. The latent variable \({B}_{i}^{L}\) was obtained by performing a Generalized Structural Equations Model (GSEM) (see Skrondal and Rabe-Hesketh 2004). In particular, we performed one single-factor measurement model and then calculated the predicted value of the latent construct. Our measurement model allowed us to understand how a certain set of indicators fitted together in order to create variables that represented a latent construct. In doing this, both endogenous (measurement) and exogenous (observed) variables were considered. In this case, the indicators or manifest variables of the latent construct were caused by the latent variable. For the structure of our model, the observed exogenous determined latent variable (here identified as Bargaining), and Bargaining in turn determined the observed endogenous indicators, as in a Multiple Indicators and Multiple Causes Model (MIMIC) (Jöreskog and Goldberger 1975). Moreover, the measurement variables were not continuous, but codified as 0 and 1, and we used the Probit estimations for these dummy variables.

The aim of the specification of the GSEM model was to obtain, as a predicted latent variable, a continuous variable positively correlated to the bargaining power of the woman.

At the second stage, the retirement equation was specified as a Probit model, as follows:

$${R}_{i}^{*}=\alpha {\widehat{B}}_{i}^{*}+{{z}}{^{\prime}}_{ri}{{{\beta}}}_{r}+{u}_{ri}$$
(2)

with = \({R}_{i}\)1 (partner retired) if > \({R}_{i}^{*}\)0, and = \({R}_{i}\)0 if the partner had not retired. \({\widehat{B}}_{i}^{*}\) is a proxy of the woman’s bargaining power obtained at the first stage by estimating, alternatively, three different models whose predicted values were, respectively \({\widehat{B}}_{i}^{S}\) and \({\widehat{B}}_{i}^{L}\). In addition, we assumed that the man’s decision to retire, \({R}_{i}\), depended on the row vector of covariates \({{z}}{^{\prime}}_{ri}\); the error term \({u}_{ri}\) was normally distributed N(0, 1).

The stochastic specification of the bargaining Eq. (1) changes in function of the choice of the dependent variable. If we use the proxy \({\widehat{B}}_{i}^{S}\) indicating the woman’s satisfaction with labour division, the equation (Eq. 1) is specified as a Linear Probability Model (LPM). As an alternative, we can obtain \({\widehat{B}}_{i}^{L}\) as a latent factor proxy of the woman's bargaining power by performing a GSEM-MIMIC procedure (see below, the Section on the “Estimation Results”).

In a further step of our estimation strategy, we used the predicted propensities of the man to retire, obtained by estimating the retirement equation (Eq. 2), to link partners with a different retirement status. Therefore, we linked “treated” and “untreated” women (or men) belonging to couples characterized by the same estimated propensity of the man to retire. In this way, we could compare the time devoted to housework and paid work, respectively, by each subject with the time devoted by his/her counterfactual.

Because in a propensity-score matching procedure the identification of the selection rule (the decision of the male partner to retire) depends on the specification of the selection equation (e.g., Rosenbaum and Rubin 1983), we are confident that the correction for selectivity, provided by the first-stage bargaining estimation, helps us to control the estimation of the propensity scores for the influence of latent factors related to the bargaining process.

In the next section we explain in detail the procedure used to estimate the propensity scores and discuss the properties of the estimated parameters in evaluating the effect of retirement on partners’ working activity. Moreover, we discuss the rationale of a specific approach to evaluate the extent to which selectivity influences the matching results. In particular, we evaluate the sensitivity of matching results to different specifications of the bargaining equation (Eq. 1).

Estimating the Effects of Retirement on Household Members' Paid and Unpaid Work

In order to perform the model estimation, we used a sample of married (or cohabiting) couples composed of n = n0 + n1 observations, where a group of n1 (“treated”) couples had experienced the retirement of the male partner, while n0 couples belonging to a comparison group had not experienced this event (“untreated” couples).

The parameters considered to evaluate the effect of the man’s retirement on both domestic work and paid work were the Average Treatment Effect (ATE), the Average Treatment Effect on Treated (ATT), and the Average Treatment Effect on Untreated (ATU). ATE = E(y1i − y0i) can be defined as the expected variation of the time devoted to housework, y1i, as an effect of the man’s retirement for a subject belonging to a couple randomly chosen from one of the two regimes (male partner retired or not). ATT = E(y1i-y0i) | R = 1 and ATU = E(y1i − y0i) | R = 0 can be defined as the expected variations of the housework time, respectively, for subjects belonging to couples who have experienced the partner’s retirement and subjects belonging to couples who have not.

In applying a matching estimator, we checked whether the condition of “ignorability” (robustness of the estimates with respect to the influence of unobservable factors) held. To this end, we assumed the relative difference between ATT and ATU as a measure of the influence of unobserved heterogeneity due to hidden factors (cf., among others, Xie et al. 2012). Following this line, a Heterogeneity Indicator given by the ratio HI = (ATT–ATU)/SE(ATT–ATU) was adopted. Higher values of this indicator signal that the estimation procedure has not provided a satisfactory reduction of the hidden influence of omitted variables between treated and untreated units (Heterogeneity bias).Footnote 6

The matching algorithm here applied is given by the “Nearest-Neighbor Within a Caliper, Without Replacement” (Guo and Fraser 2015), which combines two traditional matching criteria. The first criterion is given by the application of the Nearest Neighbor Matching, in which a control unit is matched to a treatment case if the difference in the propensity score is the smallest among all possible pairs of propensity scores between treated and control units. The second criterion, Caliper Matching, imposes a further restriction as a tolerance level, τ (the caliper to each matching method), regarding the distance between treated and control units. In particular, we matched treatment and control cases within the caliper or the tolerance level equal to τ = 0.05. The caliper size, as suggested by Rosenbaum and Rubin (1985), must not exceed a value equal to a quarter of the standard deviation of the sample estimated propensity scores \({\upsigma }_{\text{p}}\)(τ  ≤  0.25 σp). In our case, the standard deviation of the sample estimated propensity scores σp was equal to 0.2948. Hence we chose a caliper (0.05) lower than 0.25 σp  = 0. 0737.

In addition, the Balancing Score test on the difference in covariates between treated and untreated subjects after matching also checked the misspecification of the propensity score model. If there remain systematic differences in baseline covariates between treated and untreated subjects after matching, this may be an indication that the propensity score model has not been correctly specified (cf. Austin 2011, among others). The results of the balancing score test are presented in the “Appendix”.

The Data

Data came from the Time Use Survey 2008–2009 provided by Istat (Italian National Institute of Statistics). This data, surveyed using the “diary method”, allowed us to analyse how people allocated their time during a day.Footnote 7 In our analysis, the sample was comprised of 3126 women, married or cohabiting, equitably distributed by area of residence. The age of their male partner ranged from 50 to 65 years (age compatible with eligibility for retirement). The men were either employed or retired. There were 1030 women with a retired partner (33%), while 414 of the women were retired (13%). There were also 281 couples in which both the man and woman were retired (9%).

Explanatory variables used in the model provided demographic and socioeconomic information on the subjects such as the woman’s age (Woman’s age), the man’s age (Man’s age), the woman’s education (Woman’s edu) and the man’s education (Man’s edu) and the man’s health status (Man’s health: 1 = if man suffered from long-term health problems or chronic disease, 0 = otherwise), whether the couple lived in Northern-central Italy or in Southern Italy (Area), whether any children still lived at home (Children: 1 = yes; 0 = otherwise). Other variables addressed the woman’s working status (Woman retired: 1 = yes; 0 = otherwise).

In order to explain the woman’s personality aspects, we used two variables that could be considered as proxies of the influence of the relational context. The first was a dummy variable signalling the frequency of church or place of worship (Religiosity: 1 = if the woman goes to a church or another place of worship every day, various times a week, once a week; 0 = otherwise).Footnote 8 The second was a dummy variable that helped us to understand the subject’s level of physical and mental health (Worried: 1 = if, thinking back over the past three years, the woman had trouble in managing her time; 0 = otherwise). We also included a dummy variable as a measure of the woman’s economic dependency, that assumed the value of 1 if the woman provided for family maintenance to a greater extent (Economic dependency). We also introduced a dummy variable that assumed the value of 1 if the family received paid help in domestic work (Help received: 1 = yes; 0 = otherwise). Another variable related to the bargaining power of the woman was given by her leisure (Woman’s leisure), measured by the time (in minutes) devoted to leisure activities daily.

Finally, as suggested by Stancanelli and Van Soest (2012), we introduced a dummy variable indicating the man’s eligibility for retirement (dummy R_eligibility that took the value of 1 if the man’s age was more or equal to the age of eligibility, namely age 58 in the years 2008–2009).Footnote 9 In addition, we constructed a series of “interaction-variables” given by the difference between the man’s age (if higher than 58) and the “eligible” age 58, multiplied by the dummy R_eligibility.Footnote 10

In Table 1 we present the description of variables used in our analysis. In Table 2, various descriptive statistics regarding these variables are provided.

Table 1 List of variables
Table 2 Summary statistics on variables

Estimation results

In this section we discuss the estimation results of our procedure. In the first step, we provided two alternative estimates of woman’s bargaining power by adopting two different models (see above, Eq. 1). In the second step, we estimated, by Probit, the propensity scores of retirement as specified above in the equations (Eq. 2), including, alternatively, as a correction term for selectivity one of the two proxies of the woman’s bargaining power, obtained as predicted values by the first-step estimates (see Fig. 1 and Table 6). In the third step, we estimated the matching parameters (ATT and ATU, in particular) measuring the effects of the man’s retirement on both the housework and paid-work time of both partners. In doing this, we evaluated the extent to which matching parameters changed between the two groups given, respectively, by the households in which the woman had a higher bargaining power and the household in which the woman had a lower bargaining power. A criterion for classifying the families according to the woman’s bargaining power was also provided.

Fig. 1
figure 1

Path diagram of the GSEM-MIMIC estimation procedure of latent woman’s bargaining power

We report the estimation results of the Bargaining Equation (Eq. 1) obtained by applying a GSEM model in the following path diagram (Fig. 1).

The estimation results of GSEM-MIMIC (Fig. 1) procedure showed that a latent factor (assumed as a proxy of woman’s bargaining power) was inversely related to the dummy signalling if the woman contributed to the production of income to a greater extent than her partner (variable: Economic dependency. The corresponding coefficient’s value was equal to -0.74). On the other hand, the woman’s bargaining power was positively related to the woman’s time availability (variable: leisure). In addition, the GSEM-MIMIC estimation showed that the woman’s latent bargaining power inversely depended on exogenous variables such as education and religiosity, and positively on health status and presence of children in the family. Factors such as worry (concern in managing time spent at work) and religiosity can be considered as related to psychological condition and to the personality of the woman. We observed that bargaining power was inversely related to the woman’s education and that it was higher if the woman’s ability in income production prevailed. This is due to the fact that education and ability in income production were higher for women who worked outside the home. In this case, the reduction of availability of time due to the commitment in paid work may negatively influence the woman’s responsibility in household’s decision making (see above, Bertocchi et al. 2014).

We show in Fig. 2 a histogram chart of the distribution of the woman’s latent bargaining power estimated by GSEM-MIMIC.

Fig. 2
figure 2

Distribution of woman’s latent bargaining power estimated by GSEM-MIMIC

As shown in Fig. 2, the distribution of the latent variable, estimated by GSEM-MIMIC, identifies a mixture of two distributions, characterized by two different central values (Table 3). This led us to think that our sample could be partitioned into two groups characterized by a different average level in the woman’s predicted bargaining power. We then performed a clustering algorithm (Centroid-Linkage) in order to obtain a partition of the sample into two homogeneous groups.

Table 3 Classification of variables by level of woman’s bargaining power

The two clusters, obtained from the partition of the latent variable assumed as a proxy of the weight of woman in household’s decision making, identify the group of women with lower power (Cluster 1) and the group including women with higher power (Cluster 2) respectively. An indirect confirmation of this interpretation was obtained by the classification of the amount of the woman’s leisure time, shown in Table 3: Women belonging to the group denoted as Cluster 1 had less leisure time than women belonging to the group denoted as Cluster 2. Note that, as reported in Table 3, the predicted propensity of the man to retire, as resulting from the estimation of retirement equation (Table 6), was higher for men belonging to families included in Cluster 2, namely, the group in which women had greater weight in decision making.

Tables 4 and 5 show the estimation results of the bargaining equation using the dummy signalling the satisfaction of the woman with intra-household labour division as dependent variable. We adopted the LPM model as an estimator in order to simplify interpretation of the estimates. As a result, we found that the woman’s bargaining power was significantly impacted by the difficulty in managing work time, which in turn influenced the woman’s satisfaction with labour division (dummy: Worried). As previously reported in the GSEM-MIMIC results, the variable Worried was particularly relevant in explaining the weight of women in the decision-making process.

Table 4 Estimation results of woman's bargaining power using linear probability model
Table 5 Estimation results of Woman's Bargaining power using GSEM-MIMIC construct

Estimation of the Effects of Retirement on Partners’ Working Time

In the following Table 6 we report the Probit estimation results of the man’s decision to retire. We proposed two different specifications of the Probit equation, alternatively including in the set of covariates as a correction term for selectivity, the predicted values, \(\hat{B}_{i}^{L}\), of the woman’s latent bargaining power (estimated by GSEM-MIMIC) and the predicted values, \(\hat{B}_{i}^{S}\), of the woman’s satisfaction with the division of the housework with her partner.

Table 6 Estimation results of retirement equation

Note that, when performing the two alternative Retirement-Probit regressions, no relevant differences occurred in the estimated coefficients except for the coefficient of the correction term of \(\hat{B}_{i}^{L}\) and \(\hat{B}_{i}^{S}\), which was found to be higher when using \(\hat{B}_{i}^{S}\) as a correction variable. In general, the influence of covariates such as the retirement of the woman, the man’s eligibility to retire, or the geographical area of residence provided the same results in both models. The man’s propensity to retire was higher in the North-Centre, and was positively related to retirement of the female partner, while it was negatively related to the help received in terms of domestic work. AIC and BIC statistics indicated a preference for the model adopting latent bargaining, \(\hat{B}_{i}^{L}\), as correction term.

After discussing the results of the propensity scores, we focused on the estimation of the effects of the man’s retirement on the intra-household allocation of work, using a matching procedure. In Table 7, we show the estimates of the matching parameters of the full sample. In addition, we evaluated the effect of retirement separately for the two subsamples obtained based on the different level of the woman’s bargaining power (Table 8).

Table 7 Estimated effects of the man’s retirement on the partners' division of labour (minutes in a day)—Full sample, comparison between different correction methods
Table 8 Estimated effects (GSEM-MIMIC corrected) of the man’s retirement on the partners' division of labour (minutes in a day)—comparison between full sample and subsamples of women with low bargaining power and high bargaining power, respectively

Estimation results suggested that the effect of the man’s retirement involved a partial reallocation of the intra-household housework time in the woman’s favour, even if the overall commitment of partners in domestic work remained unbalanced and unfavourable to the woman.

In more detail, the estimation results for the full sample showed a modest reduction in the woman’s housework time as an effect of the male partner’s retirement. In Table 7 we show a comparison between the parameters measuring the effect of retirement obtained by adopting the two different correction methods suggested here. The ATT parameter measuring the effect on the woman’s domestic work was equal to − 22.6 min a day adopting the GSEM-MIMIC procedure to correct the estimates for selectivity of bargaining. Similar results were obtained by applying a correction method based on the woman’s satisfaction LPM estimate (-17.95). Conversely, the man’s domestic work increased by 96.75 min using GSEM-MIMIC, and by 97.35 min applying as a correction term the LPM estimate of Satisfaction. The standardized differences between ATT and ATU (HI indicator) showed values inside the traditional critical values of the normal standard distribution. In general, the HI indicator performed well when we applied the GSEM-MIMIC correction, as the results reported in Table 7 show.

In particular, Table 8 reports the estimated matching parameters from the subsamples of women with low and high bargaining power, respectively, as previously obtained by applying a clustering partition based on GSEM-predicted bargaining power. We observeed a major reduction in domestic commitment for women with high bargaining power compared to women with low bargaining power (a reduction of 21 min and 16 min, respectively, as an estimated ATT). Contextually, the man’s housework time increased by an average of 97 min per day (103 min for women with high bargaining power).

As a confirmation of the permanence of a marked gendered gap in housework division, we found that the average time devoted daily to domestic work by a woman after the man’s retirement was approximately 345 min, while the mean score of time dedicated by the man was approximately 182 min (Table 9).

Table 9 Estimates of the average housework and paid-work time of the woman and man (minutes in a day) after the man’s retirement

However, the gender gap is shown differently if we stratify the analysis as a function of the woman’s level of bargaining power. The commitment to domestic work of women whose partner was retired (AT) and who had less bargaining power (342 min in a day) was lower than the commitment of women with more bargaining power (349 min). A superficial evaluation of this result could lead to inconsistent conclusions about the influence of bargaining on intra-household labour division. Namely, when we considered the total amount of domestic and paid work engaged in by women whose partner was retired, we obtained a total of 433.87 min (341.98 + 100.66) for women with low bargaining power, and a total of 400.40 min (349.83 + 50.57) for women with high bargaining power. Similarly, the total amount of domestic and market work for women whose partner was not retired stands at 460.86 min (357.92 + 102.94) for women with low bargaining power, and at 430.33 min (370.55 + 59.79) for women with high bargaining power.

These results clarify that women with higher bargaining power were advantaged in terms of total work time compared to other women by about 36 min a day; this differential showed few variations related to whether the male partner was retired (− 42 min a day) or not (− 31 min a day). We observed that this gap in total amount of work time depended specifically on the differential in woman’s bargaining power, rather than on the effect of the man’s retirement.

Table 10 also shows the results of the computation of a relative measure of reallocation of domestic work, paid work, and total work (domestic plus paid work) between the woman and man in the household as an effect of the man’s retirement. For this purpose we applied the formula of Sørensen and McLanahan (1987), measuring, in this analysis, the relative contribution of each partner to housework, (man’s hours–woman’s hours)/(man’s hours + woman’s hours)%.Footnote 11 The resulting scores can range from − 1 to + 1, with a score of zero indicating an equal division of work. A negative value of this ratio signalled that the woman’s contribution in terms of work time was higher than the man’s. The opposite occurred if this index showed a positive value. We can see how the Sørensen–McLanahan index generally indicated a higher relative contribution in paid-work time by the women with lower bargaining power for couples in which the male partner was retired (− 93.9% vs. − 87.9%). Conversely, the contribution in paid-work time was higher for women with higher bargaining power, if the man was not retired (43.8% vs. 63.3%). The man’s retirement produced a decrease in the woman’s contribution to domestic work which was more marked for women with higher bargaining power (65.5–32.0 = 33.5) than for the woman with low bargaining power (54.2 − 27.0 = 27.2).

Table 10 Relative contribution of the woman to domestic, paid and total work in the household

As seen in the computation of the Sørensen-McLanahan indexes (Table 10), the gap in the woman’s relative contribution to total work between women whose partner was retired, and women whose partner was not, stood at 26–27%, with no difference for women with high bargaining power and women with low bargaining power. This implies that the reallocation of work between partners, evaluated in relative terms, penalizes the woman, whatever the level of her bargaining power.

Final Observations and Remarks

Our estimation results confirm what other studies have found, namely that a man’s retirement does not imply a significant reduction in a woman’s domestic work. These studies explain the gendered unequal division of housework between partners as a consequence of the influence of psychological and cultural factors, such as the gender attitude of partners and the higher bargaining power of the man within the couple (e.g., Ciani 2016). However, cultural factors and aspects of personality may involve an endogenous relationship between retirement decision and work time allocation in the household as a result of a bargaining process between partners. The contribution of our study is to provide a solution to the problem of selectivity effect due to the endogeneity of bargaining.

To this aim, in our analysis we jointly estimated the man’s retirement equation and the woman’s bargaining power equation. This approach allowed us to obtain a proxy of woman’s decision power and to evaluate its impact on the man’s retirement decision. Two different methods were adopted to estimate the bargaining equation (LPM and GSEM-MIMIC). Adopting a LPM model, we assumed the woman’s satisfaction with the division of labour as a dependent variable proxy of bargaining power. Using GSEM-MIMIC we obtained a latent construct, based on behavioural and economic factors, as a proxy of bargaining power. In both cases, the predicted woman’s bargaining power was found to be a valid predictor, positively correlated, with her male partner’s propensity to retire, as it seen in the estimation of a Probit-Retirement equation at the second stage.

The comparison of the results of two different estimation methods shows that the GSEM-MIMIC procedure seemingly performed better than LPM. By applying GSEM-MIMIC, the identification of significant relationships was provided, such as the positive influence of leisure time on the woman’s bargaining power. In addition, the result of the latent construct allowed us to classify women in two different groups according to their estimated bargaining power. The comparison between families in which the woman had high decision power and families in which the woman had low decision power allowed us to evaluate the effect of bargaining in labour division after the man’s retirement.

The empirical relation, here analysed, between bargaining and retirement supported the hypothesis that a man from a couple where the woman has high bargaining power in household decisions is generally encouraged to leave work in order to devote more time to family commitments.

We also found that women with higher bargaining power are generally “less stressed” (they have no problems with work time management), they have better health and more leisure time than women with low bargaining power. In addition, the time devoted to paid work by women with high bargaining power is less than the paid-work time of women with low bargaining power (Table 9). We interpret these findings to mean that women with higher bargaining power generally live in families characterized by a higher socio-economic status. In this context, it is possible that the woman, less constrained by the need to contribute to the production of income by working on the market, aims to increase her weight in terms of the responsibility of managing the home and family affairs. In short, a sort of spontaneous specialization of roles within the household occurs (such as in Bertocchi et al. 2014). However, the bargaining force of each partner is not necessarily associated with the traditional role of breadwinner, whose prevalent commitment is produced in the labour market. We think that our identification strategy of women's bargaining power is, in part, compatible with the Flinn, Todd and Zhang model (2017), in which economic dependency and personality aspects play a significant role, and, in part, is compatible with the collective approach, which uses leisure time to identify the weight of woman in decision making.

As a consequence of the higher weight of the woman in decision making, in a family characterized by a medium–high standard of living, the male partner may decide to retire early if this decision is allowed by his eligibility status, without this leading to a significant decrease in the family’s standard of living due a reduction in family income.Footnote 12

The matching procedure, adopted to compare the allocation of time between families in which the male partner is retired and families in which he is not, is here controlled for selectivity bias due to unobserved heterogeneity. We hypothesize that a relevant component of heterogeneity may be given by the misspecification of the effect of bargaining on the retirement decision. This conviction is supported by the estimation results of the man’s Retirement Equation, where the estimated coefficient of bargaining is relatively high and significant (Table 6). The identification and estimation of the woman’s bargaining power at the first stage allows us to control the estimation of the propensity score to retire for selectivity, and to stratify the analysis for two different levels of woman’s bargaining power. Unlike the other studies regarding the identification of the general effect of retirement (see Ciani 2016), the specific aim of our study is the evaluation of the influence of bargaining on the effect of retirement. This implies that the identification of the bargaining process is a necessary step for us.

As a result of this analysis, the man's retirement did not lead to a significant reallocation of the woman’s working hours between paid work and domestic work. The latter reduced slightly, but the gap with the man in the total time dedicated to work (penalizing for women) increased after the male partner’s retirement, even though the time devoted to domestic work by the man increased.

In particular, the increase in the time devoted to domestic activities by the man was between an hour and an hour and a half a day (depending on the correction method adopted). We should take into account, however, that the time devoted to housework, according to the criterion used in our analysis, also includes gardening activities and other home-production activities, classified by analysts as "semi-leisure activities" (e.g. Stancanelli and Van Soest 2012). In a further extension of our analysis it would be interesting to separate "semi-leisure" activities from "core-housework" activities, and investigate the extent to which the male partner increases his commitment to specific core-domestic activities after his retirement.