1 Introduction

Statistics show that unadjusted male-female earnings differentials still remain significant, between 15 and 23%, and have remained surprisingly stable in many countries over recent decades (OECD 2017). In order to understand remaining gender wage gaps, one of the important questions is whether having children affects earnings. A large body of empirical evidence exists on women showing a negative association between earnings and children, and suggesting that women are falling behind because of children (Adda et al. 2017; Bertrand et al. 2010; Ejrnæs and Kunze 2013; Anderson et al. 2002; Joshi et al. 1999; Waldfogel 1998; Datta Gupta and Smith 2002). Existing studies on men suggest that the presence of children is positively associated with men’s earnings. However, evidence on more direct explanations of this finding is still scarce. There exists no consensus on the reasons for the “fatherhood wage premium” and interpretations diverge on whether this reflects gendered household specialization, positive discrimination or something else. Alternatively, factors that positively affect the likelihood to become a father may also be positively related to earnings, suggesting that fathers may be a (positively) selected group. Evidence so far however suggests negative selection which is not consistent with a fatherhood premium (Lundberg and Rose 2000, 2002).Footnote 1

Part of the controversy regarding the fatherhood premium is that it is complicated to interpret correlations of children and earnings as a (direct) effect of children. This is because parenthood might be endogenous with respect to earnings and correlated with unobserved factors. The most common approach in the literature has been to estimate the mean effect of the arrival of children from Mincerian earnings equations exploiting cross-sectional variation, or the comparison of individual outcomes post versus pre-birth (fixed effects estimation). Instrumental variables have been used to estimate the earnings effect of the increase in number of children from two to three (Angrist and Evans 1998), teenage pregnancy (Hotz et al. 2005), and delayed motherhood (Miller 2011). A caveat of these estimates is that effects at particular parity may not be generalizable to other parities. Nor does it allow us to test for non-linearity in the effect of children.

The goal of this study is to estimate the effect of children on male earnings and earnings inequality over the male life cycle. We exploit data on brothers and twins, in order to explore another channel of non-random selection on unobserved family fixed factors. Such unobserved family fixed factors may capture several characteristics that are common to brothers such as neighborhood and socioeconomic status, or parental investment into cognitive development during childhood.Footnote 2 Our data offer several advantages compared to the literature to investigate this channel. First, we use panel register data on earnings and fertility for several cohorts of men which is rarely available to researchers and facilitate a clean study of the male earnings effects of children.Footnote 3 Second, we observe individual earnings trajectories during the period 1975 to 2005, and link male siblings and twin brothers. Since we have population data this results in large samples.Footnote 4 Third, our study does not suffer from attenuation bias problems arising from measurement error in survey data (Bound and Solon 1999) on earnings and fertility. Fourth, we use data on Norway that allow us to investigate whether family background affects outcomes in a country with a high degree of equality.Footnote 5 We use these data to estimate flexible Mincerian earnings regressions that control for family fixed effects.

We find that OLS estimates which use the cross-sectional comparison of men who have a child and men who remain childless is plagued by selection effects into fatherhood and overestimates the (positive) effect of having children on male earnings, which has been found in existing studies. We show that the earnings profiles of men who have a child over the lifetime and childless men differ already before the arrival of the first child. Thus, not accounting for these differences in trends makes (all) childless men not a suitable control group. Our estimation approach accounts for these differences in a flexible way, and in addition through family fixed effects. Our coefficient estimations exploit the variation in differences across siblings within the same family throughout the lifecycle, and essentially exploit the variation from sibling couples where one brother has children and the other does not. When we compare earnings profiles of twin brothers we find that they are on equal earnings growth paths and there is no significant effect of children on earnings. Robustness tests show that the general results still hold when we change the control group to childless men who will eventually get married. We also show that the post-birth effect only to a minor extent captures a marriage effect over and above the arrival of a child effect. Controls for being married explain only a relatively small part of the effect of the arrival of a child, 28%. We also show that the shift in earnings post births is entirely driven by the event of a first child. Regarding inequality in earnings among men, this result highlights that it is not the fact of having children that is driving inequality over the life cycle, but selection into the group of childless and never married men, versus fathers or fathers-to-be. These findings for Norway, a country with a high degree of income equality may suggest further interest in investigating variation across countries.

Our results on men and children add to the policy debate on family-work balance, which is no longer limited to women and work. Politicians in some countries set incentives for fathers to stay home through paternity leave policies. As survey data shows, men have increased the weight they place on family values (see e.g. Goldin 2006) and family time. A new and growing academic literature sets the focus on understanding various aspects of the interaction between fathers and children such as effects on peer behavior (Dahl et al. 2014) and child outcomes (Cools et al. 2015).Footnote 6 The group of men and boys has also gained interest since it seems that more recently it is men who are falling behind (see Autor and Wasserman 2013). We also add to recent more descriptive studies on Scandinavia comparing relative earnings trajectories of fathers and mothers, and couples, around the event of childbirth (Angelov et al. 2016; Kleven et al. 2018). Our extensions are that we focus on sibling brothers that are followed over time in the labor market, and that we broaden the view to inequality over the lifecycle among men.

The remainder of the paper is organized as follows. Section 2 provides an overview of the economic explanations as to why having children affects male earnings. Section 3 presents the data and summary statistics. Section 4 derives the earnings equation. Section 5 presents the empirical results, followed by a number of robustness tests and the discussion. Section 6 concludes.

2 Men’s earnings and having children

Given that husbands’ and wives’ labor market outcomes are interdependent, we would expect the reallocation of mothers’ time and effort after childbirth from market to home to be accompanied by some labor market response among fathers. Hence, two explanations would motivate a causal effect of children on male earnings. If mothers specialize more in home production then this can lead to an increased specialization of fathers in market production (Becker 1985). This is particularly likely if mothers also take over other household activities previously conducted by the father in order to gain economies of scale. The positive earnings effect of fathers can then be driven by increased effort, or by accumulation of human capital over time. For the U.S., for example, studies have shown that part of the child premium is related to increased hours of work (Lundberg and Rose 2002; Pencavel 1986). An earnings increase can also be caused by preferential treatment by employers of fathers, or positive discrimination. Employers may view having children for men as a signal of more conservative values, reliability and higher productivity and are therefore willing to pay a premium (Correll et al. 2007).

Another potential explanation is that earnings advantages of fathers compared to childless men may capture decisions made earlier in life related to the plan to become a father, or in other words that the group of those who become fathers is a selected group. This explanation suggests that the correlation between children and earnings is due to omitted variable bias. If men expect to make gains in the labor market after child birth, then it may be optimal for them to invest more into their career even before they become fathers. This may, for example, lead to self-selection into higher-track occupations (Gould 2008).

A related, but different, question is whether cohabitation or marriage explains the relatively higher earnings growth of fathers-at-some-point, even before childbirth (Peters and Siow 2002). It is related because in many countries marriage and childbirth occur in close proximity and hence the effects are difficult to distinguish. However, studies of the marriage premium for men provide little insight into the effect of having children; either the effect of having children is not separately reported (Korenman and Neumark 1991; Gray 1997), or is reported to be insignificant (Loh 1996).

One hypothesis is that marriage itself leads to gender-specific household specialization, whereby men specialize more in market work and women in home production. An alternative hypothesis is that men with relatively high productivity-related skills are more likely to marry. A large group of international studies has shown that married men earn between 10 and 40 per cent more than comparable single men (Korenman and Neumark 1991; Ginther et al. 2001). However, the precise nature of the effects remains unclear. Time-use data offers little support for the specialization hypothesis (Hersch and Stratton 2000). Time-use data suggest that gender-specific household specialization is not related to cohabitation or marriage, but rather to the presence of children and particularly to spending more time on child care (Dribe and Stanfors 2009). In a recent study, Ludwig and Brüderl (2018) show that selection is driving the marriage premium.

Evidence on the effect of having children on men’s earnings, and on earnings inequality among men, is scarce in the economics literature but there is wider body of literature in sociology. Studies rely either on cross-sectional data and ordinary least squares estimation, or panel data and individual fixed effects estimates. Panel data studies exploit the before-after comparison and, hence, account for individual heterogeneity conditional on a sample of fathers-at-some-point. These studies find estimates of the effect of children on male earnings in the range between 3 and 10% per year, which varies somewhat depending on the country and model specification (see Lundberg and Rose 2000, 2002; Pencavel 1986; Waldfogel 1998; Killewald 2012; Glauber 2008; Hodges and Budig 2010 for the U.S., Blomquist and Hansson-Brusewitz 1990 for Sweden, and van Soest et al. 1990 for the Netherlands).Footnote 7 Datta Gupta et al. (2007) reported fixed-effects estimates of the effect of children ranging between 0.3 and 1.2% depending on age for Denmark. Simonsen and Skipper (2010) exploit Danish data on a cross-sectional sample of twins in 2006 and report a significant wage premium for men. They can compare fathers only to the group of never and not yet fathers, which induces likely a downward bias. For Norway, Petersen et al. (2014) estimate a 1% wage premium per child, controlling for occupation fixed effects in a sample restricted to white collar workers in the private sector. Only a limited number of studies have looked at both the effect of having children and the effect of marriage (Hodges and Budig 2010; Hundley 2000; Loughran and Zissimopoulos 2009; Lundberg and Rose 2002; Petersen et al. 2011, 2014).

2.1 The fatherhood premium and policy context

It has been a long-standing policy goal in Norway to achieve high gender equality and help families to combine work and having children. The main policies to achieve these goals have been anti-discrimination laws introduced during the 1970s, parental leave and child care. Parental leave was first introduced in the 1970s, and a major reform took place in 1993 when leave was extended to 42 weeks at full compensation but capped, while four weeks were reserved to the father (paternity leave). Prior to 1993, not more than 3% of fathers took leave, but almost 80% of mothers took the maximum amount.Footnote 8 Since 1993, the proportion of fathers taking up leave has steadily increased from an initial 30% to almost 60% in 1998. If maternity leave is longer this may increase the opportunity for fathers to invest more in market production. On the other hand, paternity leave may reduce time in market work after childbirth. During the 1970s, publicly funded child care programs were expanded for 3 to 6 year old. Between 2002 and 2008, child care programs were also expanded to full coverage for 1 to 2 year old children. Overall, we expect to find a relatively smaller fatherhood premiums in countries like Norway, where the institutional environment promotes gender equality and equality overall and attempts to shift cultural understandings of fatherhood (Boeckmann and Budig 2013). Furthermore, we may also expect to much less degree non-random selection into fatherhood.

Compared to other countries, we would expect that the fatherhood effect in Norway is relatively small because of, for example, the relatively high wage compression and high female labor force participation.Footnote 9 Related to this hypothesis, previous research (Blau and Kahn 1996) has shown that the gender wage gap increases with wage inequality, and vice versa. However, the fatherhood effect may be relatively increased through factors that increase (gender specific) household specialization, or gender attitudes towards traditional role models (Fortin 2005). One such factor could be part-time work of women. Overtime work of fathers when the children are very young could be another factor. Overtime work however seems to play a minor role according to National statistics showing that only 20% work overtime. Overtime is usually unpaid and in many sectors restricted. According to time use data gender specialization in the household among Norwegian couples is low.Footnote 10 Following the previous literature, we take an individual approach to study earnings of men, and neglect potentially endogenous household choices. Likewise most data sets, we do not have information on overtime hours of work.

3 Data description and summary statistics

Our empirical analysis exploits panel data on the population of male siblings and male twins born in the period 1955 to 1965 and followed from 1975 until 2005. They are extracted from Norwegian registry data. We focus on selected birth cohorts to ensure that we can observe the complete individual earnings and employment histories from first entry into the labor market, as well as complete fertility histories. The Norwegian multi-generational birth registry was used to match sibling and twin brothers to each other and their offspring. We include only the first- and second-born son within a family with the same mother and father.Footnote 11 Fraternal and monozygotic twins are included but cannot be distinguished in the data.Footnote 12

Drawing on a data set starting in 1967, we generate work and earnings histories from first entry into the labor market. This ensures that we measure entry earnings accurately for every individual in our sample. The main outcome variable is the logarithm of real annual earnings that we use to measure earnings from work.Footnote 13 We deflate earnings by the Norwegian consumer price index (1998 = 100). Earnings are excluded for workers younger than 20 years of age, as they may still be in education. We also exclude observations with very low earnings (earnings less than the annually adjusted basic income according to the social security system). Years of experience are measured as the cumulative number of years with earnings above the yearly basic income. We generate two variables for years of experience. One that counts overall years of employment since first entry into the labor market (work experience), and one that counts years of employment from the year of having a child (work experience post birth). We start with the event of first childbirth. We merge the variables age and years of education to the data. We generate a variable to control in the earnings regressions for the birth order rank of each of the brothers that we compare.Footnote 14

From the birth registry, we obtain the complete record of the timing of offspring and the complete number of offspring for every man, counted up to 2005. The birth registry contains the information from the birth certificates where the mother and father are reported. In the estimations, we first focus on the year of the first childbirth and earnings before and after this year, the latter being referred to as the ‘post-birth period’. For supplementary results, we use also the birth year of the second and third child and count the corresponding years of work experience post-second and post-third birth. Hence, we can test for any non-linearity of the post-birth earnings effects of the number of children.

The main treatment group is the group fathers-at-some-point, which includes all men for whom we observe at least one child in the birth registry at some point in the observation period.Footnote 15 The group of men without any children in the birth registry are denoted as childless men. In fact, we include in the control group only men in this group who never have children across the entire observation window (life cycle). Approximately 20% remain childless by the year 2005, according to the data. Among those born in 1955 the proportion is slightly lower, 16.6%. To a small extent this is because only the oldest cohort is followed until completion of fertility, or here age 50.Footnote 16 Given the very large and long panel, the bias of our estimation results due to relatively few men that are included in the group of childless men, even though they became fathers for first time rather late in life after 2005, is negligible. Looking at the distribution of number of children with our data, we note that 19 to 21% never have children, and most men, approximately a third, have two children.

We use information on marital status to restrict the comparison group of childless men to men who are childless but married-at-some-point.Footnote 17 Childless men may be a very heterogeneous group, and childless men married-at-some-point may be more similar to fathers-at-some-point at the beginning of their working career. Some of those who married may have planned to become fathers but for some reason did not realize such a plan.

Data on marital status is available from the Norwegian registry for the period 1986 to 2005. We use this information to construct an indicator for being married. We define the indicator variable married as equal to one if a man is reported as married, divorced or separated, and zero otherwise. The variable married-at-some-point is then the indicator variable equal to 1 if in any period in our observation window a man has ever been married, and zero otherwise. Information on being married and the timing relative to first childbirth is useful in order to disentangle whether earnings increases are related to children or marriage. For the group fathers-at-some-point, we can control for whether the man is married and has a child. For childless married-at-some-point men, we can control for potential changes in earnings after the time of marriage. One should note that it is typical in Norway to marry after becoming a father. This is shown in Fig. 1. Approximately 3 out of 4 couples get married after childbirth.

Fig. 1
figure 1

Timing of marriage and first childbirth, own calculations using the sample of all brothers followed from 1986 to 2005

Table 1 reports the sample means and standard deviations for the main variables separately for fathers-at-some-point, the broad comparison group of childless men and the restricted comparison group of childless men married-at-some-point. We pool all observations across the entire observation period and report statistics for the sample of brothers and twins separately. The unconditional difference in the mean of logarithmic earnings between fathers and childless men is 17% for the sample of brothers and 15% for the sample of twins separately. Compared to childless men, men with children acquire slightly more years of education, and work less. Differences become smaller when we compare fathers to childless men married-at-some-point. Men entered fatherhood on average at age 28 in 1988. The findings are similar for the samples of brothers and twins. Reported means do suggest greater similarity of the more restricted group of childless men to the group of fathers-at-some-point.

Table 1 Descriptive statistics: Means and standard deviations, by groups

Figure 2 describes the earnings paths for fathers-at-some-point around childbirth in comparison to childless men. We use the samples from Table 1 columns 1 and 2. For illustration, earnings in this figure are predicted for a man who continuously works and becomes a father in year 10 (Father at-some-point OLS). For comparison, the earnings path for a continuously working childless man is plotted across work experience (Childless men OLS). The graph is based on the coefficients from a flexible log earnings regression estimated by simple OLS to which we will return in more detail (results reported in Table 3, column 1). The figure highlights two descriptive findings. First, childless men and fathers-at-some-point differ in the earnings paths from early on in their working career and the difference increases with work experience starting from close to zero at entry. Hence, we observe earnings differences even before men actually ‘reveal’ having children, or not. Second, for men in the father-at-some-point group, earnings increase at the time of having their first child. This pattern is also revealed when we plot earnings against age before and after first entry into fatherhood selecting on specific ages, e.g. the mean age of 27, when entering fatherhood (graphs available on request).

Fig. 2
figure 2

Predicted earnings profiles for a man who is continuously working and is entering fatherhood in year 10. Childless men are continuously working from first entry. Predictions use the estimates reported in Table 2 column 1. Years of education is set equal to zero

4 Empirical framework

4.1 The selection problem and the model

We can derive the flexible earnings equation that we are estimating in a treatment framework.Footnote 18 In our application, we observe that fathers are on different earnings paths pre-treatment (i.e. from the beginning of their career before entry into fatherhood) in comparison to childless (across the entire life cycle) men. We allow for selection on unobservables through a component that is common to brothers in the earnings equation. More generally, we estimate a more flexible Mincerian earnings equation (Mincer 1974) and allow for non-linear post-childbirth earnings effects.

Let \({ln}\,{y}_{ift}\) be observed logarithmic earnings of individual i in period t, and \({ln}\,{y}_{ift}^{* }\) the logarithmic earnings in the absence of children. (We add a subscript for family \(f\) to which we return later.) The indicator variable \({a}_{it}\) equals one from the time a person \(i\) first enters fatherhood which is in calendar year \(t\) (treated), and zero otherwise (untreated). The parameter \(\gamma\) is the effect of fatherhood, which is an overall mean.Footnote 19 The period of childbirth is denoted as k. Then we can write:

$$\begin{array}{l}ln{y}_{ift}=ln{y}_{ift}^{* }+{a}_{it}\gamma ,\qquad t> k\\ ln{y}_{ift}=ln{y}_{ift}^{* },\qquad t<=k\end{array}$$
(1)

The difference in mean post-birth earnings of fathers and non-fathers can be written as:

$$\begin{array}{ccc}&E[ln{y}_{ift}| {a}_{it}=1]-E[ln{y}_{ift}| {a}_{it}=0]&\\ &=E[\gamma ]+\{E[ln{y}_{ift}^{* }| {a}_{it}=1]-E[ln{y}_{ift}^{* }| {a}_{it}=0]\},&\end{array}$$
(2)

The expression in parenthesis is the selection bias, which is present if the assignment to fatherhood is not random.Footnote 20

Suppose \(ln{y}_{it}^{* }\) is a linear function of a set of observed characteristics \({X}_{it}\), weighted by the parameter vector \({\beta }^{* }\), and unobserved characteristics \({\epsilon }_{ift}\).

$$ln{y}_{ift}^{* }={X}_{it}{\beta }^{* }+{\epsilon }_{ift}$$
(3)

Then observed earnings can be written as

$$ln{y}_{ift}={X}_{it}{\beta }^{* }+{a}_{it}\gamma +{\epsilon }_{ift}$$
(4)

In the empirical application the vector X contains a constant and controls for years of education and experience (squared) counted since entry into the labor market. We assume that \(E({\epsilon }_{ift}{X}_{it})=0\) for all \(i\) and \(t\).

The decision to become a father can be quite generally written in terms of an index-function framework, where the index, \(fathe{r}_{it}\), is a function of both observed, \({Z}_{it}\), and unobserved, \({u}_{it}\), characteristics:

$$fathe{r}_{it}={Z}_{it}\alpha +{u}_{it}$$
(5)

Then, the ith individual’s fatherhood status is

$${a}_{it}=1\quad{iff}\quad fathe{r}_{it}\,>\,0$$
(6)
$$\qquad = 0\qquad otherwise$$
(7)

Note that entry into fatherhood can take place at different times and therefore the variable is indexed by a subscript \(t\). We assume \({u}_{it}\) is iid across individuals and distributed independently of \({Z}_{it}\). This means that the dependence between \({\epsilon }_{ift}\) and \({a}_{it}\) can arise because of dependence between \({Z}_{it}\) and \({\epsilon }_{ift}\), i.e. selection on observables, or dependence between \({\epsilon }_{ift}\) and \({u}_{it}\), selection on unobservables. Men who become fathers at some point may have already invested previously more into their careers. In this case, omitted variable bias may arise.Footnote 21

We present an estimate using a linear control function to address selection bias on observable characteristics. We assume \(E[{\gamma }_{it}| {a}_{it}=1,{X}_{it},{Z}_{it}]=\gamma\). Inserting a linear version of \(E(\epsilon | X,Z)\)Footnote 22 in Eq. (4) yields

$$ln{y}_{ift}={a}_{it}\gamma +{C}_{it}{\delta }^{* }+{\tilde{\epsilon }}_{ift}$$
(8)

where \({C}_{it}\) denotes the vector of all variables included in either \({X}_{it}\) or the vector of instruments \({Z}_{it}\) and \({\tilde{\epsilon }}_{it}={\epsilon }_{ift}-E(\epsilon | {a}_{i},{C}_{i})={\epsilon }_{ift}-E({\epsilon }_{ift}| {C}_{i})\). \({\delta }^{* }\) is a parameter vector. In our application \({Z}_{it}\) will be the vector:

$${Z}_{it}=Z(father-as{p}_{i},e{x}_{it}* father-as{p}_{i},e{x}_{it}^{2}* father-as{p}_{i}).$$

where \(father-as{p}_{i}\) is an indicator variable equal to 1 for individual \(i\) if becoming a father ever during the life time, and zero if remaining childless. We also include the indicator variable interacted with years of work experience, and work experience squared. If the indicator is equal to 1 then all of the instrumental variables in Z are switched on. This is used to instrument whether the father has a child, and is to be independent of the father’s wage conditional on a family fixed effect. Crucial for employing this strategy is that we observe complete earnings histories from first entry into the labor market, and we can distinguish fathers-at-some point from childless men.Footnote 23

Replacing \(C\) leads to:

$$ln{y}_{ift}={a}_{it}\gamma +{X}_{it}\beta +{Z}_{it}\delta +{\tilde{\epsilon }}_{ift}$$
(9)

To allow the post-childbirth effect to be non-linear, we use a non-linear function \(\gamma ()\) in the indicator \(1(a=1)\)—which is equal to 1 after childbirth and zero otherwise—and the years of post-birth work experience, \(e{x}_{it}^{post}\).

$$\gamma ({a}_{it},e{x}_{it}^{post})={\gamma }_{1}1{(a=1)}_{it}+{\gamma }_{2}1{(a=1)}_{it}* {(e{x}^{post})}_{it}+{\gamma }_{3}1{(a=1)}_{it}* {({e{x}^{post}}^{2})}_{it}.$$

The first term can be interpreted as a pure shift parameter that is an estimate of the change in earnings from the time of first entry into fatherhood, whilst the second and third term capture the curvature of earnings post entry into fatherhood. The non-linearity may capture time varying costs in the age of the child, or effects through further children. Later we test whether the second or third childbirth has a significantly different effect on earnings profiles than the first childbirth. If the estimates of \({\gamma }_{2}\) and \({\gamma }_{3}\) are not significant then there is only a constant shift of earnings post entry into fatherhood.

Substitution of the function for the flexible post-childbirth effect into the earnings equation gives then the earnings regression that we are going to estimate:

$$ln{y}_{ift}=\gamma ({a}_{it},e{x}_{it}^{post})+{X}_{it}\beta +{Z}_{it}\delta +{\tilde{\epsilon }}_{ift}.$$
(10)

The error term contains three components:

$${\tilde{\epsilon }}_{ift}={\nu }_{f}+{\mu }_{if}+{w}_{ift},$$
(11)

these are an unobserved family fixed component, \({\nu }_{f}\), capturing genetically inherited abilityFootnote 24; an individual varying and family-varying unobserved component, \({\mu }_{if}\), capturing unobserved ability and genetic traits that vary across individuals and families; and \({w}_{ift}\) capturing other idiosyncratic variation (or luck). In the empirical estimation, we control for birth order effects, capturing the fact that first or second born brothers within a family differ in birth order rank, as well as time fixed effects capturing macroeconomic shocks. (These are not shown in Eq. (10)). We assume that \(E({w}_{ift}{X}_{it})=0\) and \(E({w}_{ift}{Z}_{it})=0\) for all \(i\), \(f\) and \(t\).Footnote 25

4.2 The covariance estimator

The key parameter is the mean effect of having children, \(\gamma\), which is a vector of three parameters: the shift right after childbirth, and the squared polynomial term in years of work experience post childbirth. Most studies on the effect of fatherhood have estimated a more restrictive log earnings equation by simple OLS or fixed effects (FE), where \({Z}_{it}\) is excluded, no unobserved family fixed effects are considered and the effect of children is only a shift parameter in earnings after first childbirth. In addition, these studies only use the group of fathers-at-some-point.

We employ the covariance estimator (CV) (Bound and Solon 1999) which applies OLS to the regression of the between-siblings differences in log earnings on the between-siblings differences in the children variables, holding other between-sibling differences constant. This is an alternative to controlling for family fixed effects in the main earnings regression in levels, which is taking into account that we have only two siblings within each family. This approach allows us to use variation from the entire sample of men including childless men, and the conditional cross-sectional variation for identification.

This approach attempts to address selection problems due to unobservables, in addition to selection on observables as discussed before. If we write down the full regression (Eq. 10) for the first born brother in family \({f}{^{\prime} }\) and the second brother in family \({f}{^{\prime} }\) and then subtract the latter from the former we can derive the regression in differences between brothers. Note that we form the between-sibling difference always by subtracting the variable of the second-born brother (indicated by the subscript 2) in family \({f}{^{\prime} }\) from the variable of the first-born brother (indicated by the subscript 1) in family \({f}{^{\prime} }\). If we assume \({\mu }_{1{f}{^{\prime} }}\)\(=\)\({\mu }_{2{f}{^{\prime} }}\)Footnote 26, then the transformed regression can be written as:

$$\begin{array}{ccc}(ln{y}_{1{f}{^{\prime} }t}-ln{y}_{2{f}{^{\prime} }t}) = {\gamma }_{1}(1{(a=1)}_{1t}-1{(a=1)}_{2t})\\ + {\gamma }_{2}(1{(a=1)}_{1t}* {(e{x}^{post})}_{1t}-1{(a=1)}_{2t}* {(e{x}^{post})}_{2t})\\ + {\gamma }_{3}(1{(a=1)}_{1t}* {({e{x}^{post}}^{2})}_{1t}-1{(a=1)}_{2t}* {({e{x}^{post}}^{2})}_{2t})\\ + ({X}_{1t}-{X}_{2t})\beta +({Z}_{1t}-{Z}_{2t})\delta +({w}_{1{f}{^{\prime} }t}-{w}_{2{f}{^{\prime} }t})\end{array}$$

Under the set of assumptions that we made, this specification can be estimated by OLS to identify the key parameters. Variation used to identify the parameters \({\gamma }_{1}\), \({\gamma }_{2}\), \({\gamma }_{3}\) and \(\delta\) comes from sibling pairs where one sibling has children and the other does not. To identify \(\delta\), we need to observe men in the group of father-at-some-point in employment before they actually become a father; that is, we need variation in \({a}_{it}\), which is independent of \({Z}_{it}\).Footnote 27 This highlights the value of our data, in which we observe the complete employment and earnings history before and after becoming a father, as well as the complete fertility histories. We also require that \({a}_{it}\) and \({Z}_{it}\) are uncorrelated with \({w}_{ift}\).

Our approach has some advantages over previous approaches. First, the identification of the non-linear post-childbirth effect relies on cross-sectional variation since we take differences between brothers in the same period. Since we can distinguish father-at-some-point compared to childless men, it is not the timing of births that generates the variation. Second, the CV estimator does not rely on the strict exogeneity assumption unlike fixed effects estimators that introduce dynamics into the error term and therefore potential feedback effects. Also gaps in individual panels do not cause problems as in FE (see Loughran and Zissimopoulos (2009), Ludwig and Brüderl (2018)). Third, since we compare two men from the same family (same mother and same father), they are more similar in terms of the unobserved component than two randomly selected men from the population. This is likely to reduce the bias. We show compelling evidence that this is the case and taking out heterogeneity related to family fixed effects does reduce the bias.

In order to reduce further bias—because strict equality of \({\mu }_{1if}\) and \({\mu }_{2if}\) within each couple of brothers may not hold—we also compare siblings who are more similar in age. By reducing within sibling differences in age, we also reduce differences in family environment. That is, we are then comparing outcomes for brothers whose parents are in more similar career phases, may have more similar time and monetary resources or have more similar parenting experience.Footnote 28 Differences in these characteristics decrease between brothers who are more similar in age, and related heterogeneity can be further reduced. Note that, due to the large data sample, we are able to run separate regressions on sub-groups of siblings with a specific age difference. In the sample of first-and second-born brothers, the mean age difference is 3.5 years and it is reduced to zero when we use only twins. Using twins offers the advantage that we directly control for family fixed factors, individual fixed factors and time fixed factors.

5 Empirical results

5.1 Does having children affect male earnings?

Table 2 reports the estimation results of the main earnings regression in Eq. (10). We report the coefficients together with standard errors that are clustered at the sibling pair level. We report estimation results from ordinary least squares that exploits only the cross-sectional variation and from the covariance estimator, that uses the within family or sibling couple variation.Footnote 29 The top three rows (upper panel) report the parameter estimates of the auxiliary variables; the common return to education, and the return to experience (squared). The next panel of coefficients reports the differential effect in the intercept, that captures differences at first entry into the labor market, and the differential return to experience (squared) between the fathers-at-some-point and childless men. The three coefficients reported in the lower panel are the key coefficients of interest; the differential post-childbirth effects on earnings. OLS gives a parametric description of the earnings profiles as we showed them in Fig. 2. The results reveal a positive average return to education. The return to years of work experience for the comparison group of childless men is 6.9% in the first year and then declines. The differential effects on entry earnings (i.e. the intercept) and work experience since first entry into the labor market between fathers-at-some-point and the group of childless men are significant and positive (middle panel of Table 2). Hence the descriptive finding is that fathers-at-some-point start on higher and steeper earnings paths than childless men. After the first childbirth, the immediate shift in earnings is 0.073 which gives a 7.3% increase in earnings compared to the period before fatherhood. The differential effect in experience post birth is slightly non-linear.

Table 2, column 2 reports the results from the CV estimator. Compared to the sample used for OLS, we now have half the number of observations because CV uses variables in differences between brothers. The CV estimates of the coefficient for the post-birth variable shows that earnings significantly shift upward in the birth year of the first child. The point estimate of the effect after childbirth is 6.4%. The curvature in experience post birth reveals that the marginal effect of experience post birth is almost zero. In comparison to OLS, the CV estimate gives a relatively smaller increase in earnings post birth at any level of experience post birth. The estimates using CV suggest that positive selection on unobserved factors biases OLS upwards. The CV estimates show that 13%, that is \(((1-0.064)/0.072)\,\times\, 100\), of the simple OLS estimated shift effect post birth, that is immediately or less than one year after childbirth, is due to positive selection on fixed family-specific characteristics.

Table 2 The effect of children on log earnings: OLS and the family fixed effect (CV) estimation results

Siblings are genetically more similar than randomly selected men, which is why CV controls for more heterogeneity than OLS. Still, siblings might be quite heterogeneous in terms of family background, which may introduce bias and thus make family fixed factors appear less important. We explore remaining biases through potential time-varying differences between siblings—such as differences in parenting experience, parent characteristics and age by using more homogeneous sub-samples of brothers in terms of age. The new estimation results are reported in Table 2, where we gradually decrease the age differences in the sample of brothers to two years or less (column 3), one year or less (column 4), to zero (where we use the sample of twins (column 6)). Column 5 is reported in order to show that we can replicate the main results from OLS for the sample of all brothers with the sample of twins.

When we select brothers who are less than three years apart in age (column 3), the shift effect on earnings after childbirth diminishes to 4%. The differential effect in experience post birth is economically negligible and only the coefficient of the squared experience post-birth variable is significant. The estimate is substantially smaller than the post-birth estimate from CV when we used all siblings. Recall that we estimate the effect post birth conditional on actual work experience so that age differences between brothers capture additional factors, which could be time varying family background factors.

When we reduce the age differences between brothers further to less than two years (column 4) then the results on the post-childbirth effect remain unchanged. We now see even more clearly that the effect post birth is only an upward shift at birth in earnings of 4%. Hence, there is no further adjustment in terms of earnings during the years after first child-birth. This may be surprising given that most have a second or further child which may induce further adjustments.

When we use the sample of twin brothers (see column 6), the effect post birth declines even further. The coefficients of all of the post-birth variables become statistically and economically insignificant. Note that even though the coefficient of experience post-birth squared is statistically significant, the estimate coefficient is economically negligible (0.00043). The point estimate of the shift in earnings post birth is small, and not significant. With a F-test (not reported in the table) we can also show that estimates from brothers one year different in age, or less, and twins are jointly significantly different (Table 2, column 4 compared to column 6).

When we look at the estimation results of the other coefficients, we find that the differential effect on entry earnings (i.e. the intercept) and experience remain significant between men in the group father-at-some-point and in the group childless men. There is one noticeable difference between the estimates from CV on the sample of brothers and CV on the sample of twins. When we use brothers of different ages, we find differences in the earnings trajectories from early on in the careers. When we use twins, differences in earnings are only significant at first entry into the labor market. Hence, men who remain childless all their lives, and fathers-at-some-point, are on the same earnings profile—except that those who become fathers at some point in life start on relatively higher pay. This implies that we do not find any evidence that twin brothers invest differently into their careers over time. This makes these estimates more convincing than those from brothers since essentially we can compare two men with parallel trends (until childbirth). This gives an unbiased estimate of the effect of childbirth. The main results for twins are illustrated in Fig. 3 where we present predicted earnings profiles from OLS and CV estimates.

Fig. 3
figure 3

Results for twins: Predicted earnings profiles for a man who is continuously working and enters fatherhood in year 10 (blue line). Childless men (red line) are continuously working from first entry. The predictions use the estimates reported in Table 2 column 5 (OLS)—left figure—and Table 2 column 6 (CV)—right figure

The estimate using differences between twin brothers accounts for time fixed effects, family fixed effects, and individual fixed effects. Note also that the mean pair is now genetically more similar, because around 30% of twins in the sample are monozygotic and hence genetically identical at birth, which motivates the assumption that we can control for individual fixed effects.Footnote 30

5.1.1 Testing who to compare to

We explore further factors that may explain differences in the earnings profiles from first entry into the labor market between the fathers-at-some-point and others, and which may also be positively correlated with the post-birth effect of children. First, we change the control group and now use childless men married-at-some-point as an alternative comparison rather than all childless men. We may argue that those who marry at some point but remain childless, may have also had plans to have children at some point, but did not realize those plans for reasons uncorrelated with labor market behavior. Since the group of all childless men that we have used so far may be quite heterogeneous, taking the sub-group of childless men who also married-at-some-point makes the comparison group more homogeneous and more similar to the group of fathers-at-some-point. This contention is supported in Table 1, where we show that men in the group fathers-at-some-point are more similar with respect to mean years of education and work experience to childless men married-at-some-point than to all childless men.

In Table 3, we report the corresponding estimation results of the earnings regression only for the brother samples.Footnote 31 Columns 1 and 3 report selected coefficients showing the differential effects of childbirth. The differential effect on entry earnings (i.e. the intercept) between fathers-at-some-point and the restricted group of childless men married-at-some-point is now much smaller than those reported before in Table 2, columns 2 and 4. In column 1 of Table 3, the estimated difference between mean entry earnings is not significant, and the differences in slope coefficients are very small. When we restrict the age difference between brothers to less than two years, then the differential effects since entry into the labor market become statistically insignificant and economically close to zero (Table 3, column 3). This suggests that the large differences between those who are not yet fathers and all childless men are, to a large extent, driven by childless men who never marry. The result shows that this group performs relatively worse in the labor market. The result is robust to restricting the treatment group to ever married fathers-at-some-point. Regarding our main results, however, the size of the post-birth coefficient (that is, the coefficient of the shift and the experience post-birth variables) do not change significantly compared to our main results in Table 2.

Table 3 The effect of children on log earnings estimated by CV: comparison group is childless men who are married-at-some-point

5.1.2 Testing whether the post-first birth effect is also capturing marriage

Returning to the results for brothers (Table 3, columns 1 and 3), a question remains as to whether the remaining effect of entry into fatherhood is driven by childbirth or by marital status. As a simple test we add as a control variable an indicator variable switching to one when a man is actually married. We re-estimate these regressions using the samples of brothers (excluding twins) where the comparison group is restricted.

In Table 3, columns 2 and 4, adding a control for being married to our previous specifications reduces the size of the post-birth earnings effect. For brothers who are only one year apart in age (Table 3, column 4), the effect of children now is 3.2% and economically constant. Hence, the effect is reduced by 28% (=1 − (0.032/0.044) × 100)).Footnote 32 We interpret this as a simple test showing that the estimated effect is capturing mainly variation related to having children, rather than marriage.

The post-birth effect could still be the joint effect of entry into fatherhood and cohabitation, since men not married at childbirth may be cohabiting. The results show that even if some men in the father-at-some-point group already cohabit before having children, it is noteworthy that we do not observe differential effects from first entry into the labor market—or before childbirth. So household specialization does not explain pre-birth differences.Footnote 33

5.1.3 Testing whether the post-birth effect captures second or further births

It is also possible that the earnings effect at first childbirth captures the effects of second or subsequent births. To test this hypothesis we extend our specification in column 3 and add dummy variables for second and further births up to 5 or more. As can be seen from column 5 in Table 3 the differential effect at entry into the labor market does not vary significantly across parity compared to childless men married-at-some-point. The shift in post-first childbirth earnings remains significant and robust in size. Estimates that are not reported here would also show that the shift and the return to post-childbirth years of experience does not change after the second childbirth or further births. This result highlights that the event of (first) fatherhood is important for earnings and not the number of children.

5.2 Testing robustness across time and the paternity leave reform

We exploit the exogenous variation arising from the 1993 parental leave policy reform, in order to test the robustness of our results over time within Norway. The reform introduced four weeks of paid paternity leave. The proportion of fathers taking leave increased from 4% in 1993 to 33% in 1994 (see www.SSB.no). The reform applied to parents of children born after 1 April 1993. Since we rely on yearly data, we assume that post-1993 births are eligible.Footnote 34

In order to test whether our results hold over time, we re-estimate the main regressions from Table 2 on a restricted sample. We drop earnings from the individual time series if the earnings are post-childbirth and the childbirth was in or after 1993. Hence, we keep all individuals in the data panel, and the restriction is primarily on the individual time series of earnings. This strategy should remove any potential effects occurring through the reform, or through the post-reform take up of paternal leave. The replication of Table 2 on the restricted sample is reported in Table 4.

Table 4 Robustness Checks: Regression results on a restricted sample to exclude effects through paternity leave reform

The descriptive estimate (OLS) in Table 4, column 1 replicates the pattern that we have found before. Men who become fathers at some point are on higher earnings paths already, even before entering fatherhood. Post-childbirth earnings increase non-linearly. If we estimate the model using the CV estimator, and use brothers excluding twins, then we see that the post-childbirth effect is smaller than from using OLS. As we decrease the age difference between brothers the post-childbirth increase in earnings tends to decrease. For the sample of twins the estimate of the post-birth earnings effect is 3.2% and economically constant across years of post-birth work experience. We interpret these estimates as upper and lower bound estimates, although they are not statistically different from the estimates on the full period taking the standard errors into account. As before for the twin sample estimates, we see that earnings paths from first entry into the labor market are not significantly different, except that entry wages are higher for those in the father-at-some-point group than for childless men.

A more careful comparison of the estimation results for the post-birth period (lower panel) reported in Table 4 and Table 2 reveals that Table 4 shows systematically larger post-child birth effects when we exclude effects through the paternity reform than Table 2. Hence, it seems that the decline in the marginal effect post birth could be a reform effect and could be interpreted as an intention to treat effect. Alternatively, it may capture other trends, such as a decrease in the gender-specific household specialization related to having children. Note, in order to explain the pattern plausibly, the change would have to happen on a more permanent basis rather than simply being an adjustment during the period when the child is very young.

5.3 Testing for responses in labor supply

In Table 5 we provide more evidence on the question of whether temporary paternal labor supply adjustments are one of the mechanisms explaining the positive post-childbirth point estimates in the sibling data (excluding twins). We re-estimate our empirical model on the sample of siblings but now replace the outcome variable by a measure of labor supply. From our data we can construct an indicator variable for whether an individual is employed or not, and whether working hours are larger than 30. We find that childless men are less likely to be employed and work fewer hours, which is consistent with the findings on earnings. When we restrict the control group to childless men married-at-some-point (columns 3 and 4) most of the coefficients turn insignificant. An exception is the economically small increase, 0.004 percentage points, in the probability of working more than 30 h. Calculations show, however, an economically zero effect during the 3 to 4 years following entry into fatherhood (column 4). Overall, these results present little support for the hypothesis that the remaining effect captures labor supply responses.Footnote 35 Regarding inequality between all childless men and fathers-at-some-point we learn however that childless men who are never married have more spells of non-employment and shorter hours of work.

Table 5 Linear Probability Model results for employment and hours of work

5.4 Discussion

The main result of this study is that the effect of children on earnings is not significant when we estimate flexible Mincerian earnings regressions, controlling for a differential effect on entry earnings and experience since first entry into the labor market and family fixed effects. When we compare twin brothers, there is no significant effect of children on earnings. Twins have similar earnings growth paths all through their careers independent of whether they have children or not. These results refute explanations related to gendered household specialization and employment discrimination.

Given the Norwegian context, we may expect that the fatherhood effect in Norway is relatively small. This might be due to family policy, but not only, and perhaps even more so due to high wage compression and high female labor force participation.Footnote 36 Our findings confirm this. It may seem surprising that we find strong selection effects through family fixed effects indicating that the variation between families is relatively important. Family fixed effects may capture socioeconomic background, investment by parents into children, or related factors that some typically regard of relatively less importance in high equality countries.

We also show that most of the variation in earnings comes from children, and only a minor part through marriage. This is in line with new studies that also find that after controlling for wage growth, the marriage premium becomes smaller and not significant (Killewald and Lundberg 2017; Ludwig and Brüderl 2018). Other studies on Scandinavia also find small marriage premiums in fixed effects models (for Denmark, see Datta Gupta et al. 2007, for Norway, see Petersen et al. 2011). Our results contribute to the small literature that has investigated both the marriage premium and the effect of having children on men’s earnings. For Denmark, Datta Gupta et al. (2007) estimated a marriage premium of 1.2% and a positive effect of 0.9% per year for having children younger than 3 years. Hence, the Danish study suggests temporary post-childbirth earnings effects which we do not find. Temporary adjustments after child birth could be related to gender specific household specialization during the child care intensive infant period.

Our results add to the understanding of sources of male earnings inequality. We show that differences in observed earnings profiles between childless men and fathers-at-some point are increasing from the beginning of their working careers. This can be interpreted as effects through differential investment into human capital, which is a key driver of inequality. We find that men who remain unmarried and childless are a negatively selected group, contributing to the observed earnings inequality among men. This finding is consistent with findings for Sweden showing that single men earn less than fathers (Boschini et al. 2011). We also find that earnings inequality is affected only by the arrival of the first child. This contrasts with the literature on women showing that the shift effect of children on post-birth earnings is negative and increasing in the number of children (Waldfogel 1998).

Our results show strong evidence that family background is an important cause of the observed child earnings premium for men. This supplements the few previous studies that have used models with family fixed effects in the earnings equation.Footnote 37 We find that OLS regressions which use the cross-sectional comparison of men who have a child and who remain childless are plagued by selection effects into fatherhood. OLS regressions overestimate the (positive) effect of having children on male earnings. This implies that individuals with relatively high values of the (unobserved) family-specific factor in our model are more likely to become fathers.

This research suggests that selection on family background into fatherhood is important. We conclude that there is positive selection although this could differ in other countries and settings. Our findings also raise questions as to why family background is important. In order to answer this question we would need more research with richer data on potential mechanisms. For example, explanations could be directly related to the social environment created within the family, or the neighborhood, cognitive development or the values engendered within the family.

6 Conclusions

This study investigates whether having children has a causal effect on earnings for men with an additional focus on the effects on earnings inequality among men. The empirical analysis employs novel data on the population of siblings drawn from longitudinal Norwegian registry data that contains the complete employment, earnings and fertility histories for all brothers and, more specifically, twins. We estimate flexible earnings regressions where the unobserved component is common to brothers and non-random selection into fatherhood is taken into account. From descriptive analysis, we find that during the early career and before entry into fatherhood earnings inequality between fathers-at-some-point and childless men gradually increases. Men who remain childless and never marry contribute considerably to the increase in earnings inequality. The effect of fatherhood is capturing observed variation at first childbirth. Our results show that the conditional effect of first entry into fatherhood is declining the more effectively we control for family fixed effects. The large data set allows us to control for family fixed factors using differences between first- and second-born siblings, and by restricting the sibling age differences up to zero by using data on twins. Novel to the literature, we show that non-random selection into fatherhood is captured through family fixed factors and higher earnings growth even before entry into fatherhood. Most compelling are our results for twins, where we find that the conditional earnings effect of children for men post birth is insignificant. For other samples, we also find a relatively small yet still significant effect of having children. In summary, we conclude that it is not primarily the effect of children that makes fathers earn higher incomes, but that higher earners are more likely to become fathers.

The evidence in this paper adds to the debate about the sources of inequality among men and the gender wage gap. This research highlights that men who remain childless, and unmarried, are a negatively selected group on relatively low earnings profiles. This makes them a potentially higher risk group in the labor market more generally. Our results suggest that this group is less likely to be employed and less likely to be working long hours. One interesting question is whether these men are more likely to be unemployed, or on sickness leave. Regarding the gender wage gap, the conventional view is that having children has a negative effect on mothers’ earnings and a positive effect on fathers’ earnings. This suggests that, other things being equal, the redistribution of household time and time spent with children would potentially reduce the gender wage gap through a decrease in male work time. But our results reveal that the observed child premium for men is an upward biased estimate of the direct effect of fatherhood on men’s earnings. Hence, redistributive policies at the household level are potentially less effective than would be expected from observed gender gaps.