1 Introduction

One of the most important demographic trends in recent decades has been the incredible decline in the fertility rate in developed countries. From 1960 to 2006, the fertility rate decline by 49% in OECD countries; since 1980, the rate has declined by 22% (OECD 2009). This downward trend in fertility, as well as subsequent attempts to raise fertility through financial incentives (see Milligan 2005), has fuelled research on the impact of fertility on other outcomes. One of the areas of particular concern has been child development. According to Gary Becker’s ‘Quantity–Quality’ theory of children, the household faces a trade-off between the number of children they have and the average ‘quality’ of children (Becker 1960; Becker and Lewis 1973; Becker and Tomes 1976). Researchers then began investigating the empirical link between fertility and child quality. Using clever instruments to correct for the possible endogeneity of fertility to child quality (e.g. the incidence of twins, the sex composition of children, or China’s one child policy), they looked at measures such as educational attainment (Rosenzweig and Wolpin 1980; Angrist et al. 2005; Black et al. 2005, 2010), grade retention (Cáceras-Delpiano 2006), labour market outcomes (Angrist et al. 2005), marriage and fertility (Angrist et al. 2005), and school enrolment (Qian 2009). What has emerged from this literature is that there is no consistent negative link between fertility and child outputs. In fact, the balance of the literature points to no relationship between the two.Footnote 1

As noted by Cáceras-Delpiano (2006), however, the true spirit of Becker’s model is that households will reduce their investments in children as family size grows. The resulting level of child quality is beyond complete control of the parents. Using the 1980 US Census, he demonstrates that increased fertility (instrumented with a twin birth on a second or later birth) exerts a negative impact on private school enrolment. The question that remained was simple, “Why does fertility reduce parental investments in children, but has no consistent impact on various measures of child output?” Cáceras-Delpiano points to one possible factor, namely the reduction in maternal labour supply suggested by theories of household allocation of labour. When faced with more children, men and women further specialize in household activities, which for men is typically paid work and for women is typically unpaid housework (Becker 1985). In addition, a new child will increase the demand for parental involvement in general (Lundberg and Rose 1999). In combination, these two complementary theories suggest an ambiguous impact of fertility on men’s labour supply, and an unambiguous decline in women’s labour supply. Cáceras-Delpiano showed that maternal labour supply does indeed tend to decline with fertility. Unfortunately, the data available only contained information on hours of paid work, so it is not clear what mothers are actually doing in the home (i.e. taking care of children or not).

This study extends the work of Cáceras-Delpiano (2006) in two important ways. First, using the same twin birth strategy and a unique Canadian data set (the Youth in Transition Survey—YITS, linked to the Programme for International Student Assessment—PISA), I explore the link between fertility and a broader range of investments that are arguably linked to parental intentions for producing higher quality children (overall and non-sectarian private school enrolment, the number of computers in the home per child, and saving for the child’s education). I find a strong negative link between fertility and each of these measures of parental investments. I also demonstrate that fertility is not negatively associated with child cognitive ability (a measure of child output).

Second, I consider three possible reasons for the ‘puzzle’ presented in the literature (i.e. fertility is negatively related to parental investments, but is not always related to child outputs). The first is that the literature has not definitively linked parental investments in children to their cognitive ability. In fact, quasi-causal and experimental studies suggest no link with certain measures of parental investments. The second is that there may be economies of scale associated with larger families, with respect to rearing children and selecting effective sibling interactions. I present evidence suggesting that economies of scale explain some, but not all, of the puzzle. The third possible reason follows up on Cáceras-Delpiano’s work. Specifically, recent work suggests that mothers (and fathers!) do in fact respond to increased fertility by spending more time taking care of children without pay. However, the literature offers mixed evidence on the role of parental childcare in determining cognitive ability (ranging from no effect to a positive effect).

The remainder of this study is as follows. In the next section, I describe the methodology, including the identification strategy and the data used in the study. I then present the descriptive and econometric results in the following two sections. Next, I identify and discuss three possible reasons why fertility is negatively associated with parental investments in children, but generally does not lead to reduced child outputs. Finally, the study is summarized in the last section.

2 Methodology

The identification strategy is based on instrumenting family size with a variable indicating the presence of a multiple (twin, triplet, etc.) birth on the second or later birth, similar to Black et al. (Black et al. 2005, 2010) and Cáceras-Delpiano (2006). I begin with the Wald estimate, which is simply the ratio of the difference in average child input or output (\( \bar{Y} \)) between youth with (1) and without (0) multiple birth siblings, to the difference in the average number of children (\( \bar{N} \)) generated by the incidence of multiple birth siblings, as shown in Eq. 1:

$$ {\text{Wald}} = {\frac{{(\bar{Y}_{1} - \bar{Y}_{0} )}}{{(\bar{N}_{1} - \bar{N}_{0} )}}}. $$
(1)

The standard error of the Wald is (Angrist 1990):

$$ {\text{SE}}({\text{Wald}}) = {\frac{{\sqrt {\left( {{{\sigma {\frac{2}{{Y_{1} }}}} \mathord{\left/ {\vphantom {{\sigma {\frac{2}{{Y_{1} }}}} {n_{1} }}} \right. \kern-\nulldelimiterspace} {n_{1} }}} \right) + \left( {{{\sigma {\frac{2}{{Y_{0} }}}} \mathord{\left/ {\vphantom {{\sigma {\frac{2}{{Y_{0} }}}} {n_{0} }}} \right. \kern-\nulldelimiterspace} {n_{0} }}} \right)} }}{{(\bar{N}_{1} - \bar{N}_{0} )}}}, $$
(2)

where ‘n’ refers to the sample size. For significance testing, note that the Wald follows χ2(1). If the instrument is truly exogenous, then the Wald estimate has a causal interpretation, and there is no need to provide econometric evidence. In reality, however, no instrument is perfect. For this reason, I also estimate instrumental variable (two-stage least squares) regression, shown in Eqs. 3 and 4 (where Z is the multiple birth instrument, \( (\hat{N}) \) is the predicted value of N from the first stage, and X is a series of control variables):

$$ {\text{First {\text -} stage}}:\,N_{i} = \alpha_{0} Z_{i} + \alpha_{1} X_{i} + \varepsilon_{i} $$
(3)
$$ {\text{Second {\text -} stage}}:\,Y_{i} = \beta_{0} \hat{N}_{i} + \beta_{1} X_{i} + \mu_{i} . $$
(4)

In the case of categorical IVs (used here), the estimated effects of IV regression should be interpreted as Local Average Treatment Effects, or LATE (Imbens and Angrist 1994). In other words, the results only pertain to compliers (those induced to change their treatment status based on the instrument value). Unfortunately, it is impossible to identify compliers precisely since we do not know their counterfactual treatment status.

The data are drawn from the Canadian portion of the Programme for International Student Assessment (PISA), a project of the Organisation for Economic Co-operation and Development. In Canada only, the PISA is matched to the Youth in Transition Survey (YITS), Cohort A, Cycle 1. The YITS was developed in conjunction with PISA by Statistics Canada in order to add more contextual information than is available in PISA. For the purposes of this study, the YITS data set is unique in that it contains several measures of parental investments, child standardized test scores, family size, sufficient information to proxy multiple births with very high accuracy, and a large enough sample to conduct meaningful analyses. The YITS contains about 28,000 youth who were 15 years old on December 31, 1999 (i.e. they were born in 1984).Footnote 2,Footnote 3,Footnote 4

The measure of child output is the PISA reading score. Conducted in April and May of 2000, the PISA assessment consisted of a reading test delivered to all students, and a mathematics and science test (each delivered to half of the students). I focus on the reading score since the sample size is crucial (I use a multiple birth IV). The assessment was administered in the language of instruction of the school, which was either English or French. The reading test consisted of having students perform a range of tasks with different kinds of text that included retrieving specific information, interpreting text, and reflecting on the content and features of the text. The texts included standard prose passages and various types of documents such as lists, forms, graphs and diagrams. The test score was standardized to have a mean of 0 and a standard deviation of 1, so that regression coefficients are expressed in standard deviation units. In principle, PISA tests the ability of students to apply the knowledge they have learned. In this sense, it is a hybrid between an IQ test and a scholastic test.Footnote 5

Background questionnaires were administered to students through both PISA and YITS. Parents and schools were also administered questionnaires through YITS and PISA, respectively. The parent most knowledgeable about the child answered the parent questionnaire, while the principal of the school answered the school questionnaire. From these, three measures of child inputs were derived. The first is a binary indicator of enrolment in a private school, which is derived from the principal questionnaire. As we shall see, the majority of Canadian private schools are sectarian. Since parents may be motivated to enrol their children in sectarian schools for spiritual, as opposed to academic reasons, I also look at enrolment in private, non-sectarian schools.Footnote 6 The second measure of parental investment is the number of computers in the home per child, derived from the PISA student questionnaire.Footnote 7 The third measure of parental investment is a binary indicator of the presence of parental savings that are earmarked for the child’s postsecondary studies (i.e. started a savings account, started a Registered Education Savings Plan (RESP), set up a trust fund for this child, or made investments).

The instrumental variable (a multiple birth on a second or later birth) is set to 1 if the following three conditions hold: there are multiple birth siblings in the household, the multiple birth siblings must have occurred on the second or later birth (multiple birth siblings on the first birth are unlikely to lead to additional children), and the multiple birth siblings must have occurred on the last birth (otherwise, they could not possibly lead to additional children since the couple chose to have more children following the multiple birth).

I also create variables indicating the number of children in the household (the fertility measure) and the birth order. The age of household members is measured in discrete years on the YITS parent questionnaire, which poses two challenges. First, two children who were the same age in years can not be distinguished in terms of birth order. Second, multiple birth siblings cannot be precisely identified. However, the incidence of siblings of the same age in years is likely more common in blended families where the husband and wife each have children from a previous marriage. I thus restrict the sample to youth in families where all siblings in the household are living with their biological mother. Using an extraneous data source containing household member relationships and exact birth dates, I demonstrate that under these conditions, multiple birth siblings can be correctly identified in about 98% of the cases.Footnote 8

The other variables used in the analysis include the sampled child’s age (reported in months, but rescaled to years in the analysis), the child’s sex, whether the child’s mother tongue is the same as the test language, the age at arrival to Canada (in categories), the mother and father’s age (reported in discrete years) and their highest level of education, and finally, the household’s province of residence.Footnote 9 Note that the parental age variables are in quadratic form since their range is potentially quite high. For the child’s age, a linear specification is used since the age range is quite narrow, as all children in the sample were born in 1984.

The initial sample consists of all youth with two opposite sex parents in the home, including the biological mother of the child.Footnote 10 As shown in the left side of Table 1, this includes 18,756 youth. The mean and standard error of each variable used are also shown. The process of selecting the final sample involved three more steps. First, I drop youth who were part of a multiple birth. This is important since multiple birth siblings may be different than singleton births for biological or environmental reasons. The impact of applying this criterion is minimal. The sample declines moderately by 351. The average values of the child input and output variables are virtually unchanged as a result. The average number of children is marginally lower, but this is expected since multiple birth siblings obviously come from larger families. The proportion of youth with a multiple birth sibling is cut in half, from 0.012 to 0.006, which is expected for the same reason. All of the other variables in the analysis have similar average values after the criterion is applied. The second step consists of dropping youth in families with only one birth. This measure is adopted since multiple birth siblings are more likely to occur in larger families. Moreover, a multiple birth on a second or later birth is obviously not possible when there is only one birth. The impact on the sample size is a bit larger this time: from 18,405 to 15,471. The number of computers per child is somewhat lower, which is an early indication of the relationship between family size and this variable. Not surprisingly, the average family size and birth order increase. The proportion of youth with a multiple birth in the family also increases, but only slightly. All of the other variables are largely unaffected. The third step consists of dropping youth in families with a multiple birth on the first birth. This measure is also necessary to implement the multiple birth strategy since many families may want to stop at two children (and thus, a multiple birth on the second may have an optimal impact on family size if it follows a singleton). However, this resulted in a small decline of 42 and virtually no change in the sample statistics.

Table 1 Means and standard errors of variables used in the analysis by sample selection criteria

In the final sample, a total of 119 youth have multiple birth siblings. Although this is far less than the studies using the US Census (e.g. Cáceras-Delpiano 2006) or Norwegian administrative data (Black et al. 2005, 2010), it is still greater than the 25 available to Rosenzweig and Wolpin (1980), who looked at fertility and schooling among Indian households.

3 Descriptive results

The means of the outcome variables by the number of children in the family are shown in Table 2. There is little to no relationship between the reading score and the number of children. In terms of private school attendance, the rates are fairly consistent across different family sizes (except when we go from 5 to 6 children, which is a rare event). This may simply reflect preferences for larger families among religious households. In contrast, private, non-sectarian school attendance rates fall as families become larger. Family size and the number of computers per child are also negatively related.Footnote 11 Finally, the proportion of parents saving for postsecondary education (PSE) falls substantially as fertility rises.

Table 2 Means of outcome variables by the number of children in the family

The evidence presented in Table 2 is very preliminary since the number of children is largely a choice given the widespread availability of contraceptives. In Table 3, I show the Wald estimates. Altogether, the results suggest that one additional child generated from a multiple birth is associated with about one quarter of a standard deviation increase in the reading score (albeit not statistically significant), but a 4.2 (1.1) percentage point decrease in the probability of attending a private (private, non-sectarian) school, a 0.142 decline in the number of computers per child, and an 8.8 percentage point decline in the probability of having parents who saved money for the child’s postsecondary studies.

Table 3 Wald estimates

An even more convincing analysis should account for differences in other covariates. First, multiple births may not be fully exogenous, as they have been associated with fertility drugs in recent years. Even though all youth in the sample were born in 1984 (well before the widespread use of fertility drugs), it is possible that some multiple births result from the drugs. Second, sampling variability in small samples may generate differences in socioeconomic characteristics between youth with and without a multiple birth in the family. In Table 4, I show the means and standard errors of the relevant covariates by multiple birth status.

Table 4 Means of explanatory variables by presence of a multiple birth in the family

In general, the differences in means are quite small, but some are worth noting. For instance, the average birth order is lower among youth with multiple birth siblings. This is tautological: I code multiple births to 1 only if it is on the last birth (i.e. higher birth orders) and exclude youth who are themselves part of a multiple birth. Therefore, youth with a multiple birth in the family can not be last born children by definition. The average age of the parents is also lower among youth with a multiple birth sibling. Mothers are about 1 year younger on average, while fathers are about 2 years younger. Again, this is related to the definition of a multiple birth, which implies that youth with a multiple birth in the family can not be last born children. Moreover, all youth in the sample were born in 1984. Most of the remaining covariates are similar in mean values, with some exceptions. Given these non-negligible differences, it is imperative to verify if the results reported so far hold when they are taken into account. This is precisely what I do in the next section, which considers econometric evidence.

4 Econometric results

In this section, I use ordinary least squares (OLS) and instrumental variable (two-stage least squares) regressions to estimate the relationship between child input and output variables and the number of children and other covariates described earlier in Table 1. The coefficients on the fertility variable appear below in Table 5.

Table 5 OLS and IV regressions of child quality measures on the number of children and other controls

The OLS results suggest that one additional child is associated with higher reading scores (significant at 1%), but is negatively associated with the number of computers per child and parental savings for PSE (both significant at 1%). In terms of private school attendance, there is a slight positive relationship (significant at 10%), and no significant relationship with private, non-sectarian school enrolment.

The first-stage IV regression results (not shown) suggest that a multiple birth is associated with 1.3 additional children, which is significant at 1%.Footnote 12 The F-statistic (i.e. the square of the t-statistic on the multiple birth coefficient) is 175.033, which is well above the thresholds for strong IVs established by Stock and Yogo (2005). I show the second stage results of the IV regressions in Table 5. The results suggest that one additional child is associated with an improvement in the reading test score equivalent to 15% of a standard deviation, although the coefficient is not quite statistically significant. The key point is that the coefficient is not negative. In contrast, fertility is negatively associated with parental investments in child quality. Specifically, the additional child reduces the probability of the youth attending a private (private, non-sectarian) school by 4 (0.9) percentage points, which is significant at 1% (5%). The number of computers per child declines by 14.1 percentage points (significant at 1%). So too does the probability that the parents save money for the youth’s postsecondary education (a decline of 11 percentage points, which is significant at 5%).

5 Attempting to reconcile the findings

What factors may explain why family size is negatively associated with investments in child quality, yet is not negatively associated with child quality per se? In this section, I discuss three potential candidates.

The first possibility is that, despite the best intentions of the parents, their investments in children may simply exert little or no influence on the measured child output. For example, Neal (2009) reviews the literature on the effects of private schools on academic achievement and attainment (including experimental evidence based on voucher programs), and concludes that there is no evidence suggesting that private schools are generally superior to public schools. The one exception is with private schools that serve minority students in the United States, although this finding is likely the result of the poor funding for urban public schools. In terms of computer use, perhaps the most credible study comes from Angrist and Lavy (2002), who examine the randomized introduction of computers in Israeli elementary and middle schools in the 1990s. They conclude that although the introduction of computers raised the use of computer-aided instruction among teachers, it did not have any effect on student performance. Computer use in the home may even be less beneficial if children use them to play video games.Footnote 13 With regards to parental savings for the child’s postsecondary schooling, I am not aware of any studies devoted to credibly estimating its relationship with test scores. However, it is hardly a stretch of the imagination to assume that parents who save for their child’s education do so at least partly in response to their child’s abilities.Footnote 14

A second possibility is that there may be economies of scale in rearing children and/or in selecting effective sibling interactions in larger families. In terms of rearing children, siblings in larger families may be more likely to share the same toys or clothes, leaving more resources available for other household goods, some of which may be related to learning activities. Also, there may be economies of scale regarding the time allocation of parents. For example, parents may read to two siblings close in age at the same time. With regards to sibling interactions, the likelihood of finding a sibling who may be beneficial to interact with might be greater in a larger family. A child may benefit from an older sibling by acquiring information or aspiring to be like them. Alternatively, a child may also benefit from a younger sibling by feeling the pressure to serve as a role model, or by reinforcing knowledge through teaching the younger sibling. It is plausible that economies of scale are more likely to occur when the siblings are close in age. One way to reduce the potential impact of economies of scale is by re-estimating the child output IV model on a sample of youth who have no siblings who are close in age. Going back to the YITS data, I focus on youth with no siblings who are within 2 years in age. The impact of fertility on the standardized reading scores falls to 0.025, which is not statistically significant. In the main IV results (Table 5), the coefficient was 0.149. Thus, part (but not all) of the reason why academic performance does not decline with fertility (despite falling parental investments in children) may be due to economies of scale.

A third possibility is that, following the birth of a child, parents (especially the mother) reduce their paid labour supply in favour of unpaid work.Footnote 15 The resulting increased maternal contact with the child may foster child quality. In the literature, the findings range from a positive association between maternal contact and child development (e.g. Bernal 2008; Bernal and Keane 2010; Waldfogel 2006) to no association (e.g. Baker and Milligan 2008; Ermisch and Francesconi 2005).

6 Conclusion

The longstanding trend of declining fertility rates among industrialized countries has fuelled government initiatives aimed at raising family size through financial incentives. Although studies have shown that fertility is indeed amenable to policy intervention, it is less clear how children are affected by being raised in larger families.

Becker’s Quantity–Quality model suggests a trade-off between family size and parental investments per child. Only one previous study has investigated this relationship empirically and finds evidence to support the theory by focusing on one particular form of parental investment – private school enrolment (Cáceras-Delpiano 2006). The current study extends this work by examining a broader range of parental investments (overall and non-sectarian private school enrolment, the number of computers in the home per child, and saving for the child’s education). The evidence suggests a strong negative relationship between fertility and each measure.

The study also explored a ‘puzzle’ in the literature. Specifically, fertility has been found to reduce parental investments in children, but the balance of the evidence points to no link between fertility and child outputs. The current study adds to this literature by demonstrating no association between fertility and academic test scores. In terms of attempting to reconcile this puzzle, several explanations were offered. First, parental investments in child quality are not necessarily associated with improved child quality. In fact, the best empirical literature (based on credible identification strategies) has not reached a consensus on this issue when looking at cognitive ability (the measure used here). Second, there may be economies of scale associated with rearing more children and/or in selecting effective sibling interactions. Empirical investigation suggests that these factors contribute towards reconciling the findings, but not entirely so. The third possible explanation was raised by (Cáceras-Delpiano 2006), who suggested that larger families may increase maternal contact with the child. However, this argument critically rests on the availability of the mother in the home. Cáceras-Delpiano (2006) only had data on paid labour supply. Recent work suggests that mothers (and fathers!) spend more time in unpaid childcare when the family grows. However, the literature offers mixed evidence on the role of parental childcare in determining cognitive ability (ranging from no effect to a positive effect).