Introduction

Little is known about the etiology of the heterogeneous group of childhood cancers, but both genetic and environmental factors have been suggested to play a role [13]. Many studies have investigated the possible association between birth order and risk of childhood cancer, although the evidence to date has been inconsistent [1, 412]. As birth order is relatively straightforward to record, either through routine data sources such as birth registries or through questionnaire-based studies, where it is generally acknowledged to be well reported [13], it has often been used as a surrogate marker for in utero and/or childhood exposures.

Most notably, birth order has been used as a proxy for examining the role of infectious exposures early in life and the subsequent development of acute lymphoblastic leukemia (ALL), a topic about which there has been much debate, particularly with respect to the “delayed infection” hypothesis [1416]. According to this hypothesis, ALL results from an abnormal reaction to delayed exposure to common infections [17, 18]. It would then be expected that firstborn children would have less contact with infectious agents than children with older siblings and as such have an increased risk of ALL. However, given that data from medical records suggest that children who develop ALL between the ages of 2–5 years have, on average, more infectious illness episodes in the first year of life than those who do not [14, 1921], it would also be plausible for children with increasing birth order to be at increased risk of ALL.

With respect to in utero exposures, birth order acts as a surrogate for hormone levels, as a mother’s first pregnancy differs endocrinologically from later pregnancies [22] with both estrogen and progesterone levels shown to be higher during first pregnancies [23]. Indeed, epidemiological studies suggest a decreased risk of testicular cancer with increasing birth order [24]. There is also a well-established positive relationship between maternal parity and birth weight [25], with high birth weight associated with several different childhood cancer types, including ALL [5, 9, 26, 27]. Furthermore, there is recent evidence to suggest that maternal immune response may also vary with parity [28], and taken together, these two observations may be important for ALL development. One study observed an elevated risk with high birth weight in ALL patients who were firstborn rather than later-born, which may or may not reflect the combination of larger fetal size and later exposure to infectious pathogens incurred more frequently in the firstborn child [29]. While birth weight is regarded as a causal factor for several childhood cancers, parental age shows inconsistent evidence [1], but both factors are related to birth order.

In addition to causal mechanisms, alternative explanations cannot be ruled out. One may speculate that having a child with cancer would impact on family planning, for example by not having further children or by delaying having further children. Hence, sampling in case–control studies might increase the chance for controls to be of higher birth order due to larger average numbers of siblings. Lastly, in the case–control studies requiring active participation, family size may be related to willingness to participate introducing selection bias into a study, as it was observed for other family characteristics [30].

The objective of the present study was to investigate the association between birth order and childhood cancer in a nationwide birth cohort over a long time period. For this, we obtained data on all children born in Denmark between 1973 and 2010, involving almost 2.5 million births and about 5,700 newly diagnosed childhood cancers before the age of 20 years.

Materials and methods

From the Central Population Register (CPR), we obtained information on all children born in Denmark between 1 January 1973 (start of the Danish Medical Birth Registry, see below) and 31 December 2010. Since 1968, all Danish residents receive a unique CPR number, which includes date of birth and sex of the child, and permits accurate record linkage between the different national registries in Denmark [31]. The CPR also includes up-to-date information on vital status, migration, and first-degree relatives. Through the mother of the index child, all siblings, including their date of birth, were identified in the CPR and all stillbirths in the Danish Medical Birth Register, i.e., information used as the basis for counting of birth order and pregnancy order. The Danish Medical Birth Register was established in 1973 [33] and includes information on parental age at child birth as well as the birth weight of the child.

From the nationwide Danish Cancer Registry established in 1943 [32], we identified all children diagnosed with cancer below the age of 20 years within the defined birth cohort. The cancers were grouped according to the International Childhood Cancer Classification (ICCC; ICCC-1 (Birch Marsden Code; [34]) until 2003 and ICCC-3 [35] thereafter). We used the 12 main groups of ICCC, but combined groups XI and XII with group X as “others (X–XII)” because of small numbers and very heterogeneous subtypes. In addition, we subdivided group I into acute lymphoblastic leukemia (ALL), acute myeloid leukemia (AML), and other leukemias. Furthermore, we specifically looked at non-Hodgkin lymphoma and Hodgkin lymphoma in the lymphoma group, and astrocytoma, ependymoma, and the group of intracranial and intraspinal embryonal tumors (mainly medulloblastoma and therefore called like this) in the group of CNS tumors.

Our key exposure variable, birth order, was defined in two ways. First, we defined birth order counting all live births of the same mother, in line with the hypothesis that the number of older siblings would matter (see above mentioned delayed infection hypothesis). The group of firstborn children was subdivided into those without siblings (only children) and those with further siblings with the same mother, to obtain a surrogate measure of even lesser infectious contacts for only children compared with other firstborns, while acknowledging the distinction was only to arise in the future. Second, we defined birth order including stillborn children of the same mother, in line with the hypotheses that the pregnancy order would matter. In both definitions, multiple births were treated by assigning the same birth order to multiples and then continuing the counting while accounting for the real number of siblings; for example, for a mother having twins and one further child, the twins would both have a birth order of one while the last child was counted as the third child. In Denmark, the definition of stillbirth was changed from ≥28 to ≥22 gestational weeks during our study period, namely in the year 2004. Thus, the birth of a child with no signs of life in gestational weeks 22–27 would have been registered as a spontaneous abortion in the period from 1973 to 2003.

When including birth weight and parental age at the child’s birth as other explanatory variables to adjust for potential confounding, maternal age was dichotomized at age 35 years, paternal age at age 40 years, and birth weight was categorized into three categories of <2.5, 2.5–4, and >4 kg. Alternatively, we have also modeled paternal and maternal age using finer categorizations, specifically by categorizing them into 5-year age groups starting with <25 years and ending with >40 years for mothers and >45 years for fathers, respectively. Further sensitivity analyses looked separately at children born between 1973 and 1990, and born between 1991 and 2010, respectively, as day care patterns and birth rates may have changed over time.

Statistical analyses

We used Poisson regression models to evaluate associations between birth order and childhood cancer, estimating the rate ratios (RRs) and corresponding 95 % confidence intervals (CIs), with and without controlling for maternal age, paternal age, and birth weight. All children were followed up from date of birth until the age of 20 years, date of death, date of emigration out of Denmark, date of first cancer diagnosis, or end of study period (31 October 2013), whichever occurred first. The firstborn children served as the reference group for the comparisons. For comparability with results from previous studies, RRs for linear trend (with increasing birth order) and corresponding 95 % CIs were also calculated using Poisson regression models. The main analysis included all cancers diagnosed up to 20 years of age, but additional analyses were performed restricting to those cases aged 0–14 years at diagnosis for quantitative comparison with previous studies using this age range for their definition of childhood cancer. For the main analyses, birth order excluded stillbirths and RRs were adjusted for maternal age, paternal age, and birth weight. Alternatively logistic regression models and Cox proportional hazards regression models were applied to investigate whether the choice of statistical model would change any of the results.

All analyses were performed using SAS/STAT software [36].

Results

The birth cohort comprised 2,461,283 children, of which 1,262,979 (51.3 %) were boys, born between 1973 and 2010 inclusively. Annual numbers of births varied between 52,716 in 1985 and 73,327 in 1975. Among the total cohort, 1,099,058 children were firstborn (44.7 %), of which 227,913 (9.3 % of total and 20.7 % of firstborn) remained only children, with 906,852 (36.8 %) second-born, 336,017 (13.7 %) third-born, and 119,356 (4.8 %) with a birth order of four or higher. When stillbirths were taken into account, there were only slight changes to the birth order proportions: 1,094,468 (44.5 %) firstborn, 905,362 (36.8 %) second-born, 339,069 (13.8 %) third-born, and 122,384 (5.0 %) with a birth order of four or higher.

In the study population accruing a total of 38.6 million person-years of follow-up, 5,699 childhood cancers were observed, with leukemias and CNS tumors each representing approximately one quarter of cases. While 96.6 % of children had complete follow-up [because of either reaching age 20 years (51.4 %) or the end of the observation period (31 October 2013)] or had developed cancer (0.2 %), 1.0 % had died of other reasons and 2.2 % were lost to follow-up (emigrated from Denmark). Table 1 shows demographic characteristics of cancer subtypes with respect to age, sex, and birth order. Distributions by age and sex were in accordance with those reported by childhood cancer registries in developed countries, with higher proportions of boys being diagnosed with lymphomas, and cancers such as retinoblastoma, neuroblastoma, and hepatic tumors occurring between the ages 1–4 years, and lymphomas occurring in adolescents (15–19 years). Regarding birth order, 45.6 % of cancer cases were firstborn, similar to the proportion in the overall cohort.

Table 1 Demographic characteristics of childhood cancer cases at ages 0–19 years observed in a cohort of all live-born children born in Denmark between 1973 and 2010, followed up until 31 October 2013, by sex, age group at diagnosis, birth order, and by cancer type

Table 2 shows the associations between birth order and childhood cancers at ages 0–19 years. No significant associations were observed between birth order and any of the cancer subtypes when using all firstborn children as the reference group; for the majority of cancer types, rate ratios (RRs) were all around one. For acute lymphoblastic leukemia (ALL), all RRs were slightly below one. None of the RRs of linear trends were statistically significant, including for ALL.

Table 2 Adjusteda associations between birth order and childhood cancers at ages 0–19 and in a subcohort of those diagnosed at ages 0–14 years, by cancer type, in a cohort of all children live-born in Denmark between 1973 and 2010, followed up until 31 October 2013

RRs for Hodgkin lymphoma and non-Hodgkin lymphoma were not different. For Hodgkin lymphoma, for birth order 2, 3, and 4 and higher compared with firstborn, the RRs were 0.90 (CI 0.69–1.17), 1.16 (CI 0.82–1.64), and 1.29 (CI 0.76–2.22). The respective RRs for non-Hodgkin lymphoma were 0.92 (CI 0.67–1.26), 0.68 (CI 0.41–1.12), and 1.17 (CI 0.61–2.26). Among CNS tumors, RRs for ependymoma were 0.83 (CI 0.56–1.22), 0.94 (CI 0.55–1.61), and 0.31 (CI 0.08–1.31), for astrocytoma were 0.94 (CI 0.77–1.15), 0.93 (CI 0.69–1.24), and 0.98 (CI 0.61–1.57), and for medulloblastoma were 1.00 (CI 0.75–1.33), 0.64 (CI 0.39–1.04), and 0.57 (CI 0.24–1.32).

Table 2 also shows the same set of results between birth order and different types of cancers among cases aged 0–14 years. Results were similar to the broader age range. None of the RRs for linear trend were statistically significant.

Sensitivity analyses taking into account stillbirths of the same mother in the counting of pregnancy order had no notable effect as RRs only marginally changed compared with the main analysis (data not shown). Adjustment for parental age and birth weight had little effect, except marginally for all cancers combined and CNS tumors, where non-adjusted RRs were slightly lower than one (but statistically nonsignificant) and varied around one after adjustment (data not shown), and also the way how paternal and maternal age were modeled (dichotomous or using a finer categorization, see Methods) had no impact. Sensitivity analysis by time period (born between 1973 and 1990, and born between 1991 and 2010, respectively) did not show any consistent patterns, with small deviations most likely due to chance (data not shown).

Discussion

In this large study which included almost 2.5 million children born over a 37-year period and 5,699 cases of childhood cancer, we failed to observe associations between birth order and risk of any of the childhood cancer subtypes. Considering stillbirths and/or controlling for birth weight or parental age in the analyses had no effect on the results. Hence, we did not find support for the hypothesis that either the number of older siblings or the pregnancy order would matter.

Our study has a number of strengths over those previously published. Firstly, we adopted a nationwide approach with complete follow-up and virtually no missing data and thus provide a factual reflection of the situation in Denmark. Furthermore, we had the opportunity to incorporate accurate data on stillbirths into our analyses, as well as on some potential confounding factors, in particular birth weight and parental age—although neither variable impacted the overall results. However, one of the limitations was the lack of socioeconomic characteristics, such as parental education or income, given that previous research has shown that individual social position was not related to the risk of childhood leukemia in Denmark, but children born in low-income municipalities had an increased risk [37].

For ALL, results from epidemiological studies are not very consistent. In a five-state register-based study in the USA with 4,699 ALL cases diagnosed between 1980 and 2004, odds ratios were nonsignificantly lower than one, namely 0.97, 0.96, and 0.94, for children born second, third, or fourth or higher, respectively, compared with firstborn children [4]. In a Californian register-based case–control study of 4,721 ALL cases diagnosed between 1988 and 2008 (overlapping with [4]), the observed odds ratio was 0.97 (CI 0.87–1.08) for higher birth order versus first [6]. Similarly, a Californian record-based case–control study with 3,402 ALL cases aged 0–5 years from 1988 to 2007 (overlapping with [4, 6]) showed nonsignificant odds ratios of 1.00, 0.95, and 0.91 for birth orders second, third, and fourth or higher, respectively, compared with firstborn children, but indicated some stronger decrease in non-Hispanic Whites compared with Hispanic Whites [7]. A pooled analysis from the Childhood Leukemia International Consortium (CLIC), using data from 11 questionnaire-based case–control studies from eight countries with a total of 7,399 ALL cases diagnosed between 1979 and 2001, showed a pooled odds ratio of 0.94 (CI 0.88–1.00) for later-born versus firstborn with no monotonic trend of decrease with increasing birth order, but with substantial heterogeneity across studies (I 2 of 71 %) [8]. Individual study odds ratios ranged from 0.69 (CI 0.55–0.86; France) to 1.44 (CI 1.15–1.79; Quebec, Canada). In addition, a previous Danish cohort study involving cases diagnosed between 1968 and 1992 did show a nonsignificant trend of decreasing relative risks in the subgroup of ALL cases aged 0–4 years at diagnosis, but not in older children with ALL, but the majority of those cases are included in our larger case sample [38]. However, a large register-based study combining 1,905 ALL cases from Denmark, Iceland, Norway, and Sweden diagnosed between 1984 and 1999 did show a significant monotonic trend with decrease in risk of 0.90 (CI 0.84–0.96) per one unit increase in birth order, specifically for B-precursor ALL [5]. This is surprising as there is some overlap of the study with the present study in relation to their Danish cases; design features cannot explain the differences with both studies using an identical setup of registries and it cannot be explained further by the restriction to the subtype, as 86 % of ALL were B-precursor ALL in the other study. Reasons are therefore either a stronger effect in the other Scandinavian countries compared with Denmark or that the previous study included cases from an earlier time period; however, the differences between the two studies were also not marked with clearly overlapping confidence intervals. The most recent study from Sweden, however, using 3.57 million children in a national cohort, showed results that were compatible with ours, namely a small but non-statistically significant inverse association with birth order, with an RR for trend by increasing birth order of 0.98 (CI 0.93–1.04) [39].

Taken together and including our results for ALL, there is perhaps a suggestion of an overall, slight 10 % decrease in risk in children of second or higher birth order compared with firstborn children. However, statistical significance depends on the size of study, and there is also some heterogeneity across studies. This could be due to random variation or because the predictive power of birth order, as a surrogate for a particular exposure, may depend on the source population of the study. Overall, these observations do not lend support to the delayed infection hypothesis, but neither do they contradict it.

In Denmark, social contacts through day care are very common, and the proportion of 0- to 6-year-old children attending day care increased from just over 40 % in 1980 to 75 % in 1999 [40]; this is higher than in many other countries, and hence, in Denmark birth order may be a less predictive proxy of social contacts than elsewhere [8]. Given the UK findings of more frequent infections during infancy in children with ALL, it is perhaps surprising that the estimated rate ratio does not point in the other direction [19].

However, while birth order has been used a surrogate for exposure to infections in childhood, it is also an important determinant of other conditions both in the child (e.g., allergy) and the mother (e.g., preeclampsia). With respect to the mother, birth order has also been shown to impact on maternal immune response, including levels of IgE and IFN-γ as well as maternal/fetal HLA mismatch, which given the well-documented in utero origins of ALL may well be important [41, 42]. Indeed, it is possible that it is the changing maternal immune response with birth order, alongside other prenatal events, that impacts on the development of ALL.

A recent review suggests some evidence of an increased risk of childhood AML with increasing birth order, although the authors suggested this could in part be due to a maternal age effect [43]. We did find a slightly increased risk even after adjustment for maternal age, although stronger for those born second or third than fourth or higher. An earlier Danish cohort study with cases diagnosed between 1968 and 1992 found a nonsignificantly increased risk of Hodgkin lymphoma with increasing birth order [relative risk of 1.26 (CI 0.92–1.73)] [44], while in our study the respective relative risk of increase in birth order by 1 was 1.06 (CI 0.92–1.22).

Our lack of observation of an association between any of the solid cancers and birth order contrasts with the large US study described above, where reduced risks restricted to birth order of four and higher compared with firstborn children were observed for CNS tumors (OR 0.77, CI 0.68–0.89), neuroblastomas (OR 0.68, CI 0.55–0.84), and Wilms tumors (OR 0.67, CI 0.54–0.84), leading to an overall decreased odds ratio for their combined 17,672 cancer cases combined of 0.87 (CI 0.81–0.93) [4]. Our respective rate ratio for all cancers combined was 1.00, and no association was seen with any of the three diagnostic subgroups. For later-born children compared with firstborn children, a recent review of neuroblastoma found no clear evidence of an association, although the majority of studies found slightly decreased risks [45]. A large registry-based study in the Nordic countries of 3,983 CNS tumor cases confirmed our finding of no association with birth order, acknowledging some overlap in the Danish cases with our birth cohort [10]. In a similar study in the Nordic countries involving 3,298 cases of Wilms tumor, the odds ratio of later-born children compared with firstborn children was 0.98, again consistent with our findings [11]. Overall, there appears to be little evidence of an association between birth order and childhood solid tumor risk. The findings in the US study for high birth order needs attention regarding which underlying exposure may be reflected in the very high birth order in this setting since that might explain the findings being different to the Nordic countries; the US proportion of those born fourth or higher was twice as high as in Denmark (approximately 10 % [4] compared with 5 %), but we do not know whether this plays any role.

In conclusion, we did not observe an association between birth order and the risk of childhood cancer. Taking the evidence from all published studies together, there may be a weak protective effect for ALL, although this is still not clear and there is little insight into any causal mechanisms. As these results are based on a large nationwide cohort covering a long time span with all Danish children being included, they have some weight in the overall interpretation while results from smaller studies or case–control studies should be interpreted with caution.