Introduction

Maternal reproductive history (miscarriage, infertility, long intervals between births) and certain birth characteristics (birth weight, birth order) have been previously examined in relation to childhood cancer risk with most studies focusing on leukemia [1]. Increased risk with birth weight, being first-born or with maternal history of miscarriage have been reported. However, these factors have rarely been investigated simultaneously in the same study and mechanisms for these associations are largely unknown although various possibilities have been proposed.

The association with birth weight has been observed in many [212] but not all studies; most notably a recent population-based Californian study has reported no association [13]. Several biologic explanations have been proposed for the association with birth weight [3, 1417] but have not been tested yet. Birth weight is correlated with sex, maternal height and body mass index, gestational age, maternal age, parity and maternal smoking [18, 19], but most studies have been unable to take these additional factors into account. Some previous studies have reported a risk association also with low birth weight in childhood leukemias [10, 20].

Despite the well-established difference in childhood cancer risk between sexes [11, 2123], the reasons for the higher risk in boys is unclear. Few studies have considered sex-specific associations. However, several genetic associations have been reported exclusively in boys [2427] including one regarding prenatal loss [28]. Another sex-specific finding is the decreased cancer risk in male twins [29, 30]. This consistent finding is generally attributed to selective early mortality of male twin fetuses or neonates who would otherwise have developed a clinical cancer. The cumulative data suggest that the miscarriage association in childhood cancers should influence boys more than girls. Despite growing recognition of sex differences in disease pathogenesis, response to treatment and prognosis [31], most studies only report their results adjusted for sex rather than doing sex-specific analysis. In light of the generally increased risk for childhood cancers in boys, we hypothesized that the birth weight and miscarriage associations should be stronger in boys. In the present study, we investigated these associations in childhood cancers to test their sex-specificity.

Subjects and methods

Cases

The cases were identified from the ‘Northern Region Young Persons’ Malignant Disease Registry’ (NRYPMDR), a population-based cancer registry [32]. The cases analyzed in this study represent 732 of childhood (≤14 years) cancer cases born and diagnosed in the North of England between 1968 and 1992, and who were born in a hospital. There were a further 701 eligible cases in the registry diagnosed with childhood cancer during the same period but their hospital records were unavailable for this study or they were born at home. The reasons for this were either the closure of the maternity unit or failure to keep earlier birth records in some units. The excluded and included cases were compared for their sex, diagnosis and age-at-diagnosis and no significant differences were found (results not shown). Detailed information on case selection can be found elsewhere [33]. Cases were grouped into leukemias and non-leukemia cancers as shown in Table 1. Within the leukemia group, 225 were acute lymphoblastic leukemia (ALL) and 28 were non-ALL.

Table 1 Descriptive data from all cases with childhood cancers and their matched controls

Controls

Controls were selected from the same date and place of birth of cases by taking the 4th, 8th and 12th birth before and after the index birth by using the birth or admission registers. Out of 3,742 population-based controls randomly selected from the same birthplace and around the same date as the index cases, 19 were excluded because of the ambiguity in the information obtained (mainly the lack of sex information). Thus, the number of controls was 3,723 representing an average control-to-case ratio of 5.1 to 1. Further details of control selection are given elsewhere [33]. For the 225 cases with ALL, there were 1,163 matched controls.

Data

The data included variables other than birth weight and they were included in the analysis as covariates. Estimated duration of gestation was based on both the date of the last menstrual period reported by the mother or the assessment of an obstetrician as extracted from maternity unit records. Gestational age data were missing in 87 cases (12%) and 522 controls (14.0%). Gestational age was categorized as term (37 week or more) and preterm (less than 37 weeks). Maternal age data were missing in an even greater percentage of cases and controls (26.5% of cases and 22.8% of controls). Maternal age association was assessed using five quintiles corresponding to 16–21 years; 22–24 years; 25 and 26 years; 27–30 years; and 31–45 years. Birth weight was recorded in either pounds and ounces or grams (g) but all data were converted into g. Birth weight-for-gestational-age z-scores (standard deviations) were calculated by standardizing the raw birth weights for gestational age and sex, according to sex-specific British reference standards [34]. Because of missing gestational age data, z-scores could not be calculated in all cases and controls. Birth weight associations with childhood cancer risk were sought using both weight in g and the standardized (relative) birth weights (z-scores) to distinguish associations with size at birth and size for gestational age. We used categorized birth weight variable to test the linearity of the birth weight association as well as fractional polynomial regression (see below). Birth weight quintiles were used to estimate the strength of the association. The quintiles were assigned using the birth weight distribution in the controls only. A second set of sex-specific quintiles was defined using the birth weight distribution for each sex (controls only). All reported sex-specific results were obtained using sex-specific birth weight quintiles. The age-specificity of the birth weight association was explored for age groups 0–5 years and 5–15 years.

The association with birth-order was examined as an association with the number of previous live-births (0, 1, 2 or more). This variable was derived from the data on the number of pregnancies and previous pregnancy outcomes obtained from the obstetric records of the mothers. The number of previous pregnancies variable was categorized the same way as the live-births variable (0, 1, 2 or more). Miscarriage and stillbirth data were obtained from obstetric records. These two variables were recorded as presence or absence of miscarriage or stillbirth in the past history because of the small numbers of more than one event.

Statistical analysis

Correlations between variables were assessed using Spearman’s rank correlation. We used conditional logistic regression to evaluate the association between the variables of interest and risk of childhood cancer. Univariate odds ratios (OR) and their corresponding 95% confidence intervals (95% CI) were calculated initially for each of the following variables: sex, gestational age, birth weight of the index child, maternal age, presence of previous miscarriage or stillbirth in maternal history, prior pregnancy outcome, and the total number of previous pregnancies and live-births. Multivariable models were constructed to obtain standardized risk estimates. To examine trend for nonlinear association with birth weight, we used fractional polynomial regression modeling which is based on transformation and fractional polynomials and retains the continuous scale of the data without using arbitrary categories [35]. All statistical tests were two-sided.

Analyses were performed for all cancers and also leukemias (ALL and non-ALL) separately. The number of cases for non-ALL leukemias was too small (n = 28) and the results for this group were not presented separately. As the sex-specificity of any association with childhood cancer was of interest, analyses stratified by sex were done to determine sex-specificity of associations. The interaction between sex and variables of interest was also tested using conditional logistic regression. Positive associations were also checked by using the whole control group in non-conditional logistic regression analysis. All statistical analyses were performed using the statistical software package Stata v9.0 (StataCorp, Texas, USA).

Results

Table 1 presents some characteristics of the participants and descriptive data. In the whole group there were 732 cases and 3,723 controls (average case-to-control ratio: 1/5.1). The male-to-female ratio was 1.19 in cases and 1.09 in controls. In the case group, the most common cancer was leukemia (35%) followed by central nervous system tumors (22%) and neuroblastic tumors (9%).

Birth weight

We first examined the linearity of birth weight association using (conditional) fractional polynomial modeling which showed a nonlinear relationship in boys with ALL (P = 0.03). When quintiles based on all controls were used, there was no association in the overall group. In ALL, however, there was an association in boys only (Table 2). Interaction between birth weight and sex was found by both conditional fractional polynomial modeling of continuous birth weight data (P = 0.006) and by conditional logistic regression using the raw birth weight data as a continuous measure and sex being a two-level indicator variable (P = 0.003). Because of the nonlinearity of the association, we present quintile-specific odds ratios using the lowest risk group (second quintile) as reference. In boys with ALL and matched controls, the lowest sex-specific quintile showed a statistically nonsignificant association in the risk direction and the increased risk was statistically significant in each of the higher quintiles with ORs = 2.53, 2.16 and 3.05, respectively (Table 2). The same analysis was repeated using birth weight quintiles that was based on overall birth weight distribution regardless of the sex and a similar result was obtained in boys (not shown). To check whether exclusion of cases due to unavailability of birth records may have introduced any bias, the associations were examined for time periods 1968–1985 and 1986–1992. No difference was found in the results in this split analysis (data not shown).

Table 2 Unadjusted birth weight associations in childhood ALL (OR, 95% CI, P)

We examined the age-specificity of the birth weight association as has been reported in some earlier studies [2, 4, 8, 36]. Similar to previous studies, the risk was higher for children who were younger than 5 years of age at diagnosis for boys with ALL (OR = 5.56 for the highest sex-specific birth weight quintile compared with the second quintile, 95% CI = 1.82–17.04; P = 0.003). In older boys the risk was much lower (OR = 1.38 for the highest quintile) and statistically non-significant (P = 0.60). The weak protective effect of higher birth weight in girls was also stronger in younger children (OR = 0.33 for the fourth sex-specific birth weight quintile compared with the third quintile, 95% CI = 0.12–0.03; P = 0.03) and there was no statistically significant association in girls 6–15 years old (ORs = 0.93–1.25).

Most previous studies estimated the risk for high birth weight using a cutoff value of 4,000 g and low birth weight for the cutoff value of 2,500 g. In our group, there were few children with birth weight above 4,000 g (18 boys with ALL and 72 matched controls). We did not confirm the birth weight association using these cutoff values (three birth weight categories) in boys with childhood ALL (OR = 1.36, 95% CI = 0.80–2.33, P = 0.26). We provide the results using five usual birth weight categories in Table 2 for comparisons with other studies.

When adjusted for gestational age in a multivariable conditional logistic regression model (resulting in a slightly smaller sample due to missing gestational ages), the birth weight association in boys with ALL remained unaffected (OR = 3.02 for the highest sex-specific birth weight quintile compared with the second quintile, 95% CI = 1.36–6.67; P = 0.006).

In the smaller maternal age data-positive subgroup, adjustment for maternal age did not change the magnitude of birth weight association (not shown). The birth weight (quintile) association was also adjusted for miscarriage history. Conditional logistic regression analysis confirmed the independence of these associations: the association with the highest quintile remained unchanged (OR = 3.09, 95% CI = 1.40–6.81, P = 0.005) and miscarriage association was also unaffected (OR = 1.97, 95% CI = 1.07–3.63; P = 0.03).

Logistic regression analysis

In an unconditional analysis, all controls (n = 1,943) were used for comparisons with cases (n = 118), and the birth weight association in boys with ALL remained statistically significant (OR = 2.08 for the highest quintile, 95% CI = 1.11–3.90; P = 0.02). The same approach did not yield a statistically significant (protective) association in girls despite the larger sample size.

Finally, the re-examination of the multivariable model by unconditional analysis confirmed the independence of birth weight (quintile) and miscarriage associations in boys with childhood ALL. In the final model, associations with birth weight quintiles and miscarriage retained statistical significance (data not shown).

Other maternal and reproductive variables

Maternal age

In the most relevant group, boys with ALL and their matched controls had maternal age data in 74.7% and 78.5%, respectively. Despite the smaller numbers, maternal age showed a U-shaped risk association in childhood ALL and again only for boys. When the middle quintile (25 and 26 years) was used as reference, all other quintiles yielded odds ratios greater than 2.0 (2.18–3.50) and all but the lowest quintile (16–21 years) associations were statistically significant (P < 0.05).

Gestational age

Despite the expected strong correlation between birth weight and gestational age (Spearman’s r = 0.37; P < 0.0001), there was no association between gestational age and any groups nor any interaction with sex (Table 3). There were, however, very few preterm babies in the sample.

Table 3 Univariate associations with maternal reproductive history in childhood ALL

Number of previous pregnancies and live-births

There was no association with the number of previous pregnancies or live-births either overall or within the ALL group (Table 3). When we investigated specifically the association with being first-born (no previous live-birth), the ORs were greater than unity but were not statistically significant (data not shown).

Previous pregnancy outcome

As a measure of the relationship between reproductive success and ALL risk, the effect of previous live birth was examined. About a third of boys with ALL (43 of 118, 36%) were preceded by a live birth and 287 of 600 matched controls had the same history (48%). Conditional logistic regression analysis showed a significant protective effect of a live birth in the immediately preceding pregnancy in boys (OR = 0.60, 95% CI = 0.38–0.96; P = 0.03). The same analysis in girls yielded a nonsignificant finding (OR = 0.82, 95% CI = 0.51–1.32; P = 0.41).

Miscarriages

The presence of spontaneous miscarriage in maternal history was associated with increased risk for all cancers combined (OR = 1.29; 95% CI = 1.05–1.62, P = 0.02). For the individual cancer groups, this association was significant only for ALL (OR = 1.56; 95% CI = 1.07–2.27, P = 0.02) and osteosarcoma (OR = 4.68; 95% CI = 1.06–20.7, P = 0.04). Stratification by sex showed higher risk in boys (Table 3) although the difference in risk between the sexes did not reach statistical significance (P for interaction for sex and miscarriage = 0.48). This association did not vary between the age groups in either sex. In other cancers, only osteosarcoma showed a risk association with maternal history of miscarriages but this finding was based on a comparison of 12 cases and 54 matched controls. Logistic regression analysis using the whole control group showed an association with miscarriage in the whole ALL group (OR = 1.54, 95% CI = 1.09–2.18; P = 0.01), which was again statistically significant in boys only (OR = 1.67, 95% CI = 1.06–2.65; P = 0.03) with P = 0.21 in girls.

Stillbirths

Maternal history of previous stillbirth was positive for 10 cases (3 in the ALL group) and 48 controls (19 in the ALL group). These numbers were too small for a meaningful analysis.

Discussion

Using data from medical records, in this population-based case–control study of all childhood cancers, we confirmed the birth weight risk association in childhood ALL and also unraveled its nonlinear nature and sex-specificity. The use of standardized birth weight did not alter the results. This approach and the results it brought up will have an impact on future studies of etiologic mechanisms. We also found that the risk of childhood ALL conferred by a history of miscarriage in subsequent offspring was somewhat greater in boys.

Miscarriage association in childhood ALL is not new. Three major studies, Oxford Childhood Cancer Study [37], Tristate Study [38] and Childrens Oncology Group Studies [36, 39] have reported associations between maternal history of miscarriage and childhood cancer. In the present study, miscarriage history showed an association which was stronger in boys both in matched analysis and unmatched analysis, which used the whole control group. The sex effect in miscarriage association is biologically plausible. The extreme male predominance at the time of fertilization with estimates ranging between 120 and 160 males per 100 females [40] suggests a disproportionate loss of males during pregnancy to result in the sex ratio of around 1.05 at birth. We interpret these data as suggesting that the excess male frequency in childhood ALL may be due to boys that are destined to be miscarried but survive prenatal selection only to remain at high risk for ALL. Earlier studies identified parental HLA sharing as a risk factor for both recurrent miscarriages and childhood leukemia [reviewed in Ref. 41] and molecular studies revealed a homozygous HLA class II genotype as a male-specific risk marker for childhood ALL [25, 27] while overall HLA class II homozygosity was decreased in newborn boys compared with girls [28]. Thus, increased risks for miscarriage and childhood ALL may be related components of perinatal male disadvantage and may share a similar genetic background. The protective effect of a live birth history in the pregnancy prior to the index case in boys only also supports this hypothesis. The sex effect in the miscarriage association requires independent replication and its possible connection with HLA-mediated risk modification of childhood ALL warrants further investigation.

High birth weight has been associated with a generally increased risk for childhood leukemias, neuroblastoma and Wilms’ tumor in previous studies [211]. Despite the sexual dimorphism of birth weight and the sex effect in childhood cancer susceptibility, birth weight association studies generally overlooked sex-specificity of this association. In the sex-matched case–control study of the Nordic countries, sex did not modify the birth weight association [10]. The only other study, which considered sex as an effect modifier in birth weight association in childhood cancer is the UK Childhood Cancer Study Group (UKCCSG) [11]. This multicenter study reported a stronger association (with being heavier than 4,000 g) in girls aged less than 2-years with acute myeloblastic leukemia (AML). We did not have sufficient numbers for AML to examine this association but the mean birth weight in girls with AML was slightly higher (3,467 g, n = 7) than in boys (3,324 g, n = 12). The UKCCSG study, however, did not find the male-specific association that we have noted in ALL. The single-center nature of the present study in an ethnically homogeneous part of England and our demonstration of the nonlinearity of the birth weight association, rather than using cutoff values, may have caused this discrepancy.

Male fetuses grow at a faster rate than female fetuses and subsequently, on average, male infants are heavier than females for the same gestational age [19]. With males being overrepresented in most childhood cancers, a spurious association may arise if sex is not taken into account and if the control group has disproportionate number of males. To control for such differences, in addition to the stratified analysis, we also used birth weights standardized for UK national reference for gestational age. The birth weight associations in ALL did not get stronger when standardized birth weights were used. This observation suggested that size at birth rather than in utero growth trajectory is of etiologic importance in childhood ALL but only in boys. Our dataset had a high proportion of term babies and therefore birth weight was a good indicator of fetal growth.

There may be a genetic component in birth weight determination that also affects susceptibility to leukemia. One of the strongest predictors of birth weight is previous sibling’s birth weight [42]. This finding has been confirmed in leukemic families [10, 11]. One hypothesis proposes that increased growth factor production may increase both birth weight and leukemic cell proliferation [14, 43]. Increasing availability of large biologic sample banks should enable testing of this hypothesis soon. Because body mass and weight also increase adult cancer risk, it has been postulated that cancer risk is proportional to the number of proliferating cells [16, 17]. For all cancers, increased cell division is a risk factor whatever the reason [44]. In childhood leukemia, greater birth weight is thought to be associated with a higher rate of cell proliferation and/or a larger number of precursor cells being at risk of malignant transformation [3]. At present, this is the most parsimonious explanation and may have a genetic as well as an environmental basis. These two hypotheses are not mutually exclusive and a common genetic basis for increased growth factor production and high birth weight is plausible. One other growth factor for both fetal [45, 46] and cancer cell growth [47] is iron. The HFE gene shows a replicated association with childhood ALL in boys only [26], which is likely to be due to the effect of this mutation on body iron content [48]. Iron excess is also linked with gestational diabetes [49] and recently, an HFE association in gestational diabetes has been reported [50]. As gestational diabetes has been proposed one of the possible biological mechanisms of birth weight association with childhood cancer [2], and given the parallels in the epidemiology of diabetes and childhood leukemia, including the sex effect [51], the iron and diabetes connection is also noteworthy. These observations suggest that just like IGF-1, iron may also be connected with both birth weight and leukemia risk.

Our results should be considered in light of the study strengths and limitations. While it has several major strengths such as its population-based nature, unbiased source of information, inclusion of all childhood cancers, sample size and ethnically homogeneous study base, the limitations include our inability to include all of the cases identified for the study period, insufficient power for analysis by type for some cancers, and lack of environmental exposure data. We were also unable to examine the recently reported effect modification of the birth weight association by maternal pregnancy weight [12]. The main reason for not including a case was inability to access the birth records. Cases diagnosed earlier in the study period would be more likely to be excluded because of the lack of birth records and temporal trend in birth weight could distort the results. Our use of controls matched for birthplace and birth date, however, was a safeguard against this. Another possibility is that a bias may have been introduced because of the exclusion of births at home. This is deemed to be an infrequent occurrence and the same exclusion criteria also applied to controls. We are confident that the inclusion of 732 of eligible cases diagnosed in the defined time period and region did not distort the results obtained. In any case, however, the findings of this study, namely the sex-specificity of the birth weight association and the differences in the strength of the miscarriage association in childhood ALL need to be replicated in an independent study.

In conclusion, our findings have provided further insight to the previously known birth weight association in childhood ALL by showing that it concerns boys more strongly. This aspect of these associations has not been widely explored before. When sex is taken into account, genetic association studies also reveal stronger results for boys [2427]. These results should encourage further investigations on sex specificity of risk associations in childhood ALL. The findings are useful in that mechanisms of these associations may be easier to study in the light of the sex effect.