Introduction

There are limited numbers of studies evaluating the association between well-established risk factors for breast cancer by morphological subtype. These studies suggest that the subtypes do not tend to share a common risk profile although some of the results are inconsistent [14]. Some well-known risk factors for breast cancer include family history [58], nulliparity [9], late age at first child birth [9], early menarche [10], late menopause [10], use of hormone replacement therapy [11] and high socioeconomic status [12]. Among women whose first child birth occurred after age 35 years, the risk is higher than those of nullipara [10]. There is evidence that an increasing number of births is a protective factor for ductal carcinoma, but the findings on the lobular subtype are inconsistent [14]. Late age at first birth seems to be associated with lobular carcinoma more strongly than with ductal carcinoma [14]. In one study early age at menarche and late age at menopause showed a higher risk for lobular than for ductal carcinoma [1], but in another study the findings were reversed [13]. Long duration of breastfeeding is observed to have a protective role [12], but this protective effect seems to be limited mostly to ductal carcinoma [1, 2, 13]. Current oral contraceptive use is linked to lobular carcinoma, but the association with ductal carcinoma is unclear [13, 14]. In a meta-analysis, the risk of breast cancer due to hormonal replacement therapy varied according to morphological types whereby the largest relative risks were seen for lobular and tubular carcinomas [15]. Additional risk factors include alcohol use and obesity in postmenopausal women [12]. A recent review found evidence for a reduced risk for physical activity, especially for postmenopausal breast cancer [16].

Data on risk factors for histology-specific breast carcinoma in situ are sparse. In one study, ductal carcinoma in situ was positively associated with a family history of breast cancer, mammographic examination, clinical breast examination, low gravidity, late age at first birth and late menopause [17]. Similar associations were observed for lobular carcinoma in situ although some of these did not reach statistical significance.

We use here the year 2006 update of the nation-wide Swedish Family-Cancer Database, covering 11.5 million individuals and some 1.2 million tumors retrieved from the Swedish Cancer Registry. The Database offers unique possibilities for many types of family studies, because complete data are available on family members and their cancers, supplemented by socio-economic and other background data from the national censuses. Our aim was to determine relative risks (RRs) and population attributable fractions (PAFs) of invasive and in situ breast cancer for reproductive, familial, socioeconomic and residential factors according to morphological subtype.

Materials and methods

Statistics Sweden created a family database, “Second Generation Register” in 1995. After a few expansions, it covered offspring born after 1931 with their parents, renamed to “Multigeneration Register” to indicate that the number of generations was more than two. We have linked this Register to the Swedish Cancer Registry (1958–2004) to make the Family-Cancer Database (MigMed2) in year 2006 for the seventh time. In the Database all data are organized in child–mother–father triplets; the parents have been registered at the time of birth of the child, allowing tracking of biological parents. The Database includes all persons resident in Sweden after 1931 with their biological parents, totalling over 11.5 million individuals. The present study included women born between years 1932 and 1953, i.e., those whose minimal age at the beginning of the follow-up ranged from 40 to 61 years.

The completeness of cancer registration in the 1970s has been estimated to be over 95%, and is now considered to be close to 100%. The percentage of cytologically or histologically verified cases of breast cancer has been close to 100% [18]. The Swedish Cancer Registry is based on compulsory notification of cases [18]. A 4-digit diagnostic code according to the International Classification of Diseases, 7th revision was combined with morphology codes according to the International Classification of Diseases for Oncology, World Health Organization (WHO) and the Systematized Nomenclature of Medicine (SNOMED) used since 1993. The ICD-7 code 170 was used to identify breast cancer cases and a Swedish selection of SNOMED was used to classify the morphological subtypes.

Follow-up was started at immigration or January 1, 1993, whichever came latest. Follow-up was terminated on diagnosis of first cancer, death, emigration or the closing date of the study, December 31, 2004. A Poisson regression analysis was performed to model overall invasive and in situ breast cancer incidence using following variables: age at diagnosis (5-year bands), family history of invasive breast cancer (mother, sister, no history), parity (0, 1, 2, 3+), age at first child birth (13–20, 21–24, 25–29, 30+ years), socioeconomic status (manual worker, blue collar, professional, other) and residential area (big city, south, north) [19, 20]. Similar separate Poisson regression analyses were also performed for each of the morphologic-specific breast cancers: ductal, lobular and comedo invasive and in situ carcinomas. RRs and confidence intervals (95% CI) were calculated using the result of the Poisson regression. For simplicity we do not show results of age at diagnosis, although this variable was always included in the model.

The PAF is the proportion of disease cases in a population that is attributable to a particular exposure. Using the relevant variables from the Poisson regression models, individual PAFs due to each of the variables were calculated on the basis of RRs generated by new Poisson regression models with the variables classified dichotomously (exposed/unexposed). The population belonging to the lowest level of the variables in the initial Poisson regression models was defined as unexposed and the other levels were aggregated into the exposed group. PAF was calculated according to the formula ((RR − 1)/RR) × the proportion of cases in the exposed population, where RR was the risk in the exposed population [21]. Joint PAFs were calculated by defining the unexposed population as those being unexposed to all risk factors simultaneously. For joint PAF we used the formula ((incall − incref)/incall) × 100, where incall denotes the overall population incidence and incref denotes the incidence in the unexposed population. The incidence was age-standardized according to the Swedish census 2000. CIs for PAFs were estimated by bootstrapping with 1,000 simulations [22].

Results

Our study covered 1,028,455 women who were followed from 1993 through 2004 and who accumulated 11,066,977 person-years at risk. Invasive and in situ breast cancer was identified in 27,243 and 3,496 women, respectively, with data on family history, reproductive variables, socioeconomic status and residential region. Table 1 shows the distribution of cases and person-years by all categories in each of the morphologies. The majority of the invasive morphology-specific cases were ductal (62%) and lobular (14%), whereas only 7% were tubular and 3% comedo.

Table 1 Cases and person-years in breast cancer

The variables included in the Poisson regression models were age at diagnosis (data not shown), family history, parity, age at first child birth, socioeconomic status and residential area, all of which had a significant or borderline significant effect on the risk of breast cancer (Table 2). A family history through a mother proband increased the risk of overall invasive breast cancer to 1.64 and a through a sister proband to 1.77 relative to women with no familial history. There was a clear trend for increasing RRs with increasing age at first child birth and low-gravidity showed significant elevated risks with uniparity showing the highest RR of 1.20. High socioeconomic status was associated with increased risk, as was living in big cities.

Table 2 Relative risks in invasive breast cancer according to the Poisson model

As for the overall invasive breast carcinoma, the morphology-specific types (Table 2) showed stronger positive associations with a family history through a sister proband than through a mother proband (ductal: 1.70 and 1.67, respectively, lobular: 2.09 and 1.66, respectively, tubular: 2.09 and 1.68, respectively); the exception was the comedo morphology, for which the risk was lower through a sister proband (1.23) than through a mother proband (1.89). There was a clear trend for increasing RRs with increasing age at first child birth for all the morphologies, with the strongest effect observed for lobular carcinoma. On the other hand, low parity had little or no effect on lobular and comedo carcinomas as opposed to the ductal and tubular subtypes, where the most elevated risks were observed for uniparity. Nulliparity was inversely associated with lobular and comedo carcinoma, but the reversed result was observed in the ductal and tubular subtypes with risks lower than for uniparity. Residential region had very small or moderately elevated risks with the reference group ‘North’ in all morphologies and with the lowest RRs found in the ductal morphology. There was also a moderately positive association with high socioeconomic status in all morphologies, where the strongest effect was observed for lobular and tubular carcinoma and the weakest for the comedo subtype.

The corresponding results for in situ carcinoma are shown in Table 3 with the ductal morphology constituting 70% of all cases. Here, family history refers to the family history of invasive breast cancer. The association with a mother history was stronger in comparison with the corresponding invasive analyses, with the exception for the comedo morphology; by contrast, the sister history association was weaker for overall and ductal carcinoma in situ. The strongest associations with family history were observed for lobular carcinoma in situ, for which the RRs were 2.24 (mother) and 2.43 (sister). A clear trend for increasing RRs with increasing age at first child birth in all the in situ models were observed, with equal effects on ductal and comedo carcinomas in situ. Unlike the invasive models, the effect on lobular carcinoma in situ was the smallest and it was less pronounced than for the invasive lobular subtype. On the other hand, the effect of age at first birth on ductal carcinoma in situ was found to be more pronounced than the corresponding invasive model. Clear trends for decreasing RRs with increasing parity were observed for the overall and ductal in situ carcinoma. As for the invasive models, low parity had little or no effect on the lobular and comedo in situ subtypes, but nulliparity was associated with insignificant elevated risks. Residential region had small to moderately elevated risks with the lowest RRs in the ductal in situ morphology and the highest RRs in comedo carcinoma in situ. Lobular and especially comedo carcinoma in situ was more strongly associated with living in the more densely populated south and big cities compared with the corresponding invasive subtypes. There was a positive association with high socioeconomic status in all the in situ models and the associations for the lobular and comedo subtypes were stronger than the corresponding invasive analyses. The most pronounced effect of high socioeconomic status was observed for lobular carcinoma in situ.

Table 3 Relative risks in in situ breast cancer according to the Poisson model

For calculation of PAF, the individual variables in Tables 2 and 3 were classified dichotomously into two groups, the exposed group and the reference or unexposed group (Table 4). As nulliparous women have no age at first birth, parity and age at first birth were merged into one variable. In order to calculate a joint PAF, all the unexposed groups of the individual variables, were combined simultaneously to form the joint reference group. Socioeconomic status was excluded in the in situ analyses due to the limited number of cases in the joint reference groups, but this did not alter the remaining individual PAFs. The joint PAFs in all invasive breast carcinoma and ductal carcinoma were equal, 26.9% and 26.6% respectively, which was higher than the individual PAFs due to reproductive factors (17.9% and 16.6% respectively). The lobular and comedo morphologies had notably large joint PAFs of 56.5% and 62.2% respectively, explaining more than half of all cases. These PAFs were, however, based on very few cases in the joint reference groups. In the all in situ and the ductal in situ models, the sum of the individual PAFs were equal to the joint PAF whereas in all the other models the sum of the individual PAFs exceeded the joint PAF. The reason for the latter could be that some of the individual risk factors may cancel out. With a few exceptions, the joint PAFs and individual PAFs due to family history and reproductive factors in the in situ models exceeded those in the corresponding invasive models.

Table 4 Population attributable fractions (PAFs) in breast cancer

Discussion

The major strength of this study is the large number of women from the nation-wide Swedish Family-Cancer Database and the registered and unbiased information on all variables. One limitation of the data source is the lack of information on known or potential risk/protective factors such as oral contraceptive use, hormone replacement therapy, breastfeeding, menstrual history, alcohol use, obesity and mammographic screening.

In this study family history, low parity, late age at first birth and high socioeconomic status increased the risk of both invasive and in situ overall breast cancer. As the ductal morphology constitutes the majority of all cases, it is not surprising that this subtype generated similar results as the overall model. The role of low parity or nulliparity was, however, different in the invasive lobular and comedo morphologies. For these morphologies, nulliparity had RRs below unity and there were no or small insignificant elevated risks for low parous women. These findings were also observed for lobular and comedo carcinoma in situ, but the result for nulliparity was reversed. Lobular carcinoma deviated from the other morphologies regarding family history and age at first birth; having a family history was most strongly associated with in situ tumors while age at first birth had the largest impact on the invasive type.

High socioeconomic status is a well-known risk factor [12], in which education is an important component. One recent study, based on the Family-Cancer Database, showed that university graduates were more likely to be diagnosed with in situ and invasive ductal cancer as well as in situ and invasive lobular cancer compared with women completing less than 9 years of education [23]. Furthermore, the association with invasive lobular cancer was stronger than with ductal cancer [23]. These findings are in line with our present study, although the correlation between education and socioeconomic status in Sweden is moderate [24]. High socioeconomic status was, in our study, found to have the largest impact on lobular carcinoma in situ.

The PAF is a summary measure for characterizing the public health impact of an exposure on population patterns of disease. PAF is the proportion of disease cases in a population that is attributable to a particular exposure or “the fraction of all cases (exposed and unexposed) that would not have occurred if exposure had not occurred [21, 25, 26]“. Assuming exposure to be causal and removable, PAF could be used to estimate the potential impact of public health interventions. In this study however, exposures are not removable, and PAF is used as an etiological measure. The joint PAF of more than one exposure can analogously be defined as the fraction of cases that would not have occurred if all the exposures were avoided. Large PAFs indicate that a large proportion of the etiology is understood at the level of the defined variable. Since PAF here is defined as the product of the expression ((RR − 1)/RR) and the proportion of cases in the exposed population, large PAFs can only result from a high RR and relatively common exposure or from a moderate RR and common exposure.

A number of previous studies have been conducted to examine the impact of modifiable risk factors on breast cancer. One study estimated a PAF of 9% due to hormone replacement therapy for Australian women aged 40–79 years [27] and in a meta-analysis PAF due to alcohol use in the UK was 6% [28]. Another study estimated that low β-carotene intake, high alcohol intake and low levels of physical activity together explained 33% of the cases of breast cancer in Italian women [29]. Regarding unmodifiable risk factors, previous studies have estimated general invasive PAFs due to family history in different populations with varying results from 2.5% to 17% [3034]. To our knowledge only one study has previously estimated morphology-specific invasive and in situ PAFs due to family history [35]. However this study, which was based on an older version of the Swedish Family-Cancer Database including women 0–66 years, did not include any other risk factors. Our findings showed similar PAFs throughout all invasive morphologies, regarding family history and reproductive factors. In these morphologies, family history accounted for the smallest PAFs (between 5.2% and 6.2%) due to the small proportion (13%) of exposed women although these women had the highest relative risks. These findings are in line with our previous study although the women belonged to another age group [35]. The broad definition of exposure due to parity and age at first child birth resulted in very large (more than 90%) exposed groups, with resulting PAFs between 16.6% and 25.9% although the corresponding RRs were moderate. In the study of Tavani et al. [31], PAF due to nulliparity and late age at first birth was estimated to 38.2% for 23–74-year old women compared with 29.5% for equally old women in the study of Madigan et al. [32]. These results are, however, not fully comparable with ours as the women belonged to another age group and the definition of late age at first birth included women of all parities. High socioeconomic status held similar RRs as the reproductive factors, with the exception of comedo carcinoma, but the exposed groups were smaller (70%) giving smaller PAFs between 12.3% to 18.0% for the major morphologies and 6.9% for comedo carcinoma. These estimations are a possible result of potential risk factors correlating with socioeconomic status, such as low level of education, alcohol consumption, hormone replacement therapy, obesity, dietary habits and physical inactivity [2729]. Higher PAFs due to high education (20.3%) and high income (18.9%) were reported in the studies of Tavani et al. [31] and Madigan et al. [32] respectively. The small to moderate PAFs due to being resident in big cities or in the more densely populated south are likewise a possible result of the potential risk factors correlating with socioeconomic status stated above. These results could also be explained by different pathology classification criteria across medical regions and different access to hospital care. In the lobular morphology, the joint PAF of 56.5% suggests that the four variables explain more than a half of all cases. This figure should however, be interpreted with care as it was based on few cases in the joint reference group.

Also in the in situ models, family history accounted for the smallest individual PAFs and reproductive factors for the highest PAFs, except for comedo carcinoma in situ. The in situ PAFs were larger than or equal to the corresponding invasive ones due to family history and reproductive factors, with the exception of comedo carcinoma in situ. These results show some discrepancy with the PAFs due to family history in our previous study [35], but we would like to point out again that the women belong to another age group. Moderate and high PAFs due to residential area were observed in the specific lobular and comedo in situ morphologies with larger results in the in situ models. These two in situ morphologies also had the largest joint PAFs, but as they were based on very few cases, a meaningful interpretation is hard to make.