Introduction

According to the Cancer Registry of Norway, a cumulative risk of developing breast cancer by age of 75 is 7.9, and ranks highest among all cancer diagnoses in women [1]. Among all diagnoses of cancer in women, breast cancer is diagnosed in 34 % and 28 % of women in age groups 25–49 and 50–69 years, respectively. Considering the age span, many of these women are likely to be employed when malignancy determined. Moreover, since most women suffer an early stage of breast cancer, which considerably increases the survival odds, the issue of income development for female breast cancer survivors is highly relevant.

There is a substantial international literature on income development for breast cancer survivors, but since income is highly contingent upon local labour markets, we have restricted our attention to the Scandinavian context. An increased risk of experiencing income reduction was shown in one Danish [2] and three Norwegian studies [35]. At the same time, a Swedish study [6] found no impact of cancer on the incomes of breast cancer patients compared to cancer-free controls. This study thus aimed to assess the differences in average annual income development of breast cancer patients and matched cancer-free controls in Norway. We hypothesised that breast cancer patients would show persistently significant reduction in annual income development compared to controls.

Though a number of other studies find significant income differences, this paper adds new insight to related research in several ways. First, this study was based on high-quality merged data from two nationwide registries. Annual income development of breast cancer patients was tracked from 1 year prior to diagnosis until death or censoring nine to 13 years later, which constitutes a considerably longer follow-up period than most studies in the field. Secondly, our study resembles studies [4] and [5] but differs in key aspects. To assess income differences between cancer survivors and cancer-free controls, the difference-in-differences approach was employed in [4]. Statistically significant cancer-associated declines in earnings, not including compensatory measures, were reported for up to 9 years after diagnosis. Unlike the present study’s analysis, the analysis in [4] was restricted to women who were employed in the follow-up year, thus excluding the financial disadvantage of being completely unemployed. Meanwhile, the analysis in [5] used a subset of the same case–control dataset that we analyse in the present study, only the analysis in [5] was restricted to women who did not receive either disability- or retirement pension while the follow-up time was limited to 9 years. The study [5] analysed the probability of an income reduction of more than 10 % and found a statistically significant effect of breast cancer for only 2 to 4 years following diagnosis. The present study is more sophisticated statistically, for it utilises the longitudinal structure of the data. Furthermore, since income is modelled as a continuous variable, our study avoids the information loss caused by dichotomisation. The statistical methods developed for this study can thus be considered to be an independent contribution to the field.

Material and methods

Data sources

This study was based on data from two nationwide Norwegian registries whose merging was possible due to the unique personal identity numbers given to all Norwegian citizens. The Cancer Registry of Norway (CRN, established 1953) has collected data on all new cases of cancer as per statutory registration requirements for all hospitals, laboratories, and general practitioners. Diagnostic data from the CRN are nearly complete and of high quality [7]. FD-Trygd, Statistics Norway’s event database (established 1992) contains information regarding demographics, social security benefits from the National Insurance Scheme, education level, employment status, and income for the entire Norwegian population.

Sampling of patients and cancer-free controls

Patient sampling for the study started in 1992, the first year when dynamic data on national insurance contributions became available. Eligibility criteria for identification of breast cancer patients in the CRN were women aged 45 to 54 who had been diagnosed with breast cancer as their first life-time malignancy between 1992 and 1996. This 5-year recruitment period was determined in order to obtain a reasonable follow-up time, while the age span assured a high proportion of employed women, since the formal retirement age in Norway is 67 years. The CRN identified 2,052 women who fulfilled these criteria.

Cancer-free controls at inclusion and during follow-up matched for age, marital status, and municipality of residence were drawn randomly from the Norwegian population registry by Statistics Norway, one control per case. The controls were cancer-free, based on checks in the CRN. Throughout this article, the term ‘year of diagnosis’ also refers to controls, as well as primarily referring to the year of breast cancer diagnosis for their matched case.

Among the 4,104 women in the sample, 14 did not have a registered income and 22 had missing income data for some years during the study period. These 36 women were excluded from the analyses along with their matched controls. Six controls died before the time of cancer diagnosis and were also excluded along with their matched cases. Thus, our sample consisted of 4,020 women: 2,010 breast cancer patients and 2,010 matched controls. By comparison, the cohorts of 1992 to 1996 contained 342 (17.0 %), 362 (18.0), 373 (18.6), 436 (21.7), and 497 (24.7) pairs, respectively.

Time to events

For each case and control, observation time began the pre-diagnosis year. All women were followed until death or censoring at the end of the observation period, 31 December 2005. Hence, the maximum follow-up time varied between nine and 13 years depending on the year of diagnosis (i.e. 1992–1996).

Income

The income variable ‘pension qualifying income’, in this study, is referred to simply as income. It is defined as personal income excluding transfers of social security benefits, such as sick-leave pay, unemployment benefits, work assessment allowance, and disability pension. The income variable was adjusted for the consumer price index (CPI) for 2012.

Socio-demographic variables and stages

Age was used as a continuous variable. Level of education was coded as ‘elementary school’, ‘secondary school’ or ‘higher education’. Immigration history was divided into either ‘no immigration history’ or ‘immigration history’. Family type was coded into three categories: ‘married or in-live companion with children’, ‘married or in-live companion without children’ and ‘other’, while number of children aged less than 18 at the time of inclusion was categorised into ‘zero’, ‘one’, ‘two’, and ‘three or more’. The socio-demographic variables at year of diagnosis were employed throughout the analyses.

Staging of breast cancer has been described previously in [5]. The Classification of Malignant Tumours (TNM) system classifies cancer into four stages (i.e. I to IV), each characterised by tumour size (T), involvement of regional lymph nodes (N), and distant metastasis (M).

Constructing the income variable

To study how cancer affects income, the following computational steps were performed for each woman in the dataset. To align income data, year of diagnosis was set at year 0 for all included women. The income year before diagnosis (year −1) was used as reference income. For years 1 to 13, income development was defined as that year’s income minus the reference income. Hence, a woman’s income development for a certain year represents the change (i.e. increase or decrease) in income from 1 year pre-diagnosis to the specific post-diagnosis year. Women who died within the given follow-up year were excluded from the analysis of that and sub-sequent years. To preserve the comparability achieved by the matching procedure, data for the pair were excluded during the year when one of the women died, as well as all subsequent years.

Statistical analyses

Descriptive statistics were used to present the socio-demographic variables; a t test for independent samples and a χ 2 test were used to compare cases and controls at inclusion.

Differences in income development between cases and controls were assessed by t test for independent samples, separately for each year during follow-up. Such an unadjusted comparison of income development suggests a clear interpretation and shows how the income of two groups develops over time.

A linear regression model was estimated to assess the impact of cancer diagnosis on income, adjusted for the available socio-demographic variables that the matching procedure had not utilised (level of education, immigration history, and number of children <18 years). Each data record was based on both a case–control pair and a given number of follow-up years during which both members of each pair were alive. The dependent variable was difference in income development between a case and her control. The main predictors were dummy variables encoding the number of years since diagnosis, which offered 14 indicator variables associated with years 0 to 13 after diagnosis by using the pre-diagnosis year (−1) as a reference category. To adjust for a possibly non-linear effect of age on income, the pair’s age was encoded using dummy variables representing all possible age values in the study period by using the youngest women (aged 44 in the pre-diagnosis year) as reference. The socio-demographic variables at baseline for each case and control were included as confounding variables.

Income data demonstrated strong clustering, in the sense that differences in income development over time tend to be similar for a given pair. Thus, the standard linear regression assumption of independent observations was violated. Therefore, a linear mixed model accounting for cluster effect in data was developed with a random effect for pairs (see Appendix). Regression coefficients associated with each year during follow-up were presented with the corresponding 95 % confidence intervals (CI). These coefficients should be interpreted as the effect that different numbers of follow-up years has on the difference between average income development for cases and controls. Results were presented graphically by plotting the regression coefficients for each dummy variable with the corresponding 95 % CI.

Identical analyses stratified by stage of breast cancer were also executed. Also estimating the cumulative loss in income at different follow-up years is relevant. This cannot be reliably estimated by adding the income loss estimates for the different follow-up years, as these estimates are related to different subsamples. For example, the income development in the first follow-up year might be more favourable among women that were alive eight years after diagnosis compared to all women alive one year after diagnosis. A different version of the regression model was therefore estimated, where the dependent variable was the difference in cumulative income development between alive cases and controls. The main predictors of this model were interpreted as the average total loss in income since cancer diagnosis, for the different number of years survived.

All analyses were performed with SPSS version 20 and SAS version 9.2. The level of significance was set at 5 %. All tests were two-sided.

Ethics

Data produced by national administrative agencies registering the entire population are transferred to Statistics Norway according to the Statistics Act in relation to official statistics. Individuals cannot refuse to have their personal data included. Privacy concerns are addressed by the Personal Data Act that sets forth procedures for handling and processing of such data. No personal informed consent from the individuals constituting the sample of this study is thus needed according to Norwegian legislation. Data were delivered in a de-identified form from Statistics Norway. The study was approved by the Regional Committee for Health Research of South-eastern Norway.

Results

Of a total 4,020 women, 638 (592 patients and 46 controls) had died at the time of censoring (i.e. 31 December 2005). Table 1 presents descriptive statistics. There were statistically significantly more controls than cases with education at the lowest level (p = 0.001), with immigration history (p < 0.001), and with three or more children (p = 0.025). There were no significant differences in age or marital status. As expected, due to cancer-related mortality, the proportions alive for 1, 5, 10, and 13 years after diagnosis were higher among controls. Most cases (91.9 %) were diagnosed at stage I or II (see Table 1).

Table 1 Socio-demographic characteristics of women with breast cancer and matched controls

At inclusion, the mean CPI adjusted income was significantly higher in cases than controls (14,563 € with standard deviation (SD) 9,993 € versus 13,344 € with SD 10,123 €; p < 0.001). The differences in average CPI adjusted income for cases and controls (see Table 2 columns 3 and 4) were small and remained quite stable throughout the follow-up period. Table 2 presents the average income development for cases and controls as well as differences between the two (columns 5–7) for each year during follow-up. Figure 1a illustrates the income development for cases and controls. The income for cases was rapidly reduced immediately after diagnosis, and 1 year after diagnosis the income between cases and controls became significantly different (p < 0.001) according to the independent samples t test (see Table 2, column 8). This pattern remained persistent for the rest of the follow-up period. In Fig. 1b, the results of the regression analysis are plotted along follow-up years together with the corresponding 95 % CI. The regression coefficients associated to the indicator variable are to be interpreted as the effect that the different number of follow-up years has on the difference between average income development for both cases and controls. Clearly, the income development adjusted for the socio-demographic characteristics showed the same tendency (see Fig. 1a). Table 2 presents numerical results from adjusted regression analysis (columns 9 and 10).

Table 2 Means and standard deviations (SD) of CPI adjusted income by the year after diagnosis, and the impact that breast cancer has on a survivors’ income (and cumulative income) development, adjusted for level of education, immigration history, and number of children <18 years
Fig. 1
figure 1

Development of income for cases (solid black line) and controls (solid grey line) after diagnosis with the corresponding 95 % CI (a) and differences in development of income adjusted for level of education, immigration history, and number of children presented as regression coefficients with the corresponding 95 % CI (b); −1 corresponds to the year before diagnosis

As shown in Table 2 (columns 11 and 12), at the fourth year after diagnosis, the difference in cumulative income development became significantly different between cases and controls and preserved this tendency throughout the follow-up period.

Analysis of income development by stage demonstrated marginal differences between stage I cases and their matched controls. With increasing stage, differences became more pronounced though not always statistically significant. There were few stage III and IV cases in the sample, and though large, the differences were not significant (see Table 3 and Fig. 2 for results).

Table 3 The analysis of the impact breast cancer has on a survivors’ income development, adjusted for level of education, immigration history, and number of children <18 years stratified by stage
Fig. 2
figure 2

Differences in development of income adjusted for level of education, immigration history, and number of children presented as regression coefficients with the corresponding 95 % CI (B); −1 corresponds to the year before diagnosis

Discussion

The hypothesis that cases with breast cancer would show a persistent and significant reduction of annual income compared to controls was supported throughout the follow-up period of 13 years. At 1 year after diagnosis, the income level for cases had already become significantly lower than for controls, and this difference persisted for the reminder of the observation period. Even for those relatively few case–control pairs (n = 211) alive 13 years after diagnosis, a statistically significant effect of cancer diagnosis on their income development occurred. The effect of cancer on the cumulative income showed a stronger effect for longer follow-up periods, with an estimated total income loss of 39,403 € after 13 years (p < 0.001). Analyses of income development stratified by stage showed a similar tendency, with stronger effects for more advanced stages. Income development among stage I cases was similar to controls, and no statistically significant differences were found. For stage II to IV cases, differences were large but not always statistically significant due to few cases with more advanced stages of disease.

It is important to note that the income variable used in this study does not account for transfers of social security benefits, such as sick-leave pay, unemployment benefits, work assessment allowance, or disability pension. In Norway, one can be on sick leave for up to 1 year after diagnosis and still receive up to 100 % of the salary. As a basis to calculate basic benefits, supplementary pension, and supplement for spouse and children, six times the basic amount of the National Insurance Scheme is used. Only persons with income below such an amount are fully compensated by sickness benefit. Consequently, the economic hardship for cancer patients usually begins the year after the 1 year of receiving sick-leave pay when their opportunities for economic compensation are considerably reduced. The analysis in this paper therefore might overestimate the effect that breast cancer has on the affected women’s economic situation, particularly the first year after diagnosis. Nevertheless, findings are mostly in accordance with comparable studies [4, 5]. Even with simpler statistical analyses, income differences between cancer survivors and cancer-free controls for up to 9 years after diagnosis were detected in [4]. In study [5], while assessing the probability of an income reduction of more than 10 %, statistically significant effects of breast cancer on income were shown only 2 to 4 years after diagnosis. The discrepancy between [5] and the present study, however, is not surprising since there is considerable information loss due to dichotomisation in [5].

The content of the income variable was changed from 1 January 2002, by also including work-rehabilitation benefits. However, since this study assessed income development between two comparable groups, modifying the income variable for the last 3 years of the study most likely had a negligible effect on results. Moreover, analysis of yearly age-adjusted income development in controls showed no substantive change between 2001 and 2002.

Level of education and number of children has been reported to be associated with breast cancer [3], and these variables are statistically different for cases and controls in the dataset. Regression modelling is therefore warranted to control for these effects. It should be noted, however, that these effects are relatively small, thus a crude comparison of income development in breast cancer–control cohorts matched on age, marital status, and municipality of residence may be justified in future studies when further socio-demographic variables are unavailable.

In conclusion, breast cancer has a significant impact on survivors’ incomes, even 13 years after diagnosis. The completeness of data from the CRN, which provides the opportunity to find matched cancer-free controls in registries of Statistics Norway, is this study’s chief strength. Moreover, an extended follow-up period of 9 to 13 years of income data contributes substantially to the field. Finally, the statistical method applied in this study fully explores the longitudinal nature of the data.

Our findings suggest that breast cancer survivors may require financial compensation in order to maintain the same standard of living as their cancer-free controls. Such compensation could take the form of a government-financed program or private insurance. In either case, this study offers a starting point for estimating the associated costs in the Norwegian context.