Introduction

As for many other diseases, social inequalities have been reported also for malignancies. Lung cancer incidence was reported to be higher in lower socio-economic groups [1]. This finding emerged in surveys [2] and also in registry-based studies [3], and similar results had been reported for case-fatalities [4]. Besides individual-based indicators, area-based measures were also used for classifying target populations by socio-economic status [5]. In the geographical area that was assigned the lowest category, the risk of dying due to lung cancer was about 1.6 in men between 25 and 64 years and 1.4 in men aged at least 65 years. The comparison group was made up out of individuals living in areas that were assigned the highest classification. Increased risks of contracting lung cancer had been reported in a number of studies [610], and variations in incidence were consistently following corresponding patterns of smoking behavior.

Gastro-intestinal cancers had less often been considered. In a Dutch study using data from 204 communities embracing 58,279 males, 162 cases of gastric cancer were identified [11]. In men with the lowest qualification the highest incidences were found. Their relative risk was 54% in individuals with the highest educational level, for stomach cancer the respective relative risk was RR = 0.37. These figures were somewhat reduced after confounders had been controlled for, but the basic findings remained unchanged. Social differences had also been found in other studies [1214]. The evidence is, however, not consistent since in come cases incidences increased with socio-economic position [12, 15].

Breast cancer is deviating from this general pattern of health inequalities as it was reported to occur more frequently in women from higher socio-economic background [16, 17], and this holds for wealthy as well as for poor or developing countries [18]. In a study where patient records and census information were combined, breast cancer risks and social position showed a linear increase [19]. This was confirmed in an analysis embracing 97,227 women [20]. The gradients were also varying with respect to ethnic origin. Pukkala and Weiderpass analyzed all cancer cases in Finnish women with onsets between 1971 and 1995 and reported breast cancer risks to increase with social class. [21] In an age-cohort study no associations between breast cancer occurrence and socio-economic position had been reported [20].

The social distribution of breast cancer incidence differs from corresponding patterns of case fatalities. While in most studies on the relationship between breast cancer and socio-economic status positive correlations were reported this is different for death cases, although the findings are not homogeneous. Most breast cancer studies have reported higher rates in women in disadvantaged social positions [1924], but in an Italian study higher death rates in women from higher socio-economic background were reported [25]. The socio-economic differences between incidence and deaths are remarkable and may be explained by variations in illness behavior and medical treatment, but the effects of socioeconomic position remained after these factors had been taken into account [23, 24].

In Germany, research on social inequalities in cancer is rare, and to our knowledge only a few studies are available. One of them is dealing with survival of colorectal cancer [26], but socio-economic position had been determined on the basis of patients’ residence, and the second study dealt with cancer in general [27].

Cancer registration in Germany is practiced at a regional basis [2830], and the data are not always comparable. In contrast to other countries [23, 24], the registry data do not include information on socio-economic position, thus survival rates can only be examined on the basis of age, gender, or medical variables.

The following analyses are conducted with data from a German statutory health insurance. For Germany this is one of the few approaches to study social differences in malignant diseases on an individual basis. The dataset contains information on the frequency of treatment including dates of hospital admissions and discharges. This permits to study disease courses as far as they are documented as treatment records, and finally also the date of death is available. The disadvantage of health insurance data is the absence of information on tumor type, stage, and therapy, and the records do not include causes of death. On the other hand, individual-based data on education and on occupational position are available. Most earlier studies had been based on one indicator only, e.g., last occupation [24, 31], or education [32], and only one was based on education and occupational position [4]. It had been shown that the most frequently used indicators of socio-economic position (occupational position, education, and income) were not highly correlated, thus they must not be considered as interchangeable [33, 34]. If no association between one of these indicators and disease outcomes emerge, it may be different if another indicator is chosen. In earlier analyses it had also been shown that the relative strength of their effects were varying according to outcome [34].

In the following analyses occupational position and income will be considered together thus making it possible to compare their relative effects. As such analyses have not yet been conducted for Germany. They will contribute to closing the gap of knowledge on social inequalities of cancers.

The following questions will be dealt with:

  • Do health inequalities with respect to incidence of cancers of the lung, the stomach, the intestine, and the breast exist in this insurance population?

  • Do case fatalities of lung cancer, stomach cancer, cancer of the intestine, and breast cancer differ by social position?

  • Are health inequalities in malignant diseases more strongly associated with income or with occupational position?

Materials and methods

Study population

The following analyses are based on records of a large statutory health insurance (Allgemeine Ortskrankenkasse) covering the years 1987–1996. Actually the data were collected for accounting purposes. The dataset also includes retired and co-insured subjects, the latter mostly being wives. Due to peculiarities of the German health insurance system, lower socio-economic groups are overrepresented [35]. Retired individuals were assigned their highest occupational position attained, and jointly insured individuals were classified according to their spouses’ occupational position and income.

The study population consists of 170,764 men (56.1%) and women (43.9%) at the age of 35 up to 70 years and insured for at least 365 days. For analyses of case- fatality the age limit was extended to 75 years in order to take a five-years period [36] after disease onset into account.

Taking a population of >80,000,000 residents into account, the number of uninsured persons in Germany is low. In 1991, 232,000 were without health coverage and 105,000 in 1995. Reasons for not being covered were long-term unemployment, homelessness, and being self-employed. In the latter case income was only loosely related to the lack of health coverage [37]. In addition, the upper 10% of the income distribution are not represented in the data since the majority of them were privately insured.

The dataset is left- and right-censorized. For cancer incidence an insured person starts contributing person years at risk if exceeding the age of 35. In case of joining the insurance at a later age, periods of insurance were summed up until reaching the age of 70, until cancer was documented, or until leaving the health insurance. For case fatality persons with cancer as described above were considered from the date of documented onset to the following five years, until the date of death or until leaving the health insurance.

In Table 1 the basic description of the study population is displayed. For 57.5% of the study population income information was not available. This is due to retirement, unemployment, or to exclusions due to irregularities in the records [38].

Table 1 Distribution of occupational positions by gender

Numbers of cancer cases were sufficient for performing regression analyses (Table 2). The survival rates of the time periods under observation are 35% (women) and 31.7% (men) for lung cancer, for stomach cancer they are 60% (women) and 55.6% (men), for intestinal carcinoma they are 70.8% (women) and 71.1% (men) and finally 70.7% for breast cancer. The survival rates in this dataset are higher than those reported from German registries [36]. This is due to cancer patients not only leaving their insurance due to death, but also by changing the insurance. Thus Table 2 may not depict all deaths due to the malignancies considered; nevertheless the results are valid, because the analyses have taken lengths of observation into account.

Table 2 Number of incident cases and number of deaths in incident cases for different types of malignant diseases for women and men

Occupational group membership was determined by using a three-digit classification issued by the German Labor Authority [39]. These categories were collapsed into five groups: “unskilled and semi-skilled positions,” “skilled manuals,” “skilled non-manuals,” “intermediates,” and “professionals.” Due to the small number of professionals they had to be counted together with intermediate positions, and in the regression analyses they will serve as the reference category.

Data on occupational changes are routinely transferred from employer to health insurance. In the analyses below the highest level attained was used. For a considerable proportion of the insured this information was not available (missing data, longstanding unemployment, early retirement). Despite their heterogeneity, unclassified subjects were treated as a separate group.

Income (due to employment) is also transmitted from employer to health insurance as it is the basis for calculating insurance fees. Not all subjects were insured throughout the observation period. Some had been insured before electronic data storage began (i.e., before 1987), for some individuals coverage began and ended between 1987 and 1996, others entered the insurance after 1987 and had continuous coverage. The numerical amount of wages during the observation period increased without a parallel rise of purchasing power necessarily having taken place, thus numeric monthly revenues in 1987 and 1996 are not comparable. This made it necessary to standardize individual incomes. If individuals were insured for more than one year, the amounts were converted into a one-year reference period. In order to obtain comparability, for each individual the deviation from the mean per year was calculated. The means of these deviations were computed, and served as indicator of income. In the statistical analyses, the income continuum is transformed into categories of five groups of equal size. Actually for all insured with an occupation, income information is available as this is the basis for the calculations of insurance premiums. Missing values were assigned if unusually high or unusually low payments were recorded. This concerned single payments, transitional periods with formal employment, but without payment, rehabilitation periods, etc. In case of constant insurance this did not cause classification problems, but if individuals were insured shorter, a regular income could not be calculated and missing values had to be assigned. Again subjects with missing data were classified into an own category.

Statistical analyses

The following analyses are based on Cox-regression [40, 41]. The proportional hazards model is appropriate here since it takes time (in the following analyses: age) into account. It can handle insurance periods of differing lengths, i.e., individuals leaving the population after short periods will not cause biased results. Cox regression depicts a time process, whereas it is assumed that an event (in the present case: date of diagnosis or date of death) will occur as a function of time having elapsed. For every covariate (in the present case indicators of socio-economic status and gender) it is estimated to what extent the time process is altered, i.e., whether the respective risks for defined groups decrease or increase. For incidence the time interval refers to the age at diagnosis, for deaths it is the time interval between diagnosis and death by taking age into account. All analyses were performed with STATA 10SE [42].

Results

The rank order correlation between occupational position and income is r = 0.16, thus regression effects should not be impaired by multicollinearity problems, and the two indicators should be considered as having different latent content.

In the first step, relative morbidity risks are computed. In the second step, dealing with case fatality only, patients with one of the four diseases of interest are considered with respect to differences due to socio-economic status.

For lung cancer incidence (Table 3) monotonously increasing social gradients appear for both indicators. For the second highest 20% of the income distribution, the relative risks are not significant, while for the remaining income categories the effects are robust and range up to RR = 7.0 for the lowest 20% of the income distribution. The relative risk of unclassified subjects is falling close to the second lowest fifth.

Table 3 Relative risks and 95% CI for incidence and case fatality of lung cancer by income, occupational position and gender

In the analysis for occupational position, the comparison with the highest occupational category (professionals and intermediate positions combined) yields effects ranging from RR = 2.7 to 6.9, and again the relative risks are increasing monotonously with decreasing occupational position. The RR for unclassified subjects is falling between individuals holding skilled manual and unskilled/semi-skilled positions. The confidence intervals of occupational position are wide, indicating that the estimations may be affected by small case numbers.

There is a strong gender difference since in women as compared to men the age-standardized risk for a lung cancer diagnosis is only 15%.

For lung cancer deaths the results are different. For occupational position the relative risks are not significant, but this is different for income. Again compared with the highest 20% of the income distribution in the lowest 40% robust relative risks appear, suggesting considerable social inequalities additional to inequalities in incidence. The analyses also indicate that, once having contracted the disease, there is no significant gender difference for deaths.

Incidence due to stomach cancer (Table 4) could be analyzed only after having added up skilled non-manuals and the intermediate/professional positions as the rates of patients were too small in the upper categories, while for income this was not necessary. For both indicators social gradients emerged, but for occupational position they are more pronounced than for income. The relative risks for unclassified subjects are high (RR = 5.3 and 5.7). For both indicators the confidence intervals are again wide; low cell frequencies should have caused the problem.

Table 4 Relative risks and 95% CI for incidence and case-fatality of stomach cancer by income, occupational position and gender

As reported for lung cancer, there are gender differences; again in women the risk of stomach cancer is more than 50% lower than in men.

Social differentials in terms of case fatalities appeared neither for occupational position nor for income, and also no gender differences were found.

Social differentials in incidence due to cancer of the intestine (Table 5) emerged only for income. The relative risks are similar to those found for stomach cancer. For occupational position no social differences are present. Only for unclassified insured the relative risk is significant. No social differentials in case fatality are present.

Table 5 Relative risks and 95% CI for incidence and case-fatality of intestinal carcinoma by income, occupational position and gender

For breast cancer incidence (Table 6) a significant relative risk appears only in women that could not be classified for income. For the remaining categories as well as for occupational position no social differences can be reported. No social differences for breast cancer deaths emerged.

Table 6 Relative risks and 95% CI for incidence and case-fatality of breast cancer in women by income and occupational position

Discussion

In the present study social gradients in incidence and case fatality of four types of malignant disease were examined. For Germany, social differentials of malignant diseases at individual level had not yet been examined, although some studies on the topic had been published [26, 27].

The most pronounced effects were found for lung cancer, and they emerged both for income and for occupational position. A substantial share of lung cancer risk is due to smoking. The social patterns of tobacco consumption were reported to be similar to lung cancer. This holds not only for Germany [43], but also for other European countries [9]. Smoking is also correlated with the main indicators of socioeconomic position, i.e., in all cases the lowest positions were associated with the highest tobacco consumption [44]. Thus, although the relative contribution of smoking to social gradients of lung cancer may vary by country, it always explains a substantial part of social inequalities of lung cancer risk [9]. To a lesser degree this applies to risk-taking behavior and to expositions to environmental hazards [45, 46]. After having considered the risks of contracting lung cancer, the second line of analysis was carried out only with patients and death as outcome. Now only income was associated with increased risks of lung cancer death. Taken together, these results point to a double social inequality in health, because disadvantaged individuals have higher risks of lung cancer, and once affected, they are also at higher risk of dying.

For stomach cancer social gradients were reported for both indicators, but no interpretable differences in case fatalities emerged. This finding is not in accordance with other European studies where excess mortalities were reported for lower socioeconomic groups [47], although group differences were sometimes small [48]. Incidences and case fatalities of stomach cancer were declining over the past decades [49]. The incidence of intestinal cancers was increasing, but case fatalities were declining slowly [30, 49]. In the analysis presented above, the social gradients were different from stomach cancer since only income was associated with increased incidence, but not with occupational position. This finding is in line with the inconsistent evidence on the associations with occupational hazards [50, 51]. Again the social gradients found in our data are consistent with some earlier studies, but this does not hold for case fatalities [5, 11]. As established for lung cancer, living conditions and lifestyle patterns are risk factors for cancer of the stomach. Besides smoking, a low consumption of fruit and vegetables as well as a preference for red meat turned out to be associated with incidence [52, 53]. Social gradients were demonstrated for all risk factors, and the unfavorable combinations were reported to occur more frequently in lower socio-economic groups. This also applies to intestinal cancers, where increased body weight, lack of physical activity, the consumption of red meat, and animal fat were suspected to increase the risk of colon cancer. For small intestine cancer the knowledge about risk factors is limited. In the literature the consumption of alcohol, smoking, and dietary factors had been reported, but the results are far from conclusive [12].

For breast cancer incidence no social gradients were found for occupational position, and for income unclassified women had elevated risks. These findings are in contrast to studies where breast cancer was found to occur more often in middle class women [19, 21, 54]. The higher relative risk in unclassified women is difficult to explain since this is a heterogeneous group consisting of single and/or unemployed women or mothers and a large subgroup that cannot be described precisely. In contrast to genetic predispositions that are unrelated to social position, a number of risk factors (older age at first pregnancy, low parity, nutrition habits) were reported to occur more frequently in higher socioeconomic groups [55], but in contrast to studies from other European countries incidences do not show social patterns.

Similar to the nearly total absence of inequalities in incidence, no social differences were found for case fatality. This differs from international studies where breast cancer deaths were related to social class or socioeconomic position [24, 56], but in a recent publication from France it was reported that social differences of breast cancer deaths disappeared after a period of decline [13]. This may indicate the absence of social gradients with respect to the utilization of health care facilities or the quality of care. However, in some studies it was found that in women with lower socio-economic background breast cancers were in a more advanced stage upon detection [57, 58]. This may be due to delay in seeking medical care in good time, which may again result in increasing risks of dying prematurely.

In the scientific discussion on health inequalities the dominating role of material conditions had been emphasized [59], but this underestimates the effects of other factors since the role of income (as the best indicator of material conditions) was not consistent. In the preceding incidence, analyses on cancers of the lung and the stomach occupational position turned out as important, but this was not the case for intestinal and breast cancer. This emphasizes that in health inequality studies more than one indicator should be included and their relative contributions have to be considered in comparison; a restriction of the scientific discussion to material conditions [60] is not appropriate. Consequently, the role of every dimension of social differentiation also has to be established for every type of cancer separately.

In the scientific literature, the role of health care systems and how medical care is delivered is discussed. This applies to cancers in general [61, 62], and for lung [63] as well as for breast cancer [64] in particular. The authors maintain that social differentials in cancer deaths may indicate barriers to appropriate treatment, and this is backed up with social differences in case fatalities [62, 63]. If the quality of health care and access to treatment would explain differences under the conditions of the German health care system, it should have effects on all types of cancers considered. With one exception this had not been the case, thus we conclude that medical treatment may not differ by socio-economic factors. A precise test of this assumption is, however, not possible since the data were actually collected for accounting purposes and do not contain information on subjective measures of health status or on the perception of symptoms.

Unequal access to treatment may be a tempting explanation for differences in survival, but the German health care system is not the only one to offer equal access. This is the case in Switzerland where social position is a prognostic factor [24], but equal access is also provided in France where social differences of survival disappeared in the 1990s [13].

Until the 1990s the average duration of stay in hospital in Germany was higher than in most neighboring countries. By that reason medical control of patients was longer, and this may have led to a social equalization of survival. However, this is an unproven argument. Under the present condition of shortening lengths of stay this may change, but the set-up of treatment centers and integrated care may again act toward better quality of treatment.

Finally, some critical issues have to be considered. The large proportion of unclassified subjects on occupational position and especially on income raises the question how to interpret the results obtained for this group. The knowledge on their socio-economic structure and information about their social environment is incomplete.

Another shortcoming of the health insurance data may affect the size of the social gradient. The data do not depict the upper 10% of the population, the same holds for subjects from the lowest positions of the social scale, e.g., the majority of individuals being on social security or homeless people. Omitting them should lead to an underestimation of social gradients with respect to incidences and case-fatalities. In the analyses above the highest occupational position obtained during the observation period had been assigned for classifying individuals for occupational position. Thus the consequences of social upward and downward mobility had not been taken into account, but this may affect disease incidence [2]. As far as the ups and downs in the study population level each other out the net effect on the results may be zero. However, at that time the data were recorded, the study region underwent economic changes which may have led to more downward than upward social mobility. Again the social gradients with respect to disease incidence may have been underestimated.

Another problem of health insurance data is the absence of information on tumor size and receptor status. Although this limits their range of application, it may not question the substantive conclusions on social differentials with respect to incidence. In earlier studies it had been shown that social inequalities in case fatalities remained also after tumor stage, treatment and patient characteristics had been taken into account [23, 24]. In a Danish study, it was found that the staging of tumors upon detection did not differ by patients’ socio-economic background [65]. However, there were differences in treatment since women from higher socio-economic background were more likely to receive lumpectomy, but how this may have affected survival is not clear.

A last uncertainty was caused by the lack of information on causes of death since only hospital diagnoses and date of death had been available. The bias resulting out of this shortcoming should be reduced by having restricted the time period after disease manifestation to five years. Eventual biases should be more effective in the age groups over 65 years, but the largest share of them is falling into the groups that could not be classified for income and occupational position.

Comparing survival rates of cancer cases of the insurance population with registry data from Germany, we may conclude that mortalities in the cancer cases from the health insurance population are likely to be due to these diseases. According to registry information [36], the mean five-year survival rates for lung cancer are 17% (women) and 9% (men), for stomach cancer the respective figures are 28% (women) and 27% (men), for intestinal carcinoma 51% (women) and 48% (men), and finally 73% for breast cancer. As the survival rates in the health insurance data are higher, a number of individuals with cancer might have left the insurance without having died and others were under observation for less than five years.