Introduction

Systemic lupus erythematosus (SLE) is a systemic autoimmune disease characterized by chronic inflammation and the production of autoantibodies directed against numerous antigens [1]. Because SLE patients are now living longer due to recent advances in treatment, the incidence of chronic comorbid conditions has been rising [25]. Rates of several types of cancer, particularly hematopoietic malignancies, appear to be increasing in the SLE population [615]. One of the proposed biologic mechanisms for this association is a defect in the body’s normal immunosurveillance process. In a healthy immune system, aberrant cells produced during cell replication are removed to prevent them from becoming malignant and progressing to clinical cancer [16]. Among individuals with SLE, this regulation process is likely impaired, contributing to increased cancer risk [16]. In addition to increased baseline cancer risk in SLE patients relative to the general population, other factors which are thought to play a role include exogenous exposures to medication [17, 18] and viral agents known to be associated with cancer [12].

Several epidemiologic studies have attempted to characterize cancer incidence among SLE patients. Existing literature on this subject includes results from small case series [1922], population-based studies using administrative data sets [811], and studies using clinical cohorts of patients [6, 14, 15, 2326]. Although older studies suffered from small sample sizes and limited generalizability, results of recent studies are based on larger population-based cohorts and multicenter clinical cohorts. Increased incidence of non-Hodgkin’s lymphoma (NHL) and other hematologic malignancies appears to be a consistent finding [6, 811, 23, 24, 26], although there is considerable variation in the magnitude of excess risk and precision of the estimates across studies. The results for solid tumors, however, have not been as consistent. Some studies have reported excess risks of lung [6, 9, 15, 24], liver [6, 9], cervix [24], and vagina/vulva cancer [9] among SLE patients; in the case of the latter two cancers, the results were based on very small numbers of cases.

In this paper, we present the results of a retrospective cohort study designed to examine cancer incidence in a cohort of SLE patients in California who required hospitalization during the period 1991–2002. The cohort was defined and cancer outcomes were measured using a data set created via electronic linkage of cancer registry and patient discharge data. The aims of the project were to describe patterns of cancer development among patients with SLE in this large and racially diverse population, compare the observed cancer incidence in the cohort to the expected incidence in the general California population, and to examine cancer risk across age, sex, and race/ethnic strata. To the best of our knowledge, this is the largest cohort of SLE patients in which this question has been addressed.

Methods

Description of data sets

The cohort for this study was identified from California patient discharge data. The Patient Discharge Dataset is produced annually by the California Office of Statewide Health Planning and Development (OSHPD). This dataset contains a record for each inpatient discharged from all non-federal, licensed, acute care hospitals. Information on basic demographic characteristics, diagnostic codes, and procedure codes related to the hospitalization are included in the data set. The diagnoses and procedures utilize codes specified by the International Classification of Diseases, 9th Revision, Clinical Modification (ICD-9-CM) [27].

Information on cancer outcomes was obtained through electronic linkage of the patient discharge data set to the California Cancer Registry (CCR) data set for the period 1991–2002. The CCR is the largest, population-based cancer registry for a geographically contiguous area in the world, collecting incidence reports on over 140,000 new cases of cancer diagnosed annually in California. Standards for data abstracting, collection, and reporting are specified by the CCR [2831]. The CCR has consistently met the highest standards for data quality and completeness [32], and participates in the National Cancer Institute’s Surveillance Epidemiology and End Results (SEER) program [33]. The CCR database contains information on basic demographic factors, tumor characteristics, and cancer directed surgeries and treatment. Follow-up for vital status on patients in the database is conducted through routine linkages with several administrative databases, the primary one being the California statewide mortality file. Based on our current definition of loss to follow-up (no identification of patient through passive follow-up methods for 22 months from data of dataset creation), we estimate that approximately 3–5% of our cases are lost to follow-up.

Linkage procedures

The two data sets were electronically linked using Integrity software [34] and a combination of deterministic and probabilistic linkage strategies. The primary variables used to link the two data sets were social security number, date of birth, sex, and residential zip code. Approximately 90% of matches were identified through the deterministic linkage method. A probabilistic linkage method was used to identify the remaining matches. A probabilistic linkage utilizes a set of identifiers contained in both data sets to calculate the probability that records from different data sets are matches, allowing for errors such as transpositions of digits and spelling. Matches with weights higher than a predetermined cutoff are accepted, whereas those that are much lower than the cutoff are rejected. Matches that fall in between these limits are manually reviewed. Approximately 5% of matches were visually confirmed. Based on prior linkages of these data sets, we estimate that our algorithm identified 95%–99% of the matches [35].

Definition of study cohort and measurement of variables

All individuals with an ICD-9-CM code of 710.0 in any of the 25 diagnostic fields (principal diagnosis and up to 24 other diagnoses) hospitalized during the period 1991–2002 were included in the study. Only information on the first relevant hospitalization was included for each individual. International Classification of Diseases for Oncology, 2nd edition (ICD-O2) site and histology codes were used to identify specific cancer outcomes in the cohort [36].

Other demographic variables included in the analysis were classified as follows: age (<30, 30–54, 55–59, 60+), race/ethnic group (non-Hispanic white, non-Hispanic black, Hispanic, non-Hispanic Asian/Pacific Islander and Other/Unknown), and sex (male and female). The race/ethnic categories were mutually exclusive. Individuals identified as Hispanic were included only in the Hispanic category, and could be of any race. Individuals classified as any of the following were lumped into the Other/Unknown category: non-Hispanic American Indian/Alaskan Native, other race, or unknown race.

Statistical analysis

Descriptive statistics were generated for the entire cohort as well as for members of the cohort diagnosed with cancer during the study period. Person-years of follow-up were calculated for each individual. Time from the first hospitalization with a diagnosis of SLE to one of the following three events (whichever occurred first) was calculated: date of cancer diagnosis, date of death, or the end of the calendar year 2002. The expected numbers of cancers were calculated by sex, race/ethnicity, and five-year age group using rates from the general California population for the same time period. Standardized incidence ratios (SIRs), or the ratio of observed to expected cancers, and their 95% CI were calculated for all cancers combined and for all major cancer types. Race-specific and age group-specific estimates for specific cancer types (those for which we had adequate power) were calculated. Estimates were generated assuming that the number of observed cancers follows a Poisson distribution [37]. All estimates were adjusted by sex, age, and race/ethnicity.

The first six months of follow-up were excluded from the analysis, to minimize the possibility that the presenting symptoms of cancer overlapped with those of the SLE. When this period was expanded to one year, we observed minimal differences in the results, and therefore we chose to exclude only the first six months of follow-up to increase statistical power.

We also chose to present SIRs for each cancer type for males and females combined. When we calculated sex-specific SIRs, we did not see large differences between the sexes with respect to the magnitude of the effect measures, only the confidence intervals. Again, to increase our power, we chose to combine males and females for the presentation of data by cancer type.

We also assessed the internal consistency of our SLE diagnoses across multiple hospitalizations by examining the number of hospitalizations for each individual that recorded a diagnosis of SLE after the initial hospitalization for SLE, relative to the total number of times an individual was hospitalized.

Results

The study cohort consisted of 30,478 individuals who contributed 157,969 person-years of follow-up. The average length of follow-up was 5.1 years. Over 99% of individuals in our cohort who were hospitalized more than once had an SLE diagnosis recorded for every hospitalization (data not shown). Approximately 70% of individuals in the cohort had SLE listed as the primary or secondary diagnosis for their hospitalization. A total of 1,273 patients (4.2%) were diagnosed with at least one cancer during the study period. The most common cancer types were breast, lung, colon/rectum, and NHL. Among the NHL cases, large B-cell lymphoma was the most common subtype. The vast majority of the cohort (89%) consisted of female patients, although a higher proportion of males were diagnosed with cancer (17%), compared to the overall proportion of males in the cohort (11%) (Table 1). Half of the SLE patients diagnosed with cancer were over the age of 60 at the time of hospitalization, compared to 27% of the whole cohort. Approximately 42% of the SLE cohort members were identified as races other than non-Hispanic white. A larger proportion of SLE patients with cancer were non-Hispanic white (69%), compared to the overall proportion in the entire cohort (58%).

Table 1 Demographic characteristics of a cohort of patients with systemic lupus erythematosus (SLE), California, 1991–2002 (n = 30,478)

The overall cancer risk in the cohort was significantly elevated (SIR = 1.14, 95% CI = 1.07–1.20) (Table 2). Approximately threefold increases in both vagina/vulva cancer (SIR = 3.27, 95% C.I. 2.41–4.31) and liver cancer (SIR = 2.70, 95% CI = 1.54–4.24) risk were observed in the cohort. In addition, cohort members had roughly double the risk of lung (SIR = 1.66, 95% CI = 1.45–1.90), kidney (SIR = 2.15, 95% CI = 1.52–2.94), and thyroid (SIR = 1.83, 95% CI = 1.24–2.62) cancers relative to the general California population. SLE patients also had elevated risks of several hematopoietic malignancies including NHL (both large-B cell and follicular types), Hodgkin’s Disease (HD), and myeloid leukemia. Cohort members had significantly lower risks of breast, uterus, cervix, and prostate cancers. Given the small number of HD cases, we were not able to calculate subtype-specific SIRs. Since HD has been found to be at least three distinct diseases, which can be crudely classified by age, we calculated SIRs for individuals under 50 and those 50 years of age and older (data not shown). We did not find significant differences in the magnitude of the SIRs for these age groups and therefore chose to present the aggregate SIR for all HD cases.

Table 2 Age and race adjusted standardized incidence ratios (SIRs) for selected cancers among a cohort of patients with systemic lupus erythematosus (SLE), California, 1991–2002 (n = 30,478)a

Table 3 presents race-specific SIRs for selected cancer types. The risk of developing breast cancer was lower for non-Hispanic white (SIR = 0.72, 95% CI = 0.61–0.83), black, and Asian/Pacific Islander SLE patients compared to their counterparts in the general population, although the risk estimates for the latter two groups were not statistically significant. Lung cancer rates were significantly elevated among non-Hispanic whites and Hispanics, with the Hispanics having a twofold increase in lung cancer risk relative to the general population. Both black (SIR = 4.29, 95% CI = 1.93–8.04) and Hispanic patients (SIR = 3.48, 95% CI = 1.53–7.00) had higher risks of kidney cancer than the general population. The risks of NHL and vagina/vulva cancers were higher among all race/ethnic groups, with the highest risks among Hispanics and Asian/Pacific Islanders for both cancers.

Table 3 Race-specific standardized incidence ratios for selected cancers among a cohort of patients with systemic lupus erythematosus (SLE), California, 1991–2002 (n = 30,478)a

Table 4 presents the risks of selected cancers by age group. SLE patients under 30 years of age had a fivefold increase in breast cancer risk compared to the general population. Those 60 years of age or older had a 40% decreased risk of breast cancer. Lung cancer risk in the study cohort was significantly elevated in the two older age groups. Risks of kidney cancer were significantly higher in all age groups except those 60 years of age or older, with the risk decreasing with increasing age. NHL risk was elevated in all age groups with the risk exhibiting an inverse relationship to age. Cohort members under 45 years of age were at significantly increased risk of vagina/vulva cancer, with individuals under 30 at highest risk.

Table 4 Age-specific standardized incidence ratios for selected cancers among a hospitalized cohort of patients with systemic lupus erythematosus (SLE), California, 1991–2002 (n = 30,478)a

Discussion

Our findings confirm the results of previous research indicating both an overall increased risk of cancer and increased risk of hematologic malignancies among individuals with SLE [6, 9, 10]. Both the magnitude and direction of these effect measures are comparable to results from a large multicenter international cohort reported by Bernatsky et al. [6]. New findings include the increased risks of kidney and thyroid cancer among SLE patients, which highlight the power of the present study to detect differences in rarer cancers. The findings regarding significantly decreased risks of prostate and cervix cancers have not been previously reported.

This study also confirms the increased risks of lung and liver cancer among SLE patients reported previously [6, 9, 14]. We did not have information on tobacco use in our cohort, but other cohort studies of SLE patients reported current smoking prevalences similar to the general population [3842], although one study reported heavier use of tobacco by smokers with SLE [38]. Another possible explanation for the observed increase in lung cancer in our cohort is that lung involvement and fibrosis in particular is common in SLE patients [9]. One hypothesis for the association between SLE and liver cancer postulated by other researchers is an increased prevalence of hepatitis C infection among SLE patients [9, 43].

The SIR for vagina/vulva cancer in our study was very high and is supported by the results of only one other study that was based on only three cases [9]. Increased prevalence of human papillomavirus (HPV) in women with SLE has been observed in previous work [44, 45] and may be a possible explanation; although, similar to the results of Mellemkjaer et al. [9], we did not observe an increase in cervical cancer risk, which would be expected if HPV rates were high in our cohort. Vulvar cancer has also been associated with an autoimmune condition called lichen sclerosus et atrophicus, and thus may share a common inflammatory mechanism with SLE, or may be a secondary condition as it has occasionally been reported to occur in SLE [46].

Although increased cancer rates among SLE patients (particularly with respect to lymphomas and leukemias) have been linked to treatment with immunosuppressive agents such as cyclophosphamide, azathioprine, and methotrexate [4750], we were not able to examine this association, as we did not have treatment information on the patients. Several case series have supported the contention that there is an increased risk of bladder cancer among SLE patients because many of them are treated with cyclophosphamide [5153]. Surprisingly, the SIR for bladder cancer in our cohort was not significantly increased.

Patients in our study were at decreased risk of several screenable cancers including breast, prostate, and cervix. The results of studies that examined breast cancer risk among SLE patients are variable. Some studies reported increased risks [14] and other lower risks of breast cancer among people with SLE [6]. Since breast cancer is strongly associated with hormonal and reproductive factors, differences in these risk factors across cohorts may explain the variation in results. Decreased risks of prostate and cervix cancer have not been reported by previous studies.

Our analysis of specific cancer types by race/ethnicity resulted in several new findings, including significantly increased risks among Hispanics for four out of the five cancer types we examined. In addition, blacks had elevated risks of vagina/vulva cancer and Asian/Pacific Islanders were at increased risk of both NHL and vagina/vulva cancer. Only one other study has examined cancer risk by race in SLE patients and did not report increased overall cancer risk in any race/ethnic group other than non-Hispanic whites [54]. It should be noted that the proportion of non-white individuals in that cohort was relatively small, and that less than seven cancers total were observed among Hispanics and Asians in the cohort. More information on risk factors among these race/ethnic groups is needed to investigate the relationship between race/ethnicity and risk of various cancers among individuals with SLE.

The high relative risk for developing several cancer types in younger SLE patients compared to the general population was striking. The majority of severe cases of SLE occur in younger women, particularly women other than non-Hispanic whites [55]. If the severity of the immunological deficits associated with SLE are also associated with an increased risk of developing cancer, the finding of a higher relative risk of cancer in younger SLE patients appear plausible. In addition, younger patients with severe disease may be treated more aggressively with immunosuppressive agents, which might also lead to an increased risk of cancer.

There are several limitations that should be considered in the interpretation of results from the present study. First, our cohort consisted of hospitalized individuals, which raises the possibility that our results might not be generalizable to all SLE patients. It is possible that our study population represents a more severely diseased portion of the population. It is worth noting, however, that our estimates of overall cancer risk and the risk of several cancers such as hematologic malignancies, lung, and liver cancer fall within the range of other published reports with both hospitalized and non-hospitalized individuals.

The use of hospitalized individuals also introduces the possibility of detection bias, in which hospitalized individuals might be more likely to be screened for cancer than other people with SLE, thus leading to an overestimate of risk. If there were a strong detection bias in effect, we would expect the SIRs for screenable cancers to be high. Instead we observed decreased risks of breast, prostate, and cervix cancer in our patient population. Moreover, the only published study to examine screening rates among SLE patients concluded that appropriate cancer screening might actually be overlooked in patients with SLE [56]. In addition, many researchers have hypothesized that the use of hospitalized individuals in cohort studies may introduce surveillance bias, based on the notion that they may be more likely to be screened and tested for various diseases. Our results do not support this hypothesis.

Although we were not able to verify the SLE diagnoses in the hospital discharge data, the diagnosis codes had very high levels of internal consistency among repeat hospitalizations for the same individuals. Validation studies using Medicare physician claims data reported 85% sensitivity for SLE claims using the medical record as the gold standard [57, 58]. A recent study conducted in Sweden found that 23 of 42 cases identified by administrative data did not have SLE according to their criteria [58]. One major limitation of the Swedish study, however, was that the researchers included patients who were diagnosed over a 30-year period, but used current, more rigorous ACR classification criteria as the gold standard diagnostic criteria [59].

Other limitations of this study include the fact that this was not a closed cohort, thus we may have had some loss to follow-up, and the fact that we cannot definitively establish temporality. Both cancer and SLE are diseases with long latency periods and since we do not have definitive dates of diagnosis for SLE, we cannot be sure how long after diagnosis with SLE the cancer occurred. We attempted to address this issue by excluding cases that developed cancer during the first six months of the follow-up period. Finally, we did not have any information regarding treatment and other risk factors, such as smoking history, which affect both SLE and cancer risk.

Despite these limitations, our study has several strengths. To the best of our knowledge, this is by far the largest study of cancer and SLE conducted to date, providing enough statistical power and precision to confirm previous findings and to detect previously unreported increased risks of rarer cancers such as kidney, thyroid, and vagina/vulva cancer. In addition, we were able to examine risks of selected cancers by race/ethnic group with sufficient power to detect increased risks among blacks, Hispanics and Asians, which is also a novel contribution to the literature. Another advantage is the use of high quality cancer registry data to assess cancer outcomes.

The results of this exploratory study point to several interesting directions for future research, many of which we are currently pursuing. More detailed studies of the endogenous and exogenous factors that affect cancer risk among SLE patients are needed. Future studies should be directed toward examining the effects of treatment with immunosuppressive agents as well as the contribution of lifestyle factors such as diet and smoking on cancer risk in this population. In addition, further analysis of both cancer and SLE risk factors among various race/ethnic groups is needed to investigate the mechanisms that may underlie the increased risks. Finally, closer examination of the factors that influence cancer risk in younger populations of SLE patients is needed, as this is a group in which early intervention can potentially lead to better outcomes.