Introduction

Polycyclic aromatic hydrocarbons (PAHs) are a class of chemicals formed during incomplete combustion of organic materials. Major sources of PAH exposure for the general population include vehicle exhaust, electric power generation, waste incineration, wood smoke, tobacco smoke, and ingestion of grilled/charbroiled meats [1]. Occupational PAH exposure is high in the aluminum smelting, coal gasification, coal-tar distillation, and coke production industries. Several PAHs are known or probable human carcinogens [2, 3] based on associations between occupational exposures and cancers of the lung, skin, and bladder [24]. PAHs have also been shown to induce lymphomas in animals [58]; however, the relationship between PAH exposures and risk of non-Hodgkin lymphoma (NHL) in humans remains unclear.

NHL is the fifth most common cancer in men and women in the USA [9]. Immunodeficiency, such as HIV/AIDS, is a known NHL risk factor; however, the etiologies of most lymphomas have yet to be identified [10, 11]. Epidemiologic studies of the association between PAH exposure and NHL risk have primarily focused on occupational exposures and mainly with work in the aluminum industry. A positive relationship between occupational benzo(a)pyrene exposure and NHL risk was observed in a Canadian cohort [12]; however, most studies of occupational PAH exposure have not observed significant associations with NHL risk [1316]. Reports of associations between NHL and cigarette smoking, a major source of PAH in the general population [17], have been conflicting, with the majority of studies not showing an association with NHL overall [11, 1820].

Although to date no clear association between PAH exposure and NHL risk has been identified, there can be substantial heterogeneity in the etiology of lymphoma subtypes [11, 21, 22]. In large pooled analyses in the International Lymphoma Epidemiology Consortium (InterLymph), significant positive associations were found between smoking duration and follicular lymphoma [23] as well as lymphoplasmacytic lymphoma/Walderstrom macroglobulinemia, marginal zone, and peripheral T cell lymphomas [22]. In addition, an increased risk of T cell lymphoma was observed among men who had ever smoked in a Danish and Swedish population-based case–control study [19]. However, other studies of smoking and T cell lymphomas found no association [18, 20].

In this analysis, we evaluated the association between PAH concentrations in residential carpet dust and risk of NHL overall and by histologic subtypes. Because PAHs can accumulate in carpets and rugs over years or decades [24], dust PAH concentrations may represent long-term residential PAH exposure. To further understand sources of residential PAH exposure, we also explored whether home characteristics, demographic variables, geographic location, and environmental factors were determinants of PAH concentrations in the carpet dust samples.

Methods

Study population

The study population, which has been described in detail previously [2527], included 1,321 first primary NHL cases aged 20–74 diagnosed between 1998 and 2000 from four Surveillance Epidemiology and End Results (SEER) registries: Detroit, Michigan metropolitan area (Macomb, Oakland, and Wayne counties); state of Iowa; Los Angeles County, California; and Seattle, Washington metropolitan area (King and Snohomish counties). HIV-positive cases were excluded. The participation rate among eligible cases was 76 %. Population controls were selected using random digit dialing (<65 years of age) or from Center for Medicare and Medicaid Services files (≥65 years) and were frequency-matched to cases by age (within 5 years), sex, race, and study center. Of 2,046 eligible controls, 1,057 (52 %) participated. The study was approved by the Human Subjects Review Boards at the National Cancer Institute and each SEER center. Written informed consent was obtained from each participant.

Exposure assessment

A computer-assisted personal interview was conducted in participants’ homes and included questions about demographics, lifestyle, housing characteristics, and other factors. As previously described, the study used a split-sample questionnaire design, in which participants were mailed one of two versions of a self-administered questionnaire [25]. Due to this design, some variables, such as smoking, were only asked for a subset of participants [25].

Dust samples were collected at the time of the interviews (February 1999–May 2001) from home vacuum cleaner bags for 695 cases and 521 controls that consented and met eligibility criteria [25, 26, 28]. Eligible cases and controls had used their vacuum cleaner within the prior year and had owned at least half of their carpets or rugs for 5 or more years. Dust samples were shipped overnight to Southwest Research Institute (San Antonio, TX) and stored in freezers at −20 °C. The dust was sieved (<150 um), and a 2-g sample of the fine fraction was Soxhlet extracted for 16 h with 200 ml diethyl ether–n-hexane (6:94). The extracts were cleaned through a Florisil column and analyzed for 10 PAHs, seven of which are classified by the US Environmental Protection Agency as probable human carcinogens: benz(a)anthracene (BaA), benzo(a)pyrene (BaP), benzo(b)fluoranthene (BbF), benzo(k)fluoranthene (BkF), chrysene (Chr), dibenz(a,h)anthracene (DBaA), and indeno(1,2,3-c,d)pyrene (IdP). The other three PAHs, benzo(ghi)perylene, coronene, and dibenzo(a,e)pyrene, were excluded from analyses due to the small number of samples with detectable concentrations. Chemical analyses were performed with gas chromatography (GC)/mass spectrometry (MS) (Danvers, MA) in selected ion monitoring mode using a DB-5.625 30 m × 0.25 mm ID column (J&W Scientific, Folsom, CA). Confirmation analysis for selected samples was done using the same column under full-scan mass spectral analysis on a second GC/MS instrument. Quantitation was based on five-point calibration curves. Dust samples from 682 cases (98 %) and 513 controls (98 %) were successfully analyzed between September 1999 and September 2001. Extraction from laboratory spiked dust samples (n = 25–27 depending on the specific PAH) was between 87 % (DBaA) and 103 % (BaA). Coefficients of variation calculated for duplicate samples within batches (n = 27 pairs) ranged from 4 (BbF) to 8 % (DBaA).

Geocoding and spatial variables

Geographic coordinates for the majority of residences (89.9 %) were assigned from Garmin GPS12 Personal Navigator (Garmin International, Inc., Olathe, KS) global positioning system readings taken outside the participant’s home at time of interview. Geocoding of addresses using a modified Microsoft Visual Basic version 6.0 program (TeleAtlas, Lebanon, NH) to match input addresses to the TeleAtlas MatchMaker SDK Professional version 4.3 street database was done for remaining residences (9.5 %). Eight homes (six cases, two controls) were excluded because they could not be accurately located.

The distance from each residence to the nearest major road and freight route was determined using TeleAtlas Dynamap Transportation version 5.2 (2003). Distance to the nearest railroad was calculated from the National Atlas of the United States database (2005). Locations of industrial combustion facilities that release dioxins and may also release PAHs were obtained from an US Environmental Protection Agency (USEPA) national database [29]. The types of facilities included municipal solid waste incinerators, medical waste incinerators, sewage sludge incinerators, hazardous waste incinerators, cement kilns, and coal-fired electricity-generating plants. We determined the number of facilities within 5 km of each home. The percentage of developed land (≥20 % impervious surface) within 2 km of each residence was calculated using the US Geological Survey 2001 National Land Cover Database, a 30-m resolution land cover database created from Landsat 5 and Landsat 7 satellite imagery [30]. Buffer sizes were selected based on the findings of a previous analysis of polychlorinated biphenyl dust concentrations in this case–control study [31]. Full details of these variables have been described previously [31].

Estimated annual average ambient PAH concentrations at the census tract level, based on US Census 2000 tract assignments, were obtained from the EPA’s 1999 National Air Toxics Assessment program (http://www.epa.gov/ttn/atw/nata1999/tables.html). Ambient PAH concentrations were estimated from the Assessment System for Population Exposure Nationwide (ASPEN) model (http://www.epa.gov/ttnatw01/nata/aspen.html). ASPEN is a computer simulation model used to estimate toxic air pollutant concentrations, which takes into account the rate, location, and height of release, wind speeds and directions, breakdown of the pollutants in the atmosphere, deposition rate, and photochemical transformation into secondary pollutants.

Statistical analysis

Final analyses included 676 cases and 511 controls with known residential location and dust samples analyzed for PAHs. A multiple imputation procedure was applied for instances in which laboratory measurements contained missing data due to concentrations below the limit of detection or when the sample contained other compounds that may have coeluted with the target analyte. Full details of the imputation procedure have been described [26, 32]. Briefly, upper and lower bounds (“intervals”) were assigned for each missing datum. Values within each interval were then imputed using Tobit regression assuming a log-normal distribution, which was consistent with the observed distribution of quantified measurements. The imputation procedure was repeated 10 times.

Unconditional logistic regression was used to compute odds ratios (OR) and 95 % confidence intervals (CI) for the associations between NHL and each PAH, summed (total) PAHs, and summed PAHs weighted by their toxic equivalency factors (TEQ). We conducted separate and multinomial logistic regression analyses for NHL subtypes for which we had sufficient cases (n ≥ 25), grouped according to the World Health Organization classification [33, 34] using the InterLymph consortium guidelines [35, 36]. The NHL subtypes analyzed were diffuse large B cell lymphoma (DLBCL), follicular lymphoma, marginal zone B cell lymphoma, chronic lymphocytic leukemia/small lymphocytic lymphoma (CLL/SLL), all T cell lymphomas, peripheral T cell lymphoma, and lymphoma not otherwise specified (Table 1).

Table 1 Characteristics of NHL cases (n = 676) and frequency-matched controls (n = 511)a

PAH concentrations were categorized into quartiles (≥100 cases) or tertiles (<100 cases) based on the distribution in controls. The linearity of the relationship between PAH concentrations and NHL risk was evaluated by modeling natural log-transformed concentrations as continuous variables. We also evaluated the relationship by modeling risk per 100 ng/g increase in PAH concentration.

All analyses were adjusted for age, sex, and study center. Education (<12, 12–15, >15 years), race (black, white, other), and smoking pack-years (p-y) [non-smoker, <7 p-y, 7–16 p-y, 17–35 p-y, >35 p-y, missing (n = 714)] were examined, but did not change OR estimates >10 %, when included individually or together, and were not included in final models. Separate models were fit using the 10 different imputation datasets. The results were combined using the MIANALYZE procedure in SAS version 9.2 (SAS Institute, Inc., Cary, North Carolina) to create a single OR and CI accounting for the variability between the imputed values. In addition, we created models stratified by study center because the measured PAH concentrations in house dust varied greatly between study centers. PAH concentrations in stratified models were categorized based on the distribution among controls within each study center.

As secondary analyses, we evaluated whether home characteristics, demographic variables, and environmental factors were predictive of individual and total PAH concentrations in carpet dust samples of cases and controls. Associations between natural log-transformed PAH concentrations, based on the first imputation, and possible determinants of household PAH concentrations were calculated using multivariate least-squares regression. Results were reported as the percent change in geometric mean (GM) within each level of the individual covariates. Self-reported demographic and home characteristic covariates included: sex, race (black, white, other), education (<12, 12–15, >15), age, smoking pack-years (p-y) (non-smoker, <7 p-y, 7–16 p-y, 17–35 p-y, >35 p-y, missing), case–control status, type of home (single family, other), and the year home was built (<1940, 1940–1969, 1970–1989, >1989). Environmental factors included US Census 2000 block population density (1,000 persons/mile2), distance to nearest major road and freight route (<100, ≥100 m), distance to nearest railroad (<400, ≥400 m), separate counts of combustion facilities (municipal solid waste incinerators, medical waste incinerators, sewage sludge incinerators, hazardous waste incinerators, cement kilns burning non-hazardous waste, and coal-fired electric generating plants) within 5 km of each home, percentage of developed land (≥20 % impervious surface) within 2 km of each home, and census tract-level estimated ambient PAH concentrations (ng/m3) from the ASPEN ambient PAH model. All statistical analyses were performed using SAS statistical software version 9.2 (SAS Institute, Inc., Cary, North Carolina). Results were considered statistical significance when two-sided p values were <0.05.

Results

NHL cases and controls in our analyses were similar with respect to study center, gender, race, education level, and smoking status (Table 1). Cases with dust samples were on average slightly younger than controls, with a mean age of 58.9 and 60.3 years (p = 0.04), respectively. As described previously [26], participants with dust samples were similar to the study population overall except that they were slightly older and had lived in the interview home longer than participants without dust samples. There was a high degree of correlation between concentrations of individual PAH compounds in the dust samples, with Spearman correlation coefficients among controls ranging from 0.86 to 0.96 (results not shown). The median concentration of total PAHs measured in carpet dust was slightly higher among cases [1,089.0 ng/g, interquartile range (IQR) (603.2–2,624.2)] than controls [1,021.0 ng/g, IQR (530.7–2,950.1)] (p = 0.04). Detection rates for the individual PAHs were similar among cases (range 72.9–99.4 %) and controls (range 67.1–100 %) (Supplemental Table 1).

Compared to the lowest quartile of PAH concentration, risks of NHL overall and of the DLBCL and follicular lymphoma subtypes were elevated in the second and third quartiles for many of the PAHs, but not in the highest quartile, indicating a nonlinear relationship (Table 2). The patterns of elevated risk were fairly consistent across the individual PAH compounds, but only a few associations were statistically significant. The associations between tertiles of PAH concentrations and marginal zone lymphoma were not monotonic; many of the ORs were significantly increased for the second tertile of PAH concentration, but there was no association for the highest tertile. Tertiles of PAH concentrations were not associated with CLL/SLL (Table 2). For NHL overall and all the B cell subtypes, there were no significant associations between PAH concentrations and NHL risk when the natural log of the PAH concentration was analyzed as a continuous variable (Table 2) or for risk per 100 ng/g increase in PAH concentration (not shown).

Table 2 Associations between total and individual PAHs and NHL overall and by major B cell subtypes

We observed an increased risk of T cell lymphomas associated with increasing tertiles of PAH exposure (Table 3). Specifically, risk of T cell lymphoma overall was significantly increased among those with the highest tertile of total PAH (OR = 3.04; 95 % CI, 1.09–8.47) and BkF (OR = 3.20; 95 % CI, 1.13–9.11) compared to those with the lowest tertile of the respective residential PAH concentration. Risk was nonsignificantly elevated associated with the highest tertile of concentration of the other PAHs, generally demonstrating a monotonic positive trend with increasing tertiles, although the ORs for a natural log increase in concentrations were not significant. Similar relationships were observed for the subset of peripheral T cell lymphoma cases. Associations between PAHs and NHLs with unspecified subtype followed a pattern similar to NHL overall, but were not significant.

Table 3 Associations between total and individual PAHs and T cell lymphomas and NHL not otherwise specified

Results for total PAH concentrations and risk of NHL overall were similar when we evaluated risk separately by study center (Supplemental Table 2). Although we had limited power to estimate study center-specific risks by subtype, the patterns were generally similar to those for all centers combined, including for the T cell lymphomas (results not shown).

Determinants of carpet dust PAH

Results for multivariate analyses of the association between total PAH carpet dust concentrations and selected demographic variables, home characteristics, and environmental factors are shown in Table 4. We present results for cases and controls combined because analyses of controls only yielded similar effect estimates (results not shown). Study center was a statistically significant determinant of total PAH concentrations, with the highest adjusted levels found in Detroit (GM = 4,354.7; 95 % CI, 3,269.5–5,799.9), followed by Iowa (GM = 1,374.6; 95 % CI, 995.6–1,898.1), Seattle (GM = 838.0; 95 % CI, 666.2–1,054.0), and Los Angeles (GM = 354.1; 95 % CI, 271.4–462.1). PAH concentrations in homes of white participants were, on average, higher than levels in homes of other races. Participant’s age was positively associated with total PAH concentrations. We observed a 24.2 % lower average total PAH concentration in the homes where the participants had less than 12 years of formal education compared to homes where the participant had 16 or more years of formal education. Adjusted GM measured PAH concentrations did not significantly differ by smoking status or by housing age. Both average annual ambient PAH concentrations, as estimated at the census tract level by the ASPEN model, and the proportion of developed land within 2 km of a residence were positively associated with total PAH concentrations in carpet dust. No significant associations were observed for proximity to major roadways, freight routes, railroads, or industrial combustion facilities (results not shown). Results for center-specific and chemical-specific models were similar (results not shown).

Table 4 Multivariate analysis of determinants of natural log-transformed total PAH levels in residential carpet dust (n = 1,187)

Discussion

In this case–control study, we observed significant positive associations between total PAH and BkF concentrations in residential carpet dust and risk of T cell lymphomas. Although not statistically significant, risk of T cell lymphomas was also elevated with increasing exposures to each of the six other PAHs analyzed. No significant monotonic exposure–response associations were observed for risk of NHL overall or for B cell lymphomas including DLBCL, follicular lymphoma, marginal zone B cell lymphomas, and CLL/SLL.

Evidence of associations between PAH exposure and NHL risk in human studies is currently limited. In a Canadian cohort study of aluminum reduction plant workers, an industry with potentially high PAH exposures, there was evidence of a positive exposure–response relationship between BaP exposure, assessed via a job exposure matrix (JEM), and NHL risk (p trend <0.01) [12]. Investigations of aluminum reduction plant workers in Quebec observed nonsignificant excesses of NHL incidence and mortality among workers compared to the general population [16]. In regard to PAH exposure, specifically, the authors reported no associations between BaP and risk of NHL incidence or mortality. In addition, no associations between JEM-assessed occupational PAH exposure and NHL risk were observed in a case–control study conducted in Sweden [13] or among a retrospective cohort of workers employed at a California aerospace company [14].

NHL is comprised of a group of related, yet distinct, lymphoid diseases. Studies of cigarette smoking, a major source of PAH exposure in the general population, and NHL have found associations to vary by lymphoma subtype [23, 3742]. A pooled analysis of 20 case–control studies participating in the InterLymph consortium observed significant positive associations between smoking duration and risks of peripheral T cell, follicular, lymphoplasmacytic lymphoma/Walderstrom macroglobulinemia, and marginal zone lymphomas, but not other subtypes [22]. Morton et al. [21, 22] further describe some additional risk factors that show heterogeneity among major NHL subtypes, although the biological mechanisms are not fully understood. In addition to increased risk of T cell lymphomas overall, we observed an elevated risk of peripheral T cell lymphoma specifically, although there was limited power to detect an association due to a small number of cases. Although no human data are available, T cell lymphomas have been produced in the offspring of mice exposed to dibenzo[a,l]pyrene (DbP) during pregnancy [5, 6]. The potential biological mechanisms for the association of PAH exposure and risk of T cell lymphomas are not fully understood. One possible pathway for carcinogenesis is the bioactivation of DbP via the CYP1B1 enzyme, which is highly expressed in the thymus where T cells develop [43].

In general, carpet dust PAH concentrations measured in our study were lower than previously reported levels in Durham, NC, USA [44, 45] and Ottawa, Canada [46]. However, measured concentrations in this multicenter study were considerably higher than those measured in the California Childhood Leukemia Study (CCLS), a recent case–control study conducted in the San Francisco Bay area and the Central Valley, USA [47]. The large degree of variability in PAH concentrations reported in these four North American studies is consistent with the substantial geographic variability we observed between our study centers.

Smoking status, which has been associated with household PAH concentrations in other studies [48, 49], was not a significant determinant of PAH concentrations in our study. However, we had information on the smoking status of the study participant only and not for all household members. Carpet dust in this study may also reflect many sources of PAHs including estimated outdoor air concentrations, which, consistent with our findings, was found to be a significant predictor of house dust PAH concentrations in the CCLS [47].

In the CCLS [47], households in which the mother was Hispanic had lower house dust PAH concentrations than households where the mother was not Hispanic. The authors noted that Hispanic mothers vacuumed carpets more frequently and were less likely to live in an urban area than non-Hispanic mothers. In this study, we did not find an association between PAH concentrations and population density. However, the percent developed land near homes and air concentrations of PAHs were significant predictors indicating that population density alone was not a good proxy for residential exposure. The associations with respondent race and education were unexpected. It is possible that the higher PAH concentrations measured in the households of white participants in our study and the association we observed with education are due to behavioral differences that were not measured.

The large number of participants with dust samples and the use of an exposure measure that is not affected by recall bias are strengths of this study. However, there are limitations to the use of household carpet dust to assess PAH exposures. House dust is not a direct measure of exposure and post-diagnostic samples may not be fully representative of pre-diagnostic dust levels, which were not available due to the case–control design. However, carpet dust can act as a sink for PAHs [24] and therefore may be representative of long-term indoor exposure. In analyses of PAH concentrations in household dust from the CCLS, analytic and within-household temporal variability were approximately 90 % as large as between-household variability [48]. Although the authors observed greater between-household variability for higher molecular weight PAHs, such as those analyzed in our study, these findings point to the potential for misclassification of exposure when using a single carpet dust PAH measurements in epidemiologic studies. Other limitations include the limited statistical power to investigate less common NHL subtypes due to small numbers of cases, incomplete information on smoking status, and the possibility of chance findings due to analyses of multiple PAHs and lymphoma subtypes.

We were unable to explain the non-monotonic risk patterns we observed for NHL overall and for B cell lymphoma subtypes. Although the determinants we investigated did not substantially differ by quartile of measured PAH dust concentration, it is possible dust PAH is a proxy for an unidentified risk factor. However, total PAHs were not significantly correlated with polychlorinated biphenyls or with dioxins and furans measured in a subset of participants with dust (not shown). In addition, the pattern of ORs we observed were consistent across study sites and thus independent of PAH concentration. Our analyses indicate outdoor PAH concentrations are important determinants of indoor levels. The large difference in measured PAH concentrations we observed among our study sites may therefore be due to variability in PAH sources and regional differences in physical and biological degradation patterns. As a result, our findings should be interpreted with caution because dust PAH may be reflective of relative (rank) exposure more than absolute PAH exposure.

The results of our investigation suggest an increased risk of T cell lymphomas associated with residential carpet dust PAH concentrations, including BkF, specifically. Our results are novel and provide additional support for the importance of epidemiologic research on specific histologic subtypes of NHL in order to distinguish potential etiologic heterogeneity. Further evaluation of the associations between PAH exposures and NHL risk by subtype using validated direct exposure metrics is warranted.