Introduction

Tobacco smoking and alcohol consumption are well established major risk factors for head and neck cancers (i.e. neoplasms of the oral cavity, pharynx, and larynx) [1, 2]. A synergistic effect of these two factors has been also consistently reported [2, 3].

Most of the current evidence on the effect of the interaction of tobacco smoking with alcohol drinking on the risk of head and neck cancers derives from case–control investigations [2, 3]. This interaction was mainly evaluated through step-functions, using cut-points to categorize the two exposures and providing risk estimates for the cross-product of the categorical exposures. Step-function analysis, however, assumes that the risk is constant within each category, which may lead to efficiency loss and interpretation difficulties [47].

The dose–response relations between alcohol consumption and tobacco smoking and head and neck cancers risk have been already investigated by this research group through spline models in two previous studies [8, 9]. In this paper, we propose a bi-dimensional logistic spline model to investigate the interaction between tobacco smoking and alcohol consumption on the risk of head and neck cancers. This approach allows the estimation of a surface, which represents the risk associated to the concomitant exposure to ethanol and tobacco, and could be easily generalized to other continuous exposures.

Materials and methods

Data were obtained from a series of hospital-based case–control studies on head and neck cancers conducted in Italy and in the Swiss Canton of Vaud between, 1982 and 2000 [1014]. These studies had similar designs and inclusion criteria. Briefly, cases were patients below the age of 80 years with incident, histologically confirmed cancer. Cases with a previous history of neoplastic diseases (meta-chronous) were excluded. The control groups included patients younger than 80 years who were admitted to the same network of hospitals where cases had been identified for a wide spectrum of acute, non-neoplastic conditions (e.g., traumas, other orthopaedic disorders, acute surgical conditions, disorders of nose, ear, eye, skin, or teeth). Patients admitted to hospital for any condition related to tobacco smoking or alcohol drinking, or with any disorder that might have induced long-term modification of diet were excluded from the control group. Trained nurses carried out all interviews in hospital settings, thus keeping the refusal rate of eligible cases and controls below 5 % in Italy and about 15 % in Switzerland. Controls were frequency-matched to cases according to area of residence, age, and year of interview. Patients could have served as controls for more than one study. The original control group was considered for site-specific analyses whereas, for pool estimates, the control groups of each study were collapsed and patients enrolled in more than one study were considered only once.

Similar questionnaires, including information on age, education, other socio-economic factors, a problem-oriented medical history, usual diet, alcohol drinking, and smoking habits were used in each study. Information on smoking included lifetime status (never, former, or current smoker), daily number of cigarettes/cigars and grams of tobacco pipe smoked, age at starting, and duration of the habit. Smokers were subjects who had smoked at least one cigarette/day for at least 1 year. In our computations, one gram of pipe-smoked tobacco corresponded to one cigarette, and one cigar to three cigarettes. Former smokers were defined as those who abstained from any type of smoking since at least 12 months before cancer diagnosis or interview (for controls). In addition, the weekly numbers of drinks for several alcoholic beverages were elicited from the subjects. Taking into account the different ethanol concentration, one drink corresponded to approximately 125 ml of wine, 330 ml of beer and 30 ml of hard liquor (i.e., about 12 g of ethanol). Total alcohol intake, expressed in grams of ethanol per day (g/day), was computed as the sum of all reported alcoholic beverages. “Never drinkers” were patients who abstained from drinking lifelong; “Ex-drinkers” were individuals who had abstained from drinking for at least 12 months at the time of cancer diagnosis or interview (for controls). Satisfactory reproducibility of questions on self-reported smoking and drinking habits in our study populations has been previously reported [15, 16].

The main aim of this paper was to evaluate the interaction of alcohol consumption with tobacco smoking using the two exposures as continuous predictors, focusing on low-to medium exposures. Therefore, the analyses were restricted to people who had smoked ≤40 cigarettes/day and to those who had drunk ≤180 g/day of ethanol. This restriction also avoided odd behaviours of risk estimates due to sparse data. Thus, 1569 head and neck cancers cases (491 oral cavity cancers, 491 pharynx, and 587 of larynx) and 3147 controls were included in the present analysis (Table 1).

Table 1 Distribution of 1569 cancer cases and 3147 controls according to gender, age, tobacco smoking and alcohol drinking

The dose–response relationship between the risk of cancer and the combined exposure to daily ethanol intake and cigarettes smoking was investigated through bi-dimensional regression spline models [17]. Briefly, the logit was estimated through a generalized semi-parametric model where the two exposures were included as a smoothly joined piecewise polynomial of defined degree, with constrains for continuity at each join point. Odds ratios (OR) and corresponding 95 % confidence intervals (CI) were also calculated according to the model presented in detail in “Appendix”. An ad hoc SAS macro was developed.

The optimal number of segments was selected putting an increasing number of knots and selecting the model that minimizes the Akaike Information Criterion (AIC) [18]. This selection criterion compromises between a good fit and a simple model by taking into account the deviance and the number of parameters in the model. As a sensitivity analysis, risks estimated for models with higher AIC values were compared: the risk surface was quite stable in relation to modification of the number of knots and knots’ location, reassuring on the robustness of the method.

In the present model, the number of estimated parameters increases exponentially with the degree of polynomial due to an increasing number of interactions, thus requiring an elevated sample size. Therefore, linear splines (i.e. polynomial degree = 1) were considered. Moreover, up to a total of 4 knots were considered for all head and neck cancers; in addition, different locations for the knots were evaluated. When analyses were conducted separately for each anatomical site, up to three knots were considered. The reference category was defined as “Never smokers and Never drinkers”; study area, sex, age at diagnosis, education, smoking and drinking habits (former vs. current) were included as covariates in the models’ equations.

For the sake of comparison, risk estimates for all head and neck cancers were further calculated according to the step-function logistic model. Smoking intensity and ethanol intake were firstly categorized and a term for the cross-product of the categorical exposures was included in the regression equation. Considering the low number of abstainers from alcohol drinking in our population, the reference category for the step-function model was set to never smokers who drunk <25 g/day ethanol (i.e., 2 drinks/day). ORs were adjusted for the same covariates as the spline models.

Results

Cases and controls were predominantly men and reported similar distribution according to age (Table 1). Former smokers represented 25 % of all cancers and 29 % of controls whereas quitting from alcohol drink was less frequent in this study population (l0 % in cancer cases and 4 % in controls). As expected, the frequency of heavy smoker (i.e., 20–40 cigarettes/day) and heavy drinkers (i.e., ≥100 g/day of ethanol) was much higher among cancer cases than among controls.

Risk estimates for head and neck cancers for the best-fitting model were represented through a three-dimensional graph (mesh plot—Fig. 1a). Four knots, represented by thicker grid lines, were identified: two for ethanol intake (48 and 89 g/day) and two for tobacco smoking (5 and 10 cigarettes/day). Among never drinkers and for all levels of ethanol intake, the risk of head and neck cancer steeply increased with increasing smoking intensity, starting from 1 cigarettes/day. Conversely, the increased in risk associated to alcohol intake was less marked: the dose–response relationship between ethanol intake and cancer risk among never smokers showed a threshold effect up to approximately 50 g/day, but this effect was no longer appreciable among smokers (Fig. 1a). The combined exposure to ethanol and/or cigarettes intake led to a steep increase of cancer risk up to a 35-fold higher risk (95 % CI 27.30–43.61) among those consuming 89 g/day of ethanol and 10 cigarettes/day, compared to abstainers from both tobacco and alcohol (Fig. 1a). The highest risk was observed at the highest levels of alcohol and tobacco consumption (OR 172.25; 95 % CI 78.21–379.40).

Fig. 1
figure 1

Mesh plot (a) and contour plot (b) of risk for head and neck cancers by ethanol consumption (g/day) and tobacco smoking (cigarettes/day) estimated through a bi-dimensional regression spline. The reference category was defined as “never drinkers and never smokers”. Equation model includes terms for sex, age, centre, drinking and smoking habits (former vs. current). Surface is shown for best-fitting splines according to Akaike Information Criterion; thicker grid lines show knot locations. Thicker lines on graphic floor represent knots projection. Black lines in contour plots indicate iso-risk curves at defined levels of risk (odds ratios 2, 5, 10, 20, 50, 100)

Risk estimates were also displayed through a contour plot (Fig. 1b), i.e. a two-dimensional graph showing lines that identify combinations of tobacco smoking and ethanol intakes for which the risk of head and neck cancers is constant. A steeper risk increase with tobacco smoking rather than with ethanol intake is outlined.

Risk estimates were reported separately for the three anatomical sites in Fig. 2. A similar pattern emerged for all the considered sites, but the effect of smoking seemed stronger for laryngeal cancer. For pharyngeal cancer, a threshold effect for ethanol intake up to 40 g/day emerged, similarly to that reported for all head and neck cancers; however, this threshold effect disappeared as the number of cigarettes increased.

Fig. 2
figure 2

Mesh plot and contour plot of risk for cancer of the oral cavity (n = 491), pharynx (n = 491), and larynx (n = 587), by ethanol consumption (g/day) and tobacco smoking (cigarettes/day). Risks were estimated through a bi-dimensional regression spline, taking “never drinkers and never smokers” as reference category. Equation model includes terms for sex, age, centre, drinking and smoking habits (former vs. current). Surfaces are shown for best-fitting splines according to Akaike Information Criterion; thicker grid lines show knot locations. Thicker lines on graphic floor represent knots projection. Black lines in contour plots indicate iso-risk curves at defined levels of risk (odds ratios 2, 5, 10, 20, 50, 100)

Table 2 reports the ORs for the cross-products of ethanol intake and cigarettes/day smoked according to the step-function model with lowest and highest ORs estimated through spline models within the same cross-product category. The risk estimates of the two models were generally in agreement, except for the highest levels of the exposures, with ORs from the step-function model between the lowest and the highest ORs from the spline models. Nonetheless, ORs from the step-function model were averages of risks, which may vary considerably within the given cross-product. For instance, the OR for people smoking 1–14 cigarettes/day and drinking 100–180 g/day of ethanol was 21.61 (95 % CI 14.90–31.36) according to step-function model, whereas, according to spline model, the risk varied within that exposure categories from 3.89 to 61.41.

Table 2 Odds ratio (OR) and 95 % confidence intervals (CI)a for head and neck cancers for tobacco smoking and alcohol drinking according to step-function regression model and bi-dimensional regression spline

Discussion

Our results show that bi-dimensional regression spline models are a feasible and flexible method to estimate the interacting dose–response relationship between two continuous exposure variables, with higher power to detect the variation in risk estimates than traditional step-function models. Two- and three-dimensional graphical representations were provided (i.e., mesh plot and contour plot) to display the risk of head and neck cancers associated to the combined exposure to alcohol drinking and tobacco smoking.

Results from the present analysis were consistent with previous findings from the same data [8, 9]. In particular, the threshold effect of alcohol drinking observed in all head and neck cancers and in pharyngeal cancer for low consumption holds true only among never smokers. Conversely, among tobacco smokers, cancer risks increased across the whole range of ethanol intake starting from a very low number of cigarettes/day [9]. The risk found among heavy smokers and heavy drinkers was very high compared to never smokers/never drinkers, in agreement with what reported in a previous study from the same study areas [19]. Of course, the results for all head and neck cancers taken together should be considered with caution since they represent the risk pattern of a population with the same distribution for each cancer site.

Different dose–response relationship emerged for the three anatomical sites considered. The number of knots varied according to cancer site, as well as their location, ensuring the power of the methods to capture different relationship between the two exposures. In particular, the threshold effect for low ethanol consumption in never smokers emerged only for pharyngeal cancer and the effect of tobacco smoking was more relevant for laryngeal cancer, as previously reported by several studies [2, 9].

The standard approach to investigate interacting risk in case–control study is through logistic step-function. The two exposures were firstly categorized and risk estimates were calculated for the cross-product of the two categorical exposures, assuming that the risk was constant within each cross-product category. In this paper, we have compared risk estimates from step-function and from spline models, highlighting that the assumption of risk steadiness within the cross-product category was rarely respected. Bi-dimensional spline allowed to overcome this limitation, since the risk may vary within each cross-product category according to the polynomial degree. Moreover, the selection of the number of knots and their location may increase model flexibility. In addition, step-function did not allow the estimation of risk using never smokers who never drunk as reference category, as the sample size was inadequate for the model (i.e. 24 cases, 1.53 %). For this analysis, the inclusion in the reference category of never smokers who drunk <25 g/day of ethanol did not actually impact risk estimates, as they are not at increased risk of head and neck cancers. For the sake of caution and parsimony, only linear splines were considered in the present analysis. However, both the theoretical model and the developed SAS macro can be generalized to higher polynomial degree and to any other continuous exposure.

This method is affected by most of the limitations of univariate spline models. Firstly, the choice of some model setting parameters (e.g., maximum number of knots, highest polynomial degree, criteria for knots location) is arbitrary. Indeed, better estimates could be obtained forcing the allocation of knots, but this required an a priori knowledge of where the knots should be located. However, the selection of boundaries seems to be of less importance as compared to step-function analysis, since the flexibility of the model could partly attenuate a bad selection of knots [6]. Secondly, the number of model parameters greatly increases with the power of the polynomial and with the number of knots, thus limiting the potentiality of the methods in absence of very large databases. The present pool analysis seemed to have an adequate sample size (at least for all head and neck cancers combined), which should be accounted for among the strengths of the study. Moreover, although confidence intervals were estimated, they could not be represented through the proposed plots. Finally, we did not formally investigate whether the two exposures interacted in an additive or multiplicative way. Although this would have been possible through standard approaches (e.g., synergy index S), the great variability of risk estimates in the highest levels of exposures suggested caution.

Results of the present analysis may suffer from limitations common of a case–control design such as recall bias, selection bias, and use of hospital controls [20]. However, the reproducibility of alcohol intake and tobacco smoking was good in our study [15, 16]; in addition, the exclusion of very high exposures (i.e., >180 g/day of ethanol and >40 cigarettes/day) allowed us to rule out most of the possible misclassifications and the effect of confounding factors connected to severe alcoholism and/or heavy tobacco smoking [21, 22]. The almost complete participation of cases and controls and a direct interview through trained nurses contributed to the strength of the study.

Results from the present study strengthened the previous findings on the effect of tobacco smoking and alcohol drinking on the risk of head and neck cancers. This study also provided evidence that flexible models, such as bi-dimensional spline ones, may help to detect interactions between two continuous exposures without strong a priori hypothesis on the shape of such relationship.