Introduction

Alcohol use by adolescents and young adults is one of the most costly and largest risk factors of mortality and morbidity in established market economies (Rehm et al. 2006). In developing countries, other risk factors such as undernutrition and water hygiene are more important (Ezzati et al. 2004). ‘Risky single occasion drinking’ (RSOD, also called binge drinking or alcohol use leading to intoxication) is a main factor associated with many consequences such as injuries, unprotected sexual activities, and unfavorable academic development among young people (Hingson et al. 2005).

Brief intervention is a cost-effective, individual-centered preventive strategy with 1–3 short sessions (Babor et al. 2010), often based on techniques of motivational interviewing (MI, Miller and Rollnick 2002). Avoiding confrontation and argumentation, this approach accepts young people as self-determined individuals without lecturing them or declaring ultimatums, and provides an atmosphere for self-directed behavior change (Tevyaw and Monti 2004).

Research on brief alcohol interventions (BAI, Babor and Higgins-Biddle 2001) has shown some promising (Larimer et al. 2004; Tevyaw and Monti 2004) though equivocal (Handmaker et al. 1999; Murphy et al. 2004) findings among adolescents and young adults. However, few BAI studies have focused on RSOD (Kaner et al. 2007), and findings are inconclusive (Whitlock et al. 2004). The present study looks at the effectiveness of BAI delivered in two 45-min sessions focusing on RSOD, commonly defined as drinking 5 or more drinks on an occasion for men, and 4 or more drinks for women (Gmel et al. 2011).

Despite its potential, individual-centered BAI with one-on-one counseling is costly and time-consuming. The present study describes a group approach where each counselor conducts BAI simultaneously with several individuals. Advantages of group interventions are direct feedback through peers (LaBrie et al. 2007), strengthening of behavior change through reinforcement by group members (Foote et al. 1999), and bolstering autonomous motivation resulting in stronger individual change than is found in personal consultations (Ryan et al. 1995). However, a meta-analysis of MI-based group interventions suggests less efficacy in group interventions compared with personal consultations (Lundahl and Burke 2009), and McCambridge et al. (2011) recently found no significant effects from MI classroom sessions.

Group BAI sessions have been used, but were commonly not MI-oriented, not designed for younger individuals, and not very brief (often around 10 sessions, e.g. Bradley et al. 2007; Engle et al. 2010), making them impractical in most school settings. There were some MI-based group approaches designed for younger audiences and consisting of fewer (4–6) sessions (Fields 2006; D’Amico et al. 2010), but results were mixed. A few of the reviewed brief (3 or less sessions) group interventions on alcohol use by adolescents and young adults that showed small beneficial effects (Schmiege et al. 2009) commonly used no control group designs. One study (LaChance et al. 2009) found alcohol use reductions after 3 and 6 months, but other research showed short-term effects that disappeared after 6 months (LaBrie et al. 2009; Bachmann 1999).

An objective of the present study was to test a brief group intervention that can be directly implemented in the naturalistic setting of schools, providing information on effectiveness. It was important to conduct the intervention without using additional resources than those provided by the schools. The aim was to increase external validity for the intervention and not the internal validity as in many efficacy trials, where additional resources such as space outside of schools, personnel for screening, assessment and data collection are used.

Methods

Intervention design

The intervention was designed for groups of 8–10 students, grades 10–13, in secondary schools (i.e. vocational schools and gymnasiums preparing for high schools and universities) within the Zürich canton. It consisted of two 45-min intervention sessions (see Table 1 for details) based on MI techniques (Miller and Rollnick 2002) and BAI strategies suggested by Babor and Higgins-Biddle (2001).

Table 1 Content, aims, and description of intervention

Counselors were 12 experienced collaborators from the alcohol and drug help information centers of Winterthur, Zürich City and Zürich Oberland. All counselors have conducted brief interventions before, and they were familiar with MI principles, which were rehearsed during a half-day course prior to intervention.

Baseline assessment took place between August 2008 and March 2009. Intervention sessions were held 2–3 weeks later, with 6-month follow-ups occurring between February 2009 und November 2009.

Intervention objectives

The three objectives were to reduce an individual’s RSOD frequency, maximum number of drinks on a single occasion, and overall weekly consumption at 6 months.

Intervention outcomes

The primary outcome measure was the frequency of RSOD occasions, where one occasion was defined as 4+ drinks for females and 5+ drinks for males. Frequencies were measured on an 8-point scale ranging from never to 12 or more occasions. Midpoints of categories were used (e.g. 3.5 for 3–4 occasions, and 13 for 12 or more occasions).

Secondary outcome measures referring to the past 30 days were the total number of standard drinks in a typical week (Monday–Sunday) and the maximum number of drinks on a single occasion. Examples of standard drinks containing 10–12 g of ethanol were provided for beer, wine, spirits, alcopops and cocktails (e.g., Whiskey-Cola, Caipirinha), along with conversion values (e.g. 3 cans of 0.5 l beer = 6 standard drinks).

Sample size and power considerations

International school surveys revealed design effects of about 1.2 (Hibell et al. 2003; Roberts et al. 2004), representing an approximate Intra Class Coefficient (ICC) of 0.01. Generally, ICCs are smaller than 0.10 (Bland 2000). We based the a priori power calculations on ICCs of 0.05. As no information on effect sizes for secondary and vocational school students could be found in the literature, we drew on a pilot study pointing to approximately 50 % of students at risk, i.e. having more than two RSOD in the past 30 days (Nelson et al. 2005). To achieve power at 0.90 with a significance level of 0.05 in our study, a sample size of 49 intervention classes and 49 control classes was needed (i.e. a total of 1,940 students at 20 per class) to show a 20 % difference between intervention and control groups at follow-up (Bertholet et al. 2005; Kaner et al. 2007). Power calculations including cluster sampling were also conducted for t tests on RSOD and mean number of drinks, which resulted in a need for <49 clusters described above. These computations were done with Pass 2008 (Hintze 2008) software, but no program was available to perform power estimations for regression analysis including cluster sampling directly. For standard logistic regression assuming a conservative OR of 1.5 and 3 independent variables (intervention, age and sex) with an assumed R 2 of 0.2, 380 students would be needed. With an assumed design effect of 1.2, this meant 456 students. Fewer individuals would be needed for regressions and analysis of variance for continuous measures. Using a conservative estimate, we planned to include 60 classes each for intervention and control conditions.

Study population, sampling and quasi-randomization

Since obtaining permission from teachers and school principals is time consuming, the largest of the 43 schools in the canton were contacted first. Selection stopped after the intended number of 120 classes was approximated. Three schools in the city of Zürich and six in rural areas provided 118 classes, but five classes withdrew their initial consent because of other priorities such as urgent exams, i.e. 113 (95.7 %) participated. Schools generally offered full participation (i.e. primary sampling units were schools), but only classes with students still in school at follow-up were included.

Four of the nine schools participated with full research protocol compliance, providing 78 classes for randomization to intervention and control conditions. Three schools agreed to the condition that all their classes receive an intervention; two others agreed only to the condition that all their classes were controls. Forty classes from these five schools were included in the study.

Only classes of the first four schools could be used for random allocation, and a random number between zero and one was generated for each class. Those with the highest numbers were assigned to the intervention condition and those with the lowest numbers to the control condition. The number of intervention and control classes was chosen to match the total number of intervention and control classes across all nine schools, including the five schools furnishing only control or intervention classes. Randomization was also stratified to match the three school types (high schools and vocational schools with either mercantile or industrial–commercial apprenticeship specializations). Across all nine schools, the three types had nearly equal (±1 class) representation under the intervention and control conditions.

According to baseline drinking patterns, homogeneous groups of 8–10 students were formed for receiving the intervention, i.e. “medium risk”, with 3 or 4 RSOD occasions during the last 30 days, and “high risk”, with 5 or more RSOD occasions. Intervention students came from the same or parallel classes within schools. Those not at risk (<3 RSOD) received a lecture on general health issues. Controls were students from control classes with the same definition of risks (see Fig. 1).

Fig. 1
figure 1

Flow diagram

There were no exclusion criteria. Questionnaires were distributed with blank envelopes that students could return sealed even if the entire questionnaire was unanswered. Ethical approval was obtained from the responsible department in the Mittelschul- und Berufsbildungsamt des Kantons Zürich (Cantonal Office for Secondary Education Zürich, COSEZ). At the end of the consultation, students were encouraged to continue talking about alcohol use, and addresses of help and information centers were provided, as were business cards of counselors in the event that continued one-to-one counseling was wanted. Since baseline, only seven intervention students and six controls initiated any helpdesk or psychological counseling services. Given the small and nearly equal numbers, this should not have affected the results.

Implementation and blinding

Throughout the study, there was a strict separation of the trial evaluators (GG, FL) and those providing the intervention. Whereas the enrollment of schools was done by COSEZ, the quasi-randomization and allocation of students to groups was completed by the evaluators, without any contact with counselors or school officials.

At baseline, a 5–10 min questionnaire of seven items was distributed in classes. One-to-one codes (a combination of own birthday and first name of a grandmother) were created by each student and placed with the completed questionnaire in sealed envelopes which were opened later at the trial evaluation site. There, lists of the codes with allocations to classrooms were provided to school officials, showing where the interventions or the general health lectures took place. Officials were blinded to the content of the corresponding classroom sessions. Similarly, schools with random allocation were blinded to randomization of classes, and schools and counselors were blinded to allocation of students into groups.

Statistical methods

To test for equivalence at baseline between intervention and control individuals, Chi-squared tests for categorical variables (sex, and at-risk status) and t tests for continuous variables (frequency of RSOD, maximum number of drinks on an occasion and total number of drinks in a typical week) were used. For the analysis of attritors (those lost to follow-up), logistic regressions for categorical, and analysis of variance for continuous measures were used. Main effects in the attrition analysis show whether attrition is associated with sex (logistic regression analysis) or with age (analysis of variance), and the interaction effects show whether attrition was differential among intervention and control groups (attrition bias).

Analysis of baseline equivalence between intervention and control individuals or between attritors and non-attritors conservatively did not account for cluster sampling. This strategy increased the probability of finding non-existent differences (Type 1 error). Baseline equivalence and lack of attrition bias was assumed for tests with p > 0.10 (Petitti 2000).

Multiple linear regressions were used in the main analyses to show outcome differences between intervention and control groups; of interest were frequency of RSOD, total number of drinks in a typical week and maximum number of drinks on a single occasion. Intention to treat analysis with the last value carried forward was used. Models accounted for cluster effects (STATA version 11 software with the “robust cluster” option) and were adjusted for age and sex. No variables were transformed, since sample sizes were sufficiently large to justify analysis of skewed variables (Lumley et al. 2002), and less skewed change scores were used. Main outcomes were considered significant at p < 0.05.

Multiple regressions were used in ancillary intention to treat analyses on the same alcohol outcomes as in the main analyses, separately for the medium and high risk groups and for both sexes. Significance was set at p < 0.10 for these subgroup analyses, which were underpowered to detect differences at the conventional p < 0.05 level.

Results

Participant flow and baseline data

Quasi-randomization of classes did not create baseline equivalence on the individual level (Table 2). Students in control classes were significantly older and more often male than those in intervention classes (age and gender were subsequently adjusted). There was no significant difference for risk status (χ2 = 1.31, df = 2, p = 0.52) or alcohol outcomes (frequency of RSOD, number of drinks in a typical week or maximum number of drinks on a single occasion).

Table 2 Baseline characteristics at cluster (class) level, including students not at risk

Seventy-nine individuals from intervention classes and 103 individuals from control classes were lost to follow-up. Since intention to treat was used with the last value carried forward, the analysis included 338 at-risk students of 973 (34.7 %) students in intervention classes and 330 (37.3 %) of the 885 students in control classes.

Attrition analysis (Table 3) showed that individuals dropping out were significantly more likely to be at risk, but there was no differential dropout between intervention and control groups (Wald test for interaction = 0.01, p = 0.945). Among at-risk students, attritors were more likely to be older (F = 4.89, p = 0.027) and to have more frequent RSOD (F = 5.67, p = 0.018), but attrition overall was not differential (p > 0.10) regarding sex, number of drinks in a typical week, or maximum drinks on a single occasion.

Table 3 Differences on baseline measures: attritors versus non-attritors and intervention versus control

Main and secondary outcomes

Although the intervention and the control groups reduced their drinking (Table 4), no significant change was observed for the main outcome of RSOD frequency (b = −0.210, robust SE = 0.263, p = 0.427). Similarly, no significant changes were obtained for the secondary outcomes of number of drinks in a typical week (b = 0.534, robust SE = 1.134, p = 0.639) and maximum number of drinks on one occasion (b = 0.190, robust SE = 0.236, p = 0.423). The same was true for the analysis stratified by sex (not tabulated); results were highly non-significant (p > 0.4) for both men and women.

Table 4 Drinking outcome difference from baseline to follow-up by intervention groups (ITT)

Ancillary analysis

Analyses stratified by risk groups revealed differential results (Table 5). Beneficial effects of intervention were found for the medium risk group; however, they reached only borderline significance for frequency of RSOD (b = −0.553, robust SE = 0.295, p = 0.064) and number of drinks in a typical week in the past 30 days (b = −2.824, robust SE = 1.480, p = 0.059). This represents a 14 and 17 % reduction (respectively) over that of the control group. In gender-specific analyses (not tabulated), effects were in the same direction for males and females. Reduction of RSOD was significant only for females (b = −0.753, SE = 0.307, p = 0.017), whereas only males showed reductions in number of drinks in a typical week (b = −4.403, SE = 2.178, p = 0.047). Gender differences were non-significant (p > 0.4) for maximum number of drinks. After Bonferroni corrections for multiple testing, none of these effects were even of borderline significance, except for RSOD among women.

Table 5 Drinking outcomes differences between baseline and follow-up time by intervention groups and at risk subgroups (ITT)

Changes among the high-risk group were in the opposite direction; controls had greater reductions in consumption compared with the intervention group. Effects were significant for number of drinks in a typical week (b = 3.488, robust SE = 1.698, p = 0.042), and reached borderline significance for maximum number of drinks on one occasion (b = 0.537, robust SE = 0.306, p = 0.083). Gender-specific analyses (not tabulated) were in the same direction for both genders but revealed a significant increase in number of drinks in a typical week only for males (b = 4.980, SE = 2.410, p = 0.042) and a borderline increase in maximum number of drinks only for males (b = 0.592, SE = 0.342, p = 0.087). RSOD frequency remained virtually the same for control and intervention groups, and effects were non-significant (p > 0.4) in analyses stratified by gender. None of these results were significant after Bonferroni corrections.

Discussion

The present study is one of the few using brief alcohol interventions delivered to groups of students. The research aimed to provide evidence for the usefulness of this type of intervention. In Switzerland, evidence on feasibility and effectiveness is required before implementation of large-scale interventions. To demonstrate feasibility, the intervention was set up in a way that can be repeated in all schools. It was not designed as a “laboratory” trial to increase internal validity, e.g. setting up the intervention site outside of regular class times and rooms, or using additional research staff or resources such as laptops for data collection. Unfortunately, the unsolicited participation of schools was necessary to provide feasibility evidence in the current trial, and certain standards for state-of-the-art randomized clinical efficacy trials could not be met (e.g. random selection of schools or perfect random allocation of classes to control and intervention conditions). We believe that these shortcomings were counteracted by showing that these interventions can be accomplished within the standard curriculum. Therefore, external validity is high, but can only be generalized to secondary schools in the German-speaking Zürich Canton.

The findings of the present study went in different directions. Strictly speaking, there were no significant results for at risk-drinkers taken as a whole. However, there were divergent findings for ‘medium risk’ (3–4 RSOD) and ‘high risk’ drinkers (5 or more RSOD). The intervention yielded consistent reductions in alcohol use for medium risk group, reaching borderline significance (p < 0.10) for the main outcome of RSOD frequency and the secondary outcome of number of drinks in a typical week. Reductions were about 15 % greater than in the control group and are consistent with effect sizes described in BAI meta-analyses (Bertholet et al. 2005) and other school-based prevention programs (Faggiano et al. 2010). However, it should be noted that after Bonferroni corrections for multiple testing nothing was significant at the nominal p < 0.05 alpha level.

Alcohol use within the high risk group seemed to increase following BAI compared with controls and was significant for number of drinks in a typical week (p < 0.05), and borderline significant for maximum number of drinks on an occasion (p < 0.10). Virtually no differences were found for frequency of RSOD.

Generally, a conclusion of the BAI literature is that there are either beneficial or null effects, but none that are detrimental (Larimer et al. 2004; Tevyaw and Monti 2004; Handmaker et al. 1999; Murphy et al. 2004). Although it is unlikely that group BAI will produce detrimental effects, the present study suggests using caution regarding high-risk subgroups.

An explanation for some apparent detrimental effects on high-risk drinkers has been given by McCambridge et al. (2011). Responses to alcohol use questionnaires are subject to different biases, e.g. social desirability (e.g. Davis et al. 2010), faulty recall (Gmel and Daeppen 2007), or assessment reactivity (Walters et al. 2009). It may be that heavy drinkers “learn” from feedback during the intervention that others also have equally high (or higher) consumption levels. At follow-up, they may be more prone to revealing their “true” consumption (reduction of social desirability bias), to remember more frequent or heavier drinking (reduction in recall bias) or to assess their consumption levels more accurately (assessment reactivity). Two observations support this argument. First, negative effects were found for number of drinks during a typical week and maximum drinks on an occasion, but not for frequency of RSOD, which occurs mostly on weekends with regularity and is probably easier to remember. RSOD frequency may also be less socially undesirable to report (e.g. “everybody drinks on weekends”) than the actual amounts consumed during these occasions. Second, counselors often alluded to this phenomenon by reporting that during the first intervention session, self-reported alcohol levels were lower. By the second session, students had more confidence and were more open in their discussions with others, and admitted drinking more heavily on some occasions. If these mechanisms were actually in play, then more accurate self-reporting at follow-up may have masked any real reductions in alcohol use due to the intervention.

Drawing conclusions from this trial depends on the interpretation of negative effects obtained in the “high risk” group. Babor and Higgins-Biddle (2001) have noted that BAI is not designed for individuals with very heavy drinking, and many BAI trials either have excluded the heaviest drinkers or (if included) have found lower effect sizes (Kaner et al. 2007). It may be that BAI does not work well on heavy drinking secondary school students. There is also some discussion in the literature regarding the potential for iatrogenic effects of group interventions (Dishion et al. 2002). However, a meta-analysis was not supportive of iatrogenic or deviancy training effects (Weiss et al. 2005). If there is truly a case for iatrogenic effects, then group BAI should not be recommended without incorporating dedicated measures for heavy drinking groups into the intervention design. Model programs have been advanced by organizations such as the U.S. Substance Abuse and Mental Health Services Administration (SAMHSA), e.g. Active Parenting of Teens, Motivational Enhancement Therapy, Adolescent Community Reinforcement Approach, Multidimensional Family Therapy or Twelve Step Facilitation Therapy. These programs go far beyond the BAI of the present study and require a lot more effort and additional resources. Although BAI should not be seen as an alternative to such programs or a replacement for alcohol education in early age, it may add to the spectrum of possible interventions at relatively low expense.

One-on-one interventions have the advantage of providing a personalized and confidential setting in which counselors can focus on an individual’s change talk to stimulate behavioral change (Miller and Rollnick 2002). In contrast, group interventions have greater cost-effectiveness and the ability for implementation in secondary schools. A larger majority of younger students can be reached without introducing undesirable “side effects”, such as stigmatization by classmates or teachers. Several theoretical arguments can also be made for group interventions, such as direct peer feedback (LaBrie et al. 2007), reinforcement by peers that strengthens autonomy within the group (Foote et al. 1999), and enhancement of autonomous motivation (Ryan et al. 1995). On the other hand, social dynamics in the classroom might repress openness among shy students. Feedback of counselors, however, confirmed some very lively discussions among participants, and shy students may nevertheless have benefitted from these discussions even though they seldom joined in. Our study showed that group BAI may be beneficial for some heavy drinkers, but may be detrimental for the heaviest of them. To scientifically establish that benefits without harm exist, designs are needed that can test for reduction of social desirability and recall biases or assessment reactivity in group sessions affecting self-reports of consumption. However, if the apparent increase in alcohol use within high-risk students shown here is confirmed, then group BAI either should be discontinued or at least modified for heavy drinkers.