Three common and debilitating mental health conditions frequently emerge during adolescence: anxiety, depression and eating disorders (Fairweather-Schmidt & Wade, 2014, Teesson et al., 2014). One promising approach for prevention is mindfulness-based programmes (MBPs) that explicitly teach skills to counteract shared risk factors underpinning these mental disorders: emotional dysregulation (Aldao et al., 2010), rumination (McEvoy et al., 2013) and self-critical perfectionism (Egan et al., 2011). With their capacity to reach large numbers of youth, enthusiasm is therefore growing for widespread dissemination of mindfulness via universal prevention programmes in schools. There is accruing evidence for universal prevention programmes such that creating a small shift in risk factors or disease status has a larger population effect on wellbeing than targeting high-risk individuals (Cuijpers et al., 2008; Huppert, 2009).

However, the application of MBP as a prevention approach in youth remains nascent, with investigations rarely moving beyond early stage evaluations with low methodological rigour (Dunning et al., 2018; Shute, 2019; Tan, 2016; Volanen et al., 2020). While a series of meta-analyses (Kallapiran et al., 2015, N = 11 studies; Klingbeil et al., 2017, N = 76 studies; Maynard et al., 2017, N = 35 studies; Zenner et al., 2014; N = 24 studies; Zoogman et al., 2015, N = 20 studies) collectively support small to moderate benefits in youth across a range of outcomes such as psychological symptoms, externalising behaviour, social competence and measures of cognition, many of these analyses include early stage, small, unpublished, uncontrolled or non-randomised studies (Dunning et al., 2018). Only one meta-analysis has exclusively examined randomised controlled trials (RCTs; Dunning et al., 2018, N = 33 studies), reporting small to moderate positive effects on mindfulness, cognitive function, negative affect and negative behaviours (Cohen’s d = 0.16–0.47). The studies, however, included at-risk and clinical populations together with general education settings, where effects can be expected to differ due to floor effects. Another meta-analysis examined MBPs exclusively in mainstream school settings (Carsley et al., 2018; N = 24 studies) but included non-randomised studies. Results showed small to moderate effects on mental health outcomes (Hedges’ g = 0.24).

In the aforementioned analysis, interventions for older adolescents (15–18 years) had greater effects compared to middle childhood (6–10 years) with an absence of effects during early adolescence. The meta-analysis by Dunning et al. (2018) also reported an age effect, with larger effect sizes for older students on executive function, but larger effect sizes in younger students on negative behaviour. Also suggesting greater effectiveness for older than younger youth are the five investigations that have so far tested MBP approaches in large, universal, RCT designs with follow-up in mainstream secondary schools (Supplementary Table S1). Reasons for a differential age effect could include the following: (1) newly emergent conceptual thought processes that are not yet sufficiently advanced in young adolescents to benefit from adult-derived MBPs; (2) the dip in emotional regulation that occurs at this age with puberty (Cracco et al., 2017; Hagler et al., 2016; Zimmermann & Iwanski, 2014) lessening effectiveness and (3) lower stressors at this younger age that may reduce perceived relevance and therefore engagement.

It may also be that younger adolescents would benefit from increased classroom dosage rather than the brief “taster” format of the Dot be UK curriculum (Johnson et al., 2016, 2017a; Volanen et al., 2020). Reducing the intensity of adult MBPs (2.5 h per week, 30 min daily practice and day-long silent retreat) is appropriate and necessary in terms of adolescent developmental capacity (Tan, 2016). However, in the absence of dismantling research and developmental models of mindfulness, it is unknown which active ingredients should be retained in these diluted programmes (Felver et al., 2016). Thus, dose and content of at least some adolescent programmes may be insufficient.

In order to test both increased dose and compare age bands, Johnson and Wade (2019) conducted a feasibility trial of a more intensive 8-week Belgian curriculum: Mindful Teens (Dewulf, 2013). With longer weekly lessons, more in-class meditation and inquiry, and frequent repetition of key practices, the intervention was delivered to early- (Mage 13.5) and mid-adolescents (Mage 15.5 years) by an experienced external facilitator. Compared to 10-min meditations (seated and lying body scan) in earlier studies testing the Dot be curriculum (Johnson et al., 2016, 2017a), meditations in the Mindful Teens feasibility study ranged in length from 20 (e.g. sitting with guided awareness of breath, sounds, thoughts and emotions, including short periods of silence) to 30 min (e.g. yoga, body scan). The longer lessons were difficult to accommodate within school timetables, but lesson content was rated as agreeable by students and developmentally appropriate by attending school staff, across both age bands. No adverse effects were reported. Results from this small pilot sample (N = 90) showed moderate reductions in depression and anxiety at 4-month follow-up (Supplementary Table S1). Although underpowered, moderation analysis suggested that the improvements were only in the older students.

The aim of the current study was to examine whether the promising effects of this more intensive 8-lesson mindfulness curriculum in a universal approach could be replicated in a larger adolescent sample. We hypothesised that (1) the programme would improve mindfulness together with a range of psychological outcomes (anxiety, depression, eating disorder risk, wellbeing) and (2) improvements, and acceptability of the programme, would be greater for mid- than early-adolescent students.

Method

Participants

Three Adelaide schools participated; two (school A, B) had participated in our pilot study but were not teaching mindfulness programmes; school C was approached to broaden our sample to include a public school. Schools were classified into socioeconomic (SES) bands using the Index of Community Socio-Educational Advantage (ICSEA) whereby 1000 represents the mean, with a SD of 100 (Australian Curriculum Assessment and Reporting Authority, 2012). Schools A (Private; ICSEA 1153) and C (Public; ICSEA 1125) were urban and classified as high SES (greater than one SD above the mean; Johnson et al., 2016), while school B (ICSEA 1063) was rural and classified as medium SES (within one SD above the mean). Power analysis showed that to detect a between group Cohen’s d effect size of 0.30, with a power level of 0.80, 91 participants per group were required (Hedeker et al., 1999), which to account for our age moderation analyses required a total sample of 364.

Figure 1 shows the flow of participants (N = 476) through the study. Schools A and B provided three and two pairs of classes at each year level, respectively. School C provided one pair of year 8 classes but due to curricular pressure was unable to supply any year 10 classes. Mean age in the year 8 and 10 classes was 13.67 years (SD 0.42; 49.1% female) and 15.52 years (SD 0.37; 46.7% female), respectively. Levels of non-consent were greater in the medium SES school N = 24 (17.1%) than the high SES schools N = 18 (6.1%). A subset of students (N = 161) from schools A and B who had taken part in the first two rounds of the intervention were available for 9-month follow-up data within the trial timeframe. For this subset, mean age in the year 8 classes was 13.61 years (SD 0.32; 48.5% female) and for year 10 students 15.50 years (SD 0.33; 46.7% female).

Fig. 1
figure 1

Flow of participants through study

Procedure

A cluster (class-based) randomised controlled design was used, with assignment to mindfulness or control (usual curriculum) groups performed by the first author using the randomisation function in Excel. Outcome measures were administered on three occasions, 1–3 weeks pre-intervention (T1), 1–7 days post-intervention (T2) and at 3 months post-intervention (mean 12.3 weeks, range 9–17; T3). For a small subset available for 9-month follow-up (T4), mean time post-intervention was 41 weeks (range 39–43).

Research approval was granted by each school principal, the Department of Education (South Australia) and the Social and Behavioural Research Ethics Committee of Flinders University. A dual consent procedure was approved as follows: opt-out for parent and student on receipt of information pack and consent form, and student assent on the day of data collection. Testing was performed in a classroom setting with the teacher and principal researcher present. It was not possible for the students or the researcher to be blind to the allocated treatment group.

Intervention

The 8-week manualised programme Mindfulness Training for Teens (Dewulf, 2013) is designed for adolescents aged 13–18 years and was also used in the school study conducted by Raes et al. (2014) and our earlier feasibility trial (Johnson & Wade, 2019). The once-weekly curriculum is closely modelled on the adult programmes Mindfulness-Based Stress Reduction (MBSR; Kabat-Zinn, 1990) and Mindfulness-Based Cognitive Therapy (MBCT; Segal et al., 2013). Each session includes interactive introductions to mindfulness concepts, informal mindfulness practices (e.g. raisin exercise, 3-min STOP practice), guided seated or lying meditations and facilitator-guided group discussion of experiences. Given the longer formal meditations and more extensive inquiry compared to our earlier trials, a range of safety measures were included. These options were discussed in the first lesson and repeated before meditations: (1) tune out during meditations and take a rest; (2) raise hand and leave the room with the school counsellor who attended all sessions together with the class teacher; (3) see school counsellor following the session and (4) seek external support via parents, kids’ helpline or general practitioner.

During the earlier feasibility trial (Johnson & Wade, 2019), lesson length was 90 min inclusive of a 10-min mid-lesson break (i.e. 80-min content time). School timetabling difficulties for the current trial necessitated further shortening of the lessons to 65–75 min. This was considered acceptable as small volunteer senior groups had been successfully conducted with shorter duration (75 min; no break), and this was also felt to reflect real world curricular constraints for dissemination of this programme at scale. Modifications for the current trial involved minor shortening across all sections of the lesson (formal meditations, discussions, introduction of concepts, home practice explanations). Mindfulness courses were conducted over four consecutive school terms in 2018–2019. The control groups undertook normal lessons (i.e. English, Christian Studies, Health/Physical Education or home class activities. The latter included peer mentoring, time to catch up on personal projects and planning for school or service activities). At the conclusion of the trial, control group students and their parents were offered a free evening mindfulness workshop at their school.

All mindfulness lessons were conducted by the first author (CJ), an adult mindfulness facilitator with over 15 years of personal practice and 4-years experience teaching mindfulness in secondary schools. Given that lesson delivery was by a single instructor (CJ) due to the constraints of a small research team, her fidelity and competence in teaching this curriculum were rigorously assessed during the feasibility and acceptability pilot (Johnson & Wade, 2019) as follows. The authors used a marking rubric based on the adult Mindfulness-Based Interventions Teaching Assessment Criteria (MBI-TAC; Crane et al., 2012) which assesses a combination of adherence and competence across six domains, modified slightly to reflect the classroom environment and assess age appropriate delivery (Supplementary Material). Audio recordings of all lessons were conducted with ethics approval; from this pool, lessons were randomly selected and reviewed by the programme developer (i.e. independent of the research team and blind to outcomes during this process), such that each of the eight lessons was reviewed once. For each lesson, a score out of six was given for each MBI-TAC domain, which were averaged into an overall score. Across the six domains, average lesson rating for this study’s instructor was in the “proficient” band (5.2/6, range 4.7–5.5). A comment was given that meditations were sometimes shorter with less silence than in the original programme. The progression to a fully powered trial (albeit with a new sample) was considered an extension of the pilot, so fidelity was not assessed again for the current trial, given that the same presenter delivered all lessons.

Measures

Course Acceptability

Students

During the last mindfulness lesson, students were invited to undertake an anonymous pen and paper survey, rating the following questions on a 0–10 Likert scale with higher scores indicating greater satisfaction/likelihood: “How would you rate the course in terms of being enjoyable/interesting?”, “How much do you think you have learnt during the course?”, “In the future, how likely are you to use any of the techniques you have learnt?” and “How would you rate the Instructor?”.

Staff

During the last lesson, teachers and school counsellors rated these questions anonymously: “How would you rate the course in terms of being enjoyable/interesting for students?” and “How much do you think students have learnt during the course?”.

Internal reliability for the following scales in the current study appears in Table 1.

Table 1 Descriptive statistics for mindfulness and control groups (N = 434) at baseline, post-intervention and 3-month follow-up

Mindfulness

This construct was measured using the 25-item Comprehensive Inventory of Mindfulness Experiences—Adolescents (CHIME-A; Johnson et al., 2017b) that contains 8 individual factors: Awareness of Internal Experiences (awareness of emotions, e.g. “when my mood changes, I notice it straight away”), Awareness of External Experiences (awareness of environment), Acting with Awareness (awareness of present moment as opposed to being caught up in thinking about past/future), Accepting and Nonjudgemental Orientation (self-kindness with mistakes and perceived weaknesses), Decentering and Nonreactivity (ability to step back from difficult thoughts and emotions and not react immediately), Openness to Experiences (willingness to expose oneself to pleasant/unpleasant thoughts, feelings and emotions), Relativity of Thoughts (recognition of thoughts as transient and subjective) and Insightful Understanding (recognition that subjective interpretation of situation can create or compound difficulty). The scale has been validated for adolescents from age 13 and assesses the last 2 weeks. Items use a 5-point scale ranging from 1 “never true” to 5 “always true”, with higher scores reflecting greater mindfulness.

Depression

The 7-item depression subscale of the Depression, Anxiety and Stress Scale—Short Form (DASS-21; Lovibond & Lovibond, 1995) was used. It shows good fit in non-clinical adolescents (Szabo, 2010; Tully et al., 2009), and items include “I couldn’t seem to experience any positive feeling at all”. Each item is scored on a 4-point scale from 0 “never” to 3 “almost always”, with higher scores reflecting greater depression over the past week.

Anxiety

This was measured using the 7-item Generalised Anxiety Disorder Scale (GAD-7) which demonstrated good psychometric properties during its development and validation (Spitzer et al., 2006) and is recommended for use from age 12 (Quittner et al., 2014). Items include “How often have you been bothered by feeling nervous, anxious, or on edge?”. Items are scored on a 4-point scale from 0 “not at all” to 3 “nearly every day”, with higher scores reflecting greater anxiety over the past fortnight.

Weight and Shape Concerns

The weight and shape subscales of the Eating Disorder Examination-Questionnaire (EDE-Q; Fairburn & Beglin, 1994) were used, which correlate well with the interview format which itself has excellent psychometric properties (Berg et al., 2012). This measure has been used in early adolescent research previously (Wilksch & Wade, 2009). These 12 items (e.g. “How dissatisfied have you felt about your weight?”) use a 7-point rating scale ranging from 0 “not at all” to 6 “markedly”. Questions relate to the last 28 days and higher scores indicate greater concerns.

Wellbeing

This was measured with the 14-item Warwick-Edinburgh Mental Wellbeing Scale (WEMWBS), validated in both university and community adult populations (Tennant et al., 2007) and used in adolescent samples (Kuyken et al., 2013). It surveys the last 2 weeks (e.g. “I’ve been feeling optimistic about the future”). Items are rated on a 5-point scale (“none of the time” to “all of the time”); higher scores signify higher wellbeing.

Data Analyses

All analyses were undertaken using IBM Statistical Package for the Social Sciences, version 23. Data were assessed for normality, and logistic regression analyses conducted for the post-intervention, 3- and 9-month follow-up data to test if any baseline characteristics predicted absence. Logistic regression analyses were also used to check baseline differences between treatment groups. Repeated measures analyses were conducted using Linear Mixed Modelling (LMM), enabling inclusion of cases with missing data via maximum likelihood estimation. Baseline measures were entered as covariates to reduce the impact of variation of baseline scores on between-group effect sizes. This analysis tested the effect of the intervention on mindfulness and other psychological outcomes (Hypothesis 1); year level was then entered into LMM as a moderator to test Hypothesis 2. Acceptability levels for the intervention were compared between the mid- and early-adolescents using independent t-tests, also for Hypothesis 2. For group comparisons, we report the effect size, Cohen’s d, with suggested interpretation as small (d = 0.2), medium (d = 0.5) and large (d = 0.8; Cohen, 1988).

Results

Course Acceptability

At school A, two year 10 students stepped out of lessons with the counsellor, who reported external issues as responsible for the concerns, and five year 8 students asked to withdraw due to perceived religious concerns (i.e. meditation and yoga being contrary to their Christian beliefs) but chose to continue after discussion with their class teacher. One year 10 student at school B asked to withdraw due to disinterest but continued after discussion with the class teacher. At school C, three year 8 students spoke to the counsellor or their class teacher. One was bored and two were experiencing anxiety unrelated to the course. The former student was encouraged by their home class teacher to persist, and the latter two students chose to continue alongside pre-existing external counselling.

Anonymous feedback forms were obtained from students at the final session (N = 228). Mean instructor rating was 8.01 (median = 8; range 4–10). Mean ratings of the course were as follows: enjoyment and interest 5.61 (median = 6; range 0–10), amount learnt 5.98 (median = 6; range 0–10) and likelihood of using mindfulness practices in the future 5.30 (median = 5; range 0–10). These ratings did not differ between year 8 and year 10 students (p > 0.10).

Counsellors and class teachers also provided anonymous feedback (N = 11), with the following mean ratings: perceived student enjoyment and interest 8.36 (median 9, range 6–10) and perceived amount learnt by students 8.12 (median 8, range 5–10). Three staff suggested that the lesson length, concepts and sitting still were challenging for year 8 s and one noted that year 10 groups conducted in term 1 appeared more self-conscious during meditation practices and less engaged in class discussions, perhaps due to lack of familiarity with classmates at the start of the school year.

Quantitative Results

Data for mindfulness (Openness to Experiences facet), depression, anxiety and weight/shape concerns were positively skewed; square root transformations attained acceptable normality (Supplementary Table S2), following recommendations by Tabachnick and Fiddell (2013, pp. 82). Three mindfulness facets were negatively skewed and required reflect/square root transformations. There were no differences at baseline between groups randomised to treatment and control arms (Supplementary Table S3). Comparable percentages of students in the control and intervention groups were missing at each time point (Fig. 1). Presence at T2, T3 and T4 data collection was not predicted by gender or year level, nor baseline levels of depression, anxiety, wellbeing or weight/shape concerns (Supplementary Table S4). One aspect of mindfulness (Awareness of External Experiences) predicted missingness at T2 (3-month follow-up) such that those who were higher in this facet were more likely to be absent (odds ratio 0.38, 95% CI 0.15, 0.93).

For the combined age groups, descriptive statistics for the mindfulness intervention and the control groups at post-intervention and 3-month follow-up are shown in Table 1. Table 2 presents effect sizes from the mixed model analysis after adjusting for baseline scores. There were no differences between the mindfulness and control groups on any outcome measure at post-intervention, and effect sizes were small (Cohen’s d < 0.17). At 3-month follow-up, one of eight facets of mindfulness (Decentering and Nonreactivity) was lower (worse) in the mindfulness group (d =  − 0.20; 95% CI − 0.39, − 0.01) but there were no between-group differences in other psychological outcome variables (d < 0.13).

Table 2 Mixed model analyses with between-group effect sizes and 95% confidence intervals (N = 434)

Age (year 8 versus year 10) was investigated as a moderator (Table 3). Year 8 students who had undertaken the mindfulness course showed no improvement at post-intervention compared to control (d < 0.22). At 3-month follow-up, those who received the mindfulness intervention were worse in two aspects of mindfulness (Awareness of External Environment, d =  − 0.30; 95% CI − 0.55, − 0.06; Decentering and Nonreactivity, d =  − 0.39; 95% CI − 0.63, − 0.15) and in wellbeing (d =  − 0.25; 95% CI − 0.49, − 0.01) compared to control. There were no significant differences in outcomes for the older students (year 10) who had undertaken the mindfulness course compared to control at either time point (d < 0.22).

Table 3 Mixed model analyses: estimated marginal means for moderator (age) by treatment group (2) by time (2)

Amount of home practice outside of sessions was examined as a potential moderator for all outcomes but no significant results emerged, nor did amount of home practice vary by age group. These analyses are available from the first author upon request.

Extended Follow-up

For the groups involved in the first two rounds of the intervention (N = 161), we were able to collect 9-month follow-up data. Descriptive statistics for this subsample of participants appear in Table 4. Analysis of repeated measures (Table 5) showed all significant differences favoured the control group over the students who received the mindfulness intervention. At 3-month follow-up, Decentering and Nonreactivity was worse (Cohen’s d =  − 0.43; 95% CI − 0.74, − 0.12) as was wellbeing (d =  − 0.41; − 0.72, − 0.09). Wellbeing was also worse at 9-month follow-up (d =  − 0.40; − 0.70, − 0.08) as were weight/shape concerns (d =  − 0.35; − 0.67, − 0.04). Moderation by age group was not investigated due to the small sample size.

Table 4 Descriptive statistics for subset of mindfulness and control groups (N = 161) at baseline, post-intervention, 3- and 9-month follow-up
Table 5 Mixed model analyses with between-group effect sizes for subset with 9-month follow-up data (N = 161)

Discussion

This study aimed to replicate the promising improvements and acceptability ratings from a smaller feasibility study (Johnson & Wade, 2019) testing a more intensive 8-week mindfulness programme that had suggested greater improvements in mid versus early adolescence. Across our whole sample, there were no differences between the intervention and control groups with the exception of one aspect of mindfulness (Decentering and Nonreactivity), which was worse at 3-month follow-up in the mindfulness group compared to controls. With age as a moderator, the younger students (Mage 13.7 years) who received mindfulness lessons were worse in wellbeing and two aspects of mindfulness (Awareness of External Environment, Decentering and Nonreactivity) compared to controls at 3-month follow-up; effects were small. There were no significant differences between the intervention and control groups in the older students (Mage 15.5 years) at either post-intervention or 3-month follow-up. There were no differences between younger and older students on course acceptability. For a small subsample (combined age groups) with 9-month follow-up, deterioration in the mindfulness group was noted for weight and shape concerns and wellbeing compared to controls.

Our prediction that a more intensive 8-lesson curriculum would improve outcomes in adolescents compared to lower “dosage” programmes (Johnson et al., 2016, 2017a) was not supported: neither curriculum yielded benefits across our series of Australian studies. We also hypothesised that improvements would be greater for mid- than early-adolescent students, and a differential effect was found: while there was no improvement in mid-adolescent students, only the younger students showed mild deterioration in wellbeing and aspects of mindfulness.

Mean ratings of acceptability for this study averaged 20% lower than ratings for the feasibility trial with the same course and instructor (Johnson & Wade, 2019). One explanation might be that we conducted courses for the first time during all four school terms. Staff feedback suggested that term 1 (unfamiliarity with fellow students in the class) and term 4 (fatigue with the end of the school year) may be sub-optimal times for this type of intervention delivery. However, this remains to be tested. Another difference in the current trial was shorter lessons and removal of a break. In the earlier feasibility trial, the 10-min break involved playing a fun game, unrelated to mindfulness and different each week, for a small chocolate reward. Perhaps this inclusion fostered valuable group cohesiveness, light-hearted familiarity with the instructor and a helpful pause in concentration. A range of students in the current trial also reported concerns to their teacher or the counsellor regarding ongoing participation. For two students, concerns related to perceiving the non-academic content as irrelevant. For one group within the same class (N = 5), perceived religious conflict was ameliorated after discussing concerns with their teacher, and these students chose to continue. This suggests that more explanation might be needed in terms of the secular nature of meditation, yoga and the use of bells as an anchoring practice. However, taking part in lessons prompted four students to speak to the counsellor or their class teacher regarding anxiety, albeit related to external causes. One of our two earlier studies, using body-based meditations limited to 10 min (Dot be curriculum; Johnson et al., 2016), also found a mild worsening of anxiety in some early adolescents. Given these effects, we postulate that the use of formal meditation in unscreened conscript school students may be unwise for early adolescents, even where preceded by safety instructions. Restricting formal meditation to voluntary programmes for adolescents in the final years of school (e.g. Bennett & Dorjee, 2016) may be safer and more beneficial, with later adolescence suggested as a particular stage when brain networks are more likely to be modified by mindfulness training (Carsley et al., 2018).

It is probable that there is more than one key entry point for mindfulness, together with some resistant periods. Within MBP research, there have now been a growing number of RCTs in primary schools (see, e.g. Black, 2015), although this body of research faces similar methodological limitations together with generally smaller sample sizes. Early adolescence had been suggested as a key window for MBPs due to the capacity for abstract thought before the escalating academic and social stresses of mid–late adolescence (Broderick & Metz, 2009; Kuyken et al., 2013); however, many new barriers emerge at this age compared to the primary years: increasing self-awareness, self-doubt and peer distraction, less eagerness to please and more desire for autonomy, together with rotating class schedules that add implementation challenges and reduce opportunities for reinforcement of practices (Cook-Cottone, 2017, pp. 93–94). Mindfulness training has been conceived as a “neural training regime” (Shapiro et al., 2015) that repeatedly pairs activation between the prefrontal cortex and the limbic system (Zelazo & Lyons, 2012). Perhaps the benefits of strengthening this link may be even greater if this occurs before the developmental mismatch between affective processing and cognitive control that manifests in adolescence (Riediger & Klipker, 2014, p. 197). Evidence suggests that primary aged children enjoy simple MBPs that include learning about the neurobiology of the brain and brief practices to anchor themselves when dysregulated (Vickery & Dorjee, 2016). Further, these programmes may be easier to disseminate at scale using classroom teachers given the relative simplicity of concepts and practices (i.e. not requiring extensive personal meditation practice).

Limitations and Future Research

Limitations of our study largely arose due to the small size of the research team, with one investigator primarily responsible for design, intervention delivery, data collection and analysis. A single (external) facilitator delivered all mindfulness classes which produced consistency across intervention groups, but the skill of the facilitator in this delivery model potentially confounds the effectiveness of the curriculum. During our feasibility study, we therefore undertook the most rigorous fidelity and competency testing in any youth MBP to date, to our knowledge. Indeed, the use of an experienced and proficient instructor—with more mindfulness experience than is suggested for classroom teacher delivery—might have been expected to overinflate improvements in the current trial. The use of a single facilitator also necessitated staggered intervention delivery over a 12-month period, which impacted on the length of follow-up available for the whole sample (3 months) during the timeframe of this trial. While universal interventions may yield both a prevention and treatment effect across the spectrum present within a classroom, the former may take at least 6 months to emerge (e.g. Nehmy & Wade, 2015; Raes et al., 2014).

Difficulty with school recruitment, particularly for older students, was also reflected in our study. This impacted on eventual sample size for moderator analyses, with our year 10 mid-adolescent groups dipping below our a priori power calculations (81–86 students versus desired 91 students in each of the four cells), resulting in underpowered moderation analyses. Our study design involved clustering at the class level within schools to allow matching of demographic variables and control for school contextual differences; however, there is a risk of contamination within schools with this design. We considered this risk low given (1) the experiential nature of the course; (2) delivery by an external facilitator and (3) occurring in schools without existing mindfulness curricula and widespread knowledge of this approach; however, the potential effects of this cannot be excluded. We did not account for clustering in our power calculation, given that the same presenter delivered all lessons, and each school provided demographically matched pairs of intervention and control groups. Due to school timetabling challenges and therefore lower final participation rates, we were powered to detect an effect size of 0.30 rather than 0.25; the latter is typical for universal school-based studies (e.g. Kuyken et al., 2013; Raes et al., 2014).

Further limitations include the design employing passive (usual lesson delivery) rather than active controls. In addition to any non-specific intervention benefits, students receiving the mindfulness intervention may have actually missed out on classes they enjoyed or which also contributed to improved wellbeing (e.g. physical education), or lost extra time to complete homework compared to their peers, which may have elevated stress. Our use of anonymous surveys and explaining that there were no “right or wrong answers” reduced common method bias with our self-report surveys. However, due to constraints of school timetables, the need to minimise survey length for adolescents and the use of validated scales, we were not able to separate measurement of predictor and outcome variables (temporally or by source) nor employ different response formats to eliminate other potential common method biases (Podsakoff et al., 2003). Finally, the anonymous design of our student feedback surveys precluded us from any quantitative analysis utilising acceptability ratings (testing whether term of intervention delivery impacted engagement and whether engagement impacted outcomes). We recommend collecting course acceptability ratings linked to individual outcomes in future research designs.

Ongoing research into acceptable and effective adaptations of MBPs for early to mid-adolescents is needed (Carsley et al., 2018; Volanen et al., 2020). Based on our research results and classroom experience during four school-based RCTs, we propose that “overlearning” a small number of key concepts (Batterham et al., 2018; Scott, 1992) may be more helpful for teens still developing abstract thought capacity during a developmental dip in emotional dysregulation. The following ideas are proposed for future research into MBPs for this age band: (1) a smaller number of key ideas repeated and reinforced during activities and games across a longer period of time; (2) frequent practice of short anchoring techniques, avoiding formal meditations; (3) modelling and scaffolded application of key ideas and practices to real-life classroom challenges by teachers to embed learning and (4) ideally, reinforcement at home in the same way by parents. Identifying the key mindfulness skills to target matched to developmental age remains an important goal for future research.

In sum, despite the limitations of real-world research in schools, this RCT study adds to the small but growing number of higher quality investigations suggesting that mindfulness programmes with early- and mid-adolescents are not yet robust. We offer cautions and suggestions for continuing to refine universal school-based programmes for early–mid-adolescent students.