Attention-deficit hyperactivity disorder (ADHD) is characterized by inattention and/or hyperactivity, as well as impulsivity, which negatively affects executive function and emotion regulation (Beheshti et al. 2020; Sprafkin et al. 2016; Vitola et al. 2017). Inattentive or hyperactive/impulsive symptoms must be present in at least two settings, for example, at work and with relatives, and symptoms should be present before the age of 12 years (American Psychiatric Association 2013).

The prevalence of ADHD in adults has been estimated to be about 4.4% (Almeida Montes et al. 2007), and it is higher in males than in females (Amiri et al. 2014; London and Landes 2019). The most common treatment of ADHD consists of the administration of psychostimulant medications. However, pharmacotherapy has its limitations and is not well tolerated by some individuals due to undesirable side effects. This is why there is a search for new, non-pharmacological therapies for ADHD symptoms, including mindfulness-based interventions (MBIs). MBIs have already proven to be beneficial in reducing symptoms of various other mental health problems including depression/anxiety (Franca and Milbourn 2015; Hofmann and Gomez 2017) and substance use disorders (Bautista et al. 2019; Fahmy et al. 2018). Most MBIs involve somatically focused meditative techniques (e.g. seated and walking meditations, body scans, gentle yoga), and non-judgmental, mindful awareness of the present moment.

ADHD is often comorbid with internalizing and externalizing problems (Jacob et al. 2014). Among internalizing behaviors in adult ADHD, negative affect such as anxiety and depression often occur. Regarding externalizing behavior, adults with ADHD show higher novelty-seeking (exploratory interest in response to new stimuli), impulsive decision-making, loss of temper, and aversion to frustration (Downey et al. 1997; Jacob et al. 2007). Internalizing and externalizing behaviors might reflect an underlying emotional dysregulation disorder (Jacob et al. 2007). Moreover, emotional dysregulation is complicated by cognitive distortions (Barlow et al. 2011). Prevalence rates for emotional dysregulation range between 34 and 70% in adults with ADHD (Beheshti et al. 2020; Shaw et al. 2014), especially concerning anger (Able et al. 2007). Moreover, ADHD with emotional dysregulation has been associated with generally more severe inattention and/or hyperactivity (Corbisiero et al. 2017), and worse social or occupational functioning (Surman et al. 2013). Among problems with negative affect, depression (Strohmeier et al. 2016; Yang et al. 2013) and anxiety symptoms (Biederman et al. 2011; Grogan and Bramham 2016; Kessler et al. 2006; Michielsen et al. 2013; Simon et al. 2013) have been consistently associated with adult ADHD. Indeed, between 65 and 89% of all individuals with ADHD suffer from one or more additional psychiatric disorders in adulthood, above all mood and anxiety disorders. This situation complicates the clinical picture in terms of diagnosis, treatment, and outcome measures (Sobanski 2006).

Relative to the general population and ADHD-remitted individuals, those with persistent ADHD have higher rates of comorbid depression and anxiety disorders (Biederman et al. 2011). These comorbid symptoms increase in severity with age (Bramham et al. 2012). Conversely, ADHD symptom rates are elevated among individuals with a primary diagnosis of depression and anxiety disorders (Bron et al. 2016). Thus, we may say that anxiety and depression can contribute to attentional disorders observed in adult ADHD.

In addition to problems with emotion regulation and the comorbidity of depressive and anxiety symptoms, ADHD in adults is characterized by poor executive functioning (i.e., working memory, inhibitory control, switching) (Boonstra et al. 2005; Thorell et al. 2017). Kamradt et al. (2014) found that executive functions (e.g., time management) accounted for relational, professional, and daily living impairments. Consequently, neurocognitive deficits seem to contribute to impoverished interpersonal functioning in adults with ADHD. Clearly, there is a strong association of ADHD with executive deficits and negative affect. This is why it is important to assess these potential outcomes of MBIs in addition to any inattention and/or hyperactivity improvements.

A recent systematic review assessing cognitive and behavioral effects of MBIs on ADHD symptoms showed that there was an improvement on cognitive task performance and ADHD symptoms following MBIs compared with the pre-intervention levels (Poissant et al. 2019). The authors concluded that MBIs might be a valuable treatment option alongside treatment as usual for adult ADHD. However, the authors noted that the quality of studies was variable and questionable, such that several studies showed a high risk of bias (e.g., blinding of participants, random sequence generation, allocation concealment, and attrition rates). Two recent meta-analyses measured the impact of MBIs on children and adults combined (Cairncross and Miller 2016; Xue et al. 2019) or the effects of MBIs and other non-pharmacological treatments combined (López-Pinar et al. 2019) in adults, leaving space for the examination of MBIs in adult ADHD. Furthermore, the aforementioned meta-analyses focused only on inattention and hyperactivity symptoms, except for López-Pinar et al. (2019) who also considered comorbid-internalizing symptoms. Because of the presence of frequent comorbidity with executive dysfunction and emotion dysregulation in the ADHD population, which appear to be aggravated in adulthood, we considered essential to explore the impact of MBIs on these outcomes. We also examined the impact of the quality of studies (risk of bias) on the effect sizes reported in MBIs for adult ADHD.

Methods

Eligibility Criteria

To conduct the meta-analysis, we used the criteria from the Preferred Reporting Items for Systematic Review and Meta-Analysis Protocols (PRISMA-P) (Shamseer et al. 2015). A datasheet based on the PRISMA-P was completed with information extracted from each selected study based on (1) research design; (2) the characteristics of participants; (3) the type of the intervention; and (4) outcomes (i.e., “ADHD symptoms,” “depression,” “executive function,” and “emotional dysregulation”). A librarian helped the team to refine the search strategy. A reference manager software was used—EndNote X9 (Bramer 2018). We consulted PsycINFO, PubMed, SCOPUS, and ERIC databases from inception up to July 2019 and we included only “peer-reviewed” literature. For search terms, we followed a shortened, more restrictive version of the keywords from the review on meditation therapies for attention-deficit hyperactivity disorder (ADHD) of Krisanaprakornkit et al. (2010). For example, we excluded terms like vipassana, zen, yoga, pranayama, Sudarshan, and qigong present in the previous review. The search included keywords “ADHD AND meditation OR mindfulness,” “impulsivity,” “inattention,” and “hyperactivity,” in different combinations. We included “randomized” (RCT) and “non-randomized control trials” (N-RCT), “single,” and “comparison studies.” Studies were excluded if they (1) did not examine “treatment” effects; (2) did not include a group of “ADHD” or “ADD” or “hyperactivity disorder”; (3) did not include a “mindfulness” or “meditation-based treatment”; and (4) they were conducted with children but we included “adult,” “college student,” and “young adult.” We included different forms of “MBIs” as long as the intervention contained elements of mindfulness (e.g., “dialectical behavior therapy,” “DBT”). As for outcomes, besides the measures of “ADHD symptoms,” “inattention,” and “hyperactivity,” we included measures of “executive” or “cognitive functioning;” “attention;” and “emotional disturbance or dysregulation,” “depression,” “mood disorder,” and “anxiety.” “Self-rating” scales and “objective tasks” were accepted in the inclusion criteria. Measures of outcomes to assess ADHD symptoms included self-report scales, such as the “Conners’ Adult ADHD Rating Scales” (“CAARS-SR” OR CAARS-S OR the “ADHD Self-Report Scale” (“ASR”)). For executive functioning, objective tasks such as “Attention Network Test” (ANT) OR the “Conners’ Continuous Performance Test” (“CPT”) and subjective questionnaires were considered for inclusion (e.g., the “Behavior Rating Inventory of Executive Function,” “BRIEF”). Self-report measures of anxiety, depression, and other emotional disturbances were also included (e.g., “Beck Depression Inventory” and “Beck Anxiety Inventory”). The final literature search resulted in 721 studies. The search was conducted in three consecutive sessions, from October 2016 to January 2017, updated from April 2018 to July 2018 and finally to July 2019 (plus Google Scholar alerts until November 2019). Abstracts of all articles were included in an Excel file for further examination and updates. We first eliminated 178 duplicates and 1 erratum. Eligibility assessment for final selection was based on independent reviews by H.P. and A.M. After reviewing the abstracts, 458 studies were judged as irrelevant. From the remaining studies, 56 were reviews or meta-analyses and eliminated de facto (but we examined the references). Fifteen studies were conducted with children, adolescents, and/or their parents; therefore, they were excluded. A total of 14 studies conducted in “adults,” “young adults,” and “college students” with ADHD were selected according to all our criteria. The initial sample of studies was further trimmed as a result of the availability of the outcome measures: (a) ADHD symptoms (n = 11), (b) depression (n = 8), and (c) executive functioning (n = 9).

We adopted the Cochrane Collaboration (Higgins et al. 2011) guidelines to score potential bias for each study. The listed biases included (1) “sequence generation” (e.g., Is the allocation sequence of participants acceptably generated?); (2) “allocation concealment” (e.g., Is the allocation of participants acceptably concealed?); (3) “blinding of participants, personnel, and outcome assessors” (e.g., Is knowledge of the allocated treatment plenty prevented during the study?); and (4) “selective outcome reporting” (e.g., Are partial outcome data adequately addressed?). A positive answer indicates a low risk of bias whereas; a negative answer indicates high risk. Cochrane guidelines were chosen because they present a formal quality assessment procedure that we found relevant to the psychological nature (rather than strictly medical nature) of the selected studies. Moreover, the consultation of Krisanaprakornkit et al.’s (2010) Cochrane review (with a step by step demonstration of bias assessment) was used as a handy tool to exercise and practice our comprehension of the assessment of bias and thus limit false interpretation. The search strategy, the overview of a PICO description (i.e., Problem/Patient/Population, Intervention/Indicator, Comparison, Outcome), and the analysis of bias using the Cochrane Collaboration guidelines (O’Connor et al. 2008) for each study are fully described in Poissant et al. (2019).

Statistical Analysis

Comprehensive Meta-Analysis version 3 was used to calculate effect size estimates of the differences in performance scores (means and standard deviations) pre- and post-MBIs (Borenstein 2009; Borenstein and Rothstein 1999). The effect size estimates were calculated using Hedges’ unbiased g, which corrects for bias from small sample sizes (Hedges and Olkin 1985). Effect size estimates were separately calculated for ADHD symptoms, depression scores, and executive functioning. The primary analysis consisted of comparison of pre- to post-MBI scores at different time points to explore impact of MBIs on ADHD symptoms, depression levels, and executive functioning. Also, secondary analyses were conducted to estimate separately the effect size of studies with a comparison group (between-subject comparison) and without comparison groups (single-group pre-posttest comparison) to individually explore the effect of MBIs on ADHD symptoms, depression levels, and executive functioning.

For primary and secondary analyses, the direction of the effect was considered negative if ADHD symptoms and depression scores diminished following MBIs, and positive if ADHD symptoms and depression scores increased following MBIs. Inversely, for executive functioning, the direction of the effect size was considered positive when dysexecutive problems diminished following MBIs, and negative when dysexecutive problems increased following MBIs. According to the conventional standard of Cohen (1988), effect size estimates of 0.2, 0.5, and 0.8 were considered small, moderate, and large, respectively.

The usual way of assessing whether a set of studies are homogeneous is using the Q test. However, the Q test only informs about the presence or the absence of heterogeneity (it does not report on the extent of heterogeneity). The I2 index has been proposed to quantify the degree (true extent) of heterogeneity (Higgins and Thompson 2002). For that purpose, heterogeneity among effect size estimates was assessed with both statistics, that is, the Q test (Paulson and Bazemore 2010) and the magnitude of heterogeneity being evaluated with the I2 index (Lipsey and Wilson 2000). The I2 index describes the proportion of heterogeneity across the studies as low (25%), moderate (50%), and high heterogeneity (75%) (Higgins and Thompson 2002; Higgins et al. 2003). Thus, I2 index of 0 essentially means homogeneity of variance indicating that the pooling of studies in a meta-analysis is highly favorable (“apples with apples comparison”).

As the database was characterized by high heterogeneity, we employed random-effects which are more conservative than fixed-effect models, and appear to better address heterogeneity between studies and study populations (Cooper et al. 2009). To reduce heterogeneity, a composite effect size of all studies was calculated and a cutoff of two standard deviations away from this composite effect size was enforced to exclude outlier results (see below).

To estimate the effect of categorical (e.g., self-report versus other) and continuous variables (e.g., bias level: scores 0–14) on the effects of MBIs, we performed sub-analyses and meta-regression analyses, respectively. The influence of the following variables was investigated for the whole sample of selected studies: (a) “self-report” of ADHD symptoms versus “other” (researcher or relative), and (b) risk of bias or quality assessment. Despite their limitations, trimming procedures are recommended by several groups of investigators and developers (CMA, R, and Cochrane), and it is common practice to apply them. In previous meta-analyses from our research team, we have regularly applied such trimming procedures (seePotvin et al. 2014, 2018; Stavro et al. 2013). Limitations in conducting meta-regression and subgroup meta-analysis on a small number of studies are further discussed below.

Finally, the possibility of publication bias was examined with Begg and Mazumdar’s rank correlation test, and visual inspection of the funnel plot for the whole sample of selected studies (Begg and Mazumdar 1994). We chose this alternative over other options (like Duval and Tweedie’s procedure) relying on other assumptions, such as the observed asymmetry is due to publication bias rather than a “small-study effect.”

Results

Of 721 studies, a total of 14 were included in the meta-analysis, for an aggregated sample of 834 adults with ADHD participating in these studies. The effect sizes were calculated for 11 studies per ADHD symptoms, eight studies for depression and the nine studies for executive functioning. Also, the combined effect size was calculated for the three categories. We first look at effect sizes in each given study with the main focus on ADHD symptoms, depression, and executive functioning, as the main outcomes. Effect sizes were calculated with Comprehensive Meta-Analysis, CMA. We report effect sizes for each study by the outcome, as well as the combined effect sizes. Effect sizes were first examined separately for each outcome type: ADHD symptoms, depressive symptoms, and executive functioning. Secondly, we conducted a sub-group analysis to divide between-group studies (studies with a comparison group) from single-group studies (pre-post studies only).

ADHD Symptoms

Three studies were excluded from the analyses. That is, an outlier with an effect size of more than two standard deviations from the average effect sizes of the rest of the studies (Hesslinger et al. 2002) and two studies that did not report the data needed to be included in the meta-analysis (Cole et al. 2016; Fleming et al. 2015). As shown in Fig. 2a and based on eleven studies, the results showed that following MBIs, the combined effect size showed an improvement in ADHD symptoms (Hedge’s g = − 0.591, 95% CI = − 0.858 to − 0.324, p < 0.0001). The assessment of overall heterogeneity across studies indicated a significantly moderate-to-high heterogeneity (Q = 41.15, df = 10, p < 0.0001; I2 = 75.7%). The Begg and Mazumdar test revealed no publication bias (Tau = − 0.29, p = 0.21) (see Fig. 1).

Fig. 1
figure 1

Publication bias in adult ADHD symptoms (all studies)

As shown in Fig. 2b and based on nine studies with comparison groups, the results showed that following MBIs, the combined effect size showed an improvement in ADHD symptoms (Hedge’s g = − 0.610, 95% CI = − 0.988 to − 0.231, p = 0.002). The assessment of overall heterogeneity across studies indicated high heterogeneity (Q = 39.04, df = 8, p < 0.0001; I2 = 79.5%). As shown in Fig. 2c and based on two studies, the results showed that following MBIs, the combined effect size showed an improvement in ADHD symptoms (Hedge’s g = − 0.563, 95% CI = − 0.834 to − 0.292, p < 0.0001). The assessment of overall heterogeneity across studies indicated moderate heterogeneity (Q = 1.986, df = 1, p = 0.159; I2 = 49.6%). The meta-regression results indicated that the overall effect size was not moderated by the estimation based on the risk of bias (β = 0.03, SE = 0.44, p = 0.46). Besides, mixed-effect analyses indicate that the overall effect size was not moderated by the type of informant (i.e., self-report of ADHD symptoms versus other) (Q = 1.38, df = 1, p = 0.24).

Fig. 2
figure 2

Primary analysis of adult ADHD symptoms. a All studies. b Studies with a comparison group. c Pre-post studies

Depression

As shown in Fig. 4a and based on a total sample of eight studies, the results showed that following MBIs, the combined effect size showed a diminution in depressive symptoms (Hedge’s g = − 0.462, 95% CI = − 0.648 to − 0.277, p < 0.0001). The assessment of overall heterogeneity across studies indicated moderate heterogeneity (Q = 12.23, df = 7, p = 0.093; I2 = 42.7%). The Begg and Mazumdar test revealed no publication bias (Tau = − 0.32, p = 0.26) (see Fig. 3).

Fig. 3
figure 3

Publication bias in adult Depression symptoms (all studies)

As shown in Fig. 4b and based on five studies with comparison group, the results showed that following MBIs, the combined effect size showed a diminution in depressive symptoms (Hedge’s g = − 0.355, 95% CI = − 0.559 to − 0.151, p = 0.001). The assessment of overall heterogeneity across studies indicated low heterogeneity (Q = 3.194, df = 4, p = 0.526; I2 = 0%). Based on three single-group pre-post studies, Fig. 4c shows the results that following MBIs, the combined effect size showed a diminution in depressive symptoms (Hedge’s g = − 0.650, 95% CI = − 1.058 to − 0.241, p = 0.002). The assessment of overall heterogeneity across studies indicated high heterogeneity (Q = − 8.292, df = 2, p = 0.016; I2 = 75.8%). The meta-regression results indicated that the overall effect size was not moderated by the estimation based on the risk of bias (β = 0.02, SE = 0.06, p = 0.68). Because all scores were self-reported, we did not conduct additional analysis here.

Fig. 4
figure 4

Primary analysis of depression. a All studies. b Studies with a comparison group. c Pre-post studies

Executive Functioning

Figure 6a is based on a total sample of nine studies indicating an improvement in dysexecutive problems (Hedge’s g = 0.395, 95% CI = 0.136 to 0.653, p < 0.005) following MBIs. The assessment of overall heterogeneity across studies indicated moderate heterogeneity (Q = 19.45, df = 8, p < 0.05; I2 = 58.8%). The Begg and Mazumdar test revealed no publication bias (Tau = 0.000001, p = 1) (see Fig. 5).

Fig. 5
figure 5

Publication bias in adult executive functioning (all studies)

As shown in Fig. 6b and based on seven studies with comparison groups, the results showed that following MBIs, the combined effect size showed an improvement in dysexecutive problems (Hedge’s g = 0.260, 95% CI = 0.027 to 0.492, p < 0.05). The assessment of overall heterogeneity across studies indicated low heterogeneity (Q = 8.57, df = 6, p = 0.19; I2 = 30.02%). As shown in Fig. 6c and based on two single-group pre-post studies, the results indicated that following MBIs, the combined effect size demonstrate an improvement in dysexecutive problems (Hedge’s g = 0.810, 95% CI = 0.493 to 1.128, p < 0.001). The assessment of overall heterogeneity across studies indicated low heterogeneity (Q = 1.10, df = 1, p = 0.29; I2 = 9.53%). The meta-regression results indicated that the overall effect size was not moderated by the estimation based on the risk of bias (β = − 0.04, SE = 0.04, p = 0.40).

Fig. 6
figure 6

Primary analysis of executive functioning. a All studies. b Studies with a comparison group. c Pre-post studies

Discussion

This meta-analysis addresses both negative affect and cognitive functioning of adults with ADHD, in addition to inattention/hyperactivity, following MBIs. Moreover, we controlled our analyses by reporting bias using the Cochrane guidelines (O’Connor et al. 2008). Overall, the results revealed that MBIs are effective to treat global ADHD symptoms in adults supporting previous meta-analyses on the topic. Specifically, following MBIs, inattention and hyperactivity show a moderate reduction in adults with ADHD. Importantly, the risk of bias (quality assessment of studies) did not have a significant impact on the positive effect documented in this meta-analysis. However, a moderate-to-high heterogeneity between studies remains a limitation indicating that this result still needs further confirmation. This meta-analysis also indicates a moderate improvement of depressive symptoms following MBIs. Again, the overall effect size was not affected by the estimation based on the risk of bias. Finally, the results revealed the improvement of the executive functioning in adults with ADHD following MBIs with a low-to-moderate heterogeneity across studies. As with ADHD symptoms and depression, effect size does not appear to be affected by the quality assessment of studies. Finally, publication bias (as measured with the Begg and Mazumdar test) was not found to be significant for any of the three outcomes documented in the current meta-analytic study. However, the small number of considered studies demands caution.

To our knowledge, only Cairncross and Miller (2016) and Xue et al. (2019) have conducted similar meta-analyses with mixed samples of children and adults to measure the impact of MBIs on ADHD symptoms. Cairncross and Miller (2016) found evidence that MBIs were more effective in reducing inattention symptoms. Knowing that adults are more often characterized as inattentive instead of hyperactive, they may show a better response to treatment using MBIs as compared with children. However, the number of studies with adults in their study was very limited (n = 4). Of the eleven eligible studies, Xue et al. (2019) concluded that MBIs had large effects on inattention and hyperactivity/impulsivity. Subgroup analyses between self and other ratings on ADHD symptoms revealed that the effect of MBIs was significant regardless of the rater. Most importantly, Xue et al. (2019) found that the age of the participants significantly affected heterogeneity (adults versus children). The authors suggested that adults may have a better understanding of their condition and thus be more involved in their treatment than children. This was an additional justification for conducting the present meta-analysis with adults only.

Another limitation of the previous studies is the absence of a comprehensive evaluation of the risk of bias other than publication bias. This is an important aspect because it allows identifying studies with poor quality. The current study followed the example of the previous systematic review of Krisanaprakornkit et al. (2010) on meditation therapies for ADHD. By the end of their extensive Cochrane review, they retained only four studies (years 1983, 1987, 2004, and 2006) and could not conclude on the efficacy of any therapies due to insufficient quality of the selected studies. López-Pinar et al. (2019) followed a similar approach to determine the efficacy of different psychotherapies in improving internalizing symptoms such as depression in adults with ADHD. The authors found a significant effect of MBIs (but not dialectical behavior therapy or neurofeedback) to ameliorate emotional dysregulation. It should be mentioned that the authors observed a high risk of bias in the selected studies, which might limit their results.

Overall, the results indicate moderate-to-high heterogeneity among studies, although with a lower heterogeneity for studies addressing executive functions. Heterogeneity is a common problem in previous meta-analyses and can be accounted for by several factors. We included randomized (RCT) and non-randomized control trials (N-RCT). Removing N-RCTs from our analyses could have resulted in more homogeneity, but it would also decrease the statistical power. Another problem may reside in the fact that most participants were receiving medication before and during MBIs, psychostimulants like methylphenidate (MPH) being the most frequently reported. Psychostimulants and other dopamine agonists are commonly used to treat ADHD symptoms; their efficacy is attributed to the inhibition of dopamine and norepinephrine reuptake in the prefrontal cortex (Hosenbocus and Chahal 2012). Comorbidity was present in all examined studies, with major depressive disorders and anxiety disorders being the most frequently reported. Thus, future meta-analyses with more studies should be able to include depression and anxiety as moderators. Other comorbidities, including oppositional defiant disorder, bipolar disorder, borderline personality disorder, and substance abuse, were not frequently reported across the present studies. MBIs were also diverse across studies. They included mindfulness and/or meditation as the main or partial component of the intervention (e.g., that included various adaptations of dialectical behavior therapy, mindful awareness program, and mindfulness-based/cognitive training). The duration of treatment (from 6 to 96 hours) and the degree of experience of therapist (e.g., clinical psychologists, mindfulness instructors, graduate students) also varied considerably across studies. These factors may have acted as confounding variables so it is premature to conclude on the efficacy of one intervention over the other. Another source of heterogeneity arises from the use of self-rating scales versus objective measures of outcomes. Whereas ADHD and depression symptoms were, for the most part, assessed with self-reported scales (e.g., Conners’ Adult ADHD Rating Scale, CAARS-SR, Beck Depression Inventory (BDI)), executive functions were most frequently measured with objective tasks (e.g., Attention Network Test). The existence of a gold standard to assess depression probably accounts for the high homogeneity (I2 = 0%) found in studies with a comparison group for this outcome.

Limitations

One of the main limitations of this meta-analysis is the small number of studies evaluating different outcomes (symptoms, depression, and executive functioning). A small number of studies could limit the statistical power of the statistical analyses (e.g., meta-regressions) and thus the generalizability of the findings. This is meaningful considering that publication bias statistics are generally less appropriate for use in small meta-analyses. Estimates of “self-report” versus “other” and of bias level might as well be tempered by the small number of studies. Also, differences in study design might count for the differences in efficacy. In a previous systematic review, Poissant et al. (2019) found that the majority of studies included in the present meta-analysis were considered at high risk of “selection bias.” Selection bias relates to “random sequence generation” and “allocation concealment.” According to Cochrane Collaboration’s criteria (Higgins et al. 2011), a study is rated as a low-risk study if the way to allocate sequence produces equivalent groups and when the method to conceal the allocation sequence cannot be predicted in advance. Although we did not find a significant impact of the risk of bias on the outcomes, the existence of elevated risk related to random sequence generation, allocation concealment, and blinding of participants and staff requires more consideration. As mentioned by López-Pinar et al. (2019), knowledge of the allocated treatment by participants and/or staff by itself might affect the validity of results. This is a crucial matter since it is not usual or advantageous to blind trainers and participants in this type of interventions. Thus, the risk of bias may be unpreventable. The assessment of the attrition is also of concern since it was found high (or unclear) risk in more than half of the studies in this meta-analysis. The reason for dropout is not specified so it is difficult to decide if it might be related to some incidental reasons or discomfort with treatment.