Introduction

Stress, a ubiquitous component of modern life, has a multifaceted impact on health. It can exert direct effects through autonomic and neuroendocrine pathways or indirectly influence health behaviors [1]. While acute stress can enhance cognitive and physical performance, chronic and intense stress contributes to a variety of health issues, ranging from obesity and hypertension to migraine, asthma, and depression [2].

In light of the pervasive nature of stress in contemporary societies, stress reduction techniques have become quite popular. One influential mind–body technique rooted in Eastern traditions is taiji (also known as tai chi). Originally developed as a martial art, taiji is a form of mindful moving technique, combining slow and flowing movements with mindful breathing [3,4,5]. Over recent years, scientific interest in the health-enhancing potential of taiji has increased, and beneficial effects on physiological and psychological well-being have been confirmed [3, 6].

Several systematic reviews and meta-analyses on taiji included outcomes related to stress and psychological well-being. For example, in adolescents and college students, there is increasing evidence that taiji can improve depression, anxiety, stress, and cortisol levels [1, 2]. Beneficial effects of taiji in mitigating work-related stress among health professionals have also been reported [3]. A meta-analysis in patients with cardiovascular disease demonstrated favorable effects of taiji for anxiety, depression, and quality of life (QoL) [4, 5]. In cancer survivors, taiji positively impacted fatigue and sleep quality, while there were no significant improvements in anxiety, stress, depressive symptoms, or overall QoL [6].

Aside from partly inconsistent results, these findings are limited by small numbers of participants [7] and the partly low quality of included studies [1, 3, 4]. For example, many of these studies do not adhere to current methodological standards, including the incorporation of a control group or randomized allocation to treatment and control groups [8]. Hence, more comprehensive reviews and meta-analyses are needed to conclusively determine the stress-relieving potential of taiji.

This review aimed to systematically evaluate the available evidence from randomized controlled trials that taiji reduces perceived stress. Secondary outcomes encompassed physiological stress indicators, depression, anxiety, and health-related QoL. Our evaluation considered both patients and healthy populations.

Methods

This systematic review was conducted and reported according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) standard [9].

Literature search

The literature search was performed by two reviewers (KM, JK) as part of a larger systematic review evaluating the stress-reducing potential of various yangsheng techniques, namely taiji, qigong, and acupressure. The following databases were searched: MEDLINE, EMBASE, the Cochrane Controlled Trials Register (CENTRAL), PsychINFO, and Web of Science (from inception to April 2023) using a combination of keywords and text words related to stress, relaxation, and taiji, qigong, and acupressure, combined with validated filters for controlled clinical trials. The search strategy for MEDLINE is shown in Table 1. Further potentially relevant publications were retrieved directly from the references cited in RCTs and systematic reviews.

Table 1 Search strategy for MEDLINE

Eligibility criteria

Only randomized controlled trials were considered. Study protocols were excluded. Participants could come from any demographic, including patients, healthy volunteers, or specific occupational groups. The primary intervention must have been taiji. Studies combining taiji with other interventions, such as relaxation methods or psychotherapy, were excluded. Interventions needed to last at least four weeks. Control groups were required to receive either a placebo treatment, or treatment as usual, or no intervention, such as being placed on a waiting list (WL) or receiving no treatment (NT). Eligible studies needed to provide a measure of perceived stress.

The primary outcome was the level of perceived stress at the conclusion of the intervention in the taiji group compared to the control group (placebo, WL, or NT). Stress had to be measured using a validated questionnaire or, alternatively, through visual analogue scales or numeric analogue scales. Secondary outcomes encompassed perceived stress at the latest available follow-up, stress compared to additional active control groups (typically those engaging in physical activity), as well as biological stress markers such as systolic blood pressure (BP), diastolic BP, heart rate, and cortisol levels. Additional secondary outcomes included psychological distress indicators like depression and anxiety, as well as both mental and physical QoL at the intervention's end.

Study selection

All abstracts identified through the literature search were screened, and irrelevant hits, such as nonrandomized studies and study protocols, were excluded. All remaining articles were obtained in full text and checked for eligibility based on the predefined selection criteria.

Data collection

Data was extracted using a standardized Excel form that captured the following details: bibliographic study information; study population demographics, including the age and gender of participants; numbers of participants involved at each stage (randomization, intervention, and analysis) as well as dropouts; details of the taiji intervention (type, duration, frequency) and, where relevant, sham interventions; nature of the control group; duration of both the intervention and follow-up periods; study design specifics (e.g., parallel-group or crossover); predefinition of primary outcomes; and type of analysis (intent-to-treat or per protocol). The data extraction for each study was independently performed by two of the three reviewers (KM, JK, PW), and any disagreements were resolved through discussion.

Risk of bias assessment

The risk of bias was assessed using the Cochrane Collaboration's risk of bias tool [10]. For each of the following domains – random sequence generation, allocation concealment, blinding of participants and personnel, blinding of outcome assessors, incomplete outcome data, and selective reporting – the risk of bias was categorized as high, low, or unclear. The risk of bias assessment was performed for each study by two of the three reviewers (KM, JK, PW), and any disagreements were clarified through discussion. Summary graphs depicting the risk of bias were generated using Review Manager (RevMan) version 5.3..

Statistical analyses

Meta-analyses were conducted using RevMan. For the meta-analyses of the primary outcomes, standardized mean differences (SMD) between the intervention group and the control group and their 95% confidence intervals (CI) were calculated for each study. Results were then pooled using random-effects models. To check the robustness of results, sensitivity analyses were performed for (1) studies with low risk of bias related to allocation concealment, blinding of outcome assessors, and incomplete outcome data; (2) studies with total sample sizes of ≥ 100; and (3) studies with clearly predefined primary outcome measures (≤ 2 pre-specified primary outcomes, including how and when they were assessed). The effects of study characteristics on primary outcomes were investigated using multivariable meta-regression analyses by in IBM SPSS Statistics (version 25). For the meta-analyses of the secondary outcomes, standardized mean difference (SMD) between the intervention group and the respective control group and their 95% confidence intervals (CI) were calculated for each study.

Statistical heterogeneity was examined using Cochrane’s Q test and the I2 statistics. I2 values of 25%, 50%, and 75% were considered as low, moderate, and high heterogeneity, respectively. Funnel plot asymmetry was assessed using Egger’s test as a measure of publication bias [11]. The Egger’s test was performed using SPSS. For all statistical tests, a p-value < 0.05 was considered significant.

Results

Literature search

The study selection process is shown in Fig. 1 [12]. The literature search yielded 933 hits, of which 108 were duplicates. Upon screening the abstracts, 667 clearly irrelevant records were excluded. The full texts of 158 potentially relevant hits were retrieved, from which 133 full text articles were excluded because they did not meet the inclusion criteria or examined qigong or acupressure interventions. Eleven RCTs on taiji were included in this review.

Fig. 1
figure 1

Flow diagram

Characteristics of included studies

Table 2 provides a summary of the characteristics of the eleven included studies [13,14,15,16,17,18,19,20,21,22,23]. Most of the studies were published in the last ten years. In total, 1323 participants were randomly assigned to either an experimental taiji intervention, or NT (7 trials WL, 3 trials NT, 1 trial TAU), or an additional active control group (5 trials, with different types of exercise). Five trials investigated patients and six trials healthy/stressed populations. The median age of participants was 53 years (IQR, 34–64), with men comprising 29% (median; IQR, 16–36) of the participants. Four of the trials had a clearly predefined primary outcome measure, four trials a partly predefined primary outcome measure (like predefinition of the outcome but not the time point, or more than two outcomes predefined as primary), and three trials had no predefined primary outcome measure. The duration of taiji practice ranged from 45 to 120 min, one to five times weekly, spanning 8 to 24 weeks (not counting self-practice at home). Table 3 gives an overview of the applied taiji interventions.

Table 2 Characteristics of included studies
Table 3 Main characteristics of taiji treatments

Risk of bias assessment

About half of the studies were judged to have significant weaknesses with regard to sequence generation, allocation concealment, and/or blinding of outcome assessment. In none of the trials were the participants and personnel blinded. In four studies, the dropout rates at the post-intervention measurement exceeded 15%, potentially leading to distortions. All trials were considered to have a low risk of selective reporting. An overall Risk of Bias Graph is displayed in Fig. 2, while Fig. 3 provides a detailed summary of the Risk of Bias for each individual study.

Fig. 2
figure 2

Risk of bias graph

Fig. 3
figure 3

Risk of bias summary

Meta-analyses of included studies

Perceived stress after intervention

Figure 4 displays the forest plot for the primary outcome of perceived stress at the end of the intervention period. The pooled effect of taiji on perceived stress was significant (SMD -0.41, 95% CI -0.63 to -0.19; p < 0.001). There was moderate-to-large heterogeneity (I2 = 63%, p = 0.003). The Egger’s test for funnel plot asymmetry was not significant (asymmetry coefficient -2.05, p = 0.123), while the funnel plot suggested the presence of small study bias (Fig. 5).

Fig. 4
figure 4

Forest plot for the primary outcome perceived stress (post-intervention)

Fig. 5
figure 5

Funnel plot for perceived stress (post-intervention)

We performed sensitivity analyses to assess potential influences of study quality characteristics on the SMDs for perceived stress. Specifically, we examined the impact of risk of bias concerning allocation concealment, blinding of study personnel, and incomplete outcome data. Given the possible presence of small study bias, we added a sensitivity analysis including only studies with sample sizes larger than 100 participants. Furthermore, we added a sensitivity analysis that included only studies with clearly predefined outcome measures (≤ 2 pre-specified primary outcomes, including how and when they were assessed). As indicated in Table 4, pooled effect sizes remained significant in all sensitivity analyses.

Table 4 Sensitivity analyses for the primary outcome perceived stress (post-intervention)

Explorative regression analyses were performed to evaluate possible influences of sample characteristics on perceived stress including type of population, gender, age, intervention duration, and sample size (Table 5). Simple linear regressions revealed smaller treatment effects (p < 0.1) in studies with lower proportion of females (p = 0.024), in studies in patients (p = 0.055) and in larger studies (p < 0.001). In a next step, these variables were included in a multiple linear regression analysis to identify possible independent predictors for perceived stress. The overall model was statistically significant (R2 = 0.59, Q = 15.22, p = 0.002), with sample size (p = 0.013) turning out as the only independent predictor (Table 5).

Table 5 Explorative regression analyses for the primary outcome perceived stress (post-intervention)

Perceived stress at follow-up

Five studies provided a continuous outcome measure for perceived stress at follow-up (in median, 24 weeks after randomization) (Fig. 6). The SMD was smaller but still significant (-0.25, 95% CI -0.46 to -0.05; p = 0.02), and heterogeneity was low (I2 = 34%, p = 0.02).

Fig. 6
figure 6

Forest plot for perceived stress at follow-up

Comparison with physical activity

Additional analyses were performed to assess the effects of the interventions on perceived stress in comparison to active control groups (various forms of physical exercise). In the five studies that included such an additional group, the pooled SMD for perceived stress was not significant (SMD 0.10, 95% CI -0.08 to 0.27, p = 0.29; I2 = 0%) (Fig. 7).

Fig. 7
figure 7

Forest plot for perceived stress in comparison to active control groups (post-intervention)

Objective stress measures

Five RCTs provided at least one objective stress measure (5 trials systolic and diastolic BP, 2 trials heart rate). Neither systolic BP (SMD -0.0, 95% CI -0.37 to 0.38; p = 0.98; I2 = 78%, p < 0.001), nor diastolic BP (SMD -0.21, 95% CI -0.56 to 0.14; p = 0.24; I2 = 75%, p = 0.003), nor heart rate (SMD -0.45, 95% CI -1.42 to 0.51, p = 0.36; I2 = 82%, p < 0.02) showed a significant intervention effect in comparison to the NT control groups (Fig. 8).

Fig. 8
figure 8

Forest plot for physiological stress measures (post-intervention)

Depression and anxiety

Six RCTs provided a measure for depression and five trials for anxiety. The pooled SMD for depression did not reach the level of significance (-0.29, 95% CI -0.66 to 0.07, p = 0.12), while heterogeneity was high (I2 = 77%, p < 0.001; Fig. 9). The pooled SMD for anxiety was significant (-0.41, 95% CI -0.71 to -0.1, p < 0.001), with moderate heterogeneity (I2 = 52%, p = 0.08) (Fig. 9).

Fig. 9
figure 9

Forest plot for depression and anxiety

Quality of life

Five studies reported a composite score for mental QoL and 3 studies for physical QoL. A significant intervention effect was revealed for physical QoL (SMD -0.45, 95% CI -0.67 to -0.24, p < 0.001; I2 = 0%), while the SMD for mental QoL was not significant (SMD -0.25, 95% CI -0.61 to 0.12, p = 0.18; I2 = 59%). (Fig. 10).

Fig. 10
figure 10

Forest plot for the secondary outcome quality of life (post-intervention)

Discussion

This systematic review examined and synthesized the evidence regarding taiji’s stress-relieving effects. Based on evidence from eleven RCTs included in the primary meta-analysis, taiji appears effective in reducing stress. The pooled SMD comparing taiji to no intervention showed a moderate effect size, with moderate-to-high between-study heterogeneity. The Funnel plot and the sensitivity analysis that excluded small studies both suggested the presence of small-study bias, with the latter yielding a reduced yet still significant SMD. Approximately half of the studies had notable weaknesses related to the definition of primary outcomes, allocation concealment, blinding of outcome assessment, and/or incomplete outcome data. Sensitivity analyses excluding these studies revealed reduced effect sizes in those with low risk regarding allocation concealment. The pooled SMD for perceived stress remained significant at 24-week follow-up. The stress-reducing effect of taiji in comparison to NT thus appears to be robust, enduring, and small-to-moderate in size.

Our exploratory simple regression analyses suggested smaller effects in patients and females; however, these influences were not confirmed by the multivariate regression analysis, in which only sample size emerged as an independent predictor. This confirms and extends prior research, which has demonstrated that taiji practice benefits mental health in diverse populations [24], comprising young and healthy subjects [1], adults [3], as well as patients [4, 5] and the elderly [25].

Five RCTs in this review included a third treatment group involving physical activity to control for body movement during taiji exercise. There was no evidence suggesting taiji had effects beyond those of regular physical activity. Interestingly, the health benefits of physical activity have been linked to its potential to dampen stress [26], which may explain the comparable effects on perceived stress.

Six studies reported results for one or more objective stress markers: five studies for systolic/diastolic BP and two for heart rate. However, the pooled SMD for neither measure indicated a significant effect of the intervention. This suggests that while taiji may influence perceived stress, it doesn't seem to impact the physiological stress response. In contrast, several reviews have highlighted reductions in BP resulting from taiji practice [27,28,29]. For instance, a recent meta-analysis found that regular taiji exercise could lead to clinically meaningful reductions in both SBP and DBP. This analysis also identified baseline BP as a major effect moderator, revealing larger BP reductions in hypertensive individuals and smaller reductions in those with normal BP [28]. Similarly, our findings showed the most pronounced positive effects of taiji on SBP and DBP in a study with hypertensive adults [23]. The lack of significant effects of taiji on BP in our review might therefore stem from the fact that most participants in the studies had normal BP. However, a preventive cardiovascular benefit of taiji practice might still be inferred, given taiji's regulatory effect on endothelial function and its potential role in maintaining healthy BP [27].

Stress is often accompanied by symptoms of depression and anxiety [30]. Our analyses indicate that practicing taiji had beneficial effects on anxiety. The SMD for depressive symptoms did not reach the level of significance. However, when we excluded one study that focused on patients with schizophrenia [21, 31] – a condition where depressive symptoms can overlap with negative symptoms [31], the SMD became significant (-0.47, 95% CI, -0.67 to -0.27, p < 0.001), and heterogeneity dropped from 77 to 13% (p = 0.33). More RCTs are needed to conclusively determine the impact of taiji on depressive symptoms.

Additionally, five studies presented a composite measure for physical and/or mental QoL. The results suggest small-to-moderate sized pooled effects for physical QoL, while the SMD for mental QoL was not significant. These findings align with a recent meta-analysis that evaluated taiji's effectiveness among older adults, both with and without chronic conditions. When QoL was analyzed only the physical but not the mental component showed significant improvement [32]. Other systematic reviews have highlighted taiji's beneficial impact on health-related QoL across various chronic conditions compared to NT, encompassing psychological factors [33,34,35,36]. However, the majority of these studies emphasized physical functioning and overall well-being. To draw definitive conclusions regarding taiji's specific impact on mental QoL, more research is needed that assesses both sub-dimensions.

Even though the present review clearly indicates that taiji is more effective than NT in reducing stress, it cannot answer the question of how much of the interventional effects of taiji are due to placebo effects, given the lack of placebo control groups. The effects of taiji could only be differentiated from other influences, such as natural history, regression to the mean, and unidentified co-interventions. For physicians, it is a professional ideal to use treatments only if their specific actions are proven [37]. However, one could argue that a self-help strategy like taiji is acceptable as long as it works better than NT. According to surveys in Switzerland and the US, the majority of patients (62%) would even accept medical interventions prescribed by a physician that are effective, even if a specific action has not been proven [38, 39]. Yet, the prerequisite for such a pragmatic approach is that the intervention is not harmful [40, 41], and certainly that necessary medical treatments are not overlooked. Since taiji is a non-invasive treatment and is typically used as an add-on treatment in patients with medical disorders, these prerequisites seem to be met. Nonetheless, taiji styles and forms should be adapted to a patient’s physical and mental state [42].

Limitations

The present results are limited by their unblinded nature and the absence of placebo control groups. This could have led to positive expectations only in the treatment groups, thereby triggering placebo effects as well as a possible response bias. Other nonspecific influences, such as natural history and regression to the mean, could largely be ruled out by comparison to NT control groups. The methodological and reporting quality of the included RCTs varied widely across trials. However, sensitivity analyses revealed significant, albeit sometimes smaller, effect sizes for studies with higher methodological quality. One limitation of the results is the small number of trials, and especially the results for the secondary outcomes should be interpreted with caution. Another potential limitation arises from the inclusion of trials with perceived stress as an outcome, regardless of whether stress was the primary or secondary outcome in the respective trial, and irrespective of whether the study targeted patients or stressed yet healthy populations. Consequently, the primary stressors likely varied among studies, ranging from psychosocial stressors related to a high workload to those typically associated with chronic diseases (e.g., loss of work, financial problems, loneliness). Lastly, the typical duration of the interventions in this review comprised 24 weeks. The potential effects of taiji's long-term use on stress remain to be investigated.

Conclusions

The scientific interest in the health-promoting effects of taiji has increased considerably during the past decades, and several new RCTs have been published in the past five years. Therefore, this systematic review aimed to evaluate the current evidence for the potential of this meditative movement intervention to reduce stress. Results indicate that, in comparison to untreated controls, taiji reduced perceived stress. The effect size was comparable in size to those associated with regular physical activity. Remarkably, taiji not only improved perceived stress but also anxiety and physical QoL. This broad effect on health and well-being is in accordance with the claim of integrative medicine to provide holistic care for the whole person rather than for single symptoms. However, more rigorously performed RCTs with sufficiently large sample sizes are needed to corroborate these findings.