FormalPara Key Points

Aerobic interval training is more beneficial compared with aerobic continuous training in increasing peak oxygen uptake in patients with coronary artery disease and chronic heart failure.

A larger increase of the first ventilatory threshold and peak heart rate was observed after aerobic interval training compared with aerobic continuous training in all cardiac patients, while other cardiorespiratory parameters, cardiovascular risk factors, and quality of life were equally influenced.

1 Background

Cardiovascular diseases are the main cause of death worldwide [1] and in Europe [2], accounting for 31.5% [1] and 45% [2] of all deaths, respectively. Exercise-based cardiac rehabilitation is a cornerstone in the secondary prevention of cardiovascular diseases, reducing cardiovascular mortality by 26% in patients with coronary artery disease (CAD) [3]. These improved survival rates are mediated amongst others through training-induced increases in peak oxygen uptake (peak VO2), which is an important prognostic parameter for all-cause and cardiovascular morbidity and mortality [4]. That is, an increase of 1 mL O2/kg/min has been shown to result in an almost 15% increase in survival [4].

Traditional training methods include continuous training (30–60 min) at moderate intensity (40–80% of peak VO2) [aerobic continuous training; ACT], leading to gains in peak VO2 of approximately 20% after 12 weeks of three-times-weekly exercise sessions [5]. Training at a higher intensity leads to higher increases in peak VO2 [6]. However, it is impossible to sustain higher intensities for longer periods. Therefore, to prolong the time that training could be sustained at higher intensities, interval training was suggested [7]. About 15 years ago, aerobic interval training (AIT) was introduced in cardiac rehabilitation in Norway. These first small studies reported that the increments in peak VO2 were significantly higher after AIT in both patients with CAD [8] and chronic heart failure (CHF) [9]. Since then, research investigating the potential superiority of AIT for improving peak VO2 in different cardiac patient groups has grown rapidly. Results of the individual studies were contradictory and inconclusive, while meta-analyses collating the results showed significantly higher increases after AIT ranging from 1.04 [10] to 2.14 mL/kg/min [11] in both patients with CAD [12,13,14,15] and CHF [10,11,12, 16].

Since the publication of our first meta-analysis in 2014 [15] involving only nine study groups and 206 patients, an increasing number of larger randomized clinical trials have been published comparing the efficacy of AIT and ACT in cardiac patients. This larger number of trials allows now for a more precise estimate of the effect on peak VO2 but also on other relevant secondary outcomes.

Therefore, our aim was to update the original meta-analysis [15] comparing the efficacy of AIT and ACT on peak VO2 in patients with CAD and CHF. Moreover, we focused on secondary outcomes including (1) cardiorespiratory parameters [peak heart rate (HR), oxygen pulse (O2 pulse), first ventilatory threshold (VT1), oxygen uptake efficiency slope (OUES), ventilatory efficiency slope (VE/VCO2 slope), heart rate recovery after 1 min of exercise (HRR 1 min)], (2) cardiovascular risk factors [body weight, systolic blood pressure and diastolic blood pressure (SBP and DBP), resting HR, blood lipids, and blood glucose], (3) cardiac function [left ventricular ejection fraction (LVEF)], (4) vascular function [flow-mediated dilation (FMD)] and (5) quality of life (QoL).

2 Methods

The systematic review and meta-analysis were conducted in accordance with the guidelines from Preferred Reporting for Systematic Review and Meta-Analysis [17].

2.1 Literature Search

We conducted a literature search in the electronic PubMed and SPORTDiscus databases from the earliest available date up to March 2017 with the following search terms (‘All field’ terms): [(Aerobic interval training OR high intensity interval training OR interval training OR intermittent training OR high intensity exercise) AND (coronary artery disease OR coronary heart disease OR heart failure OR myocardial infarction OR coronary artery bypass surgery OR ischemic heart disease OR angina pectoris OR percutaneous coronary intervention)], without any further limitations. Screening of all titles and abstracts was performed by two independent investigators (NP and RB). Results from both investigators were compared and papers were selected after consensus. The reference lists of retrieved papers were manually searched to identify other appropriate studies.

2.2 Eligibility Criteria

We included (1) randomized clinical trials comparing the effects of supervised AIT with ACT in patients with CAD and/or CHF, (2) with a duration of at least 4 weeks, (3) reporting on peak VO2 (mean changes or pre- and post-intervention means and variability measures), and (4) published in a peer-reviewed journal up to March 2017. Exclusion criteria included any study not meeting any of the criteria listed above.

2.3 Measured Outcomes

The primary outcome for this meta-analysis was the change in peak VO2 in mL/min/kg. Secondary outcomes included (1) cardiorespiratory parameters (peak HR, O2 pulse, VT1, OUES, VE/VCO2 slope, HRR 1 min), (2) cardiovascular risk factors (body weight, SBP and DBP, resting HR, blood lipids, and blood glucose), (3) cardiac function (LVEF), (4) vascular function (FMD), and (5) QoL.

2.4 Data Extraction

The two main authors (NP and RB) independently extracted characteristics of patients and intervention, and point and variability data on primary and secondary outcomes, in a standardized form. Results were compared and discrepancies were resolved after mutual agreement or by consulting the senior author (VC). Cohen’s kappa was 0.83, showing a very good interrater agreement.

2.5 Study Quality

Papers were assessed for quality using the TESTEX-scale (Tool for the assEssment of Study qualiTy and reporting in EXercise) [18], which is a 14-point scale recently designed specifically for assessing study quality in exercise training studies. It includes data on eligibility criteria, random allocation, similarity of baseline values, blinding of investigators, key outcome obtained in at least 85% of subjects, reporting adverse events and adherence rates, intention-to-treat analysis, between-group differences, point and variability measures, activity monitoring in control groups, adaptation of relative intensity, and data on exercise characteristics and energy expenditure. A higher score reflects a better quality. No trials were excluded based on quality.

2.6 Statistical Analysis

Statistical analyses were performed using Review Manager Software (RevMan 5.3; Cochrane Collaboration, Oxford, UK). Descriptive data are reported as mean ± standard deviation (SD) or mean and 95% confidence interval (CI). Prior to the statistical analysis, outcome data of the included papers were converted to mean ± SD if necessary. The formula used to calculate SD from standard error (SE) was: SD = SE × √(n; the formula to calculate SD from 95% CI was: SD = [√(n) × (upper limit − lower limit)]/3.92. The mean baseline values were calculated by combining mean values from the intervention groups, weighted by the number of participants included in the final analysis in each study group.

Effect sizes were calculated by subtracting the pre-intervention value from the post-intervention value of each trial. The net treatment effect was then calculated by subtracting the change in the ACT group from the change in the AIT group. Review Manager Software calculated the variances from the inserted pooled SDs of the change in the intervention groups. However, some studies reported only the SDs or SEs at baseline and post-intervention. Therefore, missing change score SDs were calculated from pre- and post-SD values, using the following formula: SDchange = √(SDpre2 + SDpost2 − 2 × corr(pre, post) × SDpre × SDpost) [19], for which we used a correlation coefficient (corr) of 0.5 for each outcome. We used random-effects models to pool the data, given the small sample sizes of study groups and the differences in study populations. Each effect size was weighted by the inverse of its variance. The results are reported as weighted means and 95% CI. Two-sided tests for overall effects were considered significant at p ≤ 0.05.

Statistical heterogeneity among the trials was assessed using Cochrane’s Q statistic and an alpha value for statistical significance of 0.10 indicated significant heterogeneity. In addition, the I2 parameter was used to quantify inconsistency of treatment effects across trials (I2 = [Q − df] × 100%, where Q is the χ2 statistic and df are the degrees of freedom). A value for I2 > 50% has been considered to be substantial heterogeneity.

To examine the influence of each trial on the overall results of our primary outcome peak VO2, sensitivity analyses were performed with each study deleted from the model once. For the primary outcome peak VO2, different a priori-defined subgroup analyses were performed. Subgroups were compared according to pathology [patients with CAD vs. patients with CHF with reduced ejection fraction (HFrEF) vs. patients with CHF with preserved ejection fraction (HFpEF)], duration of the high-intensity interval (short ≤ 1 min, medium 1–3 min, long ≥ 4 min), intensity of the interval [vigorous effort (70–89% of peak HR; 60–84% of HRR; 60–79% of peak VO2) or very hard effort (≥ 90% of peak HR; ≥ 85% of HRR; ≥ 80% of peak VO2) (in case of training HR zones, the mean of the HR zone was used for classification) [20], training mode (cycle ergometer, treadmill) and total duration of the intervention (< 12 weeks vs. ≥ 12 weeks), energy expenditure (isocaloric vs. not iscaloric), and sample size (< 20 patients in at least one group, 20–50 patients in both groups, > 50 patients in both groups). Fixed-effects models were used to compare the subgroups. If p ≤ 0.05, we checked for a non-overlapping CI to identify which groups differed significantly. Finally, funnel plots of the effect size vs. the SE of the effect size were visually inspected to assess publication bias.

3 Results

3.1 Study Selection

A Preferred Reporting for Systematic Review and Meta-Analysis flow diagram of our literature search and selection is presented in Fig. 1. The initial search identified 2592 potentially relevant studies of which 37 studies were retrieved for full-text review. From these, we excluded six studies that did not fulfill the inclusion criteria (training intervention < 4 weeks, [21] no randomization [22, 23], deviant exercise protocols, [24, 25], or no peak VO2 data) [26]. Seven studies produced more than one publication [27,28,29,30,31,32,33]. If these duplicate papers, however, contained data that were not reported in the main paper, they were included for quantitative analyses of the secondary outcomes [27, 28, 31, 32]. In total, 28 (14 in CAD; 12 in HFrEF; two in HFpEF) publications were included for quantitative analysis, of which 24 were used for the primary outcome peak VO2 [8, 9, 34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55].

Fig. 1
figure 1

Preferred reporting for systematic review and meta-analysis flow diagram of the included studies. VO 2 oxygen uptake

3.2 Characteristics of the Participants and Study Design

A summary of the 24 studies including the primary outcome peak VO2 is presented in Table 1. A total of 1080 patients (mean age 60.7 ± 10.7 years) were randomized to either AIT (n = 540) or ACT (n = 540). The drop-out rate ranged from 0 to 48%, with a mean of 12.4%. Drop-out rates after AIT and ACT were 13.0% (range 0–50% [45]) and 11.9% (range 0–47% [45]), respectively. A total of 470 patients completed the AIT intervention (mean age 60.4 ± 11.3 years) and 476 completed the ACT intervention (mean age 61.1 ± 10.4 years). Main reasons for drop-out were low compliance with the study protocol or training sessions, patients’ withdrawal of consent, intolerance of imposed training protocol, or lack of data.

Table 1 Study and intervention characteristics

Six studies were randomized controlled trials [9, 41, 47, 49, 50, 55], 18 were randomized clinical trials. Two large multicenter studies were included: one in patients with CAD [51] and one in patients with HFrEF [55]. Three studies included only men [34, 42, 54]. and overall 83.9% of the analyzed participants were male. The New York Heart Association class ranged from I to III, with most patients being in class II. All of them had to be stable at the time of inclusion. In the studies with patients with CAD, all had normal to slightly reduced LVEF (lowest exclusion criterion on LVEF was > 30% [52]).

Studies were published between 2004 and 2017, of which 12 (50%) were published since our previous meta-analysis in 2014. Fourteen studies were performed in Europe, accounting for 72% of the study population [8, 9, 35,36,37, 42, 43, 45,46,47, 49, 51, 53, 55]. Seven studies were performed in North America [34, 40, 44, 48] and Latin America [38, 50, 54], two studies originated from Asia [41, 52] and one from Australia [39].

All patients were requested to take their medication as prescribed; beta-blockers, angiotensin converting enzyme inhibitors, statins, diuretics, and antiplatelet agents being the main drug categories. Two studies excluded patients with type II diabetes mellitus [45, 49], one study excluded smokers [40]. All studies stated that baseline characteristics were similar among the intervention groups.

Overall, study quality was good, with a median TESTEX-score of 10 (range 8–13) [Table S1 of the Electronic Supplementary Material (ESM)]. Shortcomings were unblinding of assessors, no intention-to-treat analysis, lack of activity monitoring in the control group, and no adaptation of the relative intensity according to the progress of the patients.

3.3 Outcome Assessment and Intervention Characteristics

The primary outcome for this meta-analysis was the change in peak VO2, which was measured during a cardiopulmonary exercise test until exhaustion on a treadmill [8, 9, 34, 36,37,38, 42,43,44, 46, 48, 50, 52, 54, 55] or cycle ergometer [35, 39,40,41, 45, 47, 49, 51, 53, 55] (Table 1). The exercise test was considered maximal if (1) the criteria of the American College of Sports Medicine or American Heart Association [56,57,58] were fulfilled [8, 41, 48,49,50], (2) patients were exhausted, defined by intolerable leg fatigue or dyspnea [35, 37, 39], (3) there was a leveling off in VO2 [9, 34, 54], and (4) the respiratory exchange ratio exceeded 1.10 [44, 53]. The mode of training was similar to the mode of exercise testing, except for one study, in which a combination of a treadmill, stair climber, and arm and leg cycle ergometer was performed during the training sessions and a treadmill for testing [34].

Median intervention duration was 12 weeks (range 4–24 weeks), with only four studies having a duration longer than 12 weeks [34, 39, 45, 50]. Median training frequency was three times a week (range 2–6 times a week), and at least two sessions were supervised. Three studies added one to three home-based training sessions per week to the supervised intervention (Table 1) [9, 34, 40].

Mean duration of the total training session (including warming up and cool down) was 42 min for AIT (range 30–74 min) and 46 min for ACT (range 30–65 min) (Table 1). The intensity of the interventions was prescribed as a percentage of peak HR in nine studies (AIT 80–95% of peak HR—ACT 60–75% of peak HR) [8, 9, 36, 46, 48, 50, 51, 54, 55], as a percentage of HR reserve in six studies (AIT 75–95% of HR reserve—ACT 60–85% of HR reserve) [34, 41,42,43,44, 52], as a percentage of the maximal workload in five studies [35, 37, 40, 45, 49], using the first (ACT) and second (AIT) ventilatory thresholds in three studies [37, 38, 45], as a percentage of peak VO2 in two studies [39, 47], or another method [53]. Twelve studies reported that the energy expenditure was equal for both interventions (Table 1) [8, 9, 34,35,36, 38, 39, 41, 42, 46, 52, 55]. In the other studies, the training protocols differed in total training work, with ACT expending more calories than AIT [40, 45, 51, 55], or did not report on energy expenditure [37, 43, 44, 47,48,49,50, 53, 54]. Overall, compliance with the training sessions and protocol was high, ranging from 75 to 100% of scheduled training sessions. However, patients with poor compliance were mostly excluded from the final analyses and counted as drop-outs. All studies reported similar compliance ratios between groups.

3.4 Primary Outcome

As shown in Fig. 2, a significantly larger improvement was observed after AIT compared with ACT in the total group (+ 1.40 mL/kg/min; 95% CI 0.69–2.11; p < 0.001), and in the subgroups of patients with CAD (+ 1.25 mL/kg/min, 95% CI 0.49–2.02; p = 0.001) and patients with HFrEF (+ 1.46 mL/kg/min; 95% CI 0.10–2.82; p = 0.03). No difference was found between both interventions in the small group of patients with HFpEF (+ 0.37 mL/kg/min; 95% CI − 1.59–2.32; p = 0.71). The subgroup analysis showed no differences between patients with CAD, HFrEF, and HFpEF (p = 0.65; I2 = 0%) (Fig. 2). With each trial deleted from the model once, significance did not change in the patients with CAD. When we excluded the studies of Freyssin et al. (p = 0.08) [37], Fu et al. (p = 0.08) [41], or Wisløff et al. (p = 0.28) [9] in the HFrEF group, differences between both interventions were no longer significant.

Fig. 2
figure 2

Forest plot of peak oxygen uptake according to pathology. ACT aerobic continuous training, AIT aerobic interval training, CADpEF coronary artery disease with preserved ejection fraction, CI confidence interval, HFrEF heart failure with reduced ejection fraction, HFpEF heart failure with preserved ejection fraction, IV intervention, SD standard deviation

Subgroup analyses on the duration of the peak load of AIT (Fig. S1 of the ESM), the duration of the total intervention (Fig. S2 of the ESM), and the training mode (Fig. S3 of the ESM) revealed no significant subgroup differences on peak VO2 (test for subgroup differences, p = 0.71, p = 0.66, p = 0.11, respectively). Studies prescribing a vigorous effort during the interval showed only a trend for larger increases after AIT compared with ACT (+ 1.06 mL/kg/min; − 0.03 to 2.16; p = 0.06), while studies with intervals at very hard effort were in favor of AIT (+ 1.52 mL/kg/min; 0.48–2.56; p = 0.004) [test for subgroup differences, p = 0.55] (Fig. S4 of the ESM). Studies with a sample size of > 50 patients in each group showed smaller differences between both interventions compared with studies with < 20 patients in each group (test for subgroup differences, p = 0.01) (Fig. 3). In addition, studies stating that they were comparing isocaloric AIT and ACT programs showed that AIT was more effective for improving peak VO2 (+ 2.08 mL/kg/min; 95% CI 0.91–3.25; p < 0.001) while interventions with a lower energy expenditure in the AIT group showed similar effects (+ 0.21; 95% CI − 0.48–0.90; p = 0.55) [subgroup difference, p = 0.007] (Fig. 4).

Fig. 3
figure 3

Forest plot of peak oxygen uptake according to the sample size. ACT aerobic continuous training, AIT aerobic interval training, CI confidence interval, IV intervention, SD standard deviation

Fig. 4
figure 4

Forest plot of peak oxygen uptake according to the energy expenditure of the training protocols. ACT aerobic continuous training, AIT aerobic interval training, CI confidence interval, IV intervention, SD standard deviation

3.5 Secondary Outcomes

In Table 2, meta-analytic results of the secondary outcomes are presented. Analyses were performed for the total group and patients with CAD and CHF separately, the latter including both patients with HFrEF and HFpEF.

Table 2 Meta-analytic results of secondary outcomes

For the total group, a significantly higher increase in VT1 (0.88 mL/kg/min; 95% CI 0.16–1.60; p = 0.02) and peak HR (3.78 beats per minute; 95% CI 1.10–6.45; p = 0.006) was observed after AIT compared with ACT. In addition, peak O2 pulse (p = 0.07), OUES (p = 0.10), LVEF (p = 0.07), and FMD (p = 0.09) tended to improve more after AIT compared with ACT.

In the subgroup of patients with CAD, peak HR increased more after AIT compared with ACT (5.11 beats per minute; 95% CI 1.94–8.28; p = 0.002). In patients with CHF (HFpEF and HFrEF), LVEF was significantly more improved after AIT compared with ACT (2.76%; 95% CI 0.98–4.55; p = 0.002). Excluding the papers of Angadi et al. [48] and Aksoy et al. [47] with patients with HFpEF did not change the results (p = 0.02). Similarly, in patients with CHF, VT1 (p = 0.10) and FMD (p = 0.05) tended to increase more after AIT compared with ACT. For FMD, however, results were no longer significant (p = 0.20) after excluding patients with HFpEF [48].

All other secondary outcomes responded similarly to both training interventions (p > 0.05 for all) (Table 2). Thirteen studies reported on QoL using a variety of questionnaires. Some papers used a generic QoL questionnaire including the 12-Item [51] or 36-Item Short Form Health Survey [39, 41, 45, 47, 49, 53, 54]. Other trials used disease-specific questionnaires such as the MacNew Heart Disease Health Related Quality of Life Questionnaire [9, 36, 46, 53] or the Minnesota Living with Heart Failure Questionnaire [39, 41, 45, 49, 54]. Other questionnaires that were reported less frequently were the Cardiac Depression Scale [39], the Hospital Anxiety and Depression Scale [37, 55], the 36-Item Left Ventricular Dysfunction questionnaire [47], the Kansas City Cardiomyopathy Questionnaire [55], the Global Mood Scale [55], and the Type D Scale [55]. Given this large variety in questionnaires, only a qualitative analysis was performed for QoL.

The 12-Item and 36-Item Short Form Health Surveys showed similar effects after AIT and ACT, with some trials reporting improvements in one or more subscales [39, 47, 51, 53, 54] and others showing no changes in any of the subscales [45, 49]. In the paper of Fu et al., AIT increased the physical and mental score of the 36-Item Short Form Health Survey while ACT did not [41].

For the MacNew Heart Disease Health Related Quality of Life Questionnaire all trials reported significant improvements following both AIT and ACT [9, 36, 46, 53]. In only one study, the improvements were larger after AIT compared with ACT [9].

The Minnesota Living with Heart Failure Questionnaire was used in five trials [39, 41, 45, 49, 54]. Two trials reported significant improvements in the total score after AIT [41, 54] and three trials after ACT [39, 41, 54]. There were no significant group differences in any of the studies.

Six studies did not report on adverse events [35, 38, 41, 42, 49, 50], while 15 studies reported no records of training-related adverse events [8, 34, 36, 37, 39, 40, 43, 44, 46,47,48, 51,52,53,54]. In three studies, adverse events did occur [9, 45, 55]. Wisløff et al. reported one death in the ACT arm, but it was unrelated to exercise [9]. Next, Ellingsen et al. reported three serious adverse events within 3 h of supervised exercise in the AIT group (resuscitation after ventricular arrhythmia; inappropriate implantable cardioverter defibrillator discharge; dizziness without detectable cardiovascular cause) [55]. Finally, Koufaki et al. reported one syncope during AIT and one anxiety/panic attack in the ACT group [45].

3.6 Publication Bias

The funnel plot for our primary outcome peak VO2 showed a likelihood of small publication bias (Fig. S5 of the ESM) because of an asymmetric relationship between treatment effect and SE of the treatment effect. The papers by Wisloff et al. [9] and Ellingsen et al. [55] were outside the inverted funnel.

4 Discussion

This systematic review with a meta-analysis compared the effects of AIT with ACT in patients with CAD and CHF (HFrEF and HFpEF). In this updated meta-analysis, we were able to include 15 additional trials and five times more participants, allowing us to make more precise estimates on the effects on peak VO2 and a number of secondary outcomes.

4.1 Primary Outcome

The pooling of the results of 24 trials showed a larger increase in peak VO2 in favor of AIT (1.40 mL/min/kg; p < 0.001) confirming the results of our previous report [15]. Although small, the magnitude of this mean difference in peak VO2 between both interventions is not only statistically significant but is likely to be also clinically relevant as it has been established previously that for every 1-mL/min/kg increase in peak VO2, an approximate 15% reduction can be expected in all-cause mortality [4]. The larger number of trials and participants enabled us to perform some subgroup analyses based on underlying pathology, duration of the high-intensity interval, study duration, sample size, training mode, and energy expenditure. We found that AIT was also superior in the subgroups of patients with CAD (1.25 mL/min/kg; p = 0.001) and patients with HFrEF (1.46 mL/min/kg; p = 0.03). No significant difference in effect size was observed in the subgroup of patients with HFpEF, but the current paucity of published studies warrants further research [47, 48]. Our overall results are in concordance with most previous meta-analyses performed in study populations of patients with CAD [12,13,14,15] or CHF [10,11,12], all showing significant mean effect sizes ranging from 1.04 to 2.14 mL/kg/min in favor of AIT; but our results are in contrast to Cornelis et al. [16], who reported no significant difference among patients randomized to AIT vs. ACT. The latter might be explained by the inclusion of trials of shorter duration and some low-quality studies (i.e., non-randomized) [16].

Though meta-analyses show a benefit from AIT over ACT for peak VO2, a large variety exists among training protocols. To date, it is unknown which AIT protocol is the best. In our meta-analysis, the intervals differed in duration ranging from 20 s to 4 min, and peak load ranging from 50 to 75% of peak VO2 [39, 47] up to ≥ 90% of peak HR [8, 9, 34, 35, 51, 54]. Subgroup analysis revealed no difference with regard to the duration of AIT intervals. However, previous research showed a dose–response relationship [6] between increases in peak VO2 and the intensity of the intervals, with higher intensities (> 92% of peak HR) resulting in a 2-mL/kg/min larger increase compared with lower intensities (< 88% of peak HR). Therefore, it seems that not the duration of the interval but the intensity is of major importance for effectiveness. Indeed, we found that studies with AIT protocols performed at high intensities only tended to improve peak VO2 more compared with ACT (p = 0.06), while extremely high-intensity AIT protocols found significant results in favor of AIT (p = 0.004). Yet, there was no significant subgroup difference (p = 0.55). Finally, two other exercise characteristics that could potentially influence the magnitude of the effect size are the training mode, which was either cycling [35, 39,40,41, 45, 47, 49, 51, 53] or walking/running [8, 9, 36, 38, 42,43,44, 46, 48, 50, 52, 54], and energy expenditure. No subgroup differences in effect size could be found according to training mode (p = 0.11). In contrast, studies comparing isocaloric programs of AIT and ACT found more pronounced increases following AIT, but if energy expenditure of the AIT intervention was less compared with ACT, the effect sizes were similar. This suggests that AIT could be seen as a more time-efficient intervention resulting in similar improvements with less time.

Subgroups according to the duration of the total intervention (< 12 weeks vs. ≥ 12 weeks) found no differences between short and longer duration training periods, although it should be highlighted that the maximal duration was only 6 months [45]. In addition, the two largest multicenter studies [51, 55] found similar effects after AIT and ACT, while pooled results of smaller studies (< 20 patients per group or 20–50 patients per group) were in favor of AIT. It must be mentioned that these two multicenter studies prescribed isocaloric programs but reported that intensity of AIT was decreased and the intensity of ACT was increased to satisfy the subjective feelings of the patients. These two multicenter trials also investigated longer term effects after supervised AIT and ACT [55, 59]. While Pattyn et al. reported a maintenance of the peak VO2 in patients with CAD 9 months after finishing the training intervention [59], Ellingsen et al. reported that improvements were not maintained in patients with CHF [55]. Future research is needed to develop effective cardiac rehabilitation programs and follow-up strategies to improve the lifelong physically active lifestyle in cardiac patients.

4.2 Secondary Outcomes

A significantly larger increase in VT1 (3.78 mL/kg/min; p = 0.006) was found after AIT compared to ACT, with the effect size being more than double the effect size of peak VO2. This improved ability to use oxygen aerobically at higher intensities is likely to translate into improved performance of daily living activities. Yet, it should be acknowledged that the majority of studies observed no differences in QoL among AIT and ACT patients [9, 36, 39, 46, 47, 51, 53, 54]. Based on the Fick principle [VO2 = Q * (CaO2CvO2), where Q is the cardiac output (HR * stroke volume) and CaO2CvO2 the arteriovenous O2 difference], the larger improvement following AIT might partly be explained by the ability to reach higher peak HR (5.11 beats per minute; p < 0.01) and an improved stroke volume, as assessed by peak O2 pulse. It seems that higher exercise intensities challenge the cardiac muscle more to provide the working muscles with oxygen, resulting in an increased stroke volume and HR. Furthermore, increases in peak VO2 can be mediated through improvements in cardiac function and vascular function. Both tended to increase more after AIT in the total group (LVEF, p = 0.07; FMD, p = 0.09), which was driven by the results in the subgroups of patients with CHF (LVEF, 2.76%, p = 0.002; FMD, 2.27%, p = 0.05). The results of LVEF are in line with the meta-analyses of Haykowsky et al. (patients with CHF; five studies; + 3.29%; 95% CI − 0.7–7.28; p = 0.11) [11] and Cornelis et al. [16] (patients with CHF; six studies; + 3.39%; CI 1.62–5.16, p < 0.001). Even though this 2.76% larger effect is significantly different, it might not be clinically relevant. For FMD (2.27%, p = 0.05), our results are in line with a previous meta-analysis of Ramos et al. [60] including seven studies with both healthy and diseased individuals (2.26%, p < 0.001). This difference in favor of AIT might be clinically relevant because a 1% increase in FMD is associated with a 13% reduction in the risk of cardiovascular events [61, 62].

In line with the work of Xie et al. [12], we found no differences in any other cardiorespiratory parameter (VE/VCO2 slope, HRR 1 min) or cardiovascular risk factor (body weight, resting HR, SBP, DBP, high-density lipoprotein cholesterol, low-density lipoprotein cholesterol, triglycerides, fasting glucose). This might be explained by the optimal pharmacological management of blood pressure, cholesterol, and diabetes mellitus in the included cardiac patients. An additional effect of exercise training is therefore often absent or very small.

4.3 Safety

Even though higher intensity AIT results in more beneficial effects on peak VO2, it may also lead to an acute and transient increased risk for severe cardiac events or even sudden cardiac death [63, 64]. Epidemiological data suggested earlier that sedentary healthy individuals have a 50% increased risk for myocardial infarction during or following a bout of physical activity at high intensity compared with individuals who are used to performing high-intensity physical activity on a regular basis [65]. In the current meta-analysis, 18 studies (n = 775) reported on adverse events [8, 9, 34, 36, 37, 40, 43,44,45,46,47,48, 51,52,53,54,55]. Whereas the majority of these studies reported no adverse events, the largest multicenter study comparing AIT and ACT in patients with CHF found three adverse events, all within 3 h after AIT (resuscitation after ventricular arrhythmia; inappropriate implantable cardioverter defibrillator discharge; dizziness without detectable cardiovascular cause) [55]. In addition, Koufaki et al. reported one event of dizziness during an AIT session [45]. However, it remains difficult to derive conclusions based on these data. Using retrospective data, Rognmo and colleagues found a somewhat higher risk following AIT (i.e., 1/23,182 AIT exercise hours vs. 1/129,456 ACT hours) [66]. However, they also provided a power calculation showing that for an adequately powered randomized trial, we would require > 20,500 patients (and generate > 750,000 exercise hours) to determine the safety of AIT. To be able to draw unified conclusions about the safety of AIT, more and larger studies are needed.

4.4 Strengths and Limitations

This is the first systematic review and meta-analysis including both patients with CAD and CHF, focusing on secondary outcomes including cardiorespiratory parameters, cardiovascular risk factors, cardiac and vascular function, and QoL. In addition, we are the first to analyze the data on peak VO2 depending on interval duration, total study duration, sample size, training mode, and energy expenditure of the training protocols. However, there are some limitations. Only two large multicenter trials were identified for inclusion [51, 55], while the other studies had quite small sample sizes. Moreover, there was a large heterogeneity between study protocols and in the treatment effects. Therefore, random-effects models were used. The funnel plot showed minimal publication bias.

5 Conclusion

This meta-analysis further reinforces that peak VO2 benefits more from AIT compared with ACT in both patients with CAD and CHF. For secondary outcomes, AIT was more beneficial in increasing VT1 and peak HR, and tended to increase LVEF, peak O2 pulse, OUES, and FMD more compared with ACT. Other cardiorespiratory parameters, cardiovascular risk factors, and QoL were equally influenced by both methods of exercise training. Further, AIT seems to be a time-efficient strategy to improve peak VO2, while ACT can be equally effective if the training volume is increased.