FormalPara Key Points

The aim of this systematic review and meta-analysis was to compare the associations between leisure-time physical activity (LTPA) in pregnancy on maternal and child health outcomes between randomized controlled trials and cohort studies.

Our analysis revealed that women who were active during pregnancy were less likely to have excessive weight gain, to have gestational diabetes mellitus, and to deliver a preterm infant or a baby large for gestational age.

The findings presented in our study provide additional evidence of the positive effects of LTPA during pregnancy and support the promotion of LTPA in pregnancy as a strategy to improve maternal and child health.

1 Introduction

Pregnancy is a period characterized by intense physical changes, in which morphological adaptations occur to create an ideal environment for the development of the fetus; such rapid changes produce short- and long-term impacts on health [1]. In addition, the gestational period is an opportunity to promote positive health behaviors, considering that women are concerned with the child’s well-being. In this context, the health effects of physical activity during pregnancy have been extensively investigated in the literature. The current evidence suggests potential benefits of leisure-time physical activity (LTPA) during pregnancy on maternal and child health [25].

Studies have reported that physical activity performed during pregnancy is related to a lower incidence of gestational diabetes mellitus (GDM) [3], preeclampsia [6], and excessive weight gain [2]. In addition, a decreased incidence of preterm birth [7] and obesity in adult life have been linked to maternal physical activity in pregnancy [8]. The prevention of these complications during pregnancy becomes necessary as the development of diabetes and gestational hypertensive disorders, as well as fetal growth restriction and premature birth, are associated with increased risk of cardiovascular disease and mortality in adulthood [9].

Despite substantial advances in the scientific knowledge and evolution of the guidelines to promote physical activity in pregnancy [10], most pregnant women do not reach the current recommendations of at least 150 min of moderate-intensity aerobic exercise per week and continue to be inactive before and after pregnancy [11, 12]. Furthermore, physical activity levels tend to decline during pregnancy [13, 14]. Previous studies from different countries have shown low levels of physical activity during pregnancy, especially in the third trimester [1316]. Data from Norway showed that the proportion of women who performed regular exercise before pregnancy was 46.4 %, with sharp declines to 28 and 20 % at the 17th and 30th weeks of gestation, respectively [15]. Other studies conducted in the USA [17] and Denmark [16] showed a similar reduction in physical activity with advancing pregnancy, while one study in South Brazil showed that only 4 % of mothers were active in leisure time during the entire pregnancy [13].

Recent systematic reviews have summarized the associations of physical activity during pregnancy with specific maternal and child health outcomes [36]. Some reviews focused on experimental studies only [2, 4, 18], whereas others evaluated observational studies [3, 6, 7]. A series of methodological differences mean observational and experimental designs can lead to different findings. For example, whereas the results of previous meta-analyses of cohort studies showed positive associations between LTPA and maternal–child health [3, 6], most RCTs reported no associations [1820]. An exploration of the different findings of cohort and experimental studies is a key literature gap.

The aim of our systematic review was to compare the associations between LTPA in pregnancy and three maternal (GDM, preeclampsia, and weight gain during pregnancy) and three child (preterm birth, birthweight, and fetal growth) health outcomes between randomized controlled trials (RCTs) and cohort studies.

2 Methods

This systematic review and meta-analysis was conducted according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines [21] and the Cochrane Handbook for Systematic Reviews of Interventions [22].

2.1 Search Strategy

We searched PubMed, Web of Science, and EBSCO up to 31 August 2015 for eligible studies. The following search terms were used in all databases: (physical activity OR exercise OR sports OR motor activity OR leisure time physical activity OR recreational activities OR fitness OR walking) AND (pregnancy OR pregnant woman OR gestation). The terms were entered individually and combined on the advanced search field on each database. We searched the reference lists of included studies and selected reviews for further analysis. In addition, we conducted searches in the Clinical Trials and Cochrane Database Controlled Trials websites. Only publications in English, Spanish, and Portuguese were included. The screening process was carried out independently by two researchers, and disagreements were solved by consensus. If disagreement persisted, a third reviewer resolved the disagreement.

2.2 Inclusion and Exclusion Criteria

The inclusion criteria for experimental studies were (1) RCT; (2) the intervention had to include at least one physical activity component in a structured program; (3) the study had to include a control group; and (4) outcomes had to be presented separately for the intervention and the control groups. Intervention studies were excluded from this review if the sample comprised only women with comorbidities, such as diabetes, preeclampsia, or obesity.

The inclusion criteria for cohort studies were (1) for studies using self-reported physical activity as the exposure variable, only information on LTPA was extracted; (2) studies using accelerometry as the exposure variable; (3) only physical activity during pregnancy was considered (studies evaluating exclusively physical activity before pregnancy were excluded); and (4) the study had to include one of the following three maternal (GDM, preeclampsia, gestational weight gain) or child (birth weight, preterm birth, fetal growth) health outcomes. Cohort studies were excluded if the sample was selected among a specific group of women with a high risk of developing a given outcome.

2.3 Definitions of Outcomes

Excessive gestational weight gain (EWG) was defined in accordance with the recommendations from the US Institute of Medicine (IOM), which establishes proper weight gain parameters according to categories of body mass index [23]. GDM is a specific disorder of pregnancy, defined as “any degree of glucose intolerance with onset or first recognition during pregnancy” [24]. Preeclampsia is also a specific disorder of pregnancy characterized by hypertension (blood pressure ≥140/90 mmHg) and the presence of protein in the urine from the 20th week of pregnancy in previously normotensive women [25]. Low birth weight was defined as birth weight <2500 g in term babies [26]. Preterm birth was defined as births occurring before the 37th week of gestation [27]. Fetal growth was divided into two separate outcome variables: [24] large for gestational age (LGA) was defined as a fetus or infant larger or more developed than expected for the baby’s sex and gestational age [28]; small for gestational age (SGA) was defined as a fetus or infant smaller or less developed than expected for the baby’s sex and gestational age [28].

2.4 Data Extraction

Data from selected studies were screened by two reviewers separately. From experimental studies, we extracted the characteristics of the study (author, year, country), participants (number in each group, total number), intervention (type, duration, frequency, and intensity of physical activity intervention), and findings. From cohort studies, we extracted characteristics of the study (author, year, country); cohort characteristics (gestational period, number of participants, measurement methods of physical activity), and findings.

2.5 Quality Assessment

We used the Jadad Scale [29] to evaluate the quality of experimental studies. The scale comprised three main topics: randomization, blinding, and dropouts. Points were also given for appropriate use and description of the randomization and blinding method. Because double blinding was not possible for exercise interventions, the final score ranged from 0 (worst) to 4 (best) points. For the quality assessment of cohort studies, we used the Newcastle–Ottawa Scale [30], comprising eight items on sampling methods, comparability, and outcome accuracy. Two researchers conducted the evaluation process independently. The proportion of disagreement was <10 %, and the two reviewers agreed by consensus in these instances.

2.6 Statistical Analysis

Statistical analyses were conducted using Stata version 13.0. We performed a meta-analysis to assess a summary estimate of the effects in each article by calculating a random model. I 2 was used to test heterogeneity. Usually I 2 values of <25, 25–50, and >50 % are considered to represent small, medium, and large levels of inconsistency. For binary outcomes, we calculated the odds ratios (OR) and 95 % confidence intervals (CIs) for the categorical outcomes of GDM, preeclampsia, and fetal growth (SGA vs. others, and LGA vs. others).

For studies using continuous outcomes, such as gestational weight gain (kg), birth weight (g), and gestational age (weeks), we calculated mean differences (MDs) and difference in standard error (DSE). In cohort studies, gestational weight gain and preterm births (<37 weeks) were calculated as binary outcomes. For the cohort studies in which birth weight was the outcome variable, we calculated regression coefficients (β) and 95 % CIs.

We used a random-effects meta-analysis to pool the estimates, which takes into account between-study heterogeneity, since the study design and exposure, definition of physical activity, and intensity were not uniform across cohort studies. We chose the DerSimonian and Laird method for estimating random effects to distribute weights evenly, since this method evaluates the contribution of studies with small sample size as well as heterogeneity between studies [22].

To allow for comparability of exposure across cohort studies, we analyzed the OR for the highest physical activity category versus the lowest (reference) category. All analyses were conducted separately for experimental and cohort studies.

3 Results

The search strategy resulted in 17,925 titles to be examined, of which 1119 were selected for abstract review. After reading the abstracts, we excluded 1001 articles, and therefore read 118 full texts. After we applied the inclusion and exclusion criteria, our review included 30 randomized trials and 51 cohort studies (Fig. 1).

Fig. 1
figure 1

Flowchart reporting the items for systematic reviews of the studies included. a Randomized controlled trial

3.1 Randomized Controlled Trials

Table 1 presents the characteristics of the 30 RCTs included in our review. The countries with more studies were Spain (n = 10), Brazil (n = 4), Norway (n = 4), and the USA (n = 3). A large variability was found in the number of participants for each intervention, ranging from nine to 481 individuals in the intervention group and six to 481 in the control group. Only the study by de Oliveria Melo [31] included three groups, as described in Table 1. All interventions comprised a structured exercise program. Most included moderate-intensity physical activities three times per week. The duration of sessions varied between 20 and 70 min. Exercise strategies varied widely, although the majority included aerobic exercises and muscle resistance or strength training. In terms of methodological quality according to the Jadad scale [29], the mean score was 2.7 points, ranging from 1 (worst) to 4 (best) points.

Table 1 Characteristics of randomized controlled trials included in the systematic review and meta-analysis (n = 30)

In most included studies, the dropout rate was low (<15 %) [1, 8, 3242], four studies had a dropout rate between 15 and 20 % [31, 4345], eight studies between 20 and 25 % [4653], and five studies had a dropout rate >25 % [5458]. The main reasons for dropouts in the intervention group were discontinued intervention [1, 34, 38, 40, 4648, 5557], risk of premature birth [42, 50, 51], pregnancy-induced hypertension [35, 39], logistical difficulties [54], and lack of flexibility in test scheduling [53]. Regarding compliance with the LTPA intervention protocol, nine studies reported a high rate (>90 %) [32, 36, 3941, 43, 44, 48, 49], eight had a rate between 80 and 90 % [8, 34, 35, 44, 46, 47, 50, 51], and six studies had compliance <80 % [1, 33, 38, 42, 54, 58]. In six other studies, the adherence rate could not be identified [37, 52, 53, 5557]. The main reasons to quit the LTPA programs were unwillingness to do the exercise [41], personal reasons [8, 4345], transport issues and logistical difficulties [54]. The main barrier reported by women in these studies was related to the difficulties the women had in regularly attending the scheduled programs sessions. Lack of blinding was a methodological flaw recognized in most interventions; however, it is difficult to overcome this problem in physical activity programs for pregnant women.

3.2 Cohort Studies

Table 2 displays the characteristics of the cohort studies included. Most studies came from the USA (n = 24), followed by Denmark (n = 6), Norway (n = 5), and Brazil (n = 4). Sample sizes ranged from 44 to 87,232 participants. We observed a clear heterogeneity of physical activity definitions, including continuous scores in metabolic equivalents (METs) multiplied by time, counts from accelerometry, and duration. Most instruments used to assess physical activity were self-reported questionnaires; four studies included accelerometry [5962]. The quality assessment resulted in a mean of 6.6 points in the Newcastle–Ottawa Scale, ranging from 4 (worst) to 9 (best) points. The main methodological problems detected were the use of self-reported information on physical activity instead of accelerometry and the lack of a detailed characterization of non-respondents.

Table 2 Characteristics of cohort studies included in the systematic review and meta-analysis (n = 51)

3.3 Meta-Analysis: Weight Gain During Pregnancy

Figures 2, 3, 4, 5, 6, 7 and 8 display the meta-analysis graphs comparing experimental and cohort studies. For gestational weight gain (GWG), the meta-analysis of the RCTs included 1605 women in the control groups and 1598 in the exercise groups. The meta-analysis resulted in a mean difference in GWG of −1.11 kg (DSE −1.53; −0.69). Women exposed to exercise interventions gained less weight during pregnancy than those not taking part in an exercise intervention. There was no heterogeneity across the trials (I 2 = 0 %; p = 0.868; Fig. 2a).

Fig. 2
figure 2

Meta-analysis of the effect of physical activity on gestational weight gain in randomized controlled trials and cohort studies: a mean difference and difference in standard error between intervention group and control group (continuous analysis); b odds ratio ± 95 % confidence interval for exceeding the gestational weight gain guidelines from the US Institute of Medicine [23] (binary analysis). Schlaff et al. [63] (a) = effect measure for moderate physical activity; Schlaff et al. [63] (b) = effect measure for vigorous physical activity; Chasan-Taber et al. [64] (c) and Jiang et al. [65] (c) = effect measure for physical activity during early pregnancy; Chasan-Taber et al. [64] (d) and Jiang et al. [65] (d) = effect measure for physical activity during mid pregnancy; Chasan-Taber et al. [64] (e) and Jiang et al. [65] (e) = effect measure for physical activity during late pregnancy. The point estimate drawn represents graphically the weight of the study in the random-effects analysis. CG control group, CI confidence interval, ES effect size, GWG gestational weight gain, IG intervention group

Fig. 3
figure 3

Meta-analysis of the effect of physical activity on gestational diabetes mellitus: a relative risk ± 95 % confidence interval in randomized controlled trials; (b) odds ratio ± 95 % confidence interval in cohort studies. Barakat et al. [43] (a) = analysis with World Health Organization criteria; Barakat et al. [43] (b) = analysis with International Association for Diabetes in Pregnancy Study Group criteria. Chasan-Taber et al. [66] (c) = effect measure for physical activity during early pregnancy; Chasan-Taber et al. [66] (d) = effect measure for physical activity during mid pregnancy. The point estimate drawn represents graphically the weight of the study in the random-effects analysis. CG control group, CI confidence interval, ES effect size, GDM gestational diabetes mellitus, IG intervention group, RR relative risk

Fig. 4
figure 4

Meta-analysis of the effect of physical activity on pre-eclampsia: a relative risk ± 95 % confidence interval in randomized controlled trials; b odds ratio ± 95 % confidence interval in cohort studies. De Oliveria Melo et al. [31] (a) = exercise initiated at 13 weeks; De Oliveria Melo et al. [31] (b) = exercise initiated at 20 weeks; Vollebregt et al. [67] (a) = effect measure for moderate physical activity; Vollebregt et al. [67] (b) = effect measure for vigorous physical activity. The point estimate drawn represents graphically the weight of the study in the random-effects analysis. CG control group, CI confidence interval, ES effect size, IG intervention group, RR relative risk

Fig. 5
figure 5

Meta-analysis of the effect of physical activity on difference in mean birthweight: a mean difference and difference in standard error in randomized controlled trials; b adjusted regression coefficient (β) ± 95 % confidence interval in cohort studies. De Oliveria Melo et al. [31] (a) = exercise initiated at 13 weeks; De Oliveria Melo et al. [31] (b) = exercise initiated at 20 weeks; Hegaard et al. [16] (a), Sternfeld et al. [68] (f), and Hatch et al. [69] (i) = effect measure for physical activity during mid pregnancy; Fleten et al. [71] (c), Sternfeld et al. [68] (e), and Hatch et al. [69] (h) effect measure for physical activity during early pregnancy; Hegaard et al. [16] (b), Sternfeld et al. [71] (g), Fleten et al. [71] (d), and Hatch et al. [69] (j) = effect measure for physical activity during late pregnancy. The point estimate drawn represents graphically the weight of the study in the random-effects analysis. CG control group, CI confidence interval, ES effect size, IG intervention group

Fig. 6
figure 6

Meta-analysis of the effect of physical activity on gestational age: a mean difference and difference in standard error in randomized controlled trials (continuous analysis); b odds ratio ± 95 % confidence interval in cohort studies (binary analysis). Owe et al. [72] (a), Evenson et al. [73] (d) = effect measure for physical activity during mid pregnancy; Owe et al. [72] (b) = effect measure for physical activity during late pregnancy; Evenson et al. [73] (c) = effect measure for physical activity during early pregnancy. The point estimate drawn represents graphically the weight of the study in the random-effects analysis. CG control group, CI confidence interval, ES effect size, IG intervention group

Fig. 7
figure 7

Meta-analysis of the effect of physical activity on being born small for gestational age (defined as birth weight below the 10th centile or 2500 g): a relative risk ± 95 % confidence interval in randomized controlled trials; b odds ratio ± 95 % confidence interval in cohort studies. De Oliveria Melo et al. [31] (a) = exercise initiated at 13 weeks; De Oliveria Melo et al. [31] (b) = exercise initiated at 20 weeks; Harrod et al. [74] (a), Gollenberg et al. [75] (d) = effect measure for physical activity during early pregnancy; Harrod et al. [74] (b), Gollenberg et al. [75] (e) = effect measure for physical activity during mid pregnancy; Harrod et al. [74] (c) = effect measure for physical activity during late pregnancy. The point estimate drawn represents graphically the weight of the study in the random-effects analysis. CG control group, CI confidence interval, ES effect size, IG intervention group, RR relative risk, SGA small for gestational age

Fig. 8
figure 8

Meta-analysis of the effect of physical activity on being born large for gestational age (defined as birth weight above the 90th centile or 4000 g): relative risk ± 95 % confidence interval in randomized controlled trials. De Oliveria Melo et al. [31] (a) = exercise initiated at 13 weeks; De Oliveria Melo et al. [31] (b) = exercise initiated at 20 weeks. The point estimate drawn represents graphically the weight of the study in the random-effects analysis. CG control group, CI confidence interval, ES effect size, IG intervention group, RR relative risk

For cohort studies (Fig. 2b), most studies dichotomized the outcome as exceeding or not exceeding the GWG guidelines from the IOM [19]. Active women during pregnancy had an 18 % lower risk of GWG that exceeded the IOM recommendations as compared with inactive women (OR 0.82; 95 % CI 0.68–0.99). The sample of cohort studies included 9795 women, and the studies exhibited moderate heterogeneity (I 2 = 60.2 %; p = 0.005). The article by Schlaff et al. [63] included separate findings for moderate- and vigorous-intensity physical activity, and both were included in the meta-analysis. The papers by Chasan-Taber et al. [64] and by Jiang et al. [65] included measures of early, mid, and late pregnancy, and all results were included in the meta-analysis.

3.4 Meta-Analysis: Gestational Diabetes

In terms of GDM, ten trials were included, with 1907 women in the control groups and 1883 in the exercise groups. Barakat et al. [43] included analysis with two criteria for GDM diagnostics: World Health Organization criteria and the International Association for Diabetes in Pregnancy Study Group criteria; both findings were included in the meta-analysis. The meta-analysis suggested a protective role of exercise interventions on the development of GDM (relative risk [RR] 0.67; 95 % CI 0.49–0.92). The studies showed low heterogeneity (I 2 = 33 %; p = 0.135).

When taking cohort studies into account, the total sample size was 6754. The summary OR for GDM comparing high/moderate and low/no LTPA was 0.75 (95 % CI 0.55–1.01), with no evidence of heterogeneity across studies (I 2 = 0 %; p = 0.615). The paper by Chasan-Taber et al. [66] included measures of early- and mid-pregnancy physical activity; both findings were included in the meta-analysis (Fig. 3).

3.5 Meta-Analysis: Preeclampsia

Data from three trials were included in the meta-analysis of preeclampsia, with 708 women in the control groups and 709 in the exercise groups. No evidence of an association between exercise interventions in pregnancy and risk of preeclampsia was observed (RR 0.93; 95 % CI 0.55–1.57). There was no evidence of heterogeneity across the trials (I 2 = 0 %; p = 0.872).

In the cohort analysis, eight studies were included, with a sample size of 155,414 women. Similar to the findings of the randomized trials, there was no evidence of an association between LTPA in pregnancy and preeclampsia. There was low heterogeneity across cohort studies (I 2 = 19.4 %; p = 0.270). The paper by Vollebregt et a l. [67] included effect measures for moderate-intensity and vigorous-intensity physical activity, and both were included in the meta-analysis (Fig. 4).

3.6 Meta-Analysis: Birthweight

A total of 22 randomized trials (2431 women in exercise groups and 2478 in control groups) evaluated the effect of exercise interventions on birthweight. There was no evidence of an effect of LTPA on average birthweight (MD −31.09 g; DSE −69.91; 7.73). However, high heterogeneity was detected across randomized trials (I 2 = 98.8 %; p < 0.001).

The analysis of the six cohort studies (n = 62,127 women) showed a small effect of LTPA on mean birthweight (β −1.05 g; 95 % CI −1.49; −0.62) and no heterogeneity (I 2 = 0 %; p = 0.445). Sternfeld et al. [68] and Hatch et al. [69] included measures of early, mid, and late pregnancy physical activity; again, all estimates were included in the meta-analysis. Hegaard et al. [70] and Fleten et al. [71] included measures of early and mid/late pregnancy physical activity; both were included in the meta-analysis (Fig. 5).

3.7 Meta-Analysis: Gestational Age

Data from 17 trials were included in the meta-analysis of gestational age, comprising 2169 women in the control groups and 2109 in the exercise groups. The meta-analysis showed no difference between the groups in gestational age at delivery (MD −0.07 weeks, DSE −0.29; 0.16). There was no heterogeneity across the trials (I 2 = 0 %; p < 0.001).

The 11 cohort studies (n = 81,595) showed an inverse association between LTPA and the risk of preterm birth (OR 0.80; 95 % CI 0.70–0.91). A low heterogeneity across studies was found (I 2 = 13.4 %; p = 0.310). Owe et al. [72] and Evenson et al. [73] included more than one measure of LTPA in pregnancy, and all findings were included in the meta-analysis (Fig. 6).

3.8 Meta-Analysis: Fetal Growth

Four trials (754 women in exercise groups and 745 in control groups) evaluated the effect of exercise interventions on the risk of being born SGA, and three trials (302 women in exercise groups and 301 in control groups) evaluated the effect of exercise interventions on the risk of being born LGA. No association was observed between exercise and SGA (Fig. 7), but women undergoing exercise interventions during pregnancy had a lower risk of having an LGA baby (RR 0.51; 95 % CI 0.30–0.87) (Fig. 8). There was no heterogeneity across the trials (I 2 = 0 %; p < 0.001) in both analyses, LGA and SGA.

In cohort studies, it was only possible to analyze the outcome ‘SGA’ given the low number of studies including the outcome ‘LGa’. Data from three studies were included in the meta-analysis for SGA. There was no association between LTPA and SGA (OR 1.03; 95 % CI 0.81–1.30), and low heterogeneity across studies (I 2 = 25.2 %; p = 0.245). Harrod et al. [74] and Gollenberg et al. [75] included more than one measure of physical activity during pregnancy; all estimates were included in the meta-analysis (Fig. 7).

4 Discussion

To the best of our knowledge, this is the first meta-analysis comparing associations between LTPA during pregnancy and maternal and child health between experimental and cohort studies. Previous reviews on the associations of LTPA on maternal–child outcomes were limited to specific groups of women of specific study designs [76, 77].

Consistent associations for LTPA in pregnancy were observed for weight gain (active women gained less weight during pregnancy), GDM (active women were less likely to develop GDM), preterm birth (active women were less likely to deliver a preterm infant), and fetal growth (active women were less likely to deliver an LGA baby). In methodological terms, the comparisons presented here are particularly relevant because of the different features of experimental and cohort studies.

The first difference between these two study designs relates to the nature of the exposure variable (i.e., physical activity). Cohort studies relied on self-reported LTPA and moderate–vigorous physical activity from accelerometry, whereas most experimental studies delivered structured exercise interventions to pregnant women. Observational studies therefore face higher variability in physical activity levels, whereas randomized trials are more prone to homogeneous physical activity levels in the intervention groups.

Another issue is the possibility of confounding in cohort studies, which is not the case for RCTs of sufficient sample size. For example, high socioeconomic status women are more active in leisure time [13] and have better maternal and child health indicators. In unadjusted analysis, it is therefore likely that active women will present better health indicators than inactive women, at least partly because they are from high socioeconomic groups. Adjusting for socioeconomic status could theoretically remove this artefact, but residual confounding is always a possibility.

Sample sizes also varied considerably between cohort and experimental studies. Most trials had fewer than 100 participants per group, whereas most cohort studies had sample sizes of more than 1000 women. Therefore, differences of the same magnitude could have been captured as significant in cohort studies, and not in experimental studies. Furthermore, another important issue when comparing RCTs and cohort studies is the selection of exposure to physical activity during pregnancy. While some cohort studies considered physical activity before pregnancy to be a confounding variable in the analysis, most did not specify whether women were active before pregnancy; in most RCTs, being inactive before pregnancy was an inclusion criterion. The positive effects found in cohort studies, but not in intervention studies could be attributed to long-term physical activity participation and not just to physical activity during pregnancy.

Despite differences between cohort and experimental studies, some of the findings were markedly consistent. In terms of weight gain during pregnancy, findings were similar when comparing cohort (n = 11) and experimental (n = 18) analyses; regardless of the study design, active women had lower weight gain during pregnancy. These results are consistent with the meta-analysis of 12 studies conducted by Streuling et al. [2], which showed an average weight gain significantly lower in the intervention groups (−0.61; 95 % CI −1.17 to 0.06) compared with controls. On the other hand, our findings differ from those of a Cochrane review of four articles on aerobic exercise during pregnancy, in which Kramer and McDonald [78] concluded that exercise showed no significant effect on weight gain during pregnancy.

Regarding GDM, our meta-analysis of 11 experimental studies showed a protective effect in women who engaged in physical activity during pregnancy. The same direction of association was observed in seven cohort analyses, although the difference was borderline in terms of statistical significance. In a previous meta-analysis of five cohort and case–control studies, Tobias et al. [3] showed a protective effect of physical activity on the development of GDM, but the same was not observed in the meta-analysis of five RCTs conducted by Yin et al. [18].

A protective effect of physical activity on the incidence of preeclampsia was not confirmed in our meta-analysis of four experimental and nine cohort studies. Aune et al. [6] conducted a systematic review and meta-analysis of seven cohort and four case–control studies, and found an inverse association between physical activity and preeclampsia. However, this association was not observed in a Cochrane review of three trials conducted by Meher and Duley [19].

There were no differences in mean birthweight of newborns according to maternal LTPA in pregnancy, consistent with two previous meta-analyses [20, 79]. A 2015 meta-analysis of 15 RCTs found that active women delivered lighter babies [4]; however, this study did not include articles published in 2014 and 2015, which were the studies tending to show results in the opposite direction. All weights were similarly distributed, with the exception of the meta-analysis of mean birth weight in cohort studies, where two studies carried almost 100 % of the weighted effect. The observed difference was because Fleten et al. [71] had a sample of 43,705 individuals, whereas other studies had fewer participants. However, we used a model that takes into consideration the size of the sample in the meta-analysis to distribute the weights evenly considering the contribution of studies with small sample size as well as heterogeneity between studies. In addition, we performed an additional analysis to observe a possible change in the effect estimation with the withdrawal of the Fleten et al. [71] study. No significant changes in the pooled effect were observed.

In terms of preterm births, experimental evidence does not support an association with physical activity in pregnancy, similar to the findings by Sanabria-Martinez et al. [4]. In the past, there was a concern in the literature as to whether exercise during pregnancy could cause adverse effects on maternal and fetal growth and increase the risk of premature births [80]. According to Goldenberg et al. [81] working long hours and undertaking hard physical labor under stressful conditions are probably associated with an increase in preterm birth, especially in relation to occupational physical activity. However, several studies that focused on vigorous sports training several times a week during pregnancy found it was associated with a decreased or unchanged risk of preterm delivery [69, 73, 82]. Observational studies have shown that regular LTPA during pregnancy can even reduce the incidence of preterm births [83, 84], an association that was confirmed in our meta-analysis of 13 cohort studies.

Our meta-analysis of RCTs suggests that exercise during pregnancy leads to decreased odds of delivering an LGA newborn. Similar results were found in the meta-analysis conducted by Wiebe et al. [5]. It is important to highlight that the available literature focusing on fetal growth is still limited compared with other outcomes.

Most physical activity interventions designed to prevent prenatal complications have focused on LTPA of moderate intensity. Less attention has been paid to sedentary behavior or light-intensity activity during pregnancy [85]. Increasing time spent in light-intensity activity could have important health implications during pregnancy by directly reducing time spent in sedentary behavior. Future studies might help address this literature gap. Another important research question is the impact of exercise intensity on maternal and child health outcomes. In our meta-analysis, it was not possible to evaluate the separate effects of moderate- and vigorous-intensity activities due to the small number of studies focusing on vigorous-intensity activities.

Our systematic review and meta-analysis differs from others published in the literature in some respects because we adopted methodological criteria such as the inclusion of studies in three languages (English, Portuguese, and Spanish), allowing a greater number of studies to be included, whereas most previously published reviews included only studies in English. Furthermore, we performed a meta-analysis only with LTPA from observational studies to enable a better comparison with intervention studies, whereas other meta-analyses included all domains of physical activity (occupational, leisure-time, domestic, and active commuting) in the same analysis [6].

5 Conclusion

The available evidence supports associations between LTPA during pregnancy and the following outcomes: weight gain in pregnancy, GDM, preterm births, and LGA. No evidence of an association with preeclampsia was detected, but only three trials have been conducted on this topic so far. Collectively, our findings support the promotion of LTPA in pregnancy as a strategy to improve maternal and child health.