Background

Depression during pregnancy, defined hereafter as antepartum depression, is highly prevalent, affecting approximately 20–25% of pregnant women globally (Gavin et al. 2005; Gelaye et al. 2016). Antepartum depression can have devastating affects for both the mother and infant and is linked to a number of adverse health-related behaviors and outcomes. Such behaviors and outcomes include poor maternal nutrition (Barker et al. 2013), increased substance use (Horrigan et al. 2000), preeclampsia (Kurki et al. 2000), spontaneous preterm delivery (Orr et al. 2002), postpartum depression (Dietz et al. 2007), and impaired fetal and infant growth (Rahman et al. 2004; Hoffman and Hatch 2000). Estimates for the increase in risk for certain outcomes are as high as 39% for preterm birth, 45% for intrauterine growth restriction, and 49% for low birth weight delivery (Grote et al. 2010). Based on such findings, the American College of Obstetricians and Gynecologists (ACOG) now recommends that clinicians screen for depression at least once during the perinatal period, and an understanding of the underlying neurobiological pathways continues to be an important research objective.

One such candidate pathway is the hypothalamic pituitary adrenal (HPA) axis (Pariante and Lightman 2008; Penninx et al. 2013; Knorr et al. 2010; Stetler and Miller 2011; Kino 2015). The HPA axis is one of the body’s stress-response systems and plays a crucial role in homeostatic regulation. Pregnancy is a period of profound physiological changes during which increases in maternal and placental corticotrophin-releasing hormone increase circulating maternal cortisol levels (Brunton et al. 2008; Lachelin 2013; Burke and Roulet 1970; Kirschbaum et al. 2009). Such naturally occurring adaptions are thought to occur to aid in fetal lung maturation (Ballard and Ballard 1972) and fetal growth (Bolten et al. 2011) and to prime the placenta for childbirth (Sandman et al. 2006). Saliva, blood, and urine are biological matrices commonly used to assess cortisol levels during pregnancy and reflect cortisol levels in the past 1 to 24 h. Using such matrices, cortisol has been observed to follow a diurnal trend with peak levels at awakening and nadir levels in the evening. Differences in peak and nadir cortisol levels have also been observed to widen as pregnancy progresses (Brunton et al. 2008). In order to assess long-term cortisol secretions reflecting the past 1 to 3 months, hair has emerged as another matrix of choice (Wosu et al. 2013). A commonly cited concern of hair is cortisol degradation due to prolonged environmental exposure (Dettenborn et al. 2012). Given such differences in the time windows these matrices represent, reviews of the literature for the association between maternal cortisol and antepartum depression can be increasingly complicated.

Previous findings for the association between maternal cortisol levels and antepartum depression have been mixed. When evaluating maternal cortisol concentrations (basal or in response to awakening or stressful stimuli), some investigators report statistically significant differences comparing pregnant women with and without depression (Bjelanovic et al. 2015; Diego et al. 2009; Field et al. 2009; Hoffman et al. 2016; Lommatzsch et al. 2006; Murphy et al. 2015; O’Connor et al. 2014; O’Keane et al. 2011; Parcells 2010; Peer et al. 2013; Voegtline et al. 2013). However, a number of investigators have reported no such differences (Braithwaite et al. 2016; Davis et al. 2007; Deligiannidis et al. 2013; Evans et al. 2008; Glynn and Sandman 2014; Goedhart et al. 2010; Hellgren et al. 2013; Iliadis et al. 2015; Kaasen et al. 2012; Katz et al. 2012; Luiza et al. 2015; Monk et al. 2011; Pluess et al. 2010; Rouse and Goodman 2014; Salacz et al. 2012; Shaikh et al. 2011; Wikenius et al. 2016; Deligiannidis et al. 2016). Reasons for such inconsistencies may be due to differences in methodologies across studies, such as differences in the timing and method of maternal cortisol assessment. Recent reviews on the association between maternal cortisol levels and antepartum depression conclude that pregnant women with antepartum depression present with elevated cortisol concentrations and blunted awakening responses when compared to non-depressed pregnant women (Seth et al. 2016; Serati et al. 2016; Iliadis et al. 2015). However, the role of cortisol collection method and timing of assessment during pregnancy requires additional investigation. Therefore, the objective of this systematic review was to further examine the extent to which reported associations between maternal cortisol levels and antepartum depression differed across studies according to timing of cortisol collection, thereby building upon the work of previous reviews on the topic.

Methods

Studies evaluating the relationship between maternal cortisol and antepartum depression were identified by searching PubMed/MEDLINE (National Library of Medicine), EMBASE (Elsevier; 1974–), Cumulative Index to Nursing and Allied Health Literature (CINAHL, EBSCO), PsycINFO (EBSCO), and Web of Science Core Collection (Thomson Reuters) from inception through December 15, 2016. Controlled vocabulary terms (e.g., MeSH or Emtree terms) were included when available and appropriate. The search strategies were designed and executed by a librarian (PAB). No language limits or year restrictions were applied, and bibliographies of relevant articles were reviewed to identify additional studies. The exact search terms used for each of the databases are provided in the supplementary document.

Records were identified through databases, and duplicates were removed. The abstracts of the remaining records were screened for inclusion using the following eligibility criteria: (1) maternal cortisol assessed during pregnancy; (2) antepartum depression assessed during pregnancy using a screening instrument; (3) reports on the association between maternal cortisol and antepartum depression; (4) provides information on timing of cortisol assessment during pregnancy, including time of day and gestation; and (5) not a review article or a case study. We did not restrict inclusion according to depression screener; however, studies that did not report a depression screener or assessed “blues” or stress more generally were excluded. Findings from all studies were reported, and reported p values ≤ 0.05 were deemed statistically significant.

Results

One thousand three hundred seventy-five records were retrieved from the database search, resulting in 826 unique records for screening. Twenty-seven studies met all inclusion criteria, and a scan of the references of these studies resulted in two additional studies that also met all inclusion criteria. In total, 29 studies were included in our systematic review (Fig. 1). The majority of the screened studies failed to meet inclusion because investigators reported on the association between maternal cortisol and infant outcomes (or between maternal depression with infant outcomes), rather than on the association between maternal cortisol and antepartum depression (N = 774 out of 826, 93.7%). Table 1 provides a description of each of the 29 studies; 12 reported a statically significant association between maternal cortisol and antepartum depression (Bjelanovic et al. 2015; Diego et al. 2009; Evans et al. 2008; Field et al. 2009; Hoffman et al. 2016; Lommatzsch et al. 2006; Meliska et al. 2013; Monk et al. 2011; O’Connor et al. 2014; O’Keane et al. 2011; Peer et al. 2013; Voegtline et al. 2013), and 17 studies did not (Braithwaite et al. 2016; Davis et al. 2007; Deligiannidis et al. 2016; Glynn and Sandman 2014; Goedhart et al. 2010; Hellgren et al. 2013; Iliadis et al. 2015; Kaasen et al. 2012; Katz et al. 2012; Luiza et al. 2015; Pedersen et al. 1993; Pluess et al. 2010; Rouse and Goodman 2014; Salacz et al. 2012; Shea et al. 2007; Susman et al. 1999; Wikenius et al. 2016).

Fig. 1
figure 1

Flow diagram of selection process. Superscript letter a denotes reasons for exclusions: does not report on the association between cortisol during pregnancy and depression during pregnancy (n = 774), a review article on the topic (n = 6), reports on the association between cortisol during pregnancy and depression during pregnancy but depression instrument either not mentioned or not compatible with criteria (n = 10), missing important information on time of depression and cortisol collection during pregnancy (n = 4), and met inclusion criteria but not a full article (n = 5)

Table 1 Findings on the association between cortisol and depression during pregnancy from studies meeting all inclusion criteria

Study design characteristics of eligible studies

Among the 29 studies, publication dates ranged from 1993 to 2016 and study populations were based exclusively in European or North American countries. The majority of studies sampled study participants from clinical settings (N = 18). Total sample sizes ranged from 29 (Meliska et al. 2013) to 2810 (Goedhart et al. 2010), and reported mean ages ranged from 17 to 37 years. The most common depression screeners reported were the Edinburgh Postnatal Depression Scale followed by the Center for Epidemiologic Studies Depression Scale. The most common matrix reported for the determination of maternal cortisol levels was saliva (N = 14), followed by blood (plasma (N = 4) and serum (N = 3)), urine (N = 3), and hair (N = 2). Two studies used both urine and plasma (Luiza et al. 2015; Pedersen et al. 1993), and one study used both saliva and serum (Kaasen et al. 2012). Among the studies using saliva, blood, and urine, morning collections were most common (8:00 a.m.–11:30 a.m.). Some studies reported multiple collections of maternal cortisol throughout the pregnancy period (N = 18 studies). However, such studies were predominately restricted to the second and third trimesters, and two studies reported no evidence of group by time interactions (Deligiannidis et al. 2016; Hellgren et al. 2013). No clear differences in association were observed across choice of laboratory assay or depression screener.

Study findings according to gestational age of cortisol assessment

Five studies assessed cortisol prior to or at 16-week gestational age (approximately first trimester) (Hoffman et al. 2016; Glynn and Sandman 2014; Goedhart et al. 2010; Luiza et al. 2015; Pluess et al. 2010). Among these studies, one reported a statistically significant association between first-trimester hair cortisol concentrations and antepartum depression scores in week 40 of pregnancy (r = 0.27, p = 0.02) (Hoffman et al. 2016). Eight studies assessed cortisol between 16 to 27-week gestational age (approximately second trimester) (Braithwaite et al. 2016; Davis et al. 2007; Diego et al. 2009; Field et al. 2009; Glynn and Sandman 2014; O’Connor et al. 2014; O’Keane et al. 2011; Hoffman et al. 2016). Among these studies, four reported a statistically significant association. Diego et al. observed elevated mid-morning urinary cortisol concentrations in the second trimester comparing women with and without depression and found that CES-D scores correlated with urinary cortisol concentrations (r = 0.37, p < 0.01) (Diego et al. 2009). O’Connor et al. reported lower initial wakening salivary cortisol in the second trimester among those with depression (r = − 0.22, p < 0.05) but found no evidence of an association between depression and cortisol awakening response (CAR) (O’Connor et al. 2014). Despite no observed difference in second-trimester morning salivary cortisol levels comparing women with and without antepartum depression (16.6 vs. 17.2 nmol/l, p = 0.78), O’Keane et al. observed 67% higher evening salivary cortisol concentrations comparing the two groups (7.1 vs. 4.8 nmol/l, p < 0.02) (O’Keane et al. 2011). Lastly, Hoffman et al. observed an association between second-trimester hair cortisol concentrations and antepartum depression symptom scores at 16 weeks (r = 0.32, p = 0.002) (Hoffman et al. 2016). Sixteen studies assessed cortisol at or after 28-week gestational age (approximately third trimester) (Bjelanovic et al. 2015; Braithwaite et al. 2016; Davis et al. 2007; Deligiannidis et al. 2016; Evans et al. 2008; Field et al. 2009; Glynn and Sandman 2014; Hellgren et al. 2013; Hoffman et al. 2016; Iliadis et al. 2015; Lommatzsch et al. 2006; Monk et al. 2011; O’Connor et al. 2014; Pedersen et al. 1993; Pluess et al. 2010; Salacz et al. 2012). Among these studies, seven reported a statistically significant association, two of which also reported associations in previous trimesters (O’Connor et al. 2014; Hoffman et al. 2016). One study reported elevated third-trimester salivary cortisol among women with antepartum depression at both 8 a.m. (r = 0.239, p = 0.001) and 5 p.m. (r = 0.206, p = 0.005) (Bjelanovic et al. 2015). In another study, first morning urinary cortisol levels were found to be nearly sevenfold higher among the depressed (71.7 vs. 10.5 units, p value = 0.01) (Field et al. 2009). Nearly twofold increases in 3–7 p.m. serum cortisol levels were observed comparing women with antepartum depression to women without antepartum depression (medians 620 vs. 392 nmol/l; p < 0.01) (Lommatzsch et al. 2006). However, correlations between depression scores and cortisol concentrations did not reach statistical significance (data not shown) (Lommatzsch et al. 2006). Two studies reported elevated third-trimester morning salivary cortisol concentrations among small subsets of women with comorbid depression and anxiety, and no difference comparing “depressed only” to healthy controls (Monk et al. 2011; Evans et al. 2008).

Discussion

Reported associations between maternal cortisol and antepartum depression varied across studies. On balance, we observed that (1) the majority of eligible studies reported no statistically significant association between maternal cortisol and antepartum depression (N = 17 out of 29), (2) saliva and blood were the most common biological matrices for cortisol detection, (3) morning was the most common time of day for cortisol collection, (4) second and third trimesters were the most common time of pregnancy for cortisol collections, and (5) among studies reporting an association (N = 12), second-trimester and third-trimester cortisol assessments more consistently reported an association, and elevated cortisol concentrations were observed in expected recovery periods.

Our review confirms and expands upon previous review articles (Brummelte and Galea 2010; Gelman et al. 2015; Glover and Kammerer 2004; Pariante 2014; Serati et al. 2016; Seth et al. 2016; Workman et al. 2012; Iliadis et al. 2015). For example, a recent review of 47 studies, exceeding our number primarily due to the inclusion of studies assessing depression in the postpartum period as well as during pregnancy (Seth et al. 2016), reported that cortisol awakening responses were blunted in the cases of major maternal depression (including antepartum and postpartum depression). The authors further concluded that hypercortisolemia during pregnancy was associated with transient depressive states, while hypocortisolemia during pregnancy was associated with chronic postpartum depression. Another recent review evaluated a wide range of biomarkers and concluded that hypercortisolemia was associated with depression in the weeks immediately before and after delivery (Serati et al. 2016). Observations of hypercortisolemia among individuals with major depression have been observed in both non-pregnant (Vreeburg et al. 2009) and pregnant populations (Pariante and Lightman 2008; Serati et al. 2016; Seth et al. 2016; Burke et al. 2005). Specifically, individuals (males and females) with major depression have been observed to have higher cortisol concentrations in expected recovery periods where cortisol concentrations are expected to be lower (Burke et al. 2005). Stress-related factors have also been associated with other atypical patterns in cortisol, including blunted and steeper diurnal declines (Stawski et al. 2013; Agbedia et al. 2011). Our review adds to this literature by noting that higher cortisol levels have been observed among pregnant women with antepartum depression using matrices reflecting evening levels and that this may be particularly noticeable in the second and third trimesters when circulating cortisol levels are at their highest.

Study design methodologies varied widely, impacting comparisons across eligible studies. First, few of the eligible studies reported on the association between maternal cortisol and antepartum depression in the first trimester of pregnancy. Therefore, we cannot distinguish whether the association between maternal cortisol and antepartum depression is more common in the second and third trimesters due to physiologic changes later on in pregnancy, or whether it appears more common due to limited data in the first trimester of pregnancy. Second, some studies may have been underpowered to detect an association due to small sample sizes. Third, studies likely differed in their determination of gestational age (example: self-reported last menstrual period vs. ultrasound). Therefore, comparison across studies within our trimester categories should be interpreted with some caution. Fourth, the majority of studies utilized biological matrices that reflected acute cortisol levels in the 1 to 24 hours prior to collection (saliva, blood, and urine). Such matrices allow investigators to observe deviations in expected diurnal patterns, however are limited in their ability to assess differences in long-term cortisol secretion. Two studies utilized hair that reflected long-term cortisol levels in the months prior to collection. A commonly cited limitation of this biologic matrix is the potential for cortisol degradation and leaching (Russell et al. 2012). Despite the aforementioned caveats, an understanding of both acute and long-term maternal cortisol levels is necessary in order to further our understanding of the underlying neurobiological pathways at play in antepartum depression.

Our findings highlight specific gaps in the literature for the association between maternal cortisol and antepartum depression. These gaps include the need for findings that integrate multiple maternal cortisol collections spanning all trimesters of pregnancy. Maternal cortisol collections ranging from the first trimester to delivery would allow for analyses of cortisol trajectories across the pregnancy period. Such analyses would help to determine whether women with antepartum depression present with different long-term cortisol trajectories. An additional gap noted in our review of the literature is the need for data from low-income and middle-income countries, where the burden of antepartum depression is often two to three times higher (Lara et al. 2009; Schatz et al. 2012; Barrios et al. 2015; Shidhaye and Giri 2014). Therefore, additional research across a diverse range of study populations and study participants is warranted.

Conclusion

Antepartum depression is an important research priority, and reviews of the existing literature help shed light on strategic next steps, thereby accelerating the pace at which research findings are translated into clinical practice. These strategic next steps potentially include the evaluation of maternal cortisol secretion profiles across pregnancy using longitudinal study design approaches and an expansion of the research question to diverse study populations and participants.