Introduction

Bipolar disorder is an especially high-risk condition during pregnancy and postpartum. Prospective studies from psychiatric tertiary care settings have found that approximately 60–70 % of women with bipolar disorder experienced mood episodes during pregnancy and/or the postpartum period (Freeman et al. 2002; Viguera et al. 2007). Risks of untreated perinatal depressive and manic episodes include preterm labor, decreased birth weight, elevated levels of fetal stress hormones, and changes in neurobehavioral function of the infant (Lee and Lin 2010; Davalos et al. 2012; Grigoriadis et al. 2013). Bipolar disorder can also adversely affect mother-infant dyadic interactions (Vance et al. 2008).

Despite its high prevalence and the substantial risks of untreated symptoms, perinatal bipolar disorder is under-diagnosed and under-treated (Freeman et al. 2002). As the risks of untreated perinatal depression have become increasingly understood, large-scale perinatal depression screening initiatives have become widespread throughout the USA. While such initiatives are likely to substantially improve the detection of perinatal depression, they may inadvertently lead to misclassification if there is no simultaneous screening for bipolar disorder. Misdiagnosis of depressive episodes as unipolar rather than bipolar can be especially problematic, as treatment with antidepressants without a concomitant mood stabilizer increases the risk of hypomania/mania and rapid cycling and psychotic episodes in patients with bipolar disorder and may increase the risk of recurrence (Ghaemi et al. 2003; Ghaemi et al. 2004). It is not yet known how many women who “screen positive” for antenatal depression have bipolar rather than unipolar depression. In a primary care clinic population, more than 20 % of patients receiving antidepressants for depression screened positive for bipolar disorder (Hirschfeld et al. 2005). In a study of women diagnosed with postpartum depression, more than half met the criteria for bipolar disorder based on structured interviews (Sharma et al. 2008). In addition, it has been found that a disproportionate number of women with bipolar disorder who experience a recurrence during pregnancy will experience either depressive or dysphoric mixed symptoms early in pregnancy (Viguera et al. 2007). This indicates not only that the likelihood of misdiagnosis is high, but also that in the absence of bipolar disorder screening early in pregnancy, offspring of women with bipolar disorder may be exposed to a lengthy period of maternal psychopathology in utero.

In order to determine the degree to which screening for bipolar disorder as part of prenatal care adds value, the first step is to ascertain the prevalence of positive bipolar screens in an obstetric clinic population, with or without screening positive for depression. This could help determine whether screening for perinatal depression and bipolar disorder should be simultaneous or sequential.

For these reasons, this pilot study aims to answer the following three questions:

  1. 1.

    What proportion of a pregnant population screens positive for bipolar disorder?

  2. 2.

    What percent of women who screen positive for depression also screen positive for bipolar disorder (risking misdiagnosis with unipolar depression in the absence of bipolar screening)?

  3. 3.

    What percent of women who screen positive for bipolar disorder screen negative for depression (risking lack of detection if bipolar screens were only used for women screening positive for depression)?

Materials and methods

Study participants

The study consisted of 342 women consecutively presenting for initiation of prenatal care in the Brigham and Women’s Hospital Resident Obstetrics clinic between February 2012 and September 2012. The mean age of the clinic patients during that time frame was 30.95 years (SD 7.21 years), with a racial/ethnic distribution of 35.9 % black, 23.3 % Hispanic, 21.6 % non-Hispanic white, 6.8 % Asian, 7.5 % other, and 4.0 % not recorded.

Screening tools

As part of routine clinical care within that clinic, every patient initiating prenatal care was given the Edinburgh Postnatal Depression Scale (EPDS) and the Mood Disorder Questionnaire (MDQ).

The EPDS is a depression screening tool specifically designed and validated for use during pregnancy and postpartum. Cutoff scores between 10 and 12 have consistently yielded a sensitivity of at least 0.8 and specificity of at least 0.7 in detecting perinatal major depression, which is appropriate for most clinical settings (Miller et al. 2011). In this study, we chose to use the cutoff score of 11 to indicate a positive screen, though we present results for all three cutoff points.

The MDQ is the most widely researched brief self-report screen for bipolar disorder. It has been validated for use in primary care settings (Hirschfeld et al. 2003), and one study validated its use in the postpartum period (Sharma and Xie 2011); it has not yet been validated during pregnancy. Because it assesses lifetime bipolar disorder rather than current symptoms, there is less concern that normal pregnancy changes would confound the results. Based on studies to date, there are two methods of scoring the MDQ. The initially developed scoring method requires endorsement of at least seven of 13 lifetime manic symptoms, several co-occurring symptoms, and moderate or serious associated functional impairment. With the Structured Clinical Interview for the DSM IV as the gold standard, the original validation study reported a sensitivity of 0.73 and a specificity of 0.90 in a psychiatric outpatient population (Hirschfeld et al. 2000). However, a subsequent study in the general population revealed a sensitivity of 0.28 and specificity of 0.97 (Hirschfeld et al. 2003). Dodd and colleagues studied a community sample of women, with similar results (Dodd et al. 2009). These results indicated that the psychometric properties of the MDQ may vary according to the population studied, and that the sensitivity could be unacceptably low in non-psychiatric populations. Subsequent studies found that the sensitivity was improved by excluding the last two supplementary questions (clustering of symptoms and impairment of functioning) in both general (Chung et al. 2009) and perinatal (Sharma and Xie 2011; Frey et al. 2012) populations. For the current study of a non-psychiatric perinatal population, reducing false-negatives was a more central concern than reducing false-positives, because the central intent of screening was to detect risk of perinatal mood episodes early in pregnancy. Therefore, to best meet the purposes of this study, we used the modified MDQ scoring system to enhance sensitivity.

Procedure

As part of routine clinical practice, patients initially presenting to the clinic were given the EPDS and MDQ screens by a front desk staff and returned completed screens to a nursing staff. Each patient seen for prenatal care underwent an initial nursing assessment during which these screening tools were scored and reviewed. Scores were recorded by nurses in the patients’ electronic medical records, within templated nursing assessment notes. For this study, investigators recorded whether there was documentation that the EPDS and/or MDQ were given and completed, and if so, the numeric score for each. This study was approved by the Institutional Review Board of the hospital. To protect patient confidentiality, no data containing personal identifiers were collected.

Data analysis

Positive screens for major depression and bipolar disorder were examined in terms of counts and frequencies. Associations between positive screens for bipolar disorder and positive screens for major depression were represented as odd ratios and evaluated for statistical significance using chi-square tests.

Results

Data from 342 participants were collected from February 2012 through September 2012. Of these participants, 298 (87.1 %) were screened with the EPDS and 277 (81.0 %) were screened with the MDQ. A total of 274 participants (80.1 %) completed both screens.

Figure 1 shows the proportion of study participants who screened positive for depression and bipolar disorder. As shown in Table 1, the odds of having a positive MDQ screen are significantly higher for those with a positive EPDS screen (with cutoff score of 11, odd ratio = 12.38, p < 0.001) than for those with a negative EPDS screen. At the same time, five out of 14 participants (35.7 %) with a positive MDQ screen would be missed by only screening those with an EPDS-positive screen. Since the percentage of missed positive MDQs is influenced by choice of EPDS cutoff, Table 1 shows positive bipolar screens with EPDS cutoff scores of 10, 11, and 12.

Fig 1
figure 1

EPDS and MDQ screening results for pregnant women at initial prenatal care visits using EPDS cutoff of 11 and MDQ cutoff of 7. 1 total participants screened with the EPDS, with or without MDQ screening. 2 total participants screened with both the EPDS and MDQ. 3 total participants screened with the MDQ, with or without EPDS screening

Table 1 The relationship between EPDS and MDQ screening results at varied EPDS cutoff scores

Discussion and conclusion

This study found that 5.1 % of a prenatal, non-psychiatric population screened positive for a high risk of bipolar disorder. This is similar to the 4.4 % lifetime prevalence of bipolar spectrum disorders in the general US population reported in the National Comorbidity Survey Replication (Merikangas et al. 2007). Additionally, Hoertel et al. found the lifetime prevalence of bipolars I and II in the general population to be 3.31 % using data from the 2001–2002 National Epidemiologic Survey of Alcohol and Related Conditions (NSARC), though this increased to 5.84 % if cases with subthreshold hypomania were included (Hoertel et al. 2013). Studies of primary care clinic populations have found rates of bipolar disorder between 0.5 and 4.3 % using structured interviews, though studies using screening tools reported higher rates of 7.6–9.8 % (Cerimele et al. 2014).

Guidelines from the US Preventive Task Force (Pignone et al. 2002) and UK National Screening Committee (Buist et al. 2002) on determining whether screening for a particular disorder should be initiated are based in part on prevalence. The findings from this study suggest that the proportion of pregnant women in an obstetric clinic setting screening positive for bipolar disorder is similar to the prevalence of gestational diabetes (2–6 %) (Buckley et al. 2012) and gestational hypertension (5.0 %) (Haddad and Sibai 1999). Although the latter two are part of routine prenatal screening, screening for bipolar disorder is not. The finding that more than 80 % of women completed the MDQ on their initial prenatal visit, with nursing-recorded scores entered into the medical record, suggests that screening for bipolar disorder is largely acceptable and feasible for both patients and practitioners.

In this study, 21.4 % of patients who endorsed active depressive symptoms on the EPDS also screened positive for bipolar disorder on the MDQ and may have been at risk for being misdiagnosed with unipolar depression. This is similar to Hirschfeld’s findings from a primary care clinic, showing that more than 20 % of patients receiving antidepressants for depression also screened positive for bipolar disorder (Hirschfeld et al. 2005). These findings underscore that depressive symptoms detected during pregnancy may be part of a bipolar diathesis.

Additionally, the results of our study suggest that if a clinic were to use bipolar screening only for women who had already screened positive for depression, a substantial proportion of patients with bipolar disorder would be missed. Depending on the EPDS cutoff score chosen, at least 21.4 % (EPDS cutoff 10) or as many as 57.1 % (EPDS cutoff 12) of MDQ-positive women would not have been detected if only EPDS-positive women had been screened for bipolar disorder in this study.

Our study has important limitations. Because this was a pilot study collecting only de-identified data, we were not able to investigate factors, such as parity or psychotropic medication use, which might differentiate positive responders to the MDQ from positive responders to the EPDS and from the general population. Another limitation is the unresolved controversy about optimal scoring of the MDQ. Because of the significant clinical implications of under-diagnosing bipolar disorder in pregnancy, a high sensitivity is especially important in any screening tool used for this condition. Thus, we used the modified MDQ scoring to maximize detection of bipolar disorder among these high-risk patients. However, this modified scoring has not yet been rigorously validated. In addition, our use of the modified scoring limits the comparison of our findings to those from studies using the initial MDQ scoring system.

In summary, our study provides support for universal screening of bipolar disorder during pregnancy. Detecting bipolar disorder in the beginning of pregnancy allows for the implementation of interventions early enough to prevent exacerbation during the particularly high-risk postpartum period. It also reduces the inadvertent increase in risk that could otherwise come from misdiagnosis of bipolar disorder as unipolar depression. As a result, screening for bipolar disorder in pregnancy may prevent adverse clinical events for both the pregnant patient and her fetus. Further research is needed to validate optimal screening tools and scoring methods for a perinatal population.