Introduction

Frozen-thawed embryo transfers (FETs) have been increasingly utilized in assisted reproductive technology (ART) since the implementation of more effective cryopreservation methods. As a result, endometrial preparation protocols have received greater attention as clinicians seek to optimize pregnancy outcomes. Artificial cycles (AC), involving exogenous endometrial preparation, are sometimes necessarily adopted over natural cycles for medical reasons, such as in the case of anovulation, or oocyte donation [1]. Furthermore, ACs have practical benefits over natural cycles (NC) and can be used to facilitate timely access to FETs for both clinicians and patients. In ACs, clinicians aim to reproduce the physiological hormonal milieu through sequential administration of estrogen and progesterone (P4). Hormonal replacement protocols have evolved since the initial introduction of FETs; nevertheless, clinician-dependent variations remain, with no clear consensus regarding the optimal method of artificial endometrial preparation to increase the success of FETs, or how to monitor its efficacy [2].

P4 plays a vital role in both conception and the maintenance of pregnancy; as such, it has been a central focus of investigation aimed at optimizing hormonal replacement therapy [3]. Each route of P4 administration is associated with a distinct serum and endometrial tissue response. While serum P4 levels following IM administration are greater than following PV administration, it has been demonstrated that PV administration produces greater endometrial concentrations of P4 than IM [4,5,6]. Ideally, P4 monitoring would be tailored to the administration method; however, given the significant practical difficulties in obtaining endometrial P4 levels, serum P4 concentrations continue to be the mainstay marker for P4 monitoring.

In accordance with physiological P4 fluctuations, the optimal serum P4 range correlated with improved pregnancy outcomes varies throughout the ART cycle stages. In the late follicular phase of ART cycles, elevated serum P4 levels have been associated with significantly poorer pregnancy rates [7,8,9,10]. However, in the mid-luteal stage, higher P4 concentrations are preferable for improved pregnancy outcomes [11,12,13,14]. It is between these two well-studied stages, the day of embryo transfer, that the literature on whether there is an optimal serum P4 range becomes more scant. While poorer pregnancy outcomes have been previously associated with lower P4 levels on the day of FET, all the studies supporting this contain small sample sizes [1, 15,16,17]. Furthermore, contradictory results have also been demonstrated, with P4 levels above a certain threshold on FET day being correlated with significantly lower pregnancy rates [18]. Given the heterogeneity and limited strength of the literature available in this area, it was our aim to investigate whether P4 levels on the day of embryo transfer are in fact predictive of pregnancy outcomes in artificial FET cycles. To our knowledge, this is by far the largest data set examining this topic to date.

Materials and methods

This is a large single-center retrospective cohort analysis. A standardized data set spanning 2015–2018 was analyzed. This comprised of N = 2010 frozen embryo transfers into hormonally prepared endometria. Patient characteristics, including age and BMI, were registered as routine practice in the clinic database.

The hormonal replacement protocol for artificial FETs included estradiol 2 mg BD from day 5 of the cycle. This was adjusted as required to achieve a transvaginal ultrasound measurement of endometrial thickness of ≥ 8 mm. Progesterone supplementation was then commenced PV 200 mg TID. After 5 full days of P4 replacement, embryo transfer occurred. Both E2 and P4 were continued until pregnancy testing, with beta-HCG samples taken 10 days following FET. If positive, both were continued until 12 weeks gestation; if negative or in the event of miscarriage, all medications were ceased. Serum P4 was measured on the day of FET, and if levels were less than 8 ng/mL, then P4 replacement was increased at the discretion of the treating clinician.

Embryos were derived from IVF or intracytoplasmic sperm injection (ICSI) cycles. The vitrification and warming protocols on either D5 or D6 were performed as per the now-standard method first described by Sifer et al. [19]. The quality of each blastocyst was recorded on day of freeze as per the Gardner classification [20]. A high quality (Q+) transfer was considered to occur where a blastocyst had an expansion grade of 4 or higher (expanded blastocyst to hatched blastocyst), an inner cell mass grade of A or B, and a trophectoderm grade of A or B, i.e., 4BB or superior.

Similar to significant previous studies in the area, a day of transfer P4 threshold of 10 ng/mL (31.8 nmol/L) was used to simulate the currently accepted level for physiological corpus luteal function, with FET outcomes compared between patients measuring above and below this threshold [1, 21, 22]. A total of 807 FETs were identified in patients with a P4 level below 10 ng/mL and 1203 FETs in patients with P4 levels at or above 10 ng/mL. Additionally, analysis was further completed looking at day of transfer P4 thresholds of 5 ng/mL and 20 ng/mL to see if these could create further prognostic power for clinicians attempting FET.

In these artificial cycles, serum P4 levels on the day of transfer were assessed in relation to pregnancy outcomes. The main outcomes measured were biochemical pregnancy (beta human chorionic gonadotropin ≥ 5), ultrasound diagnosed clinical pregnancy (here defined as the presence of a fetal heart beat), and live birth rates.

Univariate analyses of patient characteristics and pregnancy outcomes were compared between these cohorts using the Chi-squared test, or the student’s t test for continuous variables. A multivariate logistic regression, controlling for factors including female age at the time of egg collection, day 5 or day 6 blastocyst, and blastocyst quality at freeze, further evaluated the relationship between P4 levels and outcomes.

This study had local human research and ethics committee approval, HREC ID 71/19-MIVF.

Results

A total of 2010 FETs were included in the available data set. The characteristics of patients were compared between the two cohorts of those that recorded P4 levels ≥ 10 on day of embryo transfer, versus those that did not (Table 1). The baseline characteristics of the two cohorts were not significantly different with the single exception of previous clinical pregnancy, i.e., whether a patient previously conceived an ultrasound diagnosed pregnancy (p = 0.035). Cycles using donor oocytes were excluded from this cohort; thus, all transfers were homologous. Furthermore, endometrial thickness was similar in both groups, and all FETs occurred into normal endometrial cavities with no polyps of submucosal fibroids present.

Table 1 Characteristics of cycles according to the serum P4 threshold of 10 ng/mL on day of FET

There was a wide range of observed P4 levels on the day of FET (0.5–73.7 ng/mL), with a mean value of 12.33 ± 6.70 ng/mL (Figure 1).

Fig. 1
figure 1

Frequency distribution of P4 levels on day of FET

We observed no statistically significant difference in pregnancy outcomes between these two cohorts (P4 on day of embryo transfer < 10 ng/mL versus ≥ 10 ng/mL). This was uniformly demonstrated across biochemical pregnancy rates (39.53% vs. 40.98%, p = 0.516), clinical pregnancy rates (20.82 vs. 22.78, p = 0.299), and live birth rates (14.25 vs. 16.21 p = 0.233) (Figure 2).

Fig. 2
figure 2

Pregnancy outcomes by P4 threshold of 10 ng/mL

Further analysis was subsequently completed, assessing pregnancy outcomes following P4 thresholds on day of FET of 20 ng/mL and 5 ng/mL (Figs. 3 and 4, respectively). No statistically significant differences in pregnancy outcomes (in either direction) were observed in those patients whose day of transfer P4 met the threshold of 20 ng/mL. However, while there was no difference for biochemical pregnancy or clinical pregnancy rates, a statistically significant improvement in live birth rates was observed for those with a day of transfer P4 level ≥ 5 ng/mL compared to those with a level < 5 ng/mL (15.86% vs. 7.55% respectively, p = 0.02).

Fig. 3
figure 3

Pregnancy outcomes by P4 threshold of 20 ng/mL

Fig. 4
figure 4

Pregnancy outcomes by P4 threshold of 5 ng/mL

Additionally, a multivariate logistic regression was performed, controlling for variables including female age, D5 vs. D6 blastocyst, the blastocyst expansion grade (as a marker for the full Gardner classification of embryo quality), and previous clinical pregnancy. Age, D5 vs. D6 blastocyst and embryo quality were significantly associated with pregnancy outcomes. However, multivariate analysis failed to demonstrate a significant result for P4 levels above 10 ng/mL on day of FET as an independent prognostic factor for pregnancy outcomes (Table 2). A second multivariate analysis using the P4 threshold of 5 ng/mL demonstrated statistically significant improvements in live birth rates with P4 above this level, but like the primary analysis using this P4 value, no differences were noted in biochemical or clinical pregnancy rates (Table 3).

Table 2 Multivariate logistic regression analysis of factors related to pregnancy outcomes, P4 threshold 10 ng/mL
Table 3 Multivariate logistic regression analysis of factors related to pregnancy outcomes, P4 threshold 5 ng/mL

Discussion

This study shows that P4 levels above 10 ng/mL on day of FET are not a significant factor in predicting pregnancy outcomes. A P4 threshold of 10 ng/mL was identified based on previous studies in this area that determined such a P4 level to be an indicator of adequate corpus luteal function during the luteal phase [21, 22]. Univariate analyses comparing FETs with P4 levels that reached this threshold with those that did not demonstrated no significant differences in biochemical pregnancy, clinical pregnancy, and live birth rates between the two cohorts. Subsequent multivariate analyses, controlling for potential confounders such as female age and embryo quality, further supported this negative finding. This result contradicts what little literature is currently available.

A further analysis looking at P4 threshold of 5 ng/mL on day of FET did show that while cycles below this threshold produced equivalent biochemical and clinical pregnancy rates, there was a higher live birth rate observed in cycles starting out with serum P4 levels > 5 ng/mL.

Prior evidence in this area has demonstrated a positive correlation between P4 levels on the day of embryo transfer and pregnancy outcomes; however, in reviewing these findings, some inherent limitations demonstrated the need for further investigation. Similar to our study design, the majority of reports have been retrospective in nature; however, unlike the current study, prior literature has been additionally limited by small sample sizing [1, 15,16,17,18]. In line with the value chosen here, a number of papers used a P4 threshold of 10 ng/mL [1, 17] with one prospective study using a similar threshold of 9.2 ng/mL [15]. These studies illustrated a beneficial effect for FET outcomes where P4 reaches these thresholds. There are two papers that utilized 20 ng/mL as a threshold for analysis; while one found that results commensurate with the abovementioned studies [16], the other demonstrated reduced ongoing pregnancy and live birth rates associated with P4 levels above this threshold [18]. Another limitation of available studies in this area is the lack of uniformity in P4 replacement protocols as well as a limited understanding of the extent of correlation between serum and endometrial P4 concentrations.

In artificial cycles, P4 replacement is needed to simulate physiological hormonal function to augment endometrial receptivity; however, there is still no consensus on the optimal route of P4 administration and whether serum P4 is a reliable marker [2]. P4 improves endometrial receptivity through the regulation of cytokines, resulting in both improved trophoblast function at the site of implantation and the suppression of a maternal immunologic response against the trophoblast cells [23]. Furthermore, under the influence of P4, stromal cells produce several growth factors that help maintain the pregnancy until the placenta takes over P4 production at around 10 weeks gestation [24]. Serum and endometrial P4 concentrations correlate well in the setting of endogenous P4 or exogenous IM administration [25, 26]. However, IM injection of P4 results in significantly higher mean serum P4 levels than PV but possibly lower endometrial concentrations [4,5,6]. This is postulated to be the result of the uterine first-pass effect [27]. Additionally, peak serum P4 levels following PV administration demonstrated high inter-individual variability likely secondary to variations in vaginal absorption, vagina mucosal surface area, and differences in the vaginal microbiome [1, 28, 29]. Thus it becomes difficult to equate endometrial P4 action with serum P4 levels, particularly where PV luteal phase support is used. This is one potential explanation for the current study’s findings.

Another possible theory to explain these findings could be that our cohort received adequate P4 replacement therapy to mitigate the negative effect of lower P4 levels during the luteal phase. Similar to other studies, it was standard practice in our center to increase P4 replacement if day of transfer levels were deemed poor (< 8 ng/mL). As such, even though P4 may have been inadequate on day of transfer, this was potentially remedied by additional P4 administration, reinforcing that intervention might still be possible beyond the day of transfer. With respect to the positive association between live birth rates and P4 levels ≥ 5 ng/mL, it may be that additional P4 administration cannot mitigate the effects of an initially very low circulating level of P4 that falls below this threshold. An admitted limitation of this study is that additional replacement was not standardized and the efficacy of this was not monitored with further serum P4 levels. While a single P4 measure on day of transfer in isolation may not be helpful for predicting pregnancy outcomes, is it possible that serial measurements of P4 may be useful? This will be an important avenue for future investigation.

In conclusion, we demonstrated that serum P4 levels at or above 10 ng/mL on the day of FET are not associated with statistically significant improvements in biochemical pregnancy, clinical pregnancy, or live birth rates. It could be that increasing P4 replacement following low P4 levels on transfer day mitigated the negative effect of low initial P4 concentrations. However, P4 below 5 ng/mg was associated with worse live birth rates, suggesting that there is a threshold below which increasing P4 replacement after embryo transfer cannot be expected to prevent pregnancy loss. Additional research is clearly needed in this area to further characterize ideal P4 replacement as well as the role of P4 monitoring in artificial cycles.