Purpose

The effect of late follicular phase progesterone elevation (PE) in controlled ovarian hyperstimulation (COH) cycles has been a major controversy that is of ongoing importance. Although many studies and meta-analysis have been carried out so far, and increased serum progesterone (P) levels have been shown to have a negative impact on pregnancy outcomes, discussions about the precise P cut-off value and the use of different assays are still ongoing. Most authors conducted the discussion through embryo-endometrium asynchrony. Recent publications have verified the significant negative impacts of PE on pregnancy outcomes, especially when fresh embryo transfer is performed [1,2,3]. Moreover, gene expression studies have related this negative effect to premature development of the endometrium [4, 5].

Today, the most commonly used threshold for serum progesterone (P) level is 1.5 ng/ml, and it ranges between 0.8 and 2 ng/ml among published studies [6]. The aforementioned gene expression studies have found significant differences in gene expression profiles between two groups with P levels above and below 1.5 ng/ml on the day of triggering ovulation [4].

There are only a few published reports regarding the effect of P levels on oocyte or embryo quality, and there is still no consensus on this issue. Initial studies were not able to find detrimental effects of increased P on oocyte/embryo quality [3, 7]; however, recent evidence suggests a strong association between rising levels of P and decreasing quality of embryos [8,9,10]. Two published studies focused on the embryo development process at the cleavage and blastocyst stages and found PE to be responsible for decreased embryo development potential [8, 9]. In another study, embryo development in the cleavage and blastocyst stage and pregnancy outcomes as well as fresh and cumulative live birth rates (CLBR) were evaluated, and PE was associated with worse embryo development and decreased LBRs [10].

Due to the paucity of literature, we aimed to determine whether there was a negative relationship between PE and gamete and/or embryo quality throughout the blastocyst culture period. In addition, we tried to answer the following question: where is the effect first occurring?

Methods

This is a retrospective, single-center cohort analysis of fresh/autologous intracytoplasmic sperm injection (ICSI) cycles performed in Bahceci IVF center, Istanbul, between January 2014 and December 2018. Study protocol was approved by the institutional review board. In total, 20,001 oocytes from 1485 treatment cycles were included in the study. Repetitive treatment cycles of the same patient were not included. Based on our experience and reports in the literature, we decided to switch to freeze-all and subsequent frozen-thawed embryo transfer (FET) strategy in all treatment cycles since 2013, due to its superior reproductive outcomes compared to fresh transfer [11, 12]. Inclusion criteria were the use of a GnRH antagonist protocol, available data on the serum P levels on the day of maturation induction, and blastocyst culture up to the 5th or 6th day. To prevent bias, intrinsic prognostic factors, such as advanced maternal age (> 37 years old), BMI ≥ 30 kg/m2 and severe male factor (sperm concentration ≤ 2 million/ml), were considered as exclusion criteria. Furthermore, patients who had two or more failed IVF/ICSI attempts were excluded from the analyses.

COH was initiated on the second day of the menstrual cycle with either recombinant FSH (150–300 IU, Gonal-F; Serono) and/or highly purified hMG (75–150 IU, Merional; IBSA). Dosages were adjusted according to the patient’s characteristics and ovarian response to the stimulation. Pituitary down regulation was achieved by using daily 0.25 mg of cetrorelix (Cetrotide, Merck Serono Pharmaceuticals) injections when the leading follicle was 14 mm in diameter. When at least two or more follicles reached ≥ 18 mm in mean diameter, final oocyte maturation was induced either with the use of 250 µg of human chorionic gonadotropin (hCG; Ovitrelle, Serono) or 0.2 mg triptorelin (Gonapeptyl, Ferring), depending on the physician’s preference. Cycles were monitored by means of serial vaginal sonography and serum estradiol (E2) and P measurements. Final blood analyses were performed on the same day with oocyte maturation induction, and egg retrieval was scheduled 34–36 h later.

Hormonal measurement

Samples were tested by an electrochemiluminescence immunoassay (Cobas® Elecsys Progesterone III, Roche diagnostics GmbH, Germany) with a measured sensitivity and total imprecision of 0.03 µg/l and < 7%, respectively.

Laboratory process, embryo grading

ICSI was performed almost 1 h after denudation, and microinjected oocytes were cultured individually in a special pre-equilibrated culture dish. Fertilization was confirmed 16–18 h after insemination. Cleavage-stage controls of embryos were initiated on day 3 of development. The number of blastomeres, any observed variation in blastomeric symmetry, the percentage of fragmented blastomeres, and the presence of vacuolization, multinucleation and granulation were evaluated. Embryo grading throughout the entire process was performed by two senior embryologists. An embryo with 6–10 even blastomeres and < %20 fragmentation, without vacuolization, granulation and multinucleation, was regarded as a top-quality embryo (TQE) [13]. Second and third assessments were performed on the 5th and 6th days, respectively. Blastocyst assessments were performed according to a morphology-based three-part scoring system as described previously [14, 15]. Thereafter, every blastocyst reached an expansion score of at least 3 and was frozen either on day 5 or day 6. Categorization of blastocysts was as follows: excellent (≥ 3 AA), good (3, 4, 5, or 6 and AB, BA, BB, or BC), poor (3, 4, 5, or 6 and CB, CC, or CA).

Outcome measures

The main outcome measures in the study were the TQE and excellent-quality blastocyst (EQB) formation rate per 2PN [8, 9]. As secondary outcomes, the fertilization rate (FR) and the blastulation rate (total blastocyst/2PN) were also evaluated.

Statistics

Continuous variables in the study did not follow a normal distribution. Therefore, such parameters were reported as median (minimum–maximum) values. Categorical variables are presented as a proportion and equivalent percentage figures. Chi square tests were used to test whether there was a statistically significant difference regarding the proportion of categorical variables between the groups.

Patients were divided into three groups according to their P levels on the day of maturation induction: P levels less than 0.8 ng/ml (n: 806), P levels between 0.8 and 1.49 ng/ml (n: 475) and P levels greater than and equal to 1.5 ng/ml (n: 204). These cut-off levels were selected from previously published studies [1, 6, 9].

The association between serum P levels on the day of maturation induction and the patient’s basic and COH characteristics were calculated using Spearman’s correlation coefficient. In addition, multicollinearity analysis was performed to prevent bias formation and to determine whether there was a correlation between independent variables. Existing correlation was re-scaled by grand mean centering that helps the interpretation of the terms associated with the intercept.

Generalized estimating equation (GEE) models were performed to further investigate which factors affected the cycles in which a TQE on day 3 or an EQB on day 5/6 were generated. Since the dependent variable was dichotomous (in reference to whether or not the patient had TQE or EQB), the GEE models were evaluated by logistic regression using GEE logit estimates. Serum P levels were evaluated as a continuous variable rather than a categorical variable in the models.

All statistical analyses were performed using Statistical Package for Social Sciences (SPSS) version 25.0 (SPSS Incl. USA), and statistical significance was considered if the p value < 0.05.

Results

The baseline characteristics and all of the embryological parameters obtained from 1485 cycles, 20,001 oocytes and 5279 blastocysts according to P groups are detailed in Table 1. The average age in this study was 32 years (ranging between 18 and 37). There was no significant difference between groups with respect to female age, total sperm concentration and BMI. Although the total dose of gonadotrophins was different between groups (p = 0.026), binary group comparisons revealed statistical significance only between groups 1 and 2 (p = 0.008). Peak E2 level, total number of oocytes retrieved and MII oocytes displayed a significant upward trend with PE. FR was not affected by elevated P levels (p = 0.108). Regarding embryo development during the first 3 days, the TQE formation rate did not show any significant difference (p = 0.82). Similarly, the blastulation rate (total number of blastocysts/2PN) was comparable between groups (p = 0.68). PE seemed to have a negative influence on the number of excellent-quality day 5 blastocysts among the groups (p < 0.001), but no difference was seen in any category of day 6 blastocysts (p = 0.869). The EQB formation rate per 2PN showed a significant inverse relationship with PE (p = 0.01).

Table 1 Descriptive characteristics of patients separated by serum P levels (ng/ml) on the day of maturation induction

Table 2 shows the correlation analysis of factors related to PE, and it was found that there was a statistically significant negative association between serum P level on the day of maturation induction and age, meaning that a year increase in age decreased the serum P level by 0.0039. Conversely, statistically significant positive relationships were found between the serum P level and the total dose of gonadotrophins, type of gonadotrophins (r-FSH and hMG), peak E2 level and total number of oocytes retrieved. No statistically significant correlation was found between BMI and serum P levels.

Table 2 Relationship between serum P level on the day of maturation induction and the patient’s basic and COH characteristics

Before proceeding to regression analysis, we first tried to find out whether multicollinearity exist among the independent variables when considering both the TQE and EQB as the dependent outcomes. For all the parameters except total dose of gonadotrophins, variance inflation factor (VIF) values were less than 3 which meant that there was no multicollinearity generated due to age of the patient, type of gonadotrophins, peak E2 level, serum P level, sperm concentration, total number of retrieved and mature oocytes and fertilization rate. Therefore, to overcome bias formation, grand mean centered levels of the total dose of gonadotrophins were re-scaled and used in the further GEE models.

As shown in Tables 3, 4, when considering all data from the same patient using the GEE model with logistic regression and independent covariance structure, p values of the Wald Chi square test indicated that the only parameter affecting whether the patient would have a TQE or an EQB was the FR [OR 2.96; 95% (1.99–4.43) p = 0.000 and OR 2.65; 95% (1.46–4.81) p = 0.001, respectively]. Serum P levels on the day of maturation induction were not related to neither the TQE nor the EQB formation rate [OR 1.07; 95% (0.98–1.16) p = 0.113 and OR 0.93; 95% (0.80–1.07) p = 0.32, respectively].

Table 3 Generalized Estimating Equations (GEE) Model using logit link function on the outcome of Day 3 TQE
Table 4 Generalized Estimating Equations (GEE) Model using logit link function on the outcome of EQB

Discussion

Today, blastocyst culture is the major tool used to evaluate embryo survival. PE is a factor that has been criticized as to whether it may harm embryo development at any stage during the process. In our large retrospective analysis, we observed the embryo characteristics throughout the blastocyst culture period. Even though the number of EQB within day 5 blastocysts seemed negatively affected by PE, we could not find any detrimental effect of PE on the TQE formation rate at day 3 or on the EQB formation rate.

It is well known that premature PE on the day of triggering oocyte maturation is linked to increasing number of antral follicles, peak E2 level and oocyte yield [1, 3, 4, 16]. In accordance with the literature, this phenomenon was also observed in our study. However, the exact mechanism of PE is still unresolved. One translational research study using in vitro tools has shed light on this mechanism and has shown a direct stimulating effect of FSH on 3β-hydroxysteroid dehydrogenase (3β-HSD) enzyme activity in human granulosa cell sections. The same effect has not been shown for 17α-hydroxylase (17α-OH) enzyme activity, which mainly acts through the conversion of progesterone (P) into androgens within the theca layer by means of LH activity. As a consequence, because of the suppressed levels of LH up to the preovulatory follicular stage during IVF stimulation cycles, the enhanced enzymatic activity of 3β-HSD exceeds the conversion capacity of the theca layer; hence, accumulated P may leak into systemic circulation [17].

Until now, only a small number of studies have been published about PE and its effect on embryological parameters, and they have had conflicting results. In the last decade, most IVF practitioners thought there were detrimental effects of PE on endometrial histology rather than embryo quality, but debate is ongoing with respect to a precise cut-off level for PE due to different progesterone assays and study designs. In recent years, three publications have rekindled the discussion again. Huang et al. [8] stratified groups according to five distinct P levels and investigated the effect of PE on day 2 and 3 TQEs. Contrary to our findings, the TQE formation rate was negatively influenced by increased levels of P. The cut-off level of 2 ng/ml was found to be the most significant predictor. Vanni et al. [9] went one step further and followed-up the embryos up to the blastocyst stage. They included only cycles in which at least one blastocyst was formed and tried to determine whether there was any inverse relationship between PE and top-quality blastocyst (TQB) formation rate. Four distinct groups were evaluated, and PE was indeed found to be a significant factor. Binary comparisons of each P group that had levels increased above 1 ng/ml with a group in which the P level was below 1 ng/ml revealed statistical significance in terms of TQB formation rate. According to the ROC curve analysis, a P level > 1.49 ng/ml was the best cut-off value for identifying patients at risk for the absence of TQB. Similar to our findings, the blastulation rate was not different between groups. In our study, we tried to clarify the real effect of PE in all stages of embryo development, even for cycles in which no blastocyst was developed; thus, we thought that the results would be more representative of a general population. In addition, the average age of patients in our study was lower, which might have a positive effect on embryo quality. Finally, Racca et al. [10] investigated the deleterious impact of PE on embryo quality and CLBR in their retrospective analysis. They categorized patients according to three distinct P cut-off levels. The embryo utilization rate, LBR after fresh transfer and CLBR were lower in the high P group. In this study, embryos on day 3 were reported to be of good quality even if they had fragmentation up to 50% and were selected either to be transferred or to be frozen. Although they could not find a significant difference in terms of LBR between FET among P groups after a failed fresh embryo transfer, the number of FET cycles to reach a healthy baby was greater in the high P group, which was the opposite of our findings (unpublished data). In a retrospective analysis, the relationship between elevated P levels (cut-off level 1.5 ng/ml) on the day of oocyte maturation and ploidy status as well as pregnancy outcomes in the subsequent FET cycle was investigated [18]. Elevated P levels were not associated with the percentage of available blastocysts suitable for biopsy or pregnancy outcomes. In this study, the quality of the blastocysts was assessed through their genetic structure not through morphological criteria.

The same topic has been discussed in many studies of donation cycles. In a retrospective analysis, the effects of PE on embryological parameters as well as pregnancy outcomes were analyzed among repeated oocyte donation cycles using the cut-off level as 1.2 ng/ml [19]. The same donor, acting as its own control, had two stimulations in which the peak P level was less than or greater than the cut-off level. No detrimental effect of PE could be found on fertilization and cleavage rates, embryo quality parameters or pregnancy outcomes. Another study that was conducted on oocyte donors and in which the cut-off level was 1.0 ng/ml reported similar pregnancy rates (PR) between groups. They concluded that PE had an adverse effect on the endometrium rather than on embryo quality [20]. Three other studies commenting on embryo quality and PR published the same results by using the cut-off P level as 1.1, 1.2 and 1 ng/ml, respectively [21,22,23]

Despite all the developments in the IVF field, 35–40% of patients still cannot achieve the desired goal, live birth, even after many attempts [24]. Analyzing the poorly understood factors affecting gamete and embryo qualities may help us to illuminate them along with the embryonic development process to improve the IVF outcomes. As far as we know, no previous research has been published focusing on both day 3 and day 5/6 embryo development potentials under the effect of PE. The uncertainty of whether there could be a negative impact of PE on embryo quality should be illuminated; hence, we built our study on a large sample size to make our findings more reliable. The main limitation of the present study is its retrospective design. To potentiate our findings, we analyzed various confounders and adjusted them via advanced methods. Although a homogenous population was constructed, using only the ICSI method and including only antagonist cycles might be potential confounders regarding the usage of these findings in global practice.

Conclusions

In conclusion, our study failed to demonstrate a negative effect of PE on embryo development characteristics, which is compatible with many previous studies. Embryo morphology as well as quality assessment may vary in various clinics. Thus, comparing PGT results or pregnancy outcomes as final end-points in well-designed studies would produce a more reliable result. Furthermore, homogenization of the stages of embryos that are destined for transfer and freezing may provide more accurate information than what has been previously found.