Introduction

Clinical and laboratory advances have ushered in the widespread standard practice of single frozen embryo transfer and have lent increased importance to cycle optimization prior to embryo transfer, a crucial component of which is endometrial preparation. There are multiple protocols that may be utilized prior to embryo transfer—which include artificial (programmed, controlled, medicated), stimulated, and natural—and none has been definitively proven to be the superior preparation [1,2,3]. Recent evidence, however, suggests an increased risk of hypertensive disorders of pregnancy with artificial FET cycles, which has largely been attributed to the absence of corpus luteum formation [4,5,6,7]. This may well lead to a shift towards a higher utilization of natural-based FET cycles that use the body’s intrinsic physiology when possible. There has also been growing evidence in support of letrozole use during endometrial preparation [8,9,10,11,12]. As an aromatase inhibitor, letrozole reduces androgen conversion into estrogen in the ovarian granulosa cells. This initial decrease in estradiol (E2) upregulates estrogen receptors and increases endometrial sensitivity to rising estrogen levels, thereby enhancing endometrial proliferation prior to FET [8, 13,14,15,16,17]. There is also evidence to demonstrate letrozole’s role in improved endometrial receptivity in patients with endometriosis; increased endometrial integrin expression in response to letrozole has been shown to result in improved implantation and pregnancy rates [16, 18].

Despite this growing use, our understanding of the impact of letrozole on pregnancy outcomes in FET cycles is still in the early stages. In their large 2017 study from Japan, Tatsumi et al. found letrozole use in single blastocyst FET to be associated with increased rates of clinical pregnancy and live birth as well as a decreased rate of miscarriage [11]. More recently, Li et al.’s 2021 study from China and Takeshima et al.’s 2022 study from Japan found no significant differences in pregnancy outcomes with letrozole use [19, 20]. However, both studies included only transfer of cleavage stage embryos, and Takeshima et al. studied letrozole use for ovarian stimulation prior to natural IVF with only fresh embryo transfers, thereby limiting the generalizability to the USA as well as other countries where IVF has largely moved towards frozen embryo transfer of blastocyst stage embryos. Thus, our original study aims to evaluate whether there are differences in pregnancy outcomes between letrozole-stimulated and natural cycle frozen embryo transfer of euploid blastocysts.

Materials and methods

Patients

Our retrospective cohort study included all completed letrozole-stimulated FET (LTZ-FET) and natural FET (NC-FET) cycles with a single, autologous, and euploid blastocyst transferred at Stanford Fertility and Reproductive Health Center from June 2016 to June 2020. The LTZ-FET cohort included 134 individuals who underwent a total 161 cycles; the NC-FET cohort included 420 individuals who underwent 575 cycles. Patients with uterine anomalies and cycles that utilized donor oocytes, that were canceled, or that resulted in multiple gestations were not included. Information regarding demographic and clinical characteristics, fertility treatments, and pregnancy/neonatal outcomes were collected from medical records. Demographics and clinical characteristics included maternal age at FET, gravidity and parity, BMI, smoking status, race/ethnicity, and infertility diagnosis; cycle characteristics included serum estradiol, luteinizing hormone (LH), progesterone, and endometrial thickness at the time of ovulation trigger. The Stanford University Institutional Review Board approved the study protocol.

Natural cycle and letrozole FET treatment

The decision regarding which protocol to undergo (natural cycle or letrozole) was mainly at the discretion of the provider, utilizing shared decision-making with the patients. The standard protocol for LTZ-FET entailed the daily administration of 5 mg of letrozole for 5 days starting on cycle day 2 or 3 [8, 12]. Otherwise, patients in both cohorts underwent regular ultrasound monitoring until the dominant follicle was ≥ 18 mm, or a positive LH surge was noted (defined as LH ≥ 20 mIU/mL), at which point serum E2, progesterone, and LH levels were collected. Recombinant hCG (250 mcg Ovidrel, EMD Serono) was then used to trigger or supplement ovulation, and patients proceeded with FET only if endometrial thickness was ≥ 7 mm or if the current cycle was a personal best among a history of endometrial thicknesses below threshold (n = 4 for LTZ-FET and n = 7 for NC-FET). Two days after ovulation, patients started twice daily vaginal micronized progesterone (100 mg; Endometrin). Six days after natural LH surge or 7 days after ovulation trigger [21], FET was performed. Nine days after the transfer, serum βhCG was obtained, followed by a series of transvaginal ultrasounds 6–8 weeks after FET to assess for clinical pregnancy. Serum E2, LH, progesterone, and βhCG levels were assayed with the Roche Cobas E411 analyzer (Roche Diagnostics). The majority of patients received low-dose aspirin (81 mg) starting on cycle day 2 until 36 weeks of gestation unless they had a medical contraindication.

All embryos transferred were blastocysts derived from autologous oocytes. Blastocysts were graded from AA to DD based on the inner cell mass and trophectoderm morphology. Our clinic practice is to biopsy embryos with grade CC or higher for preimplantation genetic testing (PGT). Biopsy was performed by pipette removal of 5–8 trophectoderm cells from day 5 to day 6 fully expanded blastocysts. The survival rate of blastocyst thawing within our laboratory is 95–97%.

Study outcomes

The primary outcomes studied were rates of clinical pregnancy (presence of fetal cardiac activity), clinical miscarriage (pregnancy loss prior to 20 weeks of gestation), and live birth rate (live infant born after 24 weeks of gestation) per transfer. Additional outcomes examined were biochemical miscarriage (rise and fall in βhCG without evidence of a clinical pregnancy), ectopic pregnancy, intrauterine fetal demise/pregnancy loss after 20 weeks of gestation, caesarean delivery, gestational age at delivery, birth weight, low birth weight (< 2500 g, as defined by the World Health Organization), very low birth weight (< 1500 g), and sex assigned at birth.

Statistical analysis

Study data were managed in Stanford’s REDCap electronic data tool [22]. Patient and cycle characteristics for the LTZ-FET and NC-FET were compared using absolute standardized differences (ASD), which measure the difference in means or proportions between two groups in units of standard deviations [23]. ASD values of 0.2, 0.5, and 0.8 correspond to small, moderate, and large differences, respectively. Thus, the larger the ASD value, the larger the difference. Multivariable logistic and linear regression models determined differences in pregnancy and neonatal outcomes between the two groups while adjusting for maternal age at FET, BMI, nulliparity, embryo grade (categorized into AA, AB/BA, BB, and any C), endometrial thickness on day of trigger, race/ethnicity, ovulatory dysfunction diagnosis, polycystic ovary syndrome (PCOS) diagnosis, and recurrent pregnancy loss (RPL) diagnosis. We utilized generalized estimating equations (GEE) to account for correlation between cycles per patient. We calculated adjusted odds ratios (aOR) and 95% confidence intervals (CIs) to evaluate the association between FET type (LTZ or NC) and our outcomes of clinical pregnancies, clinical miscarriages, and live births; the LTZ-FET cohort was the reference group for all odds ratios. We calculated adjusted mean differences and 95% CI for gestational age at delivery and neonatal birth weight. Additional pregnancy and neonatal outcomes were compared between the two groups using ASD only.

Analyses were performed using the R statistical software version 3.6.2, and GEE analyses were performed using library geepack [24,25,26,27]. All statistical tests were two-sided and performed at the 0.05 significance level.

Results

Participant and cycle characteristics

From June 2016 to June 2020, a total of 134 individuals underwent 161 cycles of LTZ-FET, and a total of 420 individuals underwent 575 cycles of NC-FET. The demographic characteristics were overall similar between the two cohorts, including maternal age, BMI, smoking status, and race/ethnicity. A greater proportion of the LTZ-FET cohort was multiparous, transferred higher-grade embryos, and had an infertility diagnosis of PCOS or other ovulatory dysfunction, though these differences were not substantial with all the ASDs < 0.5 (Table 1). The cycle characteristics including peak estradiol, progesterone, and endometrial thickness at trigger were also overall similar, other than a small difference in median LH levels at trigger (Table 2).

Table 1LE: Tables 1 to 2 contain entries in boldface but without significance. If deemed to have significance, please provide their significance in the form of a table footnote; otherwise, please set them upright. Patient demographics and baseline clinical characteristics. Data are presented as mean ± SD, n (%), or median (interquartile range (IQR)) if data skewed
Table 2 Comparison of cycle characteristics, pregnancy outcomes, and neonatal outcomes between the NC-FET and LTZ-FET. Data are presented as n (%) or median (IQR)

Pregnancy outcomes

The study’s overall clinical pregnancy rate was 63.2%, miscarriage rate was 5.8%, and live birth rate was 59.1%. The LTZ-FET cohort’s clinical pregnancy rate was 67.1%, which was similar to NC-FET cohort’s 62.1% (ASD = 0.11). The two cohorts also had similar clinical miscarriage rates (8.3% for LTZ vs. 5.0% for NC; ASD = 0.13) and live birth rates (60.9% for LTZ vs. 58.6% for NC; ASD = 0.05). Furthermore, the cohorts had similar rates of biochemical miscarriage, ectopic pregnancy, intrauterine fetal demise, and caesarean delivery (Table 2). After adjusting for multiple confounders, the difference in clinical pregnancy between the two cohorts was not significant (aOR 1.31, 95% CI 0.86, 1.99; P = 0.21). Multivariable regression yielded similar results for clinical miscarriage (aOR 1.56, 95% CI 0.61, 3.99; P = 0.36) and live birth (aOR 1.17, 95% CI 0.77, 1.8; P = 0.46) (Table 3).

Table 3 Adjusted odds ratios (aORs) for pregnancy and neonatal outcomes (NC-FET reference)

Neonatal outcomes

The LTZ-FET neonates had a mean gestational age of 38 weeks and 1 day at delivery (267 days; range 224–284 days), which was earlier than the NC-FET neonates’ mean gestational age of 39 weeks and 0 day (273 days; range 178–293 days) (ASD = 0.57). There was also a greater proportion of prematurity within the LTZ-FET cohort with 18.6% (n = 18) of neonates born before 37 weeks of gestation, compared to 8% (n = 27) of NC-FET neonates (ASD = 0.32) (Table 2). Notably, both cohorts of preterm neonates had similar rates of mothers with prior history of preterm birth (1.0% LTZ-FET vs 1.1% NC-FET). Among the premature neonates, the mean gestational age was 35 weeks and 3 days for LTZ-FET and 35 weeks and 5 days for NC-FET (ASD = 0.17). There were similar rates of medically indicated preterm delivery (6.1% LTZ-FET vs 4.5% NC-FET, ASD = 0.08). The medical indications for preterm delivery for LTZ-FET cohort were preeclampsia with severe features (n = 4), placenta previa (n = 1), and non-reassuring fetal status (n = 1). The indications for the NC-FET cohort were placenta previa (n = 6), preeclampsia with severe features (n = 4, one with concurrent intrahepatic cholestasis of pregnancy), history of abdominal myomectomy (n = 2), other maternal co-morbidity (n = 2), and non-reassuring fetal status (n = 1).

After accounting for multiple potential confounders, there was a statistically significant adjusted mean difference in gestational age between the two cohorts with LTZ-FET neonates born on average 5.7 days earlier than NC-FET neonates (95% CI − 8.3, − 3.0; P < 0.001) (Table 3). Furthermore, the LTZ-FET neonates overall weighed less and had more than double the rate of low birth weight (< 2500 g): 10.4% (n = 10) for LTZ-FET compared to 5.0% (n = 17) for NC-FET (ASD = 0.20; Table 2). Rates of very low birth weight neonates (< 1500 g) were similar and rare, for both cohorts. The difference in birth weight was not statistically significant after multivariable adjustment (Table 3).

Discussion

In light of the rising use of letrozole for endometrial preparation, more information is needed on the association of pregnancy and neonatal outcomes with use of letrozole. This study is the first in the USA to examine the pregnancy and neonatal outcomes of LTZ-FET cycles compared to NC-FET cycles. After controlling for multiple possible confounders, we found that LTZ-FET cycles had similar clinical pregnancy, miscarriage, and live birth rates as NC-FET cycles. While there was a higher prevalence of prematurity and low birth weight among LTZ-FET neonates, after adjusting for confounders, there was no difference in birth weight between cohorts. Furthermore, while LTZ-FET neonates arrived on average 6 days earlier, it is unclear if this difference is clinically meaningful since the overall mean and median gestational age in both cohorts were full term.

Thus far, only few studies have examined the use of letrozole for endometrial preparation, and only a small proportion of these use NC-FET cycles as the comparison; even fewer capture neonatal data. An updated 2020 Cochrane review comparing various endometrial preparations for FET noted “low-quality” evidence that suggests LTZ-FET cycles have improved clinical pregnancy rate compared to artificial cycles and “uncertain” evidence regarding miscarriage or live birth rates [1]. Zeng et al.’s 2021 metanalysis comparing stimulated to artificial FET cycles in PCOS patients found similar rates of implantation, clinical pregnancy, and live birth, but their subgroup analysis of LTZ-FET cycles noted lower miscarriage rates [28]. Another 2021 meta-analysis by Wu et al. that was not limited to one infertility diagnosis found LTZ-FET to have a significantly higher live birth rate than artificial cycles [3].

In examining outcomes compared to NC-FET, Tatsumi et al.’s large 2017 study from Japan demonstrated letrozole use in single blastocyst FET to be associated with increased rates of clinical pregnancy and live birth as well as a decreased rate of miscarriage [11]. Li et al.’s 2021 study from China found no significant differences in pregnancy rates [19], and a more recent 2022 study from Japan by Takeshima et al. found no significant differences in pregnancy or neonatal outcomes with letrozole use [20]. Both latter studies included only transfer of cleavage-stage embryos, and Takeshima et al.’s study solely examined letrozole for ovarian stimulation prior to natural cycle fresh embryo transfers.

The applicability of prior letrozole studies to current clinical settings in the USA and other countries with similar clinical practice is considerably limited by the fact that they were (1) all international with limited diversity of the population, (2) mostly included (some exclusively) cleavage-stage embryos, and (3) did not account for the use of preimplantation genetic testing (PGT) and thus embryo ploidy. Nonetheless, studies overall seem to favor the use of letrozole for endometrial preparation, suggesting that it may prime the endometrium for implantation and create a more physiologic hormonal profile to that of normal spontaneous ovulation [8,9,10,11,12]. This is thought to be achieved by letrozole’s initial reduction of serum estradiol during its use in the early follicular phase, leading to the upregulation of endometrial estrogen receptors, increased endometrial sensitivity to rising estrogen levels, increased integrin expression, and prevention of premature progesterone action; all of which could potentially enhance endometrial proliferation and receptivity prior to FET [13,14,15,16, 29]. However, given that our knowledge regarding letrozole’s effect on the endometrium is still limited, the initial reduction in estrogen biosynthesis and the postulated endometrial changes could negatively impact outcomes as well. It is our clinical experience that patients have quite variable responses to letrozole. This results in serum E2 levels that fall well outside what is thought to be physiologic in the follicular phase [30, 31], and there is some evidence that lower estradiol levels during LTZ-FET may be associated with both a higher miscarriage rate and a lower live birth rate [32].

The neonatal impact of letrozole stimulation remains largely unknown. The pharmaceutical maker of letrozole (Novartis Pharmaceuticals) warns in their report for healthcare providers the potential for letrozole to cause embryo and fetus toxicity based on animal studies findings that were corroborated by Tiboni et al.’s 2008 study [33]. Importantly, however, these studies were examining direct fetal exposure to letrozole during pregnancy. When used for endometrial preparation, letrozole’s short half-life of around 2 days implies that it should be largely cleared from the body by the time of FET [34, 35]. Thus, it theoretically should not affect the embryo and subsequent fetus [36]. A small single center study presented at the 2005 American Society for Reproductive Medicine conference suggested a higher rate of fetal malformation with the use of letrozole for ovulation induction [37], but the use of normal, spontaneously conceived deliveries as controls was non-ideal, and the study was never published. Subsequent larger, well-designed studies from multiple countries did not find increased rates of congenital anomalies with letrozole use [11, 38, 39]. In fact, data from studies such as Tatsumi et al.’s 2017 study on letrozole use in ART (n = 694 neonates) and Takeshima’s 2022 study (n = 510 neonates) reassuringly found no increased risk of major congenital anomalies following letrozole use prior to conception. Additionally, a recent 2021 meta-analysis found no evidence of increased risk of congenital fetal anomaly with letrozole use [40].

Only a few studies have looked at neonatal outcomes of letrozole for endometrial preparation, and they had similar findings to that of ours. Tatsumi et al. found slightly higher rates of prematurity and low birth weight among the letrozole cohort, while Li et al. found lower birth weights after matching for confounders, but this was not statistically significant [11, 19]. Of note, the latter study’s criteria for live birth were a gestational age of > 28 weeks, which may impact their neonatal data as extremely preterm births were not captured. Overall, these findings along with ours suggest that though there is no direct embryo or fetal exposure to letrozole, and its impact on the peri-implantation hormonal milieu and endometrium may influence the resulting conception. In fact, prior baboon studies found alterations in some indices of placental function and structure with letrozole treatment [41,42,43]. Given the lifelong consequences of prematurity, our findings highlight the need for larger and more longitudinal studies to adequately examine whether letrozole stimulation during the follicular phase impacts neonates and if there is any clinical significance to these findings.

Our cohort study has several unique characteristics. It is the first study in the USA to compare the pregnancy and neonatal outcomes of letrozole-stimulated to natural FET cycles. By including all completed cycles in our study’s 4-year period, this study minimized selection bias; by gathering all fertility, pregnancy, and neonatal data from medical records, this study minimized recall bias; and by accounting for multiple potential confounders, this study achieved a robust comparison. Lastly and importantly, all the embryos in our study were euploid blastocysts, which more accurately reflects current practice models in many countries as well as minimizes potential outcome confounders. With PGT utilization rapidly on the rise and now included in almost half of all cycles in the USA [44], it is increasingly important to account for PGT use when investigating pregnancy and neonatal outcomes [45,46,47,48].

The main limitations of our study were the sample size, the observational (non-randomized) nature, and the fact that our participant pool was from a single academic center. The lack of a unified medical record system in the USA continues to pose a significant challenge to accessing detailed pregnancy and neonatal medical records for a larger cohort of women who have undergone LTZ-FET and NC-FET cycles [49]. Furthermore, a portion of our patients delivered outside of the Stanford Health Care system, which limited our ability to ascertain the specific details of every pregnancy, including complications or detailed neonatal outcomes. The majority of our center’s patients were of Asian or Caucasian race, which could potentially limit the generalizability of our findings to other populations. Thus, further studies with broader populations are warranted.

It is important to note that, similar to all prior letrozole studies, the selection of which protocol to undergo for endometrial preparation was mainly based upon clinician judgment with no clear algorithm to guide selection. Though this study did account for multiple potential confounders in its analysis, there still remains the potential for systematic biases such as selection bias when such choices are at the discretion of the physician as they incorporate patient preferences and history, among other numerous factors. There are understandably limited means of accounting for this outside of a randomized control trial.

In conclusion, our study found similar rates of clinical pregnancy, miscarriage, and live birth between LTZ-FET and NC-FET cycles, which adds to a growing body of evidence suggesting letrozole as a prudent option for endometrial preparation. However, there was a higher prevalence of prematurity and low birth weight among the LTZ-FET neonates, who were on average born 6 days earlier. We hope that this study proves valuable in guiding patient-counseling, helps clinical decision-making prior to FET, and highlights the importance of further investigation into the neonatal consequences of letrozole use.