Introduction

Advances in laboratory and clinical techniques have allowed single embryo transfer to become more common practice, lending more importance to the process of embryo selection. One parameter that is often used to inform such decision-making is embryo morphology [1,2,3,4,5,6]. Blastocyst grading is based on the morphology of the inner cell mass (ICM) and trophectoderm (TE), as well as the expansion of the blastocyst cavity. A commonly utilized method follows the grading system described by Gardner and Schoolcraft in 1999, with grades ranging from “A” for best quality to “D” for worst [1, 7]. Multiple studies have since demonstrated blastocyst morphology to be a good predictor of live birth rates following both fresh and frozen embryo transfers [1,2,3,4,5,6]. Alfarawati et al. further investigated the potential of morphology as a means to predict ploidy status but found a weak association between morphology and aneuploidy; almost half of the top-grade blastocysts in their study were aneuploid, while over a third of the low-grade blastocysts were euploid [8].

With the growing use of preimplantation genetic testing (PGT) to assess ploidy, there has been debate about whether embryo morphology plays a significant role in predicting pregnancy outcomes. Capalbo et al. examined 215 FET cycles performed with euploid blastocysts and reported that embryo morphology was not predictive of implantation potential [9]. A more recent 2019 study by Viñals Gonzalez et al. that examined 179 FET cycles of euploid blastocysts similarly found that embryo morphology did not significantly affect implantation, clinical miscarriage, or live birth rates [10]. Irani et al.’s study of 417 FET of euploid blastocysts, however, suggested that blastocyst morphology could prove valuable as an adjunct to PGT for embryo selection [11]. Importantly, none of these (nor any prior) studies examined the impact morphology had on neonatal outcomes. Furthermore, the majority of prior studies did not account for the FET protocol utilized, which could confound outcomes [11,12,13,14,15,16,17,18,19]. Thus, our original study aims to evaluate whether embryo morphologic grading affects pregnancy and neonatal outcomes when euploid blastocysts are used in NC-FET.

Materials and methods

Patients

Our retrospective cohort study included all NC-FET cycles performed at Stanford Fertility and Reproductive Health Center in which a single autologous, euploid blastocyst was transferred between June 2016 and June 2020. Prior to embryo transfer, patients were confirmed to have normal uterine cavity via hysteroscopy, hysterosalpingogram, or saline infusion sonogram. All demographic, fertility, pregnancy, and neonatal information were collected from medical records. Demographics and clinical characteristics included maternal age at retrieval and at FET, number of prior embryo transfers, gravidity and parity, BMI, smoking status, race/ethnicity, and infertility diagnosis. Cycle characteristics included serum estradiol, luteinizing hormone (LH), progesterone, and endometrial thickness at the time of trigger. The Stanford University Institutional Review Board approved the study protocol.

NC-FET protocol

Our standard protocol for NC-FET starts with a baseline ultrasound on cycle days 2–5, followed by ultrasound monitoring 3–4 days prior to expected ovulation. After the dominant follicle is > 15 mm in size, patients had their serum estradiol, LH, and progesterone checked and subsequently underwent daily ultrasound monitoring until either the dominant follicle was ≥ 18 mm or a positive LH surge was noted (defined as LH ≥ 20 mIU/mL). Ovulation was then triggered or reinforced with recombinant hCG (250 mcg Ovidrel, EMD Serono). If a LH surge was detected, FET was performed 6 days later, and if not, FET was performed 7 days later [20]. In general, we would only proceed with FET if endometrial thickness was ≥ 7 mm, but if the patient had a history of endometrial thicknesses below this threshold, exceptions were made if the current cycle was a personal best; of note, our study included only 7 such cases, all of which transferred blastocysts graded BB or better. Patients were instructed to start progesterone supplementation (Crinone 8% gel daily vaginally or Endometrin 100 mg twice a day vaginally) 3 days after spontaneous LH surge or 4 days after hCG trigger. Serum βhCG was obtained 9 days after FET, and clinical pregnancy was confirmed on transvaginal ultrasound at 6–7 weeks gestational age for the presence of a viable intrauterine pregnancy. Serum estradiol, LH, progesterone, and βhCG levels were assayed with the Roche Cobas E411 analyzer (Roche Diagnostics).

All embryos transferred were blastocysts (typically frozen on day 5 or 6) derived from autologous oocytes and confirmed to be euploid by PGT. Prior to freezing, blastocysts were graded from AA to DD based on the inner cell mass (the first grade) and trophectoderm morphology (the second grade); the survival rate of blastocyst thawing within our laboratory is 95–97%. Our clinical practice is to biopsy embryos with grade CC or higher for PGT.

Study outcomes

The primary outcomes studied were clinical pregnancy (presence of fetal cardiac activity) and live birth (live infant born after 24 weeks of gestation). Secondary outcomes included clinical miscarriage (pregnancy loss prior to 20 weeks of gestation), biochemical miscarriage (rise and fall in βhCG without evidence of a clinical pregnancy), ectopic pregnancy, cesarean delivery, gestational age at delivery, birthweight, and sex assigned at birth.

Statistical analysis

The study cohort was divided into five groups based on embryo grade: AA (n = 197), AB (n = 124), BA (n = 63), BB (n = 177), and “any C” (including AC, CA, BC, CB, and CC; n = 49). Study data were managed using the Stanford REDCap electronic data tool [21] and analyzed by a biostatistician who was not part of the data collection. Patient and cycle characteristics for the different embryo grade groups were compared using absolute standardized differences (ASD), which measure the difference in means or proportions between two groups in units of standard deviations [22]. Since we compared multiple groups, ASD was calculated by taking the average of the pairwise comparisons. ASD values of 0.2, 0.5, and 0.8 correspond to small, moderate, and large differences, respectively. We performed a secondary analysis including only AA euploid embryos to compare pregnancy and neonatal outcomes between three maternal age groups at transfer: < 35 (n = 70), 35–39 (n = 97), and 40 + (n = 30) years.

Multivariable logistic regression models were performed to assess associations in pregnancy outcomes with maternal age at time of embryo transfer, nulliparity, endometrial lining thickness, inner cell mass grade (A-C), and trophectoderm grade (A-C). Due to some variation in embryo grade with respect to race, diminished ovarian reserve (DOR), and male factor infertility, we assessed whether there was a relationship between these factors and pregnancy outcomes. These factors were not included in the logistic regression after univariate analysis showed no significant relationship with pregnancy outcomes. We utilized generalized estimating equations (GEE) to account for correlation between patients with repeat cycles. We calculated adjusted odds ratios (aOR) and 95% confidence intervals (CIs) to evaluate the relative odds for live birth and clinical pregnancy. Due to few live births with prematurity and/or low birthweight, comparisons between embryo morphology groups were made using descriptive statistics. Analyses were performed using the R statistical software version 3.6.2, and GEE analyses were performed using library geepack [23,24,25,26]. All statistical tests were two-sided and performed at the 0.05 significance level.

Results

Participant and cycle characteristics

A total of 431 women underwent a total of 610 NC-FET cycles between June 2016 and June 2020 at our institution. Cycles were divided into five groups based on embryo grade: AA (n = 197), AB (n = 124), BA (n = 63), BB (n = 177), and “any C” (n = 49). The maternal age at transfer ranged from 25 to 48 years (mean of 36.2), and the maternal age at retrieval ranged from 25 to 48 years (mean of 35.7). The respective mean age data for each embryo grade cohort is detailed in Table 1, ranging from mean age at transfer of 35.4 years (AA) to 37.3 years (“any C”) and mean maternal age at retrieval from 35.0 years (AA) to 36.8 years (“any C”). The overall cohort’s mean BMI was 25.0 kg/m2, with similar mean BMI among the five embryo grade groups. Other than participants’ race/ethnicity, there were no substantial differences between the embryo grade groups with respect to BMI, nulliparity, number of prior embryo transfer cycles, smoking status, or infertility diagnoses (Table 1).

Table 1 Patient demographics, baseline clinical and cycle characteristics

Cycle characteristics on the day of ovulation trigger—which included serum estradiol, LH, and progesterone—were similar between the embryo grade groups (Table 1). There was a moderate difference in the endometrial thickness on the day of hCG trigger, ranging from a mean thickness of 8.5 mm for the “any C” group to 9.5 mm for the AB group (ASD = 0.33).

Pregnancy outcomes comparing euploid embryo grades

The “any C” group had a clinical pregnancy rate of 40.8%, compared to 60.5% for BB, 61.9% for BA, 68.5% for AB, and 67.5% for AA (ASD = 0.26). Similarly, the “any C” group had a live birth rate of 40.8%, which was substantially lower compared to 56.5% for BB, 55.6% for BA, 67.5% for AB, and 64.5% for AA (ASD = 0.26). This indicates a notable drop-off in the incidence of clinical pregnancy and live birth when an euploid embryo had a “C” morphologic grade. The incidences of biochemical miscarriage, ectopic pregnancy, and clinical miscarriage were not substantially different between the groups. The “any C” and AA groups had a higher rate of Cesarean delivery (40% and 37%, respectively) compared to the other groups (30% for BB, 20% for BA, 24% for AB; ASD = 0.28) (Table 2).

Table 2 Comparison of pregnancy outcomes and neonatal outcomes among different euploid blastocyst grades

After accounting for patients undergoing repeat cycles using GEE methodology in a multivariable logistic regression, the pregnancy outcome patterns persisted (Table 3). Cycles that transferred euploid embryos with “any C” grade trended towards lower odds of clinical pregnancy and live birth. Specifically, the cycles with ICM graded “C” had a statistically significant decreased odds of clinical pregnancy (aOR 0.33; 95% CI 0.12, 0.93; p = 0.04) and live birth (aOR 0.32; 95% CI 0.11, 0.91; p = 0.03) when compared with cycles with ICM graded “A”. The differences when comparing trophectoderm grades were not statistically significant. Of note, 114 patients had repeat cycles; of those with repeat cycles, 67 patients transferred embryos of varying grades, 38 (56.3%) of whom achieved pregnancy with a lower quality embryo than previously transferred.

Table 3 Adjusted odds ratios (aORs) for clinical pregnancy and live birth

Pregnancy outcomes comparing maternal age

Our secondary analysis that included only the top-graded (AA) embryos demonstrated moderate ASD differences in the rates of clinical pregnancy and live birth between the age groups (Table 4). The 40 + age group had a lower rate of clinical pregnancy at 50%, compared to 63.9% for the 35–39 group and 78.6% for the < 35 group (ASD = 0.44). The 40 + age group also had a lower live birth rate of 43.3%, compared to 60.8% for the 35–39 group and 78.6% for the < 35 group (ASD = 0.51). The 40 + age group had a higher incidence of miscarriage at 13.3%, compared to 4.8% for the 35–39 group and 0% for the < 35 age group (ASD = 0.39). The 40 + age group also had a higher incidence of cesarean delivery at 53.8%, compared to 39% for the 35–39 age group and 30.9% for the < 35 age group (ASD = 0.38). Of note, the mean maternal age at FET for every cohort was within 8 months of mean maternal age at oocyte retrieval. Thus, overall, the age at which participants underwent FET was not substantially remote from when they underwent oocyte retrieval.

Table 4 Comparison of pregnancy and neonatal outcomes of grade AA euploid blastocysts among different maternal age groups (< 35, 35–39, 40 + years)

GEE adjusted regression demonstrated maternal age at transfer to be independently associated with outcomes (Table 3). Compared to the < 35 reference group, the 40 + age group had a significantly decreased odds of clinical pregnancy (aOR 0.54, 95% CI 0.31, 0.96; p = 0.04) and live birth (aOR 0.48; 95% CI 0.27, 0.85; p = 0.01); the 3539 age group also had a statistically significant decreased odds of clinical pregnancy (aOR 0.53, 95% CI 0.34, 0.82; p < 0.01) and live birth (aOR 0.54, 95% CI 0.35, 0.84; p < 0.01).

Neonatal outcomes comparing euploid embryo grade

The median gestational ages were similar among the different embryo grades, ranging from 273.0 days [interquartile range, IQR, 269.5, 277.5] for “any C” neonates to 275.0 days [IQR 267.0, 279.3] for AA neonates (ASD = 0.17) (Table 2). Thus, there were similar incidences of prematurity among the groups. The median neonatal birthweights were also similar among the different grades, ranging from 3200 g [IQR 2896, 3590] for AA neonates to 3450 [IQR 3319, 3650] for “any C” neonates.

The secondary analysis comparing different maternal age groups (< 35, 35–39, and 40 + year) showed comparable median gestational ages (ASD = 0.07), prematurity rates (ASD = 0.21), median birthweights (ASD = 0.23), and neonate sex distribution (ASD = 0.23) (Table 4).

Discussion

Our present study is the first to our knowledge to examine the impact of euploid blastocyst morphology on both pregnancy and neonatal outcomes in NC-FET, with the goal of informing the process of embryo selection. We demonstrated that higher graded blastocysts are associated with better pregnancy outcomes with regard to both clinical pregnancy and live birth rates. Also of note, the ICM grade specifically conveys a more significant impact than the TE grade. In addition, maternal age continues to play a significant role in pregnancy outcomes even when accounting for the embryo quality and ploidy; the odds of clinical pregnancy and live birth markedly decrease with advanced age. Nonetheless, our data reassuringly suggests that blastocyst morphology and maternal age are not associated with neonatal outcomes such as prematurity or lower birthweight.

It has been widely recognized that human oocytes retrieved for assisted reproduction have a high prevalence of aneuploidy, and multiple studies have demonstrated that the produced embryos are often affected by chromosome abnormalities [27,28,29,30,31]. Prior to PGT, the embryo selection process relied heavily on embryo morphology [1,2,3,4,5,6]. With the introduction and now rapidly growing use of PGT, clinicians can better evaluate ploidy status. The question that naturally follows is whether embryo morphology still plays a role in predicting outcomes of euploid blastocysts. Thus far, only a handful of studies have investigated such question. Capalbo et al. first examined the outcomes of euploid blastocysts in 2014 when investigating the correlation between blastocyst morphology and ploidy status. Their additional analysis of 215 FET cycles performed with euploid blastocysts found that embryo morphology was not predictive of implantation potential, though they were limited by a small sample size of lower quality embryos [9]. Similarly, Viñals Gonzalez et al. examined 179 FET cycles of euploid blastocysts and found that embryo morphology did not significantly affect implantation, clinical miscarriage, and live birth rates [10]. However, their study only included women with advanced maternal age (AMA), thereby limiting generalizability. Irani et al.’s 2016 study of 417 FET cycles included all infertility diagnoses and had a robust sample size of lower quality embryos; their findings were consistent with those of our study, suggesting that blastocyst morphologic grading was useful in predicting ongoing pregnancy rates [5]. Furthermore, they also found ICM grade to be of more significant utility when compared to trophectoderm grade. Nonetheless, the evidence regarding which component of the blastocyst grading is more consequential remains mixed. Although our study is consistent with Irani et al. among other studies [5, 32,33,34], multiple studies have found TE to be more significant, though these studies did not use PGT screened embryos [3, 4, 35, 36]. This range of conclusions highlights the need for further investigation.

Even after controlling for the morphologic grade of euploid blastocysts, our data suggests that maternal age still significantly impacts pregnancy outcomes. We continue to see a significant decline in the odds of clinical pregnancy and live birth with advancing maternal age even after euploid blastocysts of the best grade (AA) are transferred. It has long been known that aging decreases female fertility due to diminished oocyte yield and increased aneuploidy risk, and the decreased reproductive potential of older women has largely been attributed to this increased rate of aneuploidy [27, 28, 30, 31, 37,38,39,40,41]. However, when looking at studies that have only included euploid blastocysts—thereby removing the postulated main cause of age-related decline—the evidence has been mixed on whether maternal age continues to affect reproductive outcomes. Scott et al.’s 2012 study found a significant decline in the implantation rates of euploid embryos in the ≥ 35 cohort [29], while Harton et al.’s 2013 study of euploid embryos found comparable ongoing pregnancy rates among different age groups (< 35, 3537, 3840, and 4142 years) [31]. Neither study reported embryo morphology, and both included cleavage-stage embryos. More contemporary, Irani et al.’s 2019 study found maternal age only significantly impacts the quantity of euploid embryos achieved rather than implantation potential [42], while Reig et al.’s 2020 study found implantation rates to negatively correlate with age among euploid embryo transfers [43]. Thus, further investigation is warranted, and our data suggest that there are potentially age-related factors beyond aneuploidy and morphologic quality that impact reproductive potential.

Our cohort study is unique in several aspects. It is the first to study the effect euploid blastocyst morphology has on both pregnancy and neonatal outcomes. Additionally, we studied all completed cycles using euploid blastocysts over the course of four years, thereby minimizing selection bias and allowing for a robust sample size. The inclusion of only NC-FET cycles (the most common protocol for FET at our clinic) also minimized potential confounding of pregnancy and neonatal outcomes by different endometrial preparations. Furthermore, we gathered all fertility, pregnancy, and neonatal data from medical records, minimizing recall bias.

Our study has some limitations. Although we had a relatively large sample size compared to prior studies, the number of cycles that transferred grade C blastocysts was small (49 out of 610). The majority of patients in our cohort were Caucasian and Asian, potentially limiting generalizability to other ethnic groups. Thus, further studies are needed to confirm our findings and to understand the mechanisms behind these differences in outcomes.

PGT utilization is on the rise and now used in over 40% of all cycles in the USA, prompting investigation into the utility of morphologic grades during embryo selection [44]. Our findings suggest that even when euploidy is confirmed, embryo grade—particularly the ICM grade—is still associated with pregnancy outcomes, and age independently remains a significant factor in the odds of success. Reassuringly, morphologic grade and maternal age are not associated with neonatal outcomes such as preterm delivery and birthweight, suggesting that lower grade euploid embryos still could be viable options for transfer. The results from our study may not only help guide embryo selection, but also facilitate patient counseling during this critical shared-decision-making process.