Introduction

Prior to the introduction of intracytoplasmic sperm injection (ICSI) in the early 1990s, low fertilization and/or fertilization failure was anticipated in conventional insemination in vitro fertilization (IVF) cycles with known abnormal pretreatment semen analysis parameters [1, 2]. However, suboptimal fertilization or failed fertilization following conventional insemination is usually unpredictable in patients/couples with normal pretreatment semen parameters and risk factors for this outcome have not been confirmed. A low rate of normal fertilization (defined as the number of two pronuclear (2PN) zygotes/number of metaphase 2 (M2) oocytes of less than 30%) or absent fertilization complicates 5–20% of cycles with conventional insemination [3, 4]. Failed fertilization occurs in approximately 5–10% of IVF cycles with conventional insemination compared to 2–3% in IVF ICSI cycles [5, 6]. Since the estimated probability that a single mature oocyte will fertilize is approximately 30–40%, it is anticipated that the rate of fertilization failure would be directly related to the number of mature oocytes retrieved [3]. However, the incidence of fertilization failure is higher than would be expected based on these odds and warrants further investigation.

It has been suggested that gamete quality (oocytes and sperm, respectively) at the time of oocyte retrieval and insemination may play a role in mediating this low fertilization [7]. The inability of sperm to bind and/or penetrate the zona pellucida results in the absence of sperm nuclei in the unfertilized IVF oocyte [8, 9]. Unfortunately, there is a paucity of data investigating predictors for a normal rate of fertilization at the time of IVF, and it has proven difficult to predict the likelihood of low or no fertilization during conventional insemination among cycles with normal semen parameters. Given the utility of identifying those at risk for suboptimal fertilization during first IVF attempt with conventional insemination, and the potential benefit of offering ICSI preemptively, the objective of this study was to evaluate factors associated with low fertilization in autologous conventional insemination IVF cycles from couples without a diagnosis of male factor infertility.

Materials and methods

The study included couples that underwent controlled ovarian hyperstimulation and IVF at the Massachusetts General Hospital Fertility Center from 2004 through 2014. After institutional review board approval was obtained, data were derived from the electronic medical record system. The first IVF cycle attempt was exclusively examined to exclude any bias that prior cycles may confer on clinical management in subsequent cycles. Fresh IVF cycle attempts that utilized autologous gametes for IVF were reviewed (N = 1,370). Cycles which utilized ICSI, donor oocytes, pre-implantation genetic testing, had abnormal pretreatment diagnostic semen parameters (male factor infertility), or a Society for Assisted Reproductive Technology (SART) diagnosis of male factor infertility were excluded. Final analyses were limited to only cycles (N = 663) with complete information for ethnicity, age, body mass index (BMI), stimulation protocol, diagnosis, antral follicle counts, duration of infertility, ovulatory induction hormones levels, semen sample level, embryo transfers, and clinical outcomes. Patients with incomplete cycle or birth information were excluded. While it was anticipated that there would be female and male characteristics and diagnostic testing results that would be associated with the rate of normal fertilization, we had no a priori hypothesis on which specific tests would be related to this outcome.

Semen analysis assessment

Semen analyses were performed prior to beginning any treatment and again on the day of oocyte retrieval. Semen analyses were processed at Massachusetts General Hospital, as previously described [10]. Briefly, male patients were instructed to have between 2 and 3 days of abstinence prior to collection of the semen sample. After collection, the sample was allowed to liquefy for 15 minutes. The volume and viscosity were assessed using a wide-bore pipette. Semen aliquot was placed on a prewarmed Leja slide (Spectrum Technologies, CA). A computer-aided semen analyzer (CASA, HTM-IVOS Version 10HTM-IVOS, Beverly, MA, USA) was used to assess sperm concentration (million/ml) and motility (% motile). Motility was classified using World Health Organization (WHO) into progressive and non-progressive sperm cells [11]. Total progressive motile sperm count (million/ejaculate) was calculated by multiplying total sperm count by progressive motility. Sperm morphology was assessed was assessed with the Kruger strict criteria [12]. In addition to the evaluation above, the samples provided on the day of egg retrieval were processed by double-layer density gradient isolate (Irvine Scientific 99264, Santa Ana, CA). The sample was then washed twice to remove the isolate. The supernatant was removed, and the resulting pellet was resuspended in 1 ml of Quinn’s Advantage Protein Plus Fertilization Medium (Sage 1520, Sage, Trumbull, CT).

Clinical protocols

Controlled ovarian hyperstimulation was performed with luteal-phase gonadotropin-releasing hormone (GnRH) agonist, GnRH-antagonist downregulation or GnRH agonist flare protocol, as clinically indicated [13]. Women were pretreated with oral contraceptive pills (OCPs) (30 μg ethinyl E2/0.15 desogestrel, Apri, Teva Pharmaceuticals, North Wales, PA) or ethinyl estradiol patch (0.1 mg/day, Vivelle-Dot estradiol transdermal system, Novartis Pharmaceuticals Corporation, East Hanover, NJ) for follicular synchronization and/or priming. Pituitary downregulation with leuprolide acetate (Sandoz Inc., Princeton, NJ) was started 5 days before stopping OCPs in the by luteal-phase GnRH agonist cycle or 5 days after stopping OCPs in the GnRH agonist flare protocol. On the third day of induced menses, patients began controlled ovarian hyperstimulation with recombinant gonadotropins (Follistim, Merck, White House Station, NJ; Gonal-F, EMD-Seron, Rockland, MA; and Menopur, Ferring Pharmaceuticals, Parsippany, NJ). Patients were serially monitored with serum estradiol and transvaginal ultrasound to assess follicular measurements and endometrial thickness. In the GnRH antagonist protocol, the GnRH antagonist (0.25 mg, ganirelix acetate, Organon, Roseland, NJ; or 0.25 mg, Cetrotide, EMD Serono, Rockland MA) was initiated when the serum estradiol concentration exceeded 1000 pg/ml or the lead follicle size reached 14 mm. When there were three lead follicles > 16 mm, intramuscular human chorionic gonadotropin (hCG) (10,000 IU, Novarel, Ferring Pharmaceuticals or 10,000 IU, Pregnyl, Merck) was administered to induce final oocyte maturation. The patients underwent transvaginal ultrasound-guided oocyte retrieval 35–37 h after human chorionic gonadotropin trigger [13].

Following egg retrieval, oocytes underwent conventional insemination. Fertilization status of each oocyte was determined 24 h after insemination. Normal fertilization rate was defined as the ratio of the total fertilized oocytes with resultant 2 pronuclei zygotes (2PN) per the total number of MII phase matured oocytes inseminated. Treatment cycles were classified into two discrete categories based upon a threshold fertilization rate of ≤ 30%. Biochemical pregnancy rate was defined as a positive hCG (hCG > 6 IU/l) 17 days after oocyte retrieval. Clinical pregnancy was defined as the presence of gestational sac on ultrasound during the 6th week of gestation and live birth was defined as delivery of a neonate born > 24 weeks of gestation. All outcomes are reported as per cycle initiated.

Statistical analysis

Since the goal of the study was to identify the set of covariates that best predicted low fertilization, we randomly divided the 663 treatment cycles into two halves by using a random number generator. The first half served as the training set (392 cycles) and was used to identify the most predictive set of covariates. The second half served as the validation set (391 cycles) where the results of the findings from the training dataset were corroborated. Descriptive statistics were calculated and compared for demographic and reproductive characteristics between these two datasets. A chi-square test or Fisher’s exact test was used to test for differences across categories for discrete variables and the Kruskal-Wallis test for differences across categories for continuous variables.

Logistic regression with fixed effects was used to predict the risk of fertilization rate ≤ 30%. A stepwise variable selection process was used to identify predictors associated with low fertilization. Variables were allowed to enter the model when the univariable p value was < 0.05 and remained in the model if they remained associated with low fertilization rate at p < 0.05 when other variables entered the model. In order to avoid overrepresentation of cycles with few available oocytes for fertilization, we weighted the models by the total number of M2 oocytes retrieved. The female partner individual-level variables that were considered in this automated selection process were age (continuous, years), BMI (continuous, kg/m2), ethnicity (discrete, non-Hispanic Caucasian vs. all others), initial infertility diagnosis (discrete, idiopathic infertility vs. all others), and antral follicle count classified into three discrete levels: diminished ovarian reserve count (< 6 follicles), normal antral follicle count (6–24 follicles, reference), and polycystic morphology (≥ 25 follicles) [14]. The male partner individual level variables considered were sperm concentration (continuous, million/ml), normal morphology (continuous, percent), and total progressive motility after swim-up processing (continuous, million/ml) from the most recent pretreatment semen analysis, as well as sperm concentration (continuous, million/ml), total motile count (continuous, million/ml), and postprocessing sperm progressive motility (continuous, percent) from the semen sample obtained on the day of egg retrieval. The cycle level variables considered were stimulation protocol (luteal phase GnRH agonist [0.25 mg per day], regular dose luteal leuprolide [0.5 mg per day], leuprolide flare, or GnRH antagonist); assisted hatching (binary), duration of infertility (continuous, months), day 3 FSH (IU/l), and day 3 estradiol (continuous, pg/ml); estradiol (continuous, pg/ml) at ovulation trigger day, day of hCG trigger (continuous), and number of ampules of IVF medications used (continuous, units).

The cutoff values for each significant predictor remaining in the model were identified through an iterative process where the logistic regression models were fitted that included all the significant predictors. In each iteration, predictors were individually dichotomized at different values, in increments of 5 units. The predictor value that maximized the significant odds ratio for low fertilization was retained. Next, each cycle was classified according to the number of dichotomized risk factors and estimates were obtained of the odds ratio for low fertilization for cycles with increasing number of risk factors relative to cycles without any risk factor.

The predictors and cutoffs from model fit from the validation set were incorporated with those from the training set. For all models, the receiver operating characteristic (ROC) curve statistics, the training and validation models’ overall area under the curves (AUC) with their respective 95% confidence interval (CI), and the p value of the difference among the training and validation sets’ AUC were analyzed. The individual odds ratios (ORs) for each predictor from their regression coefficient were predicted. All analyses were conducted using the Statistical Analysis System Software package SAS 9.4 (SAS Institute Inc., Cary, NC) and considered two-sided significance levels less than 0.05 as statistically significant.

Results

Sixty-six of the 663 cycles had a fertilization rate ≤ 30%, including 32 cycles with absent fertilization utilizing male partner’s autologous sperm. The most common initial infertility diagnosis was idiopathic infertility (49%). Overall, the mean fertilization in the training and validation sets was 71%. Patient characteristics for the training and the validation sets are shown on Table 1. There were no differences between the training and validation sets with respect to baseline demographic and clinical characteristics, response to ovulatory induction (amount of gonadotropins used, day of hCG trigger, estradiol on day of hCG trigger, etc.), fertilization rate, and clinical treatment outcomes (clinical pregnancy and live birth rates). Semen parameters were comparable between the training and validation sets although sperm morphology was lower (but within the normal range) in the validation group compared to the training group (7.0% vs. 8.0%; p value = 0.02). There were no differences between pregnancy or ongoing pregnancy rates between the two groups, but there was a higher live birth rate in the training set per cycle initiated.

Table 1 Baseline characteristics of the participants (N = 663 women)

Stepwise regression retained only two significant predictors: postprocessing sperm concentration and postprocessing sperm progressive motility (Table 2). The cutoff value for postprocessing concentration was 40 million/ml and for postprocessing progressive motility was 50%. Overall, 150 of the couples in the training set and 154 in the validation set met had either postprocessing concentration or progressive sperm motility below these parameters. In the training set, the odds of low fertilization were 3.57-fold higher in cycles with a postprocessing concentration below 40 million/ml and 6.2-fold higher for cycles with a postprocessing progressive motility below 50% (Table 3). The results were similar, albeit reduced, in the validation set (Tables 2 and 3). Additionally, the change in concentration between the pre- and postwash samples were evaluated and were not related to low fertilization in the multivariate analysis model.

Table 2 Odds ratio for low fertilization rate predictors
Table 3 Predictors of low fertilization

Each cycle was classified based on having postprocessing concentration or progressive sperm motility below the cutoff. Cycles with one or both adverse variables were compared relative to cycles without either predictor (Table 4, Supplemental Table 1). The presence of either low postprocessing concentration or low postprocessing progressive motility was associated with a 5-fold greater odds of fertilization failure in the training set (Table 4). Moreover, having low levels of both postprocessing concentration and progressive motility was related to a 22-fold greater odds of low fertilization in the training set. These associations were reduced in the validation set, where cycles with adverse values of both postprocessing semen parameters had a 13-fold greater odds of low fertilization (Table 4).

Table 4 Predictive model for low fertilization rate

Discussion

In the absence of abnormal pretreatment semen parameters, low fertilization following conventional insemination during the first IVF attempt has been difficult to predict. In addition to limited fertilization, these cycles frequently result in limited embryo cohorts (number and quality) and may also result in diminished clinical outcomes. The present study investigated the impact of male, female, and IVF cycle characteristics on the rate of normal fertilization. We observed that fresh ejaculate sperm concentration and total progressive motility on the day of the oocyte retrieval were predictive of low fertilization in IVF conventional insemination cycles. To our knowledge, this is the first study to report specific factors that predict low fertilization during first attempt conventional insemination IVF cycles.

A comprehensive evaluation of risk factors for low or no fertilization in couples without identifiable indications for ICSI has not been reported previously. Studies investigating conventional insemination with low fertilization were primarily conducted more than a decade ago, did not establish semen parameter threshold for low or no fertilization, were comprised of relatively small numbers of (fewer than 150 patients), and primarily identified only male-related factors [6, 15]. In an early observational study, abnormal parameters for sperm viability, motility, and morphology were associated with low fertilization [1]. A retrospective cohort by Repping et al., it was reported that the decreased total motile sperm count on the day of the egg retrieval increased the likelihood of fertilization failure (zero 2PNs). However, the investigators of this study were unable to determine a specific cut off value to predict fertilization failure [16]. In patients who have had a history of fertilization failure with ICSI and were undergoing a subsequent cycle, higher numbers of oocytes retrieved, mature oocytes and higher estradiol at trigger were associated with improved fertilization during the subsequent attempt [17]. In contrast to prior studies, our study examined the largest population over a 10-year period of time to establish thresholds for semen parameters on the day of oocyte retrieval which identify patients with an increased risk of low fertilization that may be clinically actionable for providers.

It has been speculated that the inability to predict low fertilization during conventional insemination has contributed to the 51.5% increase in ICSI utilization for non-male infertility [18]. However, lower pregnancy rates in couples undergoing elective ICSI have also influenced the recommendation of several investigators not to implement the universal use of ICSI for all IVF patients [5, 19, 20]. It has been estimated that ICSI would need to be utilized in 33 cases to prevent one occurrence of fertilization failure [5]. The literature on the safety of ICSI warrants additional investigation with conflicting results with respect to adverse long-term outcomes [21,22,23,24]. The impact of the widespread use of ICSI on healthcare system expenditures or resulting offspring has yet to be fully appreciated. However, following one fresh IVF cycle with failed fertilization, the estimated recurrence rates for future cycles is 30% [25, 26]. The findings of our study may allow clinicians to better identify IVF cycles “at risk” for suboptimal fertilization with conventional insemination and provide guidance for recommending ICSI on a more selective basis as a means to potentially improve fertilization results.

A primary strength of this study is the inclusion of female demographic, fertility, and stimulation factors in addition to male semen parameters in our analyses. By using half the patients in the training set, the validation set was used to account for confounding variables (such as few numbers of oocytes and fertility diagnosis) on the outcome. The univariable analysis provides a comprehensive model to identify those with a positive score (concentration below 40 million/ml, total progressive motility less than 50% factor, or both). The probability of fertilization failure was highest when both the concentration and total progressive motility were below the cutoff, Supplemental Table 2. In models presented herein, having either low concentration and/or total progressive motility on the day of retrieval has a positive predictive value of 87.3% and a negative predictive value of 4.4%. The overall sensitivity was 70.4%, with a specificity of 57.0%.

There are several limitations of this study. Due to the retrospective nature, there may have been factors that influenced the clinical decision-making that were not reflected in the data collected. An example includes the 100 patients (excluded from our study) with normal pretreatment and day of egg retrieval semen parameters who utilized ICSI without clear indications. Approximately half of those subjects had idiopathic infertility, which may reflect increased utilization in those couples whose infertility remains unexplained. Additionally, the fertilization was based on the number of mature oocytes retrieved. In couples undergoing ICSI, cumulus cells are stripped from the oocytes to allow for definitive assessment of oocyte maturity. In conventional insemination, assessment of oocyte maturity is not as accurate as ICSI, since cumulus stripping is not performed and may lead to an overestimation of oocyte maturity. The assumption is that these oocytes were mature at time of insemination and thus negatively impact fertilization rate. An additional limitation to consider is that history of prior paternity was inconsistently and incompletely captured in our electronic medical record system and could not be included as a variable. Lastly, the patients that had low fertilization often required additional cycles of IVF to obtain pregnancy, which was not reviewed here; however, the result of the first IVF cycle remains a pivotal point for clinical decisions of subsequent cycles. While many studies have shown improved outcomes utilizing ICSI after previous low fertilization, others have not corroborated those findings [4, 27, 28]. Furthermore, a study by Tomas et al. showed lower pregnancy rates in those with prior low fertilization when compared to couples with male factor infertility utilizing ICSI during subsequent IVF cycles [29], raising the issue of whether ICSI would have prevented the low fertilization during prior cycles. Unfortunately, it has not been demonstrated if preemptive ICSI performed due to the thresholds for semen parameters we report would result in better fertilization rate and/or improved live birth rates in all couple with a variety of infertility diagnoses. This highlights the complex interplay of multiple factors impacting fertilization that requires further investigation.

In conclusion, specific semen parameters on the day of oocyte retrieval are associated with low and absent fertilization during conventional IVF cycles even with normal pretreatment semen parameters. The findings of this study provide further insight into possible predictors of suboptimal fertilization outcomes. Unanticipated low semen concentration and/or progressive motility on the day of egg retrieval in cycles with planned conventional insemination may warrant a discussion of ICSI with the goal to prevent low fertilization. If validated by prospective investigation, these data may have the potential to optimize ART results by identifying those at risk for low fertilization.