Introduction

For infertile men, microdeletion in the Y chromosome is one of the common genetic factors leading to spermatogenic disorders. The incidence of microdeletions in the azoospermia factor (AZF) region of the Y chromosome is about 10–15% in patients with azoospermia and severe oligozoospermia [1]. The deletion region involves three loci: AZFa, AZFb, and AZFc. The Y chromosome is not necessary for an individual’s life, but it is required for male sexual differentiation, and Y chromosome disruptions cause male infertility. The Y chromosome has a complex structure that includes repeated elements [2], which are associated with various genomic rearrangements, including involvement in proposed mechanisms of structural rearrangements, such as non-allelic homologous recombination [3].

Because of the structural complexity of the Y chromosome, it took 30 years to accurately detect the AZF region. Structural rearrangements involving these repetitive sequences have been difficult to detect. Current chromosome screening methods are mostly dependent on the diploid genome and may not allow detailed investigation of the haploid state of the Y chromosome. To date, several methods have been employed to detect copy number variation of Y chromosome. The choice of method largely depends on the resolution and throughput provided by each method, as well as the inherent characteristics of the information generated.

Cytogenetic methods can show altered structures down to 5 Mb in size and aneuploidies. However, chromosome karyotyping analysis has a low throughput. The first report of the AZF locus on the Y chromosome used cytogenetic analysis. For higher resolution analysis, fluorescence in situ hybridization (FISH) techniques can be utilized on interphase nuclei or metaphase chromosomes. The FISH method can provide more detailed information, from ~ 100 kb down to a few kilobases. Moreover, it can be used to reveal inversions, by the use of multiple probes, mainly targeting specific variants. In comparison, the PCR method is ideal for screening a limited number of specific variants, as well as multiplexing and the simultaneous analysis of multiple regions. The PCR approach has been used to design tests for AMELX /Y [4] and AZF microdeletions [5]. One study reported using the sequence-tagged-site PCR (STS-PCR) method to evaluate the AZFc structure in more than 20,000 individuals [6]. Multiplex STS-PCR assays are currently considered the standard protocol according to the best practice guidelines of the European Molecular Genetics Quality Network (EMQN: http://www.emqn.org). Compared with the two methods mentioned above, microarray technologies can yield more information and offer the advantages of high-throughput analysis. However, array methods have critical limitations in the detection of long identical repeats. Although genome-wide or Y chromosome copy number variation (CNV) studies have been reported using array methods [7,8,9], the method could not specify clear signal for the long, nearly identical repeated units, for which array-based methods are based on shorter probes and the similarity of sequence between a probe and a target genomic region; in particular, if there is a small change in copy number, the method cannot tell which one has mutated [10].

Next-generation sequencing (NGS) allows the detection of variants or mutations by DNA or RNA sequencing, and the potential of base-pair resolution, and was recently the primary source of data for studying CNVs on the Y chromosome [3, 11] and other parts of the genome [12, 13]. However, the genomic context of Y chromosome amplifies the inherent limitations of NGS.

At present, the STS-PCR method is often used to detect Y chromosome microdeletions, but it cannot be used to detect previously undescribed deletion types. In this study, we applied a high-performance, accurate, and low-cost method suitable for clinical use, where the classical PCR method was used to detect STS loci on the Y chromosome.

Materials and methods

Patients

The present study selected cases from men seeking fertility consultation, from January 2011 to May 2017, at the Center for Reproductive Medicine, First Hospital of Jilin University. Medical history, physical examination, and questionnaire counseling were required for all subjects, and clinical examination results, such as hormone levels and semen analysis, were also collected. All subjects included in the present study did not exhibit known clinical causes of infertility, such as infection and varicocele.

A total of 326 subjects were recruited, in which 222 were infertile patients with abnormal semen results, 58 were male family members of these infertile men, and 46 were sperm donors with normal semen (from the Jilin Province sperm bank). In the present study, all subjects were selected for NGS analysis from the STS-PCR database that all infertility cases (excluding those with infection or varicocele) with microdeletion detected by STS-PCR. The inclusion criteria were to ensure that most carriers of AZF microdeletions, from STS-PCR assays, were included in the study. Therefore, clinical statistics of incidence are for reference only.

Semen analysis

Ejaculates were obtained after 3–5 days of sexual abstinence. Semen analysis was performed according to the protocol of the World Health Organization (5th edition) (http://www.who.int/en/). Patients were diagnosed as exhibiting azoospermia when no sperm was found in at least three ejaculates after sample centrifugation, or having oligozoospermia when sperm concentrations of < 20 × 106/mL were obtained for the last three semen samples, taken at intervals of at least 1 week.

Y chromosome microdeletions analysis by STS-PCR

The STS-PCR analysis was performed following the European Academy of Andrology and the European Molecular Genetics Quality Network (EAA/EMQN) guidelines [14]. All patients and control samples were tested by multiplex PCR using classical AZF markers; STS markers were AZFa (sY84, sY86), AZFb (sY127, sY134), and AZFc (sY254, sY255). The internal controls were sY14 and ZFX/ZFY markers. The PCR analysis was performed twice to confirm any deletion.

Confirmation of detected partial AZFc deletions using STS-PCR

To verify partial AZFc deletions, the subject samples were tested by STS-PCR using established STS markers sY1191 and sY1291 [14]. DNA was then assessed by gel electrophoresis. Sample DNA was replaced with water for a blank control, and a normal male sample was used as a negative control.

The partial deletion termed gr/gr was identified by the absence of marker sY1291 and presence of marker sY1191. The b2/b3 deletions were characterized by the absence of marker sY1191 and presence of marker sY1291. The b1/b3 deletion was characterized by the absence of both sY1291 and sY1191 markers [14].

High-throughput sequencing of the Y chromosome AZF region

Specific oligonucleotides were designed as markers for high-throughput sequencing of the AZF region on the long arm of the Y chromosome. All 138 loci in the AZF region and 10 loci of housekeeping genes on the Y chromosome were selected based on the human genome database (http://genome.ucsc.edu/). The oligonucleotides had uniform melting temperatures and minimal complementarity with universal amplification sequences. Each of the 138 AZF loci had confirmed repeat numbers (denoted the theoretical copy number) and each locus was not found in other regions of the human genome. Six traditional STS markers used for multiplex PCR were also represented. Three oligonucleotides per locus were used: a left locus-specific oligo, a 5′-phosphorylated middle locus-specific oligo, and a 5′-phosphorylated right locus-specific oligo. All oligonucleotides were pooled together to create an AZF microdeletion assay oligo pool.

End-repair, acetylation, and adapter ligation were performed for library preparation following standard personal genome machine protocols, with sequencing performed on an Ion PGM Analyzer (Life Technologies, Carlsbad, CA), according to the manufacturer’s protocol [15].

For each selected locus, sequencing reads were aligned to the expected sequence using TMap software (Karolinska Institute, Solna, Sweden), and reads with fewer than three mismatches were counted. The read count for each locus was normalized to the percentage value using the total sequencing read count and housekeeping loci value. Next, the copy number of each locus was then compared with its theoretical copy number. Specific experimental details were described previously [16].

Results

Flowchart of the present study

A flowchart of the study is shown in Fig. 1.

Fig. 1
figure 1

Flowchart on detection of Y chromosome microdeletion with STS-PCR and NGS method

Comparison of AZF microdeletions detected by the STS-PCR and NGS methods

We selected 326 subjects for analysis by the STS-PCR and NGS methods. Microdeletions were found in 120 patients with male infertility by the STS-PCR, and all these microdeletions were also detected by NGS method. In the 160 patients and male family members with no detectable microdeletions by STS-PCR, we found 51 cases exhibiting structural variations of the AZF region by NGS analysis, with a detection rate of 31.88% (51/160). In the 46 sperm donors without detected microdeletions by STS-PCR, 11 were found to have structural variations by NGS analysis, and the detection rate was 23.91% (11/46) (Table 1).

Table 1 Overall results of 326 subjects screening for AZF microdeletions

NGS results for 120 patients with AZF microdeletions

The 120 patients with AZF microdeletions detected by STS-PCR and NGS analysis, the most frequent structural variation shown by NGS was the AZFc microdeletion (64.17%, 77/120), followed by the AZFb+c microdeletion (22.5%, 27/120). There were also three novel structural variations: AZFa+partial c microdeletion (Table 2, No. 4), AZFb+c microdeletion combined with a g1/g3 inversion (Table 2, No. 16), and AZFc microdeletion combined with a b1/b2 fragment repeat (Table 2, No. 18).

Table 2 Results of NGS method in 120 patients with AZF microdeletions detected by STS-PCR method

NGS analysis of 160 patients and male family members with no detectable AZF microdeletions by STS-PCR

Of the 160 patients and male family members with no detectable AZF microdeletions by STS-PCR, the NGS analysis found that 51 subjects had structural variations in the AZF region (31.88%, 51/160). The remaining 109 cases had no structural variations detected by NGS. Of the 51 subjects with AZF structural variations, 31 cases had four types of deletions (19.38%, 31/160), nine cases exhibited six types of fragment repeat (5.63%, 9/160), and 11 cases had nine types of deletion and repeat (6.88%, 11/160). Inversion structural variation also occurs with deletion or repeat (Table 3).

Table 3 Results of NGS method in the 51 subjects without AZF microdeletion detected by STS-PCR method

NGS analysis of 46 sperm donors without AZF microdeletions detected by STS-PCR

Of the 46 sperm donors with no detectable microdeletions by the STS-PCR method, NGS analysis revealed 11 cases with structural variations, with a detection rate of 23.91% (11/46). Of these 11 subjects, five cases had three types of deletion (10.87%, 5/46), two cases had two types of fragment repeat (4.35%, 2/46), and four cases had three types of deletion and repeat (8.70%, 4/46). Inversion structural variation also occurs with deletion or repeat (Table 4).

Table 4 Results of the 46 sperm donors on AZF region revealed by NGS method

STS-PCR confirmed of AZFc partial deletions detected by NGS

Fifteen cases with AZFc partial deletions revealed by NGS were selected for STS-PCR analysis, including three cases of b1/b3 deletions, five cases of b2/b3 deletions, and seven cases of the gr/gr deletion.

The deletions present in these cases were detected by STS-PCR using the sY1291 and sY1191 markers. The absence of sY1291 alone indicated deletion of the gr/gr region. The absence of sY1191 indicated deletion of the b2/b3 region. The absence of both sY1191 and sY1291 indicated deletion of b2/b3 region. The detection results are shown in Fig. 2.

Fig. 2
figure 2

Results of sY1191 and sY1291 by STS-PCR method. Note: ① represent detection of marker sY1191; ② represent detection of marker sY1291. JX009 and Y053 are cases with b1/b3 deletion. Y108 is a case of gr/gr deletion. Y046 is a case of b2/b3 deletion

Discussion

Y chromosome microdeletions have clinical significance for the diagnosis of infertile men [17]. The Y chromosome contains a high proportion of segmental duplications, and CNVs generated from these structural features are usually challenging to detect. At present, STS-PCR analysis is usually used to screen Y chromosome AZF deletions. However, the limited STS markers do not allow for comprehensive detection of duplications or rare deletions on the Y chromosome. The detection of such variants requires analysis with more throughput and higher resolution of the Y chromosome.

The application of NGS provides the possibility of a low-cost and high-throughput screening method. We have developed a novel capture sequencing method that utilizes unique STSs on the Y chromosome and offers the potential to detect established and unknown structural variations.

We selected 326 subjects, including 222 infertile patients with abnormal semen results, 58 family members of these infertile male patients and 46 sperm donors (Fig. 1). The 46 sperm donors had normal semen, and no deletion of the AZF region was detected by STS-PCR. In the remaining 280 subjects, STS-PCR analysis found that 120 patients had AZF region microdeletions, and 160 patients and male family members had no detectable AZF microdeletions (Table 1).

All patients with male infertility and microdeletions in the present study were selected after initial STS-PCR analysis, so the cases examined by NGS do not reflect the overall clinical incidence of Y chromosome variations.

All the microdeletions identified in 120 patients by STS-PCR were also detected by the NGS method. The exhibited structural variations included partial deletions, inversions, and duplications in the AZF region. To verify the repeatability and stability of the technique, two researchers independently analyzed each batch.

Targeted NGS has detected copy number variation in the field of pre-implantation diagnosis and screening of birth defects, and had the same sensitivity as examples of array-based methods, which are the gold standard for the detection of CNV [18, 19]. The NGS method had good sensitivity and specificity for detection of Y chromosome microdeletions. Eighteen different deletion types were detected by the NGS method, while seven of these were detected by the STS-PCR method, demonstrating that the NGS method is more accurate and it detected more types of variation. Another advantage of the NGS method was that it detected the detailed the specific range and fragment size of structural variations, such as a partial deletion (Table 2, No. 2), structural inversion (Table 2, No. 16), and fragment repeat (Table 2, No. 18). A graph of the pattern of normal male NGS results is shown in Fig. 3.

Fig. 3
figure 3

The pattern graph of normal male NGS method results

In the present study, the NGS method identified structural variations of the AZF region in 31.88% of subjects that had no detectable microdeletions by STS-PCR analysis (Table 3).Four different deletion patterns, six fragment repeat patterns, and nine deletions with repeat patterns were detected by NGS analysis. These structural variations were concentrated in the AZFc region, and the size of the altered regions ranged from 0.2 to 5 Mb, including a 5.7 Mb repeat.

For sperm donors with no microdeletions detected by the STS-PCR method, 23.91% of donors were found to exhibit structural variations by NGS analysis (Table 4). Of these variations, five cases were deletions (10.87%, 5/46), two cases were fragment repeats (4.35%, 2/46), and four cases were a deletion with a repeat (8.70%, 4/46). There were three types of deletion, two types of fragment repeats, and three types of deletion with repeat variations. These structural variations were also concentrated in the AZFc region, and the size of the variations ranged from 0.4 to 3.5 Mb.

It should be pointed out that the partial deletion types detected above, such as b1/b3 deletion and gr/gr deletion, can also be detected by extended analysis of STS-PCR method [14].

The effects of AZF structural variations upon fertility have been previously summarized by diagnostic infertility screening studies [20]. Microdeletions in the AZFa, AZFb, AZFb+c, and AZFc regions were associated with oligospermia or azoospermia. Patients with complete deletions of AZFa, AZFb, or AZFb+c regions typically have no testicular sperm, so intracytoplasmic sperm injection is usually not possible for these subjects.

AZFc deletions are the most frequent types of Y chromosome deletion and produce various clinical phenotypes for carriers [21]. Patients with AZFc deletions and oligospermia may exhibit a progress decrease in sperm concentration over time [22]. Sperm cryopreservation is recommended for assisted reproductive treatment of these carriers [14, 23].

Partial AZFc deletions involving three subregions, gr/gr, b1/b3, and b2/b3, were significantly more frequent among sterile males, although frequencies varied widely across different populations [24]. A common partial deletion within AZFc involves a 1.6 Mb gr/gr deletion, which removes four genes and is also associated with the risk of spermatogenic failure. Nevertheless, this deletion has only minor phenotypic effects in some haplogroups, suggesting that compensating variants may exist elsewhere. It was reported that the gr/gr deletion may lead to complete AZFc region deletions [25]. These couples should be aware that such deletions provide genetic risk factors for impaired spermatogenesis in male offspring and that there is a high risk of transmitting complete AZFc deletions.

In one report, the incidence of partial AZFc duplication was significantly higher in infertile men than in fertile men [26]. Therefore, AZFc structural variants may have variable effects on the risk of spermatogenic failure. If there are no phenotypic consequences of structural variants, should Y chromosome AZF variation be considered neutral? Like as different types of Y chromosome repeats. Further research is required to address this question.

The structural variation of the Y chromosome reflects its special haploid state and high content of repeat sequences. Research into CNV of these repeats has developed over the years, and CNV has been linked to fertility and human evolutionary traits. Microdeletions in the AZF regions are associated with spermatogenic dysfunction and male infertility. Other forms of structural variation have been recognized, such as the inversion/repeat structural variation of the AZF region associated with azoospermia/ oligospermia [27,28,29]. Therefore, in addition to AZF microdeletions, AZF region structural variation may lead to spermatogenic disorders. The current study designed and developed a NGS detection scheme for the AZF region, and we hope to further detect structural variations of AZF region, improve the detection accuracy of variations, and meet the needs of scientific research and future clinical practice.

In the present report, an extended STS-PCR test with STS markers sY1291 and sY1191 was carried out for AZFc partial deletion patterns detected by NGS but not detected by conventional PCR. The selected samples included three cases of b1/b3 deletions, five cases of b2/b3 deletions, and seven cases of gr/gr deletions. The extended STS-PCR analysis was 100% consistent with the findings of NGS analysis (Fig. 2). One previous study also utilized the NGS method with STS markers to reveal deletions in the male-specific region of the Y chromosome in subjects with non-obstructive azoospermia [30].

The current NGS detection technique provides the possibility of detecting new structural variations. At present, deletions with established clinical significance can be detected by the STS-PCR method, while the clinical significance of other variants identified by the NGS method remains mostly unknown. And the detection of new variations may present a dilemma for the application of assisted reproduction technology. However, from a research perspective, NGS technology provides a useful tool to uncover the structural variability of highly dynamic regions of the human genome, such as the Y chromosome. Understanding the clinical significance of structural variations in AZF regions will require more clinical data and breakthroughs in functional verification.

Conclusion

The established NGS analysis can replace the conventional STS-PCR method to detect Y chromosome microdeletions. The NGS method can detect CNVs, such as partial deletion or duplication, and provide details of abnormal ranges and sizes of variations.