Introduction

Y-chromosome short tandem repeats (Y-STRs) are inherited with the characteristics of the non-coding region no switching and non-recombination in the genetic inheritance from father to son [1, 2]. Because of this, they have unique application value in forensic science, particularly in paternal lineage differentiation, male identification in sexual assault cases, male component detection in male-male or male-female DNA mixtures, and in DNA genealogy construction [3,4,5]. In the past few years, several commercial Y-STR testing kits have been developed, for example PowerPlex® Y23 system, AGCU Y24 kit, and Goldeneye 20Y kit [6,7,8]. However, these kits are limited in their ability to differentiate male lineages in isolated and inbred populations with insufficient discrimination power [9, 10]. The Yfiler® Plus kit was recently developed in part to resolve this problem [11, 12]. It has a higher discrimination ability which is to a large part due to being based on the identification of seven rapidly mutating Y-STR loci (DYS387S1, DYS449, DYS518, DYS570, DYS576, DYS627) [13, 14].

This study aimed to develop a novel, feasible, amplification system with an increased number of Y-STR loci, high levels of haplotype diversity, and discriminatory capacity. The minimal haplotype (MHT) with nine Y-STRs [15] and the haplotype of the Scientific Working Group for DNA Analysis Methods (SWGDAM) with 11 Y-STRs [16] are still included in the commonly used Y-STR kits in current forensic applications. Thus, all the loci of the Yfiler® Plus kit were selected for inclusion in the novel amplification system. In addition, nine Y-STR loci that exhibited a high degree of polymorphism were also selected for inclusion (DYS388, DYS444, DYS447, DYS522, DYS527a/b, DYS549, DYS596, DYS643). With the exception of DYS596, these additional Y-STR loci were contained in either the AGCU Y24 kit or the PowerPlex® Y23 kit [17, 18]. Thus, we designed the new amplification system based on 36 Y-STR loci in a 6-dye configuration and named it the “36-plex Y-STR multiplex amplification system”, or “36-plex Y-STR system” for short. To evaluate the efficiency of the system, we tested a variety of PCR conditions, including the amount of added Taq polymerase and primers, PCR reaction volume, and annealing temperature, and performed a series of validation studies. This developmental validation study was based on the Scientific Working Group on DNA Analysis Methods (SWGDAM) developmental validation guidelines [16, 19].

Materials and methods

DNA samples

Blood card samples from 247 unrelated males were obtained from the Guangzhou Forensic Science Institute, Guangzhou, China. Nonhuman samples from cattle, sheep, pigs, cats, dogs, chickens, rats, fish, and Escherichia coli were obtained from AGCU ScienTech Incorporation, Wuxi, China and were selected for species specificity studies. The blood stain samples were punched to 1.2 mm in diameter using a BSD 600-DUET stiletto instrument (BSD, Australia) and placed into 96-well plates for direct PCR amplification. The biological material that could not be amplified directly was extracted using the Chelex-100 method for PCR amplification [20]. The project was approved by the Ethics Committee of Southern Medical University, China before the study, and signed informed consent was obtained from all participants prior to sample collection.

Primer design

The 36 selected Y-STR loci were divided into five groups based upon the needs of multiplex amplification. Information about the STR loci is presented in Table 1. Primers were developed for each locus based on unified parameters using Oligo 6.0 software (Premier Biosoft International, Palo Alto CA, USA). Preliminarily amplification specificity of the primers was evaluated using the BLAST function of GeneBank on the NCBI website (https://blast.ncbi.nlm.nih.gov/Blast.cgi). The resulting primer for each locus was optimized and redesigned to meet the needs of multiplex amplification.

Table 1 The information of 36 Y-STR loci

Primers were labeled by fluorescent dyes: primers of DYS392, DYS389I/II, DYS447, DYS438, DYS527 a/b, DYS522, and DYS596 were labeled with 6-FAM fluorescent dye (blue); primers of DYS391, DYS456, DYS19, DYS388, DYS448, DYS385a/b, and DYS549 were labeled with HEX fluorescent dye (green); primers of DYS437, DYS481, DYS533, DYS390, DYS627, DYS458, and DYS460 were labeled with TAMRA fluorescent dye (yellow); primers of DYS393, Y_GATA_H4, DYS439, DYS635, DYS444, and DYS643 were labeled with ROX fluorescent dye (red); and primers of DYS576, DYS570, DYF387S1, DYS449, and DYS518 were labeled with VIG fluorescent dye (purple). Fragments included in the internal lane standard were detected in the orange channel and labeled with SIZ-500. A schematic diagram of the fluorescence detection method is shown in Fig. 1. All primers were synthesized by Shanghai Sangon Biological Engineering Technology & Services Company, Shanghai, China.

Fig. 1
figure 1

The schematic diagram of 36-plex Y-STR system

PCR amplification

Unless otherwise mentioned, PCR amplification was performed on a GeneAmp® PCR System 9700 thermal cycler (Life Technologies, CA, USA). Each amplification reaction contained 10.0 μL PCR buffer mixture containing 125 mM Tris-HCl buffer, 125 mM KCl, 7.5 mM dNTPs, and 5.0 mM MgCl2, as well as 5.0 μL primer set (concentration 0.045–0.360 μM, 1 μL heat activated Taq polymerase (5 U/μl), 0.3–1 ng of template DNA, or a 1.2 mm punch of the blood card sample and ddH2O, to obtain a final reaction volume of 25 μL.

Standard thermal cycling conditions included an initial denaturation step at 95 °C for 2 min, followed by subsequent cycles of denaturation at 94 °C for 30 s, annealing at 60 °C for 1 min, elongation at 72 °C for 1 min repeated 30 times, and followed by a final extension step at 60 °C for 30 min. The DNA sample was amplified under standard conditions, and both positive and negative controls were included in all experiments.

Sample electrophoresis and data analysis

Electrophoresis was performed by Applied Biosystems 3500xl Genetic Analyzer using 36-cm capillary arrays with POP-4® Polymer (Life Technologies, CA, USA). Spectral calibration was performed using the J6 Dyeset with the 6 Dye Matrix Standards. Capillary Electrophoresis (CE) was performed by adding 1.0 μL of the PCR product, or 36 Y-STR allelic ladders, to 10.0 μL of formamide/SIZ-500 mixture (comprising 9.5 μL of deionized Hi-Di™ Formamide and 0.5 μL of Marker SIZ-500 size standard). The samples were denatured for 3 min at 95°C then chilled on ice prior to immediately performing electrophoresis. The PCR products for CE were analyzed on an Applied Biosystems 3500xl Genetic Analyzer with a 3 kV, 10 s injection, and electrophoresis at 15 kV for 1500 s at 60 °C. Fragment sizes and genotyping were determined by GeneMapper ID-X software (Life Technologies, USA) using a peak amplitude of 150 RFU as the peak detection threshold for allele calls.

Species specificity

Cattle, sheep, pigs, cats, dogs, chickens, rats, fish, and E. coli samples were used for the species specificity studies. For each species, a 10 ng of purified DNA was amplified by the 36-plex Y-STR system following the standard PCR protocol.

Sensitivity

Sensitivity studies were performed with control DNA 9948 (Ori-Gene company, USA), and total DNA inputs were prepared in a serial dilution to 1 μL with the following template amounts: 1.0, 0.5, 0.25, 0.125, 0.0625, and 0.03125 ng.

DNA mixtures

A total of 1 ng of human genomic DNA was prepared in a reaction volume of 25 μL for the mixture samples: Male-male mixtures were prepared using 9948 and 007 human genomic DNA, with mixture ratios of 1:1, 1:2, 1:4, 1:8, and 1:15, respectively. Each mixture was tested in triplicate to reduce the accidental error and ensure the accuracy of the results analysis.

Inhibitors study

To validate the anti-interference capability of the 36-plex Y-STR system, six common forensic inhibitors including hematin, indigo, humic acid, calcium ion, EDTA, and hemoglobin were tested. The quantity of control DNA 9948 was held constant at 0.5 ng with the inhibitors at the following concentrations: 25, 50, 75, 100, or 125 μmol/L of hematin; 8, 10, 12, 14, or 16 mmol/L of indigo; 30, 40, 50, 60, or 70 μg/μl of humic acid; 0.6, 0.8, 1.0, 1.2, 1.4, or 1.6 mmol/L of calcium ion; 0.6, 0.8, 1.0, 1.2, or 1.4 μmol/L of EDTA; and 20, 30, 40, 50, or 60 μmol/L of hemoglobin.

Reproducibility

Fifty randomly selected samples from the population study (n = 247) were analyzed by three separate laboratories (Guangzhou Forensic Science Institute and two of its affiliated institutes) to validate the genotyping concordance. The allelic ladder of the 36-plex Y-STR system was detected by both an Applied Biosystems 3130xl Genetic Analyzer (upgraded to the 6-dye module) and a 3500xl Genetic Analyzer, to facilitate comparison of the genotyping results.

Parallel testing

Parallel testing was conducted using two blood FTA® Cards, two buccal swabs, two muscle samples, three nail samples, three salivary stains, and three hair root samples which had been obtained during daily practice. All the above samples were amplified separately using the 36-plex Y-STR system, Yfiler® Plus kit, and PowerPlex® Y23 system, and the genotype results were compared.

Casework samples

Thirty-five real casework samples were used to test the efficiency of the 36-plex Y-STR system. These samples included blood stain samples on different substrates, mixed seminal stains, cigarette butt, and epithelial fractions (swab of bottleneck, chopsticks, padlock, et al.).

Size precision and stutter calculation study

All samples from the population study (n = 247) were used to evaluate the sizing precision of the developed 36-plex Y-STR system. Sizing precision was assessed by running three full injection of allelic ladder. Allelic size was calculated for each allele with GeneMapper ID-X software and then compared with the allele size of corresponding ladder. All samples were analyzed in order to estimate the stutter ratios of the loci contained in the system. The stutter percentage was determined as the stutter peak height divided by the true peak height.

Population study and statistical analysis

A total of 247 unrelated Han males in Guangdong were detected using the developed 36-plex Y-STR system. Allele frequencies were determined using the direct counting method, where the frequency of i-th alleles in locus pi = the frequency of i-th allele /n, where n is the population sample number. The global discrimination capacity (DC) was ascertained by dividing the number of different haplotypes by the number of samples in the studied population [21]. The match probability (MP) was calculated as the sum of the squared haplotype frequencies. Genetic diversities (GD) and haplotype diversities (HD) were calculated according to Nei [22] as HD or GD = n(1 − ∑Pi2)/(n − 1), where n is the total number of samples and Pi is the relative frequency of the i-th allele or haplotype. Population Rst genetic distances and multidimensional scaling (MDS) analyses were obtained using the analysis of molecular variance (AMOVA) tool to assess the genetic structure and diversity among the following populations from the YHRD database: Henan Han [7, 23, 24], Shandong Han [25], Shanghai Han [26,27,28], Shenzhen Han [29], Liaoning Hui [30], Liaoning Mongolian [31], Fuzhou She [32], Qinghai Tibetan [33, 34], Xinjiang Uighur [35,36,37], Rennes French [38], Greek [39], Shizuoka Japanese [40], Daejeon Korean [41], Tripoli Libyan [42], and Singapore Malay [43]. To describe the relationships between populations, a neighbor-joining phylogenetic tree was constructed based on the Rst value using MEGA 6.0 software [44].

Quality control measures

All methods were carried out in accordance with the recommendations of International Society for Forensic Genetics (ISFG), as described by Parson on the analysis of genetic population data [45].

Results and discussion

PCR reaction: cycle number

The 36-plex Y-STR system was tested over a range of amplified cycle numbers. One nanogram of 9948 male DNA was amplified over 28, 29, 30, 31, and 32 cycles on an Applied Biosystems GeneAmp 9700. As expected, full profiles were observed, and an increase in cycle numbers generated higher peak heights in the 25-μL reaction system. The results indicated that a PCR cycle number of 30 was optimal for the developed 36-plex Y-STR system. This cycle number maximized assay sensitivity and minimized the occurrence of off-scale peaks, as demonstrated in Supplemental Fig. S1.

PCR reaction: annealing steps

Operating temperatures can vary slightly between different PCR thermocycler instruments and across different laboratories. The optimal annealing temperature of the 36-plex Y-STR system was determined by conducting the annealing step at 56, 58, 60, 62, 64, 66, and 68 °C, and the effect on the stability and accuracy of the genotyping results was studied. As seen in Fig. 2, accurate genotype profiles were observed at an annealing temperature between 56 and 62 °C. At 64°C, the peak heights of some loci were less than 1000RFU, and when the annealing temperature was increased to 66 or 68 °C, few loci were detected. Nonspecific peaks were not observed at an annealing temperature of 62 °C, which was therefore concluded to be the optimal, recommended, annealing temperature.

Fig. 2
figure 2

Seven annealing temperature were tested:56, 58, 60, 62, 64, 66, and 68 °C. Amplification of 1 ng 9948 DNA was performed for 30 cycles on Applied Biosystems GeneAmp 9700, and 1 μL of each reaction was simultaneously analyzed on an Applied Biosystems 3500 Genetic Analyzer with a 3 kV, 10 s injection

PCR reaction: master mix, primer mix and Taq DNA polymerase

The concentration of the master mix, the primers mix, and of Taq polymerase used in PCR analysis are closely correlated to the quality and accuracy of the genotyping results. One nanogram of 9948 male DNA was amplified in different concentrations of reaction volume (0.5×, 0.75×, 1.0× (standard), 1.25×, 1.5×) in a 25-μL reaction system, and the genotyping results were compared. Results from the testing of the PCR master mix concentration revealed that over 50% of the loci were detected at a reaction mix concentration of 0.5×, and more than 20% were detected at a concentration of 1.5×. Full profiles were obtained from the other concentrations of master mix. Few loci were observed to have allelic drop-out at a primer concentration of 0.5×. All loci were detected from a primer concentration of 1.0× and showed balanced peak height. Variable concentrations of Taq DNA polymerase had little effect on the genotyping results. (Fig. S2, S3, S4).

Species specificity

Non-human genomic DNA samples from common animal species (dogs, pigs, cattle, sheep, cats, chickens, rats, fishes, and E. coli) were amplified using the 36-plex Y-STR system. The results did not reveal the presence of any allele peaks within the genotyping range (Fig. S5). On this basis, it was concluded that the developed STR system was robust and unlikely to be affected by the presence of genetic material from these animal species.

Sensitivity studies

The sensitivity of the 36-plex Y-STR system was tested with a range of DNA inputs from 0.03125 to 1 ng. When the DNA template concentration was between 1 and 0.125 ng, the peak height ratios of the majority of loci exhibiting the same fluorescent color were above 60%. When the amount of template DNA was reduced to 0.0625 ng, the peak height ratios of loci exhibiting the same fluorescent color were less than 60%, but no allelic dropout was observed. When the amount of template DNA was 0.03125 ng, allelic dropout occurred randomly at some loci, especially in those with longer DNA fragments such as DYS627, DYS460, DYS570, DYS449, and DYS518 (Fig. 3). On this basis, it was concluded that the 36-plex Y-STR system could satisfy the Chinese criteria for human fluorescent STR multiplex PCR reagents and provide reliable profiles at a threshold of 150RFU with DNA inputs higher or equal to 0.0625 ng. The sensitivity under these conditions were determined to be 0.0625 ng/25 μL.

Fig. 3
figure 3

Heat maps summarizing the sensitivity results of six serial dilutions (from 1 to 0.03125 ng) of control DNA 9948 (Stochastic threshold was 100RFU). Green represents peak height ratio > 60% within the same color. Yellow represents peak height ratio < 60% within the same color. Red represents allele drop-out

DNA mixtures

Forensic casework samples often comprise a mixture of DNA (male-male or male-female). To distinguish the major and minor male contributors, it is necessary to detect the performance of the 36-plex Y-STR system when applied to DNA mixtures. Male-male mixtures of 9948 DNA and 007 DNA in known ratios of 1:1, 1:2, 1:4, 1:8, and 1:15 were amplified to produce a total amount of 1-ng mixed DNA. Results showed that the peak heights of minor alleles decreased as the mixture ratio decreased. Full profiles were obtained for the minor contributor at non-overlapping and non-stutter positions with mixture DNA ratios ranging from 1:1 to 1:4 (Fig. 4). When the mixture ratio increased to 1:8, a general loss of alleles was observed. These findings showed that the developed 36-plex Y-STR system could satisfy the Chinese criteria for human fluorescent STR multiplex PCR reagents, which requires all alleles to be correctly detected for 1:4 DNA mixtures.

Fig. 4
figure 4

Male:male mixture analysis. Male:male mixture samples (total of 1 ng input DNA) of DNA 9948 and 007 in ratios of 1:1, 1:2, 1:4, 1:8, and 1:15 were amplified using the 36-plex Y-STR system for 30 cycles. A minor contributor allele (alleles located in non-overlapping and non-stutter positions) is presented with the arrow

Inhibitors study

Crime scene DNA samples often contain inhibitors [46], which may interfere with the PCR amplification and can sometimes cause complete amplification failure. One nanogram of control DNA 9948 containing the most common inhibitors (hematin, indigo, humic acid, calcium ion, EDTA, and hemoglobin) was tested to evaluate the robustness of the 36-plex Y-STR system. As seen from the results presented in Table 2, complete DNA profiles were observed with concentrations of up to 50 μmol/L of hematin, 80 μmol/L of hemoglobin, 25 ng/μL of humic acid, 12 mmol/L of indigo, 0.9 mmol/L of calcium ion, and 0.9 mmol/L of EDTA. When the concentrations exceeded these levels, allelic drop-outs were observed in the loci with long fragment size alleles. The exception to this was calcium ion in which allelic drop-out was observed in the loci with short fragment size alleles (such as DYS19, DYS388, DYS447, and DYS533) when the concentration was increased to 1.2 or 1.5 mmol/L.

Table 2 The concentration of the inhibitors added per reaction and the corresponding percentage of the alleles called

Reproducibility

A reproducibility study was performed to validate the reliability and accuracy of the developed 36-plex Y-STR system with correctly identifying DNA samples when used by different laboratories. Fifty samples were tested using the 36-plex Y-STR system in three participating laboratories. The results demonstrated that the genotypes of all samples were consistent with their known profiles (data not shown). To demonstrate the suitability of the system for use on different capillary electrophoresis platforms, both the Applied Biosystems 3130xl (upgraded to 6-dye module) and the 3500xl Genetic Analyzer were used to run the allelic ladder of the 36-plex Y-STR system. The results showed genotype consistency of the same locus across the two CE methods. For example, the electropherograms of the DYS448 allelic ladder were concordant between the two tested CE platforms (Fig. S6).

Parallel testing

To evaluate the compatibility of the different commercial kits available for Y-STR testing, parallel testing was performed using the 36-plex Y-STR system, Yfiler™ plus kit and PowerPlex® Y23 system to amplify control DNA 9948 and 15 samples. The results showed that the genotypes of control 9948 DNA and of the samples were detected consistently by the three systems, as shown in the Supplemental Table S2.

Casework samples

Thirty-five real forensic casework samples were processed using the 36-plex Y-STR system, and the results showed that full DNA profiles were successfully obtained from all samples with the exception of some epithelial cell fractions. Most of the 36 STR loci were successfully detected on swabs from toothbrushes and padlocks, while fewer loci were successfully detected on swab of wallet, and none of the loci were detected from swabs of brick (Table S3).

Stutter calculation and size precision study

Stutter peaks are common artifacts that may be caused by slippage of the Taq Polymerase during the elongation step [47, 48]. The stutter ratio of the 36-plex Y-STR system was evaluated with 247 DNA samples in population studies. The minimum and maximum stutter, the stutter mean and associated standard deviation (SD), and the recommended filter thresholds are shown in Table 3. The mean stutter ratios of all loci were lower than 15%, except those of DYS481 (17.68%) and DYS518 (16.24%).

Table 3 Observed stutter ratios per locus and recommended stutter filter threshold

Sizing precision is critical for accurate genotyping. Size precision of the 36-plex Y-STR system was evaluated by running three full injection of allelic ladder using a 3500xl Genetic Analyzer. Size variability was determined by calculating the standard deviation for each allele [49]. As expected, the fragment size increased with increasing standard deviation of the allele. Very little variation at each locus was seen in the size of the 36-plex Y-STR allelic ladder mix, and most allele deviations were nearly 0.06 base. The maximum SD was close to 0.1 base at locus DYS549 (Fig. 5). These results demonstrate that the 36-plex Y-STR system can ensure proper allele detection that is consistent and within the bin window.

Fig. 5
figure 5

Precision across 24 CE lanes of the 36-plex Y-STR allelic ladder on a 3500 Genetic Analyzer

Population studies

The DNA of 247 unrelated Han males was genotyped by the 36-plex Y-STR system. The data was submitted to the Y-chromosome STR haplotype reference database (YHRD) under accession number YA004330. Relevant forensic parameters were investigated, among which, the gene diversity (GD) value is used to assess the polymorphism of Y-STR loci [50]. As seen in Fig. 6, the highest GD value among the 36 Y-STR loci was 0.9620 at locus DYS385, and the lowest was 0.3992 at locus DYS438. Compared with the Yfiler™ Plus kit, an additional 9 Y-STR loci were included in the 36-plex Y-STR system, and most of them were highly polymorphic, except locus DYS388. Haplotype diversity (HD) and discrimination capacity were calculated to evaluate system efficiency [51]. As presented in Supplemental Table S3, a total of 246 haplotypes were obtained from 247 individuals, from which the obtained HD value was 0.9998641 and the DC value was 0.995951. These results demonstrate that the 36-plex Y-STR system is highly genetically informative.

Fig. 6
figure 6

The values of gene diversity (GD) for 36 Y-STR loci in Guangdong Han population. Markers with an asterisk represent added Y-STR loci compared with Yfiler™ Plus kit

The five existing Y-STR marker sets (MHT, SWGDAM, PowerPlex® Y23, Yfiler® Plus, and AGCU Y24) were compared with the 36-plex Y-STR system in terms of their haplotype-based forensic parameters (haplotype diversity, discrimination capacity, and match probability) using the aforementioned 247 DNA samples obtained from unrelated individuals (Table 4). The 36-plex Y-STR system contained 99.19% of unique haplotypes and thus provided the highest power of forensic discrimination. Overall, the results showed that the 36-plex Y-STR system would be useful in the identification of related individuals.

Table 4 Forensic parameters and diversity values for six different Y-STR systems (n = 247)

In detecting Guangdong Han males, a null allele was observed at locus DYS448 (Fig. 7) in two samples; and triallelic genotypes were detected at locus DYS527a/b in five samples, locus DYS385a/b in three samples, and locus DYS387S1 in two samples. These findings were compared with the other commercial kits such as the Yfiler™ Plus kit and PowerPlex® Y23 kit, and the abnormal genotyping results were found to be consistent across kits (Fig. 8).

Fig. 7
figure 7

Electropherogram showing a null allele at locus DYS448 in a sample detected by 36-plex Y-STR system and checked by Yfiler™ Plus kit and the PowerPlex® Y23 kit

Fig. 8
figure 8

Electropherogram showing three alleles at locus DYS385a/b in a sample detected by 36-plex Y-STR system and checked by Yfiler™ Plus kit and the PowerPlex® Y23 kit

As shown in Fig. S7, a multidimensional scaling (MDS) plot based on 21 shared Y-STRs showed that Guangdong Han population and other Chinese Han populations tend to cluster together at the center of MDS plot. In the neighbor-joining tree (Fig. S8), Guangdong Han population was first clustered with Shenzhen Han population, and second with Henan Han and other Chinese Han populations. The degree of differentiation between the studied population and the reference population available in YHRD database was evaluated by calculation of the genetic distances (Rst) and p values (Table S4). The results indicated that Guangdong Han population was not significantly genetically different from Shandong Han, Shenzhen Han, or Fuzhou She populations (p > 0.0004, 120 pairs), but was significantly genetically different from the other compared populations (p < 0.0004).

Conclusion

This study has optimized and validated a novel 36-plex Y-STR system for forensic genetic testing, that incorporates all of the loci of the Yfiler® Plus kit and additionally includes a further nine Y-STR loci. Developmental validation studies that included PCR conditions as well as testing the cross-reactivity, sensitivity, anti-interference, and stability of the method and population data analysis have demonstrated it to be a sensitive, robust, and highly informative tool for use in forensic casework.