Introduction

Recently, Y-chromosome research has become more important in forensic laboratories. In actual cases, especially in sexual assault cases when the male component of samples is largely dominated by a high background concentration of female component, it is difficult to detect minor male contributors by relying on autosomal short tandem repeat loci (STRs) [1, 2]. In contrast, Y-chromosome-specific STRs (Y-STRs) are more efficient for targeting minor male DNA components. Furthermore, the rapid mutation Y-STRs targeted by the Microreader 40Y ID System are useful for kinship identification, and the kit also contains most of the loci found in other common commercial kits, thereby making the kit more efficient and stable when analysing the samples. The Microreader 40Y ID System contains 20 core loci, 15 preferred loci, and four common candidate loci. Among these loci, DYS570, DYS449, DYS518, DYS576, DYS404S1, DYS387S1, and DYS627 are hypermutant loci which use six-dye fluorescent chemistry. Except for the size standard fluorescence, the blue channel (FAM) includes Y-Indel, DYS393, DYS570, DYS19, DYS392, DYS549, Y GATA H4, DYS444, and DYS593. The green channel (HEX) includes DYS460, DYS458, DYS481, DYS635, DYS438, DYS447, and DYS596. The yellow channel (TAMRA) includes DYS456, DYS389I, DYS448, DYS533, and DYS449. The red channel (ROX) includes DYS391, DYS439, DYS437, DYS385a/b, DYS643, and DYS518. The purple channel (PURP) includes DYS576, DYF404S1, DYF387S1, DYS627, DYS527a/b, DYS557, and DYS645 (Table 1). With the mature six-dye fluorescence system, the selection of loci with higher compatibility can effectively realize the family identification and achieve the precise location of the target family. Meanwhile, in the case of family system determination, accurate individual identification can be achieved effectively, and the overall solution of public security database construction and case analysis can be created [3]. It provides greater identification capabilities for Y database applications.

Table 1 Y-chromosome-specific short tandem repeat loci targeted by the Microreader 40Y ID System. The kit targets 20 core loci, 15 preferred loci, and four common candidate loci, including seven rapidly mutating loci

The aim of the present study was to verify the efficiency and accuracy of the Microreader 40Y ID System. This was achieved by investigating sensitivity, amplification conditions, male-male and male-female mixtures, PCR inhibition, species specificity, reproducibility, and efficacy with degraded samples. For the forensic genetic analysis study, the Y-STRs were also tested using 437 male samples from Tibetan, Han, and Yi populations. In this study, the validation analysis of the Microreader 40Y ID System was performed as the guidelines for DNA analysis methods published by the Scientific Working Group on DNA Analysis Methods (SWGDAM) [4, 5].

Materials and methods

The Microreader 40Y ID System was validated for forensic use under conditions recommended by the manufacturer (Beijing Microread Genetics Co., Ltd.) [6] and several control DNA samples, namely M308 (male 2 ng/μl), F312 (female 2 ng/μl), and 2800 M (male 10 ng/μl). Whole-genome DNA was extracted using the new rapid extraction kit (centrifugal column; Beijing BioTeke Co., Ltd.) from human blood. The blood samples used for direct amplification were collected using 1.2-mm FTA blood cards, whereas the semen samples were collected and extracted using the Chelex 100 method (5% Chelex, w:v).

Samples for genetic analysis

The samples used for genetic analysis were obtained from 437 individuals, including 169 Sichuan Han people, 168 Tibetans, and 100 Yi people. Human blood samples were collected with the approval of the Ethics Committee at the Institute of Forensic Medicine, Sichuan University (K2015008) [7]. DNA was extracted from the Yi samples using the Chelex 100 method (1 ml Chelex, 5%, and 15 μl proteinase K, 20 mg/ml) and from the Han and Tibetan samples using the new rapid extraction kit for whole blood genomic DNA.

All the participants in this study gave their written informed consent and the approval of the study was granted by the Ethical Committee of the Sichuan University. The samples were processed in anonymous form, and the confidentiality of personal information of each study participant was assured.

DNA amplification

All the samples were amplified using the GeneAmp PCR System 9700 (Thermo Fisher Scientific, Waltham, MA USA), according to the Microreader 40Y ID System instructions. Briefly, the 25-μl reactions mixtures included 10 μl Microreader 2.5× Master Mix III, 5 μl Microreader 40YD 5× Primer Mix, 1 μl DNA (0.5–2 ng) or 1.2 mm FTA card, and 9 μl water [4] and were amplified under the following conditions: 95 °C for 5 min, followed by 28 cycles of 94 °C for 20 s and 59 °C for 90 s, 60 min at 60 °C, and a final temperature of 4 °C [8]. A series of gradient experiments were performed to determine the effects of Master Mix, primer concentration, cycle number, reaction volume, annealing temperature, and final extension temperature. Different inhibitors were also used to determine inhibitor tolerance.

To determine the minimum and maximum effective Master Mix concentrations, five concentrations (0.5X, 0.75X, 1X, 1.25X, and 1.5X) were tested, and to determine the minimum and maximum effective Primer Mix, five concentrations (0.5X, 1X, 1.25X, and 1.5X) were tested, and the experiments were all repeated three times. The gradient of the number of cycles was set to 27, 28, 29, 30, and 31, of which 28 cycles were the recommended number of cycles. To investigate the effect of PCR reaction conditions, various reaction volumes (5, 10, 12.5, and 25 μl), annealing temperatures (57, 59, 60, 61, and 63 °C), final extension temperatures (56, 58, 60, 62, and 64 °C), and cycle numbers (27, 28, 29, 30, and 31) were tested.

Amplification analysis

The Microreader 40Y ID System is a six-dye multiplex amplification kit. The kit provides microreader size standard (QD 550) and allelic ladder. The 6-Dye Matrix standard can be used to spectral collected for correcting all the types of fluorescent dyes used in the kit. The separation of amplification products was performed using a 3500XL (Thermo Fisher Scientific, Waltham, MA, USA). And the instrument was used as the default for most studies. Sample setup for capillary electrophoresis (CE) was performed using a 36-cm capillary array and the performance optimized polymer-4 (POP-4 polymer), and the following run condition for a 3500XL:3.0kv for 10 s, input of 1 μl amplification product to 9 μl Hi-Di formamide. The electrophoresis results were analysed using GeneMapper ID-X v1.2 and a threshold of 150 RFU [9].

Sensitivity study

For sensitivity analysis, a serial dilution of male control DNA (M308) was established (1 ng, 500 pg, 250 pg, 125 pg, and 62.5 pg), and the average peak heights of the resulting Y-STR profiles were calculated for each dilution.

DNA mixtures

Male-male DNA mixtures were prepared using two male control DNAs, namely M308 (Beijing Microread Genetics) and 2800M (Life Technologies, Applied Biosystems); the two DNAs were selected to minimize the number of coincident loci types. And male-female DNA mixtures were prepared using M308 in the presence of a high female DNA (F312) background. For amplification, the total male input DNA was maintained at 1 ng, whereas the total female input DNA was maintained at 400 pg. Mixture ratios used for the male-male DNA mixtures were 19:1, 9:1, 3:1, 1:1, 1:3, 1:9, and 1:19. The total input DNA remained at 1 ng for the male-female DNA mixtures, whereas the different concentration gradients of male DNA corresponds to 1 ng, 500 pg, 250 pg, 125 pg, and 62.5 pg input range for the minor male component [8, 10].

Detection and genotyping (sizing accuracy and stutter calculation)

The average base size and standard variance for each locus were determined using the kit’s allelic ladder, and 437 samples were amplified and analysed using a 3500 series Genetic Analyzer and a threshold of 150 RFU. Peaks that were one repeat smaller or larger than the true allele (± 0.5 bases) were considered stutter peaks, and the peak height threshold of stutters was 20 RFU [11].

Species specificity

Total DNA was extracted from Escherichia coli, Mycobacterium tuberculosis, and whole-blood samples from goat, rat, rabbit, chicken, dog, cat, duck, and pig using the new rapid extraction kit for whole blood genomic, and 1 ng of DNA was used for amplification.

PCR inhibition

Because the real world, from which samples are obtained, is relatively complex, a large portion of forensic samples are affected by chemical reagents or other factors. Therefore, to evaluate the effort of PCR inhibitors on the efficacy of the Microreader 40Y ID System, three inhibitors were tested, including humic acid, which binds to DNA, and both haemoglobin and EDTA, which inhibit the action of Taq polymerase. The three inhibitors that were used for testing were humic acid (50, 75, or 100 ng/μl), haemoglobin (250, 500, 750, or 1000 μM), and EDTA (0.25, 0.5, 0.75, or 1 mM).

Effects of ultrasound on DNA

Ultrasound can damage DNA and break it into small fragments, eventually destroying even long DNA fragments. DNA was degraded using an ultrasonic instrument as follows: 200 watts for 10 s per cycle, with 4 s between cycles and cycles repeated 100, 200, 300, or 400 times. The resulting DNA samples then subject to amplification and electrophoresis.

Reproducibility

In actual cases, many different types of body fluid spots are collected. Therefore, the ability of the kit to type the three most common types of body fluids (i.e. blood and DNA from buccal swabs, and semen) were tested. The kit can directly type blood samples without extraction. The detection of DNA samples from different types of body fluids can be used to assess the consistency of genotyping. In the present study, three male DNA samples were extracted from three sampling types: oral swab, whole blood, semen samples, and the fourth type blood sample without extraction. The male oral swab samples used in the experiment were collected using cotton swabs.

Forensic genetic analysis

To evaluate the value of the Microreader 40Y ID System for forensic genetics, the 437 samples were subject to population genetics studies and mutation analyses. Single-copy loci were calculated using an online tool [7] (STRAF, STR Analysis for Forensics, http://www.cmpg.iee.unibe.ch/services/shiny/index_eng.html), and the forensic parameters of the multi-copy loci (DYS385a/b, DYF404S1, DYF387S1, DYS527a/b) were calculated separately. The samples contained three populations: Tibet (168), Han (169), Yi (100). Haplotype diversity (HD), discrimination capacity (DC), haplotype match probability (HMP), and distinct haplotypes were calculated [12].

Results and discussion

Y-STR loci information

All the Y-STR data are presented in Table 1. The kit contained 20 core loci, 15 preferred loci, and four common candidate loci. Among these loci, DYS570, DYS449, DYS518, DYS576, DYS404S1, DYS387S1, and DYS627 are rapidly mutating Y-STRs loci.

PCR reaction component study

Many factors affect PCR amplification, and changes in the composition and conditions of PCR reactions are important factors.

Microreader 2.5× Master Mix III

When the concentration of the master mix increased or decreased to a certain degree, the loci called suffered a corresponding decreased. At 0.75X, 92.59% of the Y-STRs were typed successfully, and only eight loci were not accurately classified. At 0.5X, only 22.22% of the Y-STRs were typed successfully. However, all the Y-STRs were successfully typed at 1X and above (Fig. 1).

Fig. 1
figure 1

Effect of Master Mix concentration on the mean percent of successfully typed Y-chromosome-specific short tandem repeat loci

Microreader 5X Master primer

When the concentration of master primer was 0.5X, 84.26% of the Y-STRs were typed successfully, and all the Y-STRs were successfully typed at 1X and above (Fig. 2).

Fig. 2
figure 2

Effect of Primer Mix concentration on the mean percent of successfully typed Y-chromosome-specific short tandem repeat loci

PCR reaction conditions

The recommended reaction conditions were 25 μl, 59 °C, 60 °C, and 28 cycles, respectively. Reaction volume had no effect on the success of Y-STR typing (figure not shown). In regard to annealing temperature, all the Y-STRs were successfully typed at 57, 59 (figure not shown), and 60 °C, whereas only 98.15 and 83.33% could be typed at 61 and 63 °C, respectively (Fig. 3). Neither final extension temperature nor cycle number affected the success of Y-STR typing either. However, unrelated peaks were observed when using 30 cycles, with peak heights that exceeded the detection threshold of 150 RFU, and when using 31 cycles, the unrelated peaks increased significantly, with even greater peak heights (figure not shown).

Fig. 3
figure 3

Effect of PCR annealing temperature on the mean percent of successfully typed Y-chromosome-specific short tandem repeat loci. Values and error bars indicate mean ± standard (n = 15)

Sensitivity studies

The control male DNA from the kit (M308) was used to evaluate the kit’s sensitivity, with 31.25, 62.5, 125, 250, 500, or 1 ng DNA in 25-μl reaction volumes, and three replicates were performed for each concentration. All the Y-STRs were successfully typed at DNA concentrations of 125 pg and above, and mean peak height increased with increasing DNA concentration, from 443.46 RFU (31.25 pg) to 4002.31 RFU (1 ng), with a peak height threshold of 80 RFU (Fig. 4). At 62.5 pg, the Y GATA H4 locus could not be typed, and at 31.25 pg, only 50% of loci could be typed.

Fig. 4
figure 4

Microreader 40Y ID System sensitivity. Allele recovery is expressed as a percentage. Blue line indicates the average peak height of each DNA concentration

DNA mixture study

Because the real world is relatively complex and mixed spots are often obtained from actual situations, verification of the Microreader 40Y ID System is crucial for ensuring the accurate identification of secondary components in mixed spots. For Y-STR kits, in particular, it is important to identify the performance of male minor contributors in a high female background. Therefore, the ability of the kit to type Y-STRs from both male-male and male-female mixtures was investigated.

Male-male mixture

When the M308:2800M ratio was 1:19, 96.83% of the minor contributor Y-STRs could be typed successfully, and only DYS627 (24), DYS385a/b (11/14), DYS391 (11) DYS449 (type 30), and Y GATA H4 (12) were not typed. When the M308:2800M ratio was 9:1, 99.47% of the Y-STRs could be typed successfully, and only DYS635 (21) was not typed. When the M308:2800M ratio was 19:1, 97.35% of the Y-STRs could be typed successfully, and only DYS635 (21), DYS389I (14), DYS527 (type 19), and DYS438 (type 9) were not typed (Fig. 5).

Fig. 5
figure 5

Successful typing of minor contributors in male:male DNA mixtures. Values and error bars represent means ± SD (n = 3)

Male-female mixture

The M308 and F312 DNAs were mixed at ratios of 1:19, 1:9, 1:3, 1:1, 3:1, 9:1, and 19:1 using 50, 100, 250, 500, 750, 900, and 950 pg of M308 DNA, respectively, in a total of 1 ng DNA. When the M308:F312 ratio was 1:9 (100 pg M308), 92.5% of the M308 Y-STRs could be typed successfully, and only DYS533 (12), DYS527a/b (21), and DYS19 (type 14) were not typed. However, when the M308:F312 ratio was reduced to 1:19 (50 pg M308), only 65% loci of the M308 Y-STRs could be typed successfully (Fig. 6).

Fig. 6
figure 6

Successful typing of minor contributors in male:female DNA mixtures. Values and error bars represent means ± SD (n = 3). X-axis represents the absolute amount of male DNA

Sizing accuracy, precision, and stutter effects

The allelic ladder profile is presented in Fig. 7. Twenty-one allelic ladder samples were run on the 3500XL (Life Technologies; Fig. 8). The standard deviation (SD) of the allelic ladder ranged was from 0.031 (DYS390) to 0.115 (DYS449), which indicated that the detection system was suitably precise (Table 2).

Fig. 7
figure 7

Electropherogram of the Microreader 40Y ID System allelic ladder

Fig. 8
figure 8

Sizing variation of all allelic ladders. Ladders were analysed using a 3500xl Genetic Analyzer (n = 21)

Table 2 Stutter values for the Microreader 40Y ID System markers. The average value plus SD value*3 obtained as the stutter threshold

Species specificity

Capillary electrophoresis (3500xL Genetic Analyzer, Thermo Fisher Scientific, Waltham, MA, USA) results of eight species of non-primate (Fig. 9a). The animal DNA yielded few peaks larger than 200 RFU, but the peaks could not be typed. Meanwhile, the two bacteria yielded no peaks larger than 200 RFU (Fig. 9b).

Fig. 9
figure 9figure 9

a Species specificity of the Microreader 40Y ID System. Few peaks were higher than 150 RFU. b Species specificity of the Microreader 40Y ID System. Using the standard protocol, 1 ng of bacterial DNA was amplified

PCR inhibition

The performance of the Microreader 40Y ID System was compared against three PCR inhibition models. The experimental reactions were performed using 1 ng of control DNA (M308). The humic acid (Fig. 10a), haemoglobin (Fig. 10b), and EDTA (Fig. 10c) yielded expected results. When the concentration of humic acid was increased to 200 ng/μl, all the Y-STRs could be typed successfully, which indicated that the inhibitory effect of humic acid on the kit’s efficacy was insignificant. At 250 μM haemoglobin, all the Y-STRs could be typed successfully, and when the concentration was increased to 500 μM, 99.07% of the Y-STRs could be typed successfully. However, at 750 μM haemoglobin, only 32.41% of the Y-STRs could be typed successfully, and at 1000 μM haemoglobin, PCR amplification was completely inhibited. At 0.75 mM EDTA, all the Y-STRs could be typed successfully, and at 1 mM, 90.65% of the Y-STRs could be typed successfully. However, when the concentration was increased further, to 1.5 mM EDTA, PCR amplification was completely inhibited (Fig. 11).

Fig. 10
figure 10figure 10figure 10

a Effect of humic acid (HA) on the successful typing of male DNA. b Effect of haemoglobin (HE) on the successful typing of male DNA. c Effect of EDTA on the successful typing of male DNA

Fig. 11
figure 11

Effect of three PCR inhibitors. EDTA (blue, five concentrations), humic acid (orange, four concentrations), and haemoglobin (green, four concentrations)

Effects of ultrasound on DNA

To investigate the effect of sample degradation, three samples were irradiated using an ultrasonic apparatus. The samples were fresh Han male DNA. The DNA was exposed to either 100, 200, 300, or 400 cycles of ultrasound. After the DNA was exposed to 100 cycles of ultrasound, all the Y-STRs could be typed successfully. After 200 cycles, the DYS645 locus was not typed, and after 300cycles, even more loci were not typed or yielded type errors. After the DNA was exposed to 400 cycles of ultrasound, even more loci were not typed (Fig. 12). Detailed typing results are presented in Supplementary Table 1.

Fig. 12
figure 12

Effects of ultrasound on DNA. DNA was degraded using an ultrasonic instrument as follows: 200 watts for 10 s per cycle, with 4 s between cycles and cycles repeated 0 (top), 100, 200, 300, or 400 (bottom) times

Case samples

As shown, profiles were obtained for all the samples, which indicated that the kit is suitable both for direct typing and for typing DNA extracted from different types of samples (figure not shown).

Forensic genetic analysis

In addition to the Yfiler plus loci, additional loci (DYS549, DYS643, Y-Indel, DYS447, DYS444, DYS557, DYF404S1, DYS527a/b, DYS596, DYS593, DYS645) were also highly polymorphic. The sample types are presented in Supplementary Table 2.

Genetic diversity (GD), or haplotype diversity (HD), was calculated using as follows: GD = HD=\( \frac{N\left(1-{\sum}_{i=1}^k{p}_i^2\right)}{N-1} \), where pi represents the frequency of the ith haplotype, N represents sample size, and k represents haplotype number. Analysis was performed on 437 samples, which included samples from 169 unrelated Han males, 168 unrelated Tibetan males, and 100 unrelated Yi males. DYS385a/b, DYF404S1, DYF387S1, and DYS527a/b were each considered two separate single-copy markers (Table 3). Other relevant forensic parameters were calculated using an online tool (STRAF, STR Analysis for Forensics, http://www.cmpg.iee.unibe.ch/services/shiny/index_eng.html). Genetic diversity was > 0.5 for all the Microreader 40Y ID System markers, except DYS645, Y-Indel, DYS438, DYS437, DYS391, and DYS593 (Fig. 13). Other parameters included polymorphism information content (PIC), match probability (MP), and power of discrimination (PD; Supplementary Table 3).

Table 3 Forensic parameters and diversity values for Y-chromosome-specific short tandem repeat data from three populations
Fig. 13
figure 13

Variability of the Microreader 40Y ID System markers within three populations. Loci are ranked by genetic diversity

Conclusion

The Microreader 40Y ID System includes 40 polymorphic Y-STR loci that can be used for the sensitive and rapid analysis of samples from a variety of real-world situations. The kit contains all the loci included in the Yfiler Plus PCR Amplification Kit and PowerPlex Y23 System, as well as others. Therefore, the Microreader kit is more accurate and efficient. The present study verified the sensitivity, reaction volume, PCR conditions, and inhibitor effects of the kit, as well as the kits ability to handle mixes or degraded samples and usefulness in local populations. Many factors influencing kit performance in real cases, and the standard system is often not suitable for the analysis of imperfect samples. The Microreader 40Y ID System can generate complete DNA profiles in reaction volumes as low as 6.25 μl, and male minor contributors can be accurately detected when the ratio of either male to female components or of male to male components is as low as 1:19. The most noteworthy characteristic of the kit is its anti-interference ability. In particular, humic acid failed to inhibit PCR amplification, even at 200 ng/μl humic acid. Haemoglobin partially inhibited PCR amplification at concentrations up to 750 μM and completely inhibited amplification at 1000 μM. The inhibitory effect of EDTA was obvious and increased with its concentration. Furthermore, ultrasound significantly damaged the DNA samples, and the number of missing loci increased with the number of ultrasound cycles.

This kit can also be applied to different types of body fluids and, in some cases, can be used for direct amplification, which ultimately saves time (no DNA extraction) and facilitates the progress of actual cases. This kit with more loci also possesses greater recognition capabilities for Y-STR applications compared to other common commercial kit and is compatible with other Y-STR ID systems, thereby ensuring maximum data capacity and consistency, as well as database comparison. In summary, the Microreader 40Y ID System is a powerful, sensitive, and balanced typing tool for forensics.