Introduction

Y-chromosomal microsatellites (Y-STRs) have proven to be an excellent tool in sexual assault cases, as the male component of DNA mixtures can be selectively detected without interference of female victims’ DNA [1]. The resulting Y-STR haplotypes also aid in the differentiation of paternal lineages as well as in deficiency paternity cases. In situations of missing persons or paternity cases, a paternal male relative’s profile can be used to support/exclude relatedness when a father’s reference sample is not available [1]. However, due to lack of recombination on the Y chromosome, the discriminatory powers of a single multiplex Y-STR analysis system are insufficient, especially for the differentiation of male lineages in inbred populations [2, 3]. Thus, many studies have attempted to add candidate loci, with high gene diversity, to improve current Y-STR systems [1, 4,5,6]. Adding rapidly mutating (RM) Y-STR loci [4] will increase the potential to distinguish related males [7]. But it may also lead to unexpected exclusion when determining whether a suspect is contained in a paternal lineage [5, 8,9,10]. For familial searching, low-medium mutation Y-STR loci are a better choice and have already played a vital role in China [11].

In this study, a novel Y-STR system, SureID® PathFinder Plus, was developed by us. The SureID® PathFinder Plus Kit allows co-amplification of 41 loci in a 6-dye configuration. It includes all 17 loci from the Yfiler® kit [12] (DYS19, DYS385a/b, DYS389I/II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438, DYS439, DYS448, DYS456, DYS458, DYS635, and Y-GATA-H4) plus 24 Y-STR loci with reported mutation rates. Among the additional loci, DYS557 [13], DYS593 [14, 15], DYS596 [15, 16], and DYS645 [16] are highly polymorphic; DYS449, DYS518, DYS549, DYS570, DYS576, DYS627, DYS643, DYF387S1a/b, and DYF404S1 are rapidly mutating Y-STR loci [1, 17, 18]; and DYS388, DYS444, DYS447, DYS460, DYS481, DYS522, DYS527a/b, and DYS533 are low-medium mutation loci [11, 19]. Thus, this kit may become a novel application kit both for the discrimination of related males [8, 20] by including loci with high mutation rates and for familial searching due to the use of Y-STR loci with low-medium mutation rates. Moreover, the use of three Y-InDels—rs199815934, rs771783753, and rs759551978—that are least likely to mutate could also contribute to familial searching.

Through optimization of primer sequences and the master mix, the 41-plex Y-STR system can overcome challenges in direct amplification of buccal and blood reference samples on cards, gauzes, or swabs with a fast PCR cycle time of < 100 min. In addition, male DNA can be specifically amplified in the presence of large amounts of female DNA background. To verify the efficiency of the system, in this work, we performed a series of validation studies, including for sensitivity, species specificity, inhibition, precision, and DNA mixture assays as per the guidelines of the Scientific Working Group on DNA Analysis Methods (SWGDAM) [21,22,23].

Materials and methods

DNA samples

Reference samples were collected from randomly selected subjects (834 males and 10 females) for the validation study. In addition, samples from 754 father–son pairs were obtained, and among them 734 pairs were composed of a father and a son, and 10 pairs contained a father and two sons. All donors were recruited from the Han ethnic group in Zhejiang province.

Concordance testing was performed with samples on a variety of substrates. Blood was spotted on FTA® Cards (GE Healthcare, Piscataway, NJ, USA), sterile gauzes, sterile cotton balls, and tissues. The samples on FTA® Cards were created following the manufacturer’s instructions. The other samples included hair roots as well as buccal cells that were collected on sterile cotton-tipped swabs. Samples were stored for up to 24 h at room temperature until they were further processed (up to 2 weeks). For further direct amplification, 1.2-mm disks punched from FTA® Cards and 2-mm-long threads of gauze or hair roots were added directly into reactions. About 2 mm2 of stains was clipped from the dried surfaces of the swabs, blood-spotted cotton balls, or blood-spotted tissues for direct amplification. Because the entire swab is not consumed in the reaction, resampling was a feasible option if the profile was not acceptable.

The control DNAs 9948 and 9947A were purchased from Promega Corporation (Madison, WI, USA), while control DNA 007 was supplied with the Yfiler® Plus Kit. The 9948 DNA was used for the sensitivity and inhibition studies, and all three control DNAs were applied in mixture studies. DNA concentrations were validated by measuring the absorbance of samples at 260 nm by using a NanoDropTM 2000 Spectrophotometer (Thermo Fisher Scientific Inc., Wilmington, DE, USA). When necessary, DNA input was diluted with TE buffer (10 mM Tris-HCl pH 8.0 and 0.1 mM EDTA).

Species studies were conducted using blood samples from domestic animals (dog, chicken, horse, bovine, rabbit, duck, sheep, fish, and pig) and Escherichia coli amplified for 28 cycles.

All human samples were collected from healthy volunteer donors after informed consent was given according to the ethical guidelines of the Helsinki Declaration. This project was approved by the Ethics Committee of the Medical College, Shaoxing University, before the study.

PCR amplification and thermal cycling conditions

The SureID® PathFinder Plus Kit was optimized in a 10-μL total reaction volume consisting of 5.0-μL Master Mix and 2.5-μL Primer Mix. PCR samples were amplified in MicroAmp® Optical 96-well reaction plates (Thermo Fisher Scientific, Waltham, MA, USA) with a GeneAmp PCR system 9700 (Thermo Fisher Scientific) using the following conditions: enzyme activation at 95 °C for 5 min; 28 cycles of 94 °C for 10 s, 60 °C for 1 min, and 70 °C for 30 s; and 60 °C for 15 min, followed by a final hold at 4 °C. Cycle optimization was carried out using control DNA 9948. The optimal annealing temperature of the SureID® PathFinder Plus Kit was determined by conducting the annealing step at 58 °C, 59 °C, 60 °C, 61 °C, and 62 °C using control DNA 9948 or a bloodstain on an FTA® Card.

PCR primer set and PCR master mix components

The primers for each locus were designed with Primer Premier 5.0 (Premier Biosoft International, Palo Alto, CA, USA). The robustness of the primer set was determined by varying each primer pair in the multiplex in increments of 10% up to ± 20% compared with the optimized formulation and comparing performance based on intracolor balance, peak height, and alleles recovered. The five dyes used in the SureID® PathFinder Plus Kit to label amplified sample products were 6-FAMTM (blue), HEXTM (green), TAMRATM (yellow), ROXTM (red), and VIGTM (purple) (Fig. 1). The sixth dye, LIZTM (orange), was used to label the SIZE-580 internal size standard. The fragments of SIZE-580 were 75, 87, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 515, 530, 550, and 580 bp. The use of an additional dye channel (purple) enabled the addition of new loci while maintaining the marker range to less than 550 bp. All primers were synthesized by Ningbo Health Gene Technologies Co. Ltd., Ningbo, China.

Fig. 1
figure 1

Diagram illustrating PCR product size ranges of the 41 Y-STR loci in the SureID® PathFinder Plus PCR Amplification Kit. The five dye panels were 6-FAMTM (blue), HEXTM (green), TAMRATM (yellow), ROXTM (red), and VIGTM (purple). The sixth panel was for internal size standard labels with an orange dye: SIZE-580 (75, 87, 100, 125, 150, 175, 200, 225, 250, 275, 300, 325, 350, 375, 400, 425, 450, 475, 500, 515, 530, 550, and 580 bp)

The components of the SureID® PathFinder Plus Master Mix included DNA polymerase, buffer, salts, dNTPs, carrier protein, detergents, and 0.05% sodium azide. Three DNA samples on an FTA® Card were tested in triplicate for the standard PCR master mix formulation and six modified master mixes.

PCR product electrophoresis and data analysis

The Applied Biosystems 3500xL Genetic Analyzer (Thermo Fisher Scientific) was used as the default for most studies to separate and detect the PCR products unless mentioned otherwise. The dye set of HGT 6-Dye Matrix Standards (Ningbo Health Gene Technologies Co. Ltd.) was used for spectral calibration. Capillary electrophoresis (CE) was as follows: 1 μL of amplified product or allelic ladder and 0.5-μL SIZE-580 were added to 8.5 μL of deionized Hi-Di™ Formamide (Thermo Fisher Scientific). Samples were denatured at 95 °C for 3 min then chilled on ice prior to electrophoresis. Sample injection was performed in a POP-4 polymer (Thermo Fisher Scientific) with the following run conditions: injection at 1.2 kV for 15 s and electrophoresis at 15 kV for 1500 s. GeneMapper® ID-X Software v1.2 (Thermo Fisher Scientific) was used to analyze the results with a 150 relative fluorescence unit (RFU) peak amplitude threshold for allele calls.

Sensitivity

To evaluate the sensitivity of the SureID® PathFinder Plus Kit, allele recovery performance was measured for control DNA inputs ranging from 0.125 to 2 ng. Samples were prepared by a serial dilution of male control DNA 9948 (2 ng/μL stock) in DNA Suspension Buffer. Total DNA inputs tested were 125, 250, 500, 1000, and 2000 pg. Non-template controls (NTC) were included.

Male specificity

All of our selected loci are located on the non-recombination region of the Y chromosome. Moreover, the Master Mix and PCR thermal cycling conditions of the SureID® PathFinder Plus Kit were optimized to achieve male specificity. Female control DNA 9947A and 10 female blood samples were utilized to validate the male specificity of the kit. In a reaction volume of 10 μL for the mixture samples, a total of 1 ng of male DNA input in the presence of total 1 μg female DNA input was maintained. The potential amplified artifacts from the females were analyzed.

DNA mixtures

Male–male mixtures of control DNAs 9948 and 007 in known ratios of 1:1, 1:2, 1:4, 1:8, and 1:15 were amplified, respectively. Each mixture was tested in triplicate to reduce the accidental error and ensure the accuracy of the results.

Species specificity

Genomic DNA from blood samples of nine male domestic animals were extracted and purified by organic extraction. E. coli DNA was isolated using the Ezup Column Bacteria Genomic DNA Purification Kit (Sangon Biotech, Co., Ltd., Shanghai, China). As per the standard PCR protocol, 10 ng each from the 10 species was used to assess the species specificity of the kit. Non-template controls (NTC) were included. The corresponding products were analyzed in triplicate.

PCR inhibition models

Three common forensic inhibitors including humic acid (Sigma-Aldrich, Darmstadt, Germany), hematin (Sigma-Aldrich), and indigo (Sigma-Aldrich) were chosen to assess the performance of the SureID® PathFinder Plus Kit. Stock solutions of high concentration were prepared by dissolving each of the inhibitors in 0.1 N NaOH (hematin and indigo) or in DNA Suspension Buffer (humic acid). They were further diluted in water to achieve working stocks. One nanogram of control DNA 9948 along with each inhibitor in various concentrations was amplified with the SureID® PathFinder Plus Kit using the standard protocol.

Population and forensic statistical analysis

The genotypes of 834 unrelated male subjects from the Zhejiang Han population were determined using the SureID® PathFinder Plus Kit under standard conditions. The data have been submitted to Y chromosome STR haplotype reference database (http://www.yhrd.org) under accession number YA004678. Allele frequencies were determined using the direct counting method. The discrimination capacity (DC) was ascertained by dividing the number of different haplotypes by the total number of individuals in the population [19]. The match probability (MP) was calculated as the sum of the squared haplotype frequencies. Genetic diversities (GD) or haplotype diversities (HD) were calculated as HD or GD = n(1 − ∑Pi2)/(n − 1) according to Nei [24], where n stands for the total number of samples and Pi is the frequency of the ith haplotype.

In addition, the total number of meioses (n = 754) that were previously confirmed by autosomal STR was used to detect the mutation rate. These father–son pair samples, in the form of hair roots, buccal swabs, or blood samples cards, were obtained from the Zhejiang Han population. Mutation rates were calculated as the number of mutations observed divided by the total number of meioses. The 95% confidence intervals (CI) of the mutation rates were estimated from a frequentist approach assuming a binomial distribution using the website http://statpages.org/confint.html.

Performance testing

For performance testing of reference samples on a variety of substrates, blood spotted on FTA® Cards (n = 53), sterile gauze (n = 76), sterile cotton balls (n = 3), and tissues (n = 6) was amplified with the SureID® PathFinder Plus Kit as per the standard conditions. Buccal samples on swabs (n = 5) and hair roots (n = 4) were also amplified with the kit using standard conditions.

Sizing accuracy, precision, and stutter effects

Sizing accuracy was defined as the deviation in size of each sample allele from the corresponding allelic ladder allele. Bloodstain samples on 54 FTA® Cards and 76 gauze samples were used to measure sizing accuracy. Punches (1.2-mm disks) of blood on FTA® Cards or threads of blood on gauze (2 mm long) were amplified with the SureID® PathFinder Plus Kit as per the standard PCR conditions of 28 cycles on an Applied Biosystems 3500xL Genetic Analyzer. Allelic ladder sizing precision was measured by calculating the average base pair sizes and standard deviation for each allele from multiple injections of the SureID® PathFinder Plus allelic ladder on the same CE instrument.

Stutters of the SureID® PathFinder Plus Kit were measured by dividing the stutter peak height (n ± 1 repeat units, or n ± 2 nt) by that of the true allele peak height (n repeats) and are expressed as a percentage. The same batch of reference samples (54 blood samples on FTA® Cards and 76 blood samples collected with gauze) was run on an Applied Biosystems 3500xL Genetic Analyzer using a 150 RFU peak height minimum (minimum stutter peak height of 20 RFU) to calculate the stutter ratios. All saturating allele peaks were removed from the stutter calculation.

Statistical analysis

Intracolor balance was calculated as the lowest peak height in each dye channel divided by the highest peak height in the same dye channel and is reported as a percentage. For the four multi-copy loci (DYS385, DYF387S1a/b, DYS527, and DYF404S1), the peak heights were averaged prior to calculating the intracolor balance, and if the alleles were of the same length, the peak heights were halved prior to intracolor balance calculation. Statistical analyses were performed using Minitab® (Minitab Inc., State College, PA, USA) software.

Results and discussion

PCR reaction-cycle number

The SureID® PathFinder Plus Kit was tested over a range of amplifying cycle numbers. One nanogram of control DNA 9948 was amplified over 26, 27, 28, and 29 cycles in the 10-μL reaction system. As a group, an increase in the cycle number generated a higher peak height as demonstrated in Supplementary Fig. S1. Full profiles were observed over 28 and 29 cycles. The results indicated that a PCR cycle number of 28 was optimal with respect to maximizing assay sensitivity while minimizing the possible occurrence of off-scale peaks or allelic drop-out events. A representative profile generated under the standard condition of 28 amplification cycles is shown in Supplementary Fig. S2 for control DNA 9948.

As mentioned in a previous study [1], many factors such as the collection method, sample type (blood vs. buccal), collection device, and even donors’ white blood cell (WBC) counts or buccal-shedding levels can affect the amount of DNA available for direct amplification. Thus, for a better amplification effect, a laboratory may need to adjust the cycle number based on their samples and CE instruments. Representative profiles generated with in-house sample sets for six different substrates under optimal conditions of 28 amplification cycles are shown in Supplementary Fig. S3.

PCR reaction—denaturing, annealing, and final extension steps

Diverse key parameters in the thermal cycling conditions (denaturing temperature, annealing/extension temperature, and final extension time) can affect the robustness of a genotyping system. One nanogram of male control DNA 9948 and the FTA® Cards were used to search for the optimal thermal cycling conditions for the SureID® PathFinder Plus Kit. There was no significant alteration upon changing the denaturing temperature by ± 1.0 °C from the optimal 95 °C (data not shown). Temperatures of 58 °C, 59 °C, 60 °C (optimal condition), 61 °C, and 62 °C were tested for the annealing/extension step, and full profiles were obtained at each of the conditions. For male control DNA 9948, average peak heights at 58 °C, 59 °C, and 61 °C were relatively higher than those at 61 °C and 62 °C (Fig. S4). For 1 ng of 9948, it appeared to have a relatively intense distribution dispersal in intralocus regions and had better performance in intracolor balance at 60 °C, although elevations of the average peak heights over all dye channels were relatively low compared with the others. For blood samples on FTA® Cards, better intralocus and intracolor balances were observed at temperatures of 61 °C, and significant elevations of the average peak heights over all dye channels were observed at temperatures of 60 °C (Fig. S5). Thus, a denaturing temperature of 60 °C was considered to be the right balance between the two kinds of samples. Furthermore, a range of final extension times (5–20 min) was tested in order to ensure nucleotide addition on all amplified products. The results show that shortening the extension time (less than 8 min) will lead to incomplete adenosine addition, especially for loci DYS19, DYS390, and DYS448 (data not shown). The optimal final extension time was determined to be 15 min.

Sensitivity studies

The sensitivity of the SureID® PathFinder Plus Kit was tested with a serial dilution of DNA 9948 from 125 pg to 2 ng in triplicate. Full Y-STR profiles were obtained for DNA inputs greater than or equal to 250 pg (Fig. S6) with the analysis threshold set at 150 RFU peak height. When the 9948 DNA inputs decreased to 125 pg, allele drop-outs were observed. On this basis, we concluded that the SureID® PathFinder Plus Kit can provide reliable profiles at a threshold of 150 RFU with DNA inputs equal to or greater than 250 pg. The sensitivity under these conditions was determined to be 0.25 ng/10 μL.

Male specificity

The SureID® PathFinder Plus Kit is designed to test only male samples. Using amplified pure female DNA as an input, even with 1 μg of control DNA 9947A there were no meaningful profiles produced. However, biology samples from forensic casework often contain high levels of female DNA background such as in sexual assault mixtures. In such situations, female cross-reactive amplification products could be observed. To guarantee an accurate profile interpretation, the SureID® PathFinder Plus Kit was optimized to minimize the artifact peak height to be well below the 50 RFU peak amplitude threshold in the presence of female DNA inputs. One nanogram of control DNA 9948 was mixed with 1 μg 9947A or 1 μg DNA isolated from each of the 10 female blood samples for amplification. The results showed that none of the possible amplified artifacts from the above female samples would affect profile interpretation in the read region (82–540 bp). A representative profile generated from a male:female mixture in the presence of 1 ng 9948 and 1 μg 9947A is shown in Supplementary Fig. S7.

DNA mixture studies

Male:male DNA mixtures are often encountered in forensic casework. In order to determine the performance of the SureID® PathFinder Plus Kit when dealing with such samples, the minor and major male contributors were distinguished. Results showed that as the mixture ratio decreased, the peak height of minor alleles decreased. Full alleles of the minor contributor at non-overlapping and non-stutter positions were obtained from a 1:1 to a 1:4 ratio (125-pg minor contributor DNA input). A representative profile generated at a ratio of 1:4 is shown in Supplementary Fig. S8. General losses of alleles were observed for ratios of 1:8 and 1:15, and the unique minor profile was called for an average of 80% and 50%, respectively. Thus, the SureID® PathFinder Plus Kit requires all alleles to be correctly detected for 1:4 male:male DNA mixtures. However, direct amplification of mixtures is not recommended when the original DNA ratios of contributors in the starting material are unknown.

Species specificity

Non-human genomic DNA samples from common animal species (dog, chicken, horse, bovine, rabbit, duck, sheep, fish, pig, and E. coli) were amplified using the SureID® PathFinder Plus Kit. Results show that no cross-reactive peaks were observed within the genotyping range above the peak amplitude threshold of 150 RFU (Fig. S9). For fish, an “OL” peak (peak height 140 RFU) was observed for locus DYS635, which may disturb genotyping analysis. However, on this basis, the novel Y-STR system is unlikely to be affected by the presence of genetic material from these animal species.

Models of inhibition

The wide range of forensic sample types and variety of sampling conditions encountered make forensic analysis particularly difficult. For as long as forensic scientists have used the PCR, PCR inhibitors that often lead to locus dropout have been an obstacle to success [25]. PCR inhibitors exert their effects mainly through direct interaction with DNA or interference with thermostable DNA polymerases [26]. Common sample types that are known to contain inhibitors include blood, tissues, fabrics, and soil. Other pivotal sources of inhibitors are the materials and chemistries that come into contact with samples during the upstream extraction.

The SureID® PathFinder Plus Kit is designed for the direct amplification of pristine reference samples such as blood and buccal samples. Therefore, DNA purification, which is the method used most often to remove inhibitors, is not used. However, actions have been taken to cope with samples from a variety of substrates and inhibitors such as increasing the amount of DNA polymerase in the reaction and using BSA that has been rigorously quality-tested. Mock inhibition samples with 1 ng of male control DNA 9948 containing varying concentrations of humic acid [27, 28], hematin [28], and indigo [29] were prepared to evaluate the inhibitor tolerance of the SureID® PathFinder Plus Kit. Full profiles were obtained with humic acid concentrations up to 200 ng/μL, hematin concentrations up to 40 μM, and indigo concentrations up to 10 mM (Fig. S10). Allelic dropouts were observed in loci with long fragment size alleles when the concentrations exceeded these levels. Meanwhile, smaller loci in the kit were preferentially amplified. It should be noticed that inhibited samples may mistakenly be assumed to be degraded, as the same pattern is typical of highly degraded DNA templates.

Population studies and mutation rates

The population data were used to calculate GD, HD, DC, and MP values. The number of unique haplotypes observed was higher with the SureID® PathFinder Plus Kit compared with three former Y-STR kits (Table 1), and our novel kit showed improved haplotype diversity and discrimination capacity. Using the SureID® PathFinder Plus Amplification Kit, 807 (98.32%) unique Y-STR haplotypes (Table 1) were generated from the Zhejiang Han population. The increased number of analyzed loci improved the DC and HD. Compared with Yfiler®, the SureID® PathFinder Plus Kit provided higher power forensic discrimination, which raised the DC from 94.00 to 98.32%. Results showed GD values above 0.4 for all markers of the SureID® PathFinder Plus Kit, except for DYS645 and DYS438 (Fig. 2). The multi-copy loci DYS385a/b, DYF387S1a/b, and DYS527a/b exceeded a GD value of 0.9.

Table 1 Haplotype diversity, match probability, discrimination capacity values as well as number of unique and shared haplotypes observed of the SureID® PathFinder Plus were calculated and compared with three other Y-STR Kits
Fig. 2
figure 2

Y-STR markers of the SureID® PathFinder Plus Kit ranked by genetic diversity. Gray bars indicate the multi-copy loci

A total of 754 father–son pairs were analyzed at 41 Y-STR loci as well as three Y-InDel markers. At all of these loci, 119 mutations were observed from 113 father–son pairs, while no mutations were found in rs199815934, DYS388, DYS19, DYS645, rs771783753, rs759551978, DYS643, DYS437, DYS557, DYS438, DYS389I, DYS596, Y-GATA-H4, DYS393, DYS390, or DYS593. Overall, there was no appreciable difference between the numbers of repeat gains (57) and losses (62). The DYS458 locus had the highest mutation rate (> 1 × 10−2) among the single-allele loci, while DYF404S1 had the highest mutation rate among the multi-copy loci (Table 2). Additionally, among the 119 observed mutations, 117 were single-step mutations, and only two followed two-step mutations (father–son pairs no. 1 and no. 2), which are shown in Supplementary Fig. S11. Five of the father–son pairs had mutations at two loci, in which DYS456-DYS449, DYS533-DYS391, DYS389II-DYS570, DYS458-DYS527, and DYS527-DYF404S1 each appeared once. One of the mutations occurred during the meiotic transfer from the father to one of his two sons (father–son pair no. 3), which is also shown in Supplementary Fig. S11. In addition, a mutation was only observed at one locus in each of the meiotic transfers, except for a special father–son pair. In this case, for father–son pair no. 4 shown in Supplementary Fig. S11, both alleles of DYF404S1 had mutations simultaneously if the single-step mutation mode is given precedence over the two-step mutation mode.

Table 2 The mutation rates of the 41 Y-STR loci in the Han nationality of Zhejiang province

Performance testing

In routine forensic applications, blood samples are often stored on FTA® Cards or sterile gauze. Sometimes, non-professionals choose to reserve blood samples on sterile cotton balls and even disposable tissues. Hair roots or buccal samples on sterile swabs are also very common both in criminal cases and in paternity identification. As the former sample types are less contaminated, performing isolation and purification of DNA is often unnecessary. The performance of the SureID® PathFinder Plus with DNA samples on five different substrates as well as hair roots was determined with respect to intracolor balance and average peak height (Fig. 3). The results showed no clear difference in performance among the six different kinds of samples with respect to average peak height, except the blood samples on sterile gauze had relatively high average peak heights compared with the others. When comparing the average intracolor balance (expressed as a percentage) values among the six sample types, the balance of the 6-FAMTM dye channel was relatively weak compared with the others. This unbalance may be explained by the greater number of loci that comprise the 6-FAMTM dye channel or influence from the storage substrates. Without using extracted DNA, the obvious deviations of intracolor balance among the six sample types might be improved by adopting other storage substrates and/or collecting devices, such as Bode Buccal DNA CollectorsTM (Bode Technology), Omni SwabsTM (GE Healthcare/Whatman), WhatmanTM EasiCollectTM Devices (GE Healthcare Life Sciences), PunchSolutionTM Reagent (Promega Corporation), or SwabSolutionTM Reagent (Promega Corporation) prior to amplification. Of course, the increased cost of genotyping would be obvious. In addition, a soaked swab is difficult to reuse.

Fig. 3
figure 3

Performance comparison of reference samples on a variety of substrates. Blood spotted on FTA® Cards (n = 53), sterile gauze (n = 76), sterile cotton balls (n = 3), and tissues (n = 6) was amplified with the SureID® PathFinder Plus Kit. Buccal samples on swabs (n = 5) and hair roots (n = 4) were also amplified with the kit using the same standard conditions. Electrophoresis was conducted on the same 3500xL Genetic Analyzer for all samples. Each box represents the 25th through the 75th percentile of the observed values, with the center line indicating the median. The range of the data was indicated by whiskers. The black dots represent outliers that are different from the mean by more than twice the pooled standard deviation

Accuracy, precision, and stutter

Sizing accuracy for the SureID® PathFinder Plus Kit was determined by comparing size differences between bloodstain samples (54 FTA® Cards and 76 gauze samples) and the alleles of the allelic ladder. All sample alleles were within ± 0.5 bp of a corresponding allele in the allelic ladder (Fig. S12). Sizing precision was assessed by calculating the standard deviation in the size values obtained for the SureID® PathFinder Plus Kit allelic ladder alleles of 10 injections on a 3500xL. The targeted standard deviation of less than 0.2 was observed for all alleles (Fig. 4). Thus, the accuracy and precision levels of the novel kit are sufficient to distinguish off-ladder peaks or microvariants. This self-imposed size limitation enables effective 1-bp resolution of microvariants on a 3500xL.

Fig. 4
figure 4

Sizing precision was assessed by using allelic ladder alleles of 10 injections on a 3500xL

Stutters derive from slippage at the polymerase/strand interface and appear as minor byproducts of PCR amplification [30,31,32]. To avoid complications in profile interpretation, the expected stutter percentage at each STR marker should be evaluated for a new commercial kit. In this study, 130 blood samples (54 FTA® Cards and 76 gauze samples) were directly amplified using the SureID® PathFinder Plus Kit to obtain the percentage of stutter products. Mean stutter percentage values for each locus were calculated for both of the sample types, respectively. The values plus three standard deviations were used to make a stutter file for GeneMapper® ID-X v1.2 (Table S1). One repeat unit (n − 1) smaller than the adjacent true allele peak (n) dominated the stutter products. One repeat unit (n + 1) larger than the target allele (n) came in a close second. The trinucleotide repeat locus DYS481 had the highest minus (n − 1 repeat unit) stutter filter value. The differences in stutter occurrences between blood spotted on FTA® Cards and sterile gauze were discernible. Stutter effects of blood spotted on FTA® Cards were generally better than with gauze.

Conclusion

The SureID® PathFinder Plus Kit is compatible with the Yfiler® Kit, Yfiler® Plus Kit, and PowerPlex® Y23 System. Co-amplification of 41 loci and 3 Y-InDel markers enhances the multiplexing capability. Therefore, this novel kit has the power to distinguish related males and familial searching. In order to accelerate and simplify the process, optimization of primer sequences and master mix enables the SureID® PathFinder Plus Kit to deal with direct amplification challenges within 100 min for hair roots, blood, or buccal reference samples on different substrates. We performed a series of developmental validations of the SureID® PathFinder Plus Kit and found that it has stable allele sizing accuracy and precision, good intracolor balance, good repeatability, male specificity, and high sensitivity. Three external labs confirmed and reproduced the results of the SureID® PathFinder Plus Kit in beta-testing. All of the above results demonstrate the robustness and validity of the SureID® PathFinder Plus Kit for use with forensic reference samples. Quality Y-STR genotypes can be obtained with significantly less labor. This novel kit will be a dependable and convenient tool for distinguishing related males, for familial searching, and in DNA database construction.