Introduction

Within the past few years, a variety of commercial kits have been developed. As different regions usually choose different loci which are suitable for their own population, data sharing and comparison are becoming a problem that needs to be paid attention to. In 2012, the CODIS Core Loci Working Group recommended 25 STR loci as Expanding CODIS, focusing on facilitating greater discrimination, assisting missing person investigations, and promoting compatibility for international data sharing [1]. However, multiplexing such a great number of STR loci in a traditional 5-dye system seems difficult, so we conceived the idea of developing a 6-dye fluorescent multiplex STR with high tolerance which include more STR loci as comprehensive as possible [supplemental Table 1]. The kit is designed to be more sensitive to male component in mixtures and be more accurate in gender determination, even with Amelogenin-Y dropout [2].

In this study, we developed and validated the EX27 System. This is a 6-dye multiplex system that enables typing of 24 autosomal STR loci, a Y-STR loci, an Indel site, and Amelogenin. The blue channel consists of fluorescein-labeled STR amplicons for the D3S1358, D13S317, D7S820, D16S539, D1S1656, and Penta E loci. The green channel includes DNA fragments labeled with HEX for the TPOX, TH01, D2S1338, CSF1PO, Penta D, and D10S1248 loci. The yellow channel consists of amplification products for the D19S433, vWA, D21S11, D18S51, and D6S1043 loci which are labeled with TAMRA. The amplified products for Amelogenin, D8S1179, D5S818, D12S391, and FGA loci are in the red channel and labeled with ROX. The DNA fragments labeled with VIG for the D22S1045, YIndel, D2S441, DYS391, and SE33 loci are consisted in the purple channel. Fragments included in the internal lane standard are detected in the orange channel and are labeled with SIZ-500 (Supplemental Fig. 1).

SWGDAM developmental validation guidelines [3] are followed in the development and validation of the EX27 System.

Materials and methods

Samples

The sensitivity studies were performed using male 9948 human genomic DNA (Promega, Madision, WI, USA) which amplified three times in a serial dilution with the following template amounts: 1 ng, 500 pg, 250 pg, 125 pg, 62.5 pg, 31.25 pg.

Species specificity studies were conducted using 500 pg purified DNA from human, 1 ng each of purified DNA from chimpanzee and macaque, which were donated from Guangzhou Zoo (Guangzhou, China), 10 ng each of dog, pig, sheep, chicken, mouse, fish, diatom [4] DNA, which were amassed in Guangzhou Forensic Science Institute over years, and a microbial pool (1 ng each purified DNA from Escherichia coli, lactobacillus acidophilus, salmonella, Pseudomonas aeruginosa, Staphylococcus epidermidis, Streptococcus salivarius, Corynebacterium pyogenes, Saccharomyces cervisiae, Candida albicans, and Aspergillus flavus mixed together. All of them were provided by the Institute of Human Virology of Sun Yat-Sen University (Guangzhou, China) and a negative control. The quantity was determined by NANODROP 2000 Spectrophotometer (Thermo Scientific, USA).

Six common forensic inhibitors, including hematin, indigo, calcium ion, humic acid, EDTA, and hemoglobin, were tested in stability studies. The male 9948 human genomic DNA template quantity was held constant at 500 pg while the concentration of inhibitors was varied: 0, 50, 100, 150, or 200 μM of hematin; 0, 5.0, 7.5, 10.0, or 15.0 mM of indigo; 0, 1.0, 2.0, 3.0, or 4.0 mM of calcium ion; 0, 10, 20, 30, or 50 μg/ml of humic acid; 0, 0.5, 0.8, 1.0, or 1.2 mM of EDTA; and 0, 300, 500, 800, or 1000 μM of hemoglobin.

Mixture studies were tested by two sets: male/male mixtures were prepared using male 9948 and male 2800 M human genomic DNA (Promega) with mixture ratios of 1:1, 1:4, 1:9, or 1:19; male/female mixtures were prepared using male 9948 and female 9947a human genomic DNA with the same ratio. These samples were formulated at a concentration of 1 ng in 10 μl, and each of them was amplified in triplicate.

Casework samples were tested at Guangzhou Forensic Science Institute and Forensic Identification Institute of the Third Affiliated Hospital of Guangzhou Medical University. These samples include blood stains, bones, epithelial fractions, hairs, amniotic fluid, underfur, and buccal Indicating FTA® Cards. Buccal Indicating FTA® Cards were amplified directly while others were extracted by QIAamp® DNA Ivestigator Kit (QIAGEN GmbH, Germany).

Reproducibility samples included buccal Indicating FTA® Cards, blood FTA® Cards (GE Healthcare/Whatman), and extracted hair DNA from one male donor. The hair DNA was extracted by QIAamp® DNA Ivestigator Kit (QIAGEN GmbH, Germany), and the quantity was determined by NANODROP 2000 Spectrophotometer (Thermo Scientific, USA).

Stutter calculation study contains 100 extracted DNA samples which were amplified and analyzed on a 3130xl Genetic Analyzer. These samples were collected with informed consent from 100 male donors of Guangdong Han population. These samples were directly amplified by blood FTA® Cards (GE Healthcare/Whatman). These samples were a subset of the 200 population study samples.

Population studies were performed with 200 unrelated individuals (100 males and 100 females) from Guangdong Han population which were collected with informed consent. The blood samples were maintained on blood FTA® Cards (GE Healthcare/Whatman).

DNA amplification

Amplifications were performed on GeneAmp® PCR System 9700 thermal cycler (Life Technologies) using the max mode on a gold plated block. Two PCR amplification reactions were tested: for the population studies, each amplification reaction contained 4 μl of reaction mix, 2 μl of EX27 primers, and 0.4 μl of C-Taq DNA polymerase with up to 3.6 μl of template; for other studies, each amplification reaction contained 10 μl of reaction mix, 5 μl of EX27 primers, and 1 μl of C-Taq DNA polymerase with up to 9 μl of template. According to the manufacturer’s technical manual, unless specifically noted, the following amplification set-up and cycling parameters were used to amplify both extracted DNA and FTA card: initial denaturation of 95 °C for 2 min, 30 cycles of 94 °C for 15 s, 59 °C for 45 s, and 65 °C for 90s, then final extention time for 20 min, and a 4 °C soak. Direct amplification reactions were performed using one 1.2 mm punch from blood FTA® Cards and three punches from buccal Indicating FTA® Cards. Negative and positive controls were included in all experiments.

Electrophoresis and analysis

CE was performed on the 3130xl (16-capillary) Genetic Analyzer using 36 cm capillary arrays with POP-4® Polymer. Reactions were prepared for CE by combining 12.0 μl of Hi-DiTM formamide, 0.5 μl of AGCU Marker SIZ-500, and 1.0 μl of sample or allelic ladder. Samples were denatured for 3 min at 95 °C, then immediately quenched on ice. Standard run conditions involved the following parameters: sample injection for 8 s at 2 kV, electrophoresis at 15 kV for 1500s with the run temperature of 60 °C as indicated in the HID36_POP run module. Spectral resolution was established using the AGCU 6-dye Matrix Standards to assess each fluorescent dye. A peak amplitude of 50 RFU was implemented as the peak detection threshold when analyzing data from CE instruments.

Statistical analysis

The fluorescence balance of heterozygous alleles (intra-locus balance) was calculated as the height of the lower peak divided by the height of the higher peak at one locus. The balance within same fluorescent dye (intra-color balance) was the peak height ratio between the minimum peak height (average height at one locus) and the maximum heights (average height at one locus) in the same group. The balance across all loci (inter-color balance) was calculated in a similar way as intra-color balance.

The sizing precision of STR allele peaks were assessed by calculating the average base pair size and standard deviation of the 16 allelic ladder with ILS (Internal Ladder Standard).

The stutter ratios were calculated as the stutter peak height divided by the peak height of the associated allele.

Population statistics such as heterozygosity (H), discrimination power (DP), polymorphism information content (PIC), and power of exclusion (PE) were calculated by Power Stats v1.2 software. Total discrimination power (TDP) and cumulative probability of paternity exclusion (CPE) were calculated according to the formula from article [5].

Results and discussion

PCR condition studies

In the PCR-based procedure studies, we tested reaction components, cycling number, annealing temperature, and final extension time of the EX27 system.

The reaction components of EX27 system involve the Primer mix, Maste Mix, C-Taq, and ddH2O. As pipetting errors may cause fluctuations, 500 pg 9948 was, respectively, amplified with 0.5×, 0.75×, 1×, 1.25×, 1.5× Primer Mix, Master Mix, and C-Taq to help us see the influence. Large fragments dropout and off-ladder peaks were common at 0.5× (Supplemental Figs. 24). The full profiles were obtained at other concentrations.

For the test of cycling number, 500 pg 9948 was amplified at 28, 29, 30, 31, 32 cycles. The average peak height, respectively, was 674, 1153, 2113, 2724, and 3324 RFU (Fig. 1). As expected, peak heights increased with additional cycles. As pull-up peaks were observed at 31 and 32 cycles, the recommended optimal cycling number is 30 (Supplemental Fig. 5).

Fig. 1
figure 1

Average peak height per dye channel vs the number of cycles for PCRs with 0.5 ng of 9948 genomic DNA. Dye colors are FAM (“F”), HEX (“H”), TAMRA (“T”), ROX (“R”), and VIG (“V”)

For annealing temperature test, the experiments were designed in increments of 2, 55, 57, 59, 61, and 63 °C. The average peak height of each was 1664, 1513, 2070, 1019, and 730 RFU (Supplemental Figs. 6 and 7). As the graph of 59 °C showed nice balance and suitable average peak height, we choose 59 °C as recommended temperature.

For final extension time test, 1 ng 9948 was amplified three times, respectively, with the final hold time 0, 10, 20, and 30 min. Under the conditions of 0 min, D13S317, vWA, D21S11, FGA, D22S1045, and YIndel showed small minus A peaks (Supplemental Fig. 8). As minus A peaks [6] sometimes appeared when we amplified directly with FTA cards with the final hold time of 10 and 20 min, 30 min was chosen to be the appropriate final extension time.

Sensitivity studies

Sensitivity study samples were tested with a serial dilution of following template amounts: 1 ng, 500 pg, 250 pg, 125 pg, 62.5 pg, and 31.25 pg. Each template amounts were amplified three times in three different 9700 thermal cycler. Full profiles were obtained with 9948 from 62.5 pg to 1 ng by the threshold of 50 RFU. Allele CSF1PO and TPOX dropouts were observed with 31.25 pg; obviously, the dropout rates were nominal. Though the new-building 6-sye system got full profiles with 62.5 pg, the peak height of YIndel, SE33, D8S1179, D5S818, D2S441, D12S391, TPOX, and FGA loci was detected below 100 RFU. The percentage of complete profiles is shown in Fig. 2.

Fig. 2
figure 2

Percentage of complete profiles of 9948 human genomic DNA by threshold of 100 and 50 RFU

Species specificity

Ten kinds of non-human genomic DNA samples were tested for cross-reactivity with the EX27 system. Excluding the chimpanzee and macaque, which showed expected cross-reactivity, no peaks were detectable for the nonprimate mammals, fish, or the microbial samples. The result of the diatom showed one peak at 120 bases with the height of about 1500 RFU in the green channel and the other peak at 125 bases with the height of 498 RFU in the purple channel. Diatom widely exists in the air and could be checked out in the liver and lung in drowned corpse (Supplemental Fig. 9). Though the two peaks are non-specific amplifications, we still need to pay attention when testing the drown bodies.

Stability studies

Six common inhibitors (i.e., hematin, indigo, calcium ion, humic acid, EDTA, and hemoglobin) were tested with the EX27 system [79]. When added nothing, full profiles were obtained. With 50 μM of hematin; 5 and 7.5 mM of indigo; 1 mM of calcium ion; 0.5 mM, 0.8 mM, and 1 mM of EDTA; and 300 μM of hemoglobin, we also got full profiles. When the concentration of hematin was raised to 200 μM, complete dropout occurred, the same is true for the 15 mM of indigo, 1000 μmol/l of hemoglobin (Table 1). We compared the inhibitor tolerance for other products from Thermo, Promega, and Qiagen [1012]. For EDTA, the EX27 kit showed a similar performance with GlobalFiler ® express kit (Thermo Fisher Scientific, USA) and the Investigator 24plex QS Kit (Qiagen, Germany), for hematin, indigo, calcium ion, and humic acid, the GlobalFiler® express kit and the Investigator 24plex QS Kit showed better tolerance. As PowerPlex ® Fusion System (Promega, USA) used different DNA template for validation, it is unable to compare these two kits.

Table. 1 Stability studies

Mixture studies

Male/male mixtures and male/female mixtures were prepared and tested with the ratio of 1:1, 1:4, 1:9, or 1:19. All alleles were called for the 1:1 and 1:4 mixtures. At the 9:1 ratio, 81.5∼88.9% of all minor contributor alleles were detected with all six PCR replicates. For the 1:19 mixtures, some autosomal STR alleles dropped out, while no amelogenin, DYS391, or YIndel dropout were found. The call out rates were among 66.7 and 77.8% (Table 2).

Table 2 DNA mixture studies

Case samples

Guangzhou Forensic Science Institute tested the blood stains, bones, and epithelial fractions. Forensic Identification Institute of the Third Affiliated Hospital of Guangzhou Medical University tested the amniotic fluid, underfur, and buccal indicating FTA® Cards (Supplemental Fig. 10). All these samples got accurate and balance profile.

Reproducibility

Three laboratories participated in the reproducibility study which aimed to determine if profiles are reliable and suitable for comparison among laboratories when using the same DNA sample. Each participating laboratory amplified buccal indicating FTA® Cards, blood FTA® Cards, and extracted DNA from hairs. The results show that loci from different samples of the same donor were consistent and same sample got same typing results among the three participating laboratories.

Electrophoresis sizing and stutter calculations

Sizing precision is critical for accurate genotyping, and it was determined by running allelic ladder (Supplemental Fig. 11) standards on multiple CE instruments. To determine the sizing precision of this system, allelic ladder with internal ladder standard (ILS) was injected on 16 capillaries on a 3130 Genetic Analyzer. The average base pair size and standard deviation were calculated for every allele. The largest standard deviation observed was 0.0784, the lowest was 0.0283 (Fig. 3), which was well below the target specification of 0.15 bp.

Fig. 3
figure 3

Precision across 16 injections of the EX27 allelic ladder mix on a 3130 series instrument

To evaluate the sizing accuracy, the size differences between sample alleles and allelic ladder alleles were investigated with 42 samples. The results showed that over 95% alleles were sized within ±0.25 bp of their corresponding allelic ladder peaks and all sample alleles were within ±0.5 bp, which supports that there was low risk of alleles being mistyped due to insufficient sizing accuracy.

DNA samples from 100 different male individuals were used to estimate the stutter ratios. Stutter peaks are amplification by-products which are likely due to slippage [13]. The average stutter plus three standard deviations were used to set the stutter filter threshold (Table 3).

Table. 3 Observed stutter ratios per locus and recommended stutter filter threshold

Population and concordance studies

Population studies were performed with 200 unrelated individuals (100 males and 100 females) from Guangdong Han population that were collected with informed consent. The related genetic parameters such as heterozygosity (H), discrimination power (DP), polymorphism information content (PIC), and power of exclusion (PE) of 24 autosomal STR loci are shown in Table 4. The total discrimination power of the 24 autosomal STR loci was above 0.999999999999, and the cumulative probability of paternity exclusion was 0.999999999943.

Table. 4 The forensic population genetic parameters of 24 autosomal STR loci

As the EX27 System multiplex 24 autosomal STR loci (i.e., D3S1358, D13S317, D7S820, D16S539, D1S1656, Penta E, TPOX, TH01, D2S1338, CSF1PO, Penta D, D10S1248, D19S433, vWA, D21S11, D18S51, D8S1179, D5S818, D12S391, FGA, D22S1045, D2S441, and SE33), DYS391, YIndel, and Amelogenin together, high confidence was placed in the new system because it conserves the proven primer-binding sites from previously released AGCU STR systems. However, five loci (D1S1656, D10S1248, D22S1045, SE33, and Y-Indel) are not contained in the existing concordance data, and further tests will be needed.

Conclusion

The intent of developing a sensitive multiplex kit for forensic testing with increased discrimination power was started from the year 2012. The aim was to facilitate single source comparisons, mixtures, and kinship analyses as well as detecting male only components in the samples. These developmental validation studies tested the EX27 System following the SWGDAM guidelines and Chinese criteria [14]. In conclusion, the new building system provides precise, accurate, and sensitive profiling data.