Introduction

Polymorphic Y-chromosome-specific short tandem repeats (STRs) (Y-STRs) have become a very useful tool both in evolutionary studies and in forensic casework. In particular, Y-STRs can be very useful in paternity testing involving male children to exclude or establish paternity or paternal lineage [6, 10]. Furthermore, the use of Y-STR in sexual assault cases containing a mixture of male and female DNA can improve the chances of detecting low levels of male DNA in a high background of female DNA [5, 9, 12]. Nevertheless, because of the lack of Y-chromosome recombination, Y-STR haplotype diversity (HD) is lower than that of similar autosomal STR panels. Thus, to achieve a power of discrimination (PD) similar to that for autosomal STR sets, a large number of Y-chromosome markers are needed.

A practical approach to improve the potential DC of Y-STR haplotype is to combine the most polymorphic markers into a Y-STR multiplex.

In this study, we report allele frequencies and basic forensic parameters for a new 17 Y-STR multiplex set commercially available as the AmpFLSTR Yfiler polymerase chain reaction (PCR) amplification kit (Applied Biosystems, Forest City, CA, USA) and includes all nine loci of the European minimal haplotype (DYS19, DYS385, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393), two loci recommended by SWGDAM (DYS438, DYS439), and six other markers: DYS456, DYS458, DYS635 (Y GATA C4), Y GATA H4, DYS437, DYS448.

Materials and methods

DNA samples

A total of 155 unrelated, autochthonous healthy males, living in Northeast Italy and the male children of 50 among these, confirmed by autosomal STR analysis using AmpFLSTR Identifiler PCR amplification kit (Applied Biosystems), with paternity probability >99.9%, were studied. Genomic DNA was extracted from peripheral blood by GenomicPrep Blood DNA Isolation Kit (Amersham Biosciences, Piscataway, NJ, USA). The amount of human DNA was determined by spectrophotometry.

Y-STR typing

Aliquots of 10–20 ng DNA were amplified in a total reaction volume of 25 μl consisting of 9.2 μl AmpFLSTR Yfiler PCR reaction mix, 5.0 μl of AmpFLSTR Yfiler kit primer set, and 0.8 μl of AmpliTaq Gold DNA polymerase. Amplification was carried out in a GeneAmp PCR System 9700 Thermal Cycler (Applied Biosystems) under the conditions reported in the user’s manual of the AmpFLSTR Yfiler PCR amplification kit.

Samples were prepared by mixing 8.7 μl of Hi-Di formamide with 0.3 μl of GeneScan-500 LIZ size standard (Applied Biosystems) and 1 μl of PCR product. Separation and detection of Y-STR 17plex PCR products were accomplished with the ABI Prism 3100 Avant Genetic Analyzer four-capillary array system (Applied Biosystems) following the manufacturer’s protocols using POP4 polymer, 36 cm×50 μm capillary array, and the G5 matrix filter set to detect the five dyes 6FAM, VIC, NED, PET, and LIZ. Prior to sample analysis, a spectral matrix was established using matrix standard set DS-33 (Applied Biosystems). Following data collection, genotyping of each samples was carried out automatically using the allelic ladders provided with the Y-17plex amplification kit and AmpFLSTR Yfiler kit template 9 macro (Applied Biosystems).

Statistical analysis

Allele and haplotype frequencies were estimated by gene/haplotype counting. DYS385 shows variation at two loci simultaneously which cannot be differentiated, and it is analyzed as a “genotype,” as recommended by DNA Commission of International Society of Forensic Genetics [4]. Gene diversity (GD) was calculated for each locus using the equation \(GD = (1 - {+AFw-sum {q^{{2+AFw,+AFw,}}_{i} )(n +AFw-mathord{+AFw-left/ {+AFw-vphantom {n {n - 1)}}} +AFw-right. +AFw-kern-+AFw-nulldelimiterspace} {n - 1)}} }\), where n is the sample size and q I is the allelic frequency. The HD for all 17 Y-STR loci, corresponding to the PD or chance of exclusion for unrelated males, was computed with the same formula using haplotype frequencies instead of allele frequencies. The haplotype match probability (HMP), i.e., the probability of finding an identical haplotype in a pair of randomly unrelated males, was calculated as HMP=1−HD [2]. The discriminatory capacity (DC) was determined by dividing the number of different haplotypes by the number of samples in a given population [6]. Mutation rate was estimated as the number of mutations, divided by the number of meiosis analyzed, and statistical calculations were carried out using StatCalc 1.1 (http://www.ucs.louisiana.edu/kxk4695).

Results and discussion

In our sample of 155 unrelated, autochthonous healthy males living in Northeast Italy, we observed a total of 153 distinct haplotypes, among which, 151 were unique (97.40%) and 2 occurred two times (Table S1). Allele frequencies and GD for each Y-STR loci analyzed in this population are shown in Table S2. The number of alleles found for each locus ranged from 3 to 7 with the exception of DYS385 loci, which presented 33 genotypes (Table S3). Almost all Y-STR systems reached values for GD higher than 0.5, except DYS393 (GD=0.492). The present haplotype data were compared with those reported by [1] for Austrian population, the only Caucasian data available for the same set of the 17 Y-STR loci, and no significant differences were observed.

Furthermore, in our population study, alleles 12 of 99.85 bp at DYS456 locus and 14 of 146 bp at Y GATA H4 locus were observed two times (haplotypes n.1, n.2 and n.126, n.128 in Table S1, respectively, as shown in Fig. 1). These alleles are not included in the allelic ladders provided with the AmpFLSTR Yfiler PCR amplification kit (Applied Biosystems) but have been described in other population studies [11, 13, 14].

Fig. 1
figure 1

An example of two off-ladder alleles: allele 12 with 99.85 bp at DYS456 locus (a) and allele 14 with 146 bp at Y GATA H4 locus (b). These alleles are not included in the specific allelic ladders that are shown in the upper part of the picture. The estimated size in base pair of the amplification products is reported below the allele designation

In the 50 father–son pairs analyzed, three mutation events were identified at the DYS389I, DYS456, and DYS458 loci, and, in two events, a reduction of allele size by a single unit repeat was observed (Table 1). In one father–son pair, two mutation events at the DYS389I (allele 12 to allele 11) and DYS389II (allele 30 to allele 29) loci were observed. In this case, both DYS389I and DYS389II differed by a single-unit repeat change in the same direction, and, because the primers designed for these two Y-STR loci encompass the PCR product of DYS389I within that of DYS389II, mutations in DYS389I would automatically lead to lengthening of the whole locus. In other words, this apparent double mutation is a single original event and should be counted as a single mutation. For the three fathers with a mutation, their age at the child’s birth was 23–39 years [average age 30±8.19 (DS) years], while for the 47 fathers without mutations (Table 2), their age at the child’s birth was 20–54 years [average age 33.30±10.14 (DS) years]. A nonparametric test (Mann–Whitney U test) showed that the difference in age between the two groups is not significant (p=0.182). The average mutation rate was estimated at 3.52×10−3 per locus per generation (3/850), with a 95% confidence limit of 0.73×10−3 to 1.03×10−2. This rate is not significantly different (p>0.05) from that reported in other Y-STR studies [3, 7, 8].

Table 1 Y-STR mutations observed in three father–son pairs
Table 2 Age distribution of 47 fathers without mutation events

With regard to the correlation between mutation rate and Y-STR’s structure, our observations fitted in with those of [7]: the mutation always occurred either in the most frequent alleles (as for locus DYS458 when a single repeat is gained) or in alleles longer than the most frequent ones (as for DYS456 and DYS89II when a single repeat is lost). Furthermore, it was noted that mutations occurred mostly in the Y-STR loci with less than 170 bp: DYS456, DYS458, and DYS389I.

The most common way to measure the ability of a Y-STR assay to resolve two unrelated male samples is determined by the HD value and corresponding random HMP. For this 17 Y-plex, HD was 0.9998, HMP was 0.0002, while the overall DC was 98.70%.

Comparing the number of haplotypes produced and the respective HD between the nine Y loci of the European minimal haplotype, 12 Y-plex (PowerPlex Y System, Promega), and the new 17 Y-STR set (Table 3), we can see that the 17 Y-plex has a higher PD and is therefore more informative. The possibility of erroneous sample identification with few Y-STR markers is high if compared with a panel of 17 Y-STRs. This means that it is very important to increase the number of Y-microsatellites for individual identification and paternity testing in forensic genetics to improve the discrimination power.

Table 3 Number of different haplotypes, unique haplotypes, and HD values found for three combinations of Y-STR loci in 155 unrelated individuals