Introduction

Short tandem repeats (STRs) are indispensable tools in forensic science and population genetics due to the ease of typing and high intra-population and inter-population heterogeneity. During the last decade, researchers have described highly polymorphic Y-chromosome-specific STRs (Y-STRs), which are useful in investigations of such crimes as sexual assaults and in male sibship analysis where the alleged father is deceased or missing. Y-STRs also appear to be useful for the study of genetic relationships among closely related populations [1]. Although a number of multiplex PCR-based typing systems for Y-STRs have been developed [2, 3, 4, 5], (unpublished results C.M Ruitberg and J.M. Butler) and databases are now being constructed, there are few studies on mutations at Y-STR loci, especially in the Japanese population. Valid mutation rate estimates are necessary for paternity testing to avoid false exclusions in cases where there is an allelic discrepancy between the biological father and son due to mutation. The possibility of encountering false exclusions increases when highly polymorphic loci are used to increase discriminating power since mutations may create new alleles.

In the present study we used two multiplex PCR-based typing systems to estimate the overall mutation rate of 14 Y-STRs in the Japanese population from analysis of 161 father/son haplotype transfers in 147 pedigrees. In addition, we determined the haplotypes and the haplotype diversity value observed in our Japanese population to evaluate the discriminatory power of the 14 Y-STRs.

Materials and methods

DNA sample preparation

Blood samples or buccal swabs were collected from 307 healthy Japanese males in 147 pedigrees, and a total of 161 meioses were investigated. All relationships had been confirmed as biological father-son relationships using autosomal STR markers and/or multi-locus or single-locus probe methods. Signed informed consent was obtained from all participants. Prior to collecting the samples, the study was reviewed and approved by the ethics committee of the School of Medicine, Nagoya University. DNA was extracted using a routine organic extraction method. The DNA concentration of each sample was quantified fluorometrically with bisbenziamide Hoechst 33258 trihydrochloride (Sigma, St. Louis, MO) using the DyNA Quant 200 fluorometer (Hoefer Pharmacia Biotech, San Francisco, CA).

PCR amplification and genotyping

We analyzed a total of 14 Y-STRs with 2 multiplex PCR-based typing systems. The loci DYS389I, DYS439, DYS435, DYS19, DYS460, Y-GATA-H4, DYS391, DYS392, DYS437, and DYS438 were analyzed with a 10-plex system originally devised by Ruitberg and Butler (unpublished results, personal communication). In the present study, DYS 389I was substituted for DYS436 because the gene diversity of the latter was estimated to be low in the Japanese population [6]. The six loci, DYS393, DYS19, DYS389II, DYS390, DYS391, and DYS385 were genotyped with the commercially available kit, Y-PLEX6 (Reliagene Technologies, New Orleans, LA). DYS19 and DYS391 are included in both systems. The sequence information of the 14 Y-STRs is shown in Table 1. Alleles were named according to the recommendation by the DNA Commission of the International Society for Forensic Genetics [7]. In the 10-plex system, PCR amplification was performed using the conditions described in our previous study [6] except that the newly designed primer set (1 µM DYS389InF: ATAGATAGATTGATAGAGGGAGGGAT, 1 µM DYS389InR: CTCTCATCTGTATTATCTATGTA) for DYS389I was added to the mixture instead of those for DYS436. The PCR products were loaded onto a Genetic Analyzer 310 (PE Applied Biosystems, Foster City, CA) with GeneScan-500 (ROX) size standard (PE Applied Biosystems), and genotyped using Genotyper 2.5 software (PE Applied Biosystems). The allelic ladder markers were made by ourselves after confirming the sequences of all alleles observed in a Japanese population. In the Y-PLEX6 system, the PCR reaction and subsequent analysis of the amplified products were performed according to the manufacturer’s instructions (Reliagene Technologies).

Table 1 The sequences of repetitive regions and the mutations at 14 Y-STRs in the present study

Haplotyping and sequence analysis

After genotyping all the samples with the two multiplex systems, we listed the haplotypes and checked to determine if there were any allele mismatches within the father/son pairs. In cases where an allele mismatch was observed, we performed sequence analysis.

Statistical analysis

The gene or haplotype diversity (h) value was calculated according to the formula h=n(1−∑xi 2)/(n−1), where n is the number of individuals and x i is the frequency of each allele or haplotype. The maximum-likelihood estimate of the mutation rate across the 14 Y-STRs in the Japanese population was computed as the number of mutations divided by the total number of allelic transfers analyzed. The number of mutation events within a given number of allelic transfers under a given mutation rate follows a binomial distribution. The 95% binomial confidence interval (CI) was determined by the Blyth-Still-Casella procedure [8]. Fisher’s exact test was used to compare our mutation rate estimates with those of Brinkmann et al. [9] for 9 autosomal STRs in German and Austrian populations, Kayser et al. [10] for 15 Y-STRs in German and Polish populations, and Dupuy et al. [11] for 9 Y-STRs in a Norwegian population. We also used Fisher’s exact test to examine gain versus loss of repeats mutations. Statistical calculations were all carried out with StatXact-5.0.3 (Cytel Software, Cambridge, MA).

Results

The haplotypes detected for the 14 Y-STRs in the Japanese population are listed in Table 2, where the pre-mutation allele was regarded as the pedigree allele in each haplotype with mutation events. An unusual triplicate allele was also observed instead of a duplicate allele at DYS385 between both father and son in the haplotype 71 pedigree (Fig. 1). This triplicate allele was regarded as a single haplotype at DYS385 instead of a mutated allele because there was no allelic discrepancy between them. As shown in Table 2, 138 haplotypes were observed in 147 pedigrees, yielding a haplotype diversity value of 0.9983. The values for the 10-plex and Y-PLEX6 were 0.9912 and 0.9898, respectively.

Table 2 The 138 haplotypes at 14 Y-STR loci detected in 147 Japanese father-son pedigrees
Fig. 1
figure 1

Genotyper plot of a triplicate allele at DYS385 locus in No. 71 haplotype shown in Table 1. Upper column allelic ladder marker, middle column father, lower column son

From the 161 meioses in the present 147 pedigrees, 4 mutation events consisting of 4 extra nucleotides were identified each at DYS389I in the haplotype 103 pedigree, DYS439 in the haplotype 68 pedigree, Y-GATA-H4 in 1 of the 5 haplotype 98 pedigrees and DYS389II in the haplotype 101 pedigree, and 1 mutation event consisting of 4 reduced nucleotides at DYS391 in the haplotype 33 pedigree. All the fathers in which mutations were observed were less than 50 years old. Sequence analysis revealed that all of the mutations were single-repeat gain/loss events that occurred in uninterrupted arrays of ≥11 homogeneous tetranucleotide repeats as shown in Table 1. The maximum-likelihood estimate of the mutation rate across the 14 Y-STRs in the Japanese population was 0.22%/locus/meiosis (95% C.I. 0.09–0.51%). Fisher’s exact test showed that this estimate did not differ significantly (p>0.05) from those reported by Brinkmann et al. [9], Kayser et al. [10] and Dupuy et al. [11]. The test did not reveal a significant bias toward gain versus loss-repeat mutations (p>0.05). With the exception of DYS389II, 4 of the 5 mutated alleles were either as long as or longer than the most frequent allele. In addition, all the loci with a mutation event followed a unimodal allele frequency distribution, and 4 out of the 5 loci were “compound” STRs as shown in Table 1.

Discussion

In the present study, 138 haplotypes at 14 Y-STR loci were identified in 147 pedigrees. The allele frequency distribution at each locus and the diversities were not significantly different from those of the Honshu-Japanese in our previous study [6].

With regard to mutation studies on autosomal STRs, various strategies have been adopted such as computer simulations [12, 13, 14], analysis of non-human species [15, 16, 17], and genotype comparisons of family members using blood [9, 13] or cell lines [18, 19]. Such studies revealed the following: there is a positive correlation between the mutation rate and the number of homogeneous repeats [9, 16, 17]; that interspersed irregular repeats have an inhibitory effect on mutation [9, 15]; that the average mutation rate is approximately 0.2%/locus/meiosis [9, 18, 19]; that the majority of mutations comprise one-repeat gain/loss [9, 13, 18, 19]; and one major mechanism for creating a new allele is slipped-strand mispairing [9, 12, 20, 21]. Brinkmann et al. [9] analyzed 23 cases of mutation among 10,844 parent/child allelic transfers for 9 autosomal STRs and reported that no mutations were observed in loci where the allele length versus frequency distribution was not unimodal.

Alternatively, a few studies on Y-STR mutations have been reported previously. Heyer et al. [22] estimated an average mutation rate of 0.21%/locus/meiosis (95% CI, 0.06–0.49%) among 9 Y-STRs in multigenerational pedigrees. Bianchi et al. [23] observed 2 mutations within 1,743 meioses in cell lines but attributed these to somatic, not germline, mutation events. Kayser et al. [10] identified 14 mutations among 4,999 allelic transfers of 15 Y-STRs in German and Polish father/son pairs; the estimated overall mutation rate was 0.28%/locus/meiosis (95% C.I. 0.17–0.43%). Dupuy et al. [11] also identified 5 mutations among 1,200 allelic transmissions of 8 Y-STRs in Norwegian father/son pairs; the estimated overall mutation rate was 0.42%/locus/meiosis (95% C.I. 0.14–0.97%). Although accumulation of data is still not enough, the rate estimates and characteristics of Y-STR mutations reported so far were similar to those of autosomal STRs, which suggests that the general mechanism for both autosomal and Y-STR mutations is slipped-strand mispairing, and is independent of recombination [10]. Kayser et al. [10] also noted that mutations always occurred in either the most frequent alleles or in alleles longer than the most frequent ones, and that the majority of mutations (13 out of 14) were observed at “compound” microsatellites.

The best way to obtain an accurate understanding of mutations at STRs is to investigate allelic transfers within families with biological parental relationships confirmed using other DNA markers [10]. We followed this approach to study Y-STR mutations in the Japanese population, and compared our results to the studies using the same approach [9, 10, 11]. There were no statistically significant differences between our mutation rate (0.22%) and those of other authors, and the characteristics of the mutation events observed in the present study were similar.

With regard to the direction (gain/loss) of Y-STR mutations, there appears to be a bias toward allele expansion mutations. The ratio of increased repeats to decreased repeats was 4:1; that of Kayser et al. [10] was 10:4, but neither of the differences was statistically significant. However, it was impossible to draw any conclusions because the sample sizes and the number of loci analyzed were extremely small compared to the study by Xu et al. [24], who examined 236 mutations in 122 autosomal STR loci and found neither bias despite the fact that the rate of contraction mutations increased exponentially with allele size.

It is generally assumed that different loci have different mutation rates and that different alleles of the same locus have different mutation rates. Because allele frequencies differ among populations, allele-specific or locus-specific mutation rates can vary among them. Recently, Rolf et al. [25] advocated a method for calculating the probability of paternity using the overall mutation rate of each locus, however, we feel it is best to use population, locus, and allele-specific Y-STR mutation rate estimates for accuracy in forensic analyses and population studies. Furthermore, it is noticeable that some Y-STRs are slowly mutating and that such loci are useful for phylogenic studies on humans [26]. The accumulation of data will make it possible to construct alternative combinations of Y-STRs for different purposes.

The allele triplication at DYS385 was observed in one father/son pair in the present study. Since the same event was also reported by Kayser et al. [10], such an event may not be rare in this extremely polymorphic locus. The allele duplications at DYS19, DYS390, and DYS391 have also been reported previously [27, 28, 29]. Consequently, such additional alleles should been taken into consideration in forensic practice because they can be wrongly interpreted as a mixed profile.

For forensic purposes, further studies on Y-STRs, including the construction of a worldwide database with more loci and mutation events at each locus are necessary.