Introduction

The aim of this work was to establish a database for the Afghan population for forensic purposes including paternity testing. We present here the allele frequencies, mutation rates and forensic efficiency values for the 16 STR loci D3S1358, VWA, FGA, TH01, TPOX, CSF1PO, D5S818, D13S317, D7S820, ACTBP2, D2S1338, D16S539, D19S433, D21S11, D18S51, and D8S1179 in a sample of 333 unrelated Afghan immigrants, 169 men and 164 women, seeking asylum in Germany.

Materials and methods

Genomic DNA was extracted from oral cotton swab samples by the proteinase K/Chelex method [1]. The 16 different STR systems were amplified using various kits (e.g. AmpFISTR Profiler and Identifiler [2] PCR amplification kits (Applied Biosystems, Darmstadt, Germany), Power ES (Promega, Mannheim, Germany), MPX3-SE ([3] Serac, Bad Homburg, Germany). Typing was performed using denaturing capillary gel electrophoresis on an ABI PRISM 310 Genetic Analyzer according to the manufacturer’s instructions.

Evaluation of Hardy–Weinberg expectations and other forensic statistical parameters was done with the computer programme HWE-Analysis 3.2 (Chr. Puers, Münster). Observed de novo mutations were included in the biostatistical evaluation according to Essen–Möller and, if necessary, additional STR systems were typed to reach a paternity probability value W≥99.997%.

Variant alleles and alleles from the mutation cases were isolated as described elsewhere [4] and directly sequenced using BigDye Terminator Cycle Sequencing Kit (ABI) with primers for both strands to check if the mutation had occurred in the repeat array.

Results and discussion

No deviation from Hardy–Weinberg equilibrium was observed for the 16 STR loci (Table 1). These 16 systems show a combined matching probability of 1 in 3.6 × 1014 and a combined mean exclusion chance greater than 0.9996 in the Afghan population investigated. According to these statistical parameters, this combination is a powerful tool for forensic identification and paternity testing.

Table 1 Characteristics of the 19 mutation cases from Afghanistan

A total of 19 one-step mutations were observed under approximately 12,000 meiotic transfers, 12 in the male and 2 in the female germ line, while five mutations could not be assigned. The ratio of repeat gains and losses was relatively balanced (7:9), while three mutations could not be assigned (Table 2). The observed mutation rates were in the range from 0 to 1.3 × 10−2 per locus per gamete per generation (Table 3) and, thus, are in the range reported by Brinkmann et al. [5] for Germans.

Table 2 Mutation rates of the 16 STR systems in Afghanistan
Table 3 Age distribution in mutation cases from Afghanistan

A slight increase of the mutation rate with age could be observed (Table 3), but the numbers are too low and do not allow statistically significant conclusions to be drawn.

In the hypervariable system ACTBP2 allele 4.2, the second smallest allele known at present and included in commercially available ladders (e.g. [6]) was observed nine times. Sequencing revealed a AA insertion directly 5′ of the core repetitive region of ACTBP2 (Fig. 1). As this allele 4.2 has not yet been found in other populations investigated (e.g. [1, 7]), it may help to determine the population of origin if detected in a stain from an unknown donor.

Fig. 1
figure 1

Sequences of the regular ACTBP2 allele 10 and of allele 4.2 that has an AA insertion directly 5′ of the core repetitive region

Sequencing also helped to assign the mutational events, e.g. in case E042/2002a, the filial allele 31.2 might have originated from the paternal alleles 30.2 or 32.2. Sequencing of allele 31.2 revealed the following motif (AAAG)12 AAAAAG (AAAG)18, which is compatible with the paternal allele 32.2 [motif (AAAG)12 AAAAAG (AAAG)19] but incompatible with the allele 30.2 [motif (AAAG)10 AAAAAG (AAAG)20 + a deletion of AAAG in the 5′ flanking region]. Thus, this mutation could be classified as a paternal one-step loss.

In case E112/2003, the presence of iso allele 14 at D8S1179 in the child [(TATC)2 TGTC (TATC)11 and (TATC)14] assisted in assigning the mutation to the paternal germ line because the mother inherited the iso allele 14 [(TATC)2 TGTC (TATC)11], while the father’s repeat structures were (TATC)13 and (TATC)15. This could be either a repeat gain or loss but they cannot be distinguished.

To conclude, we have established a forensic database for allele frequencies and mutation rates in the Afghan population that is useful for identity and parentage testing.