Abstract
Allele frequencies and forensic statistics of 21 autosomal short tandem repeat loci (i.e., D6S474, D12ATA63, D22S1045, D10S1248, D1S1677, D11S4463, D1S1627, D3S4529, D2S441, D6S1017, D4S2408, D19S433, D17S1301, D1GATA113, D18S853, D20S482, D14S1434, D9S1122, D2S1776, D10S1435 and D5S2500) were estimated in Han population from northern China (n = 220). Significant deviation from Hardy–Weinberg equilibrium was detected only for D22S1045. The observed heterozygosity, the expected heterozygosity, the discrimination power, the probability of paternity exclusion in trios, the probability of paternity exclusion in duos and the polymorphic information content ranged from 0.591 to 0.836, 0.594 to 0.830, 0.762 to 0.948, 0.341 to 0.659, 0.189 to 0.487 and 0.535 to 0.807, respectively. Triallelic patterns were observed at D19S433 and D10S1435. Mutations occurred at D22ATA63, D10S1248, D19S433 and D14S1434 loci with all single-step mutations. The expected mutation rates of these four loci are 0.0042 with 95% confidence interval [0.0001, 0.0232] in a total of 238 meioses. Our results show that these 21 non-CODIS STR loci are highly polymorphic and can be useful for human identification and kinship analysis in Northern Han population in China.
Avoid common mistakes on your manuscript.
Introduction
Short tandem repeats (STRs) are important genetic markers and widely used in forensic applications because of their highly polymorphic characteristics [1, 2]. The STRs in the Combined DNA Index System (CODIS) are commercially available in many multiplex amplification kits (e.g., AmpFℓSTR Identifiler®). Testing with these loci usually can meet the requirements of human individual identification and standard triopaternity test. However, duo paternity testing [3, 4], complex kinship analysis cases (e.g., full-sib, grandparents–grandchildren, etc.) [5], and cases with mutations [6] may need more STR loci to obtain reliable identifications. Extra STR loci can provide additional genetic information and are a complementary tool to the conventional STR analysis [7–10]. There have been studies on seeking additional polymorphic STR loci independent from the current CODIS loci [11, 12]. This study is to validate a panel of 21 autosomal non-CODIS STR loci for Northern Han population in China.
Materials and methods
Population and DNA samples
Han population is a native ethnic group in China and, by most modern definitions, the largest single ethnic group in the world. In this study, 220 healthy unrelated Han volunteers in northern China were sampled. One hundred fifty-eight two-generation families including 80 father–child–mother trios and 78 mother–child or father–child duos were collected to estimate the mutation rates of the STR loci. All samples were genotyped with a AmpFℓSTR Identifiler® multiplex STR kit (Applied Biosystems, Foster City, CA, USA). Paternity index of each family is at least 10,000 to confirm the relationships based on the STRs in Identifiler.
DNA extraction and quantification
Genomic DNA was extracted by using the Chelex-100 protocol as described by Walsh et al. [13]. The quantity of recovered DNA was determined by Qubit® Quantitation System (Invitrogen, CA, USA) according to the manufacturer's specifications.
DNA amplification
Amplification of STRs was carried out using a multiplex PCR system AGCU 21 + 1 fluorescence amplification reagents (AGCU ScienTech Incorporation, Wuxi, Jiangsu, China), which includes Amelogenin, D6S474, D12ATA63, D22S1045, D10S1248, D1S1677, D11S4463, D1S1627, D3S4529, D2S441, D6S1017, D4S2408, D19S433, D17S1301, D1GATA113, D18S853, D20S482, D14S1434, D9S1122, D2S1776, D10S1435 and D5S2500. A multiplex PCR amplification was performed with a total volume of 10.0 μl containing 0.2–1.0 ng genomic DNA, 4 μl Reaction Mix, 2 μl 21 + 1 Primers, 1 U HS-Taq DNA polymerase and ddH2O. PCR was conducted with a GeneAmp PCR System 9700 Thermal Cycler (Applied Biosystems, Foster City, CA) following a protocol with an initial denaturation step at 95°C for 11 min, followed by 10 cycles at 94°C for 1 min, 62°C for 1 min, 72°C for 1 min, and 20 cycles at 90°C for 1 min, 60°C for 1 min, 72°C for 1 min, a terminal extension step at 60°C for 60 min. The experiments were conducted in accordance with quality control measures. Cell lines 9947A (Promega, Madison, WI, USA) were used as positive standard reference materials [14] and ddH2O was used as negative control.
Genotyping
The PCR products were separated by Capillary Electrophoresis on ABI PRISM 3130 Genetic Analyzers (Applied Biosystems, Foster City, CA, USA). Raw data were analyzed using GenemapperID 3.2 software (Applied Biosystems, Foster City, CA, USA), and alleles were determined by comparing with the allele ladder (AGCU ScienTech Incorporation, Wuxi, Jiangsu, China). If off-ladder peaks, triallelic patterns [15, 16] or mutations between children and parents were encountered, the samples would be typed again to confirm the genotypes. Off-ladder alleles were determined by the method described by Gill et al. [17].
Statistical analysis
Hardy–Weinberg Equilibrium (HWE) and expected heterozygosity (He) of each locus as well as Linkage Disequilibrium (LD) between each pair of loci were tested with the Genepop Version 4.0.10 software package (http://genepop.curtin.edu.au). Polymorphism information content (PIC), observed heterozygosity (Ho), discrimination power (DP) and probability of paternity exclusion (PE) of Han population were calculated by PowerStats program (http://www.promega.com/geneticidtools/). Further, the Han population data were compared with Tibet population [18] by the shuffling testing method described in Ref. [19].
Quality control
All laboratory procedures are accredited according to ISO17025. Furthermore, laboratory internal control standards were employed according to recommendation published by the Paternity Testing Commission of the International Society for Forensic Genetics [20].
Results and discussion
The genomic mapping information of the 21 non-CODIS loci showed in Table 1 is based on UCSC genome data (http://www.genome.ucsc.edu) and STRbase (http://www.cstl.nist.gov/biotech/strbase). No peak appeared in negative control. The genotypes of 9947A were the same as the standard reference in all experiments and listed in Table 1. Allele frequencies and forensic statistics of each locus were shown in Table 2. Triallelic patterns were observed at D19S433 (e.g., 12, 13 and 14) in a male and D10S1435 (e.g., 12, 13 and 14) in a female, respectively. Deviations from Hardy–Weinberg equilibrium were only detected at D22S1045 (p-value = 0.0000) and D14S1434 (p-value = 0.0047). After Bonferroni correction (i.e., 0.05/21 = 0.00238), only D22S1045 was still significant in HWE test.
Sixteenout of 21 loci had the observed heterozygosity (Ho) greater than 0.7. The highest Ho was 0.836 at D19S433 and the lowest Ho was 0.591 at D1GATA113. Discrimination power (DP) varied between 0.762 at D1S1627 and 0.948 at D19S433. The probability of paternity exclusion (PE) in trios varied between 0.341 at D1S1627 and 0.659 at D22S1045. The probability of paternity exclusion (PE) in duos varied between 0.189 at D1S1627 and 0.487 at D19S433 and D22S1045. Cumulative DP of these 21 loci is 0.999999999999999999991. Combined probabilities of exclusion in duos and trios are 0.999025 and 0.9999997, respectively. These statistics indicate that these 21 loci have high or medium probability of exclusion as well as discrimination power and can be useful in human identification.
Thirteen pairs of loci were detected in significant Linkage Disequilibrium (LD) in a total of 210 pairwise comparisons in Table 3. After Bonferroni correction (e.g., p-value = 0.05/210 = 0.000238), only two pairs D2S441 and D2S1776 (P = 0.0000), D12ATA63 and D19S433 (P = 0.0000) were still significant in the LD test.
Table 1 also shows that some loci in the Identifiler® kit (TPOX, D2S1338, D3S1358, FGA, D5S818, CSF1PO, TH01, vWA and D18S51) are on the same chromosomes with the 21 non-CODIS loci. However, almost all physical distances between STR loci on the same chromosome are closed to or more than 50 Mb, except for D3S1459 and D3S1358 (i.e., 40 Mb). Thus, all 21 non-CODIS loci together with loci in the Identifiler® kit may be treated as independent loci, although further genetic linkage study on recombination fraction between D3S1459 and D3S1358 may be required.
In addition, we compared Northern Han population in China and Chinese Tibetan population in Lhasa [18] with these 21 non-CODIS loci. Table 4 shows the p-values of the shuffling tests [19] for population differentiation. Eleven out of 21 markers have p-value less than 0.05. Even after Bonferroni correction (0.05/21 = 0.00238), there are eight markers with significant p-values. Apparently, Northern Han and Tibet populations are significantly different in a good proportion of the tested markers.
In some complex kinship analysis cases (e.g., distant relatives or with mutations [21]), current commercial kits with CODIS core STRs may not be able to provide high enough likelihood ratio or probability of paternity to obtain reliable identifications. More STRs can provide extra information to raise the confidence of identifications. However, increasing the number of STR loci detection will also increase the probability of mutation. It would be better to select markers with low mutation rates. In the family genotype data, mutations were detected in four cases at four different loci (i.e., D22ATA63, D10S1248, D19S433 and D14S1434). All mutations were one step mutation (Table 5). With a total of 238 meioses, the expected mutation rates of these four loci are 0.0042 with 95% confidence interval [0.0001, 0.0232]. More pedigree samples will be tested to obtain more precise mutation rates.
References
Butler JM (2007) Short tandem repeat typing technologies used in human identity testing. Biotechniques 43:ii–v
Gill P, Werrett DJ, Budowle B, Guerrieri R (2004) An assessment of whether SNPs will replace STRs in national DNA databases—Joint considerations of the DNA working group of the European Network of Forensic Science Institutes (ENFSI) and the Scientific Working Group on DNA Analysis Methods (SWGDAM). Sci Justice 44:51–53
Von Wurmb-Schwark N, Malyusz V, Simeoni E et al (2006) Possible pitfalls in motherless paternity analysis with related putative fathers. Forensic Sci Int 159:92–97
Poetsch M, Ludcke C, Repenning A et al (2006) The problem of single parent/child paternity analysis: practical results involving 336 children and 348 unrelated men. Forensic Sci Int 159:98–103
Egeland T, Sheehan N (2008) On identification problems requiring linked autosomal markers. Forensic Sci Int Genet 2:219–225
America Association of Blood Banks. Annual report summary for testing in 2002 [DB/OL]. (2003-11-01) http://www.aabb.org/sa/facilities/Documents/ptannrpt02.pdf
Uchihi R, Yamamoto T, Yoshimoto T et al (2007) Population data of six STR loci using hexaplex PCR amplification and typing system, “Midi-6” in four regional populations in Japan. Forensic Sci Int 169:255–259
Kuzniar P, Jastrzebska E, Ploski R (2006) Validation of nine non-CODIS STR loci for forensic use in a population from Central Poland. Forensic Sci Int 159:258–260
Becker D, Vogelsang D, Brabetz W (2007) Population data on the seven short tandem repeat loci D4S2366, D6S474, D14S608, D19S246, D20S480, D21S226 and D22S689 in a German population. Int J Legal Med 121:78–81
Hill CR, Butler JM, Vallone PM (2009) A 26plex autosomal STR assay to aid human identity testing. J Forensic Sci 4:1008–1015
Odriozola A, Aznar JM, Celorrio D et al (2011) Development and validation of I-DNA1: a 15-Loci multiplex system for identity testing. Int J Legal Med 125:685–694
Hwa HL, Chang YY, Lee JC et al (2011) Fourteen non-CODIS autosomal short tandem repeat loci multiplex data from Taiwanese. Int J Legal Med 125:219–226
Walsh PS, Metzger DA, Higuchi R (1991) Chelex 100 as a medium for simple extraction of DNA for PCR-based typing from forensic material. Biotechniques 10:506–513
Szibor R, Edelmann J, Hering S et al (2003) Cell line DNA typing in forensic genetics—the necessity of reliable standards. Forensic Sci Int 138:37–43
Crouse CA, Rogers S, Amiott E et al (1999) Analysis and interpretation of short tandem repeat microvariants and three banded allele patterns using multiple allele detection system. J Forensic Sci 44:87–94
Lane AB (2008) The nature of tri-allelic TPOX genotypes in African populations. Forensic Sci Int: Genetics 2:134–137
Gill P, Urquhart A, Millican E et al (1996) A new method of STR interpretation using inferential logic-development of a criminal intelligence database. Int J Legal Med 109:14–22
Zhu BF, Shen CM, Wang HD et al (2011) Genetic diversities of 21 non-CODIS autosomal STRs of a Chinese Tibetan ethnic minority group in Lhasa. Int J Legal Med 125:581–585
Ge J, Budowle B, Planz JV et al (2010) US forensic Y-chromosome short tandem repeats database. Leg Med 12:289–295
Morling N, Allen RW, Carracedo A et al (2002) Paternity Testing Commission of the International Society of Forensic Genetics: recommendations on genetic investigations in paternity cases. Forensic Sci Int 129:148–157
Junge A, Brinkmann B, Fimmers R, Madea B (2006) Mutations or exclusion: an unusual case in paternity testing. Int J Legal Med 120:360–363
Lu DJ, Liu QL, Zhao H (2011) Genetic data of nine non-CODIS STRs in Chinese Han population from Guangdong Province, Southern China. Int J Legal Med 125:133–137
Acknowledgements
This work was supported by opening research grants from Shanghai Key Laboratory of Forensic Medicine (Institute of Forensic Science, Ministry of Justice, PR. China). We are very grateful to Jian Ye and Chengtao Jiang for their valuable technical assistance.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Yuan, L., Ge, J., Lu, D. et al. Population data of 21 non-CODIS STR loci in Han population of northern China. Int J Legal Med 126, 659–664 (2012). https://doi.org/10.1007/s00414-011-0664-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00414-011-0664-4