Mongol-like-horsemen-turned-merchants are Manchus from Manchuria whose homeland was originally centered around what is now the city of Shenyang in Northeast China. Manchus are the third largest group of population in China after Han and Zhuang. Manchus are among the most integrated ethnic minorities but still they have retained their own identity. They had established two dynasties in China, the first dynasty called Jin dynasty which was founded by Nvzhen people. The second one was the Qing dynasty which was founded by Huang Taijin in 1635. Manchu history could be traced back to 6000–7000 years ago [1]. The Manchu people are settled across of China, but 9.5% of their total population is settled in Liaoning province. According to demographic report of 2000, Manchu population size is 11 million so far. Most of Manchu people speaks mandarin language but few elderly people who are still living in remote areas of Northern China speak their own language.

The human Y chromosome has stretched non-recombining portion in the genome. The applications of short tandem repeats of the Y chromosome (Y-STRs) have played an important role in studies of population histories and forensic investigations such as patrilineal relationship evaluation, mixture identification, and ancestry inference [2,3,4,5,6]. Short tandem repeats (STRs) are more polymorphic and have high degree of mutability when compared with single-nucleotide polymorphisms (SNPs); as a result of these characteristics, STRs give you more information about recent events related to population history [7]. Formerly, a worldwide investigation of Y-chromosomal genetic diversity for 23 STR loci and Y-STR databases were created with collaborative effort by using Power-Plex Y23 System (PPY23, Promega Corporation, Madison, WI) [8] and there was a lack of data for Manchu population. This database provides information for the construction of phylogenetic trees and the interpretation of ethnic origin. The Manchus are the third largest ethnic group among 56 ethnic groups. Specifically, regarding Y STRs in forensic casework analysis and population genetics studies, few studies are available related to Manchu population residing in Liaoning province of China and those are only conducted using limited number of Y STRs [9, 10]. In the current study, we have used PowerPlex® Y23 System (Promega Corporation Madison, USA) kit to provide a more precise reference database for forensic investigation and population studies.

Blood samples were randomly collected from 328 unrelated healthy male individuals of Manchu population living in the Xiuyan and Huanren Manchu autonomous counties in Liaoning province (Fig. S1) with informed consent after the approval of ethical review board of China Medical University Shenyang China.

Chelex-100 method [11] was used for extraction of DNA from blood stains collected on FTA cards (Changchun Bokun Biotech CO., Ltd., China). PowerPlex® Y23 System kit was used to amplify samples using GeneAmp PCR System 9700 Thermal Cyclers (Applied Biosystems, Foster City, CA, USA). PCR amplification was performed according to the manufacturer’s instructions. PCR products were detected and separated by capillary electrophoresis with ABI Prism 3130 Genetic Analyzer (Applied Biosystems, Foster City, CA, USA) according to manufacturer’s instructions. The raw data were analyzed using Gene Mapper v.3.2. Allelic nomenclatures were based according to allelic ladder provided by PowerPlex® Y23 System kit. The nomenclature used was in accordance with the latest recommendations for the DNA commission of the International Society of Forensic Genetics [12]. Arlequin V 3.5.2 package software [13] used to calculate the gene diversity (GD), haplotype frequencies and allelic frequencies. Match probabilities (MP) were calculated as the sum of squared haplotype frequencies. The discrimination capacities (DC) were calculated by dividing the number of different haplotypes by the total number of samples. Genetic distances (Rst) between Manchu and 17 other Chinese populations were calculated by analysis of molecular variance and visualized in a multi-dimensional scaling plot from the Y chromosome haplotype reference database (YHRD: http://yhrd.org/amova). Linear discriminant analysis (LDA) was performed on Manchu, and other ethnic groups of China using R program [14] and correlation between STRs were calculated using XLSTAT (http://www.xlstat.com/en/).

We successfully obtained genotypes of total 23 Y-STRs from 328 male individuals in a Manchu population residing in Liaoning province of China (Table S1). The haplotype data were already made accessible via the Y-chromosome Haplotype Reference Database (YHRD) under accession number YC000301. The locus or haplotype diversity (GD) and the number of observed alleles or haplotypes for 21 single-copy STR loci and 1 multi-copy STRs are summarized in supplementary material Table S2. Among single-copy loci, the DYS570 showed the highest diversity of 0.8266 with 11 different alleles, and the DYS391 was the least diverse locus 0.3820 with seven different alleles. The multi-copy Y-STR DYS385 showed more diversity than single-copy Y-STRs with diversity values of 0.9695.We evaluated the haplotype resolution at five levels (Table 1), the minimal 9 Y-STRs loci, the extended 11 Y-STRs loci, PowerPlex Y12 STRs loci, Y-filer 17 STRs loci, and PowerPlex Y23STRs loci. A total of 323 haplotypes were observed at PowerPlex Y23 STRs with haplotype diversity (HD) 0.9999 and discriminatory capacity (DC) 0.9847. Among 323 haplotypes, 98.45% haplotypes were unique. When we reduced the numbers of STRs to 17, the differences in haplotypes (322), haplotype diversity (HD) 0.9999, and discriminatory capacity (DC) 0.9817 were minor, when it is compared to PowerPlex Y23 STRs. For PowerPlex Y12 STRs loci, the extended 11 Y-STRs loci, and minimal 9 Y-STRs loci, numbers of different haplotypes decreased to 313, 310, and 294 respectively, of which 300 (95.84%), 294 (94.83%), and 269 (91.49%) haplotypes were unique, respectively. The significant uplift of unique haplotypes showed that PowerplexY23 kit has superior power of discrimination in Manchu population.

Table 1 Forensic statistical parameters at five levels

The genetic distances (Rst) between Manchu population and other Chinese-integrated populations are listed in Table S3. Tibetan population was the most distantly related to Manchu population (Rst = 0.1505), while Manchu residing in Liaoning was the most closely related to Henan Han (Rst = − 0.0029). Moreover, the MDS plot (Fig. 1) showed a tight cluster consisting of Manchu populations from Liaoning and Chengde. While Han populations from Shanghai, China, and Jiangsu were loosely clustered. Zhuang, Guzhou, Uyghur, and Tibetan populations were placed on the edges of plot distant from Manchu populations. In the current study, linear discrimination analysis (LDA) was also performed (Fig. 2), which showed that Manchu population was probably an admixture of other Chinese populations with the exception of Uyghur and Tibetan populations.

Fig. 1
figure 1

Two-dimensional plot from multi-dimensional scaling analysis of Rst-values based on Yfiler haplotypes for Manchu population with other reference populations

Fig. 2
figure 2

a LDA between Hazara and 9 other ethnic groups from China. b Correlation coefficient of different Y STR markers and the two LDA factors

In conclusion, our study showed that the PPY23 kit discovers high haplotype diversity in Manchu population from China, which in general makes it suitable for forensic casework in this ethnic group. The recent inclusion of these data in the YHRD allows widespread use for forensic and other purposes. This paper followed the guidelines of publication of population genetic data in this journal [15].