Introduction

Thailand is located in the center of mainland Southeast Asia bordered by several countries: Myanmar to the North and West, Laos to the North and Northeast, Cambodia to the Northeast and East, and Malaysia to the South. With a population size ∼69 million, the major ethnic groups are Khonmueang, Lao Isan, central Thais and southern Thais who live respectively in four main regions; North, Northeast, Central and South (Myers 2005; Eberhard et al. 2020). The Northeast is the largest region of Thailand, comprising nearly one-third of the total area with a population of ∼15 million (Eberhard et al. 2020). The major group Lao Isan historically relocated from Laos during the fourteenth–eighteenth century AD (Myers 2005; Mishra 2010). At that time, besides the Lao people, other ethnic groups from both Laos and Vietnam were also relocated to the area of present-day Northeastern Thailand as minor groups, including several Tai-Kadai (TK) speaking-Phutai, Seak, Nyaw and Kalueang. However, prior to Lao Isan settlement, the indigenous people in this region were of the Austroasiatic (AA) speaking groups, e.g. Khmer and Mon (Myers 2005; Mishra 2010; Higham 2014).

Previous genetic studies utilizing mitochondrial DNA and the Y chromosome indicated contrasting paternal and maternal genetic ancestries of Lao Isan (Kutanan et al. 2017; 2019). However, analysis of autosomal DNA markers revealed that the Lao Isan people exhibited a genetic mixture with neighboring AA-speaking groups and displayed regional genetic differences among major groups in Thailand (Srithawong et al. 2020, 2021; Kutanan et al. 2021). While there have been numerous genetic investigations in northeastern Thai populations, with a particular focus on uni-parental and bi-parental genetic features (Kutanan et al. 2014, 2017, 2019, 2021; Srithawong et al. 2015, 2020, 2021; Than et al. 2022), the genetic characteristics of the X chromosome have not yet been reported.

According to gender, X-chromosome exhibits a specific inheritance pattern; father transmits it to daughter as an unchanged block while mother transmits one recombined X-chromosome to her offspring in the same way as autosome (Gomes et al. 2020). These unique properties make X-chromosome a complementary informative marker to autosomes and Y-chromosomes in forensic and population genetics (Gomes et al. 2020; Schaffner 2004). X-chromosomal short tandem repeats (X-STRs) are utilized in forensic investigations as additional evidence for several complicated scenarios of personal identification, e.g. identification cases of missing persons and mass disaster victims, and complex kinship analyses, e.g. deficiency paternity cases and incest cases (Asamura et al. 2006; Liu et al. 2007). In addition, some properties of X-STRs, e.g. high polymorphic, simple analyses and easy genotyping make X-STRs suitable for population-wide genetic study. However, the usefulness of X-STRs in both forensic and population genetic studies requires precise knowledge of not only allele and haplotype frequencies but also the genetic linkage and linkage disequilibrium (LD) status among markers (Inturri et al. 2011).

Within a decade, X-STRs have been continuously developed for commercial kits, e.g. the Investigator Argus X-12 kit (Qiagen GmbH, Hilden, Germany) with high standard production based on numerous makers, methodologies and a large database (Garcia et al. 2022) [6]. X-STRs data have been reported worldwide (Asamura et al. 2006; Li et al. 2011; Shin et al. 2005; Tetzlaff et al. 2012; Pepinski et al. 2007; Illescas et al. 2012; Baeta et al. 2013) but there is a paucity of X-STRs data in Thailand with data available from the northern and central regions of Thailand (Vongpaisarnsin et al. 2016; Khacha-ananda et al. (2020) but not in the Northeast and the South. A previous study suggested that variations in allele frequency of X-STRs might differ among individuals living in different geographic locations (Liu et al. 2007). Furthermore, our earlier study, based on autosomal STRs, also indicated regional genetic differences among Thai populations (Srithawong et al. 2020). Therefore, regional heterogeneity in Thailand may also hold true for X-STRs. To address this question, X-STRs data from northeastern Thailand must be developed. Here, we aim to characterize the genetic variations and forensic parameters of X-STR markers using the Investigator® Argus X-12 kit (Qiagen, Germany) on 10 northeastern Thai populations, encompassing both major and minor groups. Additionally, we compare genetic relatedness based on X-STRs between northeastern Thais and other Asian populations.

Materials and methods

Materials

Population samples

A total of 896 samples belonging to 10 populations included Lao Isan (n = 498), Khmer (n = 40), Mon (n = 82), Nyahkur (n = 38), Bru (n = 36), Kuy (n = 56), Phutai (n = 57), Kalueang (n = 41), Nyaw (n = 36) and Seak (n = 12). Almost all of the genomic DNA samples were obtained from our previous studies (Kutanan et al. 2017, 2018), except for 402 newly collected individuals of Lao Isan, using the 1.2-mm-diameter size blood spot on the FTA card. Sample donors were unrelated for at least two generations and provided samples with written informed consent. Ethical approval for this study was provided by Khon Kaen University (HE642161).

Methods

PCR amplification and STR typing

Genomic DNA was extracted using the Chelex-100 method. According to the manufacturer’s instructions, each sample was amplified for 12 X-STR loci using reaction mixtures included in the Investigator® Argus X-12 QS kit (Qiagen, Hilden, Germany). The PCR amplicons were separated by capillary electrophoresis on Genetic Analyzer ABI 3500 (Applied Biosystems, USA). The GeneMapper® ID-X software v1.4 (Thermo Fisher Scientific, Inc., Waltham, MA, USA) was used for determining STR allele calling. Raw genotypes of 12 X-STRs of females and males are shown in Online Resource 1, (ESM_1).

Statistical analyses

Allele frequencies for each locus and haplotype frequency of the four linkage groups (linkage group I: DXS10135-DXS10148-DXS8378; linkage group II: DXS7132-DXS10079-DXS10074; linkage group III: DXS10103-HPRTB-DXS10101; and linkage group IV: DXS10146-DXS10134-DXS7423) as well as the gene diversity (GD) and haplotype diversity (HD) were estimated using StatsX v2.0 software (Lang et al. 2019). StatsX v2.0 software was also used to calculate polymorphism information content (PIC) (Botstein et al. 1980), power of discrimination (PD) for males (PDm) and females (PDf) (Desmarais et al. 1998) and mean exclusion chances (MEC) for deficiency cases (MEC_Kruger) (Krüger et al. 1968), for normal trios (MEC_Kishida) (Kishida et al. 1997), and for duo cases (MEC_Desmarais duo) (Desmarais et al. 1998). Expected heterozygosity (HE) (Nei 1974), observed heterozygosity (HO), Hardy–Weinberg equilibrium (HWE) for females, and the pairwise linkage disequilibrium (LD) in the male and female using the exact P test for the 12 loci were performed by Arlequin version 3.5 software (Excoffier and Lischer 2010). Arlequin was also used to perform analysis of molecular variance (AMOVA) by grouping populations according to linguistic categories; i.e. Tai-Kadai and Austroasiatic languages.

A genetic distance matrix based on the number of different allele (Fst) was also created by Arlequin, then a matrix was plotted in two dimensions by means of multidimensional scaling (MDS) using Statistica v. 10 demo (StatSoft, Inc, United states). Furthermore, the genetic relationship between 10 studied populations and 17 availably published populations were illustrated by a neighbor-joining tree based on Nei’s DA distances using POPTREE2 software (Takezaki et al. 2014) and two dimensions multidimensional scaling (MDS) using Statistica.

Results and discussions

We genotyped 12 X-STRs using the Investigator® Argus X-12 kit on a total of 896 samples from major northeastern Thai (Lao Isan) and minor groups (AA-speaking-Khmer, Mon, Nyahkur, Kuy and Bru and TK-speaking-Phutai, Kalueang, Nyaw and Seak) from northeastern Thailand. The comprehensive results, encompassing allelic frequency, haplotype frequency, and forensic parameters of X-STRs for both Lao Isan and other minor groups in northeastern Thailand, are presented in Online Resource 2–6; ESM_2–6. For the Lao Isan, all loci are highly polymorphic and informative, indicating that this marker set is a valuable tool for forensic investigations. The newly generated reference X-STRs database of Lao Isan holds significance for forensic applications not only in Thailand but also in Laos. The X-STRs data also support regional genetic differences among major Thai groups in each region: northeastern Thai, Northern Thai, and Central Thai, as well as genetic distinctions of some minor groups, such as Kuy, aligning with findings from other genetic marker systems in our previous studies (Srithawong et al. 2020; Kutanan et al. 2021). This marker set could serve as an optional tool in forensic practice and population genetic studies in various populations. For more details, we present and discuss results in three parts: genetic and forensic parameters of the major group, genetic and forensic parameters of minor groups, and genetic relatedness among populations in northeastern Thailand and other Asian groups.

Genetic and forensic parameters of major northeast Thai population (Lao Isan)

Forensic parameters

There was a total of 498 genotypes of Lao Isan (NE-TH) samples (males = 228 and females = 270). The HWE tests were performed on female samples and some loci were deviated from the HWE: DXS10148, DXS7132, DXS10074, DXS10079 and DXS7423, after applying Bonferroni correction (0.05/12 = p < 0.004) (Online Resource 7, ESM_7) The deviation of some loci could be a result of the effect of population substructure (Tao et al. 2018) which was observed in genetically heterogeneous Lao Isan groups (Srithawong et al. 2020).

We calculated allele frequencies of 12 X-STRs in both genders; there were 189 and 149 alleles with corresponding allelic frequencies ranging from 0.0019 to 0.4963 and 0.0043–0.4868 in females and males, respectively (Online Resource 2–3, ESM_2–3). Based on the exact p values, the frequencies of almost all of allele between genders were not significantly different, with the exception of DXS7132 and DXS10074 loci (p < 0.0007) (Table 3). We then calculated the allele frequencies of pooled 12 X-STR data of male and female (Online Resource 2–3, ESM_2–3); a total of 191 alleles with allele frequencies ranging from 0.0013 to 0.4935 were found. With high numbers of alleles (23 alleles) and the highest value for all forensic parameters, DXS10148 is the largest informative marker (PIC = 0.9416), while the least informative marker (PIC = 0.5007) is DXS7423 with 7 alleles and low values of forensic indexes (Online Resource 4, ESM_4). GD and PIC were lowest in DXS7423 (0.5888 and 0.5007, respectively) and highest in DXS10148 (0.9946 and 0.9416, respectively). The PDm and PDf spanned 0.5880 (DXS7423) to 0.9444 (DXS10148) and 0.7430 (DXS7423) to 0.9941 (DXS10148), respectively, while the ranges of MEC_Kruger, MEC_Kishida, MEC_Desmarais Trio and MEC_Desmarais Duo were 0.2999 (DXS7423)—0.8875 (DXS10148), 0.5008 (DXS7423)—0.9415 (DXS10148), 0.5007 (DXS7423)—0.9416 (DXS10148) and 0.3593 (DXS7423)—0.8926 (DXS10148), respectively (Online Resource 4, ESM_4). The combined MEC indices were much greater than 0.99999959 for deficiency CMEC_Kru, CMEC_Kishida, CMEC_Des-trio and duo cases (CME CDes-duo) (Table 1).

Table 1 Genetic variation and forensic efficiency parameters of 12 X-STR loci based on allelic frequencies in the ten studied populations

Linkage disequilibrium (LD) and four linkage groups (LG1- 4)

The LD was estimated for all pairs of loci (Online Resource8, ESM_8) and their p-values for the LD exact tests between the male and female are shown Table 3. Technically, linkage expectations were only based on physical distances between loci; however, LD may also result from random genetic drift, founder effects, mutations, selection and population admixture or population stratification (Pereira et al. 2012). In this study, the significant associations were found both within LG and between LGs. After Bonferroni correction calculated from 66 pairwise comparisons, (0.05/66 = p < 0.0007), there were six pairs of linkage in male: DXS8378(LG1)/DXS10135(LG1), DXS7132(LG2)/DXS10074(LG2), DXS10074(LG2)/DXS10079(LG2), DXS10074 (LG2)/DXS10146(LG4), DXS10101(LG3)/DXS10103(LG3) and DXS10103(LG3)/HPRTB(LG3) and five pairs in female: (DXS10074 (LG3)/DXS7132(LG3), DXS10146(LG4)/DXS7132(LG2), DXS10103(LG3)/DXS10101(LG3), HPRTB(LG3)/DXS10101(LG3) and HPRTB(LG3)/DXS10103(LG3) (Online Resource 8, ESM_8). The significant associations of linkage disequilibrium were previously reported in a northwestern Italian population (Robino et al. 2006), four Chinese ethnic groups (Han, Tibet, Uighur and Hui) (Yang et al. 2017) and Liaoning Manchu population from China (Xing et al. 2019). Interestingly, here we also found strong linkage groups between the STR trios within LG3 (p < 0.05); DXS10101/DXS10103, DXS10103/HPRTB and HPRTB/DXS10103 in both Lao Isan males and females (Online Resource 8, ESM_8).

Haplotype frequency

Due to linkage in the studied STRs, we also classified the haplotypes into four linkage groups based on their physical localizations (Diegoli 2015): linkage group 1 (LG1): DXS8378-DXS10135-DXS10148, linkage group 2 (LG2): DXS7132-DXS10074-DXS10079, linkage group 3 (LG3); DXS10101-DXS10103-HPRTB and linkage group 4 (LG4): DXS7423-DXS10134-DXS10146. The haplotypes and corresponding frequencies of these four linkage groups (LG1-4) in Lao Isan males along with forensic parameters are shown in Table 2, Online Resource 5–6; ESM_5–6. We found a total of 195, 115, 103 and 119 haplotypes in LG1 to LG4, respectively, with frequencies ranging from 0.057 to 0.0044. The most common haplotypes were observed in LG3 as H 31-19-11 with haplotype frequency at 5.7% (13 males) and H 31-19-12 (5.26% haplotype frequency in 12 males). Haplotype diversity (HD) ranged from 0.9985 (LG1) to 0.9851 (LG3) whereas PIC varied from 0.9941 (LG1) to 0.9804 (LG3) and the PDm and PDf were greater than 0.9 (Online Resource 6, ESM_6). The combined MEC (CMEC) indices based on haplotype frequencies for deficiency cases CMEC_Kru (0.999999792953847), CMEC_Kishida (0.999999992808508), normal trio CMEC_Des-trio (0.999999978393156) and duo cases CMEC_Des-duo (0.9999996836695) were equally high for Lao Isan males (Table 2 and ESM_5).

Table 2 Genetic variation and forensic parameters of 12 X-STR loci based on haplotype frequencies

Genetic and forensic parameters of minor groups in Northeastern Thailand

Forensic parameters

A total of 398 genotypes of 12 X-STRs of nine ethnic groups were tested for HWE (Online Resource 7, ESM_7); there was no deviation from the HWE after Bonferroni correction (p > 0.05/12 = 0.0041) in each individual population with the exception of DXS7423 in Mon population (Online Resource 7, ESM_7). Allele frequencies of all loci between genders in all ethnic groups were not significantly different based on the exact P-value, excepting DXS7132 and DXS10074 (Table 3). We then pooled samples from both genders in each individual population and calculated allele frequencies and forensic parameters (Table 1, Online Resource 2–4, ESM_2–4).

Table 3 Differentiation test between all pairs of loci of male and female based on exact test

Within these nine groups, the number of observed alleles varied from 2 to 21 across the different loci. The allele frequencies and the forensic parameters, including the PD in females (PDf) and males (PDm), PIC, GD, MEC, the CDP for females (CDPf) and males (CDPm), and the combined MEC (CMEC) indices for the 12 X-STR loci in individual groups are shown in Online Resource 2–4, ESM_2–4. The combined power of discrimination for both males (CPDm > 0.99999) and females (CPDf > 0.99999) as well as the CMEC indices (MEC_Krüger, MEC_Kishida, MEC_Desmarais Trio and MEC_Desmarais Duo) were high for all populations (Table 1 and Online Resource 4, ESM_4). The GD values of all 12 X-STR markers were greater than 0.7 in all populations, indicating variation of forensic markers and allelic diversity in the studied populations (Khacha-ananda et al. 2020) (Table 1). The PIC values of DXS7423 and DXS8378 markers in all groups were less than 0.6, in agreement with previous study of the northern Thai populations (Khacha-ananda et al. 2020). DXS10148 showed the highest value for all forensic parameters, followed by DXS10135 and DXS10101, suggesting that these loci were the most informative markers, while the least informative was DXS7423 (Online Resource 2–4, ESM_2–4). Previous studies also reported the less informative nature of DXS7423 in several Asian populations, i.e. Southern Han, Tibetan, Uighur and Hui (Yang et al. 2017), Guizhou Sui from southwest China (Guo et al. 2019), Central Thai and northern Thai populations (Vongpaisarnsin et al. 2016; Khacha-ananda et al. 2020) and Sri Lankan ethnic groups (Perera et al. 2021).

Linkage disequilibrium (LD) and four linkage groups (LG1- 4)

Elevated LD is expected for populations which are either small, reproductively isolated or have low population growth (Ardlie et al. 2002), thus we estimated LD for all pairs of markers in seven populations, excluding Seak and Bru males due to small sample sizes (Online Resource 8, ESM_8). These linkage groups did not form stable haplotypes as indicated by LD of STRs within the groups and significant LD between the groups (Online Resource8, ESM_8). The high LD in populations, particularly Kuy’s male is probably driven by either genetic drift or consanguineous groups, which was also supported by the presence of highly shared haplotypes without any unique haplotypes (Online Resource 8, ESM_8). Previous study on autosomal STRs of Kuy indicated their genetic divergence from other northern Thai groups; lesser gene flow (compared with their neighbor the AA-speaking Khmer) and genetic drift could have promoted Kuy’s genetic distinction (Chantakot et al. 2017).

The high values of gene and haplotype diversities (Tables 1 and 2), along with elevated LD values in AA-speaking Khmer and Mon (Online Resource 8, ESM_8), likely contributed to the admixed population structure observed in these groups. This pattern is consistent with findings in several Asian populations, such as the Sinhalese from Sri Lanka (Perera et al. 2021), and Liaoning Manchu from China (Xing et al. 2019). Previous genome-wide studies have provided evidence of South Asian genetic ancestry in several present-day Thai populations (Kutanan et al. 2021; Changmai et al. 2022). South Asian ancestry is identified as one of the parental sources shaping the current admixed genetic structure of Khmer and Mon in Thailand, although in minor proportions (∼10–30%) (Kutanan et al. 2021; Changmai et al. 2022).

Haplotype frequencies

The forensic and statistical parameters of the four haplotypes (including PIC, HD, PDm, PDf, MEC_Kruger, MEC_Kishida, MEC_Desmarais and MEC_Desmarais_duo) are shown in Table 2, Online Resource 5–6, ESM_5-6; the PIC, PDm and PDf values for these haplotypes were greater than 0.9, while MEC values ranged above 0.8. Among the four linkage groups, LG1 proved more informative than other groups for almost all populations, mirrored by the haplotype diversity (Online Resource 5–6, ESM_5–6). Nevertheless, all four clusters showed a high haplotype diversity for all seven ethnicities. Approximately 75% to 85% of haplotypes were identified as unique haplotypes; in exception the Kuy did not show any unique haplotypes (Online Resource 5–6, ESM_5–6). The most common haplotypes are different among the studied populations; H 10-26-19 in LG1 is common in Kaleung (18.18%), while H 14-17-19 and H 13-17-20 in LG2 are prevalent haplotypes with a frequency of 26.09% and 21.74%, respectively in Nyahkur. In LG3, the most common haplotypes were observed: H 32-16-13 in Phutai with 15.79%, H31-19-11 in Nyaw with 21.05%, and H33-16-13 in Mon (11.36%), (Online Resource 5–6, ESM_5–6).

Genetic variation and genetic relatedness within northeastern Thai groups and between Asian populations

Results of Analysis of Molecular Variance (AMOVA) indicated that total genetic variation among populations accounts for 3.2%. Interestingly, the TK groups (3.13%, p < 0.05) exhibited small greater genetic heterogeneity than AA groups (2.97%, p < 0.05) (Table 4). The genetic variation showed non-significant differences among the TK and AA language families, coupled with much lower variation among groups than the variation among populations within each group, indicating that language families do not correspond to genetic structure. In addition, a direct comparison of Lao Isan vs. minor TK-groups, and Lao Isan vs. minor AA-groups, showed no significant differences between groups.

Table 4 AMOVA results

Among 45 pairwise comparisons of genetic distances (Fst) calculated from ten populations, there were 41 pairs (91.11%) with statistical differences (p < 0.05) and 4 pairs (8.89%) without significant differences (p > 0.05). No significant difference was observed between the pair of Phutai-Nyaw (Fst = 0.00625), Phutai-Kalueang (Fst = 0.00682), Phutai-Khmer (Fst = 0.00149) and Khmer-Kalueang (Fst = 0.00919). Subsequently, we carried out multidimensional scaling (MDS) plot based on Fst matrix to visualize genetic relationship among populations (Fig. 1). Based on MDS dimension 1, Seak, Lao Isan, Kuy and Nyahkur were quite distinct from other populations but Nyaw and Bru were separated from the others based on dimension 2 and 3, respectively. The Mon, Kalueang, Khmer and Phutai were positioned in the center of the plots, mirroring their close genetic relationship (Figs. 1A and B). In general, population clustering is not consistent with language family, in agreement with AMOVA results (Table 4).

Fig. 1
figure 1

The three-dimensional MDS plot based on Fst distance matrix for 10 populations for a dimension 1 vs. 2 and B dimension 1 vs.3. Black and red dots represent Tai-Kadai and Austroasiatic-speaking populations, respectively. The stress value is 0.0409

To reveal genetic relationship among populations worldwide, the allele frequencies of 12 X-STRs of ten northeastern Thai groups were compared with available data from 17 populations (Vongpaisarnsin et al 2016; Khacha-ananda et al. 2020; Shrivastava et al. 2015; Uchigasaki et al. 2013; Zeng et al. 2011; Sufian et al. 2017; Elakkary et al. 2014; Almarri and Lootah 2018; Salvador et al. 2018; Messaoudi et al. 2019; Poulsen et al. 2016). We constructed a neighbor-joining (NJ) tree and the first two dimensions of MDS based on pairwise Nei’s Da genetic distance averaged over 12 X-STRs (Fig. 2A and B). In general, the Lao Isan population was clustered with other northeastern Thai minorities with the divergence of Seak (Fig. 2A). Distinction of Seak from the other groups in Thailand was also reflected by linguistic evidence. In contrast to other TK groups in Thailand who speak the Southwestern Tai branch, the language of Seak is of the Northern Tai branch in TK family that was spoken mainly by the Tai in Gwangsi Province of China (Eberhard et al. 2020). Historically, the Seak originated from the area of Thua Thien-Hue Province in Vietnam and then migrated westward into central Laos and settled scattered there (Schliesinger 2003). In 1851 CE, some groups of Seak were taken to present-day Northeastern Thailand from the Khummuan Province of Laos by the Siamese army as war prisoners (Schliesinger 2000, 2001, 2003). Previous mtDNA studies revealed low genetic diversity and large divergence of Seak that were probably influenced by genetic drift during population movement (Kutanan et al. 2014; 2017). However, the Y chromosome showed an opposing genetic result, indicating the contrasting maternal and paternal genetic history of Seak.

Fig. 2
figure 2

Genetic relationship among 27 populations based on Nei’s genetic distances. A the NJ tree and B the two dimensional MDS plots with stress value at 0.0973. Black and red dots represent Tai-Kadai and Austroasiatic-speaking populations from northeastern Thailand, respectively, whereas green and blue dots represent Thai populations from other regions and other Asian populations, respectively

The AA-speaking Mon and Bru are positioned in between northeastern Thai clade and clades of other populations (Fig. 2A); Mon and Bru were laid far away from northeastern Thai group shifting to cloud of other Asian groups (North and South Asians) in dimension 1 of MDS (Fig. 2B), indicating genetic difference of both from other northeastern Thais. In agreement with previous studies, autosomal STRs indicated genetic differentiation of Bru (Inturri et al. 2011) while genome-wide data supported unique genetic structure of Mon that resulted from admixture with South Asian group (Kutanan et al. 2021).

In agreement with autosomal STRs and genome-wide data (Srithawong et al. 2020; Kutanan et al. 2021), genetic difference according to geographic locations of major Thai groups were observed using X-STRs; almost all of the northeastern Thai populations were clustered together and separated from the northern Thai, which were genetically closer to Chinese, and central Thai who were more related to South Asian populations (Indian, Bengali and Bangladeshi) than the other Thais These results are consistent with several lines of investigation that indicated South Asian genetic ancestry in the central Thai group (Srithawong et al. 2020, 2021; Kutanan et al. 2017, 2019).

Conclusion

Previous reports on X-STRs data from Thailand were primarily focused on the North and Central regions. This study expands upon existing research by establishing additional X-STRs data in the northeastern region, utilizing the Investigator® Argus X-12 kit. Forensic parameters within the major group, Lao Isan, demonstrated that all loci are highly polymorphic and informative, rendering this marker set a valuable tool for forensic investigations in northeastern Thailand. For the minor ethnic groups, it is important to note a limitation in this study—the low sample size. Caution is advised in applying these findings in forensic applications and interpreting them in anthropological studies. Nevertheless, a rigorous recruitment procedure for sample collection, coupled with an overall agreement with previous studies that employed various genetic markers, suggests that the results are representative of these smaller populations. The present results also support regional genetic differences among major groups in each region: northeastern Thai, Northern Thai, and Central Thai. Consequently, different haplotype frequency databases should be considered for forensic purposes. Additionally, more exhaustive sampling in this area would be beneficial for further forensic and anthropological genetic studies. Furthermore, incorporating forensic X-STRs data from southern Thailand could strengthen the forensic X-STR database for the entire country.