Introduction

Amomum tsao-ko, commonly known as black cardamom, is a perennial evergreen tufted herb belonging to the family Zingiberaceae. It is mainly distributed in Southwest China and Northern Vietnam, and found at altitudes ranging from 1300 to 1800 m. It is a type of economical crop that can grow in humid forests in tropical and subtropical areas [1, 2]. Yunnan province is the main A. tsao-ko-producing area in China, contributing to 95% of the nation’s output [3]. The whole plant of A. tsao-ko has a spicy smell, is edible and used medicinally. Its dried fruit is often used to treat internal resistance to cold and dampness, epigastric distension, fullness and vomiting, and malaria among other ailments [4, 5]. Recently, A. tsao-ko was one of the clinical prescriptions for patients with light, ordinary and severe cases of COVID-19 in China, demonstrating its important role in antiviral therapy [6, 7]. In food, A. tsao-ko has become a top-grade seasoning, and widely used in food processing [8].

Crop breeding effectiveness not only depends on the number of germplasm resources but also largely depends on the mastery of genetic characteristics of the diversity of these germplasm resources. Previous studies on A. tsao-ko have primarily concentrated on chemical composition analysis and pharmacological action, and there are few reports on genetic diversity [9,10,11]. Microsatellite markers, also called simple sequence repeat (SSR) markers, are effective in estimating genetic diversity and relationships within and among populations, and are widely used in plant genetic research. Yang et al. [12] developed 24 pairs of SSR primers by FIASCO technique for A. tsao-ko, but only nine of them were polymorphic, such limited SSR markers are not enough to conduct comprehensive genetic studies. The objectives of this study were to develop a set of novel microsatellite markers using restriction-site-associated DNA sequencing (RAD-seq) technique. These markers will provide a useful tool for A. tsao-ko genetic research.

Materials and methods

A total of 72 accessions from six cultivated populations of A. tsao-ko were collected from Yunnan Province, China (Fig. 1, Table S1). The collections covered the main producing area, with the longitudes ranging from 99°49′12″ (LC, Lancang population) to 103°31′48″ (PB, Pingbian population), latitudes ranging from 22°53′24″ (LVC, Lvchun population) to 24°17′24″ (YX, Yunxian population), and elevations spanning from 1665 to 2108 m. Each population consisted of 12 individuals, and the linear distance between individuals in the same population is not less than 50 m to decrease the possibility of sampling from the same clone.

Fig. 1
figure 1

Map of A. tsao-ko samples collection sites. PB, Pingbian population; JP, Jinping population; YY, Yuanyang population; LVC, Lvchun population; YX, Yunxian population; LC, Lancang population (see Table S1 for full description of the populations)

The genomic DNA of A. tsao-ko was extracted using a cetyltrimethylamine bromide method [13], and the concentration and quality of DNA were detected by NanoDrop 2000 ultra-micro spectrophotometer. The working solutions of genomic DNA (20 ng/μL) of different samples were stored at 4 °C. Six genomic DNA samples (one sample from each population) of A. tsao-ko were mixed with the same amount, and the RAD-seq technique was used to simplify sequencing. The bidirectional sequencing of 350–650 bp recovery sequence was carried out by HiSeq 4000 sequencing platform with PE150 mode. To obtain clean reads, the adapters and sequences <50 bp and low-quality reads were removed. The clean reads were assembled using SOAPdenovo2.04 [14]. The sequencing data were deposited in the Sequence Read Archive (SRA) of NCBI database (SRA Accession: PRJNA529582).

The software MISA was used to search the SSR sites of the DNA sequences obtained by sequencing [15]. The configuration parameters are as follows: the minimum repetition times of single nucleotide, dinucleotide, trinucleotide, tetranucleotide, pentanucleotide and hexanucleotide were 10, 5, 4, 4, 4 and 4, respectively, and the sequence length between two adjacent SSR is ≥12 bp. SSR primers were designed based on the flanking sequence of SSR locus using Primer 3.0 software [16]. The parameters were set as follows: GC content 40–60%, primer length 18–27 bp, annealing temperature 50–65 °C, and amplification product length 100–300 bp.

100 pairs of SSR primers were randomly selected for synthesis and were screened by four A. tsao-ko samples. Finally, 23 pairs of SSR primers with clear bands and obvious polymorphism were selected for genetic diversity analysis of 72 A. tsao-ko accessions. The 10 μL polymerase chain reaction (PCR) system consists of 1.0 μL template DNA (20 ng/μL), 0.8 μL SSR primers (0.2 μmol/L), 0.8 μL deoxyribonucleoside triphosphates (dNTPs) (2.5 mol/L), 1.2 μL 10 × PCR buffer (including Mg2+), 0.2 μL Taq DNA polymerase (2.5 u/μL), and 6 μL ddH2O. The PCR procedure included pre-denaturation (95 °C/5 min), followed by 95 °C/30 s, 51–55 °C/30 s, 72 °C/30 s, a total of 30 cycles, and finally extended at 72 °C for 10 min to preserve at 4 °C. All of the PCR amplifications were conducted using a thermal cycler (ETC-811, Eastwin). The amplified products were separated on 8% non-denaturing polyacrylamide gel (acrylamide:bisacrylamide = 19:1), stained with 1% silver nitrate at the end of electrophoresis and photographed after the bands were clear.

For each locus, the evaluation parameters including the observed number of alleles (Na), effective number of alleles (Ne), observed heterozygosity (Ho), expected heterozygosity (He), fixation index (F), genetic distance, and pairwise FST values were calculated by GenAlEx 6.501 program [17], which are also used for the Hardy–Weinberg equilibrium (HWE) test. Frequencies of null alleles were estimated using Micro-checker 2.2.3 [18]. The polymorphism information content (PIC) of polymorphic loci was calculated using PowerMarker V3.25 [19]. 72 accessions were assessed by multivariate principal coordinate analysis (PCoA) performed with GenAlEx 6.501. A Mantel test was performed to determine the correlation between genetic distance and geographic distance matrices using NTSYS-pc 2.11 software [20].

Results and discussion

By RAD-seq, 5,269,456 (1,317,364,000 bp) raw reads were obtained, and 5,117,655 (1,185,485,454 bp) clean reads were obtained after filtering. A total of 115,482 microsatellites were identified using MISA software, and 13,411 SSR primer pairs were designed using Primer 3.0. 100 pairs of randomly selected primers were validated in four A. tsao-ko accessions, of which 95 pairs could amplify bands in the target area, 64 pairs were polymorphic (Table S2). 23 primer pairs with clear bands and obvious polymorphism were selected for further analysis (Table 1).

Table 1 Characterization of 23 microsatellite loci developed for A. tsao-ko

A total of 90 bands were amplified by 23 SSR markers in 72 A. tsao-ko, with an average of 3.913 bands per primer pairs. The total number of alleles (Na) per loci varied from 2 (AM65) to 6 (AM49 and AM89), and the effective number of alleles (Ne) ranged from 1.315 (AM43) to 3.776 (AM89). The average Ne (2.305) was lower than the Na (3.913) thus reflecting imbalanced allelic frequencies in populations, which could imply the presence of rare or unique alleles. The observed heterozygosity (Ho) and expected heterozygosity (He) in each locus vary from 0.208 (AM43) to 0.779 (AM42) and from 0.239 (AM43) to 0.735 (AM89), respectively. All the genetic parameters were higher than those of SSR markers developed by Yang et al [12]. The PIC ranged from 0.228 (AM43) to 0.709 (AM89). The mean fixation index was 0.072, and values ranged from −0.181 (AM42) to 0.272 (AM13), indicating that there was no inbreeding depression in A. tsao-ko population. Among all the loci, 5 loci (AM4, AM12, AM13, AM40, and AM44) were exhibited heterozygote deficiency, as indicated by the positive and significant fixation index. Heterozygote deficiency could be the result of rare alleles, correlations among loci, or the sampling of closely related individuals. HWE analysis showed that 10 loci (AM4, AM6, AM8, AM12, AM13, AM42, AM43, AM44, AM45, and AM49) significantly deviated from HWE (P < 0.05), and the remaining 13 loci (AM1, AM7, AM9, AM17, AM27, AM29, AM36, AM40, AM41, AM65, AM89, AM94, and AM99) were in accordance with HWE (Table 2). Micro-checker analysis showed that null alleles frequencies were lower than 0.2 for each locus, indicating these HWE deviations may be caused by anthropic selection and short-cloning history in cultivated rather than null alleles [12, 21].

Table 2 Genetic variation of the 23 polymorphic microsatellite loci in 72 accessions of A. tsao-ko

In this study, PCoA analysis did not clearly separate the populations, and the germplasm from different populations were mixed with each other (Fig. 2). Consistent to this result, the Mantel test revealed that there was no significant correlation between geographic and genetic distance (r = 0.076, p = 0.579), which suggested no significant geographic restriction to gene flow among the populations. The pairwise FST values ranged from 0.029 (LVC and LC) to 0.074 (YY and LC), with an average of 0.047. The genetic distance varied from 0.056 (JP and YX) to 0.146 (YY and LC) with an average of 0.072 (Table 3). These results showed a relatively low degree of genetic differentiation and high genetic similarity among these populations. A. tsao-ko is a small group of species that is geographically restricted to southern China (Yunnan), northern Laos and Vietnam at altitudes ranging from 1300 to 1800 m, where it thrives in well-drained, moist, organic-matter-rich soil in shade or partial shade [4, 22]. Narrow distribution areas and special habitats lead to the high genetic similarity between A. tsao-ko populations, similar results were also observed in Dendrocalamus membranaceus [23], Thuja sutchuenensis [24] and Stemona parviflora [25]. Furthermore, a high level of gene flow (Nm = 5.425) was detected among A. tsao-ko populations based on random amplified polymorphic DNA (RAPD) markers in our previous study [8]. Such gene flow was enough to resist the genetic differentiation caused by genetic drift. In conclusion, we developed and characterized 23 novel microsatellite markers in A. tsao-ko. The set of markers significantly increases the availability of polymorphic molecular markers for the species, and will provide a valuable resource for genetic diversity, core collection construction, genetic mapping and marker-assisted selection breeding in A. tsao-ko.

Fig. 2
figure 2

Principal coordinate analysis based on the genotyping of 72 accessions of A. tsao-ko with 23 microsatellite loci

Table 3 Pairwise of FST (above diagonal) and genetic distance (below diagonal) for six A. tsao-ko populations