Introduction

Thalassemia, also referred to as Mediterranean anemia, is an autosomal inherited defect caused by the mutations in the α- or β-globin gene. Thalassemia is categorized into two major types, α- and β-thalassemia according to the mutations that occur in these globin genes, either of which can be further subdivided into another two forms, the α0 and α+ thalassemia and β0 and β+ thalassemia, respectively. The heterozygous form of α+ thalassemia or α0 thalassemia may not have symptoms of anemia, while the compound heterozygous form for α+ thalassemia and α0 thalassemia usually causes hemoglobin H. The homozygous state for α0 thalassemia causes Hemoglobin Bart’s, which is lethal in utero or soon after birth. The heterozygous form of β thalassemia, which is called β thalassemia minor, usually shows asymptomatic microcellular anemia, while others are silent carriers. In contrast, the homozygous or compound heterozygous form of β mutations, also named the β-thalassemia major and intermediate respectively, can cause severe anemia and patients carrying these mutations need transfusion for life, which poses financial burden to their families and society.

The high frequency of inherited hemoglobin variants is present in tropical and sub-tropical areas such as Mediterranean countries and Southeast Asia [1]. In China, the prevalence of α-thalassemia, β-thalassemia, and α- and β-thalassemia ranges from 1.20 ~ 19.87%, 0.53 ~ 6.84%, and 0.08 ~ 1.22%, respectively [2]. Southern China including Guangxi, Guangdong, Fujian, and Hainan Provinces are the high incidence areas of the disease, and the spectrum of the mutations of these areas have been examined previously [3,4,5]. In addition, the spectrum of thalassemia gene mutations appears to be region- and ethnic-dependent.

As thalassemia is an inherited autosomal recessive disease and has serious impacts on the quality of patient’s life, couples who are both heterozygous for thalassemia gene have 25% probability to have a child with thalassemia major. Presently, prevention programs like molecular diagnostics, genetic counselling, and prenatal diagnosis have achieved great success in preventing the occurrence of thalassemia major, as demonstrated by a decline in the birth rate of thalassemia major in some countries such as Iran, Pakistan, Thailand, and China [6,7,8]. A previous study has provided important information about prenatal diagnosis of thalassemia in Han and Li people in Hainan [9]. However, few studies have shown geographical distribution and family-based prenatal diagnosis of thalassemia gene mutations in Hainan Province. Furthermore, the main aim of a thalassemia prevention and control program is to prevent the birth of infants carrying thalassemia major. Hence, it is necessary to enroll pregnant women and their husbands in the study cohort.

In this study, we analyzed α- and β-thalassemia genotypes in the fetal specimen collected from 3049 pregnant women using molecular prenatal diagnosis and conducted a large-scale familial investigation in eighteen regions of Hainan Province. The aims of this study were to reveal the detailed geographical and ethnic distribution and familial prevalence of thalassemia, and provide scientific basis for thalassemia prevention and control in the province, and construct a detailed frequency map of the spectrum of thalassemia mutations in Hainan. This study will provide comprehensive data of thalassemia’s prevalence in Hainan, which will significantly contribute to its control and management, genetic counseling and prenatal diagnosis.

Materials and methods

Subjects

This study included pregnant women and their husbands who registered for a program of diagnosis of thalassemia at The First Affiliated Hospital of Hainan Medical University in Hainan Province between January 2004 and March 2020. All couples had resided in Hainan Province, and came from eighteen regions of Hainan Province and all subjects signed the informed consent. This study was carried out in accordance with the Declaration of Helsinki and approved by the Ethics Committee of The First Affiliated Hospital of Hainan Medical University.

Samples collection

Depending on the gestational age of the fetus, chorionic villus sampling (CVS) at 10–12 weeks, amniotic fluid at 16–21 weeks, and cord blood at 18–28 weeks of gestation were collected. All procedures were performed under the guidance of ultrasonography. Peripheral blood from parents and umbilical cord blood from the fetus were collected via venipuncture of an antecubital vein using Ethylenediaminetetraacetic acid (EDTA) anticoagulant tube. The villus samples were isolated under a dissecting microscope. Amniocentesis fluid cells were isolated by centrifugation at 2000 rpm for 2 min. Genomic DNA of peripheral blood of both parents, chorionic villus, amniotic fluid, and umbilical cord blood were extracted by Tiangen DNA extract kit (Tiangen Biotech CO., LTD., China) following the manufacturer’s instructions.

Molecular diagnosis of α- and β-thalassemia

A thalassemia gene diagnostic kit (Shenzhen Yaneng Biotechnology Co., Ltd., China) combining GAP-PCR and hybridization technology was used to test 17 known β-globin mutations including codon 41/42(− CTTT), IVS-II-654 (C → T), − 28 (A → G), codon 71/72 (+ A), codon 17 (A → T), codon 26 (G → A), codon 31(–C), codon 27/28 (+ C), IVS-I-1 (G → T), codon 43 (G → T), − 32 (C → A), − 29 (A → G), − 30 (T → C), codon 14/15 (+ G), Cap (− AAAC), Int codon (ATG → AGG) and IVS-I-5 (G → C). The amplified products were analyzed by 1.5% agarose gel electrophoresis. Predenaturation at 94 °C for 10 min, denaturation at 94 °C for 1 min, annealing at 55 °C for 30 s, extension at 72 °C for 30 s, at 35 cycles of amplification, and extension at 72 °C 5 min. The amplified products were reverse hybridized with the reverse dot-blot membrane of the specific labeled probe. After washing, the signal on the membrane was determined.

The same kit was also used to detect the three common α-globin gene deletions, (−SEA/αα, − α3.7/αα, − α4.2/αα). PCR for deletion of α-thalassemia was performed according to the following protocol: initial denaturation at 94 °C for 10 min, 96 °C for 5 min, 98 °C for 45 s, 65 °C for 90 s, 72 °C for 3 min at 10 cycles of amplification, 98 °C for 30 s, 65 °C for 45 s, 72 °C for 3 min at 25 cycles of amplification and then 72 °C 10 min. The amplified products were analyzed by 1% agarose gel electrophoresis. All mutations identified are presented in Table S1.

Result

Prevalence of α- and β-globin gene mutations among couples in Hainan Province

We first examined 3049 couples of Hainan Province and diagnosed (1) 4923 cases as carrying α-thalassemia, accounting for 80.73%, with the mutation rates of 31.53% for –SEA/αα deletions, 11.15% for − α4.2/αα deletions and 11.02% for − α3.7/αα deletions, respectively (Table S2); (2) 2225 cases as carrying β-thalassemia, accounting for 34.69% of the total number of patients, with the mutation rates of 30.27% for CD41/42, 2.56% and 1.02% for − 28 and IVS-II-654, respectively; (3) 1322 cases as carrying compound α- and β-thalassemia, accounting for 20.99% of the total number of patients; and (4) 1322 cases as carrying both α and β-globin gene mutations, and among them, the five most frequent types were − α3.7/αα, − α4.2/αα, − αWS/αα, − α3.7/− α4.2 and–SEA/αα, with all accompanied by CD41/42, accounting for 4.49, 4.25, 2.35, 1.67, and 1.53% of all the subjects, respectively.

Geographical distribution of α- and β-thalassemia gene mutations in 18 regions of Hainan

Characterization of thalassemia mutations in the risk couples had been performed before the fetal samples were collected. During the 16-year period (2004–2020), a total of 3049 subjects were screened for thalassemia at Prenatal Diagnosis Center of The First Affiliated Hospital of Hainan Medical University. The rates of α-thalassemia carrier in 3049 couples of Hainan varied between 1.38 and 13.63% in eighteen regions. The rate was higher in coastal region (Haikou, Sanya, Lingshui, Danzhou, Ledong, Fig. 1A). The β-thalassemia carrier showed less variation, ranging from 0.67 to 4.39% in eighteen regions, which was higher in the coastal regions (Sanya, Haikou, Lingshui, Dongfang, and Ledong, Fig. 1B). The rate of α- and β-thalassemia carrier presented less variation in eighteen regions, ranging from 0.23 to 3.1%. The distributed status was similar to that of β-thalassemia carriers (Fig. 1C). We then analyzed the prevalence of α- and β-thalassemia mutations in eighteen regions of Hainan Province. The geographical distribution of α-thalassemia mutations is shown in Table 1. A significantly higher frequency of the –SEA/αα mutation was found in the general population of Hainan Province. Twenty-four types of α-thalassemia were identified in eighteen regions of Hainan Province. –SEA/αα, − α4.2/αα and − α3.7/αα are the most frequent α-thalassemia types in Haikou, Wenchang, Wuzhishan, Lingao, Ding’an, Sanya, Danzhou, Qionghai, Dongfang, Tunchang, Chengmai, Changjiang, Baisha, Baoting, Lingshui, and Dongfang, while the most common genotypes were − SEA/αα, -α4.2/αα and -αWS/αα in Wanning and Qiongzhong, five mutations (− αQS/− αQS, − αCS/− αWS, − α4.2/− αCS, − αCS/–SEA, − α3.7/− αCS) were observed at a frequency of less than 1% in eighteen regions. In general, the percentage of α-thalassemia mutation genes were in a similar tendency in different regions.

Fig. 1
figure 1

Frequencies of carrier status among couples in the eighteen regions of Hainan Province, China. A α-thalassemia only, B β-thalassemia, C α- and β-thalassemia

Table 1 Distribution of α-thalassemia mutations in Hainan Province

The geographical distribution of β-thalassemia mutations is shown in Table 2. Fourteen different point mutations of β-thalassemia were identified. CD41/42 (> 50%) was the most common mutation in eighteen regions, and four mutations (CD43, CD27/28, Cap, CD14/15) were observed at a frequency of less than 1%. Interestingly, the distribution trend of β-thalassemia mutation genes in eighteen regions was similar to that of α-thalassemia.

Table 2 Distribution of β-thalassemia mutations in Hainan Province

Fifty-six types of compound gene mutations were detected. The specific phenotype and frequency are shown in Table S3. 1319 cases carried both α- and β-globin gene mutations and, among them, the four most frequent types were − α4.2/αα, − α3.7/αα, αWS/αα, and − SEA/αα, all of which were accompanied by CD41/42 in eighteen regions of Hainan Province.

Spectrum of α- and β-thalassemia mutations identified in Li and Han people of Hainan Province

Apart from geographical factors, ethnic difference also plays an important role in gene diversity. Since Hainan Province is a place, whose population is mainly composed of Han and other minority groups, which generates diverse genotypes. We then analyzed the frequency and spectrum of α- and β-thalassemia mutations of the Han people in Hainan Province and compared these findings with the minority. As is shown in Table 3, twenty-four α-globin genotypes were found in 4,597 cases of α-thalassemia, with –SEA/αα, − α4.2/αα and − α3.7/αα being the four most frequent α-thalassemia types among the Han and Li people, accounting for 57.50, 12.37, and 11.26% in Han people and 13.43, 16.43, and 16.76% in Li people, respectively. − SEA/αα and − α3.7/αα are the two most frequent in Miao and Zhuang people, accounting for 62.75 and 13.73% in Miao and 55.88 and 14.71% in Zhuang people.

Table 3 Spectrum of α-thalassemia mutations of Han people and minority in Hainan Province

Surprisingly, CD41/42 was the dominant mutation of β-thalassemia patients among the Han, Li, Miao, Zhuang people, accounting for 70.03, 94.84, 83.64, and 56.25%, respectively (Table 4).

Table 4 Spectrum of β-thalassemia mutations of Han people and minority in Hainan Province

Fifty-seven types of compound gene mutations were detected. The specific phenotypes and frequency are shown in Table S4. The three most frequent types in Han people were − α3.7/αα, − α4.2/αα, and − αWS, all of which were accompanied by CD41/42, accounting for 21.45, 21.29, and 11.28%, respectively. The three most frequent types in Li people were − α3.7/αα, − α4.2/αα, and –SEA/αα, all of which were accompanied by CD41/42, accounting for 19.48, 16.29, and 16.86%, respectively. On the contrary, the patients who carried compound gene mutations in Miao and Zhuang were less than 1% of the total subjects.

Prenatal diagnosis

The frequencies of thalassemia genotypes obtained from 3049 prenatal diagnosis in Hainan Province are shown in Table 5. Among these 3049 cases of α-thalassemia, 3.80% of the fetus were α-thalassemia hydrops, 70.35% were α-thalassemia carriers, 4.39% were β-thalassemia major, 27.75% were β-thalassemia carriers, and 18.24% were normal. The spectrum of α-thalassemia genotypes among pregnant women, their husbands and fetuses in Hainan Province are shown in Table 6. The most frequent genotype was –SEA/αα, accounting for 21.42% of total fetuses, and is followed, in order of frequency, by the mutations -α4.2/αα and − α3.7/αα, accounting for 15.45 and 14.89%, respectively. The top three α-globin gene mutations in the fetuses were the same as those observed in the pregnant women and husbands.

Table 5 Genotypes of thalassemia identified by prenatal diagnosis in Hainan Province
Table 6 Spectrum of α-thalassemia genotypes among pregnant women, their husbands and fetuses in Hainan Province

For the β-thalassemia mutations, the most frequent genotype was CD41-42 (67.36%), followed by CD41/42/CD41-42, − 28, CD71/72 and − 28/CD41/42, accounting for 15.28, 5.01, 3.42, and 2.93%, respectively. In the pregnant women and husbands, the most common mutation was CD41/42 followed by IVS-II-65 and CD17 in parents (Table 7).

Table 7 Spectrum of β-thalassemia genotypes among pregnant women, their husbands and fetuses in Hainan Province

A total of 485 genotypes were determined in both α and β-thalassemia carriers. The top three frequent types were − α3.7/αα (17.53%), − α4.2/αα (16.91%), and − αWS/αα (8.87%), all of which were accompanied by CD41/42. The top 3 α- and β-combination globin gene mutations in the fetuses were the same to those of the pregnant women and husbands (Table S5).

For couples undergoing prenatal diagnosis, we conducted telephone follow-ups 6 months after the expected date of delivery, and a total of 2842 follow-ups were completed. 241 cases of termination of pregnancy, of which 208 cases were thalassemia severe, 33 cases were due to abnormalities including chromosomal and embryonic development, and 2601 successfully delivered newborns.

Discussion

Previous studies have shown the prevalence and molecular spectrum of α- and β-thalassemia mutations in Hainan Province, but those studies had limited sample size and information [9, 10]. This study was the first to provide a large scale, sampling of the eighteen regions of Hainan Province. Also, we analyzed the family-based sampling, as we knew that the couples who carried the same type of thalassemia mutations had high risk to have a moderate or severe thalassemia fetus. These pregnant women and their husbands are critical subjects of thalassemia intervention, which will help to prevent birth of moderate to severe thalassemia fetus. Our center is the earliest established Prenatal Diagnosis Center in Hainan Province, and nearly all carriers of thalassemia in the whole province come here for prenatal diagnosis. Hence, our study presented the prevalence and molecular variations of α- and β-globin gene mutations in Hainan Province.

Our data indicated a high prevalence of thalassemia in Hainan Province. The frequencies of α- and β-thalassemia were 80.73 and 34.69% in the total subjects of our study, respectively. The frequency of α-thalassemia shown in our study were higher than that reported in other provinces of China, such as Guangdong[3], Fujian[5], Guangxi[4], Sichuan[11] and the frequency of β-thalassemia reported were lower than that reported in Fujian[5], but higher than that in Guangdong[3], Guangxi[4] and Sichuan[11]. In addition, the frequency of α-thalassemia demonstrated in our report were higher than that reported in other countries of Southeast Asia, such as Cambodia, Laos, Thailand, Vietnam, and Malaysia [12] and in other countries/regions of the world, such as the north of Southwest Iran [13] and lower than that reported in Pakistan [8].

In this study, the regions with the highest prevalence of α-thalassemia were Haikou, followed by the Sanya, Lingshui, Danzhou, and Dongfang. The three regions with the highest prevalence of β-thalassemia mutations were Sanya, Haikou, and Lingshui. The lowest prevalence of α- and β-thalassemia was Lingao, Wenchang, and Ding’an. In our study, the prevalence of thalassemia carrier status in Sanya City was lower than that reported by Li [14]. Among α-thalassemia genotypes, –SEA/αα, which is consistent with the previous research by Sanya City and Ding’an County of Hainan Province, Sichuan, Fujian, Guangdong, and Guangxi [3,4,5, 11, 14, 15], but different from that of Lingshui Autonomous County of Hainan Province [16], accounts for the highest frequency in the total couples of eighteen regions. Comparing with other countries, the most frequent α-thalassemia mutations are consistent with that Laos [17], but different from those reported Thailand [18], Cambodia [19], and Iran [20]. Among the β-thalassemia genotypes, CD41/42 accounts for the greatest proportion among couples of eighteen regions and it is consistent with the previous study that reported to be the most frequent genotype in Sanya City, Lingshui, and Ding’an County of Hainan Province, Yunnan [21], Guangdong [3], Sichuan [11], Guangxi Province [4], while IVS-II-654 ranked first in Hubei Province [22]. These findings may be explained with the special location of Hainan Province which is located on a small island in southeast China. Comparing with other countries, the most frequent genotype of β-thalassemia is similar to that in Thailand [23], while that were different from Pakistan [24], Syria [25], Iran [20], Vietnam [26] Malaysia [27], Philippines, and India [28].

Race difference may also play a significant role in gene difference, so we do not exclude racial diversity. As population of Hainan Province is composed of Han and the minority ethnic group such as Li, Miao, Zhuang, so we also analyzed thalassemia genotype of couples based on their nations. Interestingly, –SEA/αα and CD41/42 are the most frequent thalassemia genotype in Han, Li, Miao, and Zhuang people, different from that in an early study [10], whose dominant genotype is − α4.2/αα. This result may be explained by continued migration or the screened population we selected.

Due to geography, culture, customs, and other reasons, the proportion of inter-marriages between some regions of Hainan Province and other regions is relatively small, which consequently led to an overlapping distribution of thalassemia genes and resulting in a high number of thalassemia carriers. Prenatal diagnosis is one of the most effective and direct method to prevent thalassemia major. In this study, we showed the results of prenatal diagnosis in our regions from 2004 to 2020, all participants were lived in Hainan Province. This study is the first systematic family-based prenatal diagnosis of thalassemia in the Hainan Province.

The findings of fetal diagnosis presented 9.17% at risk for thalassemia major, 18.26% for the normal type. Compared with one parent being a carrier, both parents that were carriers were more likely to have a child with thalassemia major or carrier. Also, the couples who carried the same thalassemia genotype had a high risk to have a fetus with thalassemia major. Interestingly, –SEA/αα deletion and CD41/42 were found to be the most common types of thalassemia of prenatal diagnosis, which were also the most predominant mutations of the full set of couples in our study, such couples had a 1/4 chance of producing a fetus with Hb Bart’s and β-thalassemia major. We identified 118 fetuses with Bart’s hydrops syndrome, accounting for 3.88%. We also identified 161 fetuses with β-thalassemia major, accounting for 5.29%. This study had a limitation that we did not test the α12 allele which also have the impact with α-thalassemia on clinical phenotype [29].

At present, Hainan Province has established several Prenatal Diagnosis Centers and Health Departments which have initiated free screening program since 2011. Other organizations such Li Ka-shing Health Poverty Alleviation fund projects and Prevention and Control Committee and Volunteers Association of Hainan Province, will help the prevention and diagnosis of thalassemia. We also recommend further studies on the factors that can affect the accessibility of thalassemia interventions to provide a scientific basis for government decision-making.

Conclusion

The gene types of α- and β-thalassemia in Hainan Province in China are characterized by a wide range of distribution, high carrier rate, genetic diversity, genetic heterogeneity, geographical, and ethnic differences. Therefore, it is necessary to detect the thalassemia genes in the suspected population in this area so as to provide clinical genetic counseling and prenatal diagnosis of for thalassemia major in Hainan Province.