Introduction

Type 1 diabetes (T1D; MIM #222100) is the most common chronic metabolic disease in children, responsible for 5–10 % of the total cases of diabetes worldwide [1], and is suggested to increase globally by 3 % a year [2]. According to the Diabetes Atlas, 2015, more than 500,000 children developed T1D in 2015 (www.idf.org). The global annual incidence of T1D in the age group 0–14 years varies remarkably in different populations; for example, the three Nordic countries, Finland, Sweden, and Norway, are in the top three places in the list of countries with annual rates of 57.6, 43.1, and 32.8 per 100,000, respectively. In contrast, Venezuela has the lowest annual incidence rate (0.1/100,000) [3••]. The frequency of T1D varies greatly worldwide. Caucasians are considered to be at the greatest risk for T1D, with a frequency of approximately 0.4 [4], and the frequency is much lower among Asians, reaching less than 0.04 in Japanese populations [5]. Arabs have one of the highest incidence and prevalence of T1D worldwide.

T1D is characterized by immune destruction of the insulin-producing pancreatic islet β cells [6], resulting in insulin insufficiency. The cause of T1D remains unknown; however, first-degree relatives of T1D patients are at increased risk compared to the general population, and several other studies showing that approximately 10 % of patients with T1D cluster in families [7], the concordance rate in monozygotic twins for T1D is ~70 % [8, 9] and the heritability of T1D is estimated to be ~88 % [10], these factors add further evidence of a significant genetic contribution to T1D susceptibility. Nerup et al. [11] were the first to associate HLA inheritance with T1D; subsequently, scientists were able to demonstrate positive and negative correlations of particular HLA haplotypes to T1D [1214]. HLA-DQ- and HLA-DRB1-containing genotypes were found to be responsible for approximately 50 % of the risk associated with developing T1D [15, 16]. HLA class I molecules [17] and other non-HLA genes, such as CTLA4, PTPN22, INS, and IL2RA [18], were also reported as potential predisposing genetic risk loci for T1D in different ethnic groups.

Epidemiology of Type 1 Diabetes in the Arab World

Who Is an Arab?

The Arab world is comprised of 22 Arab-speaking countries (Fig. 1), according to the World Bank latest classification for 2015 (http://data.worldbank.org). Arab regions include high-income countries, such as Bahrain, Kuwait, Oman, Saudi Arabia, Qatar, and United Arab Emirates; middle-income countries such as Algeria, Egypt, Iraq, Jordan, Lebanon, Libya, Morocco, Palestine, Somalia, Sudan, Syria, and Tunisia; and low-income countries such as Comoros, Djibouti, Mauritania, and Yemen. They occupy a large area extending from the Atlantic Ocean in the west to the Arabian Sea in the east. The Arab population is approaching 0.5 billion, and this region has been extensively exposed to many successive invaders from Turkey, Rome, and Europe, as well as traders and immigrants, which has contributed to the mixed ethnic demographic of the population.

Fig. 1
figure 1

Classification of Arab countries according to their income using the latest classification of the World Bank (Source: http://www.arabspatial.org/map)

Incidence and Prevalence in Arabs

Incidence

In the Middle East and North African (MENA) region, which is mostly Arabs (18 out of 21 countries), it is estimated that there are approximately 60,000 cases of T1D in children less than 15 years old. Several studies reported large variations in T1D incidence among Arab countries, ranging from a low of 2.54/100,000 in Oman [19] to a high of 29/100,000 in Saudi Arabia [20]. These variations can be attributed to the vast diversity of socioeconomic status among Arabs, wide geographical range, and differences in marriage culture practices. The reported incidence details for different Arab countries are summarized in Table 1 [2129, 31]. According to the International Diabetes Federation (IDF)’s latest estimates for the incidence rates, 2015 (www.idf.org), Kuwait and Saudi Arabia are featured in the top 10 list of countries with the highest incidence rates in the world. Saudi Arabia and Egypt contribute almost half of the known cases in the MENA region [3••].

Table 1 The incidence rates of type 1 diabetes among Arab countries

Does Sex Matter among People with Type 1 Diabetes in the Arab World?

Generally, T1D is characterized by a male/female ratio of close to 1, although there is a slight deviation in this ratio among different ethnic groups. Among Arab patients, the male/female ratio of patients with T1D in high-income countries is lower than that of the middle- and low-income countries (Table 1). Overall, there is a slight excess of women with T1D among Arabs within the age group 0–14 (Table 1); in contrast, patients from Europe show a slight male excess within the same age group [32]. However, there is no sex bias for T1D in Swedish patients [33].

Prevalence

The latest prevalence estimates according to the IDF atlas indicate that there are almost half a million children aged less than 15 years living with T1D worldwide (www.idf.org). Unfortunately, most of the epidemiological studies of T1D have investigated the disease incidence, few dealt with its prevalence, and most data sources do not distinguish between T1D and type 2 diabetes (T2D) in reporting prevalence. Therefore, it is difficult to separate the prevalence of the two subtypes of diabetes, which might explain the lack of data regarding the prevalence of T1D, particularly in Arab countries. A few studies have reported on T1D prevalence in children and adolescents in Algeria, Egypt, Kuwait, Saudi Arabia, and Sudan [30, 3437], and it is not representative of the actual prevalence rates in the Arab world.

Patterson et al. [3••] have reported the number of children with T1D (in 1000s) globally, including Arab countries. They employed a comprehensive analysis of patients with T1D using variable sources of available data on the prevalence worldwide and determined the T1D prevalence among Arabs. Saudi Arabia features at the top of the list of prevalence of the disease in Arab countries, and Qatar and Kuwait show lower prevalence estimates than several of the middle-income countries, in contrast to their incidence estimates (Table 1), which are the highest in the Arab world together with Saudi Arabia. Overall, among all Arab countries, the highest observed prevalence was in the middle-income countries, headed by Egypt, Sudan, and Morocco. A complete and accurate snapshot of the epidemiological situation of diabetes will help to customize health care in regions where there is a lack of information available for T1D. With more thoughtful studies, it is anticipated that both of incidence and prevalence in Arab countries will be estimated at significantly higher levels.

Arabs are an Ideal Population in which to Study the Genetics of Type 1 Diabetes

Considering that T1D is a genetic disease, areas with a high incidence and prevalence of T1D should be a priority for genetic studies related to T1D to understand the genetic etiology of the disease and improve its prognosis. This makes the Arab world very interesting due to the distinctive genetic make-up of Arabs, resulting from highly prevalent endogamous and consanguineous marriages (10–70 %), with an extreme preference towards first-cousin marriage [38], which is still rising among Arabs and is responsible for the high prevalence of gene-based disorders in Arab countries [39]. For example, consanguineous marriage rates in Qatar increased from 41.8 to 54.5 % in just one generation, with first-cousin marriage of ~35 % [40]. Moreover, endogamous marriages approach 100 % in many Arab countries, especially in the Gulf States (Saudi Arabia, Bahrain, Kuwait, Oman, Qatar, and United Arab Emirates). In contrast, Finland and Norway, which top the list of countries for T1D prevalence, with Finland having the highest global annual incidence of T1D in children, IDF Atlas, 2015 (www.idf.org), have very low consanguineous mating rates compared to Arabs. In addition, the Norwegian [41] and Finnish [42] population show much lower frequency of first-cousin marriage, with rates of 0.2 and 0.17 %, respectively. These differences lead to variable genetic architecture between Nordics and Arabs, which likely results in a distinctive susceptibility or protective genetic profile between the two ethnic groups for T1D. Although environmental factors and socioeconomic background of patients with T1D are potentially important contributors to the variation between the two ethnic groups, considering the high disease heritability (88 %) and concordance rates in monozygotic twins (70 %), genetics of diabetes play a major role. Therefore, given the uncertainty of the etiology of T1D, high consanguinity rates across Arab countries are expected to contribute to our understanding of the molecular pathology of T1D.

Immunogenetics of Type 1 Diabetes among Arabs

The advances in sequencing technology and genotyping techniques have improved our understanding of the contribution of the HLA loci to T1D susceptibility in different ethnicities. Because the DQ and DR alleles exhibit a high degree of linkage disequilibrium (strong correlation) [15, 43], the associations of HLA alleles with T1D disease must be considered to be haplotype-specific, not allele-specific, because the haplotype context shows variable degrees of susceptibility and/or protection that are distinct from the individual alleles; for example, the DQB1*0302 allele is reported to be a critical susceptibility factor for T1D in Caucasians [44] but shows neutral and protective phenotypes when combined with other alleles in different ethnic groups (Table 2). The DQB1*0201 allele is almost universally reported to be associated with risk to T1D, including in Japanese populations [5], but in Arabs, it is associated with both susceptibility and protection, depending on the haplotype context (Table 2). Therefore, the HLA loci associated with either susceptibility or protection to T1D will be referred to in this manuscript in terms of haplotypes, not alleles.

Table 2 Contribution of HLA haplotypes in susceptibility and protection to type 1 diabetes in Arab population compared to other ethnic groups

Among Arabs, the DRB1*030101-DQB1*0201 haplotype is associated with increased susceptibility to T1D among individuals from Bahrain, Lebanon, and Tunisia [45, 48, 49], similar to Caucasians (Table 2) [46, 47], while the DRB1*040101-DQB1*0302 haplotype confers strong susceptibility in Tunisians and Bahrainis [49], shows a weak association in Caucasians [46, 47] and is protective among Lebanese [48]. However, the DRB1*040101-DQB1*050101 haplotype is protective among Lebanese but confers susceptibility to T1D in Bahrainis and is unknown in different ethnicities (Table 2). The strongest association between HLA region and T1D among Egyptians is observed with DRB1-DQA1-DQB1 haplotypes: the DRB1*0301-DRB3*0201-DQA1*0501-DQB1*0201 [51] and DRB1*04:02-DQA1*03-DQB1*03:02 [52] haplotypes confer susceptibility to T1D, and DRB1*04:03-DQA1*03-DQB1*03:02 is a resistant haplotype [52]. Both the protective and susceptible haplotypes are unique to the Egyptian population and have not been described in other ethnicities. In contrast, Moroccans seem to share the susceptible (DRB1*08-DQA1*0401-DQB1*0402) and protective (DRB1*07-DQA1*0201-DQB1*020 and DRB1*03-DQB1*02) haplotypes [53] with other Mediterranean populations, such as Algerians [63], Tunisians [49], French [54], and Spanish [47] populations. Although Arabs share several haplotypes with different ethnic groups (Table 2) [44, 46, 47, 54, 56, 5860], they still have unique haplotypes which confer susceptibility and protection to T1D [45, 4852, 62], which are distinctive from other ethnic groups (Table 2).

Among Arab patients with T1D, the c.49A>G polymorphism of the CTLA4 gene in the homozygous form (G/G genotype) is found to confer an increased risk for patients originating from Egypt [64, 65], Tunisia [66], Lebanon [67], and Morocco [68]. Interestingly, all Lebanese patients carrying the G/G genotype developed an early onset T1D compared to the heterozygous A/G genotype [69]. Among Tunisians, the rs3181096 gene polymorphism of the C allele of CD28 was associated with protection against T1D in the heterozygous form C/T and was significantly associated with the disease in homozygous CC carriers. Other variants in the PTPN22, TCRβ, CD3z, ZAP70 [70•], IL15, and BANK1 [71•] genes were found to be significantly associated with genetic susceptibility to T1D among Tunisians.

Homozygosity and Susceptibility to Type 1 Diabetes among Arabs

Although the pattern of inheritance of the HLA susceptibility genes is still debatable, there are studies that support mixed inheritance patterns that are either dominant or recessive [4, 6]. Others support the recessive inheritance theory [72, 73]. The DRB1*0405-DQB1*0401 haplotype is significantly higher in homozygous than in heterozygous alleles in both Japanese and Koreans patients [74], and the high-risk haplotype DRB1*04-DQB1*0302, which is found frequently in Caucasian patients, is associated with higher susceptibility to T1D in homozygous than heterozygous combinations, which has been noted in Filipino patients [75]. Among Tunisians, homozygosity for the DRB1*030101-DQB1*0201 haplotype represented the highest-risk genotype, while homozygosity for the DRB1*070101-DQB1*0201 haplotype was associated with a significantly decreased risk [50]. The frequency of the homozygous DRB1*03011-DQB1*0201 haplotype was higher in Bahraini than in Lebanese patients [48], and the homozygous haplotypes (DRB1*0301-DQB1*0201 and DRB1*0405-DQB1*0302) were strongly associated with T1D among Saudi patients [61].

The aspartic acid residue 57 of the HLA-DQB1 was found to be strongly associated with protection against T1D [76]. Haider et al. [77] studied the prevalence of the polymorphic amino acids at position 57 of the HLA-DQB1 allele in Kuwaiti children with T1D and found that the homozygous Ala/Ala is more prevalent among patients (34/55; 62 %) than heterozygotes with several combinations (19/55; 35 %). The study reported a high prevalence of homozygous Ala/Ala in different ethnic groups, such as Caucasians, African Americans, Chinese, Sardinians, and Norwegians, and it is the highest prevalence reported thus far.

To date, the studies performed to link the homozygous HLA haplotypes with susceptibility to T1D identified only 132 patients with homozygous HLA genotypes, and most of them are in Saudi Arabia (83/132); however, there is no mention in these studies of the familial relationship of the patients [61]. Endogamy is close to 100 % in Saudi Arabia and might be responsible for this frequency. Only 14 individuals had homozygous haplotypes, which were associated with protection against T1D in Lebanon and Tunisia.

Although the heterozygous combinations of several alleles and HLA haplotypes have been reported to correlate with increased susceptibility to T1D compared to homozygosity [6, 78, 79], the homozygosity of HLA haplotypes and the class I homozygosity of INS-IGF2 5ʹVNTR are responsible for the increased risk of T1D among Caucasian patients originating from Poland [80]. Because the homozygous and heterozygous HLA haplotypes in patients with T1D are in Hardy-Weinberg equilibrium, there is strong evidence suggesting that HLA susceptibility for T1D follows simple recessive inheritance [81]; however, this hypothesis has been rejected due to the excess of heterozygous rather than homozygous haplotypes in patients with T1D [79]. This is a perplexing scenario and needs further research to pinpoint the exact mode of HLA inheritance. Inbred unions among Arabs with a long familial history with T1D could provide a solution with a clearer segregation pattern of HLA haplotypes and therefore carry the potential to provide valuable insight to decode these situations.

Because approximately 50 % of the genetic risk of T1D is conferred by HLA genotypes, that leaves some 50 % which could be caused by non-HLA genes. The homozygosity of these genes, which is mostly observed in inbred communities, could provide valuable insight into disease prognosis and pathogenicity. For example, the c.49A>G polymorphism of CTLA4 gene was found to confer the highest susceptibility to T1D when observed in homozygous Arab patients from Tunisia [66], Egypt [64, 65], and Lebanon [67]. In this regard, this study will aid in discovery of novel genes and genetic variants that might be associated with T1D disease development and subsequently therapeutic development for T1D.

Interestingly, the patients in the studies reported here are mostly referred to as unrelated. The number of patients with T1D screened in this report is 1235 patients with T1D versus healthy controls of 1684 [45, 4853, 55, 57, 62, 63, 77]. The observed homozygosity might be due to the endogamous practices among Arabs, which reach approximately 100 % in Gulf States. For example, women in Saudi Arabia are prohibited to marry men other than Arab men from the Gulf countries without special dispensation from the king (http://web.archive.org/web/20120614045804/http://travel.state.gov/travel/cis_pa_tw/tw/tw_931.html), and men require a government permit to marry a foreign woman, which is applicable to the six Gulf States. The prevalence of endogamy in other Arab countries is very high due to deep-entrenched, centuries-old traditions, which strongly favor marriage within the same Arab subcultures. Investigating the genetic etiology for T1D among even closer relatives and inbred Arab communities with high disease incidence is anticipated to provide more instructive homozygous combinations.

Genotype-Phenotype Correlation of Type 1 Diabetes

Establishing meaningful genotype-phenotype relationship for patients with T1D is important for prognosis and treatment of the disease. The ability to perform high-resolution HLA genotyping using deep sequencing [82•], screen for variants in potential candidate genes that might contribute to the pathogenicity of T1D, and map variations on the crystal structure of both susceptible HLA epitopes [83] and protective loci [84] has paved the way to sorting out the intricacies of the genotype-phenotype correlation for patients with T1D. At the present time, diagnosed patients with T1D are treated with insulin injections. However, studying inbred Arab communities may allow high-risk families to be offered heightened surveillance before onset based on carrier screening of genetic risk polymorphisms segregating in their families, or immunomodulatory prevention if such a strategy proves successful in the future.

The correlation between the different HLA genotypes and the clinical phenotypes of T1D is still enigmatic and has several challenges, including the population-specific haplotypes, geographical variation [85], etiological heterogeneity, and the complex nature of the highly polymorphic HLA loci. Therefore, the idea of reducing the complexity of HLA is quite appealing to help us to understand the HLA mode of inheritance and provide insights into the complex nature of these correlations. Consistent with this concept, the non-obese diabetic (NOD) mice, that spontaneously develop T1D and have served as a useful animal model for understanding the immunological foundation, genetic basis, and treatment, of T1D [86], are genetically homogenous (created from inbred colonies), while humans are diverse; therefore, these mice are very useful for studying the correlation between the genetic variations and clinical phenotypes of the disease. The human equivalents are those Arab inbred communities living in areas where the incidence and prevalence are very high and that are considered to be genetically homogenous. Unfortunately, one of these mines has not been earnestly tapped yet in the Arab world. However, the fear of spreading genetic diseases in Arab countries due to the prevalent consanguineous mating and endogamy has ignited a genomic revolution, which is being witnessed particularly in the Gulf region.

Genome Arabia

This Arab genomic revolution started recently with launching the “Genome Arabia” for example, Saudi Arabia started a genome project (SGP) in 2013 to sequence 100,000 Saudi genome (http://www.bbc.com/news/health-25216135). SGP aims to identify Saudi-specific genetic variants, which are linked to high incidence genetic diseases in Saudi Arabia, such as diabetes, deafness, cardiovascular, cancer, and neurodegenerative diseases. The SGP’s task is to establish a genotype-phenotype correlation for genetic disease and enable the creation of a foundation for personalized medicine. This will allow for therapeutic development for genetic disease based on the DNA blueprint of each Saudi individual, which will reduce the cost of healthcare, as the health care expenses in Saudi Arabia is related to human genetic disease tops US$30 billion annually. A few days after the SGP announcement, Qatar announced the intention to launch the Qatar Genome Project (QJP) and their plan to sequence all of their population (~300,000 citizens). Recently, they started a pilot sequencing sample of 3000 Qatari citizens (http://www.qatarbiobank.org.qa). As the SGP, the QJP, seeks for future protection of Qatari citizens from the spread of genetic diseases due to the endogamous and consanguineous marriage culture, and to facilitate future molecular diagnosis of coming generations in Qatar, where diabetes is a top priority disease in the Qatar national health strategy. The wealth of these data will, in the long-term, have global impact to understanding the molecular pathology of T1D.

Inbred Arab Communities and Rare Variant Discovery

A practical example for the benefit of studying the inbred communities among Arabs was provided by Verge et al. [87], who analyzed an inbred Bedouin Arab community living in Israel with a long history of first-cousin marriage. The family consisted of 19 close relatives affected with T1D, who carried rare predisposing haplotypes to the disease that have not been found in other families. A novel T1D susceptibility locus (IDDM17; MIM#603266) was mapped to chromosome 10 (10q25.1) [88] using linkage analysis of the 19 affected relatives. Of the 19 patients, 16 harbored a common haplotype (DRB1*0301-DQA1*0501-DQB1*0201) and an unusual haplotype, DRB1*0301-DQA1*0102-DQB1*0502 [87]. Interestingly, this common haplotype exists in different ethnic groups, including Caucasians [6, 47], and in 23 relatives within this extended family with T1D [89]. The unusual haplotype occurred disproportionately in individuals who test positive for islet cell autoantibodies. Interestingly, the Bedouin Arabs show a significantly lower frequency of T1D compared to the Israeli population; however, with the high rates of consanguinity (~50 %) and endogamy (~100 %) among Bedouin Arabs, these rare haplotypes have a much greater chance to occur, causing susceptibility to T1D. This is probably applicable not only to T1D but to all complex diseases, where “genetic simplicity” might provide significant insights into the genetic etiology of the disease through studying large multiplex families from homogenous populations.

Conclusive Versus Inconclusive Genotype-Phenotype Correlation for Type 1 Diabetes

Among Arabs, the haplotype DRB1*030101-DQB1*0201 (Table 2), which is shared with Caucasians, confers strong susceptibility to T1D in individuals from Bahrain, Lebanon, and Tunisia [4549]. Unfortunately, in the “genetically wealthy” Arab states, where diabetes is a chronic problem, there are not enough reports to identify a conclusive genotype-phenotype correlation for patients with T1D. Nonetheless, a meaningful risk correlation could be drawn for the HLA haplotypes in Japanese for the high-risk haplotypes DRB1*0405-DQB1*0401 and DRB1*0901-DQB1*0303 [74, 90] and in Caucasians for the haplotypes DRB1*0301-DQB1*0201 and DRB1*0401-DQB1*0302 [46, 47].

However, this correlation is not straightforward for several reasons. Different haplotypes have different degrees of susceptibility and/or protection in different ethnic groups; for example, the DRB1*040101-DQB1*050101 haplotype is protective among Lebanese but susceptible or neutral among Bahrainis [48] and unknown in different ethnic groups (Table 2). This complicates the characterization of such relationships and might suggest other interfering genetic relationships, including epistatic gene interactions or genetic modifiers that might vary among different ethnic groups. Another challenge to these correlations pertains to the nature of the interaction of an uncharacterized number of genes with unidentified environmental factors, which together, cause the disease phenotype. Moreover, the incomplete penetrance and the missing heritability data may complicate such correlations, and finally, the complexity and lack of complete understanding of the mode of inheritance of HLA haplotypes still represent a barrier to the understanding of this relationship.

Looking Forward

Children with T1D represent a significant burden not only for their families but also for the healthcare systems in all countries, even for governments that have the means for treatment and intervention. Therefore, one of our priorities is to increase the predictability of the disease to improve the prognosis and treatment of T1D. This should start with a clear understanding of the nature of the relationship between the genetic variants, different HLA genotypes and the phenotypic characterizations and associated complications of T1D. However, large numbers of patients must be scrupulously characterized for their clinical presentation, disease-associated complications, and genetic background; it is likely that genome-wide association studies, using targeted deep sequencing and/or whole genome sequencing of well-characterized Arab patients, belonging to inbred communities with family history of the disease, in areas of elevated incidence and prevalence, will lead to the discovery of novel genes, rare variants, and modifier or epistatic genes. This may help in sorting out the mysteries of such correlations, leading to a clear-cut understanding of the HLA mode of inheritance and to novel insights of the molecular pathogenesis underlying T1D. Once novel-associated genetic factors of T1D have been identified, their direct contribution to the pathogenicity of T1D will still need to be validated.

Unfortunately, in the wealthy genetically Arab communities, there are rich genetic scenarios that are buried and still need to be discovered. Very few studies have dealt with the genetic risk factors that make Arabs susceptible to T1D. Policymakers and lead scientists in the Arab world, together with the international funding organizations that are interested in limiting the global spread of diabetes, should recruit more funds and carry out more field studies to discover what the genomic studies in Arab societies could offer by examining inbred Bedouin and tribal communities where disease prevalence is high. This should teach us a great deal about the genetic etiology of T1D, promoting a meaningful genotype-phenotype correlation to improve disease prognosis, increase the predictive ability for diagnosing T1D prenatally and in newborns, and provide effective molecular diagnostic approaches to the disease.

There is an urgent need to create a nationwide Arab registry for patients with diabetes, which can separate the data for both T1D and T2D and would provide the basis for further epidemiological efforts. The registry should aim to enroll each Arab family with diabetes in a single centralized database, which must be strictly regulated to protect patients’ privacy rights. It will collect patients’ information related to family history, including clinical, genetics, and epidemiological data. Establishing such a registry is important for reducing the health care cost and in saving not only Arab lives but lives on a global scale now and in the future.

Conclusion

This review article sheds light on the importance and benefits of studying the Arab population, since there is a high level of inbreeding which can result in recessive T1D risk variants having a large effect. Such populations would be useful not only for assessing the role existing T1D risk variants but also for identifying novel risk variants through linkage analysis, due to the availability of suitable sample groups to study and search for genetic determinants of T1D. Arabs represent an interesting mix of many other ethnicities across their history, which allowed for gene flow among Arabs, resulting in a more diverse gene pool. This admixture includes, but is not limited to, Caucasians, Africans, Iranians, Indians, Berbers, and Asians, and this was observed in shared susceptible and protective haplotypes. However, it appears that Arabs have a distinctive molecular genetics profile that confers either susceptibility or protection to T1D.