Introduction

Natural killer (NK) cells are critical components of the innate immune system and they are controlled by many activating and inhibitory receptors (Colucci et al. 2002; Lanier 2005), including members of the killer cell immunoglobulin-like receptor (KIR) family. The KIR region exhibits an extensive degree of diversity, which it achieves through a combination of variable gene content and allelic polymorphism (Carrington and Norman 2003). Population studies using genomic DNA to determine the presence or absence of each KIR gene have demonstrated that across individuals, KIR gene content can vary widely. Evaluation of differences in nucleotide sequences have documented an additional dimension of KIR diversity achieved through allelic polymorphism (Shilling et al. 2002). Indeed, estimates of the extent of KIR genotype diversity within the population suggest that <0.24% of unrelated individuals can expect to have identical KIR genotypes (Shilling et al. 2002).

While gene content varies across humans, the framework genes 3DL3 and 3DL2 at either terminus of the region and 2DL4 in the middle of the locus are consistently observed in all individuals (Wilson et al. 2000). KIR haplotypes can be divided into two basic groups, A and B, both of which are found in most populations (Denis et al. 2005; Hsu et al. 2002; Jiang et al. 2005; Norman et al. 2002; Rajalingam et al. 2002; Toneva et al. 2001; Uhrberg et al. 1997; Velickovic et al. 2006; Whang et al. 2005; Yawata et al. 2002) but sometimes at very different frequencies. Haplotype A, which is estimated to be as frequent as 47–59% within Caucasian populations, comprises a common complement of KIR genes: 3DL3, −2DL3, −2DL1, −2DL4, −3DL1, −2DS4, and −3DL2. In contrast, haplotype B varies in gene content and, in general, contains more activating receptor genes (2DS and 3DS genes) than the A haplotype (Carrington and Norman 2003). Frequencies of each gene can vary remarkably across world populations, which might reflect differential selection pressures as well as persistence of ancestral genotypes. KIR and HLA ligand combinations also appear to be important in pathogenesis of several autoimmune diseases, cancers, viral infections, and reproductive fitness (Carrington and Martin 2006; Parham 2005).

The study of KIR gene and profile (the gamut of KIR genes present in a given individual irrespective of copy number) variability across distinct populations might predict the extant diseases to which a given population may be particularly resistant or susceptible. In India, KIR genotypes have been reported only in North Indian Hindus (Rajalingam et al. 2002). The present study describes KIR genotypes in Maharashtrians and Parsis who have been geographical neighbors for several centuries, but who are of distinct ethnic backgrounds; Parsis being of Persian descent and the Maharashtrians being natives of India. The Parsis have maintained their ancestral, cultural, and religious practices, with a strong tendency to marry only within the Parsi Zoroastrian community. We tested for the presence/absence of 15 KIR loci in unrelated, healthy Parsi (n = 145) and Maharashtrian (n = 139) women and compared characteristics of the Parsi KIR locus to that of Maharashtrians and the previously reported North Indians (Rajalingam et al. 2002). The study of Parsis is of particular importance, as this community is facing dwindling numbers and high incidence of cancer, neurodegenerative diseases, autoimmune diseases, and low birth rates (Desai 2004).

Materials and methods

Study subjects

Blood was collected from 145 Parsis (Zoroastrian) from Mumbai (Maharashtra, India) and 139 Maharashtrians (Marathi-speaking Hindus) residing in Mumbai with their informed consent. All individuals were healthy and unrelated. Peripheral blood lymphocytes (PBLs) were separated on Ficoll Hypaque mixture (Boyum 1968).

DNA extraction

DNA was extracted from 5 × 106 PBLs using phenol–chloroform-based extraction followed by precipitation of DNA with ethanol. DNA was washed in 70% ethanol and then dissolved in TE buffer (Wallace 1987).

Polymerase chain reaction (PCR)

Genomic DNA was amplified for presence or absence of the following genes: 2DL1, 2DL2, 2DL3, 2DL4, 2DL5, 2DS1, 2DS2, 2DS3, 2DS4, 2DS5, 3DL1, 3DL2, 3DL3, 3DS1, and 2DP1. Genotyping was performed using PCR amplification with two pairs of locus-specific primers (PCR-SSP) as previously described (Martin and Carrington 2007). Amplification control primers that amplify a 796-bp fragment of the third intron of DRB1 were also included in each PCR to confirm robust PCR. Amplification was performed in a volume of 5 μl containing 200 μM dNTP, 500 nM primer, 1.5 mM MgCl2, 20 mM Tris–HCl, pH 8.4, 50 mM KCl, and 0.5 U of Platinum Taq DNA polymerase (Invitrogen). Cycling was performed as follows: 2 min at 94°C; five cycles of 94°C for 5 s, 65°C for 15 s, 72°C for 30 s; 21 cycles of 94°C for 5 s, 60°C for 15 s, 72°C for 30 s; four cycles of 94°C for 5 s, 55°C for 1°min, 72°C for 2 min; and a final extension step of 10 min at 72°C. PCR products were electrophoresed in 3% agarose gels containing ethidium bromide, and predicted size products were visualized under ultraviolet light.

Statistical analysis

LD coefficients for two-locus associations were calculated from unrelated Parsi and Maharashtrian individuals according to Imanishi et al. (1991). The statistical significance of LD was calculated using Fisher’s exact test. p values for the comparison of carrier frequencies were corrected using Bonferroni’s method based on the number of genes compared. Figure 2 presents three panels from a study of activating KIR and their corresponding ligands (Single et al. 2007). Results for the Parsis, Maharastrians, and North Indians are superimposed upon the global distribution. Each colored circle represents a distinct population, with the colors identifying the geographic origin of the populations. The solid line represents the estimated regression line from the Single et al. study.

Results

The comparison of highly polymorphic genomic regions between the Parsis and Maharashtrians serves as a unique opportunity to test for signatures of locale-specific selection vs. ancestral ties over a fairly short period of time. The Parsis of Mumbai, sampled here, migrated to this region from Gujarat (the region of their original immigration into India) in the late seventeenth century and have maintained their ancestral genetic roots by avoiding marriage outside of the Parsi community and disallowing conversions to the faith. Thus, similarity between the two groups in frequencies of KIR genes might suggest selection for these genes, whereas distinctions may be better explained by ancestry. KIR3DL2, 3DL3, 2DL4 were present in all individuals from both populations, a characteristic that is common across the populations that have been typed to date (Wilson et al. 2000). KIR2DL1 and 3DL1 were also very common in both Parsis and Maharashtrians; Table 1). The frequencies of all other KIR were comparable in the two populations, with the exception of KIR2DS1 and KIR3DS1, the latter of which was significantly higher in Parsis than in Maharashtrians after correction for multiple tests (p < 0.001; Table 1). Thus, in general, the Parsis and Maharashtrians showed remarkably similar KIR gene frequencies except for two activating KIR that are in strong linkage disequilibrium with one another (Cook et al. 2003; Norman et al. 2001).

Table 1 Comparison of carrier frequency of 15 KIR genes in three Indian populations

The North Indians genotyped previously for KIR (Rajalingam et al. 2002) were, in general, more distinct from Parsis and Maharashtrians than the latter two were from one another (Table 1 and Fig. 1), again, with the exception of activating 3DS1 and 2DS1. In particular, the frequencies of 2DL1 and 2DL3 were significantly higher in Parsis and Maharashtrians than in the North Indians, although the Parsi–North Indian comparison was not significant after adjusting for multiple tests.

Fig. 1
figure 1

KIR genotype profiles and their frequencies in Parsis, Maharashtrians, and North Indians. A total of 19 distinct profiles were observed in Parsis and 24 in Maharashtrians. The frequencies for these profiles are compared with those of the 47 distinct profiles in North Indians reported by Rajalingam et al. (2002). Presence (shaded boxes) and absence (white boxes) is indicated for 12 KIR loci. The framework genes 3DL3, 3DL2, and 2DL4 were present in each individual and are not shown in the figure. 2DP1 has also been excluded because it was not typed in the North Indians. The genotype profiles are organized by the degree to which they are shared among the three populations

KIR gene profiles represent a listing of all genes present in a given individual, without regard to copy number, and serve as an alternative molecular genetic characteristic when segregation (i.e., haplotypic) analysis is not an option. KIR profiles were compared among the subjects for whom all fifteen KIR genes were typed unambiguously (137 Parsi women and 118 Maharashtrian women). The number of distinct profiles in Parsis, Maharashtrians, and North Indians was 19, 24, and 47, respectively, supporting cohesiveness of the Parsi community and a very genetically diverse North Indian population (Fig. 1 and Rajalingam et al. 2002). The most common profile by far, in both the Parsis and Maharashtrians, was indicative of homozygosity for the A haplotype: 3DL3–2DL3–2DL1–2DL4–3DL1–2DS4–3DL2 (profile #17; 24.1%, and 23.7%, respectively). This profile was observed at a frequency of only 5.6% in the North Indians, whose KIR profiles were exceptionally diverse and evenly distributed in general (Rajalingam et al. 2002). The second most common profile in Parsis, 3DL3–2DS2–2DL2–2DL3–2DL1–2DL4–3DL1–3DS1–2DL5–2DS3–2DS5–2DS1–2DS4–3DL2 (profile #15), contained all 15 KIR genes (f = 17.5%), and this profile was also common in the Maharashtrians (11.9%), but was missing from the North Indian sample (Table 2).

Table 2 Frequencies of HLA ligand groups

The profile 3DL3–2DS2–2DL2–2DL3–2DL1–2DL4–3DL1–2DL5–2DS3–2DS4–3DL2 (profile #18), was observed at an even greater frequency in the Maharashtrians (13.6%), but was observed in only three Parsis (2.2%). This was the most common profile in North Indians (6.9%) and it represents the only clear case in which the Parsis were more similar to the North Indians than to the Maharashtrians. This is the second most common KIR genotype in populations of African origin, including African American, Afro-Caribbean, Guadeloupe Caribbean, and Senegalese (11–17%) (Denis et al. 2005; Du et al. 2007; Norman et al. 2002). KIR3DS1 is missing in this profile and the low frequency of the profile in the Parsis parallels the high frequency of KIR3DS1 in the Parsis relative to that in the other two Indian populations. There was one individual with the profile 3DL3–2DL3–2DL1–2DL4–3DL1–3DS1–2DS4–3DL2 (profile #7), which is identical to the A haplotype except for the presence of 3DS1. This profile has been observed at a frequency of 6% in French, 4.2% in Basque, and 1–2% in Han Chinese, Hispanics, American Caucasians, Finnish, Guadeloupe Caribbean and a population from the island of Reunion (Denis et al. 2005; Du et al. 2007; Jiang et al. 2005; Santin et al. 2006).

LD between pairs of KIR genes in each of the three populations also reflected the endogamy among the Parsis relative to the North Indians. In particular, 38 pairs of KIR genes were in significant positive or negative LD in the Parsi sample, 31 in the Maharashtrian sample, and only 19 in the North Indian sample (Tables 3 and 4; LD pairs that include 2DP1 were excluded from Table 4 because it was not typed in the North Indians). Further, only one of the significant LD pairs in the Maharastrians was unique to that population. There were eight pairs of KIR genes that showed significant LD in the Parsis but not in the Maharashtrians or North Indians (2DL2–3DS1, 2DL3–2DS4, 2DS2–3DS1, 2DS3–2DS1, 2DS3–2DS5 in positive LD; 2DL3–2DS1, 2DL3–2DS5, 2DL3–3DS1 in negative LD). Five of these pairs involved the 2DS1 or 3DS1 genes, which had quite different carrier frequencies in the two populations and can be attributed to genotypes carrying both 3DS1 and 2DS1 (particularly profiles 21, 22, 23, and 26), which are highly represented in the Parsis compared to the other two groups. Interestingly, in the Parsi population, KIR3DS1 was in significant LD with both KIR2DL2 and KIR2DL3, which segregate as alleles of a single locus. This situation was unique to the Parsis and, as far as we know, has not been observed in other populations. The lack of significant LD between KIR2DL3 and KIR2DL1 in the Parsis was also notable, as these genes are closely linked and in strong LD across other populations that have been tested including the Maharashtrians and the North Indians. Like the KIR gene frequency and the KIR profile frequency results, the LD analysis underscores a closer genetic relationship between the Parsis and Maharashtrians than either of these two groups to the North Indians, with the exception of the KIR3DS1 analyses, which clearly distinguish the Parsis.

Table 3 Linkage disequilibrium analysis for 15 KIR genes in Maharashtrians and Parsis
Table 4 KIR locus pairs in significant linkage disequilibrium (LD) for Parsis, Maharashtrians, and North Indiansa

Discussion

The Parsis of India are of Persian descent (Nanavutty 1997), settling in the Indian province of Gujarat around 900 A.D. and eventually moving to the state of Maharashtra. They follow the Zoroastrianism faith, which was once the official religion of the Persian Empire over 2,000 years ago. Although the Parsis have integrated themselves into Indian society, they do not marry outside of the Parsi community and conversions to the faith are strictly unacceptable. Thus, they have remained a closely knit, endogamous community, although there is evidence of admixture with native Indian groups. The Maharashtrians are natives of the Western Maharashtra region since ancient times, distinguished mainly by the language they speak, Marathi. They are far less homogeneous genetically than their Parsi neighbors and constitute a mixture of populations from the North (North Indians) and South (Dravidians) of India. The North Indians are a particularly heterogeneous group derived from the four northern states of Punjab, Haryana, Utter Pradesh, and Delhi. They are distinguished from the Parsis and Maharashtrians by geographical location, linguistics, cultural, and social activities. Further, since Delhi was the capital city of Mogul and British emperors, North India experienced massive racial migrations and admixture, which is likely to have contributed to the high KIR diversity in the North Indians detected previously (Rajalingam et al. 2002).

In this study, we have compared variation across the KIR locus in a sample of Parsis and Maharashtrians, and contrasted these results with published data available for 72 individuals from North India (Rajalingam et al. 2002). The KIR gene frequencies in Parsis did not differ significantly from those in the Maharashtrians except for KIR3DS1 and KIR2DS1, only the former of which remained significant after correction for multiple tests. However, there were 24 distinct KIR profiles identified in the Maharashtrians and only 19 in the Parsis in spite of the larger Parsi sample size (n = 137 for Parsis; n = 119 for Maharashtrians), emphasizing the limited information provided by gene frequency alone in distinguishing populations at the KIR locus. LD values between pairs of KIR genes also supported greater homogeneity at the KIR locus in the Parsis as compared with the Maharashtrians. In general, however, the Parsis and Maharashtrians do show strong genetic similarity at the KIR locus, a phenomenon that is most clearly substantiated by the observation that KIR profiles distinct to one group or the other were observed in that population only one to three times, with the exception of a profile that was observed eight times in the Maharashtrians and was missing in the Parsis. Interestingly, this latter profile was missing KIR3DS1, a gene found at high frequency in the Parsis. KIR gene profiles and LD analysis emphasize the striking diversity of the KIR locus in North Indians relative to both the Parsis and Maharashtrians. Overall, the results suggest that there is greater variability at the KIR locus in Maharashtrians as compared to Parsis, but that these two populations are more similar to one another than either one is to the North Indians.

KIR frequency data from an Iranian sample has not been reported to date, so it was not possible to determine whether the Parsi KIR frequencies are more similar to Maharashtrians or the descendants of the Persian ancestors from whom both the Parsis and Iranians stem. KIR frequency data from a sample of 120 Arabs, which represents the closest group to the Iranians geographically that has been typed so far (Mahfouz et al. 2006), suggests that Parsis are more similar to Maharashtrians than they are to Arabs. Notably, however, the Parsis have a significantly higher frequency of KIR3DS1 (62%) than do the Maharashtrians (37%), North Indians (39%), and the Arab sample (36%; Table 1 and Mahfouz et al. 2006; Rajalingam et al. 2002).

A study of Y chromosome markers has indicated that the Parsis are genetically closer to Iranians than they are to their geographical neighbors in Pakistan (Qamar et al. 2002), the other primary residence of the extant Parsi population. Conversely, a more recent study of mitochondrial DNA (matrilineal as opposed to the patrilineal Y chromosome markers) showed that Parsis display greater genetic similarity with Gujaratis in India than they do with Iranians (Quintana-Murci et al. 2004). Thus, male ancestors of the extant Parsi population may have admixed with Gujarati females, perhaps early after their migration to India and to a degree that led to the replacement of Persian mitochondrial DNA with that of Indian origin (Quintana-Murci et al. 2004). If so, nuclear DNA of present-day Parsis would be expected to be an admixture of ancestral Parsi and Western Indian background.

Based on HLA allele frequencies, the Parsis living in Maharashtra are more closely related to Parsis living in Karachi, Pakistan than to their geographical Indian neighbors (unpublished data, S. Kulkarni, S. Chhaya, R. Badwe and N. Joshi). In fact, amongst the various world populations analyzed, the Parsis were the only group that was phylogenetically separated from its geographic cluster. Such a separation is unlikely to be observed using a KIR dataset since Parsis are not easily distinguished from the Maharashtrians with regard to the KIR locus. However, the difference in KIR3DS1 frequency among the Parsis, which is the only significant distinction between the Parsis and Maharashtrians at the KIR gene complex, may actually be related to the difference in their HLA-B frequencies relative to other Indian populations. We have recently reported an inverse correlation between activating KIR gene frequencies and that of their HLA class I ligand (or putative ligand) group, especially for KIR3DS1 and HLA-B Bw4-80I (Single et al. 2007). Interestingly, the gene frequency of the Bw4-80I grouping was low in Parsis (11.1%) compared to Maharashtrians and North Indians (21.5% and 29.5%, respectively), corresponding with the higher frequency of KIR3DS1 in Parsis (62.2%) relative to the other two native Indian groups (37% and 39%, respectively; Fig. 2). A similar pattern was observed for KIR2DS1 and its ligand group, HLA-C group 2, particularly between Parsis and North Indians, although the inverse correlation for this receptor–ligand pair across all three groups was not as striking as that for KIR3DS1 and Bw4-80I. In general, these data reflect those involving worldwide frequency correlations between activating KIR and their HLA ligand groupings reported previously (Single et al. 2007) and the high frequency of KIR3DS1 in the Parsis may be a function of the low frequency of HLA-B Bw4-80I in this population.

Fig. 2
figure 2

KIR and HLA ligand frequencies—correlations in a global context. Results for the Parsis (P), Maharashtrians (M), and North Indians (NI) are superimposed upon the global distribution reported in Single et al. (2007). There is a strong negative correlation between the carrier frequencies for activating KIR3DS1 and gene frequencies for its putative HLA Bw4 ligand group (a). This correlation is even stronger for the subset of Bw4-80I alleles (b). Differences between the Parsis and the other two populations are most evident for this subset of Bw4 alleles. The activating KIR2DS1 also shows a negative correlation with its C-group2 ligands (c). Each colored circle represents a distinct population, with the color corresponding to the geographic region, and the solid line represents the estimated regression line from Single et al. (2007)

HLA class I does not evolve as quickly as the KIR locus (Hao and Nei 2005; Khakoo and Carrington 2006; Khakoo et al. 2000; Vilches and Parham 2002), so it is possible that the KIR3DS1 locus in the Parsi population is under pressure to retain the high frequency observed in the founding Parsi ancestors such that an appropriate balance between KIR3DS1 and its putative HLA ligand group will be maintained. Although the frequency of KIR3DS1 is significantly higher in the Parsis relative to the Maharashtrians, the frequency of individuals with both KIR3DS1 and HLA-B Bw4-80I is much closer in the two populations (10.2% in the Parsis and 18.2% in the Maharashtrians). Further analyses of additional, historically related populations will be of great interest in elucidating the genetic evolution of the Parsi people since their migration east to India and Pakistan.