Introduction

It is well established that CD8+ T-cell activation is triggered through recognition of the MHC class I molecule loaded with an antigenic peptide by an antigen-specific T-cell receptor. The MHC molecules of the mammals including primates are known to influence the outcome of many diseases such as infectious diseases, cancer, and metabolic disorders. HLA class I genes are divided into three different categories, classical (HLA-A, -B, and -C), non-classical (HLA-E, -F, and -G), and pseudogene (HLA-H, -J, -K, and -L), according to their degree of polymorphism and cell surface expression, and the presence of orthologues of the human HLA-A, -B, -E, -F, and -G genes were identified in several species of the Old World monkeys (Alvarez et al. 1997; Boyson et al. 1996a, b; Evans et al. 2000; Lafont et al. 2004; Otting and Bontrop 1993; Prilliman et al. 1996; Sidebottom et al. 2001; Uda et al. 2004). Cynomolgus monkeys as well as rhesus monkeys are preferentially used for biomedical research; however, cynomolgus MHC class I was not extensively studied compared with those in rhesus monkeys. We have previously reported the nucleotide sequences of cynomolgus MHC class I A locus and have shown that at least 14 Mafa-A alleles were present in cynomolgus monkeys (Uda et al. 2004). Although the MHC class I B locus is the most polymorphic MHC locus in primates, little information is available concerning the MHC class I B locus of cynomolgus monkeys. In this study, therefore, we have expanded our analysis on cynomolgus MHC class I genes and identified 26 B locus alleles by analyzing 16 monkeys. We have also found the presence of a novel locus that is very similar to MHC class I I locus recently identified in rhesus monkeys.

Materials and methods

Animals

All cynomolgus monkeys were raised and reared in the Tsukuba Primate Center for Medical Science, the National Institute of Infectious Diseases (NIID). Both genders were involved, and the cynomolgus monkeys were between 5 and 24 years old. This study was conducted in accordance with the Guides for Animal Experiments Performed at the NIID.

RT-PCR and nucleotide sequencing

Preparation of mRNA from peripheral blood mononuclear cells (PBMC) and RT-PCR were performed as described before (Uda et al. 2004). Primers used in this study are listed in Table 1. 5′ MBS and 3′ MBS primers designed to amplify the gene products of the rhesus MHC class I B locus by Boyson et al. (1996b) were also used to amplify the cynomolgus MHC class I B locus. PCR amplification was performed at least twice for each animal. PCR products were cloned into pCR4Blunt-TOPO plasmids (Invitrogen, Carlsbad, Calif., USA) and 48 clones were sequenced by 310 Capillary DNA Sequencer (Applied Biosystems, Foster City, Calif., USA) or 3100-Avant Capillary DNA Sequencer (Applied Biosystems). The Mafa-B nucleotide sequences were assembled with the Contig Manager of the DNASIS pro (Hitachi Software, Yokohama, Japan). The Clustal W algorithm provided in DNASIS PRO was used to align sequences.

Table 1 Primers used for the amplifications and sequencing of MHC class I cDNA from cynomolgus monkeys

Phylogenetic analysis

The full-length nucleotide sequences of Mafa-B, Mafa-I, Mafa-A, Mamu-A, Mamu-B, Mamu-I, HLA-A, and HLA-B were aligned using Clustal W provided online by the DNA Data Bank of Japan [(DDBJ) http://www.ddbj.nig.ac.jp]. A phylogenetic tree of these nucleotide sequences was constructed by the neighbor-joining method of the Molecular Evolution Genetics Analysis, version 2.1 (MEGA 2.1). Genetic distances were estimated using the method of Jules-Canter. At the sites in which alignment indicated a gap, nucleotides at this position in all the sequences were deleted. The reliability of the tree topology was tested by the bootstrap method. Thousand relationships and 64,238 random seeds were used for determining bootstrap values (Fig. 2a, b). Since the bootstrap values of less than 50% were unreliable, the values of less than 50% were not shown in Fig. 2a, b.

GenBank accession numbers

The Mafa-B and Mafa-I sequences described in this manuscript had been deposited in the DDBJ and were assigned accession numbers AB195431 to AB195465. We previously deposited Mafa-A alleles in the DDBJ, and these alleles were assigned accession numbers AB154760 to AB154773. The GenBank accession numbers for other sequences used in this study are as follows: HLA-A*0201, U07161; HLA-B*2702, L38504; HLA-B*5701, AJ458991; Mafa-E*01, U02976; Mamu-A*01, U50836; Mamu-A*02, U50837; Mamu-A*03, U41379; Mamu-A*04, U41380; Mamu-B*02, U41833; Mamu-B*03, U41825; Mamu-B*04, U41826; Mamu-B*05, U41827; Mamu-B*06, U41828; Mamu-B*07, U41829; Mamu-B*08, U41830; Mamu-B*36, AJ556886; Mamu-I*01011, AF161865; Mamu-I*02012, AF161869, Mamu-I*04, AF4161874; Mamu-I*07, AF161875; Mamu-I*08, AF161876; Mamu-I*09, AF161877; Mamu-I*10, AF161878; and Mamu-I*11, AF161879.

Results

Detection of 26 MHC class I B locus alleles and nine I locus alleles in cynomolgus monkeys

To amplify cynomolgus MHC class I B locus genes, PCR was carried out using primers that were successfully used for amplification of rhesus MHC class I B locus genes along with newly designed ones (Table 1; Boyson et al. 1996b). We obtained 48 clones from each animal. The nucleotide sequences that were found in just one clone were excluded from the subsequent analyses to avoid incorporation of artificial sequences generated by PCR error or during the cloning procedure into public databases. Ambiguous sequences were also excluded. When the nucleotide sequence was shared by more than two clones, regardless of whether they were derived from one animal or multiple animals, the sequences were regarded as a consensus sequences representing a particular alleles of each animal. Eventually, 43 candidate alleles were obtained, and 34 of 43 were found to have substantial homology with Mamu-B alleles. Amino acid sequences deduced from the nucleotide sequences of these 34 candidate B alleles were further subjected to phylogenetic analysis using the neighbor-joining method (Saitou and Nei 1987; data not shown). When the predicted amino acid sequence variation between two candidates was negligible (d<0.025), the amino acid sequence shared by a majority of the clones was regarded as representing a particular allele. The other sequence shared by a minority of clones was excluded from the subsequent analyses. As the result of the analysis, 26 Mafa-B alleles were identified. It was found that the remaining nine candidate alleles were closely related to those of Mamu-I locus reported by Urvater et al. (2000b). Since Urvater et al. also identified two Mafa-I alleles (Mafa-I*01011 and Mafa-I*01012), we named tentatively alleles identified here Mafa-I*01013 through Mafa-I*09. The Mafa-I*01013 allele was identical in amino acid sequence with Mafa-I*01011 and Mafa-I*01012, but there were several synonymous nucleotide changes scattered around the sequence. We therefore considered that this alleles was a variant of Mafa-I*01, although reported sequences of Mafa-I*01011 and Mafa-I*01012 were incomplete. The deduced amino acid sequences of Mafa-B and Mafa-I alleles were shown in Fig. 1 along with those of alleles reported for other primates. The total numbers of clones obtained and the numbers of animals having the allele were shown in the figure. The putative glycosylation site was located at residue 86, and the conserved cysteine residues occurred at positions 101 and 164 in α2 and at positions 203 and 259 in α3. To evaluate whether the nucleotide sequences of Mafa-B and Mafa-I alleles established in this study were gene products of class I B and I loci, respectively, Mafa-B and Mafa-I alleles were phylogenetically analyzed (Fig. 2a). The full-length nucleotide sequences of Mafa-B, Mafa-I, Mafa-A, Mamu-A, Mau-B, Mamu-I, HLA-A, and HAL-B were aligned by Clustal W. A phylogenetic tree was constructed by the neighbor-joining method of MEGA2.1 software. The reliability of the tree topology was tested by the bootstrap method, and the bootstrap values are shown in Fig. 2a. Since the bootstrap values of less than 50% were unreliable, the bootstrap values of greater than 50% are shown in Fig. 2a. Several Mafa-B alleles (Mafa-B*21, 22, 24, 25, and 26) appeared to cluster with Mamu-I or Mafa-I allele rather than B locus alleles. Since amino acid difference between alleles of I and B loci were more apparent at the carboxy half of the protein, we reconstructed the phylogenetic tree using the amino acid sequences of the exons 5 to 8. The result clearly showed that these nine alleles clustered with Mamu-I alleles (Fig. 2b). These results strongly suggested that these cDNA clones were derived from distinct alleles on MHC class I B and I loci of cynomolgus monkeys.

Fig. 1
figure 1figure 1figure 1figure 1figure 1

Deduced amino acid sequences of Mafa-B and Mafa-I alleles. Amino acid sequences of HLA-A, HLA-B, Mamu-A, Mamu-B, Mamu-I, Mafa-A, and Mafa-E alleles were also included. Amino acids identical to those of HLA-B*2702 are indicated by dots. The deletions of amino acids are indicated by hyphens. The total numbers of clones obtained and the numbers of animals having the allele were indicated after the allele name

Fig. 2
figure 2figure 2

Phylogenetic analysis of primate class I MHC molecules. The phylogenetic tree was constructed using a full-length and b exon five to eight nucleotide sequences by neighbor-joining method with MEGA2.1. The bootstrap values of more than 50% were shown

Inheritance of Mafa-B and Mafa-I in a family of cynomolgus monkeys

A family consisting of three parents (one sire and two dams) and four offspring was subjected to genetic analysis to study inheritance of Mafa-B and Mafa-I alleles. By nucleotide sequence analysis, ten Mafa-B alleles and four Mafa-I alleles were detected in this family as shown in Table 2. Since certain alleles appeared to be inherited in this family as a complex, we considered those gene complexes as haplotypes and assigned letters A through F to those combinations of alleles (Table 2). Haplotype A (Mafa-B*03 and Mafa-I*09) was detected in 3028, 1113, and 0068, whereas haplotype B (Mafa-B*24 and Mafa-B*25) was carried by 3028, 0079, and 7071 (Fig. 3). Haplotype C (Mafa-B*17, Mafa-B*20, and Mafa-I*06) was found in 1159 and 7071, haplotype D (Mafa-B*01 and Mafa-B*04) in 1159 and 0068, haplotype E (Mafa-B*16, Mafa-I*03) in 3032, 0079, and 1113, and haplotype F (Mafa-B*06, Mafa-B*23, and Mafa-I*02) in 3032 (Fig. 3). We could not detect Mafa-I alleles in monkeys bearing haplotypes B and D. It was evident that Mafa-B alleles were inherited in a Mendelian fashion. Moreover, cynomolgus monkeys in this family were shown to have two to four Mafa-B alleles. The presence of multiple Mafa-B alleles was confirmed by nucleotide sequences analysis of two additional cynomolgus monkeys unrelated to this family. Table 3 showed that 2010 had four Mafa-B alleles and 5076 had three Mafa-B alleles. These results indicated that MHC class I B locus of cynomolgus monkeys was duplicated as in the case of rhesus monkeys (Boyson et al. 1996b).

Table 2 Segregation of alleles with haplotypes
Fig. 3
figure 3

The family pedigree demonstrating the inheritance of alleles of the MHC class I B and I loci of cynomolgus monkeys. Male and female are denoted by squares and circles, respectively. The animal number assigned to the animal is shown. The haplotypes of each animal are given by capital letters

Table 3 The presence of multiple Mafa-B alleles in cynomolgus monkeys

Discussion

Although cynomolgus monkeys are widely used as animal models in a variety of biomedical researches, there are no nucleotide sequence data on cynomolgus MHC class I B locus. In this study, we tried to identify the alleles of cynomolgus MHC class I B locus, using PBMC cDNA from 16 cynomolgus monkeys.

Nucleotide sequence analyses and following phylogenetic analysis identified 26 Mafa-B alleles (Figs. 1, 2a). We also found nine clones with the nucleotide sequences showing high homology with those of Mamu-I alleles. Phylogenetic analysis showed that these clones were derived from nine Mafa-I alleles. It was reported that novel MHC class I I locus in rhesus monkeys, Mamu-I, could be amplified with B locus-specific primers, and that the I locus was recently evolved from a classical MHC class I B locus by duplication (Urvater et al. 2000b).

The haplotypes of rhesus MHC class I composed of at least one A locus and at least two B loci (Boyson et al. 1996b). In cynomolgus monkeys, we previously reported that the A locus had been duplicated, because one to four Mafa-A alleles were found in an animal (Uda et al. 2004). The presence of up to six Mamu-B alleles in a rhesus monkey (Urvater et al. 2000a) indicates that rhesus monkeys have three class I B loci. In this study, we also showed that two to four Mafa-B alleles were present in each individual, strongly suggesting that cynomolgus monkeys have multiple MHC class I B loci. Regarding the I locus, it seemed possible that at least one locus was present in each animal, although some individual appeared not to have the locus. The apparent lack of the I locus in some individual was probably due to low efficiency of amplification of the I locus because of the presence of the multiple B loci.

Information on MHC class I molecule is particularly important in better understanding of pathogenesis of various infectious diseases including HIV infection. So far the nucleotide sequence data are available for the alleles of Mafa-A (Uda et al. 2004), Mafa-E (Alvarez et al. 1997; Boyson et al. 1995), Mafa-G (Arnaiz-Villena et al. 1997; Castro et al. 1996), Mafa-I (Urvater et al. 2000b), Mafa-DRB (Gaur et al. 1997; Kriener et al. 2000; Leuchte et al. 2004), Mafa-DQA (Kenter et al. 1992), and Mafa-DQB (Otting et al. 2002) in cynomolgus monkeys. The identification of Mafa-B alleles would, therefore, greatly help understand the pathogenesis of various pathogens that naturally or experimentally infect cynomolgus monkeys.