Introduction

The major histocompatibility complex (MHC) plays a crucial role in vertebrate adaptive immunity. The genes encoding the MHC class I and II molecules are highly polymorphic and pathogen-mediated selection is generally thought to be a major factor maintaining this polymorphism (Bernatchez and Landry 2003; Doherty and Zinkernagel 1975; Jeffery and Bangham 2000). MHC class I molecules are expressed on all nucleated cells and are involved in presentation of peptides originating from intracellular pathogens (e.g. many viruses), whereas MHC class II molecules only are expressed on immune cells and present peptides derived from extracellular pathogens (e.g. many bacteria). The presentation of non-self peptides on MHC molecules leads to an immune response that ultimately terminates the infection (Janeway et al. 2001).

Mating preferences, for compatibility or “good-genes” (Mays and Hill 2004), guided by MHC-determined olfactory cues has also been suggested to play a role for maintaining the MHC polymorphism (Penn and Potts 1998; Penn 2002; Penn and Potts 1999; Yamazaki et al. 1976). The ability to “sniff out” MHC differences between individuals has been found in various vertebrates including humans (Chaix et al. 2010; Havlicek and Roberts 2009). Mice and rats can distinguish individuals that only differ genetically at the MHC loci (Brown et al. 1987; Yamazaki et al. 1979) and both MHC class I and II loci contribute to personal odours in rats (Brown et al. 1989). In the fish arctic charr, Salvelinus alpinus, and the three-spined stickleback, Gasterosteus aculeatus, MHC class II differences between individuals can be evaluated (Aeschlimann et al. 2003; Olsén et al. 1998). Whether olfactory MHC discrimination is a general feature of considerable evolutionary importance among vertebrates is unclear (Penn and Potts 1998), and for birds this information is by large lacking.

Procellariiform birds (petrels) have a well-developed olfactory system (Bang and Cobb 1968), are long-lived and engaged in lifetime monogamy (Warham 1996), which should make mate choice very important to maximize individual fitness (Bonadonna and Nevitt 2004; Zelano and Edwards 2002). The blue petrel, Halobaena caerulea (Procellariiformes), is a burrow nesting bird with a circumpolar distribution in Antarctic and subantarctic seas, and breeds in dense colonies on small islands close to the Polar Front in the Southern Ocean (Brooke 2004). Daily life of the blue petrel is facilitated by olfaction: food search is odour mediated (Bonadonna et al. 2006) and the blue petrel can identify its underground nest and partner based solely on olfactory cues (Bonadonna et al. 2004; Mardon and Bonadonna 2009; Mardon et al. 2010). In addition, the ability to recognize the partner and avoid self in favour of other individuals in odour choice tests (Mardon and Bonadonna 2009) raises the question whether the blue petrel MHC system could be as important for mate choice/personal recognition as described in some mammals (Penn 2002; Penn and Potts 1999). These characteristics make this species a good model for investigating MHC in birds.

The copy number of MHC genes in birds is highly variable even within orders. The MHC of the galliform red jungle fowl, Gallus gallus, has been shown to be “minimal essential” containing few loci (Kaufman et al. 1999, 1995) whereas the opposite is found in the quail, Coturnix japonica, a bird belonging to the same order (Shiina et al. 2004). Among passerine birds the number of MHC loci is generally higher than for the quail (Balakrishnan et al. 2010; Bollmer et al. 2010; Westerdahl et al. 2000, 2004). The bird order Procellariiformes emerged after Galliformes but before Passeriformes (Hackett et al. 2008) and based on phylogeny we therefore predict the number of MHC loci in the blue petrel to be intermediate to birds in these two orders. Additionally, pathogen pressure and immune investment in birds are lower in colder than in warmer regions, and in marine compared to terrestrial environments (Møller 1998; Piersma 1997). Hence, the pathogen pressure should be comparatively low in the Subantarctics and not selecting as strongly for a multi-loci MHC in the blue petrel as in birds in warmer climates.

Here, we set out to investigate the structural arrangement of the blue petrel MHC in comparison to other birds. This initial characterization is essential for addressing future questions on the role of MHC in mate choice in this species. We analyse both MHC class I and II since odour-mediated discrimination in other species has been shown correlated to either or both MHC classes. More specifically, we examine the number of MHC class I and II B loci, the gene structure and diversity as well as patterns of selection pressure. We base our MHC characterization on both partial cDNA and genomic DNA sequences.

Material and methods

Birds and sampling

A colony of blue petrels, H. caerulea, was studied on Ile Verte, a small island of the Morbihan gulf of Kerguelen, in the southern Indian Ocean (49°51′ S, 70°05′ E), where we yearly followed about 80 nest burrows since 2001. In spite of the fact that the actual population of the island is not estimated, we can consider that all breeding birds in the gulf represent a population of 100,000-200,000 individuals (Weimerskirch et al. 1989). Almost nothing is known about wintering areas of blue petrels. Nevertheless, it has been inferred by diet samples that these Procellariiforms feed in southern Antarctic waters, >1,000 km from the breeding colonies and that blue petrels also foraged on their way back to Iles Kerguelen from these distant foraging grounds (Cherel et al. 2002).

Most birds from the nests we followed are ringed and burrows have been fitted with a closable aperture above the incubating chamber to facilitate capture. Removing birds from the burrow for a brief time does not appear to affect incubation behaviour or the hatchability of the eggs (Bonadonna and Nevitt 2004; Bonadonna et al. 2004). In this study, 25 individuals were included, the individuals were randomly selected and most likely are not members of extended families; hence, the number of individuals should be sufficient for an initial characterization of the blue petrel MHC. Blood samples (0.1–0.5 ml of blood) were collected with syringe from the brachial vein and were kept in Queen’s lysis buffer (0.5 ml; 10 mM Tris–HCl, 10 mM NaCl, 10 mM EDTA, 1% n-lauroylsarcosine, pH 8.0), stored at +4°C until DNA extraction. Additionally, blood (0.1–0.5 ml) was collected for RNA extraction from five individuals in 250 μl K2EDTA (0.2 M) and 500 μl Trizol-LS (Invitrogen, Paisley, UK) was added immediately. The samples were kept at +4°C for approximately 2 months (due to transportation from the remote field site) until RNA extraction.

DNA and cDNA preparations

DNA was extracted from the blood samples, stored in Queen’s buffer, either with standard phenol-chloroform extraction (Sambrook et al. 1989) or with the DNeasy Blood & Tissue Kit (Qiagen, Hilden, Germany) according to the manufacturer’s protocol. The DNA was kept at −20° until used. Total RNA was extracted from the samples collected in Trizol-LS (Invitrogen, Paisley, UK) following the manufacturer’s instructions (carefully collecting only the RNA-containing aqueous phase to avoid DNA contamination) and the integrity of the RNA and the 260/280 ratio was checked. Until cDNA synthesis, the RNA was kept at −80°C. The RNA was reverse transcribed to cDNA using the RETROscript Kit (Applied Biosystems/Ambion, Austin, USA) according to the instructions supplied by the manufacturer.

Sequencing of MHC class I and II B fragments

The main focus in our sequencing efforts was targeting the MHC class I since it has not been described in the Procellariiform order before, whereas the MHC class II B was previously characterized in the thin-billed prion. Pachyptila belcheri (Procellariiformes) (Silva and Edwards 2009). Primers for amplification of MHC class I and class II B genes in the blue petrel were designed (Table 1). A first set of degenerated primers for amplification of 650-bp long MHC class I fragments using cDNA templates, from the middle of exon 2 almost to the end of exon 4, were designed in conserved regions identified in bird MHC sequences available in the GenBank database (Benson et al. 2008). A second primer set for amplification of the variable exon 3 region of class I sequences was designed covering the main part of the exon. Both cDNA and genomic DNA (gDNA) were used as templates. For MHC class II B, a first primer set for amplification of 749-bp long fragments from cDNA, starting at the first base of the coding sequence ending nearly at the end of exon 4, was designed from the full length information of the gene in the closely related thin-billed prion, P. belcheri (Silva and Edwards 2009). Two sets of primers were designed from a number of the 749-bp long sequences resulting from the first class II B amplification, for amplification from cDNA and gDNA of the main part of class II B exon 2 fragments. The MHC fragments were amplified using standard PCR protocols and annealing temperatures specified in Table 1. PCR products were assessed by gel electrophoresis and cloned using the commercially available cloning kits, TOPO TA Cloning Kit with TOPO TA PCR 2.1 vector and One Shot Chemically competent cells (Invitrogen, Paisley, UK) or pGEM-T Easy Vector Systems (Promega, Madison, USA), according to the manufacturers’ instructions. Plasmid clones were transferred to 100 μl of molecular grade H2O in 96-well microtiter plates and immediately lysed at 98°C for 5 min and then frozen at −20°C until analysis. PCR amplifications of the plasmid inserts were performed on the bacterial lysates using pUC/M13 forward and reverse sequence-binding site primers M13F (5′ GTAAAACGACGGCCAG) and M13R (5′ CAGGAAACAGCTATGAC) and standard PCR protocols, followed by assessment of the size and quality of the products by gel electrophoresis. The PCR products were purified and DNA was sequenced using the dideoxy chain-termination method (Sanger et al. 1977). The long (class I, 650 bp; class II B, 749 bp) fragments were sequenced in both directions whereas the class I exon 3 and class II B exon 2 fragments were analysed by single-pass sequencing in the forward direction. Clones containing fragments of expected size were sequenced. From the respective long fragment clonings (class I and II B), 40–50 clones/individual were sequenced for mainly two individuals (Supplementary Table 1). Altogether 313 MHC class I exon 3 clones from 23 individuals (seven out of these individuals were extensively investigated with 20–35 clones/individual) and 92 MHC class II B exon 2 clones from four individuals (maximum 27 clones/individual) were sequenced (Supplementary Table 1). Only presumably functional sequences (without deletions resulting in an incorrect reading frame) verified in two independent PCR reactions were included in further analysis. Before analysing the sequences, the primer sequences were removed. For the data analysis of MHC class I exon 3 and class II B exon 2 sequences, sequence information from the long sequences covering those exons was also included.

Table 1 Primers used for amplification of blue petrel, Halobaena caerulea, major histocompatibility complex (MHC) class I and class II B sequences

Southern blot hybridization

Genomic DNA (8–10 μg) from blue petrel was digested with Pvu II restriction enzyme and then size separated in an agarose gel as described (Westerdahl et al. 1999). The DNA was transferred to a nylon membrane (GE Osmonics labstore, Minnetonka, USA) and two southern blot hybridizations were performed (Westerdahl et al. 1999) using either a blue petrel MHC class I or a MHC class II B probe. The probes were made from purified PCR products of randomly picked clones transformed with a class I exon 3 fragment (primers A21P2–A23H7) or a class II B exon 2 fragment (primers P114–P115). The PCR products were labelled using Amersham Nick Translation Kit (GE Healthcare, Piscataway, USA) according to the manufacturer’s instructions.

Data analysis

The obtained sequences were checked for homology to known MHC genes deposited in GenBank (Benson et al. 2008) and positive sequences were analysed within each MHC class (I or II B) for similarity, divergence and selection. Peptide-binding regions (PBR) and exon boundaries for MHC class I were inferred from chicken and human (Wallny et al. 2006) and for MHC class II from human (Brown et al. 1993).

Apart from the sequences obtained in this study, MHC class I sequences from other bird species, with full coverage of the 606-bp (650 bp including primers) MHC class I fragments amplified in the blue petrel, were downloaded from GenBank and used in a comparative analysis. The species and sequences included were the greylag goose, Anser anser (GenBank accession numbers, Acc nr AM114924, AY387648–AY387652, DQ251182), the mallard, Anas platyrhyncos, (Acc nr AB115241–115246, AF393511, AY294416–AY294419 and AY841881–AY841884), the red jungle fowl, G. gallus (Acc nr AB159063–AB159076, AB178042–AB178045, AB178047, AB178048, AB178050, AB178051, AB178596, AY234769, AY234770, AY821520, GU451331 and NM_001097530) and the great reed warbler, Acrocephalus arundinaceus (Acc nr AJ005503–AJ005509). The sequences were trimmed to the same length as the blue petrel sequences (606 bp) and will from now on be referred to as the “Aves class I dataset”.

Sequences were aligned with ClustalW2 under an IUB cost matrix (Chenna et al. 2003) and bootstrap consensus trees (2,000 replicates) were constructed with the neighbour-joining method in MEGA4 (Felsenstein 1985; Saitou and Nei 1987; Tamura et al. 2007). Nucleotide distances using the Kimura 2-parameter (Kimura 1980) and amino acid p-distances were estimated in MEGA4 (Tamura et al. 2007).

The number of nonsynonymous substitutions per nonsynonymous site (d N) and synonymous substitutions per synonymous site (d S) were estimated from averaging over all sequence pairs. Analyses were conducted using the Nei-Gojobori method in MEGA4 (Nei and Gojobori 1986; Tamura et al. 2007). Standard error estimates were obtained by a bootstrap procedure (1,000 replicates).

Positively selected amino acid sites were estimated in CODEML, a codon-based likelihood programme in the PAML 4 package (Yang 2007), likelihood analyses were conducted under the following random-sites models: M1a (nearly neutral), M2a (positive selection), M7 (β) and M8 (β+ω), where the two latter models should be robust against recombination (Anisimova et al. 2003). If the selection models M2a or M8 fit significantly better than their neutral counter parts, Bayesian statistics integrated in CODEML were used to identify codon sites under positive selection (Yang 2006).

Results

MHC class I

Gene structure

We obtained eight verified unique 606-bp long MHC class I cDNA sequences (partial exon 2–exon 4) from two blue petrel individuals (four per individual) designated by Haca-UA*06L/22L/25L/27L/33L/34L/35L/42L (L, long; Acc nr JF276877–JF276884) according to established nomenclatural rules (Klein et al. 1990) (Supplementary Table 1). In addition, we found another 41 non-verified sequences (in total 16 versus 25 sequences per individual). The overall structure of the 606-bp Haca-UA(L) sequences was similar to the corresponding genes in four other bird orders (Fig. 1). The closest GenBank homolog (89–90% identity) to the eight blue petrel sequences was the Florida sandhill crane, Grus canadensis pratensis, MHC class I mRNA (Acc nr AF033106) which was also the only described gene that clustered with the blue petrel sequences (Supplementary Fig. 1). For the Aves class I dataset (greylag goose, mallard, red jungle fowl and great reed warbler) the MHC class I sequences all clustered according to species (Supplementary Fig. 1). The same pattern was found also when analysing the sequences exon by exon (2–4).

Fig. 1
figure 1

Amino acid alignment of major histocompatibility complex class I translated sequences (Haca-UA*xxL, GenBank accession numbers, Acc nr JF276877–JF276884) from the blue petrel, H. caerulea, five other bird species (Acc nr and species codes are given: Grca Grus canadensis pratensis, Anan Anser anser, Anpl Anas platyrhynchos, Gaga Gallus gallus, Acar Acrocephalus arundinaceus) and human HLA-A (Hosa_NM002116). Peptide-binding regions were inferred (a pocket a, f pocket f, c other peptide contacts) (Wallny et al. 2006). Positively selected amino acid sites (plus sign) identified with model 8 in CODEML, PAML (Yang 2007). Primer sites, A21P2 (F) and A23H7 (R), in exon 3, are underlined

The screening of 168–174-bp MHC class I exon 3 sequences in 23 individuals, resulted in 45 verified sequences, Haca-UA*01–Haca-UA*45 (Acc nr JF276832–JF276876), and 172 non-verified sequences (Supplementary Table 1). The sequences Haca-UA*30 (JF276868) and Haca-UA*39 (JF276872) contained deletions corresponding to a one and two amino acid deletion, respectively (Supplementary Fig. 2).

Alleles and loci

The maximum number of verified genomic exon 3 Haca-UA sequences in one individual was 15 (one of these sequences contained a stop codon), which means that the blue petrel genome should harbour at least eight MHC class I loci. However, the maximum number of transcribed class I sequences (cDNA) in one individual was only nine (Supplementary Table 1) indicating at least five expressed loci. The Southern blot probed with an MHC class I exon 3 fragment showed 11–16 bands per individual (Supplementary Fig. 3). Assuming that each band approximately corresponds to one allele, the southern blot suggests that there are at least eight class I loci in the blue petrel genome.

Divergence and selection

The amino acid distance between the 606-bp MHC class I sequences differed between exons 2, 3 and 4. The distance was significantly larger in the partial exon 2 sequences compared to exons 3 and 4 of the Haca-UA(L) sequences (Fig. 2). This pattern of difference in amino acid distance was also observed in all birds in the Aves class I dataset with exception for the great reed warbler where there was no difference in amino acid distance between exons 2 and 3 (t test, t = 0.1067, p = 0.9168; Fig. 2). The amino acid distance between the Haca-UA(L) sequences was not different from that of any of the other species in exons 2 and 3 but in exon 4 the distance was smaller between the blue petrel sequences than it was between the mallard sequences (t test, t = 2.6101, p = 0.0164; Fig. 2).

Fig. 2
figure 2

Estimates of average evolutionary divergence (amino acid distance) within different exons of major histocompatibility complex class I sequences between five bird species: Haca Halobaena caerulea, blue petrel (Acc nr JF276877–JF276884); Anan Anser anser, greylag goose (Acc nr AM114924, AY387648–AY387652 and DQ251182); Anpl Anas platyrhynchos, mallard (Acc nr AB115241–115246, AF393511, AY294416–AY294419 and AY841881–AY841884); Gaga Gallus gallus, red junglefowl (Acc nr AB159063–AB159076, AB178042–AB178045, AB178047, AB178048, AB178050, AB178051, AB178596, AY234769, AY234770, AY821520, GU451331 and NM_001097530) and Acar Acrocephalus arundinaceus, great reed warbler (Acc nr AJ005503–AJ005509). Sequence information from the middle of exon 2 to the end of exon 4. Analyses were conducted in MEGA4 (Tamura et al. 2007) and standard errors were obtained by bootstrapping (1,000 replicates)

The nucleotide and amino acid distances in the PBR of the 606-bp MHC class I sequences did not differ between the five compared species (the blue petrel and the Aves class I dataset; Table 2). The Haca-UA(L) sequences were less divergent than the greylag goose and the mallard in the non-PBR of exons 2 and 3, as well as exon 4 (Table 2), but not than the red jungle fowl and the great reed warbler.

Table 2 Evolutionary divergence among major histocompatibility complex class I sequences in the blue petrel, Halobaena caerulea, this study, compared to four other bird species

The overall nucleotide distance between the exon 3 Haca-UA*01–Haca-UA*45 sequences (Acc nr JF276832–JF276876) was found to be 0.076 ± 0.012 (SE) and nucleotide diversity showed higher values in the PBR than in the non-PBR (Table 3).

Table 3 Codon-based evolutionary divergence and nucleotide diversity of major histocompatiblility complex class I exon 3 (UA) and class II B exon 2 (DAB) sequences in the blue petrel, Halobaena caerulea

The nonsynonymous to the synonymous nucleotide substitutions (d N/d S ratio) in the Haca-UA(L) sequences was above one in the PBR of both exons 2 and 3 which indicates that the PBR is under positive selection. For the non-PBR of exons 2 and 3 the d N/d S ratio was lower than one indicating that purifying selection is active in this region (Fig. 3). The estimation of selection on Haca-UA(L) conformed well with the Aves dataset (Fig. 3). Signatures of purifying selection were found for the non-PBR of exon 3 sequences from the screening, whereas the d N was not significantly different from d S in the PBR, although the d N/d S ratio was above one (Table 3).

Fig. 3
figure 3

Average codon-based evolutionary divergence in major histocompatibility complex class I sequences within five bird species: Haca Halobaena caerulea, blue petrel (Acc nr JF276877–JF276884); Anan Anser anser, greylag goose (Acc nr AM114924, AY387648–AY387652 and DQ251182); Anpl Anas platyrhynchos, mallard (Acc nr AB115241–115246, AF393511, AY294416–AY294419 and AY841881–AY841884); Gaga Gallus gallus, red junglefowl (Acc nr AB159063–AB159076, AB178042–AB178045, AB178047, AB178048, AB178050, AB178051, AB178596, AY234769, AY234770, AY821520, GU451331 and NM_001097530) and Acar Acrocephalus arundinaceus, great reed warbler (Acc nr AJ005503–AJ005509). Nonsynonymous (d N) to synonymous (d S) substitution rates. Rates above the broken line signifies sequences under positive selection pressure. Sequence information from the middle of exon 2 to the end of exon 3. Peptide-binding regions (PBR) were inferred (Wallny et al. 2006). Analyses were conducted using the Nei-Gojobori method in MEGA4 (Nei and Gojobori 1986; Tamura et al. 2007)

We also used a codon-based maximum likelihood approach to study positive selection and then found 12 positively selected amino acid sites in exons 2 and 3 for the eight 606-bp Haca-UA(L) sequences with the CODEML programme in the PAML package (Yang 2007) (Table 4; Fig. 1). In the 45 exon 3 Haca-UA sequences, seven positively selected sites were detected with the same method, out of these, four sites were novel compared to the sites identified in the 606-bp sequences (Table 4; Supplementary Fig. 2).

Table 4 Positively selected sites in major histocompatibility complex class I and II B genes of the blue petrel, Halobaena caerulea, found implementing different selection models in CODEML, the PAML package (Yang 2007)

MHC class II B

Gene structure

Two 702-bp (749 bp including primers) MHC class II B cDNA sequences, Haca-DAB*01L and Haca-DAB*06L (partial exon 1–partial exon 4, Acc nr JF276893–JF276894), were verified from two blue petrel individuals. In addition, 32 non-verified sequences with 92–99% nucleotide identity to the verified sequences were found (in total 20 and 12 sequences per individual). The Haca-DAB(L) sequences in the blue petrel (Fig. 4) were very similar (92–93% identity) to the coding sequence of Pabe-DAB1 (Acc nr FJ588549) in the Procellariiform thin-billed prion, P. belcheri, that also served as a template for primer design.

Fig. 4
figure 4

Amino acid alignment of major histocompatibility complex class II B translated sequences (Haca-DAB*xxL, Acc nr JF276893–JF276894) from the blue petrel, H. caerulea, four other bird species (Acc nr and species codes are given: Pabe Pachyptila belcheri, Anpl Anas platyrhynchos, Ancy Anser cygnoides, Gaga Gallus gallus) and human HLA-DRB1 (Hosa_AM493435). Peptide-binding regions inferred from Brown et al. (1993) are shown (c peptide-binding site, p polymorphic peptide-binding site). Primer sites, P141 (F) and P144 (R), in exon 2, are underlined

Eight verified blue petrel 159-bp MHC class II B alleles exon 2 sequences, Haca-DAB*01–Haca-DAB*08 (Acc nr JF276885–JF276892), and another 29 non-verified sequences were obtained from seven birds. The exon 2 Haca-DAB sequences intermixed with the corresponding sequences from the thin-billed prion (Supplementary Fig. 4) and formed a well-supported cluster indicating trans-species evolution of MHC class II B polymorphism between the two (Fig. 5; neighbour-joining of nucleotide sequences, bootstrap value 100%).

Fig. 5
figure 5

Phylogenetic tree of major histocompatibility complex class II B exon 2 (159 bp) nucleotide sequences in the blue petrel, H. caerulea (filled circles Haca-DAB*01–Haca-DAB*08, Acc nr JF276885–JF276892) and the thin-billed prion, P. belcheri (Pabe, Acc nr are given). Two sequences from pheasant, Phasianus colchicus (Phco, Acc nr are given) were used as outgroup. Neighbour-joining bootstrap consensus tree from 2,000 replicates (Felsenstein 1985; Saitou and Nei 1987) constructed in MEGA4 (Tamura et al. 2007). The evolutionary distances were computed using the Kimura 2-parameter method (Kimura 1980)

Alleles and loci

For MHC class II B there were considerably fewer alleles found per individual compared to MHC class I. There were at the most three different verified exon 2 Haca-DAB sequences in one individual (at least two MHC class II B loci). All of them were found in cDNA and were hence expressed. The MHC class II B blot showed three to four bands per individual that hybridized with the exon 2 probe (Supplementary Fig. 3), which suggests that there are at least two class II B loci in the blue petrel genome.

Divergence and selection

The nucleotide distance between the eight exon 2 Haca-DAB*01–Haca-DAB*08 sequences (Acc nr JF276885–JF276892) was estimated to 0.108 ± 0.017 which is slightly, though not significantly, higher than the distance between the class I exon 3 sequences (t test, t = 1.0850, p = 0.2830).

There was support for strong positive selection in the PBR of the exon 2 Haca-DAB sequences (d N/d S ratio = 4.09; Table 3). In the codon-based maximum likelihood analysis, we detected as many as 12 positively selected amino acid sites identified in the exon 2 Haca-DAB sequences with the CODEML-programme in the PAML package (Yang 2007). All of the positively selected sites are within the postulated PBR in the corresponding human HLA-DR1 (Brown et al. 1993).

Discussion

We have characterized the major histocompatibility complex class I and class II B gene structure in a subantarctic Procellariiform seabird, the blue petrel, H. caerulea. We found evidence for at least eight MHC class I loci but only two class II B loci in the blue petrel genome. The high number of class I loci is more similar to the numerous MHC class I loci found in songbirds (Bollmer et al. 2010; Westerdahl et al. 2000, 2004) than to the “minimal essential” MHC found in chicken (Kaufman et al. 1999, 1995). In respects like gene structure, divergence and differential selection pressures in PBR versus non-PBR, the blue petrel MHC conforms well to other bird MHCs.

MHC class I

The 606-bp Haca-UA(L) cDNA sequences had one close GenBank homolog, the Florida sandhill crane MHC class I mRNA (Acc nr AF033106). The blue petrel and the crane belong to Procellariiformes and Gruiformes, orders that separated at least 45 MYA (Sibley and Ahlquist 1990). Neither of the other available long bird MHC class I sequences clustered with the blue petrel sequences but rather by species. The separation between Procellariiformes and Passeriformes is estimated to at least 50 MY and the distance to Galliformes and Anseriformes is at least 64 MY (Sibley and Ahlquist 1990), hence a larger distance than between Procellariiformes and Gruiformes. Divergence times above 45 MY thus reflect species trees rather than gene-trees for the MHC class I among these avian orders. The clustering of sequences exon by exon (2–4) did not change the phylogeny, i.e. there was no evidence for local conservation within the gene structures between species either.

The short distance between the Haca-UA sequences combined with the estimation of loci supports the idea that recent gene duplications and/or gene conversion are common in the avian MHC (Hess and Edwards 2002). On average with few exceptions the blue petrel MHC class I sequences showed the same degree of divergence as their counterparts in other bird species, both regarding the PBR and the non-PBR of the genes.

Curiously, the amino acid distance between the 606-bp MHC class I sequences was significantly larger in the partial exon 2 information (second half of the exon) compared to exons 3 and 4 in all examined bird species except for the great reed warbler. The pattern of a more variable exon 2 compared to exon 3 is also found in the African clawed frog, Xenopus laevis, (Bos and Waldman 2006) which gives some support to this being the ancestral state.

The 3 and 6 bp deletions that were found in the exon 3 sequences Haca-UA*30 (JF276868) and Haca-UA*39 (JF276872) are to date not found at the same positions in other birds, whereas 3 and 6 bp deletions have been reported in transcribed MHC class I exon 3 sequences from the house sparrow Passer domesticus (Bonneaud et al. 2004), Since both Haca-UA*30 and Haca-UA*39 were transcribed and the position of the deletions are within the PBR, the deletions most likely have functional effects in terms of what peptides can bind to the antigen-binding pocket in the encoded MHC molecule. In cattle, Bos taurus, an amino acid deletion detected in the PBR of a MHC gene was predicted to alter the peptide specificity and was shown to be associated with a higher immune response to a vaccine (Sitte et al. 2002).

MHC class II B

The MHC class II B genes in the blue petrel seems to have undergone fewer duplication events than the class I genes since there were considerably fewer alleles found, which was supported also by the southern blot probed with MHC class II B. Perhaps the bacterial diversity in the Antarctics is too low to maintain many class II B loci. Alternatively, there may be loci that our primer sets and probe did not pick up. Compared to what has been found in other birds, especially passerines, the number of class II B loci in the blue petrel was low. However, there are some examples of birds with only one MHC class II B locus found (see comparison in Bollmer et al. 2010). The zebra finch, Taeniopygia guttata, is the complete opposite of the blue petrel considering the MHC loci number, the zebra finch has many MHC class II loci and few class I loci (Balakrishnan et al. 2010). Recently, five MHC class II B loci was found in the little spotted Kiwi, Apteryx owenii, showing that also a species of a basal avian superorder (Paleognathae) can harbour multiple MHC loci (Miller et al. 2011).

The blue petrel and all the prions form two sister clades in a well-supported phylogenetic node where the split probably occurred 24–29.5 MYA (Nunn and Stanley 1998; Penhallurick and Wink 2004). The similarity of the 702-bp Haca-DAB(L) sequences in the blue petrel to the coding sequence of Pabe-DAB1 (Acc nr FJ588549) in the thin-billed prion, P. belcheri, is thus expected phylogenetically even if the divergence time is long. Exon 2 sequences that encode the PBR, which is the most variable part of the gene, from the two sister species clustered together in a phylogenetic tree instead of by species. Hence, there was evidence for trans-species polymorphism between the two species that could either result from their phylogenetic relationship and/or as a result of convergent evolution due to similar selection pressure from pathogens. Trans-species evolution of MHC polymorphism between avian species was reported e.g. among different penguin species, genus Spheniscus, (Kikkawa et al. 2009) and between great reed warbler and Seychelles warbler, Acrocephalus sechellensis (Richardson and Westerdahl 2003).

Signatures of selection in MHC class I and II B

Positive selection acting on the PBR has been found in almost all non-model studies through estimation of nonsynonymous to synonymous nucleotide substitutions (reviewed in Bernatchez and Landry 2003). In the blue petrel we compare sequences in some cases originating from different loci since it was not possible to assign alleles to loci, which is a well-known problem when dealing with the polymorphic MHC region in loci-rich non-model bird species (Westerdahl et al. 2000). We found strong differences in selection pressures between the postulated PBR (positive selection) and non-PBR (negative selection) both for class I and class II B genes, in the range of other avian MHC genes.

Most of the positively selected sites identified in the blue petrel MHC genes are known to be involved in antigen peptide binding in human and/or chicken (Brown et al. 1993; Wallny et al. 2006) which indicates that (1) the structure and function is similar to the MHC in well-known model systems and (2) the genes most likely are transcribed and translated into functional MHC molecules.

MHC and ecology of the blue petrel

Since pathogens presumably are rare among birds in Antarctica (Barbosa and Palacios 2009) the maintenance of so many class I loci in the blue petrel genome may be difficult to explain as an effect of pathogen-mediated selection, which in other systems probably is the main factor keeping MHC polymorphism (Bernatchez and Landry 2003; Jeffery and Bangham 2000). On the other hand, these birds are long-lived so they might encounter quite a few pathogens in a lifetime, even with low environmental pathogen levels. Moreover, the blue petrel is a colonial nester that may reuse underground nests (Bonadonna and Mardon 2010). These factors are associated with increased probability of horizontal pathogen transmission and higher immune responses in other bird species (Møller and Erritzøe 1996; Møller et al. 2003), meaning that pathogen-mediated selection may be stronger for the blue petrel than generally expected in cold climates.

Given the ecology of the blue petrel, where individuals return to their nestling areas to breed with only one partner for life (Brooke 2004), a system for kin recognition and inbreeding avoidance should be important (Penn 2002; Penn and Potts 1999; Zelano and Edwards 2002). If the MHC fills that function in the blue petrel, this may further explain the maintenance of many MHC class I loci. Moreover, the well-developed olfaction as well as the avoidance of self-odour (Mardon and Bonadonna 2009) in the blue petrel, suggests that olfactory MHC discrimination is possible in this species (Zelano and Edwards 2002). Mates with dissimilar MHC alleles could be chosen, thereby kin is avoided and MHC polymorphism is propagated to the next generation (Brown 1997; Jordan and Bruford 1998; Mays and Hill 2004; Tregenza and Wedell 2000). There is some evidence for a MHC-based mate choice in birds (Bonneaud et al. 2006; Freeman-Gallant et al. 2003; Gillingham et al. 2009; Richardson et al. 2005) but also reports on no correlation between MHC and bird mate choice (Ekblom et al. 2004; Westerdahl 2004).

Conclusion

Blue petrels have surprisingly many MHC class I loci, whereas they have fewer MHC class II B loci. Different evolutionary histories for MHC class I and II B with more recent gene duplications or gene conversions for class I than class II B may explain this pattern. Signatures of positive selection in the PBR were detected for both classes of MHC genes. Given the individual odour profiles and olfactory recognition of partner in the blue petrel, it is tempting to assume that the described allele richness serves as a basis for mate choice in this species and that the polymorphism thereby is maintained.