Introduction

The Western honey bee, Apis mellifera L., is native to the Old World where it has five major evolutionary lineages: the A lineage of Africa, the C and M lineages of Europe, and the Y and O lineages of North Eastern Africa, parts of the Middle East and Asia (Alqarni et al. 2011; Franck et al. 2001; Ruttner 1988; Whitfield et al. 2006). These lineages are delineated geographically, morphologically, and genetically and they include approximately 24 subspecies (Arias and Sheppard 1996; Franck et al. 2000; Garnery et al. 1992, 1993; Palmer et al. 2000; Ruttner 1988; Wallberg et al. 2014; Whitfield et al. 2006). The current honey bee populations of North America are the result of centuries of importation, chiefly from the two European lineages (C and M). Canada’s A. mellifera populations originated from importations beginning with European settlers. The first introductions were likely from the M lineage (e.g. A. m. mellifera) (Cornuet 1986; Root 1985; Seeley 1985) likely followed by subsequent introductions from the C lineage (e.g. A. m. ligustica and A. m. carnica) and some minor introductions from the O lineage (e.g. A. m. caucasica) (Sheppard 1989a, b) and A lineage (e.g. A. m. intermissa and A. m. lamarckii) (Pinto et al. 2007; Root 1985; Sheppard 1989a, b). Each new introduction of a lineage or subspecies into Canada was usually an effort by beekeepers to introduce “favourable” traits, akin to Brother Adam’s work in the United Kingdom to breed what he thought was the ideal honey bee for beekeeping (Adam 1983; Root 1985). Brother Adam’s own “Buckfast Bee” is a mix of several subspecies from each lineage and is still bred by a small number of Canadian beekeepers today.

Remarkably, there has been no large-scale investigation of the genetic ancestry of Canadian honey bees despite a history older than the country itself (Crane 1999). We undertook a study of the genetics of Canadian honey bees using a citizen science approach to characterize their genetic ancestry and to study how geography and management practices influence their genetics. We also used the population genetic dataset to test the hypothesis that beekeepers in Northern Canada maintain honey bees more related to the Northern European subspecies A. m. mellifera (M lineage). A similar pattern has been noted in Australia, where colonies in colder regions of Tasmania have higher proportions of A. m. mellifera ancestry relative to colonies in warmer regions (Oldroyd et al. 1995), supporting the hypothesis that A. m. mellifera is locally adapted for colder climes than the more Mediterranean C lineage (Le Conte and Navajas 2008; Ruttner 1988). We would, therefore, expect that M lineage bees would be more abundant in Northern Canada.

Finally, we tested the utility of the SNP panel for discriminating between Canadian honey bees and Africanized honey bees from the United States and Brazil. Africanized honey bees can be highly aggressive and are continuously distributed from central South America to the Southern United States (Collet et al. 2006; Rinderer et al. 1991; Sheppard et al. 1991; Szalanski and Magnus 2010). Africanized honey bees are the result of an introduction of the African lineage subspecies A. m. scutellata into Brazil in 1956 (Kerr 1967). Controlled crosses of Brazilian commercial honey bees with imported A. m. scutellata were performed to test if the resulting hybrid colonies would be better suited for Brazilian beekeeping. Unfortunately, the resulting “Africanized” colonies are often highly defensive (Breed et al. 2004; Collins et al. 1982; but see: Galindo-Cardona et al. 2013), swarm frequently, and typically abscond in response to adverse conditions (Winston 1992). Current tests for detecting Africanized honey bees based on mtDNA and wing morphometrics are not reliable: mtDNA tests miss cases of paternal Africanization (Sheppard and Smith 2000) and morphological tests are unable to detect low to medium levels of Africanization (Guzman-Novoa et al. 1994). Canadian beekeepers import hundreds of thousands of queens from the United States annually and the chance of accidental importation of Africanized honey bees is rated as moderate to high by the Canadian government (AHRA 2013). We had previously shown that an ancestrally informative SNP panel was very successful at identifying Africanized bees in commercial honey bee populations from the United States and Australia (Chapman et al. 2015b), and we wanted to examine if the same panel will be suitable for use in Canada.

Methods

Citizen science project and population sampling

From July 2013 to June 2014, we solicited beekeepers across Canada to voluntarily take part in a honey bee genotyping study. Solicitations were made through social media, our personal websites, telephone, and announcements at beekeeping meetings. Beekeepers indicated their willingness to join our study by filling in an online form. This information was used to send each beekeeper a pamphlet containing sampling instructions, a small survey, sampling tubes, and a return envelope (Supplemental Files 1 and 2). Beekeepers were instructed to sample two workers (diploid females) per colony, from up to six colonies in their operation. We asked beekeepers to identify the location of their colonies, the number of colonies they manage, and the location of their queen breeder. In total, 145 beekeepers from 9 provinces and 1 territory submitted a total of 857 sampling tubes (Fig. 1a; Supplemental Datasets 1, 2).

Fig. 1
figure 1

a Map of sampling locations (red dots) and b average proportion ancestry in each province with province code (yellow C; black M, red A). Ancestry of Canadian honey bees to major honey bee lineages. The first 29 solid bars are known reference samples of C, M, and A lineage bees. All bars following the white gap represent 855 Canadian honey bee samples. c Proportion of ancestry derived from each major lineage within each sampled Canadian province. We found small but significant differences in the proportions of C (P = 1.9 × 10−8) and M ancestry (P = 3.7 × 10−7) among provinces, but did not detect differences in the level of A ancestry (P = 0.091). High C (low M) ancestry is more common in the Prairie Provinces (Alberta, Saskatchewan and Manitoba) than in the Western Provinces and Territories (Yukon and British Columbia). Quebec and Ontario, and Maritime Provinces (Newfoundland, New Brunswick, and Nova Scotia), which had significantly lower C ancestry (color figure online)

DNA extraction and SNP genotyping

We extracted DNA from a single diploid worker from each sampling tube returned to us (N = 857 individual bees). High molecular weight DNA was extracted with phenol–chloroform isoamyl alcohol (25:24:1) from half of a bee’s thorax. Each sample was then purified using EMD Multiscreen Millipore purification (Merck) and genotyped using the Sequenom MassARRAY MALDI-TOF (Agena) system in four multiplexes at Génome Québec's Innovation Centre.

The SNP panel was created to differentiate between each of the three major lineages thought to be most abundant in North American honey bee populations: C, M, and A (Chapman et al. 2015a, b). Briefly, SNPs were randomly chosen from a set of more than 20,000 with high genetic differentiation as measured by pairwise F st (Weir and Cockerham 1984) between each population group from a previous full-genome re-sequencing study (Harpur et al. 2014) and conditioned on being at least 5000 bp apart. We included an additional 19 SNPs from a previous study that also showed high genetic differentiation between European and African honey bees (Table S3 in Whitfield et al. 2006). The final panel of 144 SNPs was chosen based on its ability to be multiplexed in an inexpensive SNP genotyping platform using the Sequenom Assay Design Suite (v1.0 Sequenom, CA, USA). In the final panel, all SNPs were separated by at least 45.9 kb (average 1734.8 kb) and were effectively unlinked because of the honey bee’s very high recombination rate (19 cM/Mb; Beye et al. 2006).

Population admixture analyses

To estimate each sample’s ancestry, we used STRUCTURE v2.3.4 (Pritchard et al. 2000) using all polymorphisms with minimum allele frequency >0.05 and only for markers with a call rate >0.66 (N = 91 markers of 144 on the panel met these criteria). Similarly, we only included samples that could be successfully genotyped at 66 % of all markers (N = 855 samples). We evaluated population structure using a burn-in phase of 50,000 iterations followed by 100,000 Markov-Chain Monte Carlo iterations with admixture assumed and uncorrelated allele frequencies. We included in each STRUCTURE run a set of 29 reference bees, known to be of pure descent from each of the three major lineages (African: A, Western and Northern Europe: M, and Eastern and Southern Europe: C). The reference bees were used in three previous population genetic analyses performed by our group (Harpur et al. 2012, 2014; Harpur and Zayed 2013); their genotype at each of the 91 SNPs was extracted from their full genome sequences. To reduce the influence of the large query population compared against a smaller reference population and to increase processing speed by parallelizing runs, we divided the dataset into ten smaller datasets consisting of the reference population and 85 randomly selected samples. No a priori information was provided regarding population identity or location. We performed ten replicates for each of K = 1 to 4 populations. We used STRUCTURE HARVESTER v0.6.94 (Earl and Vonholdt 2012) to estimate the most appropriate fit of K and to implement Evanno’s method for estimating ΔK (Evanno et al. 2005). For each sample, we then identified the genomic contribution of each ancestral lineage (e.g. 70 % C, 20 % M, and 10 % A) and the level of admixture (1 − maximum ancestry; e.g. if a bee is 70 % C, 20 % M, and 10 % A, then admixture = 1 − 0.7). Finally, we used GENEPOP v4.0.11 (Raymond and Rousset 1995) to report F st statistics among provinces and countries and tested if pairwise F st was significant using ARLEQUIN v3.5.12 with a False Discovery Rate at α = 0.05 (Excoffier and Lischer 2010).

Accuracy of a SNP panel as a diagnostic test for Canada

To investigate the utility of our SNP panel to detect Africanized honey bees among Canadian imports, we determined how well our panel is able to accurately classify Africanized honey bees as being African, and Canadian honey bees as European based on their genotypes. We replicated a two-stage procedure used previously to identify a cut-off at which the proportion of African ancestry is indicative of an Africanized honey bee (Chapman et al. 2015a, b).

We first estimated the True Positive rate by identifying at what minimum proportion of African ancestry (5–60 % in 5 % increments) would samples known to be Africanized (true Africanized bees) correctly be identified as such. True Africanized samples were obtained from populations in Brazil (N = 55) and the United States (N = 86) (Chapman et al. 2015a, b). We also included A. m. capensis clonal lineage (N = 3), A. m. capensis (N = 104) and A. m. scutellata × capensis hybrids (N = 17), and 128 A. m. scutellata as samples that should be correctly identified as African. At each cut-off we determined the proportion of African/Africanized samples correctly identified as African.

We next estimated the False Positive rate by repeating the above analysis to identify at what maximum proportion of African ancestry true non-Africanized bees would be incorrectly classified as African. Our true non-Africanized samples were represented by the reference C and M populations (see above) as well as commercial and feral Australian samples (N = 93) and commercial non-Africanized samples from Canada (N = 10) and the United States (N = 55). All reference samples were previously genotyped in an Australian study at 95/144 of the SNPs available on the panel (those having minimum allele frequency >0.05 and call rate >0.66) (Chapman et al. 2015a, b). Of the 95 SNPs in this previous study, 81 were shared with the current study (81/91 markers from N = 855 bees with minimum allele frequency >0.05 and call rate >0.66). Therefore, for all between-country analyses, including identifying cut-offs (above), we took only the genotypes of our Canadian samples at these 81 SNPs common between the two studies SNPs.

Statistical analyses

All statistical analyses were performed in R v3.2.0 (R Core Team 2010). For geographic analyses, we binned statistics into 0.5° latitudinal and longitudinal bins. We identified trends across provinces both between provinces and as groups. We grouped Prairie Provinces (Alberta, Saskatchewan and Manitoba) and compared admixture levels among bees from Western Provinces and Territories (Yukon and British Columbia), Ontario and Quebec, and the Maritimes (Newfoundland, New Brunswick, and Nova Scotia). When performing multiple family wise statistical tests, we corrected for False-Discovery Rate using the Benjamini and Hochberg method (1995) at α = 0.05. Our datasets are available on GitHub (https://github.com/harpur/CanadAdmix) and as Supplemental Material (Supplemental Dataset 1–2).

Results

Sampling overview

We sent a total of 1633 individual sampling tubes across Canada and received back 857, a 52.4 % return rate. Most samples came from British Columbia (N = 243) and Ontario (N = 199; Fig. 1a; Supplemental Dataset 2). From each returned sampling tube, we genotyped a single (diploid) worker honey bee. Only two workers could not be successfully genotyped at 66 % of all markers, so all population genetic and ancestry analyses were performed on 855 Canadian samples. Beekeepers could self-report the origins of their colonies. Of those who did self-report we found that most of the samples were bred in Canada (N = 665). Samples of workers from queens bred outside of Canada originated from the United States (N = 71), New Zealand (N = 27) or Denmark (N = 2). The beekeepers that responded to our study managed between 1 and 10,500 colonies (mean = 400 ± 58.9 SE), indicating that we successfully solicited interest from both hobbyists with a few colonies and commercial beekeepers with hundreds to thousands. We asked beekeepers to self-identify the subspecies or race of their bees. We received this information for 574 out of 855 colonies sampled and genotyped in this study. The largest proportions of beekeepers identified their bees as “Italian” (30.2 %) or “Mixed” (13.7 %) (Supplemental Dataset 1).

Admixture of Canadian honey bees

Analyses using STRUCTURE significantly supported models with K = 3 ancestral populations (A, M and C) both with the lowest average Ln[P(D)] = −1436.21 method and by using Evanno’s method to calculate ΔK (Supplemental Figure 1; Fig. 1b). Canadian bees were not classified as a distinct population, but instead a mix of the three ancestral lineages (Fig. 1b). Canadian samples had, on average, a large proportion of their ancestry originating from the C group (mean 74.2 %), with the remainder consisting of M group (19.6 %) and A group (6.2 %; Fig. 1b, c). As a result, differences in admixture between Canadian honey bee populations were driven by the level of M and A ancestry: increasing M and/or A ancestry lead to increased admixture (Spearman Correlation; M: ρ = 0.56; P < 2.2 × 10−16; A: ρ = 0.51; P < 2.2 × 10−16). We found small but significant differences in the level of admixture between provinces (Supplemental Figure 2; ANOVA; F 9,845 = 6.167; P = 1.9 × 10−8). These differences tended to be between Prairie Provinces (Alberta, Saskatchewan and Manitoba) and others (Supplemental Figure 2). We confirmed this trend by pooling the bees from the Prairie Provinces and comparing their admixture to bees from Western Provinces and Territories (Yukon and British Columbia), Ontario and Quebec, and the Maritimes (Newfoundland, New Brunswick, and Nova Scotia). From this comparison, we found populations in the Prairie Provinces had lower levels of admixture when compared to populations in each of Canada’s other major geographic regions (ANOVA; F 3,851 = 8.424; P = 1.6 × 10−5; Tukey’s HSD P < 0.0035; Supplemental Figure 2).

Provinces also differed in their patterns of ancestry. We found small but significant differences in the average mean proportion (per sample) of C (Fig. 1c; ANOVA; F 9,845 = 6.167; P = 1.9 × 10−8) and M ancestry (ANOVA; F 9,845 = 5.36; P = 3.7 × 10−7) among provinces, but did not detect differences in the level of A ancestry (P = 0.091). High C (low M) ancestry is more common in the Prairie Provinces than in the Western Provinces, Quebec and Ontario, and Maritime Provinces, which had significantly lower C ancestry (Supplemental Figure 3; ANOVA; F 3,851 = 8.424; P = 1.6 × 10−5) and a trend towards higher levels of M ancestry (ANOVA; F 3,851 = 2.81; P = 0.0382; Tukey’s HSD P > 0.052). Finally, Canadian provinces had very low levels of differentiation at the loci examined (mean F st = 0.0078; Table 1).

Table 1 Pairwise fixation index (F st) between each Canadian province in this study

We found no significant evidence that samples from any self-identified subspecies or group has more A lineage ancestry than any other; however, we tested if Buckfast bees, known to have been crossed with A lineage subspecies, had higher A ancestry than other self-identified subspecies or groups. We found that Buckfast bees (A = 8.1 %; N = 33) tended to have higher levels of A ancestry than non-Buckfast bees (A = 6.0 %; one-tailed t test; P = 0.06) and the sample with the highest proportion of A ancestry within Canada (A = 30.1 %) is of Buckfast origin.

Distributions of honey bee lineages across Canada: local adaptation or management practices?

We predicted that Northern Canada could favour genotypes derived from honey bee subspecies accustomed to similar environments, such as the M group subspecies (Le Conte and Navajas 2008; Ruttner 1988). To test this hypothesis, we investigated associations between ancestry (C, M, or A) and geographic location (Fig. 2). Following corrections for False Discovery Rate (Benjamini and Hochberg 1995), we found a significant negative correlation between M ancestry and latitude (P = 0.008; r = −0.48) and a positive relationship between C ancestry and latitude (P = 0.0046; r = 0.51) indicating that colonies in Northern Canada tended to have higher proportions of C lineage (Fig. 2). There was no trend for A ancestry (P = 0.45). In addition, there was a negative correlation between admixture and latitude (P = 0.0066; r = −0.14). We found that there was a significant positive correlation between M ancestry and longitude (P = 0.014; r = 0.29), a negative relation between C ancestry and longitude (P = 0.0006; r = −0.4), and a trend for more A ancestry in Eastern Canada (P = 0.052; r = 0.23; Fig. 2).

Fig. 2
figure 2

Relationships between latitude, longitude and percent ancestry (percentage C, M and A). Latitude is negatively correlated with M ancestry and is positively correlated with C ancestry, but was not significantly correlated to A ancestry. Longitude positively correlated with M ancestry, negatively with C ancestry but was not significantly correlated with A ancestry

It may be that the relationship between M lineage ancestry and geography are not a result of local adaptation but by differences in beekeeping practices. For example, small-scale beekeepers may prefer different subspecies of honey bee than commercial beekeepers or there may be regional importation differences. On the former, we found no relationship between the number of colonies managed by a beekeeper and the levels of C, M nor A ancestry of his/her samples (Spearman’s rank correlation, P > 0.38 for all comparisons), nor level of admixture of his/her colonies and the number of colonies managed (P = 0.46).

We did, however, find that regional importation practices influenced ancestry. Beekeepers at latitudes >50° self-reported purchasing more queens outside of Canada than beekeepers at lower latitudes (<50°; 22 vs 14 %; Fisher exact test; P = 0.039). Western beekeepers (longitude <−100°) also reported importing more queens than Eastern beekeepers (>−100°; 17.1 vs 9.4 %; P = 0.018). We found that imported colonies (colonies reported to have been bought outside of Canada) had significantly more C ancestry (ANOVA; F 1,763 = 18.21; P = 2.2 × 10−5) and significantly less M (F 1,763 = 5.096; P = 0.024), and A ancestry (F 1,763 = 5.82; P = 0.0218; Fig. 3) than Canadian-bred and purchased colonies.

Fig. 3
figure 3

Ancestry and country-of-origin of Canadian honey bee stocks. Canadian-bred colonies had significantly less C ancestry relative to internationally purchased colonies

Admixture on a global scale

We compared our dataset of Canadian honey bee ancestries to those of commercial honey bee populations in Australia and non-Africanized populations in the United States that were genotyped using the same SNP panel (Chapman et al. 2015a). We found significant differences in the levels of admixture between countries (ANOVA; F 2,1000 = 33.1; P < 1.2 × 10−14), with Canadian samples (mean = 25 %) having similar levels of admixture as Australian samples (Tukey’s HSD; P = 0.054; mean = 31 %) and both having higher admixture than United States commercial samples (Tukey’s HSD; P < 0.00001; mean = 23 %). We found no differences in the level of African ancestry of commercial colonies between these countries (P = 0.297), but we did find significant differences in the level of M ancestry (F 2,1002 = 96.95; P < 2.2 × 10−16) with significantly higher levels in commercial Australia (mean = 30.5 %) relative to both Canada (19.3 %) and the United States (Tukey’s HSD; P < 0.0001; 18.0 %;). An inverse trend was found for C ancestry: Canada (74.2 %) and the United States (76.6 %) had more C ancestry than Australia (Tukey’s HSD; P < 0.0001; 64.1 %). Even with these differences, we found low levels of differentiation between countries: average F st between countries was 0.04 (Table 2).

Table 2 Pairwise fixation index (F st) between Canadian, Australian and United States honey bee colonies

Accuracy of the SNP based Africanization test in Canada

Africanized bees have higher levels of African ancestry (over 50 %), compared to non-Africanized bees (less than 25 %) allowing us to distinguish potentially Africanized samples using a predetermined threshold of African ancestry (Chapman et al. 2015a, b). We re-evaluated this cut-off in light of the ancestry of Canadian honey bee populations (Fig. 4a). When we used a minimum cut-off threshold of 15 % African ancestry or greater, we obtained a True Positive rate of 1 and all true Africanized samples were correctly identified as such. When this same threshold was applied to true non-Africanized commercial stocks we obtained a False Positive rate of 0.05 (95 % of true non-Africanized commercial stocks were classified as not African). At a more conservative threshold (25 % A), we obtained a False Positive rate of 0 (100 % of true non-Africanized commercial stocks were classified as not African; Fig. 4b). Therefore, using the more conservative cut-off threshold of 25 %, which has the maximum True Positive rate and minimum False Positive rate, we found that 99.82 % of the 855 Canadian honey bees genotyped herein could be classified as not African, as expected (Fig. 1).

Fig. 4
figure 4

a To identify an optimal true positive rate, we estimated the proportion of African ancestry at which true-Africanized (N = 393) bees collected from source populations in Africa, Brazil and the United States would be correctly identified as Africanized. b To identify an optimal false positive rate, we estimated the proportion of African ancestry at which all true-non-Africanized bees (N = 187) would be correctly identified as not Africanized (i.e. not incorrectly identified as African) using 5 % increments of A ancestry

Discussion

Patterns of admixture within Canada

Canada has no native populations of A. mellifera; resident populations are the result of centuries of importation predominately from the C and M lineages of Europe (e.g. A.m. ligustica and A. m. mellifera) (Cornuet 1986; Pinto et al. 2007; Seeley 1985; Sheppard 1989a, b). We have demonstrated here that contemporary Canadian honey bees are largely derived of C lineage subspecies, similar to populations in the United States (Delaney et al. 2009; Pinto et al. 2007; Seeley 1985; Sheppard 1988, 1989a, b) and Australia (Chapman et al. 2015b; Oxley and Oldroyd 2009). This pattern is likely a result of both North American and Australian beekeepers long favouring C lineage bees for their docility and higher honey production (Langstroth and Dadant 1889). Beekeepers have regularly imported and admixed local populations with A. m. ligustica (a practice once called “Italianizing”) to introduce these favourable phenotypes (Jensen et al. 2005; Moritz et al. 2005). The large C lineage component of Canadian honey bees is likely a result of past importation preferences and the use of “Italianized” colonies that continues today.

Previous studies have discovered differences in ancestry between feral and commercial populations (Chapman et al. 2008, 2015a; Delaney et al. 2009; Schiff and Sheppard 1995; Sheppard 1988), with feral bees having higher levels of M ancestry. This pattern is thought to be the result of beekeepers either favouring the use of C lineage bees or selection in feral populations favouring M ancestry (Pinto et al. 2005). We did not include feral populations in our Canada survey. However, we did find that a colony’s location was correlated with its ancestry. North-western Canada had more C ancestry (less M) than South-eastern Canada. This is counter to expectation as northern colonies were expected to be comprised of more northern-derived (i.e. M lineage) ancestry (Oldroyd et al. 1995). We attribute this pattern not to selective differences between parts of the country, but rather to beekeepers in North-western Canada self-reporting that they import more queens (colonies) from international sources that have higher C ancestry than colonies reported to be purchased within Canada.

Commercial honey bee populations have been noted previously for their relatively low levels of differentiation within their introduced ranges (Chapman et al. 2015a, b; Delaney et al. 2009; Harpur et al. 2012). Three factors contribute to this pattern: high gene flow, similar importation histories, and the relatively young age of commercial populations. The Canadian samples included in this study were separated by as many as 4772 km and our international samples much further. Our data suggest that gene flow within Canada is very high; most beekeepers (86.9 %) reported purchasing queens from breeders within Canada rather than rearing their own queens. Inter-population comparisons herein support that commercial colonies have very low levels of differentiation at the loci examined: we found the lowest levels of differentiation between commercial US and Canadian populations (Table 2), two populations that exchange honey bees frequently. Canadian beekeepers import 150,000–200,000 queen bees from the United States each year (Tavares 2014). Commercial populations in Canada, the United States and Australia are also relatively young and originate from similar source populations. North America has only had resident populations of honey bees since the seventeenth century (Sheppard 1989a, b) and much like Australia and the United States (Chapman et al. 2008; Hopkins 1886; Jolly 2004; Koulianos and Crozier 1996, 1997; Oldroyd et al. 1992, 1995; Oxley and Oldroyd 2009; Ruttner 1976; Sheppard 1989a, b), the Canadian populations examined herein were likely first derived from the M lineage and later shifted to C lineage. Because North American and Australian populations are relatively young, drift has less time to alter allele frequencies, and potential differences are flooded by gene flow. Taken together, the young age of these populations, their similar importation histories, and high gene flow have likely contributed to the current low levels of genetic differentiation and patterns of admixture.

Admixture in global commercial populations

While introgression can be detrimental to the conservation of honey bees within their native ranges (De la Rua et al. 2009, 2013; Meixner et al. 2010; Pinto et al. 2014), it is actively sought after in regions without native A. mellifera populations, such as North America (Cobey et al. 2012; Sheppard 2012). It has been well documented that genetic diversity is important to the health of colonies (Jones et al. 2004; Mattila and Seeley 2007; Tarpy 2003), and beekeepers seek novel genotypes resistant to pests (Cobey et al. 2012; Rinderer et al. 2010; Sheppard 2012). Admixture has been shown to increase levels of genetic diversity in honey bees (Harpur et al. 2012, 2013) and beekeepers have been intentionally interbreeding subspecies of honey bee for at least a century in North America (e.g. Root 1985). Using tools such as the SNP panel herein (Chapman et al. 2015b) it can be possible for regulators to target and manage the introduction of novel genetic stock. A corollary is that these SNP panels can also be used to test for introgression of unwanted genetic stock such as C lineage in M lineage bees (Munoz et al. 2015; Pinto et al. 2014) or Africanized bees in North America.

The utility of a SNP based assay for monitoring Canadian imports

The current tests available to distinguish Africanized from non-Africanized colonies prior to importation can be unreliable and as such the risk of importation of Africanized honey bees into Canada has been declared moderate to high by the Canadian Food Inspection Agency (AHRA 2013). The SNP panel used in this study was designed to identify bees with African ancestry regardless of their maternal or paternal backgrounds. Using the frequency of African ancestry, we were able to demonstrate that Africanized honey bees can confidently be detected: we were able to detect true Africanized bees with 100 % accuracy with a False Positive rate less than 5 %.

We found low but pervasive levels of African ancestry in Canadian honey bees. Levels of African ancestry in Canadian bees ranged from 0.1 to 33 %, very similar to levels of African ancestry found in Australia (0.3–32.8 %; Chapman et al. 2015b). To our knowledge, there have been no deliberate introductions of Africanized bees into Canada. We instead propose that the low level of African ancestry in Canadian bees likely resulted from early importations of A lineage subspecies other than A. m. scutellata. Most likely, Canadian beekeepers imported A. m. intermissa (Seeley 1985), A. m. lamarkii (Nielsen et al. 2000), or other North African subspecies. As well, Canadian beekeepers have imported bees admixed with other African lineages, such as the Buckfast bee. Beekeepers in Ontario have maintained Buckfast bees—those developed by Brother Adam—since the 1960s (Otis, Pers. Comm.). Buckfast bees were originally crossed with the A lineage subspecies A. m. saharensis (Adam 1983). The original Buckfast bees brought into Ontario were partially derived from A. m. saharensis and A. m. monticola (Otis, Pers. Comm.). Even with the few Buckfast bees represented in our dataset (N = 33), we found a trend for Buckfast bees having higher A lineage ancestry relative to all other subspecies or groups identified by beekeepers. Although we are unable to differentiate African subspecies with the current version of the SNP panel, the addition of informative alleles for each A lineage subspecies, particularly A. m. scutellata, will enable us to better determine the origins of this pattern in the future.

Conclusions

Our data are the first in-depth assessment of the genetic structure of honey bees in Canada. Honey bees in this country, like most in the world, live predominantly in managed populations and management practices have significantly impacted their genetic structure and admixture, as we have demonstrated here and elsewhere (Harpur et al. 2012, 2013). How these management practices influence feral populations or contribute to the long-term success of managed populations still remain largely unanswered questions. Using our approach as a model and incorporating public engagement and high-throughput genetic data could help to answer these questions as well as provide valuable tools to beekeepers.