Introduction

Social insects represent integrated and adaptive units, the result of selective forces operating at both the individual and colony level (Pinter-Wollman and Hubler 2012). Phenotypic differences between individuals of the same social insect colony can be striking, and in fact are regarded as a feature of insect societies (Jeanson and Weidenmüller 2013). Indeed, inter-individual variability is acknowledged as a necessary condition for one mainstay of eusociality, the division of labor among colony members (Jeanson and Weidenmüller 2013). The division of labor within social insect colonies has been explained by variability in (1) individual, innate, thresholds and (2) experience-based, self-reinforcement processes; two non-mutually exclusive mechanisms (Beshers and Fewell 2001). At the individual level, the role of experience is recognized as central in shaping the individual behavioral phenotype, changing over time via learning processes (Jeanson and Weidenmüller 2013).

Inter-individual differences have been long overlooked in quantitative ecological studies, but have gained new attention during the last decade, notably in the field of “animal personality/behavioral syndrome” research (Violle et al. 2012). A number of studies across the animal kingdom have demonstrated dramatic differences among individual behavioral phenotypes (Bolnick et al. 2003). Since individual phenotypes are the focus of natural selection, any variation at this level has major evolutionary and ecological consequences (Bolnick et al. 2011; Wolf and Weissing 2012).

In biological terms, aggression is a widespread solution by animals to the problems of self-preservation, protection of young and resource competition (Archer 1988). Aggressiveness is a fundamental behavioral and personality trait, varying greatly among individuals (Dingemanse and Wolf 2013; Grinsted et al. 2013). In social hymenopterans, the study of aggressive or defensive behavior has been classically undertaken in the context of nestmate recognition and enemy identification (e.g., Guerrieri and D’Ettorre 2008; Breed et al. 2004). For honey bees and other social Aculeata, one important component of aggressive/defensive behavior is the act of stinging (Paxton et al. 1994).

The stinging response (stinging reflex, sting extension response) to electric stimulation was described in Apis mellifera during the 1980s (Balderrama et al. 1987; Kolmes and Fergusson-Kolmes 1989) and has been used as a bioassay since this time. The sting extension response (SER) was measured via the stimulus intensity (Kolmes and Fergusson-Kolmes 1989; Paxton et al. 1994) or the time necessary to elicit it (Uribe-Rubio et al. 2008). The SER was initially recorded as a binary response (Balderrama et al. 1987, 2002), with some authors refining the scoring system by recording the extent to which the sting was extruded, adding intermediate levels of response corresponding to partial sting extension (Lenoir et al. 2006).

The SER has been quantified to measure honey bees’ level of aggressiveness and to discriminate among different strains (Kolmes and Fergusson-Kolmes 1989; Uribe-Rubio 2013), colonies, worker groups (Paxton et al. 1994; Uribe-Rubio 2013) and individuals from the same hive (Lenoir et al. 2006). Variation in SER was found in relation to honey bees’ age and within-colony social environment (Paxton et al. 1994), spatial segregation in the hive (Uribe-Rubio et al. 2008), task performed (Uribe-Rubio et al. 2008, 2013), and paternity line (Lenoir et al. 2006).

It is important to highlight that, depending on the organism under examination, the same type of behavioral response (e.g., SER) can have a radically different meaning in biological terms. Although honey bees can sting other insects without harming themselves, their sting is strongly barbed and remains embedded in mammalian flesh (Vetter et al. 1999). Given the characteristic sting autotomy, anchoring and the consequent abdominal rupture, honey bees’ stinging behavior can represent an extreme form of suicidal colony defense (Paxton et al. 1994; Shorter and Rueppell 2011). Relatively little is known about the stinging behavior of other eusocial bees (reviewed in Breed et al. 2004) and we are not aware of any studies quantifying the stinging response in wasps.

For wasps, aggression is a common practice, e.g., in foraging activities (Parrish 1984; Raveret Richter 2000), social dynamics and in high-risk situations (Ross and Matthews 1991). The majority of aculeate wasps are solitary and use their stings to paralyze hosts more or less permanently and then carry them to a shelter where an egg is laid on the host (Raveret Richter 2000). For social aculeate wasps, prey are generally killed with the characteristically strong mandibles, rather than stinging (Edwards 1980; Raveret Richter 2000). Nevertheless, several authors (reviewed in Spradbery 1973; Olson 2000) documented cases in which foraging social wasps used their sting when grappling with particularly large and active prey. Vespa orientalis was reported to make regular use of the sting when attacking A. mellifera in Israel (Ishay et al. 1967, cited in Spradbery 1973). This evidence contradicts some statements that social wasps kill their prey using exclusively their mandibles (e.g., Akre and Myhre 1992, in Olson 2000; Archer 2012).

Social wasps also utilize their stings during frequent inter- and intra-specific conflicts among gynes, workers and castes (Archer 2012). During spring, mortal fights between queens competing for nesting sites are the norm (Greene 1991; Matsuura and Yamane 1990; Hunt 2007). In some yellowjackets (e.g., Vespula, Dolichovespula), usurpation may be more common than nest initiation (Greene 1991), and disputes appear more frequent among conspecifics (Buck et al. 2008). The sting can also be used by workers to kill the males (Monceau et al. 2013), or during matricide when workers kill their queen (Loope 2014). In general, aggression (e.g., mauling, biting and grappling) among workers is common, both at the foraging sites (Parrish 1984; Raveret Richter 2000) and inside the nest (Akre et al. 1976; O’Donnell and Jeanne 1995). Fighting, including attempted stinging, usually involves workers with developed ovaries, both in colonies with and without a queen (Archer 2012).

Stinging is the immediate response in a life-threatening situation, both for the individual wasps and the colonies. Intruders inside a social nest are stung in rapid actions, involving one or more co-operating wasps (Edwards 1980). If the nest is disturbed from outside, agitated workers rush out and eventually take flight, with a minority of individuals normally stinging the target (Greene 1991). In species with large colonies, active nest defense can involve hundreds of individuals and mass stinging can potentially be lethal to humans (Vetter et al. 1999). Vespine wasps, including yellowjackets, are particularly aggressive and are the group usually responsible for hymenopteran stinging events involving humans (Vetter et al. 1999). Wasps defend their nests fiercely (Edwards 1980) and may also sting disturbers while foraging, especially in the late summer and early fall, when many hungry queen larvae must be fed with flesh (Spradbery 1973). Given the sting morphology, social wasps are usually able to sting multiple times (Vetter et al. 1999), although sting anchoring routinely occurs in some species such as V. maculifrons (Greene 1991).

Because of their aggressiveness, predatory habits and sting features, workers of social wasps have the potential to sting more than once throughout their life, in a variety of contexts. As a consequence, it is possible that the stinging phenotype of one individual may be shaped by previous experience. As highlighted in the past, there are a number of unresolved problems with the study of wasps’ stinging behavior (Greene 1991; Olson 2000). In general, the descriptions of specific aggressiveness for wasps are mostly anecdotal, qualitative and contradictory. Knowledge of the factors regulating individual wasp response thresholds for various disturbing stimuli is very limited. Alarm pheromones, worker age, colony size, brood population, previous disturbance, and meteorological conditions represent some commonly discussed factors influencing the stinging behavior (Gaul 1952; Ross and Matthews 1991; Potter 1964; London and Jeanne 1996, 2003; Reed and Landolt 2000).

In the present paper we quantify individual wasps’ stinging propensity, with a laboratory bioassay. As a proxy for aggressiveness, we describe and measure the sting extension of the common wasp, V. vulgaris, looking at its individual plasticity and inter-individual variation. Taking into account individual age and size, we measured variation in individual SER throughout the adult life of these wasps, relating it to previous experience, presumptive task performed by the tested individuals and intensity of noxious stimulation.

Materials and methods

A V. vulgaris nest (colony 1) was excavated in Christchurch, New Zealand, on the 19th of January 2014. The nest was boxed and carried to a greenhouse facility at the Plant and Food Research Biosecurity laboratory, Lincoln. Wasps were allowed to forage outside of the nest box, in the grounds surrounding the laboratory. On the 18th of March 2014, the colony was anesthetized with carbon dioxide, the box opened, and three pieces of nest combs with capped brood were removed and placed in an incubator (30 °C, 50 % relative humidity, and complete darkness). A total of 292 known-age workers were obtained between the 19th of March and the 7th of April 2014. Newly emerged adults were collected daily, caged and tested for their SER. The cages were 50-ml plastic jars modified by covering either end with a metal mesh. The individually caged wasps were fed daily by imbibing with 1 ml of 30 % sugar solution one cotton bud suspended from the lid mesh. The cages were numbered and organized on trays lined with absorbent paper that was changed daily, and kept in the incubator under the same conditions as the nest combs. Two additional V. vulgaris nests were located in the grounds of the research facility at Lincoln and exiting wasps from both colonies (colony 2, 3) were collected and tested (see below).

SER scoring

The caged, known-age individuals were lightly anesthetized with carbon dioxide and, when subdued but still moving, harnessed by holding their petiole from the side with metal forceps (Bioquip, feather weight forceps, narrow tip), kept closed with one clip (Fig. 1, Fig. 1S). Each individual was allowed 30 min to acclimate to the harnessing before testing. The forceps restraining the wasp were connected to one polarized cable, while the other one was connected to a metal arch covered in conductive gel (Spectra electrode gel, contents 8.5 OZ, 250 GMS, salt free, Parker laboratories Inc.) (Figure 2S). The electric stimulation was delivered for 2 s by touching the wasp’s cervix with the metal arch. The standard stimulus intensity was 7.5 Volts (Balderrama et al. 2002). Each wasp was tested four consecutive times (rounds), with an inter-round interval of 1 min. Responses during each stimulation were recorded by observing the tested individuals’ abdomen from the side, adopting the four-level scoring system proposed by Lenoir et al. (2006) for honey bees. The absence of any response was scored 0, the response was scored 0.33 when the sting was extended less than the half of its length, 0.67 when it was extended between half and all its length and finally, the response was scored 1 when the sting was fully extended and the sting chamber was completely open. The wasps’ final SER score was obtained by averaging the four consecutive measures and hence varied between 0 (minimum aggression response) and 1 (maximum aggression response) (Fig. 3S). Student’s pairwise t tests were used to compare the individual SER between the first trial’s rounds.

Fig. 1
figure 1

a Individual plasticity of the sting extension response (SER, circles) throughout life, on the basis of previous noxious experience. Points are mean ± SE. Continuous lines significant differences between consecutive trials. Pictures phenotypes at the two extremes (docile vs aggressive). The inset histogram in the box shows the distribution of all the first trial’s SER scores (n = 122) and the two subgroups (white and black bar). b Individual SER plasticity for the initially aggressive group (SER MAX, black points, individuals with the highest stinging score recorded on the first trial) and the initially docile group (SER MIN, white points, individuals showing no response on the first day trial). Trials were separated by approximately 48 h. Points are mean ± SE. p values refer to difference in SER score between aggressive and docile group, tested trial by trial. ***p < 0.001, ns p > 0.05. Please refer to Table 2S (supplementary material), for details of the statistical comparisons

SER in relation to individual age and size

A total of 264 known-age individuals were tested for the first time at different ages (1–27 days of adult life). To study the relationship between individual size and stinging propensity, a subset of 112 wasps was subsequently preserved in 90 % ethanol at −20 °C. To estimate individual body size, we measured the wasps’ head width, using a digital caliper to the nearest 0.01 mm. To separately test for an effect of age and size on the SER score, linear models were used. We also performed a multiple regression testing the effect of age and size simultaneously, using data from 105 individuals where we had both size and age data.

Plasticity in individual SER throughout life

One subset of 122 known-age wasps was kept in individual cages (see above) and the individuals were retested at 48 h intervals (trials) throughout their remaining life. A total of 112 individuals were tested at least twice (1st test age: 10.0 ± 5.5, 1–19 days). Their average adult lifespan was 18.1 ± 5.1, with a range of 8–28 days.

On the base of the initial SER score at day 1, two subgroups were distinguished: 31 individuals had an SER score = 0 for the first trial (no sting extension response on the four consecutive rounds). Of these, 26 individuals could be retested and were pooled in the “docile group” (1st test age: 9.4 ± 4.2, 1–13 days). The 29 individuals showing the highest SER score on the first trial (=0.83) were assigned to the “aggressive group” (1st test age: 8.1 ± 5.7, 1–16 days).

To analyze the change of the individual SER during lifetime, repeated measures analysis of variance with Bonferroni confidence interval adjustment was performed in pairwise comparisons among trials (days). Since data were not normally distributed, Kruskal–Wallis test was used to compare the SER score of the docile group vs aggressive group, trial by trial.

SER and presumptive task (forager/guard)

Two task-related groups of wasps were sampled from colony 1. Over 3 days, we captured 52 incoming foragers with a butterfly net, while flying with their foraging loads in proximity of the nest. On the same days, after the capturing the foragers, we hit the substrate supporting the nest with a stick, provoking a colony defense reaction, and captured a total of 45 “guards” that emerged from the nest. Wasps from both groups were transferred in individual cages, and their SER examined within 2 h. Since data were not normally distributed, Kruskal–Wallis test was used to test for differences in the SER between the groups of foragers and guards.

SER and stimulus intensity

Individuals exiting the three colonies were captured and then tested at 7.5 and 12.5 Volts (colony 1: 78 individuals; colony 2: 26 individuals; colony 3: 12 individuals). For each individual, the inter-test interval was set to 30 min and the voltage order established randomly. Student’s pairwise t test was used to compare the individual SER between the two tests.

All data analyses were performed using R 3.1.0 (R Development Core Team Team 2012) and IBM SPSS Statistics 21 (IBM Corp. 2012).

Results

SER in relation to age and size

Substantial and consistent variation in individual stinging response was found between wasps (average SER score ± SD: 0.64 ± 0.40; n = 292). On the first trial, one individual out of five showed no response to the stimulation and half of the wasps showed the full sting extension on at least one round. Within each trial, the individual SER score was consistent across rounds (Table 1S). Only starting from the second trial, five individuals (one for the next two) scored 1 (full sting extension on the four consecutive rounds). Despite wide variability in SER scores, a positive relationship between age and SER was found (multiple R 2 = 0.066, \( R_{\text{adj}}^{ 2} \) = 0.062, F 1,262 = 18.45, p < 0.0001) (Fig. 2). When both age and size were considered, for a subset of 105 individuals, no significant effect of the two factors was found (effect of age: F 1,102 = 1.35, p = 0.251; effect of size: F 1,102 = 0.09, p = 0.765. Model R 2 = 0.013, \( R_{\text{adj}}^{ 2} \) = −0.006).

Fig. 2
figure 2

Regression between the sting extension response (SER) and the age of the wasp. Point symbols are mean ± SE. The point’s diameter is proportional to the sample size of each age (total n = 264, n per day = 2–47)

Individuals varied considerably in size (3.08–3.71 mm head width). However, no relationship was found between individual size and SER score (Multiple R 2 = 0.0024, \( R_{\text{adj}}^{ 2} \) = −0.007, F 1,110 = 0.264, p = 0.608) (Fig. 3).

Fig. 3
figure 3

Correlation between the sting extension response (SER) and the size of the wasp. Each point represents one individual and its SER score at the first test

Individual SER plasticity throughout life

There was a significant change in individual SER over successive trials and days. Overall, the average stinging response was stronger on the second trial and subsequently decreased (Fig. 1a). For the group of individuals retested, the distribution of SER on the first trial was bimodal, with 21 % of individuals displaying an SER score of 0 (pooled in the “docile” group) and 24 % showing an SER score of 0.83 (pooled in the “aggressive” group) (Fig. 1a inset). The docile and aggressive groups showed opposite trends over time. The aggressive group displayed a significantly higher SER score than the docile group for the first three trials, but with a decline in SER over time, while the docile group showed an increase in SER over the first seven trials. The two groups showed a similar score from the fourth to the seventh trial (Fig. 1b; Table 2S). Some individuals within these groups never changed, always displaying very low or very high SER scores.

SER and presumptive task (forager/guard)

On average, the SER score of presumptive guards was threefold higher than the SER score of the foragers (χ2 = 19.54, df = 1, p < 0.0001). Nevertheless, inter-individual variation within the two groups was wide (Fig. 4).

Fig. 4
figure 4

Sting extension response (SER) and presumptive task undertaken by tested individuals. “Foragers” were captured on their way back to the nest. “Guards” were collected after striking the nest box. Boxes represent the lower and upper quartile, the bold line is the median and whiskers represent extreme values of SER score, with the circles identifying outliers. ***p < 0.001

SER and stimulus intensity

The average individual from all the three colonies showed a significantly stronger stinging response in correspondence to higher stimulus intensity. The SER score at 12.5 V was up to almost four times higher than at 7.5 V (colony 1: t = −13.70, df = 77, p < 0.0001; colony 2: t = −7.84, df = 25, p < 0.0001; colony 3: t = −3.06, df = 11, p = 0.0110). Some individuals showed no or little response at both stimulus intensity levels (Fig. 5).

Fig. 5
figure 5

Sting extension response (SER) of individuals from three V. vulgaris colonies, tested twice at different intensities of noxious stimulus (electric current), in random order. Boxes represent the lower and upper quartile, the bold line is the median and whiskers represent extreme values of SER score, with the circles identifying outliers. ***p < 0.001, *p < 0.05

Discussion

Given the ubiquitous occurrence of direct competitive interactions, aggression plays a major role in fitness outcomes (Archer 1988). Despite being reported numerous times in many contexts, differences in defensiveness/aggressiveness among social hymenopteran species (e.g., Breed et al. 2004; Greene 1991), between and within specific colonies (e.g., Jeanne 1988; Pinter-Wollman 2012) have been described mostly qualitatively. Scoring the sting extension response, as a proxy for aggressiveness, can quantify an important trait of the individual behavioral phenotype (Dingemanse and Wolf 2013; Grinsted et al. 2013). The SER bioassay hence represents a potential tool for the study of inter-individual variation and animal personalities in an evolutionary perspective. A growing body of literature is showing the importance to ecological studies of quantifying individual variability (Bolnick et al. 2003, 2011; Violle et al. 2012; Wolf and Weissing 2012). We believe that the sting extension response is an opportunity to study behavioral variability among the social Aculeata, both at an individual level and in comparative studies.

We tested for, and found, substantial and consistent variation in the stinging thresholds of common wasp nestmates. Although if data for colony 2 and 3 are very limited, this behavioral pattern was consistent for all wasp colonies examined. The individual SER measure proved to be repeatable across rounds and trials (Fig. 1b, Table 1S, 2S). The plasticity of SER recorded throughout a wasp’s life highlights the variability in time of individual behavior. Initially docile wasps developed a stronger SER, while initially aggressive wasps showed an opposite trend, almost having a lower SER score compared to the first group later in their life (Trial 6: Fig. 1b; Table 2Sb). These inverse patterns seem to exclude the possibility of a simple sensitization to the electric stimulation (Shettleworth 2010). Also, artificial, asocial and stressful captivity conditions would likely affect individuals similarly. Our data do not allow to speculate on the significance of these plastic changes in a natural context, but still show that extremely different phenotypes can converge and express similarly, on the base of the same previous experience. Taken together, our empirical findings support theoretical frameworks where individuals show innate differences but change their behavioral phenotype throughout their life (Theraulaz et al. 1998; Beshers and Fewell 2001).

Age was correlated to the stinging propensity of V. vulgaris individuals. This result finds a parallel in field-based studies performed on other social wasp species, where older individuals were found to be more likely to defend the colony (Jeanne et al. 1992; Togni and Giannotti 2010; Monceau et al. 2013). However, when including head width as a covariate, no relationship between age and SER was found, probably due to the high intra-trial variability and reduction in sample size. It must be noticed that age poorly explained the variation observed. Moreover, the isolation of known-age adults might have influenced their behavioral phenotype. The stinging reaction was observed in individuals of all ages, including in one-day-old individuals. Ontogenetic variation in stinging behavior has been reported for honey bees, with older individuals (20-day-old) showing the lowest stinging thresholds (Paxton et al. 1994). The sting reflex was found to be fully developed in bees older than 5–7 days (Breed et al. 2004). The median age at which A. mellifera workers become guards was reported to be 12 days (Lenoir et al. 2006). Older individuals have lower life expectancy and theoretically are predicted to be more likely to perform risky tasks, e.g., nest defense, their anticipated future value to their society being lower (Kolmes and Fergusson-Kolmes 1989).

We found no significant relationship among individual size and SER. Still, it should be noted that small individuals with head width <3.3 mm never displayed an SER score greater than 0.5 (Fig. 3), perhaps suggesting a non-linear relationship. To our knowledge, no study has so far investigated a possible relationship between sting extension and individual size. Vespula spp. colonies are good systems, being characterized by seasonal morphometric changes of workers, with increasing average worker size and variation as the colony develops (Spradbery 1972). Size relationships represent a crucial factor in most predator–prey and intra-specific competitive interactions (e.g., MacNulty et al. 2009; Santoro et al. 2011). Together with age, size is also extremely important in establishing dominance hierarchies in social insects (O’Donnell and Jeanne 1995; Hogendoorn and Velthuis 1999). Dominance relationship among colony members can, in turn, influence the division of labor among and within castes (O’Donnell and Jeanne 1995). Size was also shown to directly determine individual behavioral thresholds: bigger bumble bees (Bombus terrestris), for example, are more likely to forage and have more sensitive antennae, showing lower thresholds to odors (Spaethe et al. 2007).

Despite their intrinsic limitations, measures obtained in solitary, standardized laboratory settings can be reliable predictors of behavior in a social context in the field (Grinsted et al. 2013). The SER bioassay detected wide variability in stinging propensity and discriminated individuals previously involved in colony defense from foragers. These results match field observations that stinging and nest active defense by social hymenopterans involves only a subset of workers (Greene 1991; Breed et al. 2004). For example, Africanized honey bee guards were faster to sting than nest bees (Uribe-Rubio et al. 2008). Given that our wasp “guards” were tested within 2 h from the colony provocation, they might have still been under the effect of an alarm pheromone, for which the composition and latency are still to be investigated (Reed and Landolt 2000).

The stinging response of individuals from the three colonies was more pronounced in association with a more intense electric stimulation. The same individual showed a more aggressive reaction to the stronger of the two noxious stimuli, regardless of which one was experienced first. Similar results were obtained with different groups of honey bee workers, whose responsiveness to electric stimulation was found to increase with higher voltages, up to 8 V (Balderrama et al. 2002).

Variation in SER can have a genetic basis. A number of studies on A. mellifera have demonstrated that guarding is a specialized task performed by a few, genetically predisposed individuals (Robinson 1992; Breed et al. 2004; Hunt 2007). Inter-individual variability in the honey bee SER has been observed as associated with different paternity lines (Lenoir et al. 2006). Polyandry is rare among social hymenopterans (Strassmann 2001) and, despite having costs in terms of colony cohesion (Crozier and Fjerdingstad 2001), was linked to colony fitness in, e.g., V. maculifrons (Goodisman et al. 2007). Polyandry is thought to have an adaptive significance by tuning group responses of insect colonies, determining worker subgroups with different task-related thresholds (Oldroyd and Fewell 2007). Both A. mellifera and Vespula are peculiar amongst hymenopterans, showing marked polyandry (Strassmann 2001, Hanna et al. 2013). Paternal effects in wasp’s aggressiveness await to be tested. However, we predict that colonies and species with single-mating queens will show lower levels of aggressive variability among nestmates, measurable with the SER bioassay.

A very promising avenue is the SER classical conditioning for research focusing on the differential learning performances among Aculeata. Successful aversive conditioning was done on A. mellifera (e.g., Vergoz et al. 2007). Wasps are traditionally described as explorative, opportunistic foragers, capable of quick associative learning (Raveret Richter 2000; Moreyra et al. 2014). Wasps’ associative learning capabilities await to be scored in the laboratory.

Our findings add to the existing literature by extending earlier SER studies on A. mellifera to a new system, vespine wasps. We quantified the degree individuals from the same insect colony can differ in the SER, providing an example of experience-based convergence of initially different behavioral phenotypes. The quantification of the stinging behavior through the SER score proved to be a repeatable, easily performed procedure to test for innate threshold differences between individual wasps. We encourage the development of standardized SER testing methods and more studies to better understand the biological significance of SER, focusing on the relationship between the measures obtained with this laboratory bioassay and aggressive/defensive interactions in different natural contexts. At these conditions, the SER bioassay has the potential to become a standard measure of aggressiveness, a cardinal personality trait. Even more interestingly, the SER comes with the promise of exciting advancement in comparative and learning studies.