To show that a behavior is a signal, it is necessary though not sufficient to demonstrate that receivers respond to the behavior. How practical such demonstrations are depends strongly on signal modality. Animals use a variety of modalities to signal the following: visual, auditory, chemical, seismic, electrical, and tactile (Marler 1967; Bradbury and Vehrencamp 2011). Most animal groups emphasize a subset of the available modalities; anurans, for example, emphasize auditory signals (Gerhardt and Huber 2002), while social insects emphasize chemical ones (Wilson 1965). In birds, two signaling modalities predominate, visual and auditory. Receiver response to auditory displays in birds has long been easy to investigate through sound playback, using a variety of playback designs (Weeden and Falls 1959; King and West 1977; Krebs et al. 1978). Receiver response to static visual displays has also been easily tested, using experimental manipulations such as dyeing (Rohwer 1977; Hill 1991) or the cutting or extending of feathers (Andersson 1982; Møller 1988; Petrie and Halliday 1994). What has proven much more difficult to study is another type of visual display—display movements. Here, we examine a display movement, “wing waving,” in swamp sparrows (Melospiza georgiana), to test for receiver response using a wing waving robotic swamp sparrow.

Wing waving is a distinctive display movement (Fig. 1; Supplementary Material video 1) in which sparrows raise and flutter one or both wings while remaining perched (Nice 1943; Nelson and Marler 1989). Swamp sparrows perform wing waves in the context of aggressive territory defense (Nelson and Marler 1989; Ballentine et al. 2008). In addition to wing waving, male swamp sparrows produce a variety of vocal signals during territorial aggression, including normal “broadcast” songs, rasp and wheeze calls, and low-amplitude “soft songs,” (Anderson et al. 2008; Ballentine et al. 2008). Wing waves and song are commonly produced together; in the Ballentine et al. (2008) study, for example, 93 % of the 308 bouts of wing waving which were observed were given together with soft song, broadcast song, or both (i.e., in the same 10-s block), and 7 % were given with neither category of song (B. Ballentine et al. unpublished data). Of the vocal and visual display behaviors given in response to territorial intrusion, only the numbers of soft songs and wing waves differ significantly between males that do and do not attack a conspecific model, with attackers giving more of each display than non-attackers (Ballentine et al. 2008). Soft songs and wing waves thus satisfy the “predictive” criterion for an aggressive signal (Searcy and Beecher 2009). Here, we test receiver response to wing waving in the context of singing during a territorial intrusion.

Fig. 1
figure 1

A male swamp sparrow, M. georgiana, performing the wing wave display in response to conspecific song playback

Ideally, receiver response to any display should be demonstrated experimentally, using designs in which the display of interest is manipulated independently of any other signal or cue provided by the animal, and response to the manipulated display is measured. For auditory signals, experiments of this sort have long been possible via playback of recorded sounds through loudspeakers (e.g., Martof and Thompson 1958; Weeden and Falls 1959; Wishart and Riordan 1959; Falls 1992). Playback experiments of this sort have already demonstrated response to soft song in song sparrows (Melospiza melodia) (Anderson et al. 2012), a close relative of swamp sparrows. Manipulation and playback of display movements such as wing waving have historically been more difficult. One solution has been video playback (Clark and Uetz 1992; Rowland 1995; Burford et al. 2000), but this technique is best suited for laboratory use. What seems more appropriate, especially for birds, is to use independently moving motorized models, popularly known as “robots.”

Robots have been used previously to examine the function of display movements given both alone and as part of multimodal displays (Taylor et al. 2008). In the satin bowerbird (Ptilonorhynchus violaceus), females perform a crouching movement in response to male courtship. Patricelli et al. (2002) showed using a robotic female that the crouch elicits male courtship, with males courting more intensely the more often the robot crouched. Balsby and Dabelsteen (2002) studied a jump display given by female whitethroats (Sylvia communis). Using a mechanical model, they showed that male whitethroats responded with more courtship song when the model jumped than when it did not. Narins et al. (2003) studied the aggressive display of the dart-poison frog Epipedobates femoralis, which combines calling and vocal sac movements. Experiments with an electromechanical model frog demonstrated that neither calling nor vocal sac movement alone elicits attack, but that frogs will attack a model that produces both the auditory and visual display (Narins et al. 2003). A fourth example involves the Eastern gray squirrel (Sciurus carolinensis), which gives both alarm calls and tail movements in response to predators. Partan et al. (2009) used a mechanical model squirrel to show that Eastern gray squirrels respond with alarm behavior to either the vocal or the visual component of the display and show an enhanced response when the two modalities are presented together.

To test the hypothesis that wing waving is a signal and, specifically, that receivers respond to wing waving as an aggressive signal, we conducted two experiments. First, we compared the aggressive response of territorial male swamp sparrows to either a wing waving intruder or a motionless intruder, both coupled with broadcast song playback (henceforth, the “wing waving experiment”). The intruder was a robotic taxidermic mount of a male swamp sparrow designed to perform the wing wave display. We coupled song with the visual display because wing waves are typically given with either soft song or broadcast song. We used broadcast song for playback in order to draw the subject’s attention to the robotic mount, and because broadcast song rate does not correlate with wing wave rate, with approach to the simulated intruder, or with attack likelihood (Ballentine et al. 2008). Because only wing waving differs between the wing waving and motionless treatments, any difference in response indicates response to wing waving.

In a second experiment, we controlled for the possibility that movement alone, rather than the wing wave motion in particular, affects aggressive response. This could occur because a moving mount is more likely to draw and keep a subject’s attention than a motionless mount even if the movement has no signal value, or because movement makes the mount appear more realistic. For this experiment, we presented subjects with a robotic mount designed to perform an arbitrary nonaggressive movement, turning from side to side, coupled with broadcast song playback. Here again, we compared aggressive response between trials with the turning mount versus trials with the mount motionless (henceforth, “turn-motion experiment”).

Methods

We conducted trials in a population of swamp sparrows in Conneaut Marsh, Crawford County, Pennsylvania, USA, between May 22 and June 16, 2011 (wing wave experiment, N = 30) and May 19 and June 1, 2012 (turn-motion experiment, N = 30). Birds were not individually marked. We identified territorial males by location, as territories in this species remain stable during the height of the breeding season. The two experiments were performed in consecutive years, and because we visited some of the same territories in both years, some males may have been subjects in both experiments. We completed all trials between 0600 and 1000 hours.

During both experiments, males received both a motion treatment (either wing wave or turn motion) and a stationary treatment (mount not moving). Within each experiment, tests were performed 48 h apart (except for six subjects, which received treatments 72 h apart due to weather or other logistic constraints). Whether the motion or stationary treatment was presented first was counter-balanced across subjects.

We constructed the wing waving sparrow by placing a robotic frame (designed and built by DP) actuated with a custom-built linear motor inside the body cavity of a taxidermic mount of a male swamp sparrow. The mount was affixed to a wood base in a standing position. The motor drove movement of the right wing, which waved at a speed and duration consistent with displays given by wild birds. Wing waving with one wing at a time is a common pattern in swamp sparrows (see Fig. 1). Wing waves occurred in bouts of 2.5 s at 1.2-s intervals. The turn-motion sparrow was a second taxidermic mount positioned the same as the wing waving sparrow. Its feet were affixed to a small dowel attached to a rotating motor hidden beneath the wood base. The motor turned the mount from side to side at roughly a 45° angle. Turns occurred in bouts of four (left, right, left, and right) lasting 2.0 s at 1.5-s intervals to roughly match the motion duty cycle of the wing waving sparrow (see Supplementary Material 2 for videos of both robotic mounts). For both mounts, motion was automated using electronics housed in a control box and held by an experimenter who stood approximately 10 m from the mount/loudspeaker setup. The motion routines were timed commands written in Basic and controlled with a PicAxe 28X2 microcontroller (Revolution Education, Ltd., Bath, UK). The motors were powered through two STMicroelectronics L298N H-Bridges (STMicroelectronics, Coppell, TX). The mounts were very similar visually in size and plumage characteristics. Both mounts were male swamp sparrows from the same population caught during the breeding season. The most obviously variable male plumage trait in swamp sparrows is the brightness and rustiness of the crown feathers (Greenberg 1988). The two robotic mounts did not differ noticeably in this regard. Both sparrow mounts were enclosed in a wire-mesh cage (15 cm W × 15 cm L × 17 cm H) to protect them from attacks by study subjects.

Using playback of broadcast song during trials was important for luring the subjects to the vicinity of the mount and ensuring they would notice it in a reasonable amount of time. Song playbacks consisted of a single exemplar of a swamp sparrow song presented at a rate of six songs per minute, a typical rate for this species. All song stimuli were recorded in the Conneaut Marsh population between 2003 and 2008. We played each subject a different song exemplar during each trial (i.e., 60 different song exemplars were used for the 30 pair-wise trials in each experiment). We calibrated playback amplitude to 80–83 dB at 1 m from the speaker using a BK Precision 732A sound level meter and broadcast playbacks using a Marantz 660 PMD digital recorder and an Advent Powered Partners AV570 speaker. During trials, we placed the robotic sparrow mount on a post at approximately 1 m height and approximately .5 m above the speaker.

With the exception of the motion performed by the two mounts, testing procedures were identical in the two experiments. We began trials by luring the subject to within 4 m of the speaker using playback. We played the lure song (one song exemplar) a maximum of six times (1 min total duration). If the subject did not approach within 4 m of the speaker during this time, we waited 2 min and then played the lure six more times; if the subject failed to approach within 4 m during this second set of lure songs, the trial was abandoned. Once the subject approached within 4 m of the speaker, we began the trial by restarting the playback and by starting the motion cycle if the trial was a motion treatment. A trial consisted of 2 min of song playback (12 total songs) accompanied by either a moving or stationary sparrow mount. During trials, one experimenter controlled the song playback and the mount’s motion cycle, while a second observer recorded the subject’s behaviors on a digital recorder for later transcription.

We measured the aggressive response to the mount and playback using the subject’s proximity to the mount/speaker averaged over the trial. This measure is strongly associated with attack likelihood in swamp sparrows (Ballentine et al. 2008) and related species (Searcy et al. 2006) and thus provides a good single-measure proxy for aggressiveness. In addition to proximity, we measured two display behaviors performed by subjects, wing waves and broadcast songs. With wing waves, we were especially interested in whether receivers adjust their own wing wave rate in response to an intruder that wing waves compared to one that does not, as this would be further evidence that wing waving is a signal. We did not consider soft song rate because it is strongly positively correlated with wing wave rate (Ballentine et al. 2008). Broadcast song rate does not correlate with wing wave rate or proximity and contains no information on attack likelihood (Ballentine et al. 2008). Nonetheless, it is a conspicuous auditory display performed in response to territorial intrusion, and features of broadcast song have been shown to influence male response (Nowicki et al. 2001, DuBois et al. 2009, 2011).

We sampled the subject’s proximity to the speaker at 5-s intervals and binned the samples into the categories 0–2, 2–4, 4–8, 8–16, and >16 m. Distance estimation was aided by placing markers at 2, 4, and 8 m from the speaker before the trial. We calculated each subject’s average proximity to the speaker following the method of Peters et al. (1980); we considered a bird in the 0–2-m range for a given 5-s interval to be at 1 m from the speaker during that interval, and a bird in the 2–4-m range at 3 m, in the 4–8-m range at 6 m, in the 8–16-m range at 12 m, and in the >16-m range to be at 24 m. We then averaged the proximity measures from the 5-s interval estimates across the 2-min observation period.

Wing wave displays are rapid upward flips of one or the other wing typically given in bouts of varying length and are hard to count in the field as discrete behaviors. Accordingly, we calculated wing wave rate as the number of 5-s intervals (out of a possible 24 intervals) in which any wing waves occurred (Altmann 1974). In one analysis, we asked whether subjects differed in the strength of their aggressive response to the motion and stationary treatments by conducting a within-subjects comparison of our proximity measure using paired t tests (two tailed). In a second analysis, we asked whether subjects adjusted their signaling behaviors (wing waves and songs) between treatments, also using paired t tests (two tailed). We assessed the relationship between wing waves and aggressive response across subjects using Pearson product–moment correlation coefficients. All analyses were performed in JMP Pro v10 (SAS Institute, Inc.).

Results

The majority of subjects in both experiments approached the mount/speaker within 4 m during the first lure playback (77 % of trials in the wing wave experiment and 85 % in the turn-motion experiment). Subjects met the criterion during the second lure in the remainder of trials, with one exception; this subject was tested successfully the following day.

In the wing wave experiment, subjects approached significantly closer to the mount/speaker (more aggressive) during wing waving trials than during stationary trials (t = 3.27, 29 degrees of freedom (df), p = 0.003, Fig. 2a). The numbers of wing wave displays and songs performed by subjects did not differ significantly during trials with the wing waving mount compared to trials where the mount was stationary (wing waves t = 1.53, 29 df, p = 0.136; songs t = 1.170, 29 df, p = 0.252, Fig. 2b).

Fig. 2
figure 2

Behavioral responses during trials of conspecific playback coupled with a stationary or a moving taxidermic mount. a Mean (±SE) proximity to loudspeaker/mount during trials with the stationary mount versus the wing waving mount; b mean (±SE) broadcast songs (dark) and bouts of wing waves (light) during stationary and wing waving trials; c mean (±SE) proximity to loudspeaker/mount during trials with the stationary mount versus the turn-motion mount; d mean (±SE) broadcast songs (dark) and bouts of wing waves (light) during stationary and turn-motion trials; **p < 0.01

In the turn-motion experiment, subjects responded with similar aggressiveness during motion and stationary trials (comparing averaged distance to the mount/speaker, t = 0.81, 29 df, p = 0.428, Fig. 2c). Subjects also gave similar rates of wing wave displays (t = 0.585, 29 df, p = 0.576) and songs (t = 0.24, 29 df, p = 0.812, Fig. 2d) during motion and stationary trials.

As in Ballentine et al. (2008), we found a positive relationship between aggressiveness and wing waves during trials with the stationary mounts. Subjects that were more aggressive (lower proximity scores) produced more wing wave displays (stationary wing wave mount, Pearson r s = −0.46, p = 0.011, Fig. 3a; stationary turn-motion mount, r s = −0.42, p = 0.023, Fig. 3b). This relationship did not occur, however, during trials with the moving mounts (wing wave mount r s = −0.038, p = 0.84, Fig. 3c; turn-motion mount r s = −0.15, p = 0.443, Fig. 3d). In the case of the wing waving trials, this effect was largely due to subjects approaching the mount more closely and producing wing waves regardless of their proximity.

Fig. 3
figure 3

Relationships between bouts of wing wave displays given by subject birds and the subjects’ proximity to the loudspeaker/mount during trials with moving and stationary mounts. The wing waving mount with movement disabled was used as the stationary mount for the wing wave experiment, and the turn-motion mount with movement disabled was used as the stationary mount for the turn-motion experiment. The panels show subjects’ wing waves and proximity during trials with the stationary mount in the wing wave experiment (a), the stationary mount in the turn-motion experiment (b), the wing waving mount (c), and the turn-motion mount (d)

We found no effect of year when comparing responses during the stationary mount trials between the two experiments. Subjects responded similarly to the stationary mounts in the 2 years using both aggression and signaling measures (proximity to speaker, 2011 vs. 2012 mean ± standard error (SE) = 3.42 ± 0.47 m vs. 2.62 ± 0.30 m; t = 0.60, 58 df, p = 0.551) wing waves (5.3 ± 1.1 vs. 6.37 ± 1.4 bouts; t = 0.6, 58 df, p = 0.585), and songs (2.0 ± 0.62 vs. 0.77 ± 0.41 songs; t = 1.65, 58 df, p = 0.104).

Discussion

Determining whether a putative signal conveys aggressive intent requires demonstrating that (1) the signal increases in aggressive contexts, (2) the signal predicts aggressive escalation by the signaler, and (3) receivers show differential responses to the signal versus a control stimulus (Searcy and Nowicki 2005; Searcy and Beecher 2009). The wing wave display in swamp sparrows satisfies the first two criteria as it occurs predominantly during aggressive interactions and has been shown to predict subsequent physical attack by the signaler (Ballentine et al. 2008). Here, we provide evidence for the third criterion, showing that male receivers respond more aggressively to a simulated singing and wing waving intruder compared to an intruder that sings only. Because only the presence and absence of wing waving differ between these two treatments, the difference in aggressive response to the treatments demonstrates that male swamp sparrows respond to wing waving. We also show that motion of the mount alone cannot account for this difference, as subjects did not respond more aggressively to a mount that turned side to side compared to the mount when stationary. Taken together, our results support the hypothesis that the wing wave display in swamp sparrows is perceived as an aggressive signal. As we did not test response to wing waving as an isolated display, we must be circumspect about whether wing waving by itself would affect receivers in a similar way.

Our measure of aggressive response was the subjects’ proximity to the mount/speaker setup, and we also measured wing waves and broadcast songs given by subjects (discussed below). We focused on proximity as the sole measure of aggressiveness because proximity is a strong predictor of attack in this species and thus directly measures aggressive intentions. Attacks are obviously the most direct measure of all, but only three of the 60 birds we tested attacked the caged mount in the course of 120 trials, so we did not include this measure. We also did not measure the number of soft songs, which, however, strongly correlates with two behaviors that we did measure, wing waves and proximity. Soft songs may prove useful in future studies that seek to test additional questions about how the wing wave display is perceived, for example, when paired with soft song versus broadcast song.

The within-subjects comparisons of wing wave rates between motion and stationary trials showed that on average, subjects are consistent in their wing waving behavior across treatments. However, in line with Ballentine et al. (2008), we found a positive relationship across subjects between wing waving and aggressiveness during stationary mount trials such that individuals that approached closely produced many wing waves, and those that did not approach produced few or no wing waves. This relationship did not arise during trials with the moving mounts.

In the case of the wing waving experiment, one explanation for this result is that the addition of the wing waving movement to the mount simulated a more realistic intruder and resulted in subjects responding more aggressively overall, including closer approach and more wing wave displays. In addition, wing waving may serve as part of a hierarchical signaling system in swamp sparrows. Hierarchical signaling systems are known for several bird species, in which signals are switched or added to escalate the encounter (e.g., Beecher et al. 1996; Burt et al. 2001). In song sparrows, a close relative of swamp sparrows, wing waving has been hypothesized to occur along with soft song at or near the top of the hierarchy of aggressive signals (Ç. Akçay et al. unpublished). The same may be true in swamp sparrows; wing waves reliably predict attack (Ballentine et al. 2008), so it is reasonable to conclude that wing waving is a highly escalated signal that evokes a highly escalated response from subjects.

In addition to the wing wave display, we also examined the rate of broadcast songs given by subjects. Song rate appears to convey no information on attack likelihood in swamp sparrows (Ballentine et al. 2008), yet an increase in broadcast singing is a typical response to a territorial challenge. This behavior could be explained by the “multiple messages” hypothesis, which states that different signals convey different kinds of information or information about different aspects of signaler quality (Møller and Pomiankowski 1993; Johnstone 1996; Hebets and Papaj 2005). The rate of broadcast songs, for example, might contain information on the singer’s condition (Davies and Lundberg 1984; Gottlander 1987) rather than on its aggressive intentions, information that would still be of interest to aggressive opponents. Alternatively, the intended receiver of broadcast songs could be neighboring males, the signaler’s social mate, or other nearby females that are potential extra-pair mates. Broadcast song rate has not been examined with regard to male qualities in swamp sparrows, but it has been shown that both male and female receivers respond to features of song such as vocal performance (Ballentine et al. 2004; DuBois et al. 2009, 2011), and vocal performance predicts male age and size (Ballentine 2009). Additional work is needed, then, to determine what information is being signaled by broadcast songs during agonistic interactions, and how the message may differ when broadcast songs are paired with wing waves.

Of the various behaviors, male swamp sparrows perform in response to a rival male, two displays, wing waves and soft songs, predict attack, and thus appear to be signals of aggressive intentions. These displays are sometimes given on their own but are most frequently given together. We suggest that the combined display constitutes a case of multimodal signaling (Rowe 1999; Candolin 2003; Hebets and Papaj 2005; Partan and Marler 2005). In some cases of multimodal signaling, signals in two modalities must be given together because of linked production mechanisms; an example is the coupling together of the auditory signal of male calling and the visual display of vocal sac inflation in male anurans (Narins et al. 2003; Pauly et al. 2006). In other cases, signals in two modalities are not coupled together by production mechanisms and so can be given alone as well as together; an example here is the combination of tail flagging and alarm calling used by gray squirrels in response to predators (Partan et al. 2009). Wing waving and soft song in swamp sparrows fall in the latter category; though they are strongly associated with each other, there is no link between their production mechanisms.

Various hypotheses have been proposed to explain the evolution and maintenance of multimodal signals (Hebets and Papaj 2005; Partan and Marler 2005). Ballentine et al. (2008) suggested that wing waves and soft song fit with the “redundant signals” hypothesis, which states that multiple displays broadcast the same information, either to intensify the signal (Hasson 1989; Rowe 1999) or to counteract error-prone signals (Møller and Pomiankowski 1993; van Doorn and Weissing 2006). In either case, receivers gain more information by attending to both signals rather than to either alone. Further, it may be advantageous for an animal to use different signaling modalities for redundant signals, so that one signal can be perceived if the other cannot, as suggested by the efficacy backup hypothesis (Hebets and Papaj 2005). We speculate that soft songs and wing waves may work as redundant signals in this way. Soft songs are defined by their low amplitude, and because of this trait, they are easily masked by other sounds or noise. Thus, wing waves may serve to draw and keep the receiver’s attention so that soft songs may be heard, as well as to amplify the aggressive message. Wing waving may often be visually occluded by reeds and other vegetation in the swamp sparrow’s habitat, making an auditory backup important. This hypothesis could be tested by measuring the latency to respond to either signal in isolation compared to the signals combined or by examining the strength of response over a longer period to test whether response wanes differentially to single versus combined signals. As wing waves are also given with broadcast songs, future studies should test whether receiver response differs when wing waves are paired with soft songs versus broadcast songs.

In their review of complex signal function, Hebets and Papaj (2005) suggest that four questions are a key to understanding how a multiple signaling system functions: (1) Do the signals covary and is the relationship positive or negative? (2) Do the signals covary with some attribute of the signaler? (3) Are the signals necessary or sufficient to elicit a receiver response? (4) Does the presence of one signal influence the receiver’s response to a second? We are making progress toward answering these questions for the swamp sparrow, allowing us to narrow in on possible hypotheses for the function of complex display in this species. With regard to the first two questions, we know that wing waves and soft songs covary positively with each other and with aggressiveness (Ballentine et al. 2008), while broadcast songs do not predict aggression. With regard to the third question, we show in this study that wing waves, in the absence of soft songs, are sufficient to elicit a strong aggressive response from receivers. What remains to be tested is whether wing waves alone in the absence of any vocalization are sufficient to elicit aggression, whether soft song is sufficient to elicit aggression in swamp sparrows as it is in song sparrows (Anderson et al. 2012), and whether the combination of soft songs and wing waves is more effective than either signal alone.