Introduction

Simple, inexpensive, field-readable markers are commonly used in studies of migration, behavior, habitat use, population dynamics, and to help form management decisions (Culik et al. 1993). Examples of these markers include colored bands, leg flags, neck bands, and nasal and wing tags. Unfortunately, the problems associated with markers have often been overlooked (Calvo and Furness 1992; Culik et al. 1993). These may include skin abrasion, feather wear, changes in behavior, reduced flying or swimming efficiency, lower reproductive success, increased predation risk, and death (Calvo and Furness 1992; Culik et al. 1993; Zuberogoitia et al. 2012). In a review of the effects of various bird markers, Calvo and Furness (1992) identified a need for careful testing of the effects that marking methods have on the parameters under study, as the use of tags may influence the accuracy of data collected when using them. Here, we test the impact of wing tags as well as other common handling protocols on nest success in Magnificent Frigatebirds (Fregata magnificens). To our knowledge, this is the first experimental test of the effects of wing tags on this seabird family (Fregatidae). We initiated this experiment after finding much lower success in nests where at least one adult was handled (5 %, n = 75) relative to control nests (no adult handled, 57 %, n = 81) in the 2009/2010 breeding season. Our handling protocol used similar methods to previous studies of frigatebirds, including night capture, banding, blood sampling, taking morphological measurements, and marking individuals using alpha-numeric wing tags (e.g., Osorno 1999; Dearborn et al. 2003, 2005; Madsen et al. 2007). Our 2009/2010 data, however, did not allow us to separate effects of tagging from other interventions.

To extend our findings more generally, we also examined the overall evidence for the effects of avian wing markers using meta-analysis, the recommended approach for this type of analysis (Stewart 2010). A quantitative synthesis will clarify how wing-marking techniques affect survival, reproductive, and behavioral parameters.

Methods

Frigatebird nest success

Study site

Barbuda, West Indies (17°37′N, 61°48′W) is a small (160 km2), low-lying island in the Eastern Caribbean (Online Resource 1). On the northwest edge of the island is the semi-enclosed Codrington Lagoon, where mangroves (primarily Rhizophora mangle and Avicennia germinans) provide nesting habitat for a Magnificent Frigatebird breeding colony. Diamond’s (1973) extrapolation from a partial census of the colony was 2,500 nests in June 1973 (late in the breeding season), and Kushlan (2009) counted 1,743 active nests in March 2009 (mid-season).

Study organism

Frigatebirds are large tropical seabirds that exhibit reversed sexual size dimorphism (females are larger than males) and male ornaments, including a red inflatable gular pouch used in courtship displays; this combination is unique in birds. They rear one highly altricial chick for 12–18 months (Nelson 1975). As in most seabirds, both parents care for the young, but in Magnificent Frigatebirds the male abandons when the chick is 3–4 months old, leaving the female to feed the chick for the remaining long period of parental care (Diamond 1972, 1973; Osorno 1996). Fledging success in frigatebirds is typically low, but probably varies between 15–50 % (Nelson 1975; Diamond and Schreiber 2002). Frigatebirds do not have waterproof plumage, and have small, semipalmate feet, so avoid landing on the surface of the water. They feed by aerial dipping, or more notoriously, by harassing other birds until they disgorge their stomach contents (Nelson 1975). For their size, frigatebirds have a large wing area and wing span, and extremely low wing loading (mass load per unit area); they take advantage of these features by gliding on thermals at the base of trade wind cumulus clouds to travel great distances in search of food (Pennycuick 1983).

Experimental design

In December 2010, we randomly assigned 12 nests to each of nine treatments [n = 108 (56 males, 52 females); Table 1]. Our design tested for main effects of the handling methods used in the previous season: applying a size “7B short” aluminum Bird Banding Lab leg band, taking morphological measurements, bleeding, and attaching wing tags. All nests were uniquely marked with numbered, blue, metal markers approximately 3 cm across with a hole in the top through which snare wire was threaded. The wire was wrapped around a mangrove branch adjacent to the nest. We captured adult Magnificent Frigatebirds on nests by approaching them at night when the moon was set and dazzling them with a headlamp. Eighty-five percent of the nests in this experiment had an egg or chick in December 2010. An adult in breeding condition on a stick nest was considered a nest attempt, since roosting adults not defending nest sites were flightier and more difficult to capture. We processed birds in a small boat in the shallow water adjacent to the colony before returning them to their nests and ensuring that they settled back onto their egg or chick. We did not band the chicks, as their legs are larger in diameter than adults’ and problems associated with guano buildup and infection have been reported (Schreiber 1999). We checked nest status in March and May 2011 by walking around the colony to look for nest markers. The final nest check occurred just prior to the fledging of the oldest chicks (it is difficult to determine nest fate after fledging). Therefore, “nest success” was defined as a chick surviving five months post-treatment.

Table 1 Description of handling treatments assigned to breeding Magnificent Frigatebirds (Fregata magnificens) on Barbuda, West Indies, in December 2010

Our wing-tag design was based on those used by others studying Great (F. minor) and Magnificent Frigatebirds (Osorno 1999; Dearborn et al. 2003, 2005; Madsen et al. 2007). Made of yellow, 18-oz vinyl-covered nylon, the wing tags were 22.7 cm × 8.5 cm and weighed approximately 8.5 g each (Fig. 1). The number–number–letter combination was hand-painted using black screen ink (System 2 S2 gloss vinyl; Nazdar) and overprint clear (S227; Nazdar) to reduce fading. The tags were placed around the radius and ulna, while facing the underside of the wing, with the central notch on the proximal side (i.e., towards the wrist). The narrowest portion of the tag was passed between two secondary feathers, and each end was rolled up and slipped through the slit in the opposite end to hold the tag in place.

Fig. 1
figure 1

a Wing tags used on adult Magnificent Frigatebirds (Fregata magnificens). Left tag goes on the bird’s left wing. An actual tag is in yellow with black numbers and letters. The narrowest portion of the tag is fitted between two secondary feathers. White lines illustrate slits; the tag ends are rolled up and slipped through the slit in the opposite end to secure. Photo of male with wing tags while perched (b) and in flight (c) (color figure online)

Blood samples were taken by puncturing the brachial vein with a 25-gauge needle and drawing blood up with heparanized microcapillary tubes. We collected up to 120 μL of blood from each bird receiving a “bleed” treatment. Measured birds were weighed using a Pesola spring balance after being placed in a pillowcase, and the weight of the pillowcase was subtracted from the total mass. We measured the right wing from the wrist to the tip of the longest primary, using a butt-ended meter stick, and including the natural curve in the wing (natural chord). Culmen (bill length) was measured using calipers, and semispan (from spine to wing tip) was measured using a meter stick and digital camera.

Data analysis

To test the hypothesis that nest success in the treatment groups was not significantly different from control nests, we used a generalized linear model (GLM) with R v.2.8.1 (R Development Core Team 2008) with a binomial error structure (since nest fates were assigned a success/fail status) and a logit link function. Considering the apparent handling effect of the previous season, we considered it biologically relevant to conduct model simplification to better distil the cause(s) of lowered nest success. Therefore, treatments with similar model estimates were pooled one at a time. Each reduced model was compared to the previous model and accepted if not significantly different. Models were ranked using Akaike Information Criterion (AIC).

Our experimental design was not fully factorial, because treating birds (e.g., bleed, measure) requires a handling treatment. In addition, since our goal was specifically to compare each treatment to the control group, we used a priori linear contrasts to test the significance of these comparisons. The “control” and “band” groups had identical nest success (42 %), so we concluded that banding alone was not contributing to nest failures and we pooled these two groups under a single “controls” group against which every other group was compared. Except for this pooling of the first two rows, treatments were coded as they appear in Table 1. Coding the data as presence or absence of each manipulation treatment (i.e., combinations of four treatments) resulted in qualitatively similar results, with no significant interactions among main effects. The coding scheme presented here resulted in seven non-orthogonal contrasts. Since opinion is divided about the need for orthogonality in planned comparisons, we follow the advice of Quinn and Keough (2002), who argue that, if the number of comparisons is small and there is a firm theoretical basis for them, it is more important to examine all the hypotheses than to insist on orthogonality.

Though scientists often focus heavily on the risk of Type I statistical errors (“false positives”), in some applied studies it is important to also consider the risk of Type II statistical errors (“false negatives”), which may have substantial repercussions (Brosi and Biber 2009). Here, aware of the risk of failing to detect a treatment effect if there was one, we adopted a more cautionary approach by setting alpha to 0.10.

Wing-marker meta-analyses

In addition to our experiment, we conducted meta-analyses on the general effect of avian wing markers. Studies using both wrap-around style wing tags, like those used on frigatebirds, and patagial tags, which are placed similarly to wing tags, but require a pin or rod pierced through the patagium to hold them in place, were included.

To obtain wing-marker studies, we first checked all articles in Calvo and Furness’ (1992) review for usable studies published prior to 1992. We then searched the Web of Science database for all articles related to the subject using “wing tag*” and “patag* tag*” as key words. Literature cited in relevant papers was also examined. We included experimental and observational studies of both types of wing markers that used “tagged” and “control” (banded or unmarked) treatments. To reduce publication bias, unpublished data were also included: our own “controls” and wing tag treatments presented above, and “control” and “wing tag” nest success from a study on Great Frigatebirds, using the same wing tags we describe (Don Dearborn, 2005, unpublished data). Model results were not qualitatively different when these unpublished data were excluded. In Dearborn’s study, unpaired adults in breeding plumage were captured and wing-tagged at night in February 2005. Nests of wing-tagged birds scattered throughout the colony and those of birds nesting in three plots (controls) were monitored twice daily to determine incubation success. In spite of extensive research, we failed to discover suitable studies of some large, commonly wing-tagged groups of birds, such as eagles and vultures.

Eighteen published studies on 17 species met our selection criteria and were included in our meta-analyses (Table 2; Online Resource 2). Twelve studies had multiple responses (2–10) that we used to calculate the effect sizes (Online Resource 2). The following data were extracted from the studies: author, publication year, type of tag used (patagial or wing tag), age class of marked birds, study species, and variable(s) measured. When data were presented as means, the treatment and control sample sizes, means, and standard deviations were extracted. In one study the error type was not specified and was assumed to be standard error (Southern and Southern 1985). Standard deviations were calculated as the standard error multiplied by the square root of the sample size if not provided. When data were not presented as means, the number of “successful” and “failed” birds for the tagged and control treatments were used. Birds were coded as “successful”, for example, if they returned or were recaptured, or were successful at pairing or hatching eggs, and “failed” if they were depredated or lost chicks.

Table 2 Species considered in the meta-analyses, and classification of order and type of wing marker used

Meta-analytical procedures

We calculated effect sizes and conducted random-effects meta-analyses on the extracted data using the metafor package (Viechtbauer 2010) for R 2.14.0 (R Development Core Team 2011). Random-effects models were chosen because they incorporate both within- and between-study variability (Hedges and Vevea 1998). We chose to conduct two separate meta-analyses because roughly half of the studies presented data as means and errors, which can be used to calculate a standardized mean difference (SMD) effect size, and the others presented numbers of “successes” and “failures”, suitable to calculate a log odds ratio (LOR) effect size. Effect sizes were estimated via weighted least squares, with weight equal to the inverse variance (Viechtbauer 2010). In cases where studies provided data to calculate SMD and LOR effect sizes, we analyzed them separately (e.g., Southern and Southern 1985).

Studies included in the SMD meta-analysis measured condition, reproductive (e.g., number of chicks or fledglings, hatch date), behavioral, and in one case, migratory variables (Online Resource 2). The LOR meta-analysis dealt with survival, hatch success, nest success, and in one case whether tagged birds were successful in establishing a breeding territory. The I 2 statistic, representing the percentage of variability across studies due to heterogeneity rather than chance, is presented for each meta-analysis.

Running a meta-analysis with multiple effect sizes from a single study violates the assumption of the analysis, since data from the same individuals are treated as independent. To test the effect of this violation on the models, we ran 100 iterations of each meta-analysis, randomly selecting one effect size per study in each iteration. These results were not qualitatively different from the meta-analyses on the entire datasets, so for completeness the models and figures presented here include all data. However, the effect of moderator variables (tag type and order) on heterogeneity was explored via a mixed-effects model on one of the random iterations (therefore, using one data point per study). We present funnel plots (scatter plots of the treatment effects against a measure of study size) for each analysis and discuss the possibility of publication bias.

Results

Frigatebird nest success

Behavioral observations

We monitored frigatebirds for up to 15 days after handling, and did not observe any behavioral differences between wing-tagged and control birds. This was consistent with observations from the previous season. We have never observed wing-tagged birds pecking at or pulling their tags, even the morning after capture (cf Jackson 1982; Brua 1998). Apparently normal pair bonding behavior, nest exchanges, and copulations occurred. Adults wearing wing tags continued to incubate, brood, lay eggs, and feed young.

Nest fates

Nest success was 42 % for “controls” (birds only handled or banded), 58 % for band + measure, 33 % for band + measure + bleed, 25 % for both band + bleed and band + wing tag treatments, 17 % for band + bleed + wing tag, and 8 % for both band + measure + wing tag and band + measure + bleed + wing tag treatments (Fig. 2). In the initial GLM, the estimate of nest success for each treatment was contrasted with the pooled controls, with an overall AIC score of 131.75 (Table 3a). The two treatments where nest success was significantly lower relative to the controls were “band + measure + bleed + wing tag” and “band + measure + wing tag”. The final reduced model (Table 3b) had 3 groups: controls, non wing-tag treatments, and wing-tag treatments, and an AIC score of 126.59. Overall, wing-tag treatments were significantly different from the controls (Table 3). Controls had an overall nest success of 42 %, non wing-tag treatments 39 %, and wing-tag treatments 15 %.

Fig. 2
figure 2

Nest fates of Magnificent Frigatebirds (F. magnificens) breeding on Barbuda, West Indies, were generally lowest for wing tag treatments. Treatments were applied in December 2010 to one adult per nest, and the final nest check occurred in May 2011 prior to fledging

Table 3 (a) Full model output of the effects of treatments applied to adult Magnificent Frigatebirds early in the breeding season on nest success; we used a priori contrasts in a generalized linear model analysis, so slope estimates are relative to the controls; two treatments with p values in bold were significantly different from controls at α = 0.10. (b) Reduced model contrasting nest success of wing tag and all non-wing tag treatments; n = sample size

Wing-marker meta-analyses

From the SMD meta-analysis, which included papers with data on means of body condition, reproductive, behavioral, and migratory distance parameters, wing markers had no overall significant effect (z = 0.549, p = 0.583, mean effect size ± SE = 0.0748 ± 0.1362, I 2 = 88 %; Fig. 3).

Fig. 3
figure 3

Forest plot of SMD meta-analysis, showing the results of 34 effect sizes calculated from 11 studies, examining the effects of wing markers on condition, reproduction, behavior, and migratory distance. For each effect size, the standardized mean difference in the marked versus control groups is shown with corresponding 95 % confidence intervals, based on a random-effects (RE) model. Observed effect sizes are drawn proportional to the precision of the estimates. A diamond indicates the summary estimate based on the model, with the outer edges representing the confidence interval limits. Reference superscripts indicate number of within-study effect size calculations, and are the same as in Online Resource 2

The LOR meta-analysis model, which included “success” and “fail” data from studies measuring survival and hatch or nest success, showed that wing tags and patagial markers had a significant effect (z = −5.18, p < 0.0001, mean effect size ± SE = −1.277 ± 0.2466, I 2 = 69 %). The negative value of the effect size indicates that the probability of success for the treatment (tagged) group is 21.8 % that of the control group, suggesting a negative effect of wing markers on survival and reproduction (Fig. 4).

Fig. 4
figure 4

Forest plot of LOR meta-analysis, showing the results of 17 effect sizes calculated from 12 studies, examining the effects of wing markers on survival and reproduction. For each effect size, the log odds ratio (success versus fail) of the marked versus control groups is shown with corresponding 95 % confidence intervals, based on a random-effects model. Observed effect sizes are drawn proportional to the precision of the estimates, and the summary estimate and confidence intervals of the model are indicated by the polygon. Reference superscripts indicate number of within-study effect size calculations, and are the same as in Online Resource 2

Comparing the effects of marker types, having patagial (z = −1.960, p = 0.050, mean effect size ± SE = −1.950 ± 0.995) or wing (z = −1.993, p = 0.046, mean effect size ± SE = −1.677 ± 0.841) tags had an overall significant negative effect; tagged birds were 12.3 and 15.7 % as successful as controls, respectively. Setting order as a moderator variable, Galliformes were not negatively affected by tags (z = 2.768, p = 0.0056, mean effect size ± SE = 1.824 ± 0.659), while all other orders were.

A common concern when conducting a meta-analysis is the potential for selective publication of studies showing significance. Funnel plots—scatter plots of the treatment effects estimated from individual data points against a measure of study size—can be useful for diagnosing heterogeneity and publication bias (Sterne et al. 2005). For example, if small studies failing to detect a significant negative effect remain unpublished, then the funnel plot will appear skewed to the right, and the meta-analysis will overestimate the effect (Sterne et al. 2005). In general, there was heterogeneity across data points, for studies with small and large sample sizes (Fig. 5).

Fig. 5
figure 5

Funnel plots of the complete data set for a the SMD and b LOR meta-analyses. Model estimates are indicated by the vertical lines, with bounds around the line equal to ± 1.96 × SE

Discussion

We suggest that, though wing tags have been used on frigatebirds for over 40 years (Diamond 1975), their use be re-evaluated in light of our experimental evidence. The behavior of adult breeding Magnificent Frigatebirds shortly after wing tagging suggested to us that there was no immediate negative effect of the handling protocol. However, the wing tag treatments had significantly lower nest success relative to controls and to other treatments such as measuring and bleeding. In the original model, our “band + wing tag” treatment had 25 % nest success, which alone may not have raised any alarms had we not tested controls. The nearly three-fold difference in percent nest success of wing-tagged versus control nests raises concerns for us about this marking technique.

Frigatebirds are heavily dependent on soaring flight performance and on agility near the water’s surface; their body structure and foraging strategies may cause them to be particularly susceptible to a “wing tag effect”. However, our meta-analyses demonstrated that a negative impact of wing tags is not unique to our study. Overall, in studies measuring survival and nest success, wing- and patagial-tagged birds were less successful than controls, while this was not the case in studies measuring condition, behavior, and variables such as number of chicks and hatch date. This suggests that the effects of wing markers on nesting success and mortality are not easily predicted from obvious effects on condition, behavior, or short-term reproductive parameters.

While we did find differences in the effect of tags among orders, we acknowledge small sample sizes and high heterogeneity between studies used in our meta-analyses, and there may be other important variables such as age at marking and study species that we lack the power to test. In this case, publication may be biased by fewer studies showing significant effects of wing markers being published, rather than the reverse, perhaps in part because finding such an effect may invalidate the study. Rigorous tests, particularly of longer-term survival and reproductive parameters (and not only behavior and condition, or other short-term measures), are necessary to demonstrate that wing and patagial tags are safe to use for a given species before being widely applied. Experiments testing for a “tag effect” should be designed with sensitivity to Type II error in mind, since failure to detect a small but real effect should be of greatest concern in this instance.

We did not attempt to demonstrate a mechanism by which wing tags reduce nesting success, which is an important step in designing field-readable markers that will not significantly alter the parameters under study or have deleterious effects on survival and reproduction. One possible mechanism leading to lowered nest success is that the color or general appearance of the wing tags caused behavioral changes that affect reproduction, as in Seamans et al. (2010). However, based on our observations of paired birds interacting, tagged bird behavior appeared to mirror that of untagged birds. A second possibility is that the extra mass of two wing tags was great enough to decrease foraging efficiency and mass gain at sea, as in Cory’s Shearwaters (Calonectris diomedea) wearing 30- and 60-g weights (Passos et al. 2010). However, a pair of wing tags was 1.1 and 1.3 % of female and male mass, respectively; much less than the 3 % limit often used in seabird studies (Phillips et al. 2003). We suggest the likely mechanism causing higher nest failure is that wing tags impair the aerodynamic functioning of the wing. Wing tags likely cause the boundary layer to detach from the wing, which would reduce lift and increase drag, and lower the lift-to-drag ratio; in effect acting like an airbrake on a glider (Pennycuick 2008; C. Pennycuick, personal communication). Aerodynamically, the upper leading edge is the most sensitive part of the wing, which would be disrupted by any wrap-around style wing tag. The tags may also affect maneuverability, by adding mass (albeit small) at a point on the wing that is subject to large accelerations in flapping flight (Pennycuick 2008). The ends of the wing tags, which occasionally hang down, and the gap sometimes created between the secondary feathers, may also affect flight performance by increasing drag or reducing lift. Wing- and patagial-tag designs, by altering the flow pattern of air over the wing, may increase the energy requirements for flying. With this consideration, other marking strategies should be considered which have less direct impact on a bird’s main mode of locomotion.

In light of our evidence of negative effects caused by wing and patagial tags, we recommend against placing field-readable markers on wings, and suggest researchers opt for body-, tail-, or leg-mounted markers, with less aerodynamic consequences. While weight is often taken into consideration with designing markers, it is increasingly clear that reducing drag is also important (e.g., Bowlin et al. 2010; Saraux et al. 2011). For species where colored bands are not practical, one option could be a uniquely marked “fin” made of a thin vertical plastic sheet mounted by attachment to the base of the tail feathers. Another option could be to mount the “fin” on a horizontal base on the bird’s back, and hold it in place with loops around the thighs. Such a design would have very little aerodynamic effect (C. Pennycuick, personal communication), but would still need to be rigorously tested for other detrimental effects.

We are not the first to suggest that casual observation of wing-tagged birds’ behavior and breeding success is not adequate to determine that tags do not have a significant impact, and that detailed comparison of wing-tagged and untagged birds is necessary to assess the impact of markers on behavior, reproduction, and survival (Howe 1980; Green et al. 2004). The general mechanisms by which we predict wing tags to affect flight performance are not species-specific, and our meta-analysis on survival and reproductive parameters extends our concern regarding the use of wing and patagial tags to other species. The costs to the birds and the reliability of the data obtained may outweigh the benefits of their use.