Introduction

Insects are by far the most species-rich group of animals, representing over 50% of the world’s biodiversity (May 1988; Gaston 1991; Groombridge 1992). Contrary to most other groups of insects, butterflies are well-documented, easy to recognize and popular with the general public (De Heer et al. 2005; Thomas 2005). Many European butterflies have decreased considerably in recent years (Van Swaay et al. 2006). As a result, nowadays 71 out of the 576 European butterfly species are considered as threatened in Europe (Van Swaay and Warren 1999). The decline of butterfly species has largely been assessed by examining the change in their area of distribution (Van Swaay 1990, Maes and Van Swaay 1997; Telfer et al 2002). But this approach has several shortcomings. First, it underestimates the rate of population decline because generally species decrease in population numbers first before they disappear locally and regionally (Thomas and Abery 1995). Secondly, most available distributional data suffer from differences in sampling effort over time, which makes it difficult to separate changes in distribution from changes in sampling effort (Dennis et al. 1999). Reliable estimates of trends can only be based on long series of distributional data, because only then correction for sampling effort is possible (Van Swaay 1990, Maes and Van Swaay 1997; Telfer et al 2002), but even then the results should be treated with caution. In order to get early warning signals, it is better to assess trends in population numbers based on monitoring schemes with standardized sampling efforts.

These were the reasons for setting up a national butterfly monitoring scheme in the UK in 1976 (Pollard 1977). This has inspired many others and the number of schemes has gradually increased in Europe (Table 1; Fig. 1; see Kühn et al. 2005, and contributions therein). New schemes are being planned, e.g. in Denmark and Sweden. The number of transects differs much between the current schemes, ranging from just a few transects per country to several hundreds in the UK and the Netherlands. In 2004 Butterfly Conservation Europe (www.bc-europe.eu) was founded and had an important role in bringing together and co-ordinating work on butterfly monitoring in Europe.

Table 1 Active butterfly monitoring schemes in Europe
Fig. 1
figure 1

Location of Butterfly Monitoring Schemes in Europe (light shading—active schemes, dark shading—planned schemes)

In this paper we describe the main methods used in the current schemes and give a few examples of applications of the data. We discuss the use of butterflies as biodiversity indicators and the perspectives of European butterfly monitoring.

Butterfly monitoring methodology

Field methods

All schemes apply the method developed for the British Butterfly Monitoring Scheme (Pollard and Yates 1993). The counts are conducted along fixed transects of about 1 km consisting of smaller sections, each with a homogeneous habitat type. The fieldworkers record all butterflies 2.5 m to their right, 2.5 m to their left, 5 m ahead of them and 5 m above them (Van Swaay et al. 2002). Butterfly counts are conducted between March–April and September–October. Visits are only conducted when weather conditions meet specified criteria. In the Dutch (and German) scheme this means temperature above 17°C, or 13–17°C in sunny weather, wind less than six on the scale of Beaufort and no rain (Van Swaay et al. 2002). Most of the transects are recorded by skilled volunteers, but their results are usually checked by butterfly experts.

The number of visits varies from every week in the UK and the Netherlands to three to five visits annually in France. In the Netherlands, transects dedicated to rare species can be visited only during the expected flight period of the species. In normal transects, weekly counts cover the entire flight period of species and thereby offer the opportunity for assessing temporal population trends per transect, but the precision of the trend estimates may be limited (Harker and Shreeve 2008). Weekly visits may however also be demanding for observers. If the objective is only to produce large scale (e.g. national) trends, the efforts may be reduced to much fewer visits (Heliölä and Kuussaari 2005; Roy et al. 2007). Such a reduced-effort scheme is planned in the UK for the wider countryside where mainly common butterflies occur and few volunteers can be recruited. This proposed reduced-effort scheme is based on only a few annual visits, targeted to the period when most information can be gathered, i.e. three visits in July–August plus in some cases an additional one in May (Roy et al. 2005, 2007). Yet a problem with the reduced effort schemes can be that the inevitable ‘why’ question can be hard to answer: it will often not be possible to compare different regions, habitats or management regimes to find the underlying drivers for population changes. Furthermore much more transects will be needed in a reduced effort scheme than in a traditional scheme. The main characteristics of the ‘Traditional’ and ‘Reduced effort’ schemes are summarized in Table 2.

Table 2 The main characteristics of the ‘Traditional’ and ‘Reduced effort’ Butterfly Monitoring Schemes (based on Roy et al. 2005, 2007; Heliölä and Kuusasaari 2005; Van Swaay 2007)

Observers never detect all butterfly individuals present during their visit in the study area (Dennis et al. 2006; Kéry and Plattner 2007). Therefore, transect counts do not provide information on absolute butterfly numbers but rather yield species-specific relative abundance indices that are assumed to reflect year-to-year population changes over the entire study area. The assumption of constant detection probability has been underpinned by the demonstration of close correlations between transect counts and population estimates based on mark-recapture data (Pollard 1977; Thomas 1983). However, if for some reasons the detection probability for a given species varies over time then trends inferred from transect count results uncorrected for this probability may be biased (Kéry and Plattner 2007).

The likely sources of between-year variation in detection probability are e.g. weather, time of day, observer experience, and vegetation height changing due to succession or more generally any habitat changes (Pollard et al. 1986; Harker and Shreeve 2008; Pellet 2008). Variation due to weather and time of day can be reduced by standardisation of the conditions in which transect counts are conducted (Pollard 1977; Pollard et al. 1986). In addition, in the case of large-scale and long-term monitoring such variation may be assumed to be random only, thereby decreasing the precision of the results, without inducing any bias. Still, any systematic changes in observer experience, vegetation height or even the behaviour of species cannot be ruled out completely. We therefore suggest to test any long-term changes in detection probabilities using methods to analyse multiple visits as applied for butterflies by Kery and Plattner (2007) and Pellet (2008) or distance-sampling methods (Pollock et al. 2002). Distance-sampling has already been applied in butterfly population studies in Northern America (Brown and Boyce 1998), and there are currently attempts to incorporate it in the UK Butterfly Monitoring Scheme (K. Cruickshanks, personal communication).

A related problem is that of the variable longevity in adult butterflies and its effect on transect count reliability. Since adult butterflies typically enclose in daily cohorts, their numbers recorded on transects depend not only on seasonal population sizes, but also on longevities, and consequently transect count results do not necessarily follow year-to-year population changes precisely (Zonneveld 1991; Nowicki et al. 2005, 2008). Nevertheless, the effect of between-season variation in butterfly longevity is likely to become random with extensive data sets.

Transect selection

To be able to draw proper inferences on the temporal population trends at national or regional level, transects should best be selected in a random or stratified random manner (Sutherland 2006). Several recent schemes, e.g. in Switzerland and France, have been designed in this manner (Henry et al. 2005). Unfortunately, such a procedure would yield many data for common butterflies, but few data for rare butterflies, unless an unrealistic high number of transect is selected. If a scheme aims to monitor rare species, scheme coordinators preferably locate transects in areas where rare species occur, leading to an overrepresentation of special protected areas. In the older schemes, such as in the UK and the Netherlands, but also in the recently established scheme in Germany, transects were selected by free choice of observers, which in some cases has led to the overrepresentation of protected sites in natural areas and the undersampling of the wider countryside and urban areas (Pollard and Yates 1993); while in Germany this effect was not that pronounced (Kühn et al. 2008). Obviously, in such a case the trends detected may be only representative for the areas sampled, while their extrapolation to national trends may produce biased results. Such bias can however be minimized by post-stratification of transects. This implies an a posteriori division of transects by e.g. habitat type, protection status and region, where counts per transect are weighted according to their stratum (Van Swaay et al. 2002, see also Henry et al. 2008, this volume for the principles of weighting).

Calculating indices and population trends

The traditional way of testing temporal population trends in yearly count data is to apply ordinary linear regression. But linear regression assumes the data to be normally distributed, which does not hold for most count data especially if the data contain many zero values. Also log transformation does not work properly in such cases. Generalized Linear Models (GLM; McCullagh and Nelder 1989) offer an alternative to analyse count data. In GLM models, the normality assumption is replaced by the assumption of a distribution of the user’s choice. For count data this distribution is often the Poisson distribution. To apply these models transformation of raw data is no longer required. Poisson (or loglinear) regression is implemented in the widely used program TRIM (TRends and Indices for Monitoring data—Pannekoek and van Strien 2005). Regarding butterflies, this program is used in the UK and the Netherlands and new schemes plan to use it as well (Kühn et al. 2008). Based on a model with year effects and site effects, TRIM produces yearly indices as well as overall trend estimates and is particularly useful if the data contain missing counts due to the coming and going of the voluntary observers in a scheme. TRIM has also options to incorporate serial correlation between counts in consecutive years, testing of covariates and testing of changepoints. An important feature of TRIM is the possibility to incorporate weight factors per transect in order to adjust for oversampling and undersampling of particular habitat types, regions or other characteristics of transects. These weights may be based on e.g. the surface area of heathland in different regions for heath butterflies, or the population shares of species per region (Van Swaay et al. 2002). One might also consider to apply detection probabilities as weights in TRIM, if these probabilities appear to change over time.

A weakness of TRIM is that the model does not include week effects. The counts per week need to be combined first into a yearly sum and only this sum enters the TRIM model. Rothery and Roy (2001) explored the possibilities to apply Generalized Additive Models (GAM) to butterfly monitoring data. A GAM is an extension of GLM methods and allows the smoothing of yearly indices.

Applications

National and regional trends

The main objective of most butterfly monitoring schemes is the production of regional and/or national population trends. These trends are being produced on a routine basis every year in e.g. the UK and the Netherlands, and are meant to evaluate at a large scale the need for or the progress made in butterfly conservation.

Relationships with environmental factors

The transect counts can be used to study the relationships with environmental factors, such as climate change, nutrient load, heavy metals, drainage, land use, fragmentation and management practice. Pollard and Yates (1993) describe detailed studies based on monitoring data. Here we mention only a few examples:

  • Climate change: Several schemes were used to examine the changes in phenology (Roy and Sparks 2000; Stefanescu et al. 2003; Kühn et al. 2008; Van Strien et al. 2008, see Fig. 2). WallisDeVries and Van Swaay (2006) used transect data to study the effects of the combination of nitrogen deposition and climate change on the abundance of butterflies.

    Fig. 2
    figure 2

    Mean Julian date of the first 10% of all observed individuals of 19 spring butterfly species in 1992–2004 (January 1 = day 1 etc.). For each species the date was assessed per year of the first 10% of all observed individuals in the entire flight period on all transects together. For details see Van Strien et al. (2008). Trends and confidence intervals were assessed by structural time-series analysis and the Kalman Filter using the program Trendspotter (Soldaat et al. 2007)

  • Nutrient load and heavy metals: Oostermeijer and Van Swaay (1998) examined relationships between butterfly absence/presence data obtained form monitoring transect and Ellenberg indicator values for nutrients, acidity and moisture (Fig. 3). Mulder et al. (2005) examined the effects of heavy metals on butterflies on a particular transect.

    Fig. 3
    figure 3

    Relationships between the probability of occurrence obtained from monitoring transect data and Ellenberg indicator values for nutrients (from Oostermeijer and Van Swaay 1998)

  • Management practice. Brereton and Warren (2005) found the trend of Lysandra coridon on calcareous grasslands with butterfly friendly management to be more positive than on other grasslands.

  • Multiple environmental factors. Other perspectives for the application of monitoring data are by testing predictions or expectations from envelope approaches, which form the basis of many biodiversity impact and risk assessments (as e.g. in the ALARM project; Settele et al. 2005). This may in particular be relevant to large scale predictions/expectation of changes and trends derived from the combined effects of a multitude of pressures (compare Schweiger et al., in press) and to extrapolations of historically reconstructed trends (Settele et al. 1992).

Butterflies as indicators

Government representatives at the 2002 World Summit of Sustainable Development pledged ‘a significant reduction in the current rate of biodiversity loss by 2010’. The commitment of the EU to protecting biodiversity is even stronger by aiming at halting biodiversity loss by 2010 (Balmford et al. 2005; Gregory et al. 2005). Butterflies may be useful as biodiversity indicators for reporting on the development towards the EU 2010 target. Contrary to most other groups of insects, butterflies have considerable resonance with both the general public and decision-makers (Kühn et al. 2008). Butterflies are also relatively easy to recognize and data on butterflies has been collected for a long time and by many voluntary observers. The method is well described, extensively tested and scientifically sound (Pollard 1977; Pollard and Yates 1993). As a result butterflies are the only invertebrate taxon for which it is currently possible to estimate rates of decline in many parts of the world (de Heer et al. 2005; Thomas 2005). However, butterflies can only be regarded as good biodiversity indicators if it is possible to generalise their trends to a broader set of species groups (Gregory et al. 2005). Admittedly, there is currently a heated debate on how well butterflies meet this criterion. Hambler and Speight (1996, 2004) claimed that this group is likely to experience greater declines than other organisms due to their herbivorous life strategies and thermophily, but Thomas and Clarke (2004) convincingly rejected both arguments. Based on a comprehensive review of studies into their life-history traits, relative sensitivity to climate change, and adjusted extinction rates, Thomas (2005) concluded that butterflies may be considered representative indicators of trends observed in most other terrestrial insects, which together form a major fraction of biodiversity.

Trends per butterfly species can be combined into a unified measure of biodiversity. We followed Gregory et al. (2005) in averaging indices of species rather than abundances in order to give each species an equal weight in the resulting indicators. When positive and negative changes of indices are in balance, we would expect their mean to remain stable. If more species decline than increase, the mean should go down and vice versa. Thus, the index mean is considered a measure of biodiversity change. We used geometric means rather than arithmetic means, because we consider an index change from 100 to 200 equivalent, but opposite, to a decrease from 100 to 50. Buckland et al. (2005) discussed a number of possible composite indicators and found the geometric mean of indices a useful approach.

The results of national butterfly monitoring schemes may be combined to create an indicator at a supra-national level (see also Henry et al. 2008, this volume). Based on the procedure described for European birds (see Gregory et al. 2005), a preliminary grassland butterfly indicator has been developed (Van Swaay and Van Strien 2005). The procedure was as follows:

  1. 1.

    National level. The indices for each species were produced for each individual country with a butterfly monitoring scheme, using TRIM (Pannekoek and Van Strien 2005). Figure 4 shows the national indices as an example for three countries for the grassland species Lasiommata megera.

    Fig. 4
    figure 4

    National abundance indices (±standard error) for Lasiommata megera in three European countries. In the first year the index can be calculated it is set to 100 (1992 for The Netherlands, 1994 for Catalunya, 1976 for the United Kingdom)

  2. 2.

    Supranational level. To generate supra-national trends, the difference in national population size of each species in each country was taken into account. This weighting allows for the fact that different countries hold different proportions of a species’ European population (Gregory et al. 2005). Here, we applied as weights the proportions of each country (or part of the country) in the European distribution of a species (based on Van Swaay and Warren 1999). The missing year totals are estimated by TRIM in a way equivalent to the treatment of missing counts for particular transects within countries (Gregory et al. 2005). Figure 5 gives the weighted and combined trend for Lasiommata megera. The same procedure may be used to establish European trends for the Habitats Directive species e.g. Euphydrias aurinia, Maculinea arion and M. nausithous (which are included in the grassland indicator).

    Fig. 5
    figure 5

    Collated index (±standard error) for Lasiommata megera in the European countries with Butterfly Monitoring Schemes

  3. 3.

    Multispecies level. For each year the geometric mean of the supranational indices is calculated. The preliminary grassland indicator was based on seven widespread grassland species (Ochlodes venata, Anthocharis cardamines, Lycaena phlaeas, Polyommatus icarus, Lasiommata megera, Coenonympha pamphilus, Maniola jurtina) and ten grassland-specialists (Erynnis tages, Thymelicus acteon, Spialia sertorius, Cupido minimus, Maculinea arion, Maculinea nausithous, Polyommatus bellargus, Polyommatus semiargus, Polyommatus coridon, Euphydryas aurinia).

The countries covered were mainly from Western Europe (Table 1). The average grassland butterfly abundance appeared to decline by almost 50% (Fig. 6), which is most probably linked with the agricultural intensification in Western Europe (Van Swaay and Warren 1999; Gregory et al. 2005). The decline is much stronger than the decline of the farmland bird indicator, which has fallen by 19% in the same period (Gregory et al. 2008). This corresponds with the findings in the UK where butterflies have experienced greater losses than birds (Thomas et al. 2004).

Fig. 6
figure 6

European grassland butterfly indicator. Trends and confidence intervals were assessed by structural time-series analysis and the Kalman Filter using the program Trendspotter (Soldaat et al. 2007)

Perspectives

The number of countries with butterfly monitoring schemes is increasing. In addition, the quality of schemes is improving, because any lack of representativeness of transects is taken into account, either by choosing an adequate design or by adjusting any bias during the stage of analysis. As the number and quality of butterfly monitoring schemes grows, the coverage of Europe by supranational species trends and multispecies indicators improves. The European Environmental Agency has already recommended to develop European butterfly indicators (European Environment Agency 2007), and these developments may lead to indicators that are comparable to the farmland bird indicator, which has been adopted by the EU as biodiversity indicator (Gregory et al. 2005). Where possible and feasible, one might even think of combining butterflies and birds in indicators to report against EU’s 2010 target, in order to generalize changes well beyond the set of species.

The grassland butterfly indicator offers the possibility to detect large scale effects of either abandonment of agricultural land (especially occurring in Eastern and Southern Europe) or intensification of agricultural practices (a process already stopped in parts of Western Europe, but ongoing in many European regions).

Apart from a grassland butterfly indicator, we see good perspectives to create a climate change indicator, summarising changes in occurrence of species driven by climate change, as well as a woodland indicator . The same indicators are also in progress for European birds (Gregory et al. 2007). A woodland indicator may however not have such a simple message as the preliminary grassland indicator. That is because woodland butterflies are made up of two different species groups. The first group of woodland butterflies are characteristic for woodland edges and open spots, e.g. Euphydryas maturna and Coenonympha hero. The second group are canopy species, who profit from high forest, e.g. Apatura iris. Though both these groups are genuine woodland butterflies, their expected trends differ entirely. Species from the first group probably suffer in large parts of Europe, because traditional coppicing has been replaced by management for high forest. In Western Europe, where this process has been going on for a few decades, these species are virtually extinct, but in Eastern Europe strong populations still exist (Van Swaay and Warren 1999, 2003). The few species of the second group, which tolerate dense forests (e.g. Pararge aegeria; Shreeve 1984) or the handful of European canopy dwellers (e.g. Neozephyrus quercus, Apatura spp., or Limenitis populi) are rather the exception. Thus, a woodland indicator might have to consider a differentiation of these two groups. As a rule, the majority of European woodland butterflies utilises sunny habitats within woodlands, such as sparse stands, bogs, streamsides, clearings, rides, or edges (Settele et al. 2009).

Over thirty years butterfly monitoring has developed from one test site in Monks Wood in the United Kingdom to more than 2,000 transects scattered over Europe. Almost every year new countries join in to start up a monitoring network. Further extension of butterfly monitoring schemes to other countries in Europe should be encouraged and supported by the European Union and its Member States. The further development and use of butterflies as a European biodiversity indicator will further stimulate new butterfly monitoring schemes,