Introduction

Aquatic ecosystems, being the final receptacle of pollutants, suffer from high levels of perturbation (Vörösmarty et al. 2010). To face this issue, international policies (Water Framework Directive and Marine Strategy Framework Directive (MSFD) in Europe) have emerged with the aim to evaluate, protect and restore freshwater and marine ecosystems. These policies rely on several strategies of ecosystem health evaluation, essentially based on chemical concentrations and on biological communities structure and composition. However, some claim for an additional intermediate approach focusing on biomarker responses at the individual level have emerged recently for the future MSFD application (Schlenk 1999; Galloway et al. 2006; Hagger et al. 2008; Lam 2009; Sanchez and Porcher 2009; Lyons et al. 2010; Artigas et al. 2012).

The multi-biomarker approach is already widely used for in situ assessment of ecotoxicological effects of contaminants and for understanding the relationships (1) between biomarkers and (2) between biomarkers and contamination levels of studied sites. However, to transfer these procedures from scientists to environmental managers, integrative tools need to be proposed. By now, some indexes exist to synthesise the responses of biomarkers in a single and simple measure (Beliaeff and Burgeot 2002; Chèvre et al. 2003; Aarab et al. 2004; Broeg et al. 2005; Dagnino et al. 2007; Yeom and Adams 2007; Izagirre and Marigómez 2009). Among them, one of the most popular is the integrated biomarker response (IBR) proposed by Beliaeff and Burgeot (2002).

The IBR is a method that provides both a graphical synthesis of the different biomarker responses and a numeric value that integrates all these responses at once. The IBR is the sum of the area defined by the k biomarkers arranged in a radar diagram (Fig. 1), following a prior step of biomarker responses standardisation. All the calculation procedure and data representation could be performed with classical spreadsheet programs. However, in their initial publication, the authors provided two calculation methods: the first one is a complicated formula (Fig. 1) that works whatever the number of biomarker is, while the second one is a simplified formula that works only when four biomarkers are used (Beliaeff and Burgeot 2002).

Fig. 1
figure 1

Calculation method of the IBR defined by Beliaeff and Burgeot (2002). Each axis of the star plot represents the standardised value S i of a biomarker (Bmk). Two successive biomarkers in the plot define a triangle with an area A i , and the IBR value is the sum of the k areas. On the figure are presented the standard (a) and simplified (b) formulas

The attractiveness for simplicity led to frequent misuse of the IBR. On the 75 publications citing the original publication (citations of the article of Beliaeff and Burgeot (2002) were collected with ISI web of knowledge—Thomson Reuters), 31 only cite the IBR with no application and 44 were applications with a multi-biomarker approach. We finally evidenced a misuse of the simplified formula in more than 50 % of them, with 23 publications with formula errors, 15 with a good use and 6 where the calculation method was not described. This misuse led to an increase of the IBR value with the number of biomarkers considered that Broeg and Lehtonen tried to correct by dividing the IBR value by the number of biomarkers studied (Broeg and Lehtonen 2006).

Moreover, the final outcome of the calculation process highly depends on the sequential organisation of the biomarkers (Beliaeff and Burgeot 2002; Kammann et al. 2005; Leinio and Lehtonen 2005; Broeg and Lehtonen 2006). Considering that this index is classically used to realise a site classification according to a pollution gradient, the identified misuse can lead to important consequences regarding ecosystem health evaluation. Indeed, a change of the sequential organisation of biomarkers in the diagram can change the IBR value, and so completely modify the score of a site that can move from a “more polluted” status to a “less polluted” one or inversely. This will be illustrated in the case study presented further.

In order to limit the effect of biomarker arrangement, two suggestions were made by Beliaeff and Burgeot (2002). They proposed to arrange biomarkers according to (1) similarities in their biological function or (2) their ability to discriminate sites with different levels of contamination. The main objection to the first proposition is that, in many biomarker batteries, authors looked for responses that represent a wide range of biological functions that are, if possible, not correlated to each other. For their second proposition, the ability to discriminate sites depend on the site contamination profiles; some biomarkers will be efficient to identify an organic contamination, others will be better for a metallic contamination, thus in multi-sites surveys, it will not be always the same biomarkers that will be the most efficient to discriminate two given sites (and that is precisely why we use batteries).

In this context, we thus aim to propose a new procedure to resolve the main problems in the IBR application that are formula misuse and arbitrary choice of biomarker arrangement.

Calculation

Formula simplification

The IBR calculation is based on four major steps described in Beliaeff and Burgeot (2002). The first three one are not modified, and we only simplified the formula on the fourth step.

  1. 1.

    The mean value for a site (X) was standardised using the mean value for all sites (m) and the standard deviation for all sites (s) to produce a value we call Y:Y = (X − m)/s

  2. 2.

    For each biomarker, we compute the value Z = Y or Z = −Y according to the expected biological effect activation or inhibition, respectively.

  3. 3.

    The S value was computed with S = Z + |Min|, where Min is the minimal value observed for all sites for each biomarker.

  4. 4.

    Finally, all the S i values were plotted on a radar diagram. The IBR is calculated as the total area displayed by the radar diagram. Here, we go back to trigonometry basics to propose a new formula for the IBR that is far simpler than the original one.

The area of the triangle defined by two successive biomarkers in a k biomarker study where at least four biomarkers are considered is defined in (Fig. 2):

Fig. 2
figure 2

New calculation method for the triangle area; h height of the triangle formed by two successive biomarkers, α the angle formed by the two corresponding axes of the star plot

$$ {A}_i={S}_i\times {S}_{i+1}\times \sin \left(2\pi /k\right)/2. $$

And the IBR value is calculated as follow:

$$ \mathrm{IBR}={\displaystyle \sum_{i=1}^k\mathrm{Ai}} $$

This new formula can be applied only when four or more biomarkers are measured. However, this is not a limitation compared to the previous formula, since no study using the IBR was published with less than four biological responses measured.

Calculation procedure

The second weakness of the IBR is the biomarker sequence in the radar diagram, which is user-defined and does not always rely on conceptual basis. The risk is to produce by chance a particular structure of the diagram. Thus, we wrote a procedureFootnote 1 that creates all the possible circular permutations of k biomarkers. It results on a (k − 1)! matrix of IBR values that allows to calculate the median IBR for a site and to prioritise IBR values among sites in a more confident way. It results on a matrix of 6 values for four biomarkers, 24 values for five biomarkers, 120 values for six biomarkers and so on.

Those values can be used to perform statistical analysis and to look for between-site differences. As it will be illustrated by the case study presented thereafter, we recommend non-parametric tests because the permutation procedure does not always lead to a normal distribution of the IBR values within a site.

Case study

A case study using this method is presented, based on the results of a survey of eight sites (Fig. 3) presenting contrasted levels of sediment contamination (Fig. 4, Table 1). A set of eight biomarkers measured in the bivalve Dreissena polymorpha, not correlated to each other, is used. The nature and the response of these biomarkers is not the topic of this study, thus we will develop neither their interest nor their measurement method. Natural populations of zebra mussels were sampled in spring, in a short time frame to avoid a bias link to strong variations of their physiological status. We focused on antipollution defences which are early warning systems involved in (1) protection of organisms against the entry of contaminants, (2) their sequestration, (3) their inactivation (metabolism) and (4) their elimination. The following endpoints were studied: multi-xenobiotic defence MXR (transport assay), pi glutathione-S-transferases in the gills and in the digestive gland (gene expression), lysosomal defence (histochemical determination), anti-oxidant defence (selenium-dependent glutathione peroxidase gene expression) and metallothioneins (polarographic determination in digestive gland and gene expression in gills). Malondialdehyde was also assessed in order to provide information about toxic effects in collected organisms.

Fig. 3
figure 3

Location of the eight studied populations (France). Sites A and B are located on shipping channels connected to the Meuse River; C, D and E on the Moselle River; F and G on the Seine River and H on the Vilaine River

Fig. 4
figure 4

Sediment contamination of the eight studied sites. Contamination is calculated as the mean rank of a site among all sites for all the pollutants of a family (i.e. PAHs, PCBs and metals)

Table 1 Contaminant concentrations in the sediments for the eight studied sites

The permutation performed for this set of biomarkers resulted in a matrix of 5,040 IBR values. First, this case study enabled to point out that the distribution of the compiled values is not always normal (Fig. 5). We thus recommend to describe the information provided by the permutation procedure by indicators that are meaningful for non-normal data distribution, i.e. by using the median and the quartile rather than the mean and the standard deviation. Similarly, between-site comparisons should rely on non-parametric method. The hypothesis tested by non-parametric tests like the Kruskal–Wallis one is not a simple comparison of the main tendency of several samples, but the overall comparison of value distribution. It means that the outcome of a non-parametric test performed on the computed IBR values provides information about the similarity of IBR distribution between sites. Thus, when IBR distributions are not similar, it can be concluded that sites present different patterns of stress.

Fig. 5
figure 5

Distribution of the computed IBR values for two sites, with an example of a non-normal (site A) and a normal (site G) distribution

Our results also highlight the importance of the arrangement in the representation of ecotoxicological effects in each site (Fig. 6). When the 5,040 values are compiled, it evidenced the variability of the IBR values, which is our main concern. Indeed, the direct consequence of this variability is a different prioritisation of contaminant effects depending on the biomarker sequence (Table 2) that could lead to misunderstanding of contamination consequences on biota if only one arrangement is considered. Table 2 shows that both the value and the rank of the IBR index for a site exhibit high variations (e.g. the site labelled E, that could be either the less or the most contaminated one depending on the permutation chosen), and that a more significant IBR value is obtained through the median calculated across all the possible arrangement.

Fig. 6
figure 6

a Star plot for two permutations among the 5,040 possibilities. The final area defined by the eight triangles is modified according to the biomarker arrangement (b). c Site classification resulting from those two permutations—sites are arranged from the lowest to the highest IBR values

Table 2 Variation of the IBR values and ranks across the 5,040 possible arrangements of the eight biomarkers considered

If we try to compare the old version of the IBR calculation and the new one proposed here, it is necessary to remind that the value calculated by the original procedure is just one of the 5,040 values computed that could be either far or near of the “true” value estimated by the median of all the possible values. For example, Fig. 6c presents two possible site classifications according to two particular arrangements of the biomarkers on the radar diagram. The old version of the IBR calculation would have given us only the results of either the 273 or 4928 classifications without considering these two possibilities.

Finally, our case study evidenced that the IBR is a pertinent index to evaluate site contamination, with a correlation between PAHs contamination levels in sediment and the median IBR value (r pearson = 0.721, p = 0.04). No correlation with PCBs and metal levels were found. However, the contamination levels considered here only reflect site quality, but no assessment of contaminant biodisponibility, nor accumulation in mussel tissues were performed to better describe exposure of each population. Our approach is global and needs to be refined through the application of other batteries of biomarkers and also by including model organisms at other trophic levels.

Conclusions

The results of the case study evidenced the influence of the biomarker order on the diagram and the need to adopt neutral procedure that is not user-defined. Indeed, the consequence of this biomarker sequence is a different prioritisation of contaminant effects that could lead to misunderstanding of contamination consequences on biota. In their study, Raftopoulou and Dimitriadis (2010) criticized the IBR that was not able to classify sites correctly on a pollution gradient. However, as we illustrated in the previous case study, it could have been linked to the specific biomarker arrangement on the radar diagram. The proposed procedure thus needs to be confronted to numerous other case studies, involving other stressors and other species to confirm its ability to reflect pollution gradient and population health in variable contamination contexts.

Finally, the new calculation procedure avoids the order bias, and makes this tool statistically more powerful and biologically more suitable. Considering the need of multi-biomarker approaches to understand (1) the complexity and the variability of biological responses and (2) the relationships between population health and site contamination, our study provides an efficient and optimised tool to integrate these data and avoid subjectivity in the final outcome of the method.

Perspectives

Some points still need to be improved to use this index for biomonitoring: (1) the IBR is a useful value for site comparison, but not an absolute index of biological stress levels because biomarker responses are standardised based on the studied sites and (2) the user should define a priori whether the biomarker response to contamination is an induction or an inhibition. To eliminate this drawback, we should be able to define reference values for each biomarker used in the IBR to express the value as a percentage of variation from this reference value. Thereby, an absolute scale of IBR variation could be established and each IBR calculation would be independent of the set of site considered. Thus, studies focusing on the natural variations of biomarkers and aiming to understand, besides contaminants, the environmental variables and physiological status that influence biomarker values have to be developed (Munkittrick et al. 2009; Xuereb et al. 2009; Coulaud et al. 2011).