Introduction

Acute aquatic toxicity tests with algae, invertebrates and fish form an integral part of hazard identification for the process of environmental risk assessment (Crane et al. 2008). In the interests of the principle of mutual acceptance of data (MAD principle) by member states, the Organisation for Economic Co-operation and Development (OECD) have been instrumental in establishing the acceptance of these standardised regulatory toxicity tests. The basis of the MAD principle is that conducting a test in accordance with OECD Test Guidelines and OECD Principles of Good Laboratory Practice (GLP) should provide confidence in the data from a single study in order to avoid repetition of the experiments in each country that may use the data (Handy et al. 2012a). Other test protocols are also in use internationally with slight variations from the OECD tests, such as the fish acute toxicity test, freshwater and marine (OPPTS 850.1075, 1996), and the fish early-life stage toxicity test (EPA OPPTS 850.1400, 1996) in the USA. However, all these tests were originally designed for chemicals that either were soluble in water or could at least be put into the aqueous phase. Recently, attention has focussed on the hazard assessment of engineered nanomaterials (ENMs) which form dispersions rather than being soluble in water, and there are concerns that the current regulatory toxicity tests are not fit for purpose or may need modifying to work for these new materials (Handy and Shaw 2007; Handy et al. 2012a; Petersen et al. 2015).

The exact definition of an engineered nanomaterial is still being debated, but for regulatory purposes, they can be described as a material with one or more external dimensions in the size range of 1–100 nm and with ≥50 % of the particles in the number size distribution in the nanoscale range (European Commission 2011). ENMs often have very different physico-chemical properties compared to their bulk or metal salt counterparts, including size, shape, specific surface area, surface charge and chemical reactivity (Handy et al. 2008; Klaine et al. 2008, 2012), and often form dispersions, not aqueous solutions in water. Depending on the water and particle chemistry, the ENM for testing may settle from the water by processes such as aggregation or agglomeration. Consequently, the handling of ENMs in regulatory toxicity tests requires careful consideration with respect to maintaining the exposure, as well as interpreting the data for hazard assessment (Crane et al. 2008; Handy et al. 2008).

Acute fish toxicity tests are required as part of the base set or first tier of regulatory hazard assessment. There are currently no specific requirements for ENMs under the European Union’s Registration, Evaluation, Authorisation and Restriction of Chemicals (REACH) programme, but ENMs are considered as new substances and may require testing (see Crane et al. 2008). However, there are reports of problems when conducting unmodified aquatic tests with ENMs, including shading in algal tests (e.g., Schwab et al. 2011), difficulty in the maintenance of exposure concentrations in fish tests (e.g., Bilberg et al. 2010; Christen et al. 2013) and concerns about the suitability of test endpoints for ENMs (e.g., mortality, growth rate, reproduction, Crane et al. 2008; Handy et al. 2012a, b). Nonetheless, several researchers have reported both the lethal and sublethal effects of ENMs to fish and other aquatic life during routine ecotoxicological experiments, albeit with some of the difficulties outlined above (e.g., Ag ENMs in medaka, Chae et al. 2009; Cu ENMs in zebrafish, Griffitt et al. 2007; Ag, Cu, Al, Ni, Co and TiO2 ENMs to zebrafish, daphnids, and algae, Griffitt et al. 2008; Cu ENMs to rainbow trout, Shaw et al. 2012; Au ENMs in zebrafish, Truong et al. 2012).

In 1992, the OECD 210 fish, early-life stage toxicity test (OECD 1992a) was adopted following a proposal in 1988 (OECD 2013) and was one of the first regulatory toxicity tests to use fish embryos for a variety of chemicals. The test includes different life stages from recently fertilised embryos up to 30 days post-hatching for zebrafish and as long as 60 days post-hatching for rainbow trout (OECD 2013). Although the early-life stage (ELS) test is being used for evaluating the hazard of metals and organic chemicals to fishes, it remains to be seen if the test is sensitive for ENMs, which life stage is most appropriate for testing, and whether or not the exposures can be conducted reliably with dispersed materials.

The primary aim of the current work was to initially test selected ENMs with different chemistries using the current OECD 210 fish, early-life stage toxicity test for its suitability for use with these materials, and then, to critically evaluate the test results and procedures at the bench, suggesting improvements or modifications to the test protocol where required for ENMs. Although this work focused on one particular test, large parts of the results are pertinent to other fish tests (e.g., OECD 1992b, 1998). A case study approach was adopted whereby three ENMs with different toxicities and chemistries were tested. The ENMs were obtained from the European Commission Joint Research Centre (JRC, Ispra, Italy). Titanium dioxide ENMs were chosen because this material has shown low acute toxicity in previous studies with fish (e.g., Federici et al. 2007; Zhu et al. 2008; Clemente et al. 2013; Ramsden et al. 2013), and limited free metal ion dissolution (Rasmussen et al. 2014a), whilst silver ENMs were chosen as a material where acute toxicity to fish is a concern (e.g., Asharani et al. 2008; Bilberg et al. 2012; Kwok et al. 2012; Ribeiro et al. 2014), and depending on the water chemistry, Ag ENMs may show some dissolution of the toxic Ag+ ion (Mitrano et al. 2014). Finally, multi-walled carbon nanotubes (MWCNT) were chosen in order to assess differences due to shape and structure. The specific objectives were to evaluate the problems and limitations of the ELS test using zebrafish (Danio rerio), to compare the utility of lethal and sublethal endpoints proposed in the original test protocol for ENMs, and to identify any ambiguity in the original test protocol that may introduce variability into the test method or results for ENMs. Given the concerns about the sensitivity of different life stages of fishes to chemicals, the case studies also evaluated whether it was appropriate to continue using both embryo and larval stages consecutively in the OECD test, or whether the test could be shortened to one sensitive life stage. Modifications of the test method and apparatus were also explored to improve the maintenance of the exposure concentrations in the test when using ENMs. Finally, having critically evaluated the results and test protocols, firm recommendations are made on how the OCED test should be modified for regulatory use with ENMs.

Is the current, unmodified, OECD 210 test fit for purpose for use with ENMs?

The current work was carried out using the original OECD TG (1992a), but these guidelines were updated shortly after the experimental work was finished (OECD 2013). However, the fundamental principles and experimental design are almost identical between the two versions of the test, and so, the work remains valid. All of the points below relate to the current, 2013, version of the TG. The first experimental step was to assess the OECD 210 fish, early-life stage toxicity test for use with TiO2 ENMs, Ag ENMs and MWCNT at the bench, i.e., without modifications for ENMs, and to follow the written protocol as closely as possible. However, even before commencing the laboratory work, it became apparent that the text of the protocol had some ambiguity in places which allowed small deviations that might be important with ENMs in mind. Regardless of ENMs, there were also some concerns from the perspective of fish biology about the purpose and utility of the test description. Many of these ambiguities of the text were carried over into the 2013 updated TG, and some of the concerns about the wording of the current protocol are outlined below.

According to OECD 210 TG (OECD 2013), the aim of the test is to: … define the lethal and sub-lethal effects of chemicals on the stages and species tested. They yield information of value for the estimation of the chronic lethal and sub-lethal effects of the chemical on other fish species. The notion of chronic sublethal effects is immediately problematic for ENMs, the test, whether considered as a time proportion of the life expectancy of a zebrafish or in terms of absolute time duration is an acute (short term) test, and cannot yield information on chronic effects. Unlike existing chemicals, there are also insufficient data on ENMs to extrapolate acute toxicity estimates to chronic effects. In addition, apart from mortality or survival, the only other biological endpoints in the test are hatching success, abnormal appearance (morphological abnormalities), abnormal behaviour and body weight and length. For the abnormal appearance, the TG gives no specific guidance on how to make such morphometric measurements or what exactly to measure.

In terms of dosing the beakers or vessels with the test chemical, the TG states that: The test system should preferably be conditioned with concentrations of the test chemical for a sufficient duration to demonstrate stable exposure concentrations prior to the introduction of test organisms. However, it is currently unknown whether this is a valid approach with ENMs. The length of time before the fish are added could be critical with some ENMs known to quickly precipitate out of the water column. Also, the ecological relevance of the dosing method should be considered. For example, in the wild, a polluted effluent would be added to the fish already in the water, not vice versa. This is most pertinent with embryos where the time to dose (with or without the animals present) would either result in the embryos being placed on top of precipitated ENMs or beneath them, with potential consequences to subsequent toxicity.

The OECD 210 TG states that the … concentrations of the test chemical higher than the 96 h LC 50 or 10 mg l −1 , whichever is the lower, need not be tested… However, from previous experiments within our laboratory, it was clear that this concentration was too low to elicit a toxicological response for some ENMs. Therefore, the upper concentration for all subsequent TiO2 ENM experiments was increased (see below). The TG suggestion of using LC50 values for guidance on where to set the highest concentration in the test was also less useful. Of course, range finding studies can be attempted, but for many ENMs, the practical problems of dispersing high mg or g concentrations of ENMs in water has so far prevented the precise or reliable determination of exact LC50 values for fishes (Handy et al. 2011, 2012a). The OECD 210 TG also stipulates that: Normally five concentrations of the test chemical, with a minimum of four replicates per concentration, spaced by a constant factor not exceeding 3.2 are required. However, these aspects of the TG present two issues. Firstly, for ENMs, it is still not clear if mass concentration is the best metric to describe toxicity data, and secondly, the spacing factor of 3.2 has not been validated for ENMs. Furthermore, whether in a range finding experiment, or the main test, the data will be a composite of both embryo and larval stages. From the perspective of fish biology, this is illogical, given that the toxicity of chemicals to fish embryos and larvae may be very different (e.g., Eaton et al. 1978; McKim et al. 1978a, b; Ma and Diamond 2013), and from the perspective of ENMs, which life stage represents the “sensitive window of toxicity” during development is not yet established.

Finally, the OECD 210 TG stipulates that: … the analytical measure of the test concentrations is compulsory. However, there are currently no routine methods with which to accurately quantify ENMs in solution. The metallic component of metal-based ENMs can be assessed using analytical techniques such as ICP-MS, but several problems exist when doing so (Shaw et al. 2013). Engineered nanomaterials such as carbon nanotubes are also problematic to quantify as there is currently no way to differentiate between these particles and background carbon from other sources. Also, for ENMs such as cadmium telluride quantum dots (QD), there is no routine method to quantify the intact material with respect to stoichiometry of the component substances. Analytical methods such ICP-MS can determine cadmium and tellurium concentrations in the water samples or animals, but these do not necessarily inform on the state of the composite QD. The TG goes on to state: Prior to initiation of the exposure period, proper function of the chemical delivery system across all replicates should be ensured (for example, by measuring test concentrations). Analytical methods required should be established, including an appropriate limit of quantification (LOQ) and sufficient knowledge on the substance stability in the test system. Again, current analytical instruments and techniques are often inadequate and modifications, or new methodologies are still being developed. Lastly, with respect to measured concentrations of the test chemical the TG state that: When the measured concentrations do not remain within 80120 % of the nominal concentration, the effect concentrations should be determined andexpressed relative to the geometric mean of the measured concentrations for semi-static tests (see Chapter 5 in the OECD Guidance Document No. 23 on aquatic toxicity testing of difficult substances and mixtures… For traditional chemicals, the notion is to use nominal concentrations to simplifying the logistics and cost, unless there is a clear reason for more detailed quantification. The complex behaviour and settling of ENMs will always require the latter.

The only notable difference in experimental design between the original 1992 TG and the updated 2013 TG is that they now state that there should be a minimum of four replicates, but in the current work, a triplicate design was used as per the original protocol. Throughout this study, ELS zebrafish were exposed to nominal concentrations of either TiO2 ENMs (NM-105), TiO2 bulk (NM-100), Ag ENMs (NM-300K), AgNO3, MWCNT (MN-400) or carbon black (CB) (Table 1). A dispersant control was also utilised in the Ag ENM experiments (NM-300K-Dis). Here, fish were exposed to a dispersant concentration equivalent to that in the highest Ag ENM or MWCNT treatment, respectively (Table 1). See below for ENM characterisation.

Table 1 Characterisation data and nominal exposure concentrations of the materials used for assessment of the OECD 210 TG

Brood stocks of adult wild Indian karyotype zebrafish (Danio rerio) were kept at Plymouth University in 20-l aerated, flow-through glass aquaria containing dechlorinated Plymouth tap water. Temperature was maintained at 28 °C, and the photoperiod was 14 h light/10 h dark (with an identical temperature and photoperiod used during all experiments). Following fertilisation, the embryos were cleaned (i.e., removal of debris and unfertilised eggs) and viable embryos randomly selected and placed into 500-ml experimental Pyrex beakers (Fisher Scientific, UK; n = 20 embryos per beaker, 60 per treatment) containing 400 ml dechlorinated Plymouth tap water (buffered to circumneutral pH). The TG does not stipulate that a synthetic water should be used, only that: Any water in which the test species shows suitable long-term survival and growth may be used as test water (see Annex 4). It should be of constant quality during the period of the test. The utility of the tap water at Plymouth University has consistently been suitable for fish husbandry purposes with little change in chemistry over time. OECD 210 TG indicates the exposure should continue until all of the control fish are feeding exogenously and the control fish reach a juvenile stage (up to 30 days for zebrafish) (temperature quoted as 26 ± 1.5 °C). Regardless of nano, it is not clear why degree-days are not used as the unit of time of development in the OECD test, so that data can be compared with the scientific literature on fish biology. Nonetheless, in the current experiments, embryos were exposed to the nominal concentrations (as above) of each ENM or appropriate control until 6 day post-fertilisation. This duration allowed the collection of the data required to investigate the overall aims of the work, and involved both the unhatched embryos and the hatched larvae, but also minimised the incursion of the experiments into first feeding where the fish would be considered “free-living vertebrates” from the perspective of animal welfare legislation. Furthermore, conducting the exposure past this point in the life stage would require the feeding of the larvae, and introduce a confounding factor of ENM adsorption to food pellets, or indeed, particulate material from the food inevitably entering the water.

The exposures were carried out using a semi-static regime (100 % water renewal every 24 h with subsequent redosing). The unmodified TG states that: … (at least two thirds) of the test solution/control volume is changed… In the current experiments, initial attempts at 80 % daily water changes with the ENMs present in the test vessels were unsuccessful. Because of particle settling, the remaining 20 % of the water in each beaker contained a disproportionate concentration of ENMs (data not shown). Therefore, to ensure accurate dosing and test water renewal, 100 % water changes were carried out in all subsequent experiments. To accomplish this, new test solutions were prepared in clean beakers and the embryos or larvae were transferred with subsequent redosing as described in the TG.

Nanoparticle stock preparation and characterisation

The OECD 210 TG is intended for use with many different chemical substances, and unsurprisingly it would be impractical to document methods for dissolving or dispersing test chemicals. Instead, the TG indicates: … evidence that the measurements [of expected nominal] refer to the concentrations of the test chemical in true solution… Currently, there are not many validated methods for determining ENM concentrations in water (von der Kammer et al. 2012), and it is accepted that particle size distribution measurements can be problematic in natural waters (Ranville and Montano 2015). Nonetheless, preliminary guidance on ENM characterisation and dispersion from the Working Party on Nanomaterials (WPMN) is in preparation, and some guidance on stock preparation and dosing is already available (OECD 2012). It is therefore prudent to consider these nano-specific guidance notes as supplementary information that can inform on the preparations for running the OECD 210 method specifically with ENMs. The OECD 210 TG states that: The use of a solvent carrier is not recommended…. but is allowable where necessary. For ENMs at least, it may be preferable to avoid dispersing agents where possible, and if they must be used to be mindful of the ratio of dispersant concentration: ENM concentration, and its potential effects on particle shape, coating or reactivity (Handy et al. 2012a, b). The OECD 210 TG directs the reader to the OECD guidance document on the aquatic toxicity testing of difficult substances and mixtures (OECD 2000) for further information when using a solvent in testing. However, this document was not written with ENMs in mind, and there are no provisions in the guidance notes for ENMs, though the document does state that the nature of the difficulty in testing colloids is achieving, maintaining and measuring exposure concentrations, which clearly applies to ENMs. Both the OECD 210 TG (OECD 2013) and the difficult substances and mixtures document (OECD 2000) recommend a maximum solvent concentration of 100 µl l−1 in order to avoid effects from the substance used. Again, this approach has not been validated for ENMs.

In the current study, ENM stocks were produced using our standard laboratory protocol (e.g., Federici et al. 2007; Shaw et al. 2012, 2013). Briefly, all glassware was acid-washed and triple-rinsed in deionised water. The desired mass of powder was carefully weighed out for TiO2 ENMs and MWCNT, and volume accurately pipetted for Ag ENMs and then added to a volumetric flask containing 200 ml ultrapure water (Milli-Q, 18.2 MΩ-cm resistivity, ion free and unbuffered). Following manual mixing and inversion, the stocks were then quickly decanted into glass bottles (Fisherbrand, borosilicate glass with polypropylene cap). The stocks (200 ml) were immediately sonicated for 6 h (bath type sonicator, 35-kHz frequency, Fisherbrand FB 11010, Germany) and a further hour prior to each dosing event along with manual agitation (shaking and swirling). Note that Ag ENM stocks were freshly prepared each day in order to minimise the risk of silver ion dissolution from the ENMs. Stocks of TiO2 bulk, carbon black (CB) and AgNO3 were prepared precisely as above for ENMs, but with stirring for 2 h initially and a further 1 h prior to dosing replacing sonication (magnetic stirrer IKA-Werke RET basic C, at 300 rpm). Both the Ag ENMs and the AgNO3 stocks were stored in dark bottles throughout.

Characterisation of the ENMs used in the current study was conducted in our laboratory, whilst initial JRC characterisation data were provided with the materials (Table 1). Furthermore, extensive characterisation of these materials was carried out in separate studies incorporating several laboratories and published as part of the European Commissions’ JRC Science and Policy Reports (i.e., Rasmussen et al. 2014a for TiO2 materials, Rasmussen et al. 2014a, b for MWCNT and Klein et al. 2011 for Ag ENMs) (data presented in Table 1).

The primary particle size of the ENMs used here was calculated by firstly imaging the particles by transmission electron microscopy (TEM, JEOL 1200EXII) and then using Image J Software to measure the particles on the TEM micrographs. A total of 100 particles from 10 different TEM grids were assessed per material. Further size data were generated at Plymouth University by nanoparticle tracking analysis (NTA, NanoSight LM10), which allows an assessment of the particle size distribution in solution. Briefly, ENM stocks were prepared at 100 mg l−1 (optimum ENM concentration for this technique) in Milli-Q water and sonicated for 1 h prior to analysis. Five subsamples of each stock were measured in duplicate giving a total of ten analyses per material. The presence of natural colloids in the water and the relatively low concentrations of ENMs used in the exposures prevented the accurate assessment of particle size distribution of the dosed, experimental waters.

Assessing the dissolution of ENMs

To better understand the mechanisms of toxicity from ENMs it is important to establish whether dissolved ions contribute to the effects. Therefore, experiments were carried out to ascertain whether ions were released from the TiO2 and Ag ENMs used in the fish exposures. These experiments used the dialysis approach as described in Besinis et al. (2014) and Al-Jubory and Handy (2013) with minor modifications. Briefly, 100 mg l−1 dispersions of each ENM were prepared in both Milli-Q and dechlorinated Plymouth tap water. All glassware and dialysis tubing used in the experiments were acid-washed (5 % HNO3) and triple-rinsed in deionised (Milli-Q) water. Dialysis bags were prepared from dialysis tubing (Sigma-Aldrich, cellulose membrane with a molecular weight cut-off at 12,000 Da) precisely as described in Besinis et al. (2014). Experiments were carried out in triplicate with each bag filled with 8 ml of the appropriate ENM solution and secured with mediclips. The bags were then transferred to beakers containing a magnetic stirring bar and 492 ml of the appropriate media (Milli-Q or dechlorinated Plymouth tap water), and placed onto a multipoint magnetic stirrer (RO 15P power, IKA-Werke) for 24 h at room temperature. Water samples (3 ml) were taken immediately and then at 24 h, and Ti or Ag content was assessed by ICP-OES. These experiments showed minimal dissolution of Ag from the Ag ENM, whilst the TiO2 ENMs were stable in solution (no measurable release of Ti) (Table 1).

Water quality

The current OECD 210 TG has quality criteria that specify the allowable deviations in water chemistry within a valid test. Dissolved oxygen must be maintained over 60 % of the air saturation, and water temperature must not differ by more than ±1.5 °C between test chambers or between successive days. Furthermore, the water used should be: any water in which the test species shows suitable long-term survival and growth… and be of constant quality during the period of the test. The guidance on water quality has its origins in historic experimental work on the physiology of fishes and the requirements for fish health. For example, the range for dissolved oxygen is intended to generally avoid a hypoxia-driven hyperventilation (e.g., Holeton and Randall 1967) that might otherwise alter the exposure dose over the gills of teleost fishes. The primary consideration for water quality measurements was not intended to be in relation to the test substance per se. Nonetheless, it remains good practice to confirm the general water quality during a test, and for ENMs, we have yet to confirm their reactive chemistries and how they might (or might not) alter the freshwater. For example, one theoretical concern for a high surface area of a negatively charged ENM might be cation depletion of the freshwater (by adsorption to the particles) leading to osmoregulatory distress. Such possibilities in data interpretation can be verified only if the composition of the water is monitored during the test.

In order to assess water quality parameters in the current study with ENMs, water samples were taken every 24 h (immediately prior to each water change) for pH, dissolved and saturated oxygen, and temperature (HACH HZ40d multimeter). Separate samples were also collected for trace element analysis to inform on the general water quality and to confirm the exposure (see below). Six beakers were also randomly chosen each day and tested for total ammonia (HACH LANGE GMBH LCK kit 304 read on a HACH LANGE GmbH DR 2800 spectrophotometer). Most water quality parameters were maintained within the limits set out in the OECD 210 TG for zebrafish and thus met the validity criteria. As there were no significant differences throughout in routine water quality measurements between beakers or treatments (ANOVA, P > 0.05), data were pooled by experiment. For example, water quality parameters in the first OECD 210 experiment were (mean ± SD, n = 126) dissolved oxygen (7.20 ± 0.10 mg l−1), saturated oxygen (94.97 ± 0.90 %), pH (7.29 ± 0.09) and temperature (27.13 ± 0.29 °C). Total ammonia was 0.06 ± 0.007 mg l−1 (n = 36). The trace element profile of the water was (mean data ± SD, n = 204 samples) 5.97 ± 0.70, 0.77 ± 0.30 and 15.73 ± 2.20 mg l−1 for Na+, K+ and Ca2+, respectively, whilst total hardness (as CaCO2) was 42.5 ± 5.82 mg l−1. The only aspects of the water chemistry that did not meet the OECD 210 TG requirements were the Cu and Zn concentrations. The Cu and Zn levels were 6.42 ± 4.36 µg l−1 and 7.91 ± 6.23 µg l−1, respectively (mean ± SD), but Annex 4 of the TG states that an acceptable (limit) concentration is 1 µg l−1 for both elements. This seems an unusually low concentration to stipulate and difficult to achieve without using reconstituted water (which may have its own inherent problems), though there is no caveat that the test will be invalidated if exceeded.

Confirmation of exposure

The current OECD 210 TG does not have a detailed protocol for taking and analysing water samples to confirm exposure concentrations in the test vessels apart from: … the concentrations of the test chemical are determined at regular intervals to characterise exposure. A minimum of five determinations is necessary. However, this may not be suitable for ENMs. The TG are intended for many different chemicals, so the absence of specific protocols for analytical chemistry is not surprising, and suggestions are given elsewhere on how frequently to measure ENM concentration and what might reasonably be achieved in aquatic tests (Handy et al. 2012a, b; Petersen et al. 2015). Nonetheless, it would be useful to develop a separate guidance document on this for ENMs, with a similar philosophy to that for organic chemicals that proved difficult to dissolve or handle in aqueous solution (e.g., guidance notes from the US EPA or OECD on difficult substances, OECD 2000).

For all experiments in the current work, water samples were taken daily from the middle of the water column of each beaker 10 min post-dosing and 10 min prior to the water change in order to confirm the 24-h exposure profile (i.e., the time period between water changes). Briefly, 10 ml samples were acidified with 200 µl of concentrated analytical-reagent-grade nitric acid (Fisher Scientific) and stored in the dark until required. All ENM samples were sonicated for 20 min and then vortexed for 10 s immediately prior to analysis (samples from the metal salt treatments were vortexed only). Inductively, couple plasma optical emission spectrometry (ICP-OES, Varian 725 ES) was used to analyse all TiO2 and Ag ENM samples, whilst AgNO3 samples were analysed by inductively couple plasma mass spectrometry (ICP-MS, Thermo Fisher Scientific X Series 2) with calibration and quality control as described in Shaw et al. (2013). Briefly, instrument calibration was achieved by using matrix-matched ICP standard prepared from Arista® plasma emission grade Ti and Ag solutions. All samples for ICP-MS also had internal standards (1 % indium and 1 % iridium) to assess instrument drift during the analyses. For TiO2 analysis, the ICP-OES measures Ti metal only so TiO2 values were calculated using stoichiometric conversion based on atomic weight. Data are presented as routine metric concentration units (e.g., µg l−1) or as measured percentage of the nominal dose.

Measuring the biological endpoints of OECD 210

The current OECD 210 TG aims to assess both the lethal and sublethal effects of the test substance, with observations on survival and hatching made at least once daily. There are no specific instructions on the methods to be used to measure effects on fish embryos in the current OECD 210 TG. The observations should include: the stage of embryonic development, hatching and survival, abnormal appearance, abnormal behaviour, weight and body length. Criteria for death are generally well defined in the TG, but from the perspective of fish biology, the other endpoints are not. This lack of technical detail is unfortunate, and for example, references could be made to agreed anatomical/zoological descriptions of the stages of development of each species of fish so that stage of development in the test is accurately recorded the same way in different laboratories performing the test. Relevant articles on zebrafish anatomy and development are available (e.g., Laale 1977; Kimmel et al. 1995; Schilling 2002; Bird and Mabee 2003; Parichy et al. 2009; Menke et al. 2011; Bryson-Richardson et al. 2012). There is also no guidance on preserving ELS fish so that morphometrics, etc., may be done after the exposure is conducted. From the view point of ENMs, it is a question of whether these observations in the test are possible or not (the experiments below indicate the separate issue of sensitivity).

In situ observations of mortality and behaviour may be hindered by turbidity in the test solutions for some ENMs. This was evident in the current study where high mg concentrations of TiO2 gave the water a cloudy, milky appearance. For these beakers, it was necessary to decant most of the exposure medium out of the beaker in order to view and count the animals. Beakers with lower concentrations of TiO2 ENM as well as those containing Ag ENMs and MWCNT did not experience this turbidity, and therefore, the guidelines to assess mortality/survival were used throughout without problem. Each beaker was observed daily to note any abnormal swimming behaviour including lethargy, loss of equilibrium, unusual positioning in the water column (e.g., vertically positioned), lack of reaction to mechanical stimuli and avoidance behaviour (the latter characterised by increased swimming speed and/or attempting to escape). The numbers of survivors and hatched larvae were then recorded immediately prior to the water change. At the end of the exposure (day 6), remaining larvae were preserved to assess for any effects to development. The following outlines how our laboratory makes morphometric measurements on zebrafish, based on many years of experience of fish anatomy, and might be a useful addition to the current OECD TG 210 document. Briefly, larvae were terminally anaesthetised with buffered tricaine methanesulfonate (MS222) and then fixed in formalin until required for morphometric analysis. Larvae (n = 4 per beaker, 12 per treatment where sufficient numbers remained) were photographed at the same orientation (Meiji binocular microscope with Infinity 2-2C camera). Image J software (1.45 s) was the used to assess total length (from anterior part of the head to the end of the tail) as described in OECD 210 TG, skeletal muscle block width on the tail and yolk sac volume. For muscle block width, the larvae were positioned laterally and the muscle (myotomes) in the tail section (between the dorsal and ventral dermis, including the notochord) measured immediately posterior to the yolk sac (at the thickest portion of the tail). Yolk sac volumes (YSV) were calculated according to Velasco-Santamaría et al. (2011) using the equation:

$${\text{YSV}} = \frac{\pi }{6} \times L \times H^{2} ,$$

where π (Pi) is 3.1416, L represents the yolk sac length (mm), and H represents the yolk sac height (mm), expressed as mm3. Microscopic observations were also made in order to assess for spinal deformities (e.g., lordosis, kyphosis) and abnormal eye positioning and development. However, a lack of deformity in the spinal column and eyes of larvae from all treatments meant that there was no quantitative assessment for these endpoints.

Statistical analysis

All data were analysed by StatGraphics Centurion XVI.I. Where no beaker (test vessel) differences were observed, data were pooled by treatment for statistical analysis. After checking kurtosis, skewedness and unequal variance (Bartlett’s test), data were tested for treatment, time, and treatment × time interaction effects by multifactor ANOVA followed by Fisher’s 95 % least-squares difference (LSD) post hoc test. When a statistically significant effect was indicated by this model, a one-way ANOVA was employed for parametric data to assess for simple effects with Fishers LSD used to locate the differences. Where the data was nonparametric, the Kruskal–Wallis test was used and differences located by notched box and whisker plots. Differences among groups were considered significant at P < 0.05. Coefficient of variation (CV) was calculated by \({\text{CV}}(\% ) = ( {\frac{\sigma }{\mu }} ) \times 100,\) where σ is the standard deviation and µ is the mean.

Experimental case study results

Hatching and survival of the control animals met the validity criteria set out for zebrafish in OECD 210 throughout all experiments (i.e., 70 % hatching success and 75 % post-hatch success for zebrafish). The median lethal concentration that kills 50 % of the population (LC50) was calculated where possible, and these data are presented along with the no observed effect concentrations (NOEC) for each material in Table 2.

Table 2 The 96 and 144 h lethal concentration (LC50) and no observed effect concentration (NOEC) values of early-life stage zebrafish exposed to ENMs and relevant controls according to OECD 210

TiO2 ENM

Confirming the exposure

The highest measured percentage of the nominal dose of TiO2 ENMs in the current experiment was 60.1 %, 10 min post-dosing in the 80 mg l−1 treatment (Table 3). However, 24 h post-dosing, this decreased to just 4.0 %. It was clear that both TiO2 ENMs and the bulk material aggregated and settled to the bottom of the exposure beakers, particularly at the higher concentrations. With water samples typically taken from the middle of the water column, these sedimented particles would therefore not have been collected, leading to erroneously low measured ENM concentrations (i.e., unrepresentative sampling of the heterogeneous test water).

Table 3 Nominal and measured concentrations of TiO2 ENMs and bulk

Acute toxicity

Exposure to TiO2 ENMs or TiO2 bulk at concentrations up to 160 mg l−1 did not produce sufficient mortality to calculate LC50 values (Table 2). The lack of acute effects has been seen previously in experiments where zebrafish embryos were exposed to TiO2 EMNs at concentrations reaching 500 mg l−1 (Zhu et al. 2008; Ma and Diamond 2013). The lowest survival in the current study was seen in larvae exposed to 80 mg l−1 TiO2 ENMs where 71.7 % survived by day 6 (Fig. 1a, b). However, survival in the control animals for this experiment was 85.0 % at day 6 (background mortality), and there were no statistically significant differences between any treatments (ANOVA, P > 0.05). Survival was between 78.3 and 93.3 % in all of the other TiO2 treatments (Fig. 1a, b). Where mortality did occur, the zebrafish were not affected during the embryonic stage, with most of the observed mortality occurring in the hatched larvae following 5 or 6 days of exposure (Fig. 1a, b). In the study by Ma and Diamond (2013), the switch to the use of ultraviolet light in the exposures did cause significant acute toxicity, but here, the hatched larvae (at the eleutheroembryo stage) were more sensitive than the pre-hatch embryos. In the current experiment, a concentration-dependent sedimentation of TiO2 ENMs and bulk was observed in each beaker, which resulted in a settling of material on embryos (smothering) and a pollution avoidance response by hatched larvae (swimming and resting at top of beaker only, compared to swimming and resting throughout the water column in controls).

Fig. 1
figure 1

Survival of zebrafish embryos and larvae exposed in triplicate to a TiO2 ENMs, b bulk TiO2, c MWCNT, d carbon black, e Ag ENMs, f AgNO3 using the original OECD 210 TG. Controls with no added ENM or bulk/metal salt were included throughout, as well as dispersant (NM-300K-Dis) only controls in the Ag experiments. Error bars are omitted for clarity. Where no data are presented, 100 % mortality had occurred at these concentrations by that time point. Data are mean, n = 60 larvae per treatment

Swimming behaviour and sublethal toxicity

Larvae exposed to the highest concentrations of TiO2 ENMs and to a lesser degree TiO2 bulk swam or maintained a resting position towards the top of the beakers, whereas larvae from the control beakers were observed to be swimming or positioned randomly throughout the water column, clearly exhibiting pollution avoidance behaviour.

Exposure to TiO2 ENMs or bulk did not affect hatching success in the current study, although there was a slight concentration-dependent lag in time to hatch which was statistically significant in the 160 mg l−1 TiO2 ENM treatment. Here, there was approximately 20 % less hatching at 48 h than in all of the other treatments (ANOVA, P < 0.05). Larvae length was not affected by exposure to TiO2 ENMs. Similarly, no significant differences were seen in muscle block width or yolk sac volume following exposure to  TiO2 ENMs or bulk.

MWCNT

Confirming the exposure

At present, it is not possible to quantitatively confirm the exposure of MWCNT due to a lack of analytical techniques that can distinguish between the MWCNT and other forms of carbon in the sample. Attempts were therefore made to assess the concentration of cobalt (Co) in the experimental water. Cobalt is used as a catalyst in the production of the MWCNT and is therefore present as a metal impurity. However, there were no measurable levels of Co in the water samples (data not shown). A slight, though noticeable, density gradient was visible in the beakers containing 10 mg l−1 MWCNT (the highest concentration), indicting aggregation of this material. This has obvious implications for exposure and the subsequent interpretation of the dose–response.

Acute toxicity

Overall, MWCNT and carbon black showed low acute toxicity to ELS zebrafish at the concentrations tested here (up to 10 mg l−1). Mortality was less than 15 % at the highest concentration of each material (Fig. 1c, d), and as such, it was not possible to calculate an LC50. None of the mortality occurred during the embryonic stage, which is in agreement with the literature where concentrations of MWCNT up to 100 mg l−1 did not cause mortality in zebrafish embryos (e.g., Rojo et al. 2007; Liu et al. 2014). Similar effects have been seen in studies with single-walled carbon nanotube (SWCNT), with concentrations as high as 360 mg l−1 failing to affect the survival of zebrafish embryos (Cheng et al. 2007). A study exposing early-life stage rare minnows (Gobiocypris rarus) to 0–360 mg l−1 SWCNT found that survival was only reduced after the larvae had hatched (Zhu et al. 2015). Much of the limited mortality seen in the current study also occurred post-hatch, further highlighting the low toxicity of this material to fish embryos. This is most likely due to the protective barrier provided by the chorion. The sensitivity of pre-hatch embryos to MWCNT has been seen in a microinjection study. In their work, Cheng and Cheng (2012) introduced fluorescent-labelled MWCNT (2 ng equivalent weight) directly into the embryonic cells of one-cell-stage zebrafish. They observed 100 % mortality in exposed embryos within 8–10 hpf, again demonstrating the effectiveness of the chorion in protecting the animal.

Swimming behaviour and sublethal toxicity

Hatched larvae avoided the bottom of the beaker and thus the aggregated MWCNT, but showed no other signs of altered swimming behaviour or distress. As with TiO2 ENMs, there were no effects to growth (total length or muscle block width) or yolk sac volume (data not shown).

Ag ENM

Confirming the exposure

As the nominal concentration of Ag ENMs increased, the amount measured by ICP-MS or OES decreased (concentration-dependent effect) (Table 4). However, even at the lower Ag ENM concentrations, it was not possible to measure concentrations equal or greater than 80 % and was as low as 16.9 % of the nominal concentration 10 min post-dosing in the 1000 µg l−1 Ag ENM treatment. The Ag ENMs used here were supplied with a capping agent in order to improve dispersion, and yet, even this approach failed to meet the criteria for maintaining the test substance concentrations.

Table 4 Nominal and measured concentrations of Ag ENMs and AgNO3

Acute toxicity

Survival was not affected in the dispersant control treatment in the Ag experiments indicating that any effects seen in the Ag ENM treatments were as a result of the particles only and not caused by the dispersing agent. Both Ag ENMs and AgNO3 were acutely toxic to zebrafish (Figs. 1,2), though the latter was more toxic with a 6-day (144 h) LC50 value approximately ten-fold lower than for the Ag ENMs (Table 2). A NOEC was calculated for Ag ENMS, but again, the concentrations were not low enough for a NOEC to be calculated for the AgNO3 (Table 2). Interestingly, the NOEC for the Ag ENMs decreased significantly between 96 h (23.1 µg l−1) and 144 h (3.9 µg l−1). The 96-h animals had been exposed for half of that time period as embryos which indicates the increased sensitivity of eleutheroembryos (i.e., hatched, but still using their yolk sac for nutrition) and larvae to Ag ENMs compared to embryos.

Significantly reduced survival was observed in embryos exposed to both Ag ENMs and AgNO3. Survival was 6.67 and 16.67 % in the 1000 µg l−1 Ag ENM and 50 µg l−1 AgNO3 treatments at 24 hpf, respectively, and 0 % in both treatments by 48 h (still prior to hatching, Fig. 1e, f). However, although both Ag materials were able to impart toxic effects to the embryos, the mechanisms of toxicity are likely to be different, particularly as the exposures did not commence until the chorion had hardened (i.e., >1 hpf). The chorion enveloping the embryo and the perivitelline fluid within, both present barriers that offer some protection from exposure to contaminants. Negatively charged sulfhydryl groups within the chorion can tightly bind cations such as Ag, Cu and Hg (Rombough 1985), with 85 % of whole embryo Ag burden bound to the chorion in rainbow trout embryos exposed to waterborne Ag (as AgNO3, Guadagnolo et al. 2001). It is likely, however, that either the binding sites were effectively overwhelmed by the higher Ag (as AgNO3) concentrations used here or that the chorionic structure was compromised, thereby facilitating Ag (from AgNO3) access to the embryo. The permeability of the chorion to Ag ENMs has been demonstrated previously. Asharani et al. (2008) used transmission electron microscopy to assess the distribution of Ag ENMs in zebrafish embryos and found the particles in the brain, heart, yolk and blood of exposed animals. A study assessing the transport of single Ag ENMs into zebrafish embryos found that the particles transported in and out of the embryo via the chorion pore canal using Brownian diffusion (Lee et al. 2007). The diameter of chorion pores is approximately 0.5–0.7 µm (Rawson et al. 2000) and are therefore large enough to allow even aggregated ENMs through (MacCormack et al. 2014). Although no attempts were made during the current experiments to image the ENMs within the embryos, it is feasible that they were also able to enter via the chorionic pore, causing the toxic effects seen. The primary particles size of the NM-300K ENMs used here was <20 nm, and although larger aggregates were formed in solution (around 50–75 nm, Table 1), their relatively small size could still allow a significant portion of them to enter the chorion via the canal.

Fig. 2
figure 2

Mortality of ELS zebrafish at the end of the experiment (day 6) from exposure to a Ag ENMs and b AgNO3. Data are mean ± SEM, n = 60 larvae per treatment. Note, the different nominal exposure concentrations used (x-axis)

Swimming behaviour and sublethal toxicity

There were no obvious density gradients visible in the beakers from the Ag ENM exposure, and hatched larvae were distributed throughout the water column with no opportunity for avoidance behaviours. However, larvae were noticeably lethargic with increased Ag ENM and AgNO3 concentrations, with some only moving following physical stimulus.

An assessment of hatching success and time to hatch in the Ag ENM exposures was difficult due to the high mortality seen in that experiment. However, there appeared to be a minor concentration-dependent, though non-significant, effect on both of these endpoints (ANOVA, P > 0.05, data not shown), though a dedicated sublethal exposure would be necessary to elucidate this further.

The morphometric analysis carried out in the current study did show some effects following exposure to Ag ENMs, though again the mortality seen in larvae exposed to the highest concentrations of both Ag ENMs and AgNO3 meant that it was not possible to assess morphometrics in all fish at all concentrations. Surviving larvae at the end of the exposures showed a statistically significant concentration-dependent reduction in total length following exposure to Ag ENMs (ANOVA, P < 0.05, Table 5) and was shown to be a relevant endpoint for the toxicological testing of this material. The control larvae had a mean total length of 3.72 mm compared to mean lengths of 3.36 and 3.25 mm in the 50 and 100 µg l−1 Ag ENM treatments, respectively, representing 9.7 and 12.6 % reductions in total length compared to control larvae (significantly different to controls, ANOVA, P < 0.05). A similar effect was also seen in larvae exposed to 5.0 µg l−1 AgNO3 (Table 5), showing the utility of this measurement for both ENMs and dissolved chemicals outside of regulatory testing.

Table 5 Morphometric analysis of zebrafish larvae exposed to Ag ENMs or AgNO3 for 6 days using the standard OECD 210 TG

Interestingly, although a reduction in total length was seen in larvae exposed to 100 and 5 µg l−1 Ag ENMs and AgNO3 respectively, both of these treatments also saw a simultaneous increase in muscle block width compared to controls (ANOVA, P < 0.05, Table 5). Although myogenesis can be affected by environmental stress (Müller 2008), when considering the relatively short time frame of the current experiments, a more likely explanation is osmotic stress resulting in muscle turgidity. A relative increase in extracellular space and a concomitant decrease in muscle fibre size has previously been observed with copper ENMs in trout (Al-Bairuty et al. 2013). Also, in the present study, a significant increase in yolk sac volume was seen larvae from the 5 µg l−1 AgNO3 and 100 µg l−1 Ag ENM treatments compared to all other treatments and controls (ANOVA, P < 0.05, Table 5; Fig. 3), suggesting either oedema in the yolk sac or lack of utilisation of the contents.

Fig. 3
figure 3

Micrographs showing yolk sacs (YS) of a control larvae and b larvae exposed to 100 µg l−1 Ag ENMs for 6 days (144 h) with the latter exhibiting oedema in the yolk sac (Meiji binocular microscope, ×70 magnification)

Was the OECD 210 test effective for assessing ENM toxicity to fish?

Several issues arose when conducting toxicity tests with the OECD 210 TG. Some of these were nano-specific, and whilst the recent update (OECD 2013) to the original TG released in 1992 addressed some of the fundamental flaws with respect to fish biology and producing robust data, some generic problems still remain. Both the nano and more general issues are discussed below and are summarised in Table 6 along with suggestions to improve the test.

Table 6 Recommendations on test guidelines

Rationale behind the test

The TG states that: Tests with the early-life stages of fish are intended to define the lethal and sublethal effects of chemicals on the stages and species tested. However, there are some problems with this strategy. Firstly, one must question the validity of using a combination of lethal and sublethal endpoints, over what is essentially a chronic duration. For example, if the embryos experience high mortality, then the test will not inform upon effects to free swimming larvae. Chronic tests were designed to assess adverse effects from sublethal concentrations over time (to provide information not available from shorter, acute tests) (Rand et al. 1995). These data are needed in ENM toxicity, where predicted environmental concentrations are low. Also, it has been seen previously that exposure to chemicals at an early life stage may have consequences for later stages, that perhaps would not occur if the later life stage only was exposed. For example, dissolved Cu exposure to the embryos of a cyprinid fish (Leuciscus idus) resulted in reduced toxicity in the hatched larvae compared to larvae exposed post-hatching only (Witeska et al. 2014). A similar effect was seen with Cd, where free-feeding larvae were more sensitive when not exposed as embryos and yolk sac larvae, than those that were exposed throughout these life stages (Brinkman and Hansen 2007). This has environmental implications and requires further investigation for ENMs (see “Is a test spanning both the embryonic and larval stages of fish development suitable for use with ENMs?” section).

Dosing the test vessels

With regards to dosing the experimental vessels the TG state: The test system should preferably be conditioned with concentrations of the test chemical for a sufficient duration to demonstrate stable exposure concentrations prior to the introduction of test organisms. During a 2014 workshop attended by several international experts, one of the approaches discussed was to allow ENMs to reach a pseudosteady state prior to the addition of the test fish (Petersen et al. 2015), akin to the OECD 210 TG. The pseudosteady state is the point where the ENMs appear to have reached equilibrium in terms of agglomeration, settling and dissolution kinetics, with acceptably low or no further measureable changes (Petersen et al. 2015). However, with this approach, one could essentially be missing the initial and perhaps greatest, ‘window of exposure’ (in terms of concentration). Furthermore, this is likely influenced by several factors including water chemistry, analytical techniques and instrumentation, the individual opinion of the researcher, and even test vessel volume. As such, the methodology would require standardisation. Also, in terms of environmental relevance, it is highly unlikely that early-life stage fish will move into a polluted environment, and therefore, it seems pertinent to add the fish prior to the ENMs.

Indirectly related to dosing is the issue of feeding the fish. Although for short, acute tests, the withdrawal of food is routine and does not harm adult or even juvenile fish (Handy et al. 2012b); this approach is not appropriate for early-life stage fish. The yolk sac stage is relatively short-lived, and when depleted, the larvae are no longer nutritionally self-sufficient and have no fat reserves. At this point, it is critical to feed the fish in order to ensure their survival. However, this presents a special problem for the early-life stage test where feeding may confound particle characterisation in the water or the route of exposure (i.e., unintentional dietary exposure as the ENMs stick to the feed). Furthermore, uneaten feed pellets are often removed from the tanks, but this would result in the simultaneous removal of ENMs from the exposure. These problems create additional uncertainty in experiments with ENMs and require consideration in future TG.

Maintaining the exposure

A principal concern with the aqueous testing of ENMs is the ability to maintain exposure concentrations in the water column in order to meet standard test criteria. This is traditionally ≥80 % of nominal concentrations for a fish test (e.g., OECD 1992b), and a failure to meet this validation criterion may result in a rejection of the data (Handy et al. 2012b). The original OECD 210 TG (OECD 1992a) stipulated that concentrations of the test substance in solution had to be satisfactorily maintained within ±20 % of the mean measured values. However, this has been amended in the updated TG (OECD 2013), and the only indication of desirable measured exposure concentrations now comes from the following statement: When the measured concentrations do not remain within 80120 % of the nominal concentration, the effect concentrations should be determined… The aggregation and subsequent loss of ENMs from a colloidal dispersion in solution has been frequently observed, often resulting in ENM concentration levels in the water column that would be below the acceptable limits in regulatory testing (e.g., Ag ENMs, Lee et al. 2012; Cu ENMs, Griffitt et al. 2007; TiO2 ENMs, Clemente et al. 2014). Furthermore, the relatively unpredictable behaviour of ENMs in solution and subsequent lack of reproducibility in achieving similar exposure concentrations in test vessels would also be likely to invalidate the test criteria. Flow though tests are preferred in the OECD 210 TG and are often used due to their ability to maintain dispersions and concentrations of the test chemical in the water column as well as providing sensitive fish species with well-oxygenated water and low ammonia levels (Handy et al. 2012b). However, even for traditional (i.e., dissolved) chemicals, these tests involve the use of somewhat complicated delivery systems and require large volumes of test water and thus test chemical (Welsh et al. 2008). This in turn produces large volumes of hazardous waste water and overall can prove costly compared to static or semi-static systems, increased further if expensive ENMs are tested (Handy et al. 2012a, b). Furthermore, there are concerns that ENMs may stick to or clog the exposure apparatus and that settled ENMs may not be removed from the exposure vessels, thereby increasing ENM concentrations (Petersen et al. 2015). Another consideration is the maintenance of the ENM stock required to deliver the materials to the test vessels (i.e., within a metering pump or diluter). For example, this may require continuous sonication with currently unknown consequences as well as other issues such as the dissolution of metallic ENMs over time. The combination of these factors means that flows through tests are generally impractical for ENMs.

In the absence of a flow-through system, the renewal of the test substance can be achieved through water changes with subsequent redosing (semi-static design). Two approaches to this are recommended in OECD 210. Firstly, the animals can be transferred into separate, but identical test vessels containing the test substance or secondly a proportion (e.g., 2/3) of the test water can be removed and an appropriate volume of freshwater added and the vessel then redosed accordingly. However, because ENMs readily fall out of suspension, it is very difficult to accurately remove a homogeneous proportion with the latter approach. Therefore, in the current study, it was necessary to adopt the approach of using duplicate test vessels to carry out water changes. This was carried out every 24 h, though this itself caused further problems in terms of handling stress on the fish. The TG state that when transferring larvae they should not be exposed to air. Therefore, plastic Pasteur pipettes were used to transfer embryos and larvae to new test beakers, with care taken to minimise the amount of water transferred with the animals in order to negate any changes to nominal substance concentrations in the new beakers. However, some problems with this approach became apparent. Firstly, some of the embryos were accidently dechorionated when using the pipettes, and secondly, it was difficult to capture swimming larvae (increasingly so with age), and the actions in doing so likely caused stress to the animals. This is exacerbated when using ENMs that cloud the water (e.g., TiO2 ENMs), making it more difficult to see the fish. Some larvae in the high TiO2 ENM treatments also became buried in the aggregates of ENMs found at the bottom of the beakers, particularly as they attempted to avoid capture. Furthermore, if the test was to run up to its full duration (up to 30 days post-hatch for zebrafish), then the logistics of daily water changes could prove difficult and may not be financially viable or practical for a commercial laboratory (Handy et al. 2012b).

Is the test substance concentration criteria in OECD 210 fit for purpose with ENMs?

Many of the original OECD TG, including OECD 210, were designed to assess dissolved chemicals (OECD 2013; Petersen et al. 2015) and as ENMs do not readily dissolve in solution, this can lead to problems with the test. The recently updated OECD 210 TG (OECD 2013) states that if the measured concentrations do not remain within 80–120 % of the nominal value, then the effects concentration should be determinedand expressed relative to the geometric mean of the measured concentrations for semi-static tests (further guidance provided in a separate documents, OECD 2000). Exposure concentrations were not maintained between these values in the current experiments, but this amendment to the TG means that the test validity criteria were not failed per se. This is a welcome addition to the TG, though it should remain desirable to attempt to maintain concentrations within this 80–120 % limit of variation from the nominal values. Doing so does represent a scientifically valid approach, with the established paradigm being that response is proportionally related to dose. However, for some ENMs, this is difficult or even impractical to achieve, and the value of such limits for ENM testing should be considered (Petersen et al. 2014). Furthermore, it may be desirable to assess toxicity by alternative dose metrics to mass concentration, such as particle number concentration, size distribution or surface area (Handy et al. 2012a; Petersen et al. 2015). The reasons why ENMs are lost from the water column also need to be considered. For traditional chemicals, these losses may be attributed to actual removal from the test systems (e.g., by volatilisation and chemical degradation processes) (Petersen et al. 2015), whilst ENMs may still be present, but have settled out or stuck to vessel walls, etc. Moreover, technical limitations in quantifying ENMs do not necessarily mean they are absent from the test media.

When assessing test substance concentrations using the old OECD 210 TG (OECD 1992a), it was apparent how variable the results were with ENMs. In this TG, the measured test chemical concentrations had to be satisfactorily maintained within ±20 % of the mean measured values. For example, in the current experiments, the individual measured values from the 10 mg l−1 TiO2 ENM treatment deviated between 60.8 and 219.2 % from the mean immediately following dosing and from 16.9 to 240.8 % after 24 h. Although not now required as part of OECD 210, these data do highlight the difficulty in achieving uniform and reproducible dispersions with ENMs.

Currently, the TG has an upper concentration limit that should not exceed the 96 h LC50 value or 10 mg−1 (whichever is lower). However, as seen in the current experiments, this was not high enough to elicit a toxicological response in the TiO2 ENM experiments. Raising the upper allowable test concentration is a point of debate. In order to gain specific dose–response data, the concentration needs to be high enough to produce a measurable response in the animal, but from a regulators point of view, these concentrations may be environmentally unrealistic and thus not useful to assess.

Analytical measurements

In the Frequency of Analytical Determinations and Measurements section of the TG, it states: Analytical methods required should be established, including an appropriate limit of quantification (LOQ) and sufficient knowledge on the substance stability in the test system. However, for ENMs, we cannot reasonably claim to fully meet these prerequisites. As discussed elsewhere (e.g., von der Kammer et al. 2012), the development of appropriate analytical techniques and instrumentation is the focus of international research, but is not yet routinely available. Also, although progress has been made in understanding the behaviour of ENMs in solution, there are still no universally accepted paradigms. The dynamics of ENMs in test media changes with particle characteristics (including any changes to surface chemistry), abiotic factors and time, and research is still needed to fully elucidate these behaviours.

In order to characterise (confirm) the exposure, the TG recommend at least five determinations throughout the duration of the exposure. For flow-through tests, this should be carried out in at least one replicate per concentration each week, whilst there are no further instructions for semi-static tests. Given the changes over time and variability between replicates for the ENMs tested here, there is a need to increase the measurements from those recommended in the TG. There is also a requirement in the TG to filter (0.45 µm pore size) or centrifuge water samples prior to measuring. However, this is not practical for ENMs where aggregates may get caught in the filter, even if below the 0.45-µm cut-off size and the filter is saturated before use.

Are the sublethal endpoints appropriate for use with ENMs?

Although nano-specific endpoints or biomarkers are desirable and might simplify ENM toxicity testing, it is unlikely that appropriate endpoints will be discovered or developed in the short term (Handy et al. 2012b). A more plausible task is to identify a suite of toxicological responses following exposure to the major types of ENMs and use these to indicate possible exposure (Handy et al. 2012b). For assessing sublethal effects, the OECD 210 TG suggests recording abnormal behaviour and abnormal (morphological) appearance at adequate intervals. However, there are no specific instructions or recommendations on how to assess any abnormalities or to quantify them (i.e., through a classification or scoring system). Furthermore, due to the small size of zebrafish larvae (approximately 3–4 mm), it is very difficult to accurately assess behavioural abnormalities such as hyperventilation and uncoordinated swimming (as are suggested in OECD 210). Some commentary on sublethal observations in our experiments with ENMs is given below in relation to the test method.

Swimming behaviour and quiescence

Although swimming and quiescence were not assessed quantitatively during the current work (e.g., by video tracking analysis), it was clear that hatched larvae exhibited a pollution avoidance response to both the TiO2 ENMs and the TiO2 bulk. This was most evident at the higher TiO2 ENM exposure concentrations where aggregated particles sedimented on the bottom of the beaker forming a layer of TiO2. This observation implies that the settling of the test substance creates the opportunity for the animals to avoid exposure, thus undermining one of the fundamental assumptions in experimental toxicology that the toxic response is associated with the compound administered. However, the avoidance behaviour itself may be a useful endpoint for substances that settle out of the water column.

Hatching

The numerous studies showing that ENMs affect hatching (review; Celá et al. 2014) suggest that this is a suitable endpoint for assessing the toxicity of ENMs to fish embryos. The approach in OECD 210 of combining both lethal and sublethal endpoints can evidently prove somewhat problematic (e.g., disparate sample numbers due to mortality), though this is a generic rather than a nano-specific issue. Moreover, an issue with the current OECD 210 TG is the lack of more specific and robust instructions on how to accurately assess the effects to hatching. Although the guidance notes do state that hatching should be assessed at least once a day, conducting daily observations only is clearly inadequate and can lead to inaccuracies in the data. For example, if an embryo hatches shortly after the daily observation at 48 hpf, it would not be detected until the following day, classified as hatching at 72 h and possibly as delayed. Therefore, in order for this potentially relevant and useful endpoint to be included in ENM testing, the protocol requires updating to be less ambiguous and more rigorous in its approach. This could include more frequent observations when assessing hatching or a separate experiment assessing these sublethal endpoints.

Growth and morphological abnormalities

Somatic growth and morphometrics have long been utilised as sublethal markers of toxicity in fish for traditional chemical substances (Schlenk et al. 2008; Groh et al. 2015) and are beginning to be used for ENMs (e.g., Ramsden et al. 2009; Bai et al. 2010a, b; Fraser et al. 2011; Clemente et al. 2014). The significant decrease in larvae length following exposure to Ag ENMs along with the absence of such effects in TiO2 ENM exposed animals indicates that body length measurements are ENM-specific, which may aid the differentiating of the effects of one material from another in hazard assessment.

During the course of the current work, it was seen that careful handling of larvae for morphometrics was crucial, particularly when euthanising remaining fish and placing them into fixative. Artefacts such as curved spine or even a disintegration of the body would occur if this was not done correctly and with care. Therefore, the inclusion of a suitable standard operating procedure into future TG would be essential, in order to ensure that the data are robust and comparable, the fundamental rationale behind the MAD principle.

The results from the present work suggest that morphometrics may be considered a valid approach for assessing the sublethal effects of some ENMs with comparable effects seen in other studies (e.g., multi-walled carbon nanotubes, Liu et al. 2014; silica ENMs, Duan et al. 2013; ZnO ENMs, Bai et al. 2010a, b), though one must be aware that similar effects have been seen with dissolved contaminants (e.g., reduced larval length in zebrafish larvae exposed to dissolved Cu, Johnson et al. 2007). These measurements cannot therefore be considered nano-specific biomarkers for the toxic effects in fish, but nonetheless remain useful indicators of ENM toxicity. Differences in fish length and yolk sac volume during the current study showed the utility of these endpoints and should be incorporated into any prospective SOPs, but muscle block width was not a sensitive endpoint for any of the materials tested in the current study.

Are the uses of the NOEC and LOEC endpoints still valid?

As part of the reporting procedure, the TG requires the no observed effect concentration (NOEC) and the lowest observed effect concentration (LOEC). Due to the methods used in their calculation, these metrics are essentially a function of test chemical concentration, data variability, significance level (e.g., 95 %) and sample size (Warne and van Dam 2008). As such they have been criticised for their statistical flaws (e.g., Skalski 1981), which can result in them being poor estimators of safe chemical concentrations and incapable of producing consistent summary statistics (Crane and Newman 2000). There have been numerous calls for the cessation of their use in regulatory toxicology and even their reporting in peer-reviewed journals (e.g., Laskowski 1995; Chapman et al. 1996; Warne and van Dam 2008; Landis and Chapman 2011; Sánchez-Bayo 2012). Clearly, this has not been implemented as the requirement for these metrics remains in use internationally, as seen here in the OECD 210 test. This is despite an OECD workshop in 1998 concluding that NOECs should be phased out, a view echoed by the International Organisation for Standardization (ISO) and the European Chemicals Agency (ECHA) (Jager 2012). The concerns that reduce their worth for traditional chemicals may be magnified when testing ENMs. For example, the issue of variability in the data which appears a far greater problem in tests with ENMs than traditional chemicals. Whilst it is outside the scope of the current work to suggest alternatives to these endpoints, there is clearly the need for an agreement and implementation of viable replacements.

When should the exposures commence?

The OECD 210 TG recommends that: The test should start as soon as possible after the eggs have been fertilised and preferably being immersed in the test solutions before cleavage of the blastodisc commences, or as close as possible after this stage. Cleavage of the blastodisc occurs between ¾ and 2 ¼ hpf in zebrafish (Kimmel et al. 1995). Logistically speaking, this is a narrow window of opportunity in which to set up and commence the experiments. During the current work, concerns were raised that processing the several hundred eggs required to fulfil the additional treatments needed in the design (e.g., bulk materials, metal salt controls, dispersant controls) compromised accuracy. For example, working with haste to get the experimental vessels set up quickly after the eggs are fertilised may result in random human error, such as non-viable embryos being added to the experimental beakers. At best, this would introduce variance, and at worst could skew survival data (i.e., giving a false negative). Therefore, although survival in the control beakers was over that required for validation in OECD 210 (i.e., >75 % post-hatch throughout), a separate study was conducted to assess the worth of beginning the exposure as described in the current TG.

With a minimum of 20 embryos per beaker, a routine ENM experiment using the old OECD TG (OECD 1992a) required 660 viable embryos to fulfil OECD 210 criteria (five concentrations of the ENM in triplicate, five concentrations of an appropriate bulk or metal salt control in triplicate and three control beakers). Therefore, for this assessment on the time to start exposures, 660 embryos between ¾ and 2 ¼ hpf were graded into 33 beakers (as per experimental conditions, n = 20 embryos per beaker) and observed daily until hatched. However, hatching success ranged from 35 to 75 % (data not shown) even though they were all in clean dechlorinated Plymouth tap water with more than adequate water quality and a low biomass loading. An identical experiment was then carried out using a new batch of embryos from the same brood stock, except that grading began when the eggs were at the 8–16 cell stage. This increased hatching success to 95–99 % in clean water (data not shown). This improvement in hatching is clearly due to being able to firmly identify viable embryos at the 8–16 cell stage and exclude poor embryos from the experimental design. The variance introduced by the initial selection of the embryos under the current TG will be dependent upon batch quality and experience of the worker, and raises concerns about the utility of this guideline for chemicals generally. Scoring embryos at the 8–16 cell stage removes most of this uncertainty and could be incorporated into the new OECD 210 TG (OECD 2013); especially, now the revised test method is indicating a minimum of four replicates per treatment. This increase in replicates from three to four would add logistical pressure (speed vs. reliability) to the task of preparing test vessels with animals (e.g., 44 beakers containing 880 animals here), with further increases when solvent controls are necessary.

Some studies have observed an increased sensitivity in embryos exposed prior to hardening of the chorion compared to embryos that were exposed a little later when the chorion has hardened (e.g., zebrafish embryos exposed to wastewater, Gellert and Heinrichsdorff 2001). This was attributed to the reduced permeability of the chorion post-hardening, and therefore, the authors argue that exposure should commence prior to the hardening (approximately <0.5 hpf). This could be considered an environmentally realistic approach as fish embryos in the wild could be exposed at any point during their lifespan, but as described above, this would not be logistically practical with tests involving ENMs. Other workers have advocated dechorionation prior to exposure as a legitimate way to bypass the protective effects of the chorion and assess direct effects to the animal (e.g., Mizell and Romig 1997). However, although this approach may be suitable for proof of principle experiments (i.e., how does a substance affect an embryo without a barrier that may not be ubiquitously protective), it would not be appropriate for regulatory toxicity tests where many different test substances need to be screened, a problem that is exacerbated by the myriad of surface functionalisations available for ENMs. Furthermore, chemical dechorionation has been seen to result in low levels of survival, as low as 20 % in zebrafish embryos incubated in pronase (Henn and Braunbeck 2011). Mechanical dechorionation is feasible, but impractical in embryos <8 hpf, again due to variable survival rates (Henn and Braunbeck 2011). Mechanical dechorionation is also labour intensive and requires skill in order to prevent harm to the embryo.

Is a test spanning both the embryonic and larval stages of fish development suitable for use with ENMs?

The OECD 210 TG involves exposing the fish from fertilisation through hatching, the subsequent eleutheroembryo and larval stages, until the control fish reach a juvenile life stage. However, using a test spanning, these life stages can cause uncertainty in the interpretation of the data, as it has often been seen that the embryonic stage is less sensitive to chemicals than that of hatched larvae (e.g., cadmium, Eaton et al. 1978; copper, McKim et al. 1978a, b) or dechorionated embryos (e.g., silver, Guadagnolo et al. 2001). In order to maximise the efficiency and scientific value of the test, it is necessary to know whether this joint life-stage approach is valid for ENMs. Therefore, experiments were conducted to assess the sensitivity of pre-hatch embryos versus post-hatched larvae to ENMs along with animals that were exposed through both life stages. To this end, three experimental groups were exposed in triplicate to either 58 µg l−1 Ag ENMs or a further three to 5 µg l−1 AgNO3 (these values were the 144 h LC50 values previously determined for early-life stage zebrafish for each material, respectively, Table 2). Group 1 were exposed to Ag ENMs or AgNO3 from 4 hpf until free feeding (144 h). Group 2 were exposed to Ag ENMs or AgNO3 from 4 hpf until hatching (approximately 48 h). Group 3 were exposed to Ag ENMs or AgNO3 from hatching (4 hph) for 96 h. The exposures were carried out using an identical experimental design to that describe above for the unmodified OCED 210 test with appropriate no added Ag and dispersant (NM-300K-Dis) controls.

The results showed that larvae were more sensitive than embryos to both Ag ENMs and AgNO3, though this was more noticeable in the Ag ENM treatment (Fig. 4). After 96 h, there were no surviving larvae following exposure to Ag ENMs from post-hatching only, whereas larvae that were exposed to Ag ENMs from 4 hpf throughout the embryonic stage first experienced approximately 25 % survival after being hatched for 96 h (Fig. 4a). These results indicate that pre-exposure to Ag ENMs during the embryonic stage reduces sensitivity in hatched larvae or at least delays the effects. However, a similar trend was not seen in fish exposed to AgNO3, with larvae from both experimental groups experiencing similar levels of survival (around 40–45 %) after being exposed for 96 h post-hatching (Fig. 4b). Interestingly, embryos were not appreciably affected by exposure to AgNO3 from fertilisation, whilst embryos exposed to Ag ENMs did see some mortality prior to hatching (survival less than 80 % in embryos exposed to Ag ENMs by 72 h, compared to over 90 % in those exposed to AgNO3 by that time point, Fig. 4).

Fig. 4
figure 4

A life-stage comparison experiment investigating the differences in survival of zebrafish embryos and larvae exposed to either a 58 µg l−1 Ag ENMs or b 5 µg l−1 AgNO3 for 96 h. These values represented the previously determined 6 day (144 h) LC50 values for early-life stage zebrafish. Error bars and statistics are omitted for clarity, but there was a statistically significant difference (ANOVA, P < 0.05) in survival between embryos and larvae at every time point in the Ag ENM exposure (a). Data are mean of triplicate beakers, n = 20 embryos or larvae per beaker, 60 per treatment

Clearly these data highlight the need to carefully consider the rationale for conducting a joint embryo–larvae test. Whilst a shift towards the use of fish embryos as ‘alternative’ test organisms has been driven by ethical considerations to reduce the number of free-living vertebrate animals used in toxicity testing (Embry et al. 2010; Halder et al. 2010; Knöbel et al. 2012), this should not be at the risk of increased uncertainty in the process of hazard and risk assessment (Oris et al. 2012). Moreover, the OECD 210 test continues until the juvenile stage and therefore does not necessarily fit the 3Rs criteria in any case. Zebrafish embryos were sensitive to Ag ENMs here, but far less so than larval fish. The inclusion of the embryo stage also has implications for experimental design in tests with ENMs. For example, if the ENMs aggregate and fall out of suspension, those embryos on the bottom of the tanks or beakers are likely to be smothered, and whilst this may represent a novel mechanical mechanism of toxicity to fishes, it also implies that one should not assume that the toxicity reported in a test is solely attributable to chemical toxicity. Therefore, the use of embryos in ENM toxicity testing may have some merit, but it is debatable whether they are a more useful life stage than the larval stage. These data also highlights the difficulty in conducting reproducible toxicity tests with ENMs using the unmodified OECD 210 TG. A previously determined LC50 value using the OECD 210 method (Table 2) was used for the exposure concentrations in the life-stage study. However, this concentration caused 100 % mortality after 96-h exposure in the larvae treatment, an effect not seen in animals exposed to the relevant LC50 concentration of AgNO3 (Fig. 4).

Approaches for maintaining ENM exposure

In the present study, maintaining the exposure concentration of the ENMs was problematic when using the current, unmodified, OECD 210 TG. Two predominant issues in aqueous toxicity tests with ENMs are the loss of the materials from solution (e.g., settling out, adhering to tank/beaker walls) and technical limitations in analytical procedures to quantify the ENMs in samples. The difficulties involved in the quantification of ENMs are well documented (see Handy et al. 2008, 2012b; Tiede et al. 2008; Simonet and Valcárcel 2009; von der Kammer et al. 2012; Shaw et al. 2013), and the development of suitable analytical techniques is beyond the scope of the current work. However, modifications to test criteria and ways of improving the maintenance of exposure concentrations were explored.

Redefine test criteria

Prior to investigating potential changes to test apparatus and protocols, the first consideration is whether a relaxation of the criteria required to validate the test with respect to maintaining the exposure should be made. In the case of ENMs, it is arguable that some relaxation of the criteria may be pragmatic, to ensure that the necessary testing is not delayed. Although the updated OECD 210 TG (OECD 2013) does have provisions for when the data do not test requirements for the exposure, it remains critical that attempts should be made to ensure ENM dispersions in toxicity tests. Therefore, attempts were made to improve this aspect of the test design, with our approach considering the importance of keeping the methodology relatively simple in order to allow routine use within an ecotoxicology contract testing laboratory environment. The strategy included the use of dispersants in test water, stirring and the development of exposure chambers (see below). Initially, these experiments were conducted without fish in order to try and improve the colloidal suspensions and later with embryos and larvae to assess the effect of the presence of the animals on the dispersion. TiO2 ENMs were initially chosen as the test material as they had proven the most problematic in terms of maintaining the exposure concentration in the previous experiments, though further work included Ag ENMs and MWCNT (see below).

Changes to exposure media

A common approach for maintaining ENMs in solution is with the use of dispersants (see Handy et al. 2012a, b; Shaw et al. 2013). During the current experiments, three different dispersing agents were tested: sodium dodecyl sulphate (SDS), Triton X-100 and citrate. However, the only ENM to show an improvement was MWCNT when 0.25 mg l−1 SDS was added, though this was not quantifiable due to technical constraints. The addition of 2 % Triton X-100 not only failed to improve dispersions of the ENMs, but it also resulted in the exposure media being too viscous to support fish life. It is possible that as the Ag ENMs used here (NM-300K) were already mixed with a dispersant media (NM-300K-Dis); this negated any potential effects of the other dispersants. This last point demonstrates the importance of understanding the full constituents of the ENMs to be tested, with capping agents potentially affecting the experimental design. Although there are other dispersants that may be more effective, it is unlikely that many of those used will provide suitable husbandry conditions to sustain healthy aquatic life (Petersen et al. 2015). Another approach is to use naturally occurring substances. For example, natural organic matter (NOM) has been seen to improve ENM dispersions (Zhang et al. 2009; Delay et al. 2011), but increasing NOM in an aqueous fish test will not inform upon possible effects in an environment where levels are naturally low. When considering all of these issues, it is preferable to avoid the use of dispersants in tests with ENMs.

Mixing to improve ENM dispersion

With the use of dispersing agents undesirable, the next approach was to attempt to improve ENM dispersion by use of mechanical mixing in the experimental beakers. A preliminary experiment was carried out to assess the ability to keep the TiO2 ENMs in suspension over a 24-h period by simple stirring (12-mm stirring bars on a magnetic multipoint stirring plate (5.5 MOT min−1, RO 15 IKA-Werke). Triplicate test beakers containing 300 ml of dechlorinated Plymouth tap water in 400-ml beakers (as used in animal exposures) were dosed with either 10 or 160 mg l−1 TiO2 ENMs. Samples analysed by ICP-OES 24-h post-dosing showed that the measured mid-water column concentration of TiO2 from the 10 mg l−1 TiO2 ENM treatment was 63 % of the nominal exposure concentration when it was stirred. Interestingly, in the unstirred beakers, just 5 % of the nominal dose was quantified, meaning there was an almost 13-fold increase with stirring. However, when the 160 mg l−1 treatment was assessed, the results were less prominent. Here, the measured exposure concentrations 24 h post-dosing were 27 and 6.5 % of the nominal concentrations with and without stirring, respectively. This indicated that although stirring was able to improve dispersion of TiO2 ENMs, it was mass concentration dependent. It is notable, however, that this was using a high TiO2 ENM concentration (160 mg l−1), which is far higher than the highest recommended concentration in OECD 210 (10 mg l−1).

From the view point of logistics for a testing laboratory, the initial aim was to enable sufficient mixing of the ENMs so that daily water changes were not required to maintain the exposure. However, this was not possible, as even with stirring, the measured concentration of TiO2 ENMs decreased over 24 h with aggregates clearly observed to settle at the bottom of the stirred beakers in the 160 mg l−1 treatment. Using larger (20 mm) stirring bars partially resolved this problem, visually showing an improved dispersion of TiO2 ENMs at both nominal concentrations, with no discernible settling of TiO2 on the bottom of the beaker (data not shown).

Exposure chambers

Fish larvae are delicate animals, and although stirring was effective at improving the colloidal suspension of TiO2 ENMs in the exposure beakers, it also created a mechanical injury hazard to the organisms. There are also concerns that forcing a larva to swim in the vortex of mechanically stirred beaker would change the energy partitioning in the animal (i.e., higher cost of swimming). This may result in less energy being available to deal with toxicity (Handy et al. 1999), thus inadvertently increasing the stress on the animals. Consequently, several modifications of the test apparatus were explored in attempts to find a way of healthily maintaining the animals in the water column of the beaker from the perspective of animal welfare, whilst also finding a stirring method that could still be utilised. In separate experiments within our laboratory, Boyle et al. (2015) attempted to solve this issue by the use of exposure chambers based upon a modification of methods for suspended sediment experiments. In their study, the use of exposure chambers in 2-l beakers with aeration resulted in significantly increased the quantifiable concentrations of Cu, TiO2 and Ag ENMs. These improved dispersions consequently resulted in lower LC50 values for each ENM tested and negated the need for daily water changes. However, for the current study, it was decided that although this was a good model to follow, in order to conduct multiple exposures simultaneously, the beaker volume (thus size) needed to be reduced. Initial experiments in the current study used fully filled 400-ml beakers, but whilst these were a convenient size with which to work, the amount of the dose measured by ICP-OES did not significantly increase (data not shown). Further experiments revealed that using a 1-l beaker (filled to the 1-l mark) resulted in a significant increase in the measured TiO2 ENM concentration in the test vessels towards the desired nominal value. Though the 1-l beakers still had slightly lower TiO2 levels than that seen in 2-l beakers, there was far less variability. For example, the 24 h measured concentration of a 10 mg l−1 TiO2 ENM dose was 80.0 ± 8.7 and 83.8 ± 16.6 % of the nominal concentration for the 1- and 2-l chambers, respectively (data are mean ± SD, n = 12 beakers per treatment). Further attempts to maintain this for 96 h, with and without aeration, were unsuccessful (data not shown), and it was decided that daily water changes were still a necessity.

These experiments led to the development of a prototype exposure chamber similar to that of Boyle et al. (2015). The chamber was made of polyvinyl chloride plastic (PVC) and allowed water flow, but also protected the animals from mechanical injury. These chambers (one per beaker) were suspended in the water column and had meshed cover apertures strategically placed to allow inwards and outwards flow of the ENMs (Fig. 5). The chambers essentially work via a stirring bar that creates a flow of water up through the water column at the sides of the beaker and then down through the chamber back towards the vortex, thereby evenly circulating the dispersed TiO2 ENMs around the beaker. The initial mesh used was constructed from nylon and was designed for aquarium use (e.g., fish netting). It had a pore size of 441.0 ± 26.8 µm (mean ± SD, n = 6 images), chosen so that neither the embryos nor the larvae could fall or escape through it, but big enough to allow even large aggregates of ENMs to flow through. During these experiments, it was noticed that the speed of stirring that was effective in the original experiments without chambers was no longer optimum. Reducing the speed to 3.5 MOT min−1 in conjunction with the prototype exposure chamber improved the stability of the exposure concentration (data not shown).

Fig. 5
figure 5

Diagrammatic representation of the exposure chambers (a) used to improve the dispersion of ENMs in the test [based upon Boyle et al. (2015)]. Using this design in conjunction with magnetic stirrers improved ENM dispersion in the experimental beakers and therefore the exposure, whilst also maintaining animal health. The chamber was effective for both embryos and larvae. The dimensions of the chamber were: height 125 mm, diameter 68 mm, wall thickness 2 mm, diameter of mesh-covered aperture on the side of the chamber 45 mm. c, d Micrographs of the two types of mesh employed in the exposure chambers following 6-h exposure to 10 mg l−1 TiO2 ENMs with stirring. Nylon mesh (b) remained relatively free of TiO2 ENMs, whilst PTFE-coated mesh (c) had visible aggregates of TiO2 ENMs (arrows) which blocked the pores (Olympus SZX7 with Infinity 2-2C camera, ×4.0 magnification)

Experiments were then conducted using both embryos and larvae, and it was seen that the presence of the chamber and the mesh size was suitable for housing the embryos and larvae with no mechanical damage from the stirring bars or escapees. Furthermore, the animals were not affected by the water flow, and quiescent behaviour was observed as normal for early life stage zebrafish (i.e., not discernibly different from larvae in control beakers with no stirring). Further attempts were made to improve the dispersion of TiO2 ENMs within the chambers by exploring increased mesh size and material and changing the chamber vessels from PVC to borosilicate glass (i.e., to reduce adsorption of the test substance to the apparatus). It was hypothesised that polytetrafluoroethylene (PTFE)-coated mesh would improve the consistency of the exposure as this material is known for its ‘non-stick’ properties (e.g., Teflon). However, measured concentrations of TiO2 ENMs were actually reduced when using PTFE mesh compared to nylon (data not shown). A microscopic analysis of the different mesh types revealed that the TiO2 ENMs had aggregated and blocked the pores of the PTFE-coated mesh, far more than seen in the nylon mesh (Fig. 5b, c).

To assess the efficacy of the exposure chambers, that is, its ability to maintain homogenous dispersions of the test materials, an experiment was carried out where nominal concentrations of TiO2 and Ag ENMs (with suitable controls) were added (in triplicate) to 1-l beakers either with or without a chamber. Samples were taken 10 min post-dosing (designated as 0 h) and at 24 h and assessed for Ti or Ag as appropriate using the methods described above. The results clearly show the benefits of using the chambers, with notable improvements in the maintenance of exposure concentrations over 24 h compared to static beakers (Fig. 6). To further explore the use of the exposure chambers, an experiment was conducted using MWCNT. Though it is not yet possible to quantify MWCNT in solution, it was apparent that when using a chamber and stirring, a concentration of 10 mg l−1 MWCNT did not settle out of suspension as noticeably as when added to a static beaker without mixing. This demonstrates the utility of the apparatus for ENMs with different physico-chemical properties and shape to that of TiO2 and Ag ENMs.

Fig. 6
figure 6

Measured concentrations of both TiO2 ENMs and Ag ENMs decreased over time without exposure chambers (a), and significantly improved over 24 h when glass exposure chambers were used with nylon mesh (b). The TiO2 ENM nominal concentration was 10 mg l−1, whilst the nominal concentration of the Ag ENMs was 50 µg l−1

It is essential that methods used in regulatory toxicology have good reproducibility, and so, the utility of the exposure chambers was assessed for variability with both TiO2 and Ag ENMs. Here, the lowest and highest concentrations of TiO2 ENMs, TiO2 bulk, Ag ENMs and Ag (as AgNO3) used in the original exposures (Table 1) were added to chambers in beakers (n = 6 chambers/beakers per treatment/material) containing 1 l of Plymouth tap water as before. Water samples were taken from the centre of each chamber at 0 and 24 h, and assessed by ICP-MS (AgNO3) and ICP-OES (TiO2 ENMs and bulk, Ag ENMs) as described previously. Six samples per test material were assessed, and the coefficient of variation (CV) between each set of six was calculated. The CV provides information on the similarity of the measured concentrations, with low and high variance indicating good and poor reproducibility, respectively. The variation was significantly lower when the exposure chambers were used (Table 7) compared to the original exposures using the unmodified TG (Tables 3, 4). For example, 10 min post-dosing, the CV ranged from approximately 21–41 and 14–32 % for TiO2 ENMs and Ag ENMs, respectively, without chambers. However, when the exposure chambers were used the CV decreased to approximately 3–10 % for TiO2 ENMs and 2.5–5 % for the Ag ENMs, also at 10 min after dosing. This becomes even more significant 24 h after dosing, where the CV was between 71 and 90 % for TiO2 ENMs and 15–37 % for Ag ENMs without chambers, compared to 6–12 and 3–8 % for TiO2 and Ag ENMs, respectively, with a chamber.

Table 7 Utility and precision of the exposure chambers for maintaining TiO2 and Ag ENM dispersion over 24 h

Finally, the use of glass chambers rather than PVC was examined and this change in material increased the measured concentration from 7.5 ± 0.9 to 8.9 ± 0.7 mg l−1 (data, mean ± SD, n = 10 beakers per treatment) for a nominal TiO2 ENM concentration of 10 mg l−1 after 24 h compared to PVC chambers. This represents an almost 15 %, improvement towards the target concentration during this particular experiment, though previous experiments with PVC chambers had resulted in higher measurable concentrations of TiO2 ENMs (e.g., Table 7).

Together, these data show that the chambers were not only able to produce measured exposure concentrations closer to nominal than the original TG, but also reduced inter-beaker variability. Furthermore, they alleviated handling stress on the animals (refinement of animal welfare) by facilitating the quick and easy removal of the fish from the test beakers when conducting water changes. This also reduced the duration of the task, thus improving the logistics (important for commercial testing laboratories). In order to be fit for purpose for routine use, the exposure chambers should be standardised in terms of their dimensions, shape and material. The use of standardised exposure chambers could increase the consistency of the hazard data generated for ENMs (Petersen et al. 2015), which adheres to the MAD principle that these tests were originally designed to achieve.

Recommendations and conclusions

Our recommendations are presented in Table 6, based on our case study experiences at the bench with the TiO2 ENMs, Ag ENMs and MWCNT tested here. Overall, in its current form, the TG for OECD 210 was not entirely proven fit for purpose for ENMs and requires some modification. Some aspects of the TG for OECD 210 would benefit from further written technical guidance on ENMs, perhaps as supplementary appendices or by referring the reader to the detailed TG documents emerging from the Working Party for Manufactured Nanomaterials (WPMN) at the OECD. The most prominent recommendations from Table 6 with respect to the nano-specific problems with the test are:

  1. 1.

    A shift towards the use of larval rather than embryos in a much shorter test, as the larvae are more sensitive to ENMs. This recommendation also removes the confounding problem of multiple life-stage effects from the data interpretation (for all chemicals), as the current test design has embryonic fish and larval stages in one continuous test. The need for chronic ENM tests remains;

  2. 2.

    The utilisation of an exposure chamber (modification of the test apparatus) to maintain exposure concentrations during tests with ENMs rather than modifying the test criteria to a more lenient expected concentration of the nominal test substance concentration. The exposure chambers considerably improve the dispersion of the ENMs throughout the test vessels so that the current test criteria can be met;

  3. 3.

    Inclusion of written guidance on how to measure the sublethal endpoints in fish, and commentary on which endpoints seem sensitive to ENMs so far.

Table 6 also has several recommendations on the technical details of the OECD 210 TG that are aimed at standardising aspects that are important factors in the behaviour of ENMs, but not previously a worry for traditional chemicals. These include a preference to avoid the use solvents, being more prescriptive about when the test substance should be added, extra controls for ENMs, and when/how to feed fish so that it does not interfere with ENM characterisation in the test water. There are also recommendations that either add a comment or an appendix about stock dispersion and media preparation, or links that prompt the reader towards other TG documents from the WPMN where the information may be found for ENMs.

There are also a number of recommendations that are not necessarily nano-specific, but are a source of variation in the current OECD 210 method that can be improved upon by considering the biology of the fish (Table 6), for example, comments on the scoring of embryos at the start of a test. The scoring of embryos during the 8–16 cell stage drastically improved hatching success, and even if embryos are not used in the exposure, this subsequently improves the efficiency of a larval test. Shortening the test by using only a larval stage would be logistically beneficial to the testing laboratory. There is also scope to improve the artificial water recipes so that they are more physiological for fishes.

Whilst these recommendations are intended to improve the utility of the test as a standardised method for hazard assessment, the opportunity to improve the environmental relevance and ethical aspects relating to animal welfare should be taken. Adding a benthic species to the list of organism will be beneficial for substances that settle from the water column in aquatic ecosystems. Proposing a shorter larvae-only style of test does not undermine the ecological importance of the data obtained. In polluted water, unhatched embryos have some protection from the chorion and buffering power of the perivitelline fluid, and consequently, there may be more environmental concerns about hatching itself, than the subsequent survival of the free swimming larvae. The normal development and survival of larvae are a critical part of recruitment into fish populations, and it is necessary to understand the potential effects of ENMs on these ecologically relevant processes (Handy et al. 2011). A larval-only test would reduce the use of animals (3Rs), but remain sensitive for hazard assessment purposes. For risk assessment, logic dictates that testing the most sensitive life stage will offer the greatest probability of protecting other life stages when the risk analysis is performed. For the Ag ENMs and to a lesser extent the MWCNT tested here, the hatched larvae were more sensitive than the unhatched embryos. There is ethical pressure to move towards embryo testing as an alternative to adult fishes. However, the scientific reasoning for using a particular life stage for hazard assessment needs to be clearly defined and weighed against the ethical concerns for the test organism. For our recommended larval test for ENMs, the first 24–48 h or so after hatching (depending on water temperature/animal species) is prior to first feeding and therefore outside of the licensing required under the current EU directives on animal welfare during scientific procedures.

In conclusion, the utility of the OECD 210 ELS test has been experimentally and critically evaluated using a case study approach involving several ENMs with different physico-chemical properties. There are concerns for using ENMs in the current unmodified test method, but these technical problems have been resolved with recommendations for improving the utility of the test for ENMs, as well as other sources of variation associated with the biology of the organisms and general test design. The ELS test using zebrafish (Danio rerio) remains a useful tool for hazard assessment, but our recommendations will improve the scientific robustness of the method as well as increasing the utility of the protocol from the view point of logistics and time in the laboratory. The experiments here were done with the regulatory toxicology community in mind, and it is hoped that regulatory bodies, such as the OECD, can consider these recommendations for adoption.