Introduction

An increasing number of chemical compounds are being produced worldwide in large amounts (CEFIC 2023; Scheringer et al. 2012; Strempel et al. 2012; Wang et al. 2020). It is very important to know their fate and ecotoxicological behavior in the environment in order to make predictions and to take appropriate measures to prevent harmful effects. Biodegradability as well as possible toxic effects and bioaccumulation are the most important criteria for this purpose. In different recent reviews, this matter has been stressed and available test methods have been discussed (Kowalczyk et al. 2015; Strotmann et al. 2023). An effective assessment of chemical compounds for the aquatic compartment, e.g., under REACH (registration, evaluation, authorization, and restriction of chemicals) is mainly based of acute and chronic tests with representative organisms from different trophic levels (algae, crustacea, fish). Toxicity against bacteria is only routinely assessed for the compartment of wastewater treatment plants (WWTPs), often applying the activated sludge respiration inhibition test according to (ISO 8192 2007; OECD 209 2010) and the nitrification inhibition test (ISO 9509 2006; OECD 209 2010). However, bacterial toxicity is also an important factor when interpreting data from biodegradability tests and analyzing biotechnological processes. Activated sludge from WWTPs is the inoculum most often used in tests for determining biodegradability. The species composition of the activated sludge is very diverse and difficult to determine. An overview of the organisms present (Fig. 1) shows that bacteria play a predominant role. The possibilities to identify and characterize bacterial species are limited and vary from classical microbiological methods, over phenotypical, and molecular biological methods up to new spectroscopic techniques (Fig. 2). The importance of bacteria in this complex system cannot be overestimated and inhibitory effects can have far-reaching consequences. Therefore, many test methods for determining potential bacterial toxicity exist which use different inocula, species, and endpoints.

Fig. 1
figure 1

Main organism groups present in activated sludge. Also, the main bacterial phyla (Proteobacteria, Bacteroides, Firmicutes, and Actinobacteria) present in different environmental compartments are indicated. References: 1 (Brown et al. 2019a), 2 (Maza-Marquez et al. 2018), 3 (Madoni 2011), 4 (Brown et al. 2019b), 5 (Shin et al. 2015), 6 (Xia et al. 2010), 7 (Zhang et al. 2018)

Fig. 2
figure 2

Overall description of the methods used to characterize bacterial cells (the graphics of the bacteria cell was retooled from Vectorstock) References: 1 (Dib et al. 2023), 2 (Rodriguez et al. 2023), 3 (Kassem et al. 2023), 4 (Cui et al. 2022), 5 (Zhang et al. 2022), 6 (Farias et al. 2023)

The importance of biodegradability has been pointed out in an overview where also current test methods have been listed (Strotmann et al. 2023). Microbial toxicity tests can also serve for analyzing biotechnological processes in wastewater treatment such as nitrification, denitrification, and nitrogen elimination as well as the control of toxicity of influent and effluent in WWTPs.

In order not to hinder or prevent degradation processes in the natural environment and in technical facilities, some important criteria must be met. One is the toxic effect of chemicals or wastewaters on microorganisms, especially on bacteria, because they are primarily responsible for biological degradation and, therefore, also for their elimination in the environment. In this context, the determination of possible toxic effects of wastewaters and UVCBs (substances of unknown or variable composition, complex reaction products, or biological materials) using appropriate test methods and the correct evaluation of the test results is an important task. Especially UVCBs have come into focus in the last years. They include chemical mixtures such as detergents, fragrances, and personal care ingredients. But they also serve as fuel and are used for chemical reactions. It also has to be mentioned that about 20 to 40% of the chemicals registered in Europe and the USA are UVCBs (Lai et al. 2022). Therefore, this class of chemical mixtures poses large future challenges concerning persistence assessment including the estimation of biodegradability and microbial toxicity (Dimitrov et al. 2015; Kutsarova et al. 2019; Prosser et al. 2023).

Up to now, a number of biotests have been developed to detect possible toxic effects of chemicals in aqueous systems and to make predictions for natural environments (For reviews, see Escher et al. 2021). As a practical consequence, there are some methods available to measure the quality of effluents of WWTPs. They cover a wide range of possible effects such as cytotoxicity, genotoxicity, estrogenicity, and androgenicity (Bain et al. 2014; Bertanza et al. 2013, 2022; Carvalho et al. 2022; Escher et al. 2014; Leusch et al. 2010, 2014; Smital et al. 2011; Stalter et al. 2011; Välitalo et al. 2017). There also exist methods for determining bacterial toxicity using various defined species and endpoints, but their regulatory significance is limited and their use is a particular challenge. It is not the technical side, which poses the problems. There are simple, reliable laboratory methods available, and even dynamic test systems are known to continuously monitor complex systems such as wastewaters. It is rather the selection of the right organisms and the evaluation of the test results, which plays a crucial role. With microorganisms, more than with other organisms, the question arises, which bacterial species or which mixture of microorganisms, the so-called inoculum (for example, activated sludge in biological WWTPs), should be used to represent the huge abundance of bacteria in the natural and technical environment and their intrinsic enormous biochemical and physiological potential (Wu et al. 2019).

Therefore, the targets of microbial toxicity testing are also very diverse (Bitton and Dutka 1986; Dutka and Bitton 1986; Pagga and Strotmann 1999). Toxic compounds can have an influence on enzymes in the catabolic, anabolic, and intermediate metabolism, on nucleic acids, on membranes and the cell wall as well as on the floc structure in activated sludge. In some cases, organic compounds are toxic for living organisms because they accumulate in cell membranes and damage these. Furthermore, essential metabolic processes such as respiration and cell growth, nitrification, or denitrification can also be inhibited, which can be checked and prevented by appropriate test methods. Random processes such as the inhibition of light emission by bioluminescent bacteria (e.g., Aliivibrio fischeri) are used to determine toxic effects on microorganisms. One should take note that the use of pure bacterial cultures (e.g., Pseudomonas species), was one of the earliest approaches in the field of bacterial toxicity, but these methods are only of limited value, because in technical facilities, such as WWTPs, the main goal of tests and predictions are focused on activated sludge which is always a complex microbial culture.

This article provides an overview of available standardized and non-standardized microbial toxicity tests with bacteria, the evaluation of the test results, and a short discussion of future possibilities. However, these tests do not only affect natural and technical environmental systems as they are also used as pre-tests for biodegradation tests or as parallel test assays. Furthermore, they may also be used for the characterization of the quality of the inocula used in these tests. This is especially true for ready biodegradability tests (RBTs) because these test results play a crucial role for the estimation of the environmental persistence of chemicals as well as for regulatory purposes (Gartiser et al. 2022; Kowalczyk et al. 2015; Pagga 1997; Painter 1995; Poursat et al. 2019; Strotmann et al. 2023). Microbial toxicity data of chemicals can also be compared with data from classical ecotoxicological tests, using for example fish, daphnia, or algae tests to enable a sound risk assessment in the environment. Furthermore, microbial toxicity tests play an important role in monitoring the influent and effluent quality of biological WWTPs the latter being a direct consequence of the biodegradation activity in the plant. Therefore, also intoxications in the event of technical disruptions of a treatment plant and unsuitable wastewater streams can be effectively monitored (Araujo et al. 2005; Bertanza et al. 2022; Oliveira et al. 2007; Pagga 1983).

Importance and limitations of microbial toxicity tests

The metabolism of bacteria is extremely diverse, including both the use of various energy sources and different final degradation products. The determination of biodegradability is a fundamental task in the assessment of the environmental behavior of anthropogenically produced compounds. If the bacterial metabolism is severely inhibited by toxic substances, biodegradation processes may be impaired. This may not only lead to an enrichment of non-natural substances but may also have severe negative effects in natural environments such as surface waters, sediments, the marine environment, and soils. Moreover, also biotechnological processes which operate with degrading bacteria such as biological sewage treatment plants, anaerobic digesters, or composting sites may be negatively affected by toxic substances which can have far leading unfavorable consequences for the whole environment.

Laboratory toxicity tests with microorganisms are performed to predict a possible inhibition of biodegradation processes in environmental compartments. This helps for example to avoid disruptions in the operation of WWTPs or the deterioration of the self-cleaning ability of surface water, which may occur by the discharge of toxic wastewater or other toxic material. Useful results are obtained if essential aspects of the target environments are considered. Therefore, numerous factors have to be considered in the tests including the composition of the nutrient solution, salt concentration, pH value, temperature, and the oxygen supply. Furthermore, also the test duration, the concentration of the test substance, and the inoculum play a crucial role. Another important aspect is the biodegradation adaptation potential (BAP) of the inoculum, which is often determined by its origin (Strotmann et al. 2023). In order to avoid incorrect assessments of the test results it is advisable to determine possible inhibitory effects in advance or in parallel with biodegradation tests. Some degradation tests require a control assay using the test substance in combination with a well-known easily biodegradable substrate whose degradation should not be affected. Because of the great importance of the inoculum, its origin should be as far as possible comparable to the target compartment. For general statements, activated sludge used in toxicity or biodegradation tests should originate from standard municipal sewage treatment plants, but for monitoring a special plant treating for example chemical wastewater, it should be derived by that very plant. The reason is the better adaptation of the microorganisms in such a treatment plant which may be caused by a higher tolerance to toxic substances together with a higher biodegradation potential. In this ideal case, reliable predictions for the system to be monitored are possible. However, the transfer to other systems, for example, to other treatment plants or natural environments may cause problems. If even the transmission of test data with activated sludge is problematic, the use of pure bacterial cultures can be even more. Test systems with Pseudomonas species for example may be used to determine an intrinsic toxic potential of substances. On the other hand, it has to be considered, that bacteria show a wide metabolic variety depending on the habitat in which they naturally occur. A transfer of these data from a single species test to make predictions for a certain environmental compartment is rather difficult and may cause problems. For this reason, it is not possible to recommend one single microbial inhibition test that fits for all applications. However, a reasonable generalization can be made by using standardized test methods.

Another important point concerns microbial toxicity tests as pre-tests for biodegradation tests. It is advisable to perform such tests before biodegradation tests are carried out in order to know at which concentrations inhibiting effects may occur. The chosen test concentration should not exceed the EC20 value of the toxicity test, which is the concentration that causes an inhibition of 20 percent compared to the unaffected control. This information may also be obtained in parallel test vessels during the degradation tests (inhibitory control assays), where an easily biodegradable substance and the test substance are mixed. If degradation of the easily degradable substance is detected but not for the test substance, one can rule out that toxic effects may play a role. Furthermore, it can be checked if abiotic degradation or elimination from water occurs by adding a well-known, very toxic substance to one of the standardized test mixtures. A loss of the test substance can then only be due to abiotic processes such as evaporation or adsorption. For this purpose, mercury (II) chloride is often used as an inhibitor because of its strong and reproducible toxic effect. On the other hand, its practical usage in laboratories is critically viewed for reasons of a possible environmental pollution by discharging the residues of the tests.

Overview of microbial toxicity tests

Standardized bacterial toxicity tests (OECD/ISO)

Several standardized laboratory methods are available for testing bacterial toxicity, using both mixed cultures such as activated sludge and pure cultures such as Pseudomonas putida (for an overview, see Table 1). These are usually static test systems mainly representing a freshwater environment and which are operated under largely defined conditions including the test volume, the nutrient solution, the way of mixing, or in aerobic tests, the oxygen supply. The test duration ranges from a few minutes to a few hours. For pure cultures, a distinction must also be made between the duration of cultivation, the incubation with the test substance, and the actual measurement duration. Measured variables are usually the inhibition of respiration by the determination of oxygen consumption and the inhibition of growth via turbidity measurements. The main purpose is to predict the inhibition of bacterial growth as well as the inhibition of biodegradation processes. When the focus is on other important bacterial activities in the environment such as the inhibition of nitrification and denitrification, special test methods are required. In addition, anaerobic inhibition tests are applied, where the underlying principle is the inhibition of biogas production by chemical compounds, determined in comparison to a control without a test compound. A test method derived from the marine environment is the luminescent bacteria test, in which the inhibition of the light emission of special bacteria is used as an indication of toxic effects. The disadvantage of this test is that luminescent bacteria are organisms derived from the marine environment, which tolerate high salt concentrations. These bacteria are different to those from limnic systems such as rivers, lakes, or another surface water. It also has to be mentioned that the phenomenon of luminescence is closely coupled to the catabolic energy-producing metabolism, but that the light emission is not a prerequisite for the survival of the bacteria. The great advantage of the method is, however, that it can be performed rapidly without much effort and that it can be easily automatized. Tests with luminescent bacteria and pure bacterial cultures can at least give a reliable indication of the toxic potential of substances and its relative classification. In addition to these short-term tests, it may sometimes be necessary to carry out long-term studies of toxic environmental effects. On the other hand, these tests are much more complex and more time-consuming than simple laboratory tests and need very special test designs.

Table 1 Different OECD/ISO-based microbial toxicity tests

The results of toxicity tests are usually expressed as limit values that are recorded as effective concentrations (ECx, with x representing the degree of inhibition in percent) in mM (mg/L). The beginning of a significant inhibition is for example an inhibition of 20% compared to a control without a test substance. In order to compare the inhibitory effects of different substances, the statistically more reliable EC50 value is the most important value. In order to indicate the concentration that causes a complete inhibition the EC80 value is generally used.

Respiration inhibition tests

This type of bacterial inhibition test is based on the fact that the oxidative degradation of organic compounds is strictly coupled to a utilization of oxygen which acts as a terminal electron acceptor in the respiration chain. The following Eq. (1) gives a general overview of the overall reaction:

$$Carbon\;source+O_2+X\Rightarrow H_2O+CO_2+byproducts$$
(1)

The heterotrophic respiration (HR) is a vital process for energy generation and biomass production under aerobic conditions. Inhibition of this process by toxic compounds has severe consequences on the biodegradation processes of heterotrophic bacteria. Therefore, these types of inhibition tests are widely used before performing biodegradation tests and to monitor wastewater treatment plants. The standardized methods ISO 8192 and OECD 209 are also known as “short-term respiration inhibition test.” The preferred incubation time of ISO 8192 (e.g., used for wastewater samples) is 30 min while that of OECD 209 (e.g., used for chemical substances) is 3 h. But ISO 8192 also states that the incubation time may extended up to 180 min in the case of poorly water-soluble test materials and in special cases even up to 27 h (Gendig et al. 2003). As an inoculum activated sludge is used at a concentration of 1.5 g L−1. For general predictions, it is usually taken from municipal WWTPs. The inoculum may be pre-aerated to degrade excess carbon material. The oxygen concentration is measured with the help of oxygen electrodes. The test results are expressed as EC20, EC50, and EC80 or as a limit value which shows no inhibition. As a reference compound 3,5-dichlorophenol is frequently used. When the total respiration (TR; the sum of heterotrophic respiration and nitrification respiration) is determined the EC50 is at 9.8 mg L−1 (validity range: 2 to 25 mg L−1; ISO 8192 2007). Concerning the heterotrophic respiration, the EC50 is at 20.3 mg L−1 (validity range: 5 to 40 mg L−1 (ISO 8192 2007). As the test is very reliable and shows a good reproducibility it can be regarded as a standard microbial toxicity test for the estimation of a possible toxicity of chemicals. On the other hand, the sensitivity of the test can be increased when using a prolonged incubation time of 27 h (see Table S1). Especially for 2,3-dichlorophenol an increased sensitivity could be stated when using a test over 27 h with municipal sludge. The source of activated sludge can also have a significant influence (see Table S1). In case of uncertainty, it is therefore useful to perform tests with different incubation times and inocula from different sources. This test system is also very reliable when testing the toxicity of wastewaters (Oliveira et al. 2007; Pagga 1981).

Nitrification inhibition test

The process of nitrification is an important natural process that occurs in soil and water. It involves the conversion of ammonia (NH3) or ammonium (NH4+) into nitrite (NO2), and then into nitrate (NO3) by autotrophic bacteria using molecular oxygen as a terminal electron acceptor in order to generate energy for their metabolism. As nitrifying bacteria have very low growth rate constants they can be easily washed out in wastewater treatment plants when they are not retained by adequate means (Tchobanoglous et al. 1991). The inhibition of nitrification can also be considered a suitable indication for any other toxic effects in microbial environments. Principally the test, as normalized by ISO, can be performed in two different ways. Firstly, the nitrification activity and inhibition of nitrification can be determined by an analysis of the decrease of ammonium and the subsequent production of nitrite and nitrate (ISO 9509 2006). Alternatively, the oxygen consumption due to this oxidation process can be determined (ISO 8192 2007). The nitrification respiration can be calculated by the following Eq. (2):

$$NR=TR-HR$$
(2)

where NR is the nitrification respiration, HR is the heterotrophic respiration and TR is the total respiration. Both the nitrification and the heterotrophic degradation of carbon compounds require oxygen. To distinguish between these processes and to selectively detect nitrification inhibition, allylthiourea (ATU) is used, because this substance selectively inhibits nitrification processes. In this way, the estimation of total respiration in an assay without ATU and the estimation of heterotrophic respiration in an assay containing ATU enables the determination of the nitrification respiration (NR). In addition, the results obtained can be verified by an analysis of nitrate formed by the oxidation of ammonium to nitrate via nitrite. In some cases, the bacterial oxidation of ammonium is not complete, but is stopped at the nitrite stage. The microbial background is that the whole nitrification process consists of two single processes, the nitritation (formation of nitrite from ammonia) and the nitration (subsequent formation of nitrate from nitrite). These processes are performed by the two groups of ammonia-oxidizing bacteria and nitrite-oxidizing bacteria. The two reactions of the whole nitrification process can be described by the following Eqs. 3 and 4:

$$2NH_4^++3O_2\Rightarrow2NO_2^-+4H^++2H_2O\rightarrow Formation\;of\;nitrite$$
(3)
$$2NO_2^-+O_2\Rightarrow2NO_3^-\rightarrow Formation\;of\;nitrate$$
(4)

The overall reaction can be described by the following Eq. (5):

$$2NH_4^++4O_2\Rightarrow2NO_3^-+4H^++2H_2O$$
(5)

From Eq. (5), it is obvious that there is a stochiometric proton release which requires sufficient buffering conditions in the test medium. Since this biological process is also very sensitive, nitrification inhibition tests are required to monitor the buffering capacity of the test medium.

In the ISO 9509 test system, the biomass concentration is at 1.5 g L−1, and the incubation time at least 4 h. The oxygen concentration has to be maintained at 4 mg L−1. Sodium hydrogencarbonate (NaHCO3) serves as an inorganic carbon source for the autotrophic nitrifiers. The pH value should be at 7.6. The inoculum is activated sludge from a suitable WWTP or from a special nitrifying enrichment culture. Nitrification activity is determined by chemical analyses of ammonium, nitrite, and nitrate. Reference compounds can be 3,5-dichlorophenol (EC50 = 5.6 mg L−1), 4-nitrophenol (EC50 = 43.3 mg L−1), or ATU (EC50 = 0.38 mg L−1) (ISO 9509 2006). It has to be mentioned that the EC50 values of a sludge from a wastewater treatment plant can be higher than from a nitrifying enrichment culture. Possible reasons are the adsorption processes of the inhibitors to sludge flocs and diffusion processes (ISO 9509 2006).

In the ISO 8192 test, the inhibition of nitrification can be determined by the measurement of the oxygen demand for nitrification processes according to Eq. 2. The EC50 value for 3,5-dichlorophenol determined in a ring test was at 4.6 mg L−1 (validity range: 0.1–10 mg L−1) which is quite similar to the values in the ISO 9509 test system.

Nitrification activity can also be measured using pure bacterial cultures, such as Nitrosomas europaea (NBRC 1429, ATCC 19718). Mizukami-Murata et al. (2023) and Nishigaya et al. (2016) studied the nitrification processes and reported a strong correlation with ISO 9509 and ISO 8192 for different chlorophenols and dichlorophenols showing a clear comparability of the methods. Comparative data published by Pagga et al. (2006) were in the same range, whereas Yuan et al. (2019) reported somewhat lower values in the range of 0.60 mg L−1.

Growth inhibition tests

The growth inhibition test with activated sludge bacteria according to ISO 15522 specifies a method for assessing the potential toxicity of a test material to the growth of mixed aerobic bacteria present in activated sludge. The inhibitory effect is restricted to those microorganisms capable to grow with the chosen organic test medium. The standard outlines the test environment, reagents, apparatus, and the procedure to be followed. The underlying principle of the test is the exposition of mixed bacteria which originate from activated sludge to different concentrations of a test compound. The addition of the test compound to a nutrient mixture occurs in the early exponential phase. Growth is followed by turbidity measurements with a light of wavelength of 530 nm. The test was validated by testing phenol, different chlorophenols, and 2,4-dinitrophenol (Strotmann et al. 1994). In a subsequent ring test, the EC50 for 3,5-dichlorophenol proved to be at 8.1 mg L−1 and for KCN at 12.3 mg L−1 (Strotmann and Pagga 1996). Comparable data for 3,5-dichlorophenol were reported by Yuan et al. (2019) (EC50 = 4.22 mg L−1). The results of this test have a certain limitation, because the bacteria are cultivated on a complex medium containing nutrient broth and an easily degradable carbon source (e.g., sodium acetate) which is quite different from the composition of a real wastewater. Nevertheless, the EC50 values obtained are comparable to those of respiration inhibition tests.

The growth inhibition test with Pseudomonas putida (ISO 10712 1995) specifies a method similar to ISO 15522 for assessing the toxicity of wastewater and water-soluble substances to bacteria, but it uses pure cultures of a representative of heterotrophic microorganisms present in freshwaters. The toxic concentration range of the test substance is first determined in preliminary tests. In the main test, a serial dilution within the relevant concentration range is used with three replicates per concentration and two controls, a negative one without the test substance and a positive one with 3,5-dichlorophenol, which both are run in parallel. A defined bacterial suspension is added to each test vessel and after 16 ± 1 h, the cell concentration is determined by a turbidity measurement at a wavelength of 436 nm. Using this test the EC50 for 3,5-dichlorophenol was at 21.4 mg L−1 and slightly higher than in the ISO 15522 test with mixed bacteria. In former times, the Pseudomonas putida growth inhibition test was often used in Germany for the derivation of water-hazardous classes (WGK) of chemical substances. However, the system is no longer used and, additionally, the Pseudomonas putida strain has been classified into a risk group, which makes handling in laboratories considerably more difficult. It should be noted, that the OECD 201 2006 norm describes a growth inhibition test with algae and cyanobacteria within an incubation period of 72 h under defined lightening conditions. In fact, cyanobacteria (blue-green algae) such as Anabaena flos-aquae or Synechococcus leopoliensis are bacteria, but the test results are interpreted as algae toxicity data.

Luminescent bacteria test

Light-emitting luminescent bacteria are very common in the marine environment. Their light emission is closely coupled to their catabolic metabolism and when it is disturbed by toxic compounds also the light emission decreases. This effect forms the basis of this inhibition test. Suitable bacterial strains are Aliivibrio fischeri (also designated as Photobacterium phosphoreum and Vibrio phosphoreum) and Vibrio qinghaiensis (Zhang et al. 2023). The test system has been standardized in three variations (ISO 11348 Part 1 to 3). The different parts are due to the cultivation method of the bacterial strain (freshly cultivated, frozen, or lyophilized bacteria). When stored at a temperature of − 80 °C the bacteria keep their viability for years and can successfully be reactivated for the test (Strotmann et al. 2020). The performance of the luminescent bacteria test is easy and rapid and can simply be automated and even kits for simple use are commercially available. The luminescent bacteria test can be used to determine toxic effects of chemical compounds ranging from basic and intermediate chemicals to antibiotics and pesticides including fungicides, herbicides, and insecticides. Another important application is the supervision of all kinds of aqueous solutions ranging from drinking water, river, and surface water eluates up to process effluents and landfill leachates. It can also be used to determine the reduction of toxicity in wastewater treatment plants by analyzing the influent and effluent (Abbas et al. 2018; Araujo et al. 2005; Yuan et al. 2019). In some regard, the luminescent bacteria test is an ideal tool due to its short incubation time (5, 30, 60 min). Recently a modification of the original ISO method has been published where especially the cultivation and reactivation medium have been modified so that the performance of the test system is improved (Strotmann et al. 2020). Moreover, in this test, the commonly known reference compound 3,5-dichlorophenol is used. The EC50 values for different incubation times (5, 30, 60 min) are in a small range from 3.52 to 5.00 mg L−1 (Strotmann et al. 1994, 2020). In the ISO 11348 text, the results of a ring test are given where the EC50 is at 5.80 mg L−1. In a comparative study with different chlorophenols, dichlorophenols, and 2,4-dinitrophenol, it was found that 3,4 dichlorophenol was the most toxic (EC50 = 1 mg L−1) whereas phenol and 2.4-dinitrophenol exhibited the lowest toxicity (EC50 = 29 mg L−1 each) (Strotmann and Eglsäer 1995). The luminescent bacteria test is extremely popular and it is used in a large variety of applications ranging from toxicity determination of chemical compounds and mixtures of these up to a wide range of monitoring purposes (Abbas et al. 2018; Mendonca et al. 2009; Menz et al. 2013). Because of its simplicity and versatile application possibilities the luminescent bacteria test is a reliable screening test, which should be combined in certain situations of ambiguity with another microbial toxicity test which is not based on marine bacteria.

There also exists a kinetic version of the luminescent bacteria test (ISO 21338 2010; Lappalainen et al. 1999) which aims at the determination of light emission in sediments and other solids as well as in colored samples. Here, a kinetic measurement of light emission with its peak and its decay is recorded. The relevant parameter is the signal after a certain incubation time versus the height of the peak. Standard incubation times are 5, 15, and 30 min including a peak measurement 5 s after mixing the probe. Reference compounds are 3,5-dichlorophenol (EC50 = 5.1 mg L−1), zinc (II) (EC50 = 3.03 mg L−1), and chromium (VI) (EC50 = 15.3 mg L−1, 30 min) (Lappalainen et al. 2001).

Inhibition controls in biodegradability tests

When there is a suspicion that a chemical substance or wastewater might have inhibitory effects to the inoculum in ready biodegradability tests, OECD 301 1992 recommends to consider additionally inhibitory control vessels, to avoid false negative conclusions. In the inhibition control, the test compound is incubated with a biodegradable reference substance. A reduction of the biodegradation of the reference substance is interpreted as a toxic effect to the inoculum used. If the biodegradation of the reference substance is less than 35% (based on total DOC) or less than 25% based on total theoretical oxygen demand (ThOD) or theoretical carbon dioxide evolution (ThCO2) within 14 days, the test substance is assumed to be inhibitory and the test should be repeated at a lower test concentration (OECD 301, Sect. 25). According to ECHA (European Chemicals Agency) Guidance for assessing potential toxic effects on microorganisms of activated sludge in WWTPs, the concentration tested in the inhibition control may be used for deriving the “no effect concentration” (NOEC) assuming an assessment factor (safety factor) of 10 (ECHA 2023). The fixed limit values are only a rough estimate. Often, a comparison of the calculated and the measured degradation curves of the inhibition control allows a more precise interpretation.

Anaerobic bacteria inhibition test

There are two similar standardized anaerobic inhibition tests, OECD 224 and ISO 13461–1 and ISO 13461–2. These standards specify a screening method for assessing the potential toxicity of substances, mixtures, surface waters, groundwaters, wastewaters, effluents, sludges, or other environmental samples using digested sludge from anaerobic digestion reactors and by determining the production of biogas (carbon dioxide and methane) over a period of up to 3 days. The biogas production is measured by the determination of the overpressure in the test vessels. An inhibition can be calculated by comparing the gas production with a control vessel. The test is able to predict the maximum direct toxic effect and the time to reach this maximum. ISO 13641–1 specifies a general test, while ISO 13641–2 specifies a test for low biomass concentrations. The normal test duration is 48 h, the test temperature is at 35 °C, and 3,5-dichlorophenol is recommended as a reference substance. The mean EC50 determined in a ring test using (ISO 13461, OECD 224) was at 153 mg L−1. This test can also be used with wastewaters and it fits perfectly with a strategy of biotest systems for the evaluation of anaerobic biological treatment of wastewaters (Abwassertechnische Vereinigung ATV-AG 7.5.1 2004; Strotmann et al. 1993b).

Bacterial inhibition tests for the soil compartment

Although the focus of this review is on aqueous systems, it should be mentioned that several bacterial toxicity tests also exist for soil compartments. Their objective is to evaluate potential toxic effects on soil microorganisms involved in biodegradation processes and nutrient cycles. Examples are the Nitrogen Transformation Test (OECD 216 2000) or the Carbon Transformation Test (OECD 217 2000), next to the nitrification inhibition tests according to ISO 15685 or the contact test for solid samples using the dehydrogenase activity of Arthrobacter globiformis (ISO 18187 2023). Another soil quality test estimates the decomposition of organic matter in contaminated soil (ISO 23265 2023).

Non-standardized test systems

There also exist several non-standardized test systems used to determine bacterial toxicity. They include tests on cell membranes as well as on enzymes, tests based on the measurement of metabolites such as adenosintriphosphate (ATP) as well as microcalorimetric methods. Among these, the most widely tests used are enzymatic assays using dehydrogenases and the estimation of ATP luminescence as a measure of active biomass (Dalzell et al. 2002). The purposes range from the determination of toxic effects to the supervision of vital functions in living cells. For a general overview, see Bitton and Dutka (1986) and Dutka and Bitton (1986).

Toxic effects on cell membranes and QSAR models

Organic compounds in the environment may be toxic for living organisms because they accumulate in and disrupt cell membranes and essential metabolic processes such as respiration and cell growth can be inhibited. The toxicity of such compounds often correlates with the logarithm of its partition coefficient between octanol and water (log P). Table 2 provides a summary of this correlation on the basis of the well-investigated chemical classes of (chloro)phenols and n-alkanols. Substances with a log P value between 1 and 5 are, in general, toxic for whole cells (Heipieper et al. 2007; Sikkema et al. 1995). Such chemicals, for example, toxic hydrocarbons, can only be degraded at a low rate. This needs to be taken into consideration for the design of biodegradation tests. This direct correlation is also used for quantitative structure–activity relationship (QSAR) models. Exceptions are compounds with very low water solubility (bioavailability) and those that are charged. Here, the Henderson-Hasselbalch-Equation can be used to calculate the percentage of non-charged substances for the QSAR model (Heipieper et al. 2007, 1994; Sikkema et al. 1995).

Table 2 LogP dependency of EC50 values (in mM and in mg L−1) of phenol/chlorophenols and n-alkanols. Data for growth inhibition test (EC50) with Pseudomonas putida (Heipieper and Martínez 2018)

Toxicity tests using enzymes

Toxicity tests with enzymes can be very reproducible, reliable, and often can be carried out easily. They include enzymes such as dehydrogenases, ATPases, esterases, phosphatases, urease, mixed function oxidases (MFO), aryl hydrocarbon hydroxylases (AHH), and peptidases (L-alanine-aminopeptidase, Dalzell et al. (2002)). For a detailed review, see Bitton and Dutka (1986) and Dutka and Bitton (1986). The test results from enzymatic tests can give information for possible toxic effects but they have the disadvantage that predictions to the real environment are restricted.

Dehydrogenase test

The dehydrogenase test is frequently used for monitoring the biological activity of activated sludge (Bensaid et al. 2000; Pan et al. 2023) and also for the determination of toxicity of chemical compounds (Strotmann et al. 1993a, 1992). It is based on the fact that artificial acceptors take up the reduction equivalents produced in the microbial electron transport system (ETS) and are subsequently reduced. These reduced compounds form colored substances which can be analyzed spectrophotometrically. This system is easy to perform and can be used for various purposes, such as activated sludge (Bitton and Dutka 1986; Pan et al. 2023; Strotmann et al. 1993a, 1992; Yuan et al. 2019), anaerobic degradation and digestion processes (Wang et al. 2022), in soil (Järvan et al. 2014) and even for eukaryotic systems such as algae (Xie et al. 2008). Acceptor molecules for the reduction equivalents are TTC (2,3,5-triphenyltetrazolium chloride), INT (2-(p-iodophenyl)-3-(p-nitrophenyl)-5-phenyltetrazolium chloride), XXT (3′-[1-[(phenylamino)-carbonyl]-3,4-tetrazolium]-bis(4-methoxy-6-nitro) benzenesulfonic acid hydrate), resazurin (sodium salt of 7-hydroxy-3H-phenoxazin-3-on-10-oxid), and methylene blue (Liu 1983; Maurines-Carboneill et al. 1998; McCluskey et al. 2005; Strotmann et al. 1993a). It has to be mentioned that both methylene blue and TTC have a low affinity for electrons. Therefore, the tests using these indicators have to be carried out in the absence of oxygen. When using INT a time-consuming extraction step is necessary prior to spectrophotometry (Anderson et al. 1988). Resazurin has the advantage that no extraction step is necessary and is, therefore, a favorable method for activity determination and toxicity assessment (Liu 1981, 1983).

Concerning the estimation of activity, it was stated that there is not always a positive correlation between respiration activity and dehydrogenase activity in model wastewater treatment plants operated with municipal or industrial wastewater (Strotmann et al. 1993a). On the other hand, Pan et al. (2023) stated a positive correlation between respiration activity, dehydrogenase activity, and ATP content in a sequencing batch reactor (SBR) model using TTC and INT-based dehydrogenase tests. They measured the effects of heavy metal ions such as Cu2+, Cd2+, and Ni2+ and several antibiotics such as sulfamethoxazole, terramycine, and tetracycline. The reliability of the test system was demonstrated as well as the influence of pH activity of different phenolic compounds (Table S2). Furthermore, it could be shown in a shock loading experiment that the uncoupling agent 2.4-dinitrophenol inhibited the dehydrogenase activity whereas it enhanced the respiration activity as expected (Strotmann et al. 1993a). These results show that the dehydrogenase activity is a suitable test system for monitoring purposes as well as for toxicity assessment and can be used to provide additional information to the respiration inhibition test.

Up to now, the dehydrogenase test system has not been standardized with one exception aiming at soil quality, the contact test for solid samples using the dehydrogenase activity of Arthrobacter globiformis (ISO 18187 2023). It could be useful to standardize more variations of the dehydrogenase test in order to broaden the range of microbial toxicity tests for water and physiological parameters.

Sensitivity of microbial toxicity tests

The bacteria toxicity tests presented here are used to determine the toxicity of chemical compounds and wastewaters. However, it has to be kept in mind, that the sensitivity of the different test systems may be different. The most sensitive test is the inhibition of nitrification activity, followed by the luminescent bacteria test. The respiration inhibition test and the growth inhibition test with sewage bacteria are less sensitive as well as the growth inhibition test with Pseudomonas putida and the dehydrogenase test (see Tables 3, 4, and S2). This assumption is in good accordance with data from the literature (Dalzell et al. 2002; Gutiérrez et al. 2002; Yuan et al. 2019). One has to take note that even standardized tests allow certain changes in its performance. For example, a prolongation of the test duration up to 27 h can increase the sensitivity of the respiration inhibition test (Gendig et al. 2003; ISO 8192 2007; ISO 9509 2006; Strotmann et al. 2020). Toxicity tests require effective reference substances in order to control the performance. Reliable results are usually obtained with chlorophenols, especially with 3,5-dichlorophenol. Ionic compounds such as Zn(II) and Cr(VI) give more variable results because physical–chemical interactions such as precipitation and adsorption may influence the test results (Dalzell et al. 2002; Gutiérrez et al. 2002; Strotmann et al. 2020). Mercury compounds would be ideal, but their use is no longer desirable due to environmental reasons.

Table 3 Toxicity of different phenolic compounds in different microbial toxicity tests (modified after Pagga and Strotmann 1999; Strotmann et al. 1994, 1993c). All concentrations are given in mg L−1
Table 4 Compilation of ecotoxicological data and bacterial toxicity data concerning 3,5-dichlorophenol

In the ECHA Guidance, the following order of increasing sensitivities is shown based on the publications of the German Federal Environment Agency UBA (1993), Reynolds et al. (1987), and Ren and Frymier (2003). The sensitivity increases from the respiration inhibition test to the inhibition control assays in biodegradation tests, over the growth inhibition test with Pseudomonas putida and as the most sensitive tests the inhibition of nitrification and the luminescent bacteria test. On the other hand, sensitivity is not the only criteria for selecting appropriate tests for assessing toxicity effects in WWTPs as mentioned above.

When comparing microbial toxicity tests with classical ecotoxicological tests (which are also designated as ecotoxicity tests) such as the microalgae test with Pseudokirchneriella subcapitata (= Scenedesmus capricornutum) and Desmodesmus subspicatus (= Scenedesmus subspicatus), the Daphnia test with Daphnia magna and the fish test with Crassius auratus, Brachydanio rerio, Oryzias latipes, and Platichthys flesus, it is striking that only the sensitivity of the luminescent bacteria test and the nitrification inhibition test is in the same range (see Table 4). Also, in other studies, a good correlation of the data in these test systems has been published (Bringmann and Kühn 1980; Hernando et al. 2003, 2005; Shigeoka et al. 1988). Therefore, the microbial test systems may be suitable as range finders or pre-tests for other ecotoxicological tests with algae, invertebrates, and fish species. This is also beneficial as animal welfare problems are avoided when using microbial toxicity tests. Therefore, the strict separation of microbial toxicity tests from classical ecotoxicological tests systems should be overthought. It also has to be mentioned that sometimes microbial toxicity tests are seen as a part of ecotoxicological tests and sometimes are seen separately. Nevertheless, microbial test systems could be integrated as a useful help in a toolbox or a test battery containing microbial as well as ecotoxicological tests for the determination of toxicity of chemical compounds and for monitoring purposes (Barcelo et al. 2020; Farré and Barcelo 2003).

A promising integrated toolbox: microbial and ecotoxicity tests in combination with biodegradation tests

In order to properly characterize single chemical compounds and mixtures of chemicals a number of studies have been performed where microbial toxicity tests, ecotoxicity tests, and OECD biodegradation tests have been combined. A bibliometric analysis from two databases (Science Direct and PubMed) was carried out in order to analyze studies integrating toxicity tests when assessing the biodegradation of molecules or mixtures of compounds. Figure S1 shows that 74 articles were retained after selection using specific key words. From these 74 articles only 39 studies used OECD biodegradation tests. Table S3 gives a detailed analysis of the biodegradation tests carried out as well as the associated ecotoxicity tests. The main information that emerges shows that microbial and ecotoxicity tests are generally performed on pure compounds or mixtures before biodegradation tests, whereas only few studies (8/40) focus on the possible toxicity of metabolites. A summary of the data analysis is given in Fig. 3. In addition, this figure also illustrates that there is a close connection between microbial toxicity, biodegradation, and classical ecotoxicology, but it also has to be mentioned that ecotoxicological details are beyond the purpose of this review. Although a wide variety of ecotoxicity tests were carried out, the majority concerned microbial toxicity. Most studies (25/39) combined the luminescent bacteria toxicity test using Aliivibrio fischeri with biodegradation tests. This choice is primarily based on the ease of the test set up than on its ecological relevance. In order to assess the ecotoxicological impact of a compound or mixture, it is necessary to combine tests using organisms from different trophic levels. 6 of the 39 studies combined the impact of the compound on a microorganism, a photosynthetic organism, and an invertebrate (Daphnia magna) (Gartiser et al. 2009; Pedrazzani et al. 2012; Stolte et al. 2012; Sagi et al. 2018; Gatidou et al. 2021). Finally, four studies include mutagenicity or genotoxicity tests in order to assess the impact of leachate of materials, effluents (Gartiser et al. 2009, 2017), detergents (Padrazzani et al. 2012), or by-products of antibiotics due to photodegradation (Bergheim et al. 2015). Therefore, it is recommended that the ecotoxicity assay implemented with biodegradation studies should consider the ecosystem affected by the release of the chemical compounds.

Fig. 3
figure 3

Ecotoxicity tests used before (blue) or after (pink) standardized biodegradation tests. A Schematic trophic chain showing the position of each family of ecotoxicity tests selected by the 39 publications identified in Table S4. The pie charts inside indicate the times the tests were used in the publications. B Details of the ecotoxicity tests performed either before or/and after OECD standard biodegradation tests. The Ames test and the UMU test are tests for mutagenicity. (ATP: adenosine triphosphate)

The interplay of microbial toxicity and biodegradation

It is obvious that toxic effects of chemicals can severely affect biodegradation processes and also hinder the adaptation of bacteria to degrade xenobiotic compounds. This may be due to the original substance, by metabolic products which are formed during the biodegradation process, or by a shift of the pH value. For example, during the degradation of 1,2-dichloroethane the metabolites 2-chloroethanol and chloroacetaldehyde are formed, the latter being well known to be rather toxic and mutagenic (Dijk et al. 2003; Janssen et al. 1984, 1995; Strotmann et al. 1990; Strotmann and Röschenthaler 1987). Many bacteria are able to handle these restrictions and degrade not only toxic substances but also toxic metabolites. This may be due to genetic processes by induction of degrading enzymes or by the excretion of toxic metabolites. These enzymatic processes are often designated as detoxification reactions and can include different types of reactions such as dehalogenation, hydrolysis, hydroxylation, dealkylation, reductions of nitro groups, deamination, ether cleavages, conversion of nitriles to amides, and conjugations (Alexander 1994). Besides these enzymatic reactions, also the cultivation conditions of bacteria can positively affect the biodegradation and lower toxic effects. This could be demonstrated when Pseudomonas putida US2 was immobilized and used to degrade 2-chloroethanol which is a metabolite in the degradation pathway of 1,2-dichloroethane. During the biodegradation process, a liberation of protons occurs during a dehalogenation process, which lowers the pH value. It could be demonstrated that the way of immobilization and an effective pH control together with a supplementation with a secondary substrate had a significant influence on the biodegradation efficiency leading to a technical application in a bioreactor (Knippschild and Rehm 1995; Overmeyer and Rehm 1995). Concerning microbial toxicity 2-chloroethanol exhibits a rather low toxicity with an EC50 at 6 g L−1 (74.52 mM) in the luminescent bacteria test (30 and 60 min test, unpublished results). In an OECD 301F biodegradation test 2-chloroethanol could be degraded to an extent of 93% after a 5 to 6 days lag period which is a clear indication for adaptation processes (Reuschenbach et al. 2003). These results show that a low microbial toxicity often positively correlates with a high adaptation potential of the inoculum in a biodegradation test. This is possible because the tolerant species whose degradation potential is already present prevail. Concerning the cyclic nitrogen-containing compound morpholine similar results could be obtained. Here also a low microbial toxicity (15% inhibition at a concentration of 1000 mg L−1 in a respiration inhibition test with not adapted activated sludge, 5% inhibition at a concentration of 100 mg L−1 in a nitrification inhibition test) seems to be a prerequisite for a successful adaptation process. Morpholine showed a 16 days lag period before the onset of biodegradation in an OECD 301F test system. The final extent of biodegradation was rather high at 87 to 89%. Further extended shock loading experiments with morpholine in a continuously operated laboratory scale WWTPs confirmed these results (Strotmann et al. 1993c). All in all, these examples show that a low microbial toxicity is very favorable to adapt inocula to a xenobiotic compound and stress the importance of microbial toxicity tests as pre-tests for biodegradability tests. This fact is also illustrated in Fig. 4 where the close interplay of microbial toxicity tests and biodegradation tests is illustrated which can also have consequences for subsequent engineering and technical implementations.

Fig. 4
figure 4

Flow chart illustrating the interconnection of microbial toxicity tests, ready biodegradation tests (OECD 301 A to F 1992), inherent biodegradation tests (OECD 302 A 1981, OECD 302 B 1992, OECD 302 C 1981, OECD 304 A 1981) and advanced engineering techniques

The objective of biodegradability testing is not only to conclude whether a test substance is biodegraded or not, but also to indicate whether biodegradability is an intrinsic substance property. Inhibition of the inoculum is regarded as a disturbance of possible biodegradation processes and should be avoided within biodegradation tests. Therefore, Annex II of OECD 301 1992 (see also OECD 301 A to F 1992) recommends that substances suspected to be toxic to the inoculum should preferably be tested at concentrations corresponding to 1/10 of the EC50 values obtained in the activated sludge respiration inhibition tests (OECD 209 2010). ISO 10634 2018 provides a guidance for the preparation and treatment of poorly water-soluble organic compounds for the subsequent evaluation of their biodegradability. Here, it is mentioned that by adding an inert carrier material the bioavailability of a test substance may be reduced due to adsorption processes. In this way, possible toxic effects of the test substance to microorganisms may be reduced and an increase of biodegradation is possible. On the other hand, the limited bioavailability of the test substance due to adsorption may also result in a decrease of biodegradation. The reason lies in the limited access of the test substance to the inoculum. Only few publications addressing these controversial effects are available (Gartiser et al. 2023; Nabeoka et al. 2020; Timmer et al. 2019; van Ginkel et al. 2008). The ECHA guidance (2023) states that inhibitory substances could be tested with the lowest test substance concentration possible, which may be achieved using the Closed Bottle Test (OECD 301 D 1992). Unfortunately, the inoculum concentration and the degradation potential are very low in this test and thus counteracting the desired effect. Inhibitory effects on the inoculum in biodegradability test may often be manifested by longer lag-periods, which is obvious by achieving the 10% level. Another option to avoid the influence of toxic effects is the use of several concentrations of the test substance (e.g., 10 and 20 mg L−1 TOC in the OECD 301 B 1992 test or 50 and 100 mg L−1 ThOD in the OECD 301 F 1992 test). All in all, there exist no general rules to reduce inhibitory effects in biodegradation tests. Therefore, the best option is a try and error approach by practical testing.

Characterization of inocula for biodegradation tests and microbial toxicity tests

General considerations

Bacterial inocula play an important role in biodegradation and bacterial toxicity tests. Therefore, it may be helpful to characterize the quality of the inocula used. This can be done by genetic analysis of the bacteria present in the inoculum, but this is often difficult and time-consuming (Forney et al. 2001; Muñoz-Palazon et al. 2018; Prosser et al. 2007; Saunders et al. 2016; Wu et al. 2019; Xia et al. 2010; Zhang et al. 2012). This approach is very promising for scientific reasons, but it will probably not fully describe the physiological properties and degradation capacities of inocula required for practical microbial toxicity and biodegradation tests.

The inocula used in biodegradation tests mainly originate from municipal WWTPs whose microbial composition may be identified to a certain degree and contain a common core of bacteria which can be found in all these plants (Fig. 1) (Brown et al. 2019b, 2019a; Madoni 2011; Maza-Marquez et al. 2018). If toxicity tests are carried out with pure strains, they cannot be representative for the whole biocenosis in natural or technical systems or they are even entirely missing. For example, the marine bacterium Aliivibrio fischeri used in the luminescent bacteria test is not present in WWTPs. Furthermore, a lack of reproducibility of standardized biodegradation tests has been described, which was associated with the difficult control of the inoculum by terms of density and diversity (Davenport et al. 2022; Strotmann et al. 2023; Thouand et al. 2011). However, investigations have shown that it is possible to sufficiently characterize an inoculum before performing a biodegradation test or a toxicity test. The aim is to limit or control this variability by adjusting both the amount and the activity of the inoculum. There are three main current techniques covering three levels of information. The first one is the measurement of cell density consisting of microscopic observation, the determination of the cell biomass (dry matter, optical density, and other methods) (Ahtiainen et al. 2003; Struijs et al. 1995; Thouand and Block 1993; Thouand et al. 1995), the measurement of cell density by optical means (Riedel et al. 2023; Thouand et al. 1995), the cell counting (total and viable cells, cytometry) (Brown et al. 2019a; Goodhead et al. 2014; Thouand and Block 1993; Thouand et al. 1995) and cultivation methods (colony forming units, CFU and most probable number, MPN) (Gartiser et al. 2023; Thouand and Block 1993; Thouand et al. 1995; Vázquez-Rodríguez et al. 2007). The second group consists of phenotyping methods such as the determination of activity levels (ATP, reduction of tetrazolium salts whose transformation state is an activity indicator) (Pan et al. 2023; Thouand and Block 1993; Vázquez-Rodríguez et al. 2007) and physiological profiling (BIOLOG) (Guo et al. 2010; O’Malley 2006; Vázquez-Rodríguez et al. 2007). For example, O’Malley et al. (2006) selected a better inoculum for an OECD 301 1992 test by using BIOLOG GN microplates by comparing their whole community carbon source utilization profile. Now it is even possible to enlarge the comparison by using the Phenotype Microarrays (BIOLOG) with 190 Carbon sources, 95 nitrogen sources, 59 phosphorous sources, and 35 sulfur sources (personal communication). The third group compiles methods to identify cellular diversity such as genotyping and metagenomic methods (denaturing gel electrophoresis, DGGE; 16 s and 18 s rRNA analysis and shotgun methods in order to identify the main microbial communities present. Together with metatranscriptomic, metaproteonomic, and metabolomic methods, this level is more difficult to achieve and only a few publications associated with the evaluation of the biodegradation of substances have used it Goodhead et al. 2014; Poursat et al. 2020). Of all these characterization methods, only the measurement of dry matter is suggested in the standards. We believe that it is now possible to increase the information on the inoculum before carrying out a biodegradation test, at least by using more descriptors. At a minimum dry matter, cell counting and a physiological profile (using the physiological potential of an inoculum, PPI (see below) and BIOLOG) should be used, for this would allow a better comparison between tests or to identify misfunctioning inoculum before starting a biodegradation test.

The physiological potential of an inoculum (PPI): a new integrative concept of inocula characterization based on biodegradation and microbial toxicity test data

Besides the general physiological description of the inoculum with the help of the BIOLOG system also a specific system can be used which is directly aiming at OECD/ISO normalized biodegradation test. Concerning the physiological biodegradation-oriented capacities there are two approaches which can be applied. One is based on the estimation of the biodegradation adaptation potential (BAP) and the other on the resistance of an inoculum to defined toxic compounds which can be designated as a chemical resistance potential (CRP). Therefore, a successful inoculum should meet certain defined criteria concerning the BAP and the CRP. As these two parameters concern the physiology of degrading bacteria they can be combined to a new parameter, the physiological potential of an inoculum (PPI), which can be regarded as a tool to estimate the quality of an inoculum for biodegradation tests.

The BAP is obtained by the determination of the lag period in a biodegradation test with a defined test chemical, which is the period before degradation takes obviously place. It may describe as well the ability of an inoculum to adapt to an unknown compound. BAP tests can be performed with certain chemical compounds, which can only be degraded after a certain lag period during which the bacteria acquire the capability to degrade these compounds. Examples for such compounds are morpholine, NTA, ibuprofen (adaptation periods in the range of 9 to 16 days), 2-chloroethanol, and diethylene glycol (adaptation periods 5 to 8 days). In general, different compounds require certain adaptation capabilities. NTA, morpholine, and ibuprofen are compounds which require a high adaptation potential, whereas 2-chloroethanol and diethylene glycol require a medium adaptation potential. Compounds such as acetate, glycerol, and benzoate (adaptation periods 0 to 1 day) are not useful because they need only a very short time and require a low adaptation potential. Performing such tests, the adaptation potential of an inoculum can be exactly examined and the inoculum characterized. Adaptability can be grouped into three classes, where an inoculum of class 1 (lag period 0 to 2 days) has the lowest adaptation potential, an inoculum of class 2 (lag period > 2 days to 5 days) has a medium adaptation potential, and an inoculum of class 3 (lag period > 5 days) the highest. In all tests, the extent of biodegradation for ready biodegradability (ranging from 60 to > 80%, depending on the test system used) has to be met. It is important to perform these tests in a way that inocula are given the chance to reach the highest-class level. Inocula with a BAP of class 2 to class 3 are assumed to perform better in a biodegradation test than inocula with a BAP of class 1. Therefore, these inocula meet certain quality criteria concerning adaptability and biodegradation patterns. Details to this new concept were described before (Strotmann et al. 2023).

Besides the adaptation potential of an inoculum also the resistance of an inoculum to a toxic reference compound can be used to characterize it in terms of physiological properties and can be designated as chemical resistance potential (CRP). The CRP is based on the determination of an EC50 in bacterial toxicity tests with a well-defined reference compound. Ideally, an inoculum should show a certain resistance against inhibitory compounds. On the other hand, the resistance should not be too distinct. The concept is primarily aiming at inocula from WWTPs. Here the inoculum should be tested in a respiration inhibition test (heterotrophic respiration) and also in a nitrification inhibition test (autotrophic respiration) using for example 3,5-dichlorophenol which is a well-characterized toxic compound. These two test systems are up to now the most relevant test systems to characterize the physiological activity of activated sludge from a WWTP (Yuan et al. 2019). Therefore, a scheme was developed to characterize activated sludge using these test systems. There are different classes concerning the CRP. Class 1 represents a low chemical resistance, class 2 a moderate resistance, and class 3 a high resistance to 3,5-dichlorophenol. Details of the proposed scheme are shown in Table 5.

Table 5 Estimation of the physiological potential of an inoculum (PPI) based on the biodegradation adaptation potential (BAP) and the chemical resistance potential (CRP) based on respiration and nitrification inhibition tests (modified and extended after Strotmann et al. (2023)

A combination of the BAP and CRP of an inoculum would be ideal to characterize the intrinsic physiological potential of inocula (PPI) for biodegradation tests concerning either the adaptation capabilities and the chemical resistance potential. Therefore, quality groups for the physiological potential of an inoculum ranging from a very high to a low physiological potential have to be met (Table 5). An ideal inoculum for biodegradation tests should meet the criteria for a high or medium physiological potential. A very high potential of an inoculum would mean that the bacteria are extremely well adapted. Such an inoculum would not be representative for a realistic estimation of biodegradability under natural conditions. On the other hand, a low PPI would mean that the inoculum would be unsuitable and would deliver unrealistic results in a biodegradation test. Together with the genetic analysis of the inoculum, this concept with physiological parameters might be ideal for further future developments.

A technical application: bacterial toxicity tests for monitoring wastewater treatment plants

Laboratory tests

The use of bacterial toxicity tests is not restricted to characterize chemical compounds or important information for biodegradation tests. It can also be used for monitoring purposes. There are several fields of application for WWTPs, and if aqueous solutions or extracts are available, also for composting plants, soil, and landfills.

The easiest, fastest, and most cost-effective method is the use of the short-term respiration test (ISO 8192 2007). The test performance is the same as described for chemical compounds. Practice showed that the information obtained is sufficient to predict and, therefore, prevent toxic effects on activated sludge in a treatment plant. In order to be able to estimate the usefulness of laboratory tests, they should be compared with studies in technical plants or continuous run test facilities. Extensive investigations showed that the EC20 obtained in short-term respiration tests represents a very safe limit for the prediction of toxic effects by contaminated sewage. The effluent quality does not deteriorate if the EC20 is not exceeded. The results show as well, that in the case of long-term exposure, mixed populations may react by adaptation or degradation of toxic substances and thereby reduce the initial toxic effects (Pagga 1985, 1986; Pagga and Günthner 1981; Pagga et al. 1986; Strotmann and Weisbrodt 1994). Toxic effects that are not directly related to the respiration of the activated sludge such as the disruption of flocculation, may still worsen the effluent quality, but cannot be detected using respirometric methods.

According to the ECHA Guidance for assessing potential toxic effects on microorganisms in WWTPs, the preferred tests are those using a mixed inoculum because these tests assess the performance of the entire microbial community. For this purpose, the activated sludge respiration inhibition test (OECD 209 2010) is the most important test. The nitrification inhibition test (ISO 9509 2006; ISO 15685 2012) covers the performance of nitrifying organisms. Single species tests such as the cell multiplication inhibition test with Pseudomonas putida or tests with Pseudomonas fluorescens or Escherichia coli have a lower relevance for WWTPs (ECHA 2023). It should be noted, that there are also protozoa toxicity tests based for example on ciliates for assessing inhibitory effects on activated sludge, but this is beyond the objectives of this review paper.

The BASF toximeter as a continuous monitoring system

One of the largest biological WWTP worldwide is being operated by BASF SE in Ludwigshafen Germany. The basic data of this plant are summarized in Table S4. The influent, the mixture of all wastewaters of the chemical sites and of two towns, is continuously monitored by a so-called toximeter which is based on continuous respiration measurements.

Basically, this toximeter is a miniaturized wastewater treatment plant with a primary clarification tank, an aeration tank, and a secondary clarification tank, which is operated in the same way as the real technical plant. However, the location is on the factory premises, about 3.5 km away from the WWTP, which results in a response time of around half an hour. In this time, appropriate measures can be taken in case of a toxic load of the wastewater, e.g., the temporal diverting of the wastewater into a collecting basin.

The respiration rate of the activated sludge in the toximeter is assumed to be the same as in the technical plant. It is monitored by a discontinuous intermittent aeration of the aeration tank. If a maximum oxygen concentration of about 3 mg L−1 is reached, the aeration is switched off. The bacteria continue to respire and the oxygen concentration drops. When a minimum value of approximately 0.5 mg L−1 is reached, the aeration process restarts again automatically. In the oxygen depletion phase, the respiration rate is calculated and graphically indicated. A decreasing respiration rate can reliably indicate a toxic shock load, which may damage the activated sludge. Due to the intoxication of the sludge, it is no longer possible to indicate the end of a toxic influent. Therefore, a second device, the toxicontrol unit, is operated simultaneously, which is continuously run with fresh uncontaminated activated sludge from a separate storage unit.

In addition to the respiration measurement, also the dissolved organic carbon (DOC) and other relevant substances are continuously measured in the influent and effluent. The treatment performance determined by the DOC decrease is an additional important criterion for the safe operation of the monitoring plant.

For many years the operation of the respiration-based toximeter provided a safe monitoring, but in the spring of 1984, a serious disruption occurred in the wastewater treatment plant with massive negative consequences for the effluent quality. The treatment process suddenly collapsed and the untreated wastewater was discharged for a certain time directly into the river Rhine. After a few hours, the activated sludge recovered again and its activity proved to be even better than before. The question arose why the toximeter did not warn the operators in good time. Surprisingly, no inhibition of respiration was measured, but, in contrast, a significant stimulation of the respiration activity was observed. Therefore, the aeration in the plant proved to be too weak to provide enough air to treat the influent. There had been no unusual discharges of wastewaters and analytical measurements showed no irregularities in the influent. Subsequent studies such as simulation tests with uncoupling agents like 2,4-dinitrophenol and other different pollutants could not clarify the situation. Therefore, two basic points can be derived from this incident. Firstly, it can be concluded that respiration is an important monitoring parameter, but that not all incidents can be detected with it. A battery of different additional microbial toxicity tests (e.g., nitrification and dehydrogenase activity for an additional monitoring of the activated sludge activity and the luminescent bacteria test for monitoring the influent and effluent) would be useful efforts to detect certain incidents. Secondly, not only a respiration inhibition can cause trouble, but also an unexpected increase of the respiration rate. Therefore, respiration has to be maintained in a certain safe interval, and leaving this interval can be regarded as an incident. But not only chemical substances could have been the reason, but also changes of the biocenosis of the activated sludge organisms. From today’s perspective, after the devastating impact of Corona on the health of humanity, one could also consider the role of bacteriophages, which can damage bacteria. Bacteriophages are present in activated sludge in high concentrations (see Fig. 1). These viruses could have caused a stimulation of metabolic activity of the activated sludge bacteria, something like an “euphorization.” Consequently, this might have caused the proliferation of certain bacterial species that suddenly started to grow much faster and outgrow the other species present. The result was a huge increase in metabolic activity and an enormous need for oxygen, which could no more be supplied. All these factors may have led to a temporary overload of the system and, as a consequence, a decrease of the effluent quality.

Conclusions and future aspects

In this review, the most important microbial toxicity tests which have been normalized by the OECD and ISO are presented. The differences concerning the sensitivity of the test systems and the environmental compartments which they are aiming at are discussed as well as their advantages and disadvantages for a certain use. There is no single test which meets all requirements. It was also shown how important bacterial toxicity tests are for biodegradation testing. Therefore, an integrated concept which includes both, microbial toxicity tests and adequate biodegradation tests is helpful. The concept of the physiological potential of an inoculum (PPI) based on the biodegradation adaptation potential (BAP) and the chemical resistance potential (CRP) may be useful for such a concept. Furthermore, the set-up of a toolbox also containing certain ecotoxicological tests should be considered and might be useful for the near future. But there are two more points of interest concerning future testing. The first one is the potential which lies in the different fields dealing with “omics” (genomics, proteomics, metabolomics, metagenomics) which will gain increasing importance with the further development of modern technology and information processing. Undoubtedly, these technological improvements will also influence the existing norms and also the subsequent regulatory aspects. The second aspect concerns the field of artificial intelligence. In the field of biodegradation, there are successful projects which have been launched and which aim at predicting biodegradation on the basis of chemical structures and already existing or new test results. Artificial intelligence will certainly gain an increasing practical importance also in this field. Therefore, in the near future new insights will certainly arise which will open much more interesting and important aspects and pathing the way for promising future developments.