Introduction

Amide herbicides constitute the second largest proportion of the herbicides used in agriculture (Ding et al. 2011). Amide herbicides are a group of chemicals than can specifically interfere with biosynthesis of fatty acids, proteins and membrane, inhibit α- amylase and protease activities of germinating seeds, and suppress photosynthesis as inhibitors and uncoupling agents of electron chain transport in these plants (Qin et al. 2007; Robin et al. 2017). However, only a small amount of the applied herbicides reaches the target plants while an overwhelmingly larger portion is introduced into the environment (Cui et al. 2012) that adversely affect nontarget species, such as algae (Zhao et al. 2017), fish (Nassar et al. 2021), and earthworm (Li et al. 2021). These organisms occupy different trophic levels which constitute a significant part of the food web. Ecotoxicological risk assessment of such compounds is traditionally performed using standardized tests (e.g., according to OECD and ISO guidelines) which focus on biotoxicity toward some sensitive organisms. However, current methodologies for biotoxicity testing are expensive, time-consuming, laborious and poorly reproducible (Pavan and Worth 2008).

Quantitative Structure Activity relationship (QSAR) is an effective and low-cost alternative technology of biotoxicity testing that can accurately determine biotoxicity by developing mathematical models to establish the quantitative relationship between molecular structure and biotoxicity of chemicals (Kishor et al. 2019), which can also provide scientific insights into biotoxicity mechanisms of these chemicals (Pandey et al. 2020). Several studies have employed QSAR approach to investigate the quantitative structure activity relationship and predict the biotoxicity of pollutants such as natural medicine (Hamadache et al. 2016), herbicides (Gough and Hall 1999; Zakarya et al. 1996), food additive (Valerio et al. 2007), cosmetics (Hamadache et al. 2016), agriculture (Yang et al. 2020) and metal nanomaterials (Sizochenko and Leszczynski 2016). According to OECD Requirements and Guidelines (Netzeva et al. 2005), a valid and effective QSAR model conforms to the following characteristics: (1) a defined endpoint, (2) an unambiguous algorithm, (3) a defined domain of applicability, (4) appropriate measures of goodness-of-fit, robustness and predictability, (5) a mechanistic interpretation if possible. The QSAR models that complied with OECD principles for QSAR validation can be effectively applied into the ecotoxicological risk assessment of these compounds for management and pollution control.

In the present study, we assembled the toxicity concentrations of amide herbicides from Pubchem and Pesticide Properties Database. A series of electrical, thermodynamic, steric and hydrophobic molecular descriptors including EHOMO, ELUMO, αxx, αyy, αzz, μ, qN, Qxx, Qyy, Qzz, qH+, q, Vm, Eth, Cv, Sθ, Et, Hθ, Gθ, and ZPVE were calculated by density-functional theory calculation in EPIWEB4.1 and ORCA software. Five QSAR models toward algae, daphnia, fish, earthworm and avian species were respectively developed using a combination of Multiple Linear Regression (MLR) and Principal Component Analysis (PCA). The quantitative relationship between chemical structure and biotoxicity was then investigated. The main aim of this study were: (1) to develop valid and effective biotoxicity QSAR models of amide herbicides to different organisms; (2) to establish the quantitative relationship between structure and biotoxicity of amide herbicides; (3) to identify the effects of different molecular descriptors on the toxicity of amide herbicides. The results of this study provide an effective and low-cost measure for accurate biotoxicity determination of amide herbicides and give new insights that will helps to understand the biotoxicity mechanisms.

Materials and Methods

Biological Toxicity of Amide Herbicides

Amide herbicides are hazardous to the ecological environment. There is an extensive database on the effects of amide herbicides on ecosystems. Amide herbicides are thought to inhibit the growth, reproduction and development of many terrestrial and aquatic organisms (Coleman et al. 2000; Lunghini et al. 2020). In this study, acute toxicity results (50% effective concentration (EC50), lethal concentration or dose (LC50 or LD50)) of amide herbicides on five organisms (algae, daphnia, fish, earthworm and avian species) were searched and collected as many as possible from online database or other references based on OECD guidelines. For instance, growth inhibition for algae, immobilization inhibition for daphnia, and mortalities and abnormalities appearance/behavior for fish, earthworm, and avian species. In this study, amide herbicides with higher toxicity that have been studied extensively in previous literatures were selected and collected as much as possible from PubChem and Pesticide Properties Database. Finally, the minimum value of toxicity data of 27 amide herbicides was collected. The name, chemical formula, CAS number and molecular weight of these amide herbicides are shown in Table 1.

Table 1 The molecular structure information of amide herbicides

Quantification of Molecular Structure

In order to characterize the molecular structure of these amide herbicides, twenty-one molecular descriptors were calculated to quantify these structures. The molecular descriptors involve one hydrophobic parameters (Octanol–Water Partition Coefficient, logkow), twelve electronic parameters (EHOMO, ELUMO, αxx, αyy, αzz, qN, qH+, q,μ, Qxx, Qyy, and Qzz), seven thermodynamic parameters (Eth, Et, Cv, Sθ, Gθ, Hθ, and ZPVE) and one steric parameters (Vm). The symbols and definitions of these molecular descriptors are shown as Table 2. The molecular descriptors were quantified as follows: Firstly, the molecular structure of each amide herbicide was initially constructed using ChemDraw 19.0 and then optimized by Chem3D software. Secondly, LogKow of amide herbicides was calculated by EPIWEB4.1. Lastly, the rest of the molecular descriptors were calculated with the optimized structure at the B3LYP/6-311G++ (d, p) level using the density-functional theory (DFT) calculation by ORCA software, as described by Micera and Garribba (Giovanni and Eugenio 2011).

Table 2 Symbols and definition of the molecular descriptors

QSAR Model Development

MLR and PCA modeling methods were performed to develop QSAR models for biotoxicity prediction using SPSS26, Eviews7 and StataMP software, as described by Fadilah and Toropova (Fadilah et al. 2018; Toropova et al. 2015). MLR was employed to describe the quantitative linear correlations between the molecular descriptors and the biotoxicity. PCA was used to eliminate multicollinearity between the individual molecular descriptors during modeling. In this study, the collected toxicity data were categorized into five groups (algae, daphnia, fish, earthworm and avian species) and then the QSAR models were developed, respectively. The specific modeling steps were as follows:

Firstly, in the MLR analysis, Ordinary Least Squares Method (OLS) was applied to identify the most important descriptors contributing to the toxicity. F-tests and T tests were employed to eliminate insignificant descriptors in the OLS analysis. If F-tests and T tests can not pass, the regression equation recheck is necessary. According to the stepwise regression results, an initial QSAR model was developed. The statistical quality of QSAR model was evaluated by fitting coefficient (R2) and root-mean-square error (RMSE). R2 and RMSE are defined as Eqs. (12). Higher R2 and lower RMSE are considered to be one of the necessary conditions for QSAR model to meet the OECD standard (Pandey et al. 2020).

$$R^{2} = \frac{{{\text{SSR}}}}{{{\text{SST}}}} = 1 - \frac{{{\text{SSE}}}}{{{\text{SST}}}}$$
(1)
$${\text{RMSE}} = \sqrt {\frac{1}{n}\mathop \sum \limits_{i = 1}^{n} \left( {y_{i} - \widehat{{y_{i} }}} \right)^{2} }$$
(2)

where SSR is sum of squares due to regression, SSE is sum of squares due to error, SST is sum of squares total, \(\widehat{{y_{i} }}\) is the predicted value of the test set, and \(y_{i}\) is the experimental value of the test set.

Then, PCA was performed on the significant molecular descriptors in the MLS analysis to extract principal components of variables and eliminate multicollinearity. In this work, variance inflation factors (VIF) are defined as Eq. (3), which are adopted to evaluate the collinearity of descriptors in the model. If VIF > 10, the regression equation is unstable and recheck is necessary. MLR analysis was then used again on the extracted principal components of descriptors in the PCA, and a new regression equation was built and a QSAR model with eliminated multicollinearity was developed.

$${\text{VIF}} = \frac{1}{{1 - R^{2} }}$$
(3)

where R2 is fitting coefficient of the regression equation.

Lastly, double cross-validation (internal validation and external validation) was conducted on the developed QSAR models for reliable estimation of prediction errors. In this study, leave-one-out method was used for internal validation and external validation to evaluate the reliability and accuracy of the models, as described by Baumann and Baumann (Désirée and Knut 2014). Internal stability of the developed models was evaluated by leave-one-out cross-validation coefficient (Q2LOO). The models’ performance in predictions was evaluated by external validation correlation coefficient (Q2EXT). Q2LOO and Q2EXT are defined as Eq. (45).

$$Q^{2}_{{{\text{LOO}}}} = 1 - \frac{{{\text{PRESS}}}}{{{\text{TSS}}}}$$
(4)
$$Q_{{{\text{EXT}}}}^{2} = 1\frac{{\mathop \sum \nolimits_{i = 1}^{{n_{{{\text{EXT}}}} }} \left( {\widehat{{y_{i} }} - y_{i} } \right)^{2} }}{{\mathop \sum \nolimits_{i = 1}^{{n_{{{\text{EXT}}}} }} \left( {\widehat{{y_{i} }} - \overline{y}_{{{\text{EXT}}}} } \right)^{2} }}$$
(5)

where PRESS is prediction error sum of squares, TSS is sum of squares of deviations of the experimental values, \(\widehat{{y_{i} }}\) is the predicted value of the test set, \(y_{i }\) is the experimental value of the test set,\(\overline{y}_{{{\text{EXT}}}}\) is the mean of the experimental values of the test set.

According to the procedures described above, an effective and accurate QSAR model for biotoxicity prediction was finally established. The biological toxicity (Y) is described with the best combination of the most relevant descriptors used as independent variables (x1, x2xn), as follows (6):

$$Y = \, a_{1} x_{1} + \, a_{2} x_{2} + \, \ldots \, + \, a_{n} x_{n} + \, a_{0}$$
(6)

where a0 is the intercept and a1, a2an, are the regression coefficients.

Biotoxicity Prediction Accuracy Verification of the Developed QASR Models

The predicted biotoxicities by QASR models and the measured biotoxicities of four amide herbicides (benzadox, cyprazole, epronaz, and coconut diethanol amide CDEA), which were randomly selected, were compared to verify the accuracy of the developed QSAR models for biotoxicity prediction. In order to predict the biotoxicities by the developed QSAR models, the four amide herbicides were performed to structural quantification as “Quantification of molecular structure”. Then, the biotoxicities of these amide herbicides were calculated and predicted based the quantified molecular structures and the quantitative relationship between molecular structure and biotoxicity in the developed models. In order to measure the biotoxicities of the four amide herbicides on algae, daphnia, fish, earthworm and avian species, biotoxicity tests were respectively carried out according to the following OECD guidelines: freshwater Alga and Cyanobacteria, Growth Inhibition Test (OECD 201); Daphnia sp., Acute Immobilisation Test (OECD 202); Fish, Acute Toxicity Test (OECD 203); Earthworm, Acute Toxicity Tests (OECD 207); Acute Avian Oral Sequential Toxicity Test (OECD 223).

Results and Discussion

Toxicity of Amide Herbicides

As shown in Table 3, five groups (algae, daphnia, fish, earthworm, and avian species) of acute toxicity data were collected for QSAR model development. The toxicity for each organism species showed significant differences among these amide herbicides. The EC50, LC50 and LD50 concentrations of these amide herbicides were 0.0036–149 mg/L, 0.058–500 mg/L, 0.36–170 mg/L, 0.515–1000 mg/Kg and 180–30000 mg/Kg for algae, daphnia, fish, earthworm and avian species, respectively. The toxicity of amide herbicides followed in the order of algae > daphnia > earthworm > fish > avian species, as reflected by the change ranges and fold changes in EC50, LC50 and LD50 concentrations. The maximum and minimum toxicity was separately observed in algae and avian species, with a EC50 or LD50 concentration of 0.0036 mg/L and 30,000 mg/kg. It can be seen from these results that a great diversity of amide herbicides with large differences in toxicity to a variety of organisms were included in the QSAR model development.

Table 3 Toxicity of amide herbicides on algae, daphnia, fish, earthworm and avian species

Structural Quantification Information of Amide Herbicides

The three-dimensional graphics of the involved amide herbicides are listed in Figure S1 of the supplementary material. In this study, molecular structure characteristics of amide herbicides were quantified and characterized by a series of molecular descriptors, including one hydrophobic parameter, twelve electronic parameters, seven thermodynamic parameters and one steric parameter. These molecular descriptors showed the structural information and properties of different aspects of amide herbicides. For example, EHOMO is an electronic descriptor directly related to the ionization potential, which characterizes the susceptibility of the molecule toward attack by electrophiles (Sun et al. 2013). q is another electronic descriptor characterizing atomic charges, which are connected with the reactive centers activity of a chemical (Niu and Yu 2004). Sθ is a thermodynamic descriptor that is a measure of resistance to thermal disturbance within a compound (Zhu et al. 2010). Cv and Hθ are thermodynamic descriptors that reflect changes in heat and energy within a molecular system (Xi et al. 2006).

In this study, 27 amide herbicides were collected from pervious references in PubChem and Pesticide Properties Database. As reflected by the molecular descriptors in Table S1–S3 of the supplementary material, the involved amide herbicides showed a large difference in hydrophobic, electronic, thermodynamic parameters and steric properties, indicating the molecular structure differs significantly among these amide herbicides compounds. For hydrophobic descriptor, LogKow varied from − 3.3600 to 6.5100, which indicated a great difference for the aqueous solubility and hydrophobicity of these amide herbicides. The variation in electronical descriptors among these amide herbicides ranged from 1.3-fold to 26.8-fold for EHOMO and μ, respectively. For thermodynamic descriptors, Eth, Et, CV, Sθ, Gθ and Hθ were, respectively, − 3288.8321 to − 715.2634, 110.2390–268.5050, 44.2020–116.1480, 115.5790–211.3120 and − 3288.8311 to − 715.2624. The steric descriptor Vm varied extensively, ranged from − 101.8673 to 3306.8850. Maximum and minimum values for these molecular descriptors occurred frequently in allidochlor and saflufenacil. These results supported that amide herbicides with a wide variety of molecular properties were involved in this study.

QSAR Model Development and Accuracy in Toxicity Prediction

Table 4 provides the overall summary of the developed QSAR models for each set of organism groups. Based on the fitting coefficient (R2), root-mean-square error (RMSE) in the MLR and PCA analysis and the double cross-validation coefficients (Q2LOO and Q2EXT) in the model validation procedures, these QSAR models showed good robustness and prediction ability in the toxicity evaluation and prediction of amide herbicides. Less than 5% deviation between the model predicted biotoxicity values and the measured biotoxicity results was identified as accurate QSAR models. The QSAR model for earthworm was better than the other four models for algae, avian species, daphnia, and fish, as indicated by higher R2, Q2LOO, Q2EXT, and lower RMSE in QSAR model development.

Table 4 Summary of the best QSBR model for each of the selected organisms

QSAR Models for Aquatic Organisms

The developed QSAR models for three aquatic organisms (algae, daphnia and fish) are shown in Table 4. According to OECD Requirements and Guidelines, if R2 > 0.6, Q2LOO > 0.6 and Q2EXT > 0.5, the developed models are available, and if Q2LOO > 0.9 and Q2EXT > 0.9, the models are identified as excellent. Additionally, the closer R2 gets to 1, the better the fitting effects of the developed models. Our results supported that all the three models were stable (0.8869 < R2 < 0.9666) and robust (0.7201 < Q2LOO < 0.8634) and showed good performance in the toxicity prediction of amide herbicides (0.5612 < Q2EXT < 0.7676).

For predictive ability tests of the developed QSAR models, the model predicted EC50 or LC50 concentrations of the chosen four amide herbicides (benzadox, cyprazole, epronaz, and CDEA) on the three aquatic organisms (algae, daphnia and fish) was compared with the measured EC50 or LC50 concentrations, as shown in Fig. 1. The results showed that the deviations of the model predicted EC50 or LC50 values and the measure EC50 or LC50 concentrations varied from 0.7 to 3.5, from 1.0 to 4.9, and from 2.6 to 4.6, respectively, for algae, daphnia and fish, all of which were less than 5% deviation. It was supported that the measured EC50 or LC50 concentrations and the model predicted EC50 or LC50 concentrations showed good agreements, indicating the accurate predictive ability of the developed QSAR models for aquatic organisms.

Fig. 1
figure 1

The predictive performance of the developed QSAR models as reflected by the measured and the model predicted EC50 or LC50 values (mg/L or mg/Kg) of four amide herbicides on algae, daphnia, fish, earthworm and avian species. a Cyprazole, b benzadox, c epronaz, d CEDA, e the measured EC50 or LC50 values and the model predicted EC50 or LC50 values of four amide herbicides

QSAR Models for Terrestrial Animals

As shown in Table 4, the QSAR models for two terrestrial animals (earthworm and avian species) were developed, conforming to OECD Requirements and Guidelines (R2 > 0.6, Q2LOO > 0.6 and Q2EXT > 0.5) (Tropsha 2010). Comparatively, the QSAR model for earthworm showed more excellent performance, with high stability (R2 = 0.9700), robustness (Q2LOO = 0.9800), and external predictive ability (Q2EXT = 0.9800). The QSAR model for avian species was stable (R2 = 0.8931), robust (Q2LOO = 0.6821) and showed good predictive ability in toxicity (Q2EXT = 0.6342).

For further accuracy validation in toxicity prediction of the developed models for terrestrial organisms, the model predicted LC50/LD50 concentrations and the measured LC50/LD50 concentrations were compared, as indicated in Fig. 1. Our results indicated that the difference between the model predicted EC50 or LC50 values and the measured EC50 or LC50 results of all the chosen four amide herbicides (benzadox, cyprazole, epronaz, and CDEA) were lower than 5%. The deviations of the model predicted EC50 or LC50 values varied from 2.6 to 4.3 and from 1.9 to 4.6, respectively, for earthworm and avian species. The good consistency between the measured LC50/LD50 concentrations and model predicted LC50/LD50 concentrations supported the excellent predictive potential of the developed two models for terrestrial organisms.

The Effects of the Molecular Descriptors on Biotoxicity

In this study, five validated mathematical QSAR models were established for the toxicity prediction of amide herbicides. The molecular descriptors involved in these models can provide explanations and mechanisms of the toxicity caused by amide herbicides. It is possible to gain some insights into the interrelation of molecular structure and toxicity of amide herbicides through these molecular descriptors, which could provide solid foundation for the toxicity prediction and risk assessment.

The Molecular Descriptors Involved in the Models

Our results showed that electrical, thermodynamic and steric descriptors were included in the developed QSAR models, which made statistically significant contributions to the toxicity of amide herbicides. From the QSAR models for aquatic organisms in Table 4, eleven molecular descriptors, including seven electrical parameters (axx, azz, Qxx, Qyy, qN, EHOMO, and qH+), three thermodynamic parameters (Sθ, Cv, and Hθ), and one steric parameter (Vm), were associated with algal toxicity of amide herbicides. Eight molecular descriptors, including six electrical parameters (μ, axx, ayy, azz, ELUMO and q) and two thermodynamic parameters (ZPVE and Sθ), were related to daphnia toxicity of amide herbicides. Six molecular descriptors, involving two electronic parameters (μ and azz), three thermodynamic parameters (Cv, Sθ, and ZPVE), and one steric parameter (Vm), were connected with fish toxicity of amide herbicides. However, no influence of hydrophobic parameter was found over the aquatic toxicity of amide herbicides. The thermodynamic descriptor Sθ and the electrical descriptor azz were observed to be important molecular parameters affecting the toxicity of amide herbicides on all the three investigated aquatic organisms (daphnia, algae and fish), which respectively accounted for 11–37% and 5–16% of the weight in all the influencing molecular descriptors (Fig. 2).

Fig. 2
figure 2

The effects of molecular descriptors (involved in the developed QSAR models) on the toxicity of amide herbicides on algae, daphnia, fish, earthworm and avian species

There were differences in the molecular descriptors that associated to the toxicity of amide herbicides on different terrestrial animals, compared with aquatic organisms (Table 4). From the molecular descriptors involved in the QSAR models for terrestrial animals, ten molecular descriptors including seven electrical parameters (Qxx, Qyy, q, qN, ELUMO, ayy and azz), two thermodynamic parameters (ZPVE, Hθ) and one steric parameter (Vm) were observed to be associated with the earthworm toxicity of amide herbicides. Eight molecular descriptors, including five electrical parameters (ELUMO, q, qN, Qxx and EHOMO) and three thermodynamic parameters (ZPVE, Sθ and Hθ), were relevant to the toxicity of amide herbicides on avian species. Hθ, ZPVE, Qxx, qN, q and ELUMO were observed to be important molecular descriptors affecting both the toxicity of amide herbicides on both earthworm and avian species. As shown in Fig. 2, the thermodynamic descriptors Hθ and ZPVE accounted for 14–19% and 12% of the weight for all the influencing molecular descriptors. ZPVE was also observed to be an important factor influencing the aquatic toxicity of amide herbicides, which accounted for 24% and 9% of the weight in all the influencing molecular descriptors, respectively, for daphnia and fish. Additionally, Sθ was found to affect the toxicity of terrestrial organisms (2% weight) as well as aquatic organisms.

The Underlying Mechanism of the Structure Descriptors Related to Biotoxicity

The molecular descriptors involved in the developed QSAR models demonstrate the mechanism underlying the toxicity of amide herbicides. Previous studies have reported that physicochemical, electrical, thermodynamic and steric properties are important factors influencing toxicity for many chemicals (such as phenols, organic phosphorus, benzenes, chlorophenols, PCBs (Duchowicz et al. 2008; Zvinavashe et al. 2009), chlorophenols, organophosphorus pesticide and aldehyde (Hadanu et al. 2015). In our study, the obtained results also indicated that the associated descriptors with toxicity of amide herbicides were related to the electrical, thermodynamic and steric properties. Electrical and thermodynamic properties had a larger impact on the toxicity of amide herbicides than steric properties.

Our results supported that the toxicity of amide herbicides was closely related to the molecular polarity of the herbicide molecules, as reflected by the sixteen electrical, thermodynamic and steric descriptors involved in the QSAR models. The electrical molecular descriptors (axx, ayy, azz, qN, q, and μ) and the thermodynamic molecular descriptors (Cv and Sθ) characterize the polarizability properties of molecule. axx, ayy, and azz indicate the weight of polarizability in x, y, and z directions (Yang et al. 2021). qN represents the charge of the most electronegative atom in a compound (Qiu et al. 2013). q is related to atomic charge that influencing the binding of chemicals to the active site as well as the ability to form hydrogen bonds with biological receptors and therefore potentially affecting their toxicity. μ characterizes the average charge separation in a molecular system (Walker et al. 2002), which has negative contribution toward the toxicity as evidenced by the negative regression coefficient in the models of daphnia, fish and earthworm, which is consistent with previous reports. Cv indicates the constant capacity heat capacity of a chemical, which is directly proportional to the molecular polarity and the toxicity (Zuriaga et al. 2019). Sθ is the standard entropy representing the disorder degree of molecular reaction system of a compound which determines the difficulty of chemical reaction and thus has an impact on biological toxicity. Generally, the higher the Sθ value, the greater the biological toxicity (Ding et al. 2009). Previous studies have reported that chemicals might be more toxic with the increase of molecular polarity (Zhang et al. 2011). The polarizability represents the ability of a compound molecule to deform under an applied electric field and affects the interaction of electrons between the compound and atoms or molecules at the reaction site (Su et al. 2010). The higher the polarizability of the compound molecule, the deformation of the compound molecule in the corresponding direction of three-dimensional space will be enhanced, resulting in the increase in molecular polarity and the enhancement of toxicity (Wang et al. 2005).

The obtained results also indicated that the toxicity of amide herbicides was affected by the gain and loss of electrons. Energy of highest occupied molecular orbit (EHOMO) and Energy of lowest unoccupied molecular orbit (ELUMO) are electrical parameters describing energy of molecular orbital of chemicals, which respectively reflect the electron supply and loss capacity of a compound (Zhang et al. 2019). EHOMO and ELUMO relate with the sensitivity of compound molecules to external electrophilic/nucleophilic attack (Jiang et al. 2015). In this study, amide herbicides with larger energy of molecular orbital (higher EHOMO and ELUMO) produced higher toxicity to environmental organisms, which was consistent with previous discoveries (Clare 2004). The greater the EHOMO values of one chemical, the stronger its loss of electrons and reducing power and the greater its binding affinity for receptors and thus the lower its biological toxicities (Sun et al. 2013). The higher the ELUMO value of one chemical, the stronger its gain of electrons and electrophilic ability and thus the greater its biological toxicity (Walker et al. 2002).

The results also showed that the toxicity of amide herbicides was associated with the electrostatic inductions across the herbicides molecules and the organisms. The electronic molecular descriptors (Qxx, Qyy, and qH+) relate to electrostatic inductions. Qxx and Qyy characterizes the non-spherical symmetry of three-dimensional charge distribution, which higher Qxx and Qyy are beneficial to form electrostatic inductions and thus increase the possibility of biological toxicity (Mhin et al. 2002). Our results indicated that Qxx and Qyy has a majority impact on the toxicity of amide herbicides on both aquatic and terrestrial organisms, which was consistent with previous reports (Zhang et al. 2008). qH+ represents the maximum charge of hydrogen atoms in a compound which could influence the electrostatic attraction and thus affect the toxicity of chemicals (Niu and Yu 2004). Our results supported that the increase of qH+ had a positive influence in decreasing the algal toxicity of amide herbicides.

Molecular volume was observed to be another important factor affecting the toxicity of amide herbicides in this study. Vm is Van der Waals volume that expresses the volume of per unit compound molecule, which may influence the biological toxicity by affecting other properties of a chemical, such as water solubility (Liu et al. 2015). It is reported that there is a toxicity threshold Vm in every compound system. The toxicity of a compound molecule enhances with the increase of its Vm but declines when exceeding the toxicity threshold Vm of this compound (Nohair et al. 2009). Our results showed that as Vm of amide herbicides increased, the EC50 value of amide herbicides decreased and the toxicity enhanced.

The water partition coefficient of octanol parameters (LogKow) characterizes the hydrophilic and hydrophobic properties of a chemical and is consistent with its lipid solubility (Wu et al. 2020). LogKow is an important physicochemical parameter affecting the biological toxicity of a chemical, which has been mentioned in many chemicals such as polycyclic aromatic hydrocarbons (PAHs), nitrobenzene, dioxin and phenols (Bellifa and Mekelleche 2016; Ha et al. 2019). However, in this study, no significant effect was observed of LogKow on toxicity of amide herbicides. Other influencing factors such as the actual effective concentration of amide herbicides on the target sites in the organisms may also affect the biotoxicity, which has been verified in previous study (Qiu et al. 2013).

Conclusions

The developed QSAR models showed excellent performance in predicting the biotoxicity of amide herbicides, supporting as an alternative approach to expensive laboratory toxicity tests. The QSAR relationship between electrical, thermodynamic, steric properties and toxicity can be easily interpreted with respect to potential mechanistic explanations of their effects on biotoxicity of amide herbicides. Strong association of electrical descriptors with the biotoxicity suggested electrical descriptors as the best predictive parameters for the biotoxicity prediction of amide herbicides.