Abstract
Toxin–antitoxin (TA) modules are part of most bacteria’s regulatory machinery for stress responses and general aspects of their physiology. Due to the interplay of a long-lived toxin with a short-lived antitoxin, TA modules have also become systems of interest for mathematical modelling. Here we resort to previous modelling efforts and extract from these a minimal model of type II TA system dynamics on a timescale of hours, which can be used to describe time courses derived from gene expression data of TA pairs. We show that this model provides a good quantitative description of TA dynamics for the 11 TA pairs under investigation here, while simpler models do not. Our study brings together aspects of Biophysics with its focus on mathematical modelling and Computational Systems Biology with its focus on the quantitative interpretation of ’omics’ data. This mechanistic model serves as a generic transformation of time course information into kinetic parameters. The resulting parameter vector can, in turn, be mechanistically interpreted. We expect that TA pairs with similar mechanisms are characterized by similar vectors of kinetic parameters, allowing us to hypothesize on the mode of action for TA pairs still under discussion.
Similar content being viewed by others
Introduction
The vast majority of free-living bacteria contain a number of toxin–antitoxin (TA) gene pairs1,2,3,4. The toxin products target key cellular functions inhibiting cell growth and eventually leading to cell death, while the corresponding antitoxin neutralizes the toxin’s effect, thus, forming a TA system whose accurate expression regulation is vital to the survival of the cell5. These TA systems are currently classified in six groups (types I, II, III, IV, V, VI)2 according to the mechanism used by the antitoxin to neutralize the toxin. Types I-III are considered to be well-established TA systems3,6,7,8,9 while types IV-VI consist of newly discovered types10,11,12,13,14. Type II TA systems are the largest and best studied TA system class. Type II antitoxins are proteins. They typically have two domains, one that binds DNA and a second that binds and inhibits the activity of the cognate protein toxin2,3,9. The presence of TA systems is considered to be associated to persistence, i.e. the multidrug tolerance of bacteria, which obviously compromises the effectiveness of antibiotics on many pathogenic bacteria15. It is believed4,15,16 that when antibiotics are applied, a small sub-population of bacteria, called persisters, enters a dormant, non-dividing state and thus are protected from being killed. Experiments have shown a connection between persister formation and the competition between a toxin and its antitoxin inside an E. coli cell. Toxins inhibit cell growth and most antibiotics target the cell during the growth phase. Cells entering this persistent state seem to be immune to antibiotics but this immunity is different from the one obtained through advantageous mutations that result in antibiotic resistance since it is not permanent or inherited17. Knowledge about TA systems in bacteria is still accumulating18. This is true for the discovery of new TA modules19, their classification5,20, their functional roles21,22,23,24 as well as their detailed molecular mechanisms25. Very recently for example, it was discovered that the type II TA system PrpT-PrpA of the Pseudoalteromonas rubra plasmid, directly controls plasmid replication. It seems that the antitoxin PrpA binds to the iterons in the origin of replication (Ori), interfering with the binding of RepB to the Ori and, thus, preventing overreplication of the plasmid26.
In E. coli, there are more than ten well-characterized type II TA systems1. These include relE-relB, yafQ-dinJ, yoeB-yefM, hipA-hipB, yafO-yafN, hicA-hicB, higB-higA, ypjF-yfjZ, mqsR-mqsA, ymcE-gnsA and ydaT-ydaS10,27,28,29,30,31,32,33,34,35,36,37. The genomic location of each of these TA systems is indicated in Fig. 1. It is of considerable practical importance to understand the dynamics of TA systems and several plausible models for TA dynamics and persister formation have been proposed (see, for example38,39,40 and references therein). It is also important that the proposed model predictions are compared to, nowadays available, high-throughput data. In this paper, we present a minimal model for the description of TA type II dynamics in E. coli. The basic characteristics of the minimal model is that it assumes: (a) regulation of toxin and antitoxin production rate by means of a negative feedback through DNA binding of the TA complex (b) toxin induced growth rate modulation. The model’s predictions are compared to the RNA-Seq gene expression data published in41 (see Results and Discussion).
TA dynamics have been of interest to mathematical modelling for a long time. So far, the focus of research has been on the basic dynamical properties of TA modules39,40,42,43 and the synchronization of multiple TA modules in response to environmental stimuli (e.g.,44), rather than the agreement with high-throughput data. For high-throughput data, in particular gene expression patterns, the dominant avenue of research has been to compare these patterns with large-scale regulatory networks or classes of regulatory mechanisms. In the case of bacterial gene regulation, successes have been understanding and experimentally confirming the role of small regulatory devices like feedforward loops45,46, the discovery of an interplay the regulatory network and chromosomal structure47,48,49,50 and the organization of gene expression along the axis from the origin (OriC) to the terminus (Ter) of replication50.
TA systems are often embedded in an intricate network of regulatory processes5 and part of functional regulatory modules51. There is evidence of collective behaviors arising from the interplay between TA systems. Such a model of coupled TA systems has for example been studied in21 and in44. Simple ordinary differential equation (ODE) models of (type-II) TA systems have for example been formulated in21 with an emphasis on coupled systems and the spontaneous switching occurring in stochastic dynamics, in40, where conditional cooperativity of the RelBE system has been studied and its response to environmental stimuli (e.g., nutritional stress), in52, which contains a simplified system capable of excitable dynamics, as well as in39 and44 with a focus on bistability. For type-I TA systems, a mathematical model has been developed in53, offering insight in time scales involved.
Here we study the long-term dynamics of TA pairs in time-resolved RNA-Seq data for E. coli. Our question is, whether the dynamics of all TA pairs in the data can be described by the same model, or whether qualitatively different models have to be assumed for the different TA modules.
Methods
Figure 2 shows a schematic of the basic characteristics of the minimal model of type II TA gene expression. Toxin T and antitoxin A are expressed by neighbouring genes. It is known1,39 that toxins are more stable than the antitoxins, thus, the latter have to be constantly expressed in order to neutralize the toxin effects. The toxin and antitoxin form a complex AT which inhibits toxin and antitoxin production. More complex TA interaction (such as conditional cooperativity39,40 or cooperation between multiple TA systems17) are not included in the minimal model. Moreover, the presence of toxin has an inhibitory effect on the cell growth. This last fact is found to be an essential characteristic of an acceptable minimal model.
We denote the concentration of the antitoxin A with the variable \(y_{1}\), that of the toxin T with \(y_{2}\) and, finally, the concentration of the TA complex AT with \(y_{3}\). The system of ordinary differential equations (ODEs) that describes the system is:
Equation (3) is a standard chemical kinetics equation. We assume that the production rate of the complex \(y_3\) is proportional to the product of the concentrations of \(y_1\) and \(y_2\), thus the term \(k_{3} y_{1} y_{2}\) where \(k_3\) is the respective rate constant. We also assume that the complex degrades to its constituents A and T with a rate constant \(d_3\). To be precise, the rate constants \(d_1, d_2, d_3\) are considered to be a sum of 2 terms due to a. protein degradation (specific destruction by specialized proteins in the cell) and b. dilution (the reduction in concentration due to the increase of cell volume during growth)54. This is the standard way of dealing with cell growth in the mathematical modeling of bacterial gene expression and is adequate in steady-state models. However, in the context of this work, since the abundance of free toxin can directly affect growth rate (and thus dilution), dilution cannot be properly characterized using a fixed number. Thus, the above model and, for that matter all other models in the scientific literature we are aware of, do not fully considered the effect of bacterial growth.
The inhibitory action of the AT complex is modelled through the inclusion of negative feedback terms such as \(k'_{1}/\left( 1 + \frac{y_{3}}{s'_{1}}\right)\) in Eq. (1). The existence of toxin T in the cell reduces all protein production and decreases protein dilution by decreasing cell growth. Thus, the toxin concentration will have an inhibitory impact on the production rates of toxin, antitoxin, and on the cellular growth rate. We introduce an inhibition factor \(1/(b'_{m} y_{2} + 1 )\) in Eqs. (1)–(2). The parameter \(b'_m\) represents the redaction of protein expression due to the presence of toxin molecules. We also assume that growth inhibition will influence the toxin degradation rate, and we introduce a factor \((b'_{c} y_{2} + 1)\) that modulates the toxin degradation rate in Eq. (2), while we assume that the degradation rate of the free antitoxin remains the same. This is in agreement with a recent finding from55 that importantly, although free antitoxin is readily degraded in vivo, antitoxin bound to toxin is protected from proteolysis, preventing release of active toxin.
However, Eqs. (1)–(3), if one includes the unknown initial conditions for the quantities \(y_1, y_2, y_3\) at \(t=0\), contain 13 adjustable parameters. Our aim is to estimate the model parameters using experimental RNA-Seq data obtained from41. These experimental data (10 data points for each toxin antitoxin pair) would render such an estimation problematic, since such a model is structurally unidentifiable56.
In order to reduce the number of adjustable parameters we rescale the unobserved variable \(y_3\) by setting \(y_3 = (k'_2/d_3)z_3\) and rescale the variables \(y_1, y_2\) by the same factor \(\beta = k'_2\), i.e. by setting \(y_1 = k'_2 z_1\) and \(y_2 = k'_2 z_2\). Thus, we arrive at a system of ODEs for the rescaled variables \(z_1, z_2, z_3\) which is:
where the new kinetic constants are related the those in Eqs. (1)–(3) by the relations \(k_1 = k'_1/k'_2, s_1 = d_3 s'_1/k'_2, b_m = b'_m k'_2, k_2 = k'_2 k_3, s_2 = d_3 s'_2/k'_2, b_c = b'_c k'_2\). Moreover, we assume that \(z_1\) and \(z_2\) at time \(t = 0\) are equal to zero and allow the unobserved complex concentration \(z_3(0)\) to be equal to a constant \(c_0\) which will be determined from the fitting of the solution of Eqs. (4)–(6) to the data. Henceforth, we will refer to the above model (Eqs. (4)–(6)) as the Z-model. The model is essentially a rescaled version of the model proposed in39,40 with the additional assumption that the antitoxin bound to toxin is protected from proteolysis.
Our numerical investigations have shown that the Z-model (Eqs. (4)–(6)) is the simplest model able to represent the complete set of the experimental data that we have in our disposal with reasonable accuracy. Omission of any of the above basic ingredients of the model (e.g. setting \(b_m\) and \(b_c\) equal to zero) leads to plausible models, which may describe adequately the time evolution of the concentrations of some TA pairs, but fail to describe the expression of the entire set. It is obvious to the reader that the Z-model and its variants that we examine in this manuscript are deterministic models. We will not deal with the important topic of investigating a stochastic variant of the Z-model through a Monte Carlo approach based on the Gillespie algorithm. Our modeling decision is based on the fact that the RNA Seq data that we will use to fit the model parameters are not single cell sequencing data. As one can see in the detailed description of the experimental data used in this study, each RNA seq “read” represents multi-cell averages on a timescale of hours. Of course for single cell RNA seq experiments a stochastic modelling approach would be more appropriate although admittedly much more difficult. There is, however, important progress in the direction of using stochastic models and the inference of parameter values from noisy data, see for example57. Bulk RNA-Seq data have clear limitations regarding such mechanistic interpretations. When technology advances (see, e.g.58 for an important step in this direction) and time-resolved single cell experiments are readily available, we envision that repeating our analysis could provide further valuable insights. In this case, however, it is known that, on a single cell level, mRNA and protein concentrations do not correlate well59. Repeating our analysis on a single cell level would then require time-resolved proteomics data.
For our analysis we used experimental RNA-Seq data obtained from41 (GEO accession number: GSE65244). The RNA Seq data used here are for the wild-type(wt) strain and obtained after the culture growth in rich medium during the stationary phase. The system of Eqs. (4)–(6) was solved numerically with custom code written in Python using the scipy python module60. Fitting of the numerical solutions of the ODE’s was performed as part of the code using the Nelder-Mead minimization algorithm as implemented in scipy. Since the task of performing fits for all TA pairs and all model variants is quite demanding the code was parallelized using the dask.distributed python module. All numerical simulations were performed on a workstation equipped with 2 Intel Xeon Gold 6140 Processors (72 cpu cores in total).
Results
Figure 3 shows the concentrations of toxin and antitoxin for 11 known TA pairs of E. coli as a function of time. Symbols represent experimental RNA-Seq data obtained from41 (GEO accession number: GSE65244). The above list is exhaustive meaning that it includes all the TA pairs for which there are experimental measures in the dataset. All data have been rescaled (multiplied by the same constant \(c=10^{5}\) in order to avoid numerical errors during the fitting process). Lines are the numerical solutions of the ODE system, Eqs. (4)–(6). The kinetic constants of the system were estimated so that the weighted sum of the squared differences between the experimental data and the model predictions becomes minimum. We calculate weighted least squares since we have to fit two different experimental curves simultaneously whose y-axis values may differ considerably. Thus, we first calculate the mean values for each curve and then the weighted sum of the squared differences. Otherwise, curves with low mean values are practically ignored during the fitting process. Thus, the lines represent the “best” fit of the model to the data. We observe a very good agreement between the model predictions and the experimental data. As mentioned above, we assume that \(z_1\) and \(z_2\) at time \(t = 0\) are equal to zero. This is a rather harsh, and possibly unrealistic, condition to impose. If more data points were available the more natural and appropriate choice would be to use the RNA seq measurements of the earliest available timepoint as our initial conditions. This is indeed the approach we took in our analysis in Appendix B (Supplementary Materials). We should point out, however, that since the same initial condition is imposed to all TA pairs and since there is no indication that the TA systems will exhibit chaotic dynamics—which is known to be rare in chemical systems, requiring rather special conditions—we do not have any reason to expect sensitivity of the dynamics to the initial conditions and, thus, we do not believe that our choice to affect the accuracy of the model. An additional analysis in Appendix B, where a different choice of initial conditions has been adopted, i.e. the average concentration across all measurements, seems to support such a claim.
Figure 4 shows a box plot of the model parameters estimated from the best fit of the ODE system, Eqs. (4–6), to the RNA-Seq data. Each box shows the “dispersion” of eleven values, one per TA pair. We observe a wide distribution of parameter values across the different TA pairs. This is rather common in biological systems, where the kinetic constants of various metabolic reactions can differ by several orders of magnitude. Therefore, the same underlying differential equations lead to quite different dynamics precisely due to the broad range of the kinetic constants. In Appendix A we include a detailed discussion of the estimated covariances and standard deviations of the fitting parameters (see also the attached files in supplementary materials).
Figure 5 shows in a log-linear plot the toxin, antitoxin and TA complex concentrations as a function of time for the 11 known TA pairs of E. coli. Solid lines show the result \(z_1(t)\) of the numerical solution of the ODE system, Eqs. (4)–(6), for the antitoxin. Dashed lines show the corresponding variable \(z_2(t)\) for the toxin. Dotted lines show the corresponding variable \(z_3(t)\) for the TA complex. We observe a variety of different dynamics, but interestingly enough in all cases the complex concentration \(z_3\) seems to be lower than that of both the toxin and the antitoxin. For the majority of cases the antitoxin concentration is higher than that of the toxin. There are, however, exceptions, namely the relB-relE, mqsR-mqsA and the ymcE-gnsA pairs. The ydaT-ydaS pair also exhibits higher toxin expression for the most part of the observation time and only at the final stage the toxin level drop below that of the antitoxin. It is also quite intriguing that the Z-model predicts expression states where the toxin is constantly quite higher than the antitoxin (e.g. ymcE-gnsA) without resorting to the mechanism of conditional cooperativity2,39, although it is quite well-established that certain TA pairs (e.g. the relB-relE pair) exhibit conditional cooperativity and, obviously, such effects are not accounted for in the Z-model.
Next, we are interested in examining simpler versions of the proposed model and assessing their ability to describe the experimental data. We compare the Z-model to 7 simpler (i.e. with less adjustable parameters) variants, which we obtain from Eqs. (4)–(6) by forcing constraints on some of the constants, i.e. by fixing their numerical value or by setting them numerically equal to other constants. We describe these simpler variants below:
-
Model “s1=s2” is obtained by forcing the constants \(s_1\) and \(s_2\) to have the same numerical value.
-
Model “s1=s2 no bm” is obtained by forcing the constants \(s_1\) and \(s_2\) to have the same numerical value and by dropping the \(b_m\) constant, i.e. setting \(b_m = 0\).
-
Model “s1=s2 no bc” is obtained forcing the constants \(s_1\) and \(s_2\) to have the same numerical value and by setting \(b_c = 0\).
-
Model “s1=s2 no bm bc” is obtained by forcing the constants \(s_1\) and \(s_2\) to have the same numerical value and by setting both \(b_m =0\) and \(b_c = 0\).
-
Model “s1!=s2 no bm” is obtained by setting \(b_m = 0\). Note that now constants \(s_1\) and \(s_2\) are allowed to have different numerical values.
-
Model “s1!=s2 no bc” is obtained by setting \(b_c = 0\).
-
Model “no s1 s2 bm bc” is the simplest variant and is obtained from the Z-model ODEs by setting \(s_1=1, s_2=1, b_m =0, b_c = 0\).
Models, where the parameter \(b_m\) is identically zero, do not take into account the reduction of protein expression due to the existence of toxin, while variants, where the parameter \(b_c\) is identically zero, ignore the effect of growth inhibition. Figure 6 shows the minimum values of the objective function (i.e. the sum of weighted squared differences between model predictions and the experimental data) for all TA pairs and for the 7 model variants described above. The objective function values depend on the values of the experimental data which differ considerably between different TA pairs, thus the noticeable difference in the y-axis scales of Fig. 6.
The objective function of the Full Z-model is always lower than that of the variants, as expected. We should also mention that the algorithms (basinhopping in combination with a local Nelder-Mead algorithm) used for the minimization of the objective function are guaranteed to find local, not global, minima. Although we have performed a rather extensive search of the parameter space, there is always the chance that there are sets of parameters that will lead to lower values of the objective function than those reported here. We see that there are TA pairs for which simpler variants are capable of fitting the data with results comparable to those of the Full Z-model. However, the Full Z-model is the appropriate choice if one wants to describe the expression of the entire set of TA pairs.
Since we want to compare models with different numbers of parameters, it might be plausible to examine two widely used model selection criteria, the Akaike Information Criterion (AIC) and the Bayesian Information Criterion (BIC) for the Full Z-model and its seven variants. These are calculated as follows:
where \(\chi ^2\) stands for the sum of the squares of the residuals (i.e., the objective function discussed above), N is the number of data points (common for all model variants) and \(N_{v}\) is the number of adjustable parameters for each model. \(N_{v}\) is different for each variant. The full Z-model has the highest value, i.e. \(N_{v} = 10\). The most appropriate model is considered to be the one with the lower AIC or BIC value since both these criteria penalize the a large \(N_{v}\) number and reward a low objective function. Generally, the Bayesian information criterion is considered the most conservative of the two statistics. Figure 7 shows the AIC and BIC for the “collective” description of the TA gene expression set, i.e. when we describe the complete set of TA-pair with \(N = 10*11 = 110\) data points and \(\chi ^2\) is the sum of the objective functions of all the TA pairs.
Finally, it is helpful to compare the values of the constants that we obtained from the minimal ODE model for the different TA pairs. To this end we may view them as a “vector” characterizing the TA pair and we use an unsupervised learning method, namely a Principal Component Analysis (PCA), a statistical procedure that uses an orthogonal transformation to convert a set of observations of possibly correlated variables into a set of values of linearly uncorrelated variables called principal components61. PCA is routinely applied to experimental measurments directly for reasons of dimensionality reduction. Using PCA, however, to interpret the parameters of a deterministic ODE model consists a novel approach which has been recently used to interpret the parameters of a fractal kinetics SI model of Covid-19 spreading62. Figure 8 shows a plot of the two largest PCA components.
Typically in a PCA plot we try to identify clusters and perceive them as an indication of similar underlying causal behavior. For cluster identification, to avoid subjectivity, we applied a clustering identification algorithm i.e. DBSCAN with parameter \(eps = 0.8\)63. For DBSCAN the number of clusters is not predefined but decided by the algorithm. Here, the clustering algorithm has identified one cluster of 7 TA pairs, namelydinJ-yafQ, relB-relE, yafN-yafO, higA-higB, hipB-hipA, hicB-hicA, and mqsA-mqsR, which form a large central cluster, and four outliers i.e. the three pairs yefM-yoeB, ydaT-ydaS, and ymcE-gnsA, which have a negative PC2 component, and yfjZ-ypjF with relatively large PC1 and PC2 values.
In Table 1 we summarize this distinction between a main cluster and several outliers, together with the associated functional classification of the TA pairs. This distinction can serve as a starting point for comparing this statistical result with the wealth of biological information available for each of these TA modules. For the TA module hipB-hipA for example, the mode of action has been debated over the last years64,65, but is still not clear1. The similarity of estimated parameters to higA-higB, hicB-hicA and other members of the main cluster may be seen as evidence of a functional classification of this TA system as RNA interferases and guide further attempts of functional elucidation, in particular a better understanding of superfamilies of type-II TA systems66.
Appendix B (Supplementary Materials) contains the results for another time-resolved gene expression data set, namely the data from72 which are available at GEO (accession number: GSE131992).
In Appendix C (Supplementary Materials), we present in tabular form the biological information relevant to the members of the clusters identified in Fig. 7 as obtained from The Universal Protein Resource (UniProt), a comprehensive resource for protein sequence and annotation data (https://www.uniprot.org).
Conclusions
We have proposed a minimal model that is able to capture the dynamics of TA systems in E. coli and agrees with experimental high-throughput RNA-Seq data reasonably well. We find that a minimal acceptable model of TA regulation should at least include a negative feedback loop through a TA pair formation and the effect of toxin induced growth modulation. Despite the obvious over-simplifications of the model, e.g. we study each TA pair in isolation, and we do not account for the influence on cell growth due to the remaining toxin proteins, the model is able to replicate a variety of experimental curves.
With the availability of more time-resolved high-quality gene expression data, the description of time courses of systemic components with the help of simple mathematical models can provide an important instrument for the interpretation of such high-throughput data and thus bridge the gap between Theoretical Biology, Statistical Physics and Systems Biology73.
Data availability
The datasets analysed and the custom code used during the current study are available from the corresponding author on reasonable request. They are also available for direct download from Zenodo at https://doi.org/10.5281/zenodo.5162947.
References
Yamaguchi, Y. & Inouye, M. Regulation of growth and death in Escherichia coli by toxin–antitoxin systems. Nat. Rev. Microbiol. 9, 779–790 (2011).
Page, R. & Peti, W. Toxin-antitoxin systems in bacterial growth arrest and persistence. Nat. Chem. Biol. 12, 208–214 (2016).
Pandey, D. P. & Gerdes, K. Toxin-antitoxin loci are highly abundant in free-living but lost from host-associated prokaryotes. Nucleic Acids Res. 33, 966–976 (2005).
Balaban, N. Q., Merrin, J., Chait, R., Kowalik, L. & Leibler, S. Bacterial persistence as a phenotypic switch. Science 305, 1622–1625 (2004).
Harms, A., Brodersen, D. E., Mitarai, N. & Gerdes, K. Toxins, targets, and triggers: An overview of toxin-antitoxin biology. Mol. Cell 70, 768–784 (2018).
Thisted, T., Sørensen, N., Wagner, E. & Gerdes, K. Mechanism of post-segregational killing: Sok antisense RNA interacts with Hok mRNA via its \(5^\prime\)-end single-stranded leader and competes with the \(3^\prime\)-end of Hok mRNA for binding to the mok translational initiation region. EMBO J. 13, 1960–1968 (1994).
Gerdes, K., Nielsen, A., Thorsted, P. & Wagner, E. G. H. Mechanism of killer gene activation. Antisense RNA-dependent RNase III cleavage ensures rapid turn-over of the stable hok, srnB and pndA effector messenger RNAs. J. Mol. Biol. 226, 637–649 (1992).
Brantl, S. & Jahn, N. sRNAs in bacterial type I and type III toxin–antitoxin systems. FEMS Microbiol. Rev. 39, 413–427 (2015).
Pedersen, K. & Gerdes, K. Multiple hok genes on the chromosome of Escherichia coli. Mol. Microbiol. 32, 1090–1102 (1999).
Brown, J. M. & Shaw, K. J. A novel family of Escherichia coli toxin–antitoxin gene pairs. J. Bacteriol. 185, 6600–6608 (2003).
Masuda, H., Tan, Q., Awano, N., Wu, K. P. & Inouye, M. YeeU enhances the bundling of cytoskeletal polymers of MreB and FtsZ, antagonizing the CbtA (YeeV) toxicity in Escherichia coli. Mol. Microbiol. 84, 979–989 (2012).
Wang, X. et al. A new type V toxin–antitoxin system where mRNA for toxin GhoT is cleaved by antitoxin GhoS. Nat. Chem. Biol. 8, 855 (2012).
Wang, X. et al. Type II toxin/antitoxin MqsR/MqsA controls type V toxin/antitoxin GhoT/GhoS. Environ. Microbiol. 15, 1734–1744 (2013).
Aakre, C. D., Phung, T. N., Huang, D. & Laub, M. T. A bacterial toxin inhibits DNA replication elongation through a direct interaction with the \(\beta\) sliding clamp. Mol. Cell 52, 617–628 (2013).
Balaban, N. Q. et al. Definitions and guidelines for research on antibiotic persistence. Nat. Rev. Microbiol. 17, 441–448 (2019).
Sneppen, K., Micheelsen, M. A. & Dodd, I. B. Ultrasensitive gene regulation by positive feedback loops in nucleosome modification. Mol. Syst. Biol. (2008).
Fasani, R. A. & Savageau, M. A. Molecular mechanisms of multiple toxin-antitoxin systems are coordinated to govern the persister phenotype. Proc. Natl. Acad. Sci. 110, E2528–E2537 (2013).
Fraikin, N., Goormaghtigh, F., & Van Melderen, L. Type II toxin–antitoxin systems: Evolution and revolutions. J. Bacteriol.202. (2020).
Leplae, R. et al. Diversity of bacterial type II toxin–antitoxin systems: A comprehensive search and functional analysis of novel families. Nucleic Acids Res. 39, 5513–5525 (2011).
Ghafourian, S., Raftari, M., Sadeghifard, N. & Sekawi, Z. Toxin–antitoxin systems: Classification, biological function and application in biotechnology. Curr. Issues Mol. Biol. 16, 9–14 (2014).
Fasani, R. A. & Savageau, M. A. Unrelated toxin–antitoxin systems cooperate to induce persistence. J. R. Soc. Interface 12, 20150130 (2015).
Gerdes, K. Hypothesis: Type I toxin–antitoxin genes enter the persistence field—A feedback mechanism explaining membrane homoeostasis. Philos. Trans. R. Soc. B Biol. Sci. 371, 20160189 (2016).
Kedzierska, B. & Hayes, F. Emerging roles of toxin-antitoxin modules in bacterial pathogenesis. Molecules 21, 790 (2016).
Massey, S. E. & Mishra, B. Origin of biomolecular games: Deception and molecular evolution. J. R. Soc. Interface 15, 20180429 (2018).
Ruangprasert, A. et al. Mechanisms of toxin inhibition and transcriptional repression by Escherichia coli DinJ-YafQ. J. Biol. Chem. 289, 20559–20569 (2014).
Ni, S. et al. Conjugative plasmid-encoded toxin-antitoxin system PrpT/PrpA directly controls plasmid copy number. Proc. Natl. Acad. Sci. 118(4), e2011577118 (2021).
Yamaguchi, Y., Park, J.-H. & Inouye, M. MqsR, a crucial regulator for quorum sensing and biofilm formation, is a GCU-specific mRNA interferase in Escherichia coli. J. Biol. Chem. 284(42), 28746–28753 (2009).
Takagi, H. et al. Crystal structure of archaeal toxin–antitoxin RelE-RelB complex with implications for toxin activity and antitoxin effects. Nat. Struct. Mol. Biol. 12, 327 (2005).
Zhang, Y., Zhu, L., Zhang, J. & Inouye, M. Characterization of ChpBK, an mRNA interferase from Escherichia coli. J. Biol. Chem. 280, 26080–26088 (2005).
Motiejūnaitė, R., Armalytė, J., Markuckas, A. & Sužiedėlienė, E. Escherichia coli dinJ-yafQ genes act as a toxin–antitoxin module. FEMS Microbiol. Lett. 268, 112–119 (2007).
Prysak, M. H. et al. Bacterial toxin YafQ is an endoribonuclease that associates with the ribosome and blocks translation elongation through sequence-specific and frame-dependent mRNA cleavage. Mol. Microbiol. 71, 1071–1087 (2009).
Kamada, K. & Hanaoka, F. Conformational change in the catalytic site of the ribonuclease YoeB toxin by YefM antitoxin. Mol. Cell 19, 497–509 (2005).
Zhang, Y. & Inouye, M. The inhibitory mechanism of protein synthesis by YoeB, an Escherichia coli toxin. J. Biol. Chem. 284, 6627–6638 (2009).
Keren, I., Shah, D., Spoering, A., Kaldalu, N. & Lewis, K. Specialized persister cells and the mechanism of multidrug tolerance in Escherichia coli. J. Bacteriol. 186, 8172–8180 (2004).
Korch, S. B., Henderson, T. A. & Hill, T. M. Characterization of the hipA7 allele of Escherichia coli and evidence that high persistence is governed by (p) ppGpp synthesis. Mol. Microbiol. 50, 1199–1213 (2003).
Zhang, Y., Yamaguchi, Y. & Inouye, M. Characterization of YafO, an Escherichia coli toxin. J. Biol. Chem. 284, 25522–25531 (2009).
Brown, B. L. et al. Three dimensional structure of the MqsR: MqsA complex: a novel TA pair comprised of a toxin homologous to RelE and an antitoxin with unique properties. PLoS Pathog. 5, e1000706 (2009).
Vandervelde, A., Loris, R., Danckaert, J. & Gelens, L. Computational methods to model persistence. In Methods in Molecular Biology, Methods in Molecular Biology Vol. 1333 (eds Michiels, J. & Fauvart, M.) 207–240 (Springer, 2016).
Cataudella, I., Sneppen, K., Gerdes, K. & Mitarai, N. Conditional cooperativity of toxin–antitoxin regulation can mediate bistability between growth and dormancy. PLoS Comput. Biol. 9, e1003174 (2013).
Cataudella, I., Trusina, A., Sneppen, K., Gerdes, K. & Mitarai, N. Conditional cooperativity in toxin–antitoxin regulation prevents random toxin activation and promotes fast translational recovery. Nucleic Acids Res. 40, 6424–6434 (2012).
Beber, M. E., Sobetzko, P., Muskhelishvili, G. & Hütt, M. T. Interplay of digital and analog control in time-resolved gene expression profiles. EPJ Nonlinear Biomed. Phys. 4, 8 (2016).
Gelens, L., Hill, L., Vandervelde, A., Danckaert, J. & Loris, R. A general model for toxin–antitoxin module dynamics can explain persister cell formation in E. coli. PLoS Comput. Biol. 9, e1003190 (2013).
Nikolic, N. et al. Autoregulation of mazEF expression underlies growth heterogeneity in bacterial populations. Nucleic Acids Res. 46, 2918–2931 (2018).
Tian, C., Semsey, S. & Mitarai, N. Synchronized switching of multiple toxin–antitoxin modules by (p) ppGpp fluctuation. Nucleic Acids Res. 45, 8180–8189 (2017).
Shen-Orr, S. S., Milo, R., Mangan, S. & Alon, U. Network motifs in the transcriptional regulation network of Escherichia coli. Nat. Genet. 31, 64 (2002).
Alon, U. Network motifs: Theory and experimental approaches. Nat. Rev. Genet. 8, 450–461 (2007).
Marr, C., Geertz, M., Hütt, M. T. & Muskhelishvili, G. Dissecting the logical types of network control in gene expression profiles. BMC Syst. Biol. 2, 18 (2008).
Travers, A., Muskhelishvili, G. & Thompson, J. DNA information: From digital code to analogue structure. Philos. Trans. R. Soc. A 370, 2960–2986 (2012).
Sonnenschein, N., Geertz, M., Muskhelishvili, G. & Hütt, M. T. Analog regulation of metabolic demand. BMC Syst. Biol. 5, 40 (2011).
Kosmidis, K., Jablonski, K. P., Muskhelishvili, G. & Hütt, M. T. Chromosomal origin of replication coordinates logically distinct types of bacterial genetic regulation. NPJ Syst. Biol. Appl. 6, 1–9 (2020).
Fang, X. et al. Global transcriptional regulatory network for Escherichia coli robustly connects gene expression to transcription factor activities. Proc. Natl. Acad. Sci. 114, 10286–10291 (2017).
Vet, S., Vandervelde, A., & Gelens L. Excitable dynamics through toxin-induced mRNA cleavage in bacteria. PLoS ONE14 (2019).
Himeoka, Y. & Mitarai, N. Modeling slow-processing of toxin messenger RNAs in type-I toxin–antitoxin systems: Post-segregational killing and noise filtering. Phys. Biol. 16, 026001 (2019).
Alon, U. An Introduction to Systems Biology: Design Principles of Biological Circuits (CRC Press, 2019).
LeRoux, M., Culviner, P. H., Liu, Y. J., Littlehale, M. L. & Laub, M. T. Stress can induce transcription of toxin–antitoxin systems without activating toxin. Mol. Cell (2020).
DiStefano, J. III. Dynamic Systems Biology Modeling and Simulation (Academic Press, 2015).
Cao, Z. & Grima, R. Accuracy of parameter estimation for auto-regulatory transcriptional feedback loops from noisy data. J. R. Soc. Interface 16, 20180967 (2019).
Blattman, S. B., Jiang, W., Oikonomou, P. & Tavazoie, S. Prokaryotic single-cell RNA sequencing by in situ combinatorial indexing. Nat. Microbiol. 5(10), 1192–1201 (2020).
Taniguchi, Y. et al. Quantifying E. coli proteome and transcriptome with single-molecule sensitivity in single cells. Science 329, 533–538 (2010).
Virtanen, P. et al. SciPy 1.0: Fundamental algorithms for scientific computing in python. Nat. Methods 16, 261–272 (2020).
Jolliffe, I. Principal Component Analysis 2nd edn. (Springer, 2002).
Kosmidis, K. & Macheras, P. A fractal kinetics SI model can explain the dynamics of COVID-19 epidemics. PLoS ONE 15, e0237304 (2020).
Schubert, E., Sander, J., Ester, M., Kriegel, H. P. & Xu, X. DBSCAN revisited, revisited: Why and how you should (still) use DBSCAN. ACM TODS 42, 1–21 (2017).
Germain, E., Castro-Roa, D., Zenkin, N. & Gerdes, K. Molecular mechanism of bacterial persistence by HipA. Mol. Cell 52, 248–254 (2013).
Hansen, S., Vulić, M., Min, J., Yen, T. J., Schumacher, M. A., Brennan, R. G., & Lewis, K. Regulation of the Escherichia coli HipBA toxin–antitoxin system by proteolysis. PLoS ONE7 (2012).
Guglielmini, J. & Van Melderen, L. Bacterial toxin–antitoxin systems: Translation inhibitors everywhere. Mobile Genet. Elem. 1, 283–306 (2011).
Armalytė, J., Jurėnaitė, M., Beinoravičiūtė, G., Teišerskas, J. & Sužiedėlienė, E. Characterization of Escherichia coli dinJ-yafQ toxin–antitoxin system using insights from mutagenesis data. J. Bacteriol. 194, 1523–1532 (2012).
Gerdes, K. In Type II Toxin-Antitoxins Loci: The relBE Family, pp. 69–92. Berlin, Heidelberg: Springer Berlin Heidelberg. (2013).
Hurley, J. M. & Woychik, N. A. Bacterial toxin HigB associates with ribosomes and mediates translation-dependent mRNA cleavage at A-rich sites. J. Biol. Chem. 284, 18605–18613 (2009).
Unterholzner, S. J., Poppenberger, B. & Rozhon, W. Toxin–antitoxin systems: Biology, identification, and application. Mobile Genet. Elem. 3, e26219 (2013).
Wei, Y., Zhan, L., Gao, Z., Privé, G. G. & Dong, Y. Crystal structure of GnsA from Escherichia coli. Biochem. Biophys. Res. Commun. 462(1), 1–7 (2015).
Lempp, M. et al. Systematic identification of metabolites controlling gene expression in E. coli. Nat. Commun. 10, 1–9 (2019).
Schureck, M. A. et al. Structural basis of transcriptional regulation by the HigA antitoxin. Mol. Microbiol. 111, 1449–1462 (2019).
Funding
Open Access funding enabled and organized by Projekt DEAL.
Author information
Authors and Affiliations
Contributions
K.K, M-T.H designed research, K.K. performed research, K.K, M-T.H analyzed the results, K.K, M-T.H wrote the manuscript.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Kosmidis, K., Hütt, MT. A minimal model for gene expression dynamics of bacterial type II toxin–antitoxin systems. Sci Rep 11, 19516 (2021). https://doi.org/10.1038/s41598-021-98570-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-021-98570-z
- Springer Nature Limited