Introduction

Recently the mathematical formalism of quantum mechanics and the ideology of operational description of observed data (elaborated by N. Bohr and W. Heisenberg) were successfully applied outside of physics; in psychology, cognitive science, economics (e.g. Khrennikov 2004; Busemeyer et al. 2006; Cheon and Takahashi 2010; Asano et al. 2010, 2011; Basieva et al. 2010), see the monograph (Khrennikov 2010) for the extended list of references. In 2010 the quantum-like behavior and information processing were found in microbiology and genetics (Basieva et al. 2010). We found experimental evidences, see Inada et al. (1996), demonstrating that the genome processes information nonclassically exhibiting behavior similar to the quantum interference. Hence, the powerful methods developed for quantum physics and quantum information, see e.g. Khrennikov (2010), can be applied even to the description of the mechanism of the genetic regulation in cells.

This paper contains “Appendix” with simple introduction to the quantum formalism and quantum information. We want to make this paper readable for mathematically oriented researchers in microbiology and genetics.

Our quantum-like model can be considered as a kind of the operational formalism for microbiology and genetics. Instead of trying to describe processes in a cell in the very detail (e.g., Wanke and Kilian 2009), we propose a formal operator description. Such a description may be useful in situation in which the detailed description of processes is impossible or extremely complicated. The usage of the operational formalism provides a possibility to obtain the main probabilistic predictions without constructing a detailed model.

Escherichia coli (E. coli) is a bacterium which has been studied well in biology. One of interesting phenomena in E. coli’s growth is the gene regulation of glucose/lactose metabolism. The energy for E. coli’s activity is produced by metabolizing sugar: glucose, lactose, etc. When E. coli is incubated in a test tube containing both glucose and lactose, E. coli will digest glucose first and lactose second. Many microbiological studies demonstrated that E. coli has a preference for carbon resources, that is, E. coli likes glucose better than lactose. Such preference is caused by the difference of the metabolism efficiency (or cost) between lactose and glucose (Inada et al. 1996) (since the operon theory was proposed in 1956–1961 (Jacob and Monod 1961), the regulatory system of gene expression of lactose operon has been extensively studied and the molecular mechanism of it has been mostly elucidated including the catabolite repression).

Recently, it was pointed out (Basieva et al. 2010) that the complex microscopic biological dynamics of E. coli’s metabolism violates one of basic law of classical probability theory—the law of total probability (playing a crucial role in Bayesian analysis). It was argued that such complex behavior can be described by models based on non-Kolmogorovian probability theory inspired by the quantum phenomena (Khrennikov 2010) (such models are called quantum-like models). In the present paper we shall show that starting with experimental statistical data it is possible to reconstruct basic operators of the quantum-like operational formalism for glucose/lactose regulation in genome.

An important element of our model is the usage of adaptive dynamics (Ohya 2008; Asano et al. 2007)—dynamics which depend nontrivially on biochemical context (Khrennikov 2010). The quantum formalism provides a possibility of the mathematical description of adaptive (contextual) dynamics. This formalism was very well developed in quantum physics and especially recently in quantum information theory (Ingarden et al. 1997; Khrennikov 2010; Ohya et al. 2011). Therefore, we describe the adaptive dynamics of the gene regulation in cells by theory of operators in complex Hilbert spaces. The notions from quantum information are adapted to the microbiological context. The most important for us are the notions of the information channel and lifting. The second one belongs to the advanced level of quantum information theory and its successful usage in the description of the regulatory gene activity is an important step in the invention of the methods of quantum information theory to microbiology.

We analyze statistical data obtained from experiments (Inada et al. 1996), and we compute the degree of E. coli’s preference within adaptive dynamics. It is known that there are several types of E. coli characterized by the metabolization system. We demonstrate that the same type of E. coli can be described by the well determined operators; we find invariant operator quantities characterizing each type. Such invariant quantities can be calculated from the obtained statistical data. We do not need difficult simulation of complex biochemical systems. We use only simple calculation based on adaptive dynamics and its operational representation. Our quantum-like model can explain E. coli’s adaptive behavior.

Violation of total probability law

In this section, we show that statistical data obtained from experiments on the gene regulation of E. coli metabolism violate the total probability law, which is one of basic laws in classical probability theory; see Basieva et al. (2010) for detail. We modify the probabilistic representation (Basieva et al. 2010) of statistical data from Inada et al. (1996) to operate solely with conditional probabilities (as it is done in the standard applications of the formula of total probability in, e.g., Bayesian analysis).

In order to digest lactose, E. coli must produce the enzyme of β-galactosidase. The amount of its production is regulated by a series of genes, called lactose operon. The activity of E. coli’s lactose operon is estimated by the value of Miller’s Units (MU), which is measured in the experiment of β-galactosidase assay (Inada et al. 1996).

Let us consider the measurement of MU in the different situations: E. coli grows in the test tube containing (1) only lactose; (2) only glucose; (3) both glucose and lactose. Table 1 shows the measured value of MU for the cases (1), (2) and (3). We use the wild type of E. coli (W3110) in data-(I) and -(II). The lactose/glucose concentration in data-(I) is different from that in data-(II). The MU values in the case of (3) are 43 and 64, respectively. It is clear from this result that the activities of β-galactosidase in case of (3) are as week as E. coli has strong preference for glucose.

Table 1 Results of β-galactosidase assay in different situations: E. coli grows in the test tube containing (1) only lactose; (2) only glucose; (3) both glucose and lactose

Such property of sugar metabolism in E. coli has been explained in biology as follows. E. coli cell has various transport systems including lactose permease (LacY) and glucose transporter (PtsG). They recognize their substrates (glucose or lactose). If there is either sugar in the medium, the sugar enters into cells with its concentration dependent manner. Therefore lactose outside the cell can induce lactose operon expression in concentration dependent manner, if there is no glucose outside: Gene expression apparatus recognizes the lactose concentration inside as high enough to induce mRNA synthesis.

On the other hand, if there is glucose outside at the same time with lactose, the cell can recognize glucose and lactose concentrations outside but, glucose transport process itself inhibits the lactose permease activity so the lactose concentration inside decreases. In this situation, the gene expression apparatus can not recognize the lactose concentration inside as high enough to induce mRNA synthesis.

This process can be quantitatively discussed with metabolic networks. However, the metabolic networks can be very large, so that, the construction of a mathematical model of the network and its analysis are a daunting task; most of the kinetic parameters are unavailable. Instead of such detailed model, we here explain an approach of coarse-grained model which include the process of the gene expression in the cell.

We focus on the behavior of one cell. There are many molecules of lactose or glucose around the cell. These molecules sometimes arrive at the surface of the cell, and they enter into the cell through the gate of a transport system. We call this event “E. coli’s detection”. In this experiment, we use two kinds of sugars, so that we consider an event system {LG}: L means the event that E. coli detects a lactose molecule around its cell; G means the E. coli detects a glucose molecule around its cell. We use only two kinds of sugars in this experiment, so that, we assume that P(LG) is equal to one.

Let P(L) and P(G) be probabilities of event L and G, respectively. It is clear that, in the case of (1) where there is only lactose in medium, these probabilities are given by P(L) = 1 and P(G) = 0. In the case of (2), P(L) = 0 and P(G) = 1 are given. In the case of (3), these probabilities are proportional to the concentrations of lactose/glucose. The probabilities for data-I are

$$ P(L)=\frac{0.4}{0.4+0.1}=0.8\quad\hbox{and}\quad P(G)=\frac{0.1}{0.4+0.1}=0.2. $$

The probabilities for data-II are

$$ P(L)=\frac{1.0}{1.0+1.0}=0.5\quad\hbox{and}\quad P(G)=\frac{1.0}{1.0+1.0}=0.5. $$

Let us consider another event system {+, −}: + means the event that E. coli activates its lactose operon, that is, the event that β-galactosidase is produced through the transcription of mRNA from a gene in lactose operon; − means the event that E. coli does not activates its lactose operon.

In the case of (1), the only event L occurs since there are only lactose in the medium. Therefore, we can give the conditional probability P(+|L) with MU values in (1), for example, we can give the probability P(+|L) for data-I as

$$ P(+| L)=\frac{2920}{3000}\approx 0.97 $$

and that for data-II as

$$ P(+| L)=\frac{2132}{2200}\approx 0.97. $$

Similarly, we can give the probability P(+|G) for each data with MU value in (2); P(+|G) for data-I is

$$ P(+| L)=\frac{33}{3000}\approx 0.01 $$

and P(+ |G) for data-II is

$$ P(+| L)=\frac{64}{2200}\approx 0.03. $$

Here, let us discuss the relation among the experiments of (1), (2) and (3). If we assume that the experiment (3) is a simple sum of (1) and (2), then the value of MU (1) × P(L) + MU (2) × P(G) should be equal to MU (3) where MU (n) denotes MU values in the case of experiment (n). However we can easily see that such relation does not hold for each data. We remark that the experiment (3) is not the simple sum of (1) and (2).

In classical probability theory, the following well-known formula of probabilities, called “total probability law”, should hold, Khrennikov (1999).

$$ P(+| L \cup G)=P(+| L)P(L)+P(+| G)P(G) $$
(1)

Here P(+|LG) denotes the probability of the event + if either L or G occurs in the case of (3). However, it is important to note that the probabilities calculated from obtained data do not satisfy the above formula of total probability law; we can calculate the value given by right-hand side of Eq. (1);

$$ P(+| L)P(L)+P(+| G)P(G) =\frac{2920}{3000}\times0.8+\frac{33}{3000}\times 0.2\approx 0.781 $$

for data-(I) and

$$ P(+| L)P(L)+P(+| G)P(G) =\frac{2132}{2200}\times0.5+\frac{64}{2200}\times 0.5\approx 0.499 $$

for data-(II).

Also we can calculate the value given by left-hand side of Eq. (1) in the case of (3). So that, we can give P(+|LG) for data-(I) as

$$ P(+| L \cup G)=\frac{43}{3000}\approx 0.01 $$

and that for data-(II) as

$$ P(+| L \cup G)=\frac{64}{2200}\approx 0.03. $$

These calculated probabilities are shown in Table 2.

Table 2 Probabilities calculated from the obtained data; L means the event that E. coli detects a lactose molecule around its cell; G means the E. coli detects a glucose molecule around its cell; + means the event that E. coli activates his lactose operon, and − means not

The value of left-hand side of Eq. (1) is much smaller than that of right-hand side of Eq. (1) for both data, that is, total probability law of the Eq. (1) is violated.

This violation of total probability law is well over the experimental errors. We can give the real values of MU (1), MU (2) and MU (3) with an error rate δ as MU (1)(1 ± δ), MU (2)(1 ± δ) and MU (3)(1 ± δ), respectively. If we assume that the total probability law is hold, then the following equation should be hold.

$$ MU_{(3)}(1\pm\delta)=MU_{(1)}(1\pm\delta)\times P(L)+MU_{(2)}(1\pm \delta)\times P(G) $$

From this equation, we obtain the error rate δ ≈ 0.959 for data-(I), and error rate δ ≈ 0.889 for data-(II). In general, we may have 20 % or sometimes 30 % standard deviations of the real value. However this calculation shows that over 85 % error rate is needed to satisfy the total probability law.

Here let us consider the opposite situation that E. coli has “no preference for any sugar”. Although such egalitarian E. coli is ideal and unrealistic, he must never distinguish lactose from glucose. Therefore this ideal and unrealistic E. coli must probabilistically produce β-galactosidase in proportion to the concentration of lactose. In this case, the experiment (3) is just simple sum of experiment (1) and (2), and total probability law is satisfied. We remark that the strength of E. coli’s preference is expressed as the difference between left-hand side and right-hand side in total probability law.

New mathematical law computing the probability in adaptive dynamics for glucose effect of E. coli

In this section, we briefly explain how to compute these probabilities with adaptive dynamics and lifting theory, see Ingarden et al. (1997), Ohya et al. (2011), Asano et al. (2007) for the details of lifting theory).

First, let us introduce the initial state \(\rho_0 = \vert x_0 \rangle \langle x_0 \vert \), the density operator in Hilbert space \(\mathcal{H}=\mathbf{C}^{2}\) (the qubit space). The state vector x 0 is written as

$$ \left\vert x_{0}\right\rangle =\frac{1}{\sqrt{2}}\left( \left\vert e_{1}\right\rangle +\left\vert e_{2}\right\rangle \right) =\frac{1}{\sqrt{2} }\left\{ \left(\begin{array}{c} 1\\ 0 \end{array}\right) + \left(\begin{array}{c} 0\\ 1 \end{array}\right) \right\} =\frac{1}{\sqrt{2}} \left(\begin{array}{c} 1\\ 1 \end{array}\right) $$

The basis {e 1 = (0, 1)Te 2 = (1, 0)T} describes the detection of lactose or glucose by E. coli, that is, the events L and G. The initial state ρ0 is the state of E. coli before the detection of the molecules. When the E. coli recognizes these molecules, the following state change occurs;

$$ \rho_{0}\rightarrow\rho_{D}\equiv\frac{D\rho_{0}D^{\ast}}{\hbox{tr} \left(\left\vert D\right\vert ^{2}\rho_{0}\right) }=\left( \begin{array}{cc} \left\vert \alpha\right\vert^{2}&\alpha\beta^{\ast}\\ \alpha^{\ast}\beta& \left\vert\beta\right\vert^{2} \end{array}\right), $$

where D is an operator given by a diagonal matrix

$$ D= \left(\begin{array}{ccl} \alpha & 0\\ 0 & \beta \end{array}\right) $$

with |α|2 + |β|2 = 1. The operator D represents E. coli’s adaptive effect for surroundings (concentration of lactose or glucose), and it is called detection operator. Note that |α|2 and |β|2 are nothing else than the probabilities for the events L and G, that is, P(L) and P(G). The state σ D  ≡ DD * means the distribution of P(L) and P(G). In this sense, the state σ D is derived from the solution concentrations of lactose and glucose. The state determining the activation of the operon in E. coli depends on the detection state ρ D . In our adaptive dynamical model such state is produced by the following state-change

$$ \rho_{D} \rightarrow\rho_{\rm op}\equiv\frac{Q\rho_{D}Q^{\ast} }{\hbox{tr}\left( \left\vert Q\right\vert ^{2}\rho_{D}\right)} =\frac{1}{\left\vert a\alpha+b\beta\right\vert ^{2}+\left\vert c\alpha+d\beta\right\vert ^{2}} \left(\begin{array}{cc} \left\vert a\alpha+b\beta\right\vert ^{2} &\left(a\alpha+b\beta\right) \left(c\alpha+d\beta\right) ^{\ast}\\ \left( a\alpha+b\beta\right)^{\ast}\left( c\alpha+d\beta\right) & \left\vert a\alpha+b\beta\right\vert ^{2} \end{array}\right), $$

where Q is an operator given with complex numbers a, b, c and d as

$$ Q= \left(\begin{array}{cc} a& b\\ c & d \end{array}\right). $$

This operator Q represents the state-change of lactose operon and we call it activation operator. The correlation between the activity of lactose operon and concentrations of lactose/glucose is described as the lifting

$$ {\mathcal{E}}_{D,Q}^{\ast}(\rho) =\Uplambda_{Q}^{\ast}\Uplambda_{D}^{\ast} \rho\otimes\Uplambda_{D}^{\ast}\rho =\rho_{\rm op}\otimes\rho_{D}, $$

with a map (channel) \(\Uplambda_{A}\rho=A\rho A^{\ast}/{\rm tr}\left\vert A\right\vert ^{2}\rho.\) One can define the joint probabilities P(+ ∩ L) and P(+ ∩ G) as

$$ \begin{aligned} P_{D}(+\cap L)&\equiv\hbox{tr}(E_{1}\otimes E_{1}){\mathcal{E}} _{D,Q}^{\ast}(\rho_{0})\;{\rm and}\;P_{D}(-\cap L)\equiv {\rm tr}(E_{2}\otimes E_{1}){\mathcal{E}}_{D,Q}^{\ast}(\rho_{0}),\\ P_{D}(+\cap G)&\equiv\hbox{tr}(E_{1}\otimes E_{2}) {\mathcal{E}} _{D,Q}^{\ast}(\rho_{0})\;{\rm and}\;P_{D}(-\cap G)\equiv {\rm tr}(E_{2}\otimes E_{2}) {\mathcal{E}}_{D,Q}^{\ast}(\rho_{0}), \end{aligned} $$

where E 1 and E 2 are projection operator given by

$$ E_{1}=\left\vert e_{1}\right\rangle \left\langle e_{1}\right\vert = \left(\begin{array}{cc} 1& 0\\ 0 & 0 \end{array}\right),\quad E_{2}=\left\vert e_{2}\right\rangle \left\langle e_{2}\right\vert =\left(\begin{array}{cc} 0 & 0\\ 0 & 1 \end{array}\right). $$

Also one can define conditional probabilities as

$$ P_{D}(+| L)\equiv\hbox{tr}(E_{1}\otimes I){\mathcal{E}}_{D,Q}^{\ast} (E_{1})=\frac{\left\vert a\right\vert ^{2}}{\left\vert a\right\vert ^{2}+\left\vert b\right\vert ^{2}}, $$
(2)
$$ P_{D}(+|G)\equiv\hbox{tr}(E_{1}\otimes I){\mathcal{E}}_{D,Q}^{\ast} (E_{2})=\frac{\left\vert b\right\vert ^{2}}{\left\vert a\right\vert ^{2}+\left\vert b\right\vert ^{2}}, $$
(3)
$$ P_{D}(-|L)\equiv\hbox{tr}(E_{2}\otimes I){\mathcal{E}}_{D,Q}^{\ast} (E_{1})=\frac{\left\vert c\right\vert ^{2}}{\left\vert c\right\vert ^{2}+\left\vert d\right\vert ^{2}}, $$
(4)
$$ P_{D}(-|G) \equiv\hbox{tr}(E_{2}\otimes I){\mathcal{E}}_{D,Q}^{\ast} (E_{2})=\frac{\left\vert d\right\vert ^{2}}{\left\vert c\right\vert ^{2}+\left\vert d\right\vert ^{2}} $$
(5)

and

$$ P_{D}(+| L \cup G)\equiv\hbox{tr}(E_{1}\otimes I){\mathcal{E}}_{D,Q}^{\ast }(\rho_{0})=\frac{\left\vert a\alpha+b\beta\right\vert ^{2}}{\left\vert a\alpha+b\beta\right\vert ^{2}+\left\vert c\alpha+d\beta\right\vert ^{2} }, $$
(6)
$$ P_{D}(-|L \cup G)\equiv\hbox{tr}(E_{2}\otimes I){\mathcal{E}} _{D,Q}^{\ast}(\rho_{0})=\frac{\left\vert c\alpha+d\beta\right\vert ^{2} }{\left\vert a\alpha+b\beta\right\vert ^{2}+\left\vert c\alpha+d\beta \right\vert ^{2}}. $$
(7)

The above definition of the conditional probability is different from classical one. We remark that the following property;

$$ P_{D}(\pm|L)\neq\frac{P_{D}(\pm\cap L)}{P(L)}\, \hbox{and} \, P_{D}(\pm |L)\neq\frac{P_{D}(\pm\cap G)}{P(G)}. $$

Invariant operator quantities in E. coli’s preference obtained from a statistical data

In this section, we show how to find invariant operator quantities in E. coli’s preference.

We give the activation operator Q by the following decomposed form:

$$ Q =AF\\ = \left(\begin{array}{cc} \sqrt{P_{E_{1}}(+| L)} &\sqrt{P_{E_{2}}(+|G)}\\ \sqrt{P_{E_{1}}(-|L)} &\sqrt{P_{E_{2}}(-|G)} \end{array}\right) \left(\begin{array}{cc} k_{L} & 0\\ 0 & e^{{\rm i}\theta}k_{G} \end{array}\right). $$

with real numbers k L k G and θ. Note that the above Q satisfy Eqs. (2)–(5) for any k L k G and θ. From the Eqs. (6) and (7), we can express the conditional probability P D (+ | LG) as

$$ P_{D}(+| L \cup G)=\frac{P_{E_{1}}(+| L)P(L)\sqrt{\frac{k_{L}}{k_{G}}}+P_{E_{2}} (+|G)P(G)\sqrt{\frac{k_{G}}{k_{L}}}+2\delta\cos\theta}{P(L)\sqrt{\frac{k_{L}} {k_{G}}}+P(G)\sqrt{\frac{k_{G}}{k_{L}}}+2\tilde{\delta}\cos\theta} $$
(8)

with

$$ \begin{aligned} \delta&=\sqrt{P_{E_{1}}(+| L)P_{E_{2}}(+|G)P(L)P(G)},\\ \tilde{\delta} & =\sqrt{P(L)P(G)}\left[ \sqrt{P_{E_{1}}(+| L)P_{E_{2}} (+|G)}+\sqrt{P_{E_{1}}(-|L)P_{E_{2}}(-|G)}\right]. \end{aligned} $$

If k L  = k G and θ = π/2, then the total probability law holds;

$$ P_{D}(+| L \cup G)=P_{E_{1}}(+| L)P(L)+P_{E_{2}}(+|G)P(G). $$

However, for general k L k G and θ, the total probability law is violated;

$$ P_{D}(+| L \cup G)\neq P_{E_{1}}(+| L)P(L)+P_{E_{2}}(+|G)P(G). $$

Let us remind that the violation of total probability law means E. coli’s preference. Therefore we can say the values of k L /k G and θ describe E. coli’s preference, which is given as an invariant quantity for the strain of W3110. It is natural that we assume that each strain of E. coli has own invariant quantity which describes E. coli’s character in metabolization.

From data-I and -II of Table 2, we can determine the operators A (I) and A (II), and we can easily confirm that A (I) is approximately same as A (II). The operator of E. coli W3110 is given by

$$ A=\frac{A^{(I)}+A^{(II)}}{2}= \left( \begin{array}{cc} 0.99&0.13\\ 0.17 & 0.99 \end{array}\right). $$

Further we can obtain the values of k L /k G and θ by solving two equations which are given by assigning the probabilities for data-(I) and data(II) to the Eq. (8);

$$ \begin{aligned} \sqrt{\frac{k_{L}}{k_{G}}}&\approx 0.066,\\ \cos\theta & \approx -0.842 \end{aligned} $$

With \(\left\{A,\sqrt{k_{L}/k_{G}},\cos \theta\right\}\) which are shown the above, we can compute the probability P D (+|LG) for any D. Figure 1 shows the value of P D (+|LG) with respect to P(L). As seen in Fig. 1, lactose operon is active only when P(L) is very close to one. We can quantitatively estimate E. coli’s glucose preference for any concentration of lactose/glucose.

Fig. 1
figure 1

Value of P D (+|LG) with respect to P(L)

Conclusion

We found the decomposed form of the activation operator and the new invariant operator quantities A and B for the type W3110 of E. coli. We determine these operators from experimental data with the aid of a simple calculation based on the concept of adaptive dynamics.