Abstract
There exist several phenomena breaking the classical probability laws. The systems related to such phenomena are context-dependent, so that they are adaptive to other systems. In this paper, we present a new mathematical formalism to compute the joint probability distribution for two event-systems by using concepts of the adaptive dynamics and quantum information theory, e.g., quantum channels and liftings. In physics the basic example of the context-dependent phenomena is the famous double-slit experiment. Recently similar examples have been found in biological and psychological sciences. Our approach is an extension of traditional quantum probability theory, and it is general enough to describe aforementioned contextual phenomena outside of quantum physics.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
It is well-known that quantum mechanics describes statistical properties of microscopic phenomena for which classical probability theory seems to be inapplicable. In this paper, we focus on violations of classical probability laws that can be found in quantum theory [40–42]. Our assertion is that the mathematical formalism of quantum theory has some features of non-Kolmogorovian probability theory and it can be used for analysis of statistical properties of some phenomena outside of quantum physics. Actually, authors have pointed out that the violations of the total probability law can be found in experimental data in biology [10], cognitive science [4, 5, 7, 9, 11–13, 15–17, 20, 27–31], and decision-makings in games [7, 8].
First of all, we briefly illustrate the violation of a classical probability law, namely, the total probability law, in the famous double-slit experiment. The probability that a photon is detected at the position x on a photo-sensitive plate is represented as
where ψ 1 and ψ 2 are two wave functions, whose absolute values |ψ k (x)|2 give the distributions of photons which pass through the slits numerated as k=1,2. The term of |ψ 1(x)||ψ 2(x)|cosθ describes the interference effect due to superposition of two wave functions. Let us denote |ψ k (x)|2 by P(x|k). Then the above equation is rewritten as
where P(1)=P(2)=1/2. However, the usual total probability law has the form:
Thus it is violated. The interference term
describes the degree of violation.
For this example, we can propose the following interpretation. Let us consider context “both slits are open”, and denote it by S 1∪2. This context is not a simple (Boolean) sum of two contexts S i , i=1,2, “only ith slit is open”:
In the approach of hidden variables theory, one would try to find a proper common probability space describing all contexts, S 1∪2 and S 1∪S 2. However, this is difficult.Footnote 1 In quantum mechanics, different contexts are distinguished as different states, like, e.g.,
The probabilities denoted by P(x) in Eqs. (1) and (2) are given by \(\langle x \vert \rho_{S_{1\cup 2}}\vert x \rangle \) and \(\langle x \vert \rho_{S_{1} \cup S_{2}}\vert x \rangle \), respectively. We denote these probabilities by \(P_{S_{1\cup 2}}(x)\) and \(P_{S_{1} \cup S_{2}}(x)\) and rewrite Eq. (1) as
The violation of the total probability law comes from the difference between the probabilistic structures of two contexts, S 1∪2 and S 1∪S 2, or more precisely, the two states \(\rho_{S_{1\cup 2}}\) and \(\rho_{S_{1} \cup S_{2}}\).
In this way we can discuss the violation of the total probability law for phenomena outside of quantum physics. Let us consider a simple and intuitive example. Consider the following experiments in the domain of cognitive psychology
We give chocolate to the subjects, and ask them whether it is sweet (C=1) or not (C=2). Then we obtain statistical data determining the probabilities P(C=1) and P(C=2). In another experiment we first give sugar to the (other) subjects before giving chocolate to them. Then we can obtain the probabilities P(S=1), P(S=2), P(C=1|S=1) and P(C=1|S=2). The probability that chocolate is sweet is estimated as
And a naive application of classical probability theory implies that it is equal to P(C=1). However, one can easily see that the value given by Eq. (4) is smaller than P(C=1);
The LHS probability P(C=1) is obtained in context “subjects did not taste sugar before tasting chocolate”. Contexts “a tongue tasted sugar” and “a tongue did not taste sugar” are different. We denote first context by S sug and the latter by S ¬sug . The probabilities of LHS and RHS in the above equation should be replaced by \(P_{S_{\neg sug}}(C=1)\) and \(P_{S_{sug}}(C=1)\). Intuitively, the result of \(P_{S_{\neg sug}}(C=1)\neq P_{S_{sug}}(C=1)\) seems to be now natural. However, to explain this result mathematically, we need a proper probability space which describes the both contexts S ¬sug and S sug . To find such a common probability space, enormous knowledge about the physical and chemical structure of a tongue will be needed, so that it is very difficult to find such space, as in the case of the hidden variables theory. In this paper, another approach is employed. It is based on the mathematical apparatus of quantum information and probability. The states \(\rho_{S_{\neg suger}}\) and \(\rho_{S_{suger}}\) describing two contexts S ¬sug and S sug are constructed. Then the concepts of channel and lifting [3] play an important role.
A channel is a map from states to states. This is a basic mathematical tool of quantum information theory. If a state describes context for a system, a channel operating on the states describes the change of context. The role of lifting is more general than one of channel, see Sect. 2. That is, a lifting sends a state in \(\mathcal{S}(\mathcal{H})\) to a compound state in an expanded space \(\mathcal{S}(\mathcal{H}\otimes \mathcal{K})\). In Sect. 2, we introduce several examples of lifting maps, and in Sect. 3, we point out that the lifting maps are useful to define joint probabilities in two (event) systems. The violation of the total probability law can be mathematically explained by the difference between two states which are provided by lifting maps. The main problem discussed in this paper is how to construct such a lifting map. To do this, we use the ideas of adaptive dynamics (AD) [37]. The basic concept of AD is explained briefly in Sect. 3. The AD-framework presented in this paper generalizes the standard open systems dynamics.Footnote 2
We illustrate usage of the lifting maps to describe mathematically the violation of the total probability law by three examples of cognitive phenomena. The first example is the problem of sweetness, see Sect 5. In Sect. 6, we discuss the system describing metabolism in E. coli. In biology, it is known that E. coli gives preference to metabolism of glucose over one of lactose. Our model evaluates this function of preference in the term of the violation of the total probability law. In Sect. 6, we focus on a problem which has been widely studied in psychology and cognitive science. People frequently make an inference to estimate the probability of certain event. According to experimental analysis of human behavior in cognitive science, there are cases such that human estimation of the probability does not match with classical probability theory. Such inference is called heuristic inference. We assume that a decision-maker using a heuristic inference holds some psychological factor biasing Bayesian inference. The latter is known as a “rational” inference and it is based on classical probability theory. In our approach, a lifting map is used for describing such a psychological effect.
We also remark that application of quantum information theory outside of quantum physics, e.g., for macroscopic biological systems, wakes up again the long debate on a possibility to combine the realistic and quantum descriptions, cf. [6, 22, 23]. At the moment we are not able to present a consistent interpretation for coming applications of quantum information theory outside of quantum physics; we can only keep close to the operational interpretation of quantum information theory, e.g., [18, 19]. In applications of quantum probability outside of physics, the Bayesian approach to quantum probability and information interpretation of the quantum state [14, 21] are the most natural.Footnote 3
2 Lifting Map
In this section we discuss the notion of lifting [3].
Let \(\mathcal{A}\) be C ∗-algebra. The space of states on this algebra is denoted by the symbol \(\mathcal{S}(\mathcal{A})\).
Definition
Let \(\mathcal{A}\), \(\mathcal{B}\) be C ∗-algebras and let \(\mathcal{A} \otimes \mathcal{B}\) be a fixed C ∗-tensor product of \(\mathcal{A}\) and \(\mathcal{B}\). A lifting from \(\mathcal{A}\) to \(\mathcal{A}\otimes \mathcal{B}\) is a weak ∗-continuous map
If \(\mathcal{E}^{\ast }\) is affine and its dual is a completely positive map, we call it a linear lifting; if it maps pure states into pure states, we call it pure.
Remark
Let \(\mathcal{A}\), \(\mathcal{B}\) be the sets of all observables in Hilbert spaces \(\mathcal{H}\), \(\mathcal{K}\); \(\mathcal{A}=\mathcal{O}(\mathcal{H})\), \(\mathcal{B}=\mathcal{O}(\mathcal{K})\). Then, \(\mathcal{E}^{\ast }\) is a lifting from \(\mathcal{S} (\mathcal{H} ) \) to \(\mathcal{S} ( \mathcal{H}\otimes \mathcal{K} )\). The definition of lifting includes that of channel: (1) If \(\mathcal{K}\) is \(\mathbb{C}\), then lifting \(\mathcal{E}^{\ast }\) is nothing but a channel from \(\mathcal{S} ( \mathcal{H} ) \) to \(\mathcal{S} ( \mathcal{H} )\). (2) If \(\mathcal{H}\) is \(\mathbb{C}\), then lifting \(\mathcal{E}^{\ast }\) is a channel from \(\mathcal{S} ( \mathcal{H} ) \) to \(\mathcal{S} ( \mathcal{K} )\).
We present some important examples of liftings.
Example 1
Nondemolition lifting: Lifting from \(\mathcal{S} (\mathcal{H} ) \) to \(\mathcal{S} (\mathcal{H}\otimes \mathcal{K} )\) is called nondemolition for a state \(\rho\in \mathcal{S}(\mathcal{H})\) if ρ is invariant for \(\mathcal{E}^{\ast }\) i.e., if for all \(a \in \mathcal{O}(\mathcal{H})\)
Example 2
Isometric lifting: A transition expectation from \(\mathcal{A}\otimes \mathcal{B}\) to \(\mathcal{A}\) is a completely positive linear map \(\mathcal{E}:\mathcal{A}\otimes \mathcal{B}\rightarrow \mathcal{A}\) satisfying
Let \(V:\mathcal{H}\rightarrow \mathcal{H}\otimes \mathcal{K}\) be an isometry
Then the map
is a transition expectation, and the associated lifting maps a density matrix ρ in \(\mathcal{H}\) into
in \(\mathcal{H}\otimes \mathcal{K}\). Liftings of this type are called isometric. Every isometric lifting is a pure lifting.
Example 3
Compound lifting: Let \(\varLambda ^{\ast }:\mathcal{S}(\mathcal{A}_{1})\rightarrow \mathcal{S}(\mathcal{A}_{2})\) be a channel. For any \(\rho _{1}\in \mathcal{S}(\mathcal{A}_{1})\) in the closed convex hull of the extremal states, fix a decomposition of ρ 1 as a convex combination of extremal states in \(\mathcal{S}(\mathcal{A}_{1})\)
where μ is a Borel measure on \(\mathcal{S}(\mathcal{A}_{1})\) with support in the extremal states, and define
Then \(\mathcal{E}^{\ast }:\mathcal{S}(\mathcal{A}_{1})\rightarrow \mathcal{S} (\mathcal{A}_{1}\otimes \mathcal{A}_{2})\) is a lifting, nonlinear even if Λ ∗ is linear, and it is a nondemolition type.
The most general lifting, mapping \(\mathcal{S}(\mathcal{A}_{1})\) into the closed convex hull of the extremal product states on \(\mathcal{A}_{1}\otimes \mathcal{A}_{2}\) is essentially of this type. This nonlinear nondemolition lifting was first discussed by Ohya to define the compound state and the mutual entropy for quantum information communication [34, 35].
Now we omit the condition that μ is concentrated on the extremal states used in [34]. Therefore once a channel is given, then lifting of the convex product type can be constructed. For example, the von Neumann quantum measurement process is written, in the terminology of lifting, as follows. Having measured a compact observable A=∑ n a n P n (spectral decomposition with ∑ n P n =I) in a state ρ, the state after this measurement will be
and lifting \(\mathcal{E}^{\ast }\) of the convex product type associated to this channel Λ ∗ and to a fixed decomposition of ρ as ρ=∑ n μ n ρ n (\(\rho _{n}\in \mathcal{S}(\mathcal{A}_{1})\)) is given by
3 Adaptive Dynamics and New View to Total Probability Law
The idea of the adaptive dynamics (AD) has implicitly appeared in series of papers [2, 3, 25, 26, 32, 34, 36–39]. The name of the adaptive dynamics was deliberately used in [37]. The AD has two aspects, one of which is the “observable-adaptive” and another is the “state-adaptive”.
The idea of observable-adaptivity comes from studying chaos. Recognition (measurement) of chaos in a phenomenon depends on the choice of the method of structuring of this phenomenon; for example, which scales of time, distance or domain are used by observer. And even generally measurement depends on the choice of the method of structuring of a phenomenon. For example, consider time dependent dynamics; suppose that one studies its discretization based on a time interval τ and another takes ten times of τ, their results can be different. (See the paper [37] in more details.) Examples of the observable-adaptivity are used to understand chaos [32, 36] and to examine the violation of Bell’s inequality, namely the chameleon dynamics proposed by Accardi [1].
The idea of state-adaptivity is implicitly started in constructing a compound state for quantum communication [2, 33–35]. Examples of the state-adaptivity are seen in an algorithm solving NP complete problem [3, 38, 39]. State-adaptivity means that dynamics depends on the state of a system. For instance, in [3], the interaction Hamiltonian used for the computation depends on the state at time t 0 and the state after t>t 0 is changed by this Hamiltonian.
The above concept of AD can be represented mathematically by lifting. Let us introduce lifting from \(\mathcal{S} ( \mathcal{H} ) \) to \(\mathcal{S} ( \mathcal{H}\otimes \mathcal{K} )\) (or lifting from \(\mathcal{S} ( \mathcal{K} ) \) to \(\mathcal{S} ( \mathcal{H}\otimes \mathcal{K} )\)), say \(\mathcal{E}^{\ast }_{\sigma Q}\). Here, σ and Q are a state and an observable belonging to \(\mathcal{S} ( \mathcal{H}\otimes \mathcal{K} )\) and \(\mathcal{B(H)}\otimes \mathcal{B(K)}\), respectively. Lifting \(\mathcal{E}^{\ast }_{\sigma Q}\) is constructed with the aid of σ and Q. We consider the following dynamics.
The initial state ρ is defined in \(\mathcal{S}(\mathcal{H})\) or \(\mathcal{S}(\mathcal{K})\). We call this state change the dynamics adaptive to the state σ and the observable Q or the dynamics adaptive to context S={σ,Q}.
The compound state \(\mathcal{E}_{\sigma Q}^{*}(\rho) =\mathcal{E}_{S}^{*}(\rho)\) describes correlation of an event system of the interest with another event system.
Now consider two “event systems” \(A= \{ a_{k}\in \mathbb{R},E_{k}\in \mathcal{O}(\mathcal{K}) \} \) and \(B= \{ b_{j}\in \mathbb{R}, F_{j}\in \mathcal{O}(\mathcal{H}) \}\), where the sets of {E k } and {F k } are positive operator valued measures (POVMs), i.e., satisfying ∑ k E k =I,∑ k F k =I and E k ,F k >0. We define the joint probability as
Further, the probability P S (a k ) is defined as
As was discussed in the introduction, the violation of the total probability law comes from a difference of two contexts, say S={σ,Q} and \(\tilde{S}=\{\tilde{\sigma}, \tilde{Q}\}\). It is represented as
Generally, if \(S\neq \tilde{S}\), then Δ≠0. In order to discuss the form of Δ mathematically we have to define corresponding liftings.
In the sequels, we shall find proper liftings describing the following three problems: (1) state change of a tongue as the reaction to sweetness; (2) lactose-glucose interference in E. coli growth; (3) Bayesian updating.
4 State Change as Reaction of Tongue to Sweetness
The first problem under investigation is not so sophisticated, but quite common. As was discussed in the introduction, we consider the following cognitive experiment. One takes sugar S or (and) chocolate C and he is asked whether it is sweet or not so. The answers “yes” and “no” are numerically encoded by 1 and 2. Then the basic classical probability law need not be satisfied, that is,
because the LHS P(C=1) will be very close to 1 but the RHS will be less than \(\frac{1}{2}\). Note that the LHS P(C=1) is obtained in context that subjects do not taste sugar; they start directly with chocolate. Contexts “a tongue tasted sugar”, say S sug , and “a tongue did not taste sugar”, say S ¬sug , are different. The probabilities in LHS and RHS in the above equation should be replaced by \(P_{S_{\neg sug}}(C=1)\) and \(P_{S_{sug}}(C=1)\). The problem to be discussed is how to obtain these probabilities mathematically.
Let |e 1〉 and |e 2〉 be the orthogonal vectors describing sweet and non-sweet states, respectively. The initial state of a tongue is neutral such as
where \(x_{0}=\frac{1}{\sqrt{2}} ( \vert e_{1} \rangle +\vert e_{2} \rangle )\). Here we start with the neutral pure state ρ, because we consider experiments with two sweet things. This problem can be described by the Hilbert space \(\mathbb{C}^{2}\), so that |e 1〉 and |e 2〉 can be set as \(\binom{1}{0}\) and \(\binom{0}{1}\), respectively.
When one tastes sugar, the operator corresponding to tasting sugar is mathematically (and operationally) represented as
where |λ 1|2+ |λ 2|2=1. This operator can be regarded as the square root of the sugar state σ S :
Taking sugar, he will taste that it is sweet with the probability |λ 1|2 and non-sweet with the probability |λ 2|2, so |λ 1|2 should be much higher than |λ 2|2 for usual sugar. This comes from the following change of the neutral initial (i.e., non-adaptive) state of a tongue:
This is the state of a tongue after tasting sugar.
This dynamics is similar to the usual expression of the state change in quantum dynamics. The subtle point of the present problem is that just after tasting sugar the state of a tongue is neither ρ S nor ρ. Note here that if we ignore subjectivity (personal features) of one’s tongue, then, instead of the state given by (6), the “just after tasting sugar” state will have the form:
This is the unread objective state as usual in quantum measurement. We can use the above two expressions, which give us the same result for computation of the probabilities for the S-variable.
However, for some time duration, the tongue becomes dull to sweetness (and this is the crucial point of our approach for this example), so the tongue state can be written by means of a certain “exchanging” operator such that
Then similarly as for sugar, when one tastes chocolate, the state will be given by
where the operator C has the form:
with |μ 1|2+ |μ 2|2=1. Common experience tells us that |λ 1|2≥ |μ 1|2≥|μ 2|2≥|λ 2|2 and the first two quantities are much larger than the last two quantities.
As can be seen from the preceding consideration, in this example the “adaptive set” {σ,Q} is the set {S,X,C}. Now we introduce the following nonlinear demolition lifting:
The corresponding joint probabilities are given by
The probability that one tastes sweetness of the chocolate after tasting sugar is
which is \(P_{S_{sug}}(C=1)\). Note that this probability is much less than
which is the probability of sweetness tasted by the neutral tongue ρ. This means that the usual total probability law should be replaced by the adaptive (context dependent) probability law.
5 Activity of Lactose Operon in E. Coli
The lactose operon is a group of genes in E. coli (Escherichia coli), and it is required for the metabolism of lactose. This operon produces β-galactosidase, which is an enzyme to digest lactose into glucose and galactose. There was an experiment measuring the activity of β-galactosidase which E. coli produces in the presence of (I) only 0.4 % lactose, (II) only 0.4 % glucose, or (III) mixture 0.4 % lactose +0.1 % glucose, see [24]. The activity is represented in Miller’s units (enzyme activity measurement condition), and it reaches to 3000 units by full induction. In the cases of (I) and (II), the data of 2920 units and 33 units were obtained. These results make one to expect that the activity in the case (III) will be large, because the number of molecules of lactose is larger than that of glucose. However, the obtained data were only 43 units. This result implies that E. coli metabolizes glucose in the preference to lactose. In biology, this functionality of E. coli have been discussed, and it was known that glucose has a property reducing lactose permease provided by the operon. Apart from such qualitative and biochemical explanation, it will be also necessary to discuss mathematical representation such that the biological activity in E. coli is evaluated quantitatively. In the paper [10], it is pointed out that the activity of E. coli violates the total probability law as shown below, which might come from the preference in E. coli’s metabolism. We will explain this contextual behavior by the formula (5).
We consider two events L and G; L: “E. coli detects a lactose molecule in cell’s environment—to use it for its metabolism” and G: “E. coli detects a glucose molecule”. In the case of (I) or (II), the probability P(L)=1 or P(G)=1 is given. In the case of (III), P(L) and P(G) are calculated as
The events L and G are mutually exclusive. So it can be assumed that P(L)+P(G)=1. Further, we consider the events {+,−} which means that E. coli activates its lactose operon or not. From the experimental data for the cases (I) and (II), the following conditional probabilities are obtained:
In the case (III), if the total probability law were satisfied, the probability P(+) would be computed as
However, from the experimental data, we obtain
so that the total probability law is violated:
This violation is similar to that one in the double-slit experiment which was discussed in the introduction. Context of the case (III), say S L∪G , is different from a simple (Boolean) sum of two contexts, that is, S L ∪S G . We replace the LHS probability of the above equation by \(P_{S_{L\cup G}}(+)\) and replace the RHS probabilities P(+|L) and P(+|G) by \(P_{S_{L}}(+)\) and \(P_{S_{G}}(+)\) respectively;
We now use our mathematical model for computation of the above probabilities, by using the concept of lifting. First, we introduce the initial state ρ=|x 0〉〈x 0| in Hilbert space \(\mathcal{H}=\mathbb{C}^{2}\). The state vector x 0 is written as
The basis {e 1,e 2} denote the detection of lactose or glucose by E. coli, i.e., the events, L and G. In the initial state ρ, the E. coli bacteria has not recognized the existence of lactose and glucose yet. When E. coli recognizes them, the following state change occurs:
where
with |α|2+|β|2=1. Note that |α|2 and |β|2 give the probabilities of the events L and G: P(L) and P(G). The state σ D ≡DD ∗ encodes the probability distribution P(L),P(G):
In this sense, the state σ D represents the chemical solution of lactose and glucose. We call D the detection operator and call ρ D the detection state. The state determining the activation of the operon in E. coli depends on the detection state ρ D . In our operational model, this state is obtained as the result of the following transformation:
where the operator Q is chosen as
We call ρ DQ the activation state for the operon and we call Q the activation operator. (The components a,b,c and d will be discussed later.)
We introduce lifting
by which we can describe the correlation between the activity of lactose operon and the ratio of concentration of lactose and glucose. From the discussion in Sect. 3, the joint probabilities P DQ (L,+) and P DQ (G,+) are given by
The probability P DQ (±) is obtained as P DQ (L,±)+P DQ (G,±).
Let us consider context S L (the case (I)) such that the detection operator D satisfies the condition P(L)=|α|2=1. We denote such D by the symbol D L . The probabilities \(P_{D_{L} Q}(\pm)=P_{S_{L}}(\pm)\) are calculated as
From the experimental results of Eq. (6), these values should be \(\frac{2920}{3000}\) and \(\frac{80}{3000}\). Therefore, we can give the following forms for the parameters a and c.
Here, k L is a certain real number. In a similar way, we consider context S G (the case (II)) and obtain
for the components b and d. To simplify the discussion, hereafter, we assume θ +L =θ −L , θ +G =θ −G and denote \(\mathrm{e}^{\mathrm{i}\theta _{L}}k_{L}\), \(\mathrm{e}^{\mathrm{i}\theta _{G}}k_{G}\) by \(\tilde{k}_{L}\), \(\tilde{k}_{G}\). Then, the operator Q is rewritten to
By using this Q, we calculate the probability \(P_{S_{L\cup G}}(+)\) corresponding to the case (III):
In general, the value of this probability is different from that of \(P_{S_{L} \cup S_{G}}(+)=P_{S_{L}}(+)|\alpha|^{2}+P_{S_{G}}(+)|\beta|^{2}\). The rate \(|\tilde{k}_{L}|/|\tilde{k}_{G}|\) essentially determines the degree of the difference. Recall the experimental data in the case (III). In this case, P(L)=|α|2=0.8>P(G)=|β|2=0.2, but \(P_{S_{L\cup G}}(+)\) is very small. According to our interpretation, it implies that the rate \(|\tilde{k}_{L}|/|\tilde{k}_{G}|\) is very small. In this sense, the operator F = in Eq. (9) specifies the preference in E. coli’s metabolism. We call F the preference operator. Finally note that if α, β are real and \(\tilde{k}_{L}=\tilde{k}_{G}^{\ast }\), the usual total probability law is held.
6 Bayesian Updating Biased by Psychological Factor
The Bayesian updating is an important concept in Bayesian statics, and it is used to describe a process of inference, which is explained as follows. Consider two event systems denoted by S 1={A,B} and S 2={C,D}, where the events A and B are mutually exclusive, and the same holds for C and D. Firstly, a decision-making entity, say Alice, estimates the probabilities P(A) and P(B) for the events A and B, which are called the prior probabilities. The prior probability is sometimes called “subjective probability” or “personal probability”. Further, Alice knows the conditional probabilities P(C|A) and P(C|B) which are obtained from some statistical data. When Alice sees the occurrence of the event C or D in the system S 2, she can change her prior prediction P(A) and P(B) to the following conditional probabilities by Bayes’ rule: When Alice sees the occurrence of C in S 2, she can update her prediction for the event A from P(A) to
When Alice sees the occurrence of D in S 2, she can update her prediction for the event A from P(A) to
In the same way she updates her prediction for the event B. These conditional (updating) probabilities are called the posterior probabilities. The change of prediction is described as “updating” from the prior probabilities P(A),P(B) to the posterior probabilities, and it is called the Bayesian updating.
In the paper [9], we redescribed the process of Bayesian updating in the framework of “quantum-like representation”, where we introduced the following state vector belonging to Hilbert space \(\mathcal{H}= \mathcal{H}_{1}\otimes \mathcal{H}_{2}=\mathbb{C}^{2}\otimes \mathbb{C}^{2}\);
We call this vector the prediction state vector. The set of vectors {|A′〉,|B′〉} is an orthogonal basis on \(\mathcal{H}_{1}\), and {|C′〉,|D′〉} is another orthogonal basis on \(\mathcal{H}_{2}\). The A′, B′, C′ and D′ represent the events defined as
- Event A′ (B′)::
-
Alice judges “the event A(B) occurs in the system S 1.”
- Event C′(D′)::
-
Alice judges “the event C(D) occurs in the system S 2.”
These events are the subjective events (judgments) in Alice’s “mental space” and the vectors |A′〉, |B′〉, |C′〉 and |D′〉 give quantum-like representation of the above judgments. The vector |Φ〉 represents coexistence of these judgments in Alice’s brain. For example, Alice is conscious of |A′〉 with the weight \(\sqrt{P(A^{\prime })}\), and under the condition of the event A′, she sets the weights \(\sqrt{P(C^{\prime }|A^{\prime })}\) and \(\sqrt{P(D^{\prime }|A^{\prime })}\) for the minds |C′〉 and |D′〉. Such an assignment of weights implies that Alice feels causality between S 1 and S 2: The events in S 1 are causes and the events in S 2 are results. The square of \(\sqrt{P(A^{\prime })}\) corresponds to a prior probability P(A) in the Bayesian theory. If Alice knows the objective conditional probabilities P(C|A) and P(C|B), Alice can set the weights of \(\sqrt{P(C^{\prime }|A^{\prime })}\) and \(\sqrt{P(C^{\prime }|B^{\prime })}\) from P(C′|A′)=P(C|A) and P(C′|B′)=P(C|B). If Alice has the prediction state |Φ〉〈Φ|≡ρ and sees the occurrence of the event C in S 2, the event D′ is vanished instantaneously (in her mental representation). This vanishing is represented as the reduction by the projection operator M C′=I⊗|C′〉〈C′|;
The posterior probability P(A|C) is calculated by
where M A′=|A′〉〈A′|⊗I.
The inference based on the Bayesian updating is rational—from the view point of classical probability theory, game theory and economics (the Savage sure thing principle). However, in cognitive psychology and economics one can find extended empirical data showing that sometimes human inference seems to be irrational, see [30] for the review. Typically this happens in contexts such that there are (often hidden) psychological factors disturbing the rational inferences. Our aim is to provide a mathematical description of such an irrational inference; the concept of lifting will be used. Let us introduce lifting from \(S(\mathcal{H})\) to \(S(\mathcal{H}\otimes \mathcal{K})\) by
Here \(\sigma \in S(\mathcal{K})\) represents the state of Alice’s psychological representation of context which is generated when Alice updates her inference. The operator V on \(\mathcal{H} \otimes \mathcal{K}\) is unitary and gives a correlation between the prediction state ρ and the psychological factor σ, in other words, it specifies a psychological affection to the rational inference. We call the state defined by
the prediction state biased from the rational prediction ρ. From this ρ σV , the joint probability is defined as
for the events X′=(A′ or B′) and Y=(C′ or D′), and the biased posterior probability is defined as
In general, the value of P σV (X′|Y′) is different from the original P(X′|Y′) obtained with the aid in the rational (Bayesian) inference.
Notes
In this paper we proceed pragmatically. We do not discuss arguments for and against hidden variables. We want just proceed mathematically. In the literature on quantum foundations it is generally claimed that Kolmogorov description of the double-slit experiment is impossible, cf., however, [19].
Elaboration of such generalized quantum(-like) dynamics is not based on just a rather common wish to consider so general situation as possible. Already the simplest examples from biology, see Sect. 4, demonstrate that for such biological systems the dynamical state change cannot be described in the conventional quantum framework. Elaboration of such a new mathematical apparatus and its application to biology differs this paper from our previous publications [7, 8] in which the standard theory of open dynamical systems was in use. By the same reason we presented AD-theory in the framework of C ⋆-algebras (and not simply complex Hilbert space): in general there are no reasons to expect that the probabilistic structure of all possible biological phenomena can be embedded into complex Hilbert space model of probability.
We point out that by using liftings we operate with entangled quantum states. By the conventional interpretation of quantum mechanics the corresponding probabilistic structure cannot be represented classically, in a Kolmogorov space. However, we again remark that inter-relation between classical and quantum probabilistic descriptions is still actively debated.
References
Accardi, L.: Urne e Camaleonti: Dialogo Sulla Realta, le Leggi del Caso e la Teoria Quantistica. Il Saggiatore, Perugia (1997). English edition: World Scientific (2002); Japanese edition: Makino (2002), Russian edition, Regular and Chaotic dynamics (2002)
Accardi, L., Ohya, M.: Compound channels, transition expectations, and liftings. Appl. Math. Optim. 39, 33–59 (1999)
Accardi, L., Ohya, M.: A stochastic limit approach to the SAT problem. Open Syst. Inf. Dyn. 11, 1–16 (2004)
Accardi, L., Khrennikov, A., Ohya, M.: The problem of quantum-like representation in economy, cognitive science, and genetics. In: Accardi, L., Freudenberg, W., Ohya, M. (eds.) Quantum Bio-Informatics II: from Quantum Information to Bio-Informatics, pp. 1–8. WSP, Singapore (2008)
Accardi, L., Khrennikov, A., Ohya, M.: Quantum Markov model for data from Shafir–Tversky experiments in cognitive psychology. Open Syst. Inf. Dyn. 16, 371–385 (2009)
Allahverdyan, A.E., Balian, R., Nieuwenhuizen, Th.M.: The quantum measurement process in an exactly solvable model. In: Foundations of Probability and Physics-3. American Institute of Physics, Ser. Conference Proceedings, vol. 750, pp. 16–24. Melville, New York (2005)
Asano, M., Ohya, M., Khrennikov, A.: Quantum-like model for decision making process in two players game. Found. Phys. 41(3), 538–548 (2010)
Asano, M., Ohya, M., Tanaka, Y., Khrennikov, A., Basieva, I.: On application of Gorini–Kossakowski–Sudarshan–Lindblad equation in cognitive psychology. Open Syst. Inf. Dyn. 18(1), 55–69 (2011)
Asano, M., Ohya, M., Tanaka, Y., Khrennikov, A., Basieva, I.: Quantum-like representation of Bayesian updating. In: Proceedings of the International Conference on Advances in Quantum Theory. American Institute of Physics, vol. 1327, pp. 57–62 (2011)
Basieva, I., Khrennikov, A., Ohya, M., Yamato, I.: Quantum-like interference effect in gene expression: glucose-lactose destructive interference. Syst. Synth. Biol. (2011). doi:10.1007/s11693-011-9081-8
Busemeyer, J.R., Matthews, M., Wang, Z.: A quantum information processing explanation of disjunction effects. In: Sun, R., Myake, N. (eds.) The 29th Annual Conference of the Cognitive Science Society and the 5th International Conference of Cognitive Science, pp. 131–135. Erlbaum, Mahwah (2006)
Busemeyer, J.B., Wang, Z., Townsend, J.T.: Quantum dynamics of human decision making. J. Math. Psychol. 50, 220–241 (2006)
Busemeyer, J.R., Santuy, E., Lambert-Mogiliansky, A.: Comparison of Markov and quantum models of decision making. In: Bruza, P., Lawless, W., van Rijsbergen, K., Sofge, D.A., Coeke, B., Clark, S. (eds.) Quantum Interaction: Proceedings of the Second Quantum Interaction Symposium, pp. 68–74. College Publications, London (2008)
Caves, C.M., Fuchs, Ch.A., Schack, R.: Quantum probabilities as Bayesian probabilities. Phys. Rev. A 65, 022305 (2002)
Cheon, T., Takahashi, T.: Interference and inequality in quantum decision theory. Phys. Lett. A 375, 100–104 (2010)
Cheon, T., Tsutsui, I.: Classical and quantum contents of solvable game theory on Hilbert space. Phys. Lett. A 348, 147–152 (2006)
Conte, E., Khrennikov, A., Todarello, O., Federici, A., Zbilut, J.P.: Mental states follow quantum mechanics during perception and cognition of ambiguous figures. Open Syst. Inf. Dyn. 16, 1–17 (2009)
D’ Ariano, G.M.: In: Operational Axioms for Quantum Mechanics. Foundations of Probability and Physics-3. American Institute of Physics, Ser. Conference Proceedings, vol. 889, pp. 79–105. Melville, New York (2007)
De Muynck, W.M.: Foundations of Quantum Mechanics, an Empiricists Approach. Kluwer Academic, Dordrecht (2002)
Fichtner, K.-H., Fichtner, L., Freudenberg, W., Ohya, M.: On a quantum model of the recognition process. In: QP-PQ: Quantum Prob. White Noise Analysis, vol. 21, pp. 64–84 (2008)
Fuchs, Ch.A., Schack, R.: A quantum-Bayesian route to quantum-state space. Found. Phys. 41, 345–356 (2011)
Garola, C., Sozzo, S.: The ESR model: a proposal for a noncontextual and local Hilbert space extensions of QM. Europhys. Lett. 86, 20009–20015 (2009)
Garola, C., Sozzo, S.: Generalized observables, Bell’s inequalities and mixtures in the ESR model. Found. Phys. 41, 424–449 (2011)
Inada, T., Kimata, K., Aiba, H.: Mechanism responsible for glucose-lactose diauxie in Escherichia coli challenge to the cAMP model. Genes Cells 1, 293–301 (1996)
Inoue, K., Ohya, M., Sato, K.: Application of chaos degree to some dynamical systems. Chaos Solitons Fractals 11, 1377–1385 (2000)
Inoue, K., Ohya, M., Volovich, I.V.: Semiclassical properties and chaos degree for the quantum baker’s map. J. Math. Phys. 43(1), 734 (2002)
Khrennikov, A.: Open Syst. Inf. Dyn. 11(3), 267–275 (2004)
Khrennikov, A.: Biosystems 84, 225–241 (2006)
Khrennikov, A.: Contextual Approach to Quantum Formalism (Fundamental Theories of Physics). Springer, Berlin (2009)
Khrennikov, A.: Ubiquitous Quantum Structure: from Psychology to Finance. Springer, Berlin (2010)
Khrennikov, A., Haven, E.: Quantum mechanics and violations of the sure-thing principle: the use of probability interference and other concepts. J. Math. Psychol. 53, 378–388 (2009)
Kossakowski, A., Ohya, M., Togawa, Y.: How can we observe and describe chaos? Open Syst. Inf. Dyn. 10(3), 221–233 (2003)
Ohya, M.: Note on quantum proability. Lett. Nuovo Cimento 38(11), 203–206 (1983)
Ohya, M.: On compound state and mutual information in quantum information theory. IEEE Trans. Inf. Theory 29, 770–777 (1983)
Ohya, M.: Some aspects of quantum information theory and their applications to irreversible processes. Rep. Math. Phys. 27, 19–47 (1989)
Ohya, M.: Complexities and their applications to characterization of chaos. Int. J. Theor. Phys. 37(1), 495–505 (1998)
Ohya, M.: Adaptive dynamics and its applications to chaos and NPC problem. In: QP-PQ: Quantum Probability and White Noise Analysis, Quantum Bio-Informatics 2007, vol. 21, pp. 181–216 (2007)
Ohya, M., Volovich, I.V.: New quantum algorithm for studying NP-complete problems. Rep. Math. Phys. 52(1), 25–33 (2003)
Ohya, M., Volovich, I.V.: Mathematical Foundations of Quantum Information and Computation and Its Applications to Nano- and Bio-Systems. Springer, Berlin (2011)
Plotnitsky, A.: Reading Bohr: Physics and Philosophy. Springer, Berlin (2006)
Plotnitsky, A.: Epistemology and Probability: Bohr, Heisenberg, Schrödinger, and the Nature of Quantum-Theoretical Thinking. Springer, Berlin (2009)
Plotnitsky, A.: On the reasonable and unreasonable effectiveness of mathematics in classical and quantum physics. Found. Phys. 41, 466–491 (2011)
Acknowledgements
Two authors (I. B. and A. Kh.) were supported by the grant Quantum Bio-Informatics, Tokyo University of Science (visiting fellowships, 2010, 11, 13); they would like to thank Noboru Watanabe and their coauthors for hospitality.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Asano, M., Basieva, I., Khrennikov, A. et al. Non-Kolmogorovian Approach to the Context-Dependent Systems Breaking the Classical Probability Law. Found Phys 43, 895–911 (2013). https://doi.org/10.1007/s10701-013-9725-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10701-013-9725-5