Abstract
We first present a generalization of the Robertson-Heisenberg uncertainty principle. This generalization applies to mixed states and contains a covariance term. For faithful states, we characterize when the uncertainty inequality is an equality. We next present an uncertainty principle version for real-valued observables. Sharp versions and conjugates of real-valued observables are considered. The theory is illustrated with examples of dichotomic observables. We close with a discussion of real-valued coarse graining.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
One of the basic principles of quantum theory is the Robertson-Heisenberg uncertainty inequality [4, 7]
where A, B are self-adjoint operators and \(\psi \) is a vector state on a Hilbert space. The inequality (1.1) is usually applied to position and momentum operators A, B in which case \(|{\langle {\psi ,[{A,B}]\psi }\rangle }|^2=\hbar ^2\) where \(\hbar \) is Planck’s constant. In this situation, A and B are unbounded operators, but for mathematical rigor we shall only deal with bounded operators. However, our results can be extended to the unbounded case by considering a dense subspace common to the domains of A and B. In this paper, we derive a generalization of (1.1). This generalization applies to mixed states and contains an additional covariance term that results in a stronger inequality.
The main result in Section 2 is an uncertainty principle for observable operators. This principle contains four parts: a commutator term, a covariance term, a correlation term and a product of variances term. This last term is sometimes called a product of uncertainties. In Section 2 we also characterize, for faithful states, when the uncertainty inequality is an equality. Section 3 introduces the concept of a real-valued observable. If \(\rho \) is a state and A is a real-valued observable, we define the \(\rho \)-average, \(\rho \)-deviation and \(\rho \)-variance of A. If B is another real-valued observable, we define the \(\rho \)-correlation and \(\rho \)-covariance of A, B. An uncertainty principle for real-valued observables is given in terms of these concepts. An important role is played by the stochastic operator \(\widetilde{A}\) for A. In Section 3 we also define the sharp version of a real-valued observable and characterize when two real-valued observables have the same sharp version
Section 4 illustrates the theory presented in Section 3 with two examples. The first example considers two dichotomic arbitrary real-valued observables. The second example considers the special case of two noisy spin observables. In this case, the uncertainty inequality becomes very simple. Section 5 discusses real-values coarse graining of observables.
2 Quantum Uncertainty Principle
For a complex Hilbert space H, we denote the set of bounded linear operators by \(\mathcal {L}(H)\) and the set of bounded self-adjoint operators by \(\mathcal {L}_S(H)\). A positive trace-class operator with trace one is a state and the set of states on H is denoted by \(\mathcal {S}(H)\). A state \(\rho \) is faithful if \(\mathrm {tr\,} (\rho C^*C)=0\) for \(C\in \mathcal {L} (H)\) implies that \(C=0\). For \(\rho \in \mathcal {S} (H)\) and \(C,D\in \mathcal {L} (H)\) we define the sesquilinear form \(\langle {C,D}\rangle _\rho =\mathrm {tr\,} (\rho C^*D)\).
Lemma 2.1
(i) If \(C\in \mathcal {L} (H)\), \(\rho \in \mathcal {S} (H)\), then \(\mathrm {tr\,} (\rho C^*)=\overline{\mathrm {tr\,} (\rho C)}\). (ii) The form \(\langle {\bullet ,\bullet }\rangle _\rho \) is a positive semi-definite inner product. (iii) A state \(\rho \) is faithful if and only if \(\langle {\bullet ,\bullet }\rangle _\rho \) is an inner product
Proof
(i) If D is a trace-class operator and \(\left\{ \phi _i\right\} \) is an orthonormal basis for H, we have
Hence,
(ii) Applying (i), we have
Moreover, since \(C^*C\ge 0\) we have \(\langle {C,C}\rangle _\rho =\mathrm {tr\,} (\rho C^*C)\ge 0\). Hence, \(\langle {\bullet ,\bullet }\rangle _\rho \) is a positive semi-definite inner product. (iii) If \(\langle {\bullet ,\bullet }\rangle _\rho \) is an inner product, then
implies \(C=0\) so \(\rho \) is faithful. Conversely, if \(\rho \) is faithful, then
implies \(C=0\) so \(\langle {\bullet ,\bullet }\rangle _\rho \) is an inner product \(\square \)
For \(A\in \mathcal {L}_S(H)\) and \(\rho \in \mathcal {S} (H)\), the \(\rho \)-average (or \(\rho \)-expectation) of A is \(\langle {A}\rangle _\rho =\mathrm {tr\,} (\rho A)\) and \(\rho \)-deviation of A is \(D_\rho (A)=A-\langle {A}\rangle _\rho I\) where I is the identity map on H. If \(A,B\in \mathcal {L}_S(H)\), the \(\rho \)-correlation of A, B is
Although \(\textrm{Cor}_\rho (A,B)\) need not be a real number, it is easy to check that \(\overline{\textrm{Cor}_\rho (A,B)}=\textrm{Cor}_\rho (B,A)\). We say that A and B are uncorrelated if \(\textrm{Cor}_\rho (A,B)=0\). The \(\rho \)-covariance of A, B is \(\Delta _\rho (A,B)=\mathrm {Re\,}\textrm{Cor}_\rho (A,B)\) and the \(\rho \)-variance of A is
It is straightforward to show that
We see from (2.1) that A and B are \(\rho \)-uncorrelated if and only if \(\mathrm {tr\,} (\rho AB)=\langle {A}\rangle _\rho \langle {B}\rangle _\rho \). We say that A and B commute if their commutant \([{A,B}]=AB-BA=0\).
Example 1
In the tensor product \(H_1\otimes H_2\) let \(\rho =\rho _1\otimes \rho _2\in \mathcal {S} (H_1\otimes H_2)\) be a product state and let \(A_1\in \mathcal {L}_S(H_1)\), \(A_2\in \mathcal {L}_S(H_2)\). Then \(A=A_1\otimes I_2\), \(B=I_1\otimes A_2\in \mathcal {L}_S(H_1\otimes H_2)\) are \(\rho \)-uncorrelated because
This shows that A, B are \(\rho \)-uncorrelated for any product state \(\rho \). Of course, \([{A,B}]=0\) in this case. However, there are examples of noncommuting operators that are uncorrelated. For instance, on \(H=\mathbb {C}^2\) let \(\alpha =\left[ \begin{array}{c} 1\\ 0\end{array}\right] \), \(\phi =\left[ \begin{array}{c} 0\\ 1\end{array}\right] \), \(\psi =\frac{1}{\sqrt{2}}\left[ \begin{array}{c} 1\\ 1\end{array}\right] \). With \(\rho =|{\alpha }\rangle \langle {\alpha }|\), \(A=|{\phi }\rangle \langle {\phi }|\), \(B=|{\psi }\rangle \langle {\psi }|\) we have
Hence, A, B are \(\rho \)-uncorrelated. However,
so \([{A,B}]\ne 0\). \(\square \)
We now present our main result.
Theorem 2.2
If \(A,B\in \mathcal {L}_S(H)\) and \(\rho \in \mathcal {S} (H)\), then (i) \(\frac{1}{4}|{\mathrm {tr\,}\left( {\rho [{A,B}]}\right) }|^2+\left[ {\Delta _\rho (A,B)}\right] ^2=|{\textrm{Cor}_\rho (A,B)}|^2\)
(ii) \(\frac{1}{4}|{\mathrm {tr\,}\left( {\rho [{A,B}]}\right) }|^2+\left[ {\Delta _\rho (A,B)}\right] ^2\le \Delta _\rho (A)\Delta _\rho (B)\)
Proof
(i) Applying Lemma 2.1 we have
From (2.2) and (2.4) we obtain
(ii) Applying Lemma 2.1(ii), the form \(\langle {C,D}\rangle _\rho =\mathrm {tr\,}(\rho C^*D)\) is a positive semi-definite inner product. Hence, Schwarz’s inequality holds and we have
\(\square \)
We call Theorem 2.2(i) the uncertainty equation and Theorem 2.2(ii) the uncertainty inequality. Together, they are called the uncertainty principle. Notice that Theorem 2.2(ii) is a considerable strengthening of the usual Robertson-Heisenberg inequality (1.1) since it contains the term \(\left[ {\Delta _\rho (A,B)}\right] ^2\) and it applies to arbitrary states. Thus, even when \([{A,B}]=0\) we still have an uncertainty relation
Lemma 2.3
A state \(\rho \) is faithful if and only if the eigenvalues of \(\rho \) are positive.
Proof
Suppose the eigenvalues \(\lambda _i\) of \(\rho \) are positive with corresponding normalized eigenvectors \(\phi _i\). Then we can write \(\rho =\sum \lambda _i|{\phi _i}\rangle \langle {\phi _i}|\) for the orthonormal basis \(\left\{ \phi _i\right\} \). For any \(A\in \mathcal {L} (H)\) we obtain
Hence, \(\mathrm {tr\,} (\rho A^*A)=0\) implies \(A\phi _i=0\) for all i. It follows that \(A=0\). Conversely, if 0 is an eigenvalue of \(\rho \) and \(\phi \) is a corresponding unit eigenvector, then setting \(P_\phi =|{\phi }\rangle \langle {\phi }|\) we have
But \(P_\phi \ne 0\) so \(\rho \) is not faithful. \(\square \)
Theorem 2.4
If \(\rho \) is faithful. then the following statements are equivalent. (i) The uncertainty inequality of Theorem 2.2(ii) is an equality. (ii) \(D_\rho (B)=\alpha D_\rho (A)\) for \(\alpha \in \mathbb {R}\). (iii) \(B=\alpha A+\beta I\) for \(\alpha ,\beta \in \mathbb {R}\). If one of the conditions holds, then
Proof
(i)\(\Rightarrow \)(ii) If the uncertainty inequality is an equality, then
We can rewrite (2.6) as
Since we have equality in Schwarz’s inequality and \(\langle {\bullet ,\bullet }\rangle _\rho \) is an inner product, it follows that \(D_\rho (B)=\alpha D_\rho (A)\) for some \(\alpha \in \mathbb {C}\). Since \(D_\rho (B)^*=D_\rho (B)\) and \(D_\rho (A)^*=D_\rho (A)\) we conclude that \(\alpha \in \mathbb {R}\). (ii)\(\Rightarrow \)(iii) If \(D_\rho (B)=\alpha D_\rho (A)\) for \(\alpha \in \mathbb {R}\), we have
Hence, letting \(\beta =\langle {B}\rangle _\rho -\alpha \langle {A}\rangle _\rho \) we have \(B=\alpha A+\beta I\). Since \(A,B\in \mathcal {L}_S(H)\) and \(\alpha \in \mathbb {R}\), we have that \(\beta \in \mathbb {R}\). (iii)\(\Rightarrow \)(i) If (iii) holds, then
Hence, \(\beta =\langle {B}\rangle _\rho -\alpha \langle {A}\rangle _\rho \) so that
Thus, (ii) holds and it follows that (2.6) holds and this implies (i). Equation (2.5) holds because (2.6) holds. \(\square \)
Example 2
The simplest faithful state when \(\dim H=n<\infty \) is \(\rho =I/n\). Then \(\langle {A,B}\rangle _\rho =\frac{1}{n}\,\mathrm {tr\,} (A^*B)\) which is essentially the Hilbert-Schmidt inner product \(\langle {A,B}\rangle _{HS}=\mathrm {tr\,} (A^*B)\). In this case for \(A,B\in \mathcal {L}_S(H)\) we have \(\langle {A}\rangle _\rho =\frac{1}{n}\,\mathrm {tr\,} (A)\), \(D_\rho (A)=A-\frac{1}{n}\,\mathrm {tr\,} (A)I\). The other statistical concepts become:
The uncertainty principle is given by:
3 Real-Valued Observables
An effect is an operator \(C\in \mathcal {L}_S(H)\) that satisfies \(0\le C\le I\) [1, 4,5,6]. Effects are thought of as two outcomes yes-no measurements. When the result of measuring C is yes , we say that C occurs and when the result is no , then C does not occur. A real-valued observable is a finite set of effects \(A=\left\{ A_x: x\in \Omega _A\right\} \) where \(\sum \limits _{x\in \Omega _A}A_x=I\) and \(\Omega _A\subseteq \mathbb {R}\) is the outcome space for A. The effect \(A_x\) occurs when the result of measuring A is the outcome x. The condition \(\sum \limits _{x\in \Omega _A}A_x=I\) specifies that one of the possible outcomes of A must occur. An observable is also called a positive operator-valued measure (POVM). We say A is sharp if \(A_x\) is a projection for all \(x\in \Omega _A\) and in this case, A is a projection-valued measure [4, 7]. Corresponding to A we have the stochastic operator \(\widetilde{A}\in \mathcal {L} (H)\) given by \(\widetilde{A} =\sum \limits _{x\in \Omega _A}xA_x\). Notice that we need A to be real-valued in order for \(\widetilde{A}\) to exist.
We now apply the theory presented in Section 2 to real-valued observables. For \(\rho \in \mathcal {S} (H)\), the \(\rho \)-average (or \(\rho \)-expectation) of A is defined by
We interpret \(\mathrm {tr\,} (\rho A_x)\) as the probability that a measurement of A results in the outcome x when the system is in state \(\rho \). Thus, (3.1) says that the \(\rho \)-average of A is the sum of its outcomes times the probabilities these outcomes occur. We define the \(\rho \)-deviation of A by
If A, B are real-valued observables, the \(\rho \)-correlation of A, B is \(\textrm{Cor}_\rho (A,B)=\textrm{Cor}_\rho (\widetilde{A} ,\widetilde{B}\,)\), \(\rho \)-covariance of A, B is \(\Delta _\rho (A,B)=\Delta _\rho (\widetilde{A} ,\widetilde{B}\,)\) and the \(\rho \)-variance of A is \(\Delta _\rho (A)=\Delta _\rho (\widetilde{A}\,)\). Applying (2.1) we obtain
It follows that
and
We also have by (2.4) that
Substituting \(\widetilde{A},\widetilde{B}\) for A, B in Theorem 2.2 gives an uncertainty principle for real-valued observables.
Two observables A, B are compatible (or jointly measurable) if there exists a joint observable \(C_{(x,y)}\), \((x,y)\in \Omega _A\times \Omega _B\), such that \(A_x=\sum \limits _yC_{(x,y)}\), \(B_y=\sum \limits _xC_{(x,y)}\) for all \(x\in \Omega _A\), \(y\in \Omega _B\). If \(\left[ {A_x,B_y}\right] =0\) for all x, y, then A, B are compatible with \(C_{(x,y)}=A_xB_y\) for all \((x,y)\in \Omega _A\times \Omega _B\). However, if A, B are compatible, they need not commute [4]. If A, B are compatible real-valued observables, then
Using (3.2), (3.3), (3.4), (3.5) we can write \(\textrm{Cor}_\rho (A,B),\Delta _\rho (A,B),\Delta _\rho (A),\Delta _\rho (B)\) and \(\mathrm {tr\,}\left( {\rho \left[ {\widetilde{A} ,\widetilde{B}\,}\right] }\right) \) in terms of \(C_{(x,y)}\). Hence, we can express the uncertainty principle in terms of \(C_{(x,y)}\).
If \(A=\left\{ A_x: x\in \Omega _A\right\} \) is a real-valued observable, then \(\widetilde{A}\) has spectral decomposition \(\widetilde{A} =\sum \limits _{i=1}^n\lambda _iP_i\) where \(\lambda _i\in \mathbb {R}\) are the distinct eigenvalues of \(\widetilde{A}\) and \(P_i\) are projections with \(\sum P_i=I\). We call \(\widehat{A} =\left\{ P_i: i=1,2,\ldots ,n\right\} \) the sharp version of A. Then \(\widehat{A}\) is a real-valued observable with outcome space \(\Omega _{\widehat{A}}=\left\{ \lambda _i: i=1,2,\ldots ,n\right\} \) and \(P _{\lambda _i}=P_i\). Since \((\widehat{A}\,)^\sim =\widetilde{A}\), A and \(\widehat{A}\) have the same stochastic operator. It follows that \(\langle {A}\rangle _\rho =\langle {\widehat{A}\,}\rangle _\rho \), \(\Delta _\rho (A)=\Delta _\rho (\widehat{A}\,)\) and if B is another real-valued observable, then \(\textrm{Cor}_\rho (A,B)=\textrm{Cor}_\rho (\widehat{A} ,\widehat{B}\,)\) and \(\Delta _\rho (A,B)=\Delta _\rho (\widehat{A} ,\widehat{B}\,)\).
Lemma 3.1
The following statements are equivalent. (i) \(\widehat{A} =\widehat{B}\). (ii) \(\widetilde{A} =\widetilde{B}\). (iii) \(\langle {A}\rangle _\rho =\langle {B}\rangle _\rho \) for all \(\rho \in \mathcal {S} (H)\).
Proof
(i)\(\Rightarrow \)(ii) If \(\widehat{A} =\widehat{B}\) then
(ii)\(\Rightarrow \)(iii) If \(\widetilde{A} =\widetilde{B}\) then
(iii)\(\Rightarrow \)(i) If \(\langle {A}\rangle _\rho =\langle {B}\rangle _\rho \) for all \(\rho \in \mathcal {S} (H)\), then \(\langle {\widetilde{A}\,}\rangle _\rho =\langle {\widetilde{B}\,}\rangle _\rho \) for all \(\rho \in \mathcal {S} (H)\). It follows that \(\widehat{A} =\widehat{B}\). \(\square \)
Let \(\widetilde{A} =\sum xA_x=\sum \lambda _iP_i\) so \(\widehat{A} =\left\{ P_i: i=1,2,\ldots ,n\right\} \) is a sharp version of A. Let \(B=\left\{ B_x: x\in \Omega _A\right\} \) be the real-valued observable given by \(B_x=\sum \limits _{i=1}^nP_iA_xP_i\). We conclude that A and B have the same sharp version because
so by Lemma 3.1, \(\widehat{A} =\widehat{B}\). We say that B is a conjugate of A. Letting \(C_{ix}=P_iA_xP_i\), we have that
is an observable and \(\sum \limits _iC_{ix}=B_x\), \(\sum \limits _xC_{ix}=P_i\). It follows that B and \(\widehat{A}\) are compatible with joint observable \(\left\{ C_{ix}\right\} \). We say that an observable \(A=\left\{ A_x: x\in \Omega _A\right\} \) is commutative if \(\left[ {A_x,A_y}\right] =0\) for all \(x,y\in \Omega _A\). Notice that if A is sharp, then A is commutative. However, there are many unsharp observables that are commutative.
Theorem 3.2
If A is commutative, then B is conjugate to A if and only if \(B=A\).
Proof
If A is commutative, we show that A is conjugate to A. Since
we have that \(\left[ {\widehat{A} ,A_x}\right] =0\) for all \(x\in \Omega _A\). By the spectral theorem, \(\left[ {A_x,P_i}\right] =0\) for all x, i so \(A_x=\sum P_iA_xP_i\). Therefore, A is conjugate to A. Conversely, suppose A is commutative and B is conjugate to A. Then \(B_x=\sum \limits _iP_iA_xP_i\) for all \(x\in \Omega _A\). As before, we have that \(\left[ {\widehat{A} _x,A_x}\right] =0\) for all \(x\in \Omega _A\) so \(\left[ {A_x,P_i}\right] =0\) for all x, i. Hence,
for all \(x\in \Omega _B=\Omega _A\) so \(B=A\). \(\square \)
Thus, nontrivial conjugates only occur in the nonclassical case where A is noncommutative.
4 More Examples
This section illustrates the theory in Sections 2 and 3 with two examples.
Example 3
A two outcome observable is called a dichotomic observable. Of course, a dichotomic observable is commutative but it need not be sharp. Let \(A=\left\{ A_1,I-A_1\right\} \) be a dichotomic observable with \(\Omega _A=\left\{ 1,-1\right\} \). Then
If \(B=\left\{ B_1,I-B_1\right\} \) is another dichotomic observable with \(\Omega _B=\left\{ 1,-1\right\} \), then
Hence,
and
We also have
We conclude that \(\left[ {\widetilde{A} ,\widetilde{B}\,}\right] =0\) if and only if \(\left[ {A_1,B_1}\right] =0\) and this does not hold in general so \(\widetilde{A},\widetilde{B}\) need not commute. The uncertainty principle becomes
\(\square \)
Example 4
We now consider a special case of Example 3. For \(H\in \mathbb {C}^2\) we define the Pauli matrices
Let \(\mu \in \left[ {0,1}\right] \) and define the dichotomic observable \(A=\left\{ A_1,I-A_1\right\} \), where
and \(\Omega _A=\left\{ 1,-1\right\} \). Similarly, let \(B=\left\{ B_1,I-B_1\right\} \), where
and \(\Omega _B=\left\{ 1,-1\right\} \). We call A and B noisy spin observables along the x and y directions, respectively, with noise parameter \(1-\mu \) [7].
Any state \(\rho \in \mathcal {S} (H)\) has the form \(\rho =\frac{I}{2}(I+\overrightarrow{r}\bullet \overrightarrow{\sigma })\) where \(\overrightarrow{r}\in \mathbb {R} ^3\) with \(\Vert {\overrightarrow{r}}\Vert \le 1\) [1, 2]. This is called the Block sphere representation of \(\rho \) [4, 7]. The eigenvalues of \(\rho \) are \(\lambda _{\pm } =\frac{1}{2}\left( {1\pm \Vert {\overrightarrow{r}}\Vert }\right) \). Then \(\lambda _+=1\), \(\lambda _-=0\) if and only if \(\Vert {\overrightarrow{r}}\Vert =1\) and these are precisely the pure states. Letting \(\sigma _1=\sigma _x\), \(\sigma _2=\sigma _y\), \(\sigma _3=\sigma _z\) we obtain
and
Hence, \(\mathrm {tr\,} (\rho A_1)=\frac{1}{2}(1+r_1\mu )\) and as in Example 3, \(\langle {A}\rangle _\rho =r_1\mu \). Similarly, \(\mathrm {tr\,} (\rho B_1)=\frac{1}{2}(1+r_2\mu )\) and \(\langle {B}\rangle _\rho =r_2\mu \). We also obtain
and it follows from (4.1) that
Therefore, \(\Delta _\rho (A,B)=-r_1r_2\mu ^2\). A straightforward calculation shows that
It follows that
and similarly, \(\Delta _\rho (B)=\mu ^2(1-r_2^2)\).
The commutator term in (4.2) becomes
The covariance term in (4.2) is
and the correlation term in (4.2) is
Finally, the variance term in (4.2) is given by
The inequality in (4.2) reduces to
If \(\mu \ne 0\), (4.3) is equivalent to the inequality
If the commutator term vanishes and \(\mu \ne 0\), the uncertainty inequality becomes
which is equivalent to \(r_1^2+r_2^2\le 1\). If A and B are \(\rho \)-uncorrelated and \(\mu \ne 0\), the uncertainty inequality becomes \(r_3^2\le (1-r_1^2)(1-r_2^2)\) which is equivalent to \(\Vert {\overrightarrow{r}}\Vert ^2\le 1+r_1^2r_2^2\). This inequality and (4.4) are weaker than (4.3). \(\square \)
5 Real-Valued Coarse Graining
Let \(A=\left\{ A_x: x\in \Omega _A\right\} \) be an arbitrary observable. We assume that A is not necessarily real-valued so the outcome space \(\Omega _A\) is an arbitrary finite set. For \(f:\Omega _A\rightarrow \mathbb {R}\) with range \(\mathcal {R} (f)\) we define the real-valued observable f(A) by \(\Omega _{f(A)}=\mathcal {R} (f)\) and for all \(z\in \Omega _{f(A)}\)
We call f(A) a real-valued coarse graining of A [2,3,4]. Then f(A) has stochastic operator
It follows that \(\langle {f(A)}\rangle _\rho =\sum \limits _xf(x)\mathrm {tr\,} (\rho A_x)\) for all \(\rho \in \mathcal {S} (H)\). If B is another observable and \(g:\Omega _B\rightarrow \mathbb {R}\) we have
Moreover, we have the uncertainty inequality
We denote the set of trace-class operators on H by \(\mathcal {T} (H)\). An operation on H is a completely positive, trace reducing, linear map \(\mathcal {O}:\mathcal {T} (H)\rightarrow \mathcal {T} (H)\) [1,2,3,4]. If \(\mathcal {O}\) preserves the trace, then \(\mathcal {O}\) is called a channel. A (finite) instrument is a finite set of operators \(\mathcal {I} =\left\{ \mathcal {I}_x: x\in \Omega _\mathcal {I}\right\} \) such that \(\overline{\mathcal {I}} =\sum \left\{ \mathcal {I}_x: x\in \Omega _\mathcal {I}\right\} \) is a channel [1,2,3,4]. We say that \(\mathcal {I}\) measures an observable A if \(\Omega _\mathcal {I} =\Omega _A\) and \(\mathrm {tr\,}\left[ {\mathcal {I}_x(\rho )}\right] =\mathrm {tr\,} (\rho A_x)\) for all \(x\in \Omega _\mathcal {I}\). It can be shown that \(\mathcal {I}\) measures a unique observable which we denote by \(J(\mathcal {I})\) [2, 3]. Conversely, any observable is measured by many instruments [1,2,3,4]. Corresponding to an operation \(\mathcal {O}\) we have its dual-operation \(\mathcal {O} ^*:\mathcal {L} (H)\rightarrow \mathcal {L} (H)\) defined by \(\mathrm {tr\,}\left[ {\rho \mathcal {O} ^*(C)}\right] =\mathrm {tr\,}\left[ {\mathcal {O} (\rho )C}\right] \) for all \(\rho \in \mathcal {S} (H)\) [2, 3]. It can be shown that \(J(\mathcal {I})_x=\mathcal {I}_x^*(I)\) for all \(x\in \Omega _\mathcal {I}\) where I is the identity operator [2, 3].
As with observables, if \(\mathcal {I}\) is an instrument, and \(f:\Omega _\mathcal {I}\rightarrow \mathbb {R}\) we define the real-valued instrument \(f(\mathcal {I})\) such that \(\Omega _{f(\mathcal {I})}=\mathcal {R} (f)\) and
If \(J(\mathcal {I})=A\), then \(J\left[ {f(\mathcal {I})}\right] =f(A)\) because
for all \(z\in \Omega _{f(A)}=\Omega _{f(\mathcal {I})}\). If \(\mathcal {I}\) is real-valued, we define \(\widetilde{\mathcal {I}}\) on \(\mathcal {L} (H)\) by \(\widetilde{\mathcal {I}}(C)=\sum x\mathcal {I}_x(C)\) and \(\langle {\mathcal {I}}\rangle _\rho =\mathrm {tr\,}\left[ {\widetilde{\mathcal {I}} (\rho )}\right] \). If \(J(\mathcal {I})=A\), then
for all \(\rho \in \mathcal {S} (H)\). We also define \(\Delta _\rho (\mathcal {I})=\Delta _\rho (A)\). It follows that \(\langle {f(\mathcal {I})}\rangle _\rho =\langle {f(A)}\rangle _\rho \), \(\Delta _\rho \left[ {f(\mathcal {I})}\right] =\Delta _\rho \left[ {f(A)}\right] \) and \(f(\mathcal {I})^\sim =\sum f(x)\mathcal {I}_x\).
Let \(A=\left\{ A_x: x\in \Omega _A\right\} \), \(B=\left\{ B_y: y\in \Omega _B\right\} \) be arbitrary observables and suppose \(\mathcal {I}\) is an instrument with \(J(\mathcal {I})=A\). Define the \(\mathcal {I}\)-product observable \(A\circ B\) with \(\Omega _{A\circ B}=\Omega _A\times \Omega _B\) given by \((A\circ B)_{(x,y)}=\mathcal {I}_x(B_y)\) [2, 3]. Then \(A\circ B\) is indeed an observable because
Although \(A\circ B\) depends on \(\mathcal {I}\), we shall not indicate this for simplicity. We interpret \(A\circ B\) as the observable obtained by first measuring A using \(\mathcal {I}\) and then measuring B. If \(f:\Omega _A\times \Omega _B\rightarrow \mathbb {R}\) we obtain the real-valued observable \(f(A,B)=f(A\circ B)\). We then have
If f is a product function \(f(x,y)=g(x)h(y)\) we obtain
We then have the simplification
Hence,
In a similar way we obtain
If A and B are arbitrary observables, we define the observable B conditioned by A to be
where \(\Omega _{B\mid A}=\Omega _B\) [2, 3]. We interpret \((B\mid A)\) as the observable obtained by first measuring A without taking the outcome into account and then measuring B. If B is real-valued we have
We now illustrate the theory of this section with some examples.
Example 5
The simplest example of an instrument is a trivial instrument \(\mathcal {I}_x(\rho )=\omega (x)\rho \) where \(\omega \) is a probability measure on the finite set \(\Omega _\mathcal {I}\). It is clear that \(\mathcal {I}\) measures the trivial observable \(A_x=\omega (x)I\). Let B be an arbitrary observable and let \(f:\Omega _A\times \Omega _B\rightarrow \mathbb {R}\). We then have
We conclude that
Moreover, since
we have that \((B\mid A)=B\). \(\square \)
Example 6
Let \(A=\left\{ A_x: x\in \Omega _A\right\} \) and \(B=\left\{ B_y: y\in \Omega _B\right\} \) be arbitrary observables and let \(\mathcal {H}_x(\rho )=\mathrm {tr\,} (\rho A_x)\alpha _x\), \(\alpha _x\in \mathcal {S} (H)\) be a Holevo instrument [2, 3]. Then \(\mathcal {H}\) measure A because
Since \(\mathcal {H}_x^*(a)=\mathrm {tr\,} (\alpha _xa)A_x\) for all \(x\in \Omega _A\) [2, 3], we have
If \(f:\Omega _A\times \Omega _B\rightarrow \mathbb {R}\), we obtain the real-valued observable
We conclude that
Moreover, we have
Example 7
Let A, B be arbitrary observables and let \(\mathcal {L}\) be the Lüders instrument given by \(\mathcal {L}_x(\rho )=A_x^{1/2}\rho A_x^{1/2}\) [2, 3, 6]. Then
so \(\mathcal {L}\) measures A. Since \(\mathcal {L}_x^*(a)=A_x^{1/2}aA_x^{1/2}\) [2, 3] we have
If \(f:\Omega _A\times \Omega _B\rightarrow \mathbb {R}\), we obtain the real-valued observable
We conclude that
Moreover, we have
References
Busch, P., Grabowski, M., Lahti, P.: Operational Quantum Physics. Springer-Verlag, Berlin (1995)
Gudder, S.: Conditional observables in quantum mechanics. arXiv:quant-ph 2005, 04775 (2020)
Gudder, S.: Quantum instruments and conditioned observables (2020). arXiv preprint arXiv:2005.08117
Heinosaari, T., Ziman, M.: The Mathematical Language of Quantum Theory. Cambridge University Press, Cambridge (2012)
Kraus, K.: States. Effects and Operations. Springer-Verlag, Berlin (1983)
Lüders, G.: Über due Zustandsänderung durch den Messprozess. Ann. Physik. 6, 322–328 (1951)
Nielson, M., Chuang, I.: Quantum Computation and Quantum Information. Cambridge University Press, Cambridge (2000)
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Dedicated to the memory of Richard Greechie (1941–2022).
The author’s cherished friend, long time colleague and collaborator.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Gudder, S. Real-Valued Observables and Quantum Uncertainty. Int J Theor Phys 62, 94 (2023). https://doi.org/10.1007/s10773-023-05342-8
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10773-023-05342-8