Keywords

Mathematics Subject Classification (2000).

1 Introduction

Quantum mechanics is the most successful theory ever devised, by far. It explains everything that we know about matter, atoms, stars, the universe, chemistry, and indeed all physical phenomena that it has been applied to. Moreover, quantum mechanics predicts bizarre phenomena, such as vacuum fluctuations, that have been experimentally observed.

Quantum mechanics is a probability theory. While the probabilistic “results” of quantum mechanics are of the same nature as standard probability theory, for example expectation values and probability densities, the method of calculation is radically different from standard probability theory. Quantum mechanics uses wave functions, operators, and methods which are seemingly totally foreign to standard probability theory.

There have been numerous attempts to formulate quantum mechanics as a standard probability theory. It is fair to say that these attempts have not succeeded. We reverse the question and ask: Since quantum mechanics is certainly the most successful probability theory ever devised, we ask whether standard probability theory has the concepts of wave functions and operators. We emphasize that we are not trying to formulate quantum mechanics as a standard probability theory; quite the contrary, we are trying to see if standard probability theory contains the ideas and methods of quantum probability theory and if it could be formulated in quantum mechanical language [2, 3, 9].

Notation.

Operators will be denoted by bold-face letters and the corresponding random variables by lower case letters. When it is not obvious what random variable the characteristic function and corresponding probability density are referring to, we use the notation M a (θ) and P a (a) where the subscript denotes the random variable. All integrals go from − to or the appropriate range of the variables. Also, it is assumed that eigenfunctions are normalized to one for the discrete case and to a delta function for the continuous case.

2 Characteristic Functions

For a probability density, P(x), the characteristic function, M(θ), is the expectation value of e i θ x

$$\displaystyle{ M(\theta ) =\langle e^{i\theta x}\rangle =\int e^{i\theta x}P(x)\,dx }$$
(1)

and from the characteristic function, one may obtain the probability density by Fourier inversion,

$$\displaystyle{ P(x) = \frac{1} {2\pi }\int \,M(\theta )e^{-i\theta x}d\theta }$$
(2)

The characteristic function is standard in probability theory for many reasons [5, 8]. It is often easier to manipulate probabilistic results by using the characteristic function compared to the probability density function itself. For example, the moments, defined by

$$\displaystyle{ \langle x^{n}\rangle =\int x^{n}P(x)\,dx }$$
(3)

may be obtained from

$$\displaystyle{ \langle x^{n}\rangle = \left. \frac{1} {i^{n}} \frac{d^{n}} {d\theta ^{n}}M(\theta )\,\right \vert _{\theta =0} }$$
(4)

Since differentiation is easier then integration, Eq. (4) is often easier to use than Eq. (3) if indeed we know M(θ). Furthermore the characteristic function is very useful for obtaining probability densities for new variables [6].

The characteristic function is generally complex, but not every complex function is a characteristic function since it has to be derivable from a positive density function. What are necessary and sufficient conditions for a function M(θ) to be a characteristic function? Khinchine solved this problem [7]. A function, M(θ), is a characteristic function if and only if there exists a function, g(x), so that the characteristic function is expressed in the following form [7, 8]

$$\displaystyle{ M(\theta ) =\int g^{{\ast}}(x)g(x+\theta )dx }$$
(5)

If there is such a function, it should be normalized to one, which insures that the corresponding density will integrate to one. While this theorem is fundamental in probability theory, it appears that the significance and properties of the g(x) functions have not been extensively studied. We will argue that they are the “wave functions” of quantum mechanics, and that the generalization of Khinchine’s theorem that we present in Sect. 3 leads to the concept of operators in standard probability theory. We first present our idea for the Khinchine theorem as originally given, Eq. (5), before we give the general result in the next section.

Rewrite Khinchine’s theorem in the following way

$$\displaystyle{ M(\theta ) =\int g^{{\ast}}(x)e^{\theta \frac{d} {dx} }g(x)dx }$$
(6)

where in going from Eqs. (5)–(6) we have used the fact that \(e^{\theta \frac{d} {dx} }\) is the translation operator in that for any function f(x) [11]

$$\displaystyle{ e^{\theta \frac{d} {dx} }f(x) =f(x+\theta ) }$$
(7)

We now insert i as indicated

$$\displaystyle{ M(\theta ) =\int g^{{\ast}}(x)e^{i\theta \left (\frac{1} {i} \frac{d} {dx}\right )}g(x)dx }$$
(8)

and write Eq. (6) as

$$\displaystyle{ M(\theta ) =\int g^{{\ast}}(x)e^{i\theta \mathbf{p}}\,g(x)dx }$$
(9)

where we have defined the operator, p, by

$$\displaystyle{ \mathbf{p} = \frac{1} {i} \frac{d} {dx} }$$
(10)

We now calculate the expectation value by way of Eq. (4). In anticipation of the result we shall use the letter p for the random variables since it will turn out to be momentum and the letter p is standard for momentum. In particular,

$$\displaystyle\begin{array}{rcl} \langle \,p\rangle = \left.\frac{1} {i} \frac{d} {d\theta }M(\theta )\,\right \vert _{\theta =0}& &{}\end{array}$$
(11)
$$\displaystyle\begin{array}{rcl} = \left.\frac{1} {i} \frac{d} {d\theta }\int g^{{\ast}}(x)e^{i\theta \mathbf{p}}g(x)dx\,\right \vert _{\theta =0}& &{}\end{array}$$
(12)
$$\displaystyle\begin{array}{rcl} = \left.\int g^{{\ast}}(x)\left (\frac{1} {i} \frac{\partial } {\partial \theta }\right )e^{i\theta \mathbf{p}}g(x)dx\,\right \vert _{\theta =0}& &{}\end{array}$$
(13)
$$\displaystyle\begin{array}{rcl} = \left.\int g^{{\ast}}(x)\mathbf{p}e^{i\theta \mathbf{p}}g(x)dx\,\right \vert _{\theta =0}& &{}\end{array}$$
(14)

or

$$\displaystyle{ \langle \,p\rangle =\int g^{{\ast}}(x)\mathbf{p}\,g(x)dx =\int g^{{\ast}}(x)\left (\frac{1} {i} \frac{d} {dx}\right )g(x)dx }$$
(15)

This is precisely how one calculates the average momentum in quantum mechanics when the system has the “wave function” g(x) [1, 10]. Therefore we argue that for this case (momentum) the g’s of the Khinchine theorem are the wave functions of quantum mechanics. Note that the g’s are generally complex functions and that the operator p is self-adjoint, as indeed they should be. The reason for self-adjointness will be discussed in Sect. 3

The Probability Density What is the probability density that corresponds to the characteristic function given by Eq. (5)? Again, we use p for the random variable, which is an ordinary variable, and should not be confused with the operator p. Using Eq. (2) we have

$$\displaystyle\begin{array}{rcl} P(\,p) = \frac{1} {2\pi }\int \,M(\theta )e^{-i\theta p}d\theta & &{}\end{array}$$
(16)
$$\displaystyle\begin{array}{rcl} = \frac{1} {2\pi }\iint g^{{\ast}}(x)\,g(x+\theta )e^{-i\theta p}d\theta dx& &{}\end{array}$$
(17)

Making a change of variables

$$\displaystyle{ x^{{\prime}} = x + \theta \qquad \quad dx^{{\prime}} = d\theta }$$
(18)

we have

$$\displaystyle\begin{array}{rcl} P(\,p) = \frac{1} {2\pi }\iint \,g^{{\ast}}(x)\,g(x^{{\prime}})e^{-i(x^{{\prime}}-x)p }dx^{{\prime}}dx& &{}\end{array}$$
(19)
$$\displaystyle\begin{array}{rcl} = \frac{1} {2\pi }\left (\int \,g^{{\ast}}(x)e^{ixp}dx\right )\left (\int \,g(x^{{\prime}})e^{-ix^{{\prime}}p }dx^{{\prime}}\right )& &{}\end{array}$$
(20)

that is

$$\displaystyle{ P(\,p) = \frac{1} {2\pi }\left \vert \int \,g(x)e^{-ixp}dx\right \vert ^{2} }$$
(21)

But this is precisely the probability density of “momentum” in quantum mechanics [1, 10].

Comment Notice that the random variable p is continuous, ranging from − to . That is indeed the case in quantum mechanics, and we say that momentum is not quantized. How quantization for other physical quantities comes in will be clear when we discuss general operators and general random variables in the next section.

3 Generalization of Khinchine’s Theorem

We now generalize Khinchine’s theorem to apply to arbitrary self adjoint operators. M a (θ) is a characteristic function if and only if for a self adjoint operator A there exists the representation

$$\displaystyle{ M_{a}(\theta ) =\int g^{{\ast}}(x)e^{i\theta \mathbf{A}}g(x)dx }$$
(22)

We prove this in Appendix A, “Khinchine Theorem for Operators”. For the expectation value we have, using Eq. (2), that

$$\displaystyle\begin{array}{rcl} \langle a\rangle = \left.\frac{1} {i} \frac{d} {d\theta }M_{a}(\theta )\,\right \vert _{\theta =0}& &{}\end{array}$$
(23)
$$\displaystyle\begin{array}{rcl} = \left.\frac{1} {i} \frac{d} {d\theta }\int \,g^{{\ast}}(x)e^{i\theta \mathbf{A}}\,g(x)dx\,\right \vert _{\theta =0}& &{}\end{array}$$
(24)
$$\displaystyle\begin{array}{rcl} = \left.\int \,g^{{\ast}}(x)\left (\frac{1} {i} \frac{\partial } {\partial \theta }\right )e^{i\theta \mathbf{A}}\,g(x)dx\,\right \vert _{\theta =0}& &{}\end{array}$$
(25)
$$\displaystyle\begin{array}{rcl} = \left.\int \,g^{{\ast}}(x)\mathbf{A}e^{i\theta \mathbf{A}}\,g(x)dx\,\right \vert _{\theta =0}& &{}\end{array}$$
(26)

giving

$$\displaystyle{ \langle a\rangle =\int \, g^{{\ast}}(x)\mathbf{A}\,g(x)\,dx }$$
(27)

This is precisely the standard manner of calculating expectation values in quantum mechanics for a physical quantity associated with the self-adjoint operator A [1, 10].

3.1 Probability Density

We now discuss the probability density that corresponds to the characteristic function given by Eq. (22). Substitute Eq. (22) into Eq. (2) to obtain

$$\displaystyle{ P(a) = \frac{1} {2\pi }\int \,M_{a}(\theta )e^{-i\theta a}d = \frac{1} {2\pi }\iint \,g^{{\ast}}(x)e^{i\theta \mathbf{A}}\,g(x)e^{-i\theta a}dx\,d\theta }$$
(28)

We evaluate Eq. (28) in Appendix B, “Probability Density”. Here we state the result. There are two cases: namely, if we have discrete or continuous random variables. This follows naturally, as we show in the Appendix “Probability Density”. In short, it is the spectrum of the operator A which determines whether the random variables are discrete or continuous. Moreover the random variables are the eigenvalues of the operator.

Continuous case If the spectrum of the operator has continuous eigenvalues we write

$$\displaystyle{ \mathbf{A}u_{a}(x) = au_{a}(x) }$$
(29)

where a and u a (x) are the eigenvalues and corresponding eigenfunctions of the operator A. The probability density as evaluated by way of Eq. (28) is given by

$$\displaystyle{ P(a) = \left \vert c(a)\right \vert ^{2} }$$
(30)

where

$$\displaystyle{ c(a) = \int \,g(x)u_{a}^{{\ast}}(x)dx }$$
(31)

Hence, the random variables are the a s (the eigenvalues) and their range is the range of the eigenvalues.

Discrete case If the spectrum of the operator is discrete, we write

$$\displaystyle{ \ \mathbf{A}u_{n}(x) = a_{n}u_{n}(x) }$$
(32)

then the probability distribution is given by

$$\displaystyle{ P(a) =\sum _{n}\vert c_{n}\vert ^{2}\delta (a - a_{ n}) }$$
(33)

where

$$\displaystyle{ c_{n} = \int \,g(x)u_{n}^{{\ast}}(x)dx }$$
(34)

Notice that the probability density is non-zero only when the random variable, a, is one of the discrete eigenvalues. In this case we have quantization. We may write Eq. (33) as

$$\displaystyle{ P(a_{n}) = \left \vert c_{n}\right \vert ^{2} }$$
(35)

Discussion The probability densities derived above are called the Born rule. We have derived them from the generalization of the Khinchine theorem, Eq. (22). Also, we proved that the random variables are the eigenvalues of the operator A, which is usually just assumed in quantum mechanics.

3.2 Two Ways of Calculating Expectation Values

We have shown, using Eq. (22) that one may calculate expectation values by

$$\displaystyle{ \langle a\rangle =\int \, g^{{\ast}}(x)\mathbf{A}\,g(x)\,dx }$$
(36)

which is the standard quantum mechanical way. However in standard probability theory we calculate expectation values by

$$\displaystyle{ \langle a\rangle = \sum \text{(random variable) } \times \text{ (probability)}\qquad \quad \text{(discrete case)} }$$
(37)

for the discrete case, and by

$$\displaystyle{ \langle a\rangle =\int \text{(random variable) } \times \text{ (probability)}\qquad \quad \text{(continuous case)} }$$
(38)

for the continuous case. Substituting Eqs. (30) and (35) we have

$$\displaystyle\begin{array}{rcl} \langle a\rangle = \sum a_{n}\left \vert \int \,g(x)u_{n}^{{\ast}}(x)dx\right \vert ^{2}\qquad \quad \text{(discrete case)}& &{}\end{array}$$
(39)
$$\displaystyle\begin{array}{rcl} \langle a\rangle =\int a\left \vert \int \,g(x)u_{a}^{{\ast}}(x)dx\right \vert ^{2}da\qquad \quad \text{(continuous case)}& &{}\end{array}$$
(40)

It is well known in quantum mechanics that the two methods are the same, that is that

$$\displaystyle\begin{array}{rcl} \int \,g^{{\ast}}(x)\mathbf{A}\,g(x)\,dx = \sum a_{ n}\left \vert \int \,g(x)u_{n}^{{\ast}}(x)dx\right \vert ^{2}\qquad \quad \text{(discrete case)}& &{}\end{array}$$
(41)
$$\displaystyle\begin{array}{rcl} \int \,g^{{\ast}}(x)\mathbf{A}\,g(x)\,dx =\int a\left \vert \int \,g(x)u_{ a}^{{\ast}}(x)dx\right \vert ^{2}da\qquad \quad \text{(continuous case)}& &{}\end{array}$$
(42)

In Appendix C, “Standard vs. Quantum Manner of Calculating Expectation Values” we show the equivalence for the sake of readers that may not be familiar with the result.

4 Conclusion

We summarize the main results. We have generalized Khinchine’s theorem for a self-adjoint operator A by showing that a function, M a (θ), defined by

$$\displaystyle{ M_{a}(\theta ) =\int \, g^{{\ast}}(x)e^{i\theta \mathbf{A}}\,g(x)dx }$$
(43)

is a proper characteristic function. From Eq. (43) we have shown that the usual rules of quantum probabilities follow. In particular we have shown that:

  1. 1.

    The expected value is

    $$\displaystyle{ \langle a\rangle =\int \, g^{{\ast}}(x)\mathbf{A}\,g(x)\,dx }$$
    (44)
  2. 2.

    The random variables are the eigenvalues of the operator A.

  3. 3.

    If the eigenvalues, a, are continuous, then the probability density associated with the characteristic function is

    $$\displaystyle{ P(a) = \left \vert \int \,g(x)u_{a}^{{\ast}}(x)dx\right \vert ^{2} }$$
    (45)

    where u a (x) are the eigenfunctions.

  4. 4.

    If the eigenvalues, a n , are discrete with corresponding eigenfunctions u n (x), the probability density is given by

    $$\displaystyle{ P(a_{n}) = \left \vert \int \,g(x)u_{n}^{{\ast}}(x)dx\right \vert ^{2} }$$
    (46)

The above items are exactly how one obtains the random variables and probabilities in quantum mechanics. We have derived them from the characteristic function defined by Eq. (43).