1 Introduction

Though not really new,Footnote 1 the (Strong) Free Will Theorem of Conway and Kochen [13, 14] is one of the sharpest and most interesting results that give constraints on determinism. It does so by proving that determinism is incompatible with a number of a priori desirable assumptions, including a small fragment of quantum mechanics (viz. the theory of two epr-correlated spin-one particles), the free choice of settings of an epr-style bipartite experiment involving such particles, and a locality condition called min. The latter has a long pedigree, arguably going back to epr, but it was first stated quite clearly by Bell:Footnote 2

‘The vital assumption is that the result \(B\) for particle 2 does not depend on the setting \(\vec {a}\) of the magnet for particle 1, nor \(A\) on \(\vec {b}\).’ [3, p. 196].

Also a closer study shows that Bell’s Theorem on deterministic hidden variable theories [3] and the (Strong) Free Will Theorem appear to achieve a very similar (if not identical) goal under strikingly similar assumptions, which prompts the question what exactly their mutual relationship is. Curiously, despite the stellar fame of Bell’s paper [3] (which according to Google Scholar had about 8,500 citations as of May 2014) and the considerable attention that also the Free Will Theorem has received (e.g., [16, 21]), as far as we are aware, there has been little research in this precise direction.Footnote 3

Hence the main aim of this paper is to clarify the relationship between Bell’s Theorem [3] and the Free Will Theorem. But in doing so, we will en passant attempt to resolve an issue that has troubled Bell as well as Conway and Kochen, namely the theoretical status of parameter settings. As pointed out by Conway and Kochen [14] themselves, it is odd to assume determinism for the physical system under consideration but not for the experimenters, so that the contradiction that proves their theorem seems almost circular.Footnote 4

In Bell’s later work, there has been a similar tension between the idea that the hidden variables (in the pertinent causal past) should on the one hand include all ontological information relevant to the experiment, but on the other hand should leave Alice and Bob free to choose any settings they like; see especially [28, 32] for a fine analysis of Bell’s dilemma (by some of his greatest supporters).Footnote 5 We will show that in both contexts of Bells’ Theorem (i.e. either deterministic or stochastic) this issue can be resolved in a straightforward way by initially including the settings among the random variables describing the experiment, after which they are ‘liberated’ by suitable independence assumptions.Footnote 6

The plan of our paper is as follows. In Sect. 2 we present a version of Bell’s original theorem [3] that addresses the above issues. As a warm-up for what is to come, in Sect. 3 we extend this version to the spin-one case, followed in Sect. 4 by a reformulation of the Strong Free Will Theorem [14] in the same spirit. Our final Sect. 5 goes beyond our primary goal of finding constraints on determinism, but has been included in order to show that our treatment of parameter settings through random variables also applies to Bell’s later results on stochastic hidden variable theories [6, 8, 10, 15, 22, 27, 31, 37].

Our conclusion is that the Strong Free Will Theorem uses fewer assumptions than Bell’s Theorem [3], as no appeal to probability theory is made. This comes at a price, though. First, in the absence of an Aspect-type experiment using spin-one particles, the former so far lacks the experimental backing of the latter. Second, because of its dependence on the Kochen–Specker Theorem, the Strong Free Will Theorem might lack finite precision robustness, cf. [1, 2, 18], though this threat recently seems to have been obviated [19].

2 Bell’s (1964) Theorem Revisited

The setting of Bell’s Theorem in its simplest form is given by the usual epr-Bohm experiment (with photons) [8], in which Alice and Bob each choose a setting \(A=\alpha \in X_A\) and \(B=\beta \in X_B\), respectively, where \(X_A\) and \(X_B\) are finite sets whose elements are angles in \([0,\pi )\). For the theorem, it is even enough to assume \(X_A=\{\alpha _1,\alpha _2\}\) and \(X_B=\{\beta _1,\beta _2\}\), for suitable \(\alpha _i\) and \(\beta _j\) (see below). Alice and Bob each receive one photon from an epr-correlated pair, and determine whether or not it passes through a polarizer whose principal axis is set at an angle \(\alpha \) or \(\beta \) relative to some reference axis in the plane orthogonal to the direction of motion of the photon pair.Footnote 7 If Alice’s photon passes through she writes down \(F=1|A=\alpha \), and if not she writes \(F=0|A=\alpha \). Likewise, Bob records his result as \(G=1|B=\beta \) or \(G=0|B=\beta \). Repeating this experiment, they determine empirical probabilities \(P_E\) for all possible outcomes through the frequency interpretation of probabilities, which they denote by \(P_E(F=\lambda |A=\alpha )\) and \(P_E(F=\mu |B=\beta )\), or, having got together and compared their results, by \(P_E(F=\lambda ,G=\mu |A=\alpha ,B=\beta )\), where \(\lambda ,\mu \in \{0,1\}\). If the photon pair is prepared in the epr-correlated state \(|\psi _{\hbox {epr}}\rangle =(|0\rangle |0\rangle +|1\rangle |1\rangle )/\sqrt{2}\) (taking into account helicity only), they find (as confirmed by quantum mechanics):Footnote 8

$$\begin{aligned} P_E(F\ne G|A=\alpha ,B=\beta )=\sin ^2(\alpha -\beta ). \end{aligned}$$
(2.1)

The question, then, is whether these probabilities are ‘intrinsic’ or ‘irreducible’, as claimed by mainstream quantum mechanics, or instead are just a consequence of our ignorance. To make this precise, we define the latter case, i.e. determinism, at least in our present context, adding the other assumptions of Bell’s Theorem [3] along the way.

Definition 2.1

In the context of the epr-Bohm experiment (with photons):

  • Determinism means that there is a state space \(X\) with associated functions

    $$\begin{aligned} A: X\rightarrow X_A, B:X\rightarrow X_B, F:X\rightarrow \{0,1\}, G:X\rightarrow \{0,1\}, \end{aligned}$$
    (2.2)

    which completely describe the experiment in the sense that some state \(x\in X\) determines both its settings \(\alpha =A(x),\beta =B(x)\) and its outcome \(\lambda =F(x),\mu =G(x)\).

  • Probability Theory means that the above set \(X\) can be upgraded to a probability space \((X,\Sigma ,P)\), carrying the above functions \(A,B,F,G\) as random variables,Footnote 9 so that the empirical probabilities are reproduced as conditional joint probabilities throughFootnote 10

    $$\begin{aligned} P_E(F=\lambda ,G=\mu |A=\alpha ,B=\beta )= P(F=\lambda ,G=\mu |A=\alpha ,B=\beta ). \end{aligned}$$
    (2.3)

    Furthermore, in terms of a postulated additional random variable \(Z:X\rightarrow X_Z\):

  • Parameter Independence means that \(F=F(A,Z)\) and \(G=G(B,Z)\), in that there are measurable functions \(\hat{F}:X_A \times X_Z\rightarrow \{0,1\}\) and \(\hat{G}:X_B\times X_Z\rightarrow \{0,1\}\) for which \(F(x)=\hat{F}(A(x),Z(x))\) and \(G(x)=\hat{G}(B(x),Z(x))\) (\(P\)-almost everywhere).

  • Freedom means that \((A,B,Z)\) are probabilistically independent relative to \(P\).Footnote 11

Here \(Z\) is the traditional ‘hidden variable’ space that, in the spirit of Bell [6, 28, 32], carries exactly the ‘ontological’ information (including e.g. the photon variables) that is:

  1. (i)

    Sufficiently complete for the outcome of the experiment to depend on \((A,B,Z)\) alone;

  2. (ii)

    Independent of the settings \((A,B)\), in the pertinent probabilistic sense.

These conditions stand (or fall) together: without (ii), i.e., Freedom, one could take \(X_Z=X\) and \(Z=\mathrm {id}\), whereas without (i), \(X_Z\) could be a singleton. Parameter Independence in fact sharpens i), which a priori might have been \(F=F(A,B,Z)\) and \(G=G(A,B,Z)\), to the effect that Alice’s outcome is independent of Bob’s, given \(A\) and \(Z\) (and vice versa) [3].

Our reformulation of Bell’s Theorem [3], then, is as follows.

Theorem 2.2

Determinism, Probability Theory, Parameter Independence, Freedom, and Nature (i.e. the outcome (2.1) of the epr-Bohm experiment) are contradictory.

Proof

Determinism, Probability Theory, and Parameter Independence implyFootnote 12

$$\begin{aligned} P(F=\lambda ,G=\mu |A=\alpha ,B=\beta )=P_{ABZ}(\hat{F}=\lambda ,\hat{G}=\mu |\hat{A}=\alpha ,\hat{B}=\beta ), \end{aligned}$$
(2.4)

where the function \(\hat{A}:X_A \times X_B\times X_Z\rightarrow X_A\) is just projection on the first coordinate, likewise the function \(\hat{B}: X_A \times X_B\times X_Z\rightarrow X_B\) is projection on the second, and \(P_{ABZ}\) is the joint probability on \(X_A \times X_B\times X_Z\) induced by the triple \((A,B,Z)\) and the probability measure \(P\). Similarly, let \(P_Z\) be the probability on \(X_Z\) defined by \(Z\) and \(P\), and define the following random variables on the probability space \((X_Z,\Sigma _Z,P_Z)\):

$$\begin{aligned} \hat{F}_{\alpha }(z)&:= \hat{F}(\alpha ,z);\end{aligned}$$
(2.5)
$$\begin{aligned} \hat{G}_{\beta }(z)&:= \hat{G}(\beta ,z). \end{aligned}$$
(2.6)

Freedom then implies (indeed, is equivalent to the fact) that \(P_{ABZ}\) is given by a product measure on \(X_A \times X_B\times X_Z\) (cf. [24, Lemma 3.10]). A brief computation then yields

$$\begin{aligned} P_{ABZ}(\hat{F}=\lambda ,\hat{G}=\mu |\hat{A}=\alpha ,\hat{B}=\beta )=P_Z(\hat{F}_{\alpha }=\lambda ,\hat{G}_{\beta }=\mu ), \end{aligned}$$
(2.7)

and hence, from (2.4),

$$\begin{aligned} P(F=\lambda ,G=\mu |A=\alpha ,B=\beta )=P_Z(\hat{F}_{\alpha }=\lambda ,\hat{G}_{\beta }=\mu ). \end{aligned}$$
(2.8)

Adding the Nature assumption, i.e. (2.1), then gives the crucial result

$$\begin{aligned} P_Z(\hat{F}_{\alpha }\ne \hat{G}_{\beta })=\sin ^2(\alpha -\beta ). \end{aligned}$$
(2.9)

However, any four \(\{0,1\}\)-valued random variables must satisfy the (‘Boole’) inequality [30]

$$\begin{aligned} P_Z(\hat{F}_{\alpha _1}\ne \hat{G}_{\beta _1})\le P_Z(\hat{F}_{\alpha _1}\ne \hat{G}_{\beta _2})+P_Z(\hat{F}_{\alpha _2}\ne \hat{G}_{\beta _1})+P_Z(\hat{F}_{\alpha _2}\ne \hat{G}_{\beta _2}),\nonumber \\ \end{aligned}$$
(2.10)

which can be proved directly from the axioms of (classical) probability theory. But for suitable values of \((\alpha _1,\alpha _2,\beta _1,\beta _2)\) this inequality is violated by (2.9). Take, for example, \(\alpha _2=\beta _2=3\theta \), \(\alpha _1=0\), and \(\beta _1=\theta \). The inequality (2.10) then assumes the form \(f(\theta )\ge 0\) for \(f(\theta )=\sin ^2(3\theta )+\sin ^2(2\theta )-\sin ^2(\theta )\). But this is false for many values of \(\theta \in [0,2\pi ]\). \(\square \)

As already mentioned, in the usual treatment of Bell’s Theorem (either his deterministic version [3, 8] or his stochastic version [6, 8, 10, 15, 22, 27, 31, 36]), the hidden variable \(\lambda \) corresponds to our \(z\in X_Z\) rather than \(x\in X\). It is the distinction between \(X_Z\) and the ‘super-deterministic’ state space \(X\) that allowed us to give a consistent formulation of Determinism without jeopardizing Freedom. As shown above, this eventually enables one to treat the apparatus settings as parameters rather than as random variables.

3 Bell’s (1964) Theorem for Spin-one

The Free Will Theorem relies on a variation of the epr-Bohm experiment in which \({\mathbb C}^2\) is replaced by \({\mathbb C}^3\); specifically, photons with the helicity degree of freedom only (or electrons with spin only) are replaced by massive spin-one particles. Although such a ‘Free Will Experiment’ has never been performed (though it might be, one day), quantum mechanics gives unambiguous predictions that may be used in lieu of measurement outcomes. Compared to the set-up of the previous section, the following changes are to be made:

  • The settings are now given by \(A=\mathbf {a}\) and \(B=\mathbf {b}\), where \(\mathbf {a}=[\vec {a}_1,\vec {a}_2,\vec {a}_3]\) and \(\mathbf {b}=[\vec {b}_1,\vec {b}_2,\vec {b}_3]\) are frames in \({\mathbb R}^3\), that is, orthonormal bases \((\vec {a}_1,\vec {a}_2,\vec {a}_3)\) etc. in which each unit vector is defined up to a minus sign so that, e.g., \([-\vec {a}_1,\vec {a}_2,-\vec {a}_3]=[\vec {a}_1,\vec {a}_2,\vec {a}_3]\).

  • The outcomes are now given by \(F=\lambda \in X_F\), \(G=\mu \in X_G\), where

    $$\begin{aligned} X_F=X_G=\{(1,1,0), (1,0,1), (0,1,1)\}. \end{aligned}$$
    (3.1)
  • If we write \(F=(F_1,F_2,F_3)\) and \(G=(G_1,G_2,G_3)\), so that e.g. \(F=(1,1,0)\) corresponds to \(F_1=F_2=1, F_3=0\), the relevant outcome of the experiment in the epr-state (defined in terms of the usual spin-1 basis \((|0\rangle ,|\pm 1\rangle )\) of \({\mathbb C}^3\))

    $$\begin{aligned} |\psi _{\hbox {epr}}\rangle =(|-1\rangle |-1\rangle +|0\rangle |0\rangle +|1\rangle |1\rangle )/\sqrt{3}, \end{aligned}$$
    (3.2)

    at least as predicted by quantum mechanics,Footnote 13 is given byFootnote 14

    $$\begin{aligned} P_{QM}(F_i\ne G_j|A=\mathbf {a},B=\mathbf {b})={\frac{2}{3}} \sin ^2\theta _{\vec {a}_i,\vec {b}_j} \,\,\, (i,j=1,2,3). \end{aligned}$$
    (3.3)

    Here \(\theta _{\vec {a},\vec {b}}\) is the angle between \(\vec {a}\) and \(\vec {b}\), so that \(\cos ^2 \theta _{\vec {a},\vec {b}}=\langle \vec {a},\vec {b}\rangle ^2\), cf. (2.1). Note that the right-hand side only depends on \((\vec {a}_i,\vec {b}_j)\) rather than on all six vectors \((\mathbf {a},\mathbf {b})\).

Along the same lines as Theorem 2.2, and subject to analogous definitions,Footnote 15 one proves:

Theorem 3.1

Determinism, Probability Theory, Parameter Independence, Freedom, and Nature (i.e. the outcome (3.3) of the Free Will Experiment) are contradictory.

For future reference, we also record the following consequence of (3.3):Footnote 16

$$\begin{aligned} P_{QM}(F_i=G_j|A_i=B_j)=1. \end{aligned}$$
(3.4)

In other words, if the settings \((\mathbf {a},\mathbf {b})\) have \(\vec {a}_i=\pm \vec {b}_j\), then with probability one the measurements \(F_i\) and \(G_j\) have the same outcomes (i.e. either \(F_i=G_j=0\) or \(F_i=G_j=1\)).

4 The Strong Free Will Theorem Revisited

The Strong Free Will Theorem [14] historically arose as a refinement of the Kochen–Specker Theorem [4, 25], in which the assumption of Non-contextuality in a single-wing experiment on a (massive) spin-one particle was replaced by the assumption of Parameter Independence in the double-wing experiment described in the previous section. In turn, the Kochen–Specker Theorem (like Gleason’s Theorem, from which it follows) freed von Neumann’s no-go result for hidden variable theories [35] from its controversial linearity assumption (see [9] for a balanced discussion). Thus the Strong Free Will Theorem of 2009 may be seen as a finishing touch of the development started by von Neumann in 1932. Ironically, we are now going to place the Strong Free Will Theorem in the Bell tradition, which emphatically arose in opposition (if not hostility) to the work of von Neumann!

Roughly speaking, the Strong Free Will Theorem removes the assumption of Probability Theory from Bell’s Theorem [3] (in our spin-one version, i.e., Theorem 3.1), but in order to achieve this, some of the assumptions now acquire a somewhat different meaning.

Definition 4.1

In the context of the Free Will Experiment of the previous section:

  • Determinism means that there is a state space \(X\) with associated functions

    $$\begin{aligned} A: X\rightarrow X_A, B:X\rightarrow X_B, F:X\rightarrow X_F, G:X\rightarrow X_G, \end{aligned}$$

    where \(X_A=X_B\) is the set of all frames in \({\mathbb R}^3\), and \(X_F=X_G\) is given by (3.1), which completely describe the experiment in the sense that each state \(x\in X\) determines both its settings \(\mathbf {a}=A(x),\mathbf {b}=B(x)\) and its outcome \(\lambda =F(x),\mu =G(x)\). Furthermore, in terms of a postulated additional random variable \(Z:X\rightarrow X_Z\):

  • Parameter Independence means that \(F=F(A,Z)\) and \(G=G(B,Z)\), i.e., for all \(x\in X\) one has \(F(x)=\hat{F}(A(x),Z(x))\) and \(G(x)=\hat{G}(B(x),Z(x))\) for certain functions \(\hat{F}:X_A \times X_Z\rightarrow X_F\), \(\hat{G}:X_B\times X_Z\rightarrow X_G\).

  • Freedom means that \((A,B,Z)\) are independent in the sense that for each \((\mathbf {a},\mathbf {b},z)\in X_A\times X_B\times X_Z\) there is an \(x\in X\) for which \(A(x)=\mathbf {a}\), \(B(x)=\mathbf {b}\), and \(Z(x)=z\).

Thus the main change lies in the Freedom assumption, which simply says that the function \(A\times B\times Z:X\rightarrow X_A\times X_B\times X_Z, x\mapsto (A(x),B(x),Z(x))\), is surjective. The goal of this assumption is to remove any potential dependencies between (or constraints on) the variables \((\mathbf {a},\mathbf {b},z)\), and hence between the physical system Alice and Bob perform their measurements on, and the devices they perform their measurements with.

Also, rather than the probabilistic outcome (3.3) of the Free Will Experiment, we use its corollary (3.4), construed non-probabilistically (i.e., probability one is replaced by deterministic certainty): writing \(\hat{F}=(\hat{F}_1,\hat{F}_2,\hat{F}_3)\) and \(\hat{G}=(\hat{G}_1,\hat{G}_2,\hat{G}_3)\), analogous to \(F\) and \(G\), so that \(\hat{F}_i:X_A \times X_Z\rightarrow \{0,1\}\) and \(\hat{G}_j:X_B \times X_Z\rightarrow \{0,1\}\), Nature reveals that:Footnote 17

$$\begin{aligned} \vec {a}_i=\vec {b}_j\, \Rightarrow \, \hat{F}_i(\vec {a}_1,\vec {a}_2,\vec {a}_3,z)=\hat{G}_j(\vec {b}_1,\vec {b}_2,\vec {b}_3,z). \end{aligned}$$
(4.1)

Our reformulation of the Strong Free Will Theorem [7, 11, 14, 20, 34], then, is as follows.

Theorem 4.2

Determinism, Parameter Independence, Freedom, and Nature (here represented by the outcome (4.1) of the Free Will Experiment) are contradictory.

Proof

The Freedom assumption allows us to treat \((\mathbf {a},\mathbf {b},z)\) as free variables, a fact that will tacitly be used all the time. First, take \(i=j\) in (4.1). This shows that \(\hat{F}_i(\vec {a}_1,\vec {a}_2,\vec {a}_3,z)\) only depends on \((\vec {a}_i,z)\), whilst \(\hat{G}_j(\vec {b}_1,\vec {b}_2,\vec {b}_3,z)\) only depends on \((\vec {b}_j,z)\). Hence we write \(\hat{F}_i(\vec {a}_1,\vec {a}_2,\vec {a}_3,z)=\tilde{F}_i(\vec {a}_i,z)\), etc. Next, taking \(i\ne j\) in (4.1) shows that \(\tilde{F}_1(\vec {a},z)=\tilde{F}_2(\vec {a},z)=\tilde{F}_3(\vec {a},z)\). Consequently, the function \(\hat{F}:X_A \times X_Z\rightarrow X_F\) is given by

$$\begin{aligned} \hat{F}(\vec {a}_1,\vec {a}_2,\vec {a}_3,z)=(\tilde{F}(\vec {a}_1,z),\tilde{F}(\vec {a}_2,z),\tilde{F}(\vec {a}_3,z)), \end{aligned}$$
(4.2)

Combined with its value set (3.1), this shows that for each fixed \(z\), \(\hat{F}\) is a frame function: to each frame \(\mathbf {a}\) it assigns one of the triples in (3.1), in such a way that if two different frames \(\mathbf {a}\) and \(\mathbf {a}'\) overlap in that \(\vec {a}'_i=\vec {a}_j\) for some \(i,j\), then \(\hat{F}_i(\vec {a}'_i,z)=\hat{F}_j(\vec {a}_j,z)\). However, such a function does not exist by the Kochen–Specker Theorem [25, 29]. \(\square \)

Through the proof of the Kochen–Specker Theorem, this proof shows that a suitable finite set of frames will do for \(X_A=X_B\), a simplification that is not available in Theorem 3.1!

5 Bell’s Theorem Revisited

To close, we show that what is usually called Bell’s Theorem [6, 8, 10, 15, 22, 27, 31, 36], in which Determinism is not assumed, may also be reformulated using our treatment of apparatus settings as random variables. We restrict ourselves to generalizing Theorem 2.2; Theorem 3.1 may be adapted to stochastic hidden variables in an analogous way.

Definition 5.1

In the context of the epr-Bohm experiment (with photons):

  • Probability Theory means that there is a probability space \((X,\Sigma ,P)\), carrying random variables (2.2), so that the empirical probabilities are reproduced as conditional joint probabilities through (2.3).

  • Bell-Locality means that there is a fifth random variable \(Z:X\rightarrow X_Z\) for which

    $$\begin{aligned}&P(F=\lambda ,G=\mu |A=\alpha ,B=\beta ,Z=z)= \end{aligned}$$
    (5.1)
    $$\begin{aligned}&P(F=\lambda |A=\alpha ,Z=z)\cdot P(G=\mu |B=\beta ,Z=z). \end{aligned}$$
    (5.2)
  • Freedom means that, for this fifth variable, \(P(Z\!=\!z|A\!=\!\alpha ,B\!=\!\beta )\!=\! P(Z\!=\!z)\).

Theorem 5.2

Probability Theory, Bell-Locality, Freedom, and Nature are contradictory, where Nature is represented through the outcome (2.1) of the epr-Bohm experiment.

Proof

Introduce a new probability space \(\tilde{X}_Z=[0,1]\times [0,1]\times X_Z\), with elements \((s,t,z)\), and probability measure \(d\tilde{P}_Z(s,t,z)=ds\cdot dt\cdot dP_Z(z)\). On \(\tilde{X}_Z\), define random variables

$$\begin{aligned} \tilde{F}_{\alpha }(s,t,z)&= \chi _{[0,P(F=1|A=\alpha ,Z=z)]}(s);\end{aligned}$$
(5.3)
$$\begin{aligned} \tilde{G}_{\beta }(s,t,z)&= \chi _{[0,P(G=1|B=\beta ,Z=z)]}(t), \end{aligned}$$
(5.4)

a move inspired by [36]. Using all assumptions of the theorem, one then finds

$$\begin{aligned} P(F=\lambda ,G=\mu |A=\alpha ,B=\beta )=\tilde{P}_Z(\tilde{F}_{\alpha }=\lambda ,\tilde{G}_{\beta }=\mu ), \end{aligned}$$
(5.5)

cf. (2.8), so that the proof may be completed exactly as in the case of Theorem 2.2. \(\square \)