Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

1 Diverging Accounts of the Casimir Effect

We have been considering the Casimir Effect from a number of different theoretical perspectives. In the light of what we have discussed, and before proceeding any further, it is worth taking the time to ask a simple question: what, then, are we actually talking about? Certainly, the Casimir Effect is an empirically verified phenomenon involving attractive (or repulsive) forces between macroscopic objects that persists even at zero temperature in a vacuum [1, 2].

However, explanations of the phenomenon are not uniformly consistent among theorists. The Casimir force has been described, on the one hand, as an effect resulting from the alteration, by the boundaries, of the zero-point electromagnetic energy [3]. On this account, the force is a property of the vacuum and “clear evidence for the existence of vacuum fluctuations” [4]. On the other hand, the Casimir Effect has also been described as a “force [that] originates in the forces between charged particles” that can be “computed without reference to zero point energies”. According to this alternative account, “The Casimir force is simply the (relativistic, retarded) Van der Waals force between the metal plates” and the phenomenon offers “no evidence that the zero-point energies are real” [5].

This is rather unsatisfactory [68]. We should like to be able to say something clearly (albeit provisionallyFootnote 1) about what this phenomenon subsists in, and what it may imply about the nature of physical reality. However, to adopt a more metaphysical parlance, these popular accounts of the Casimir Effect appear to invoke different ontologies in which a certain metaphysical priority is exchanged between the matter and the fields.

In this chapter, our aim is to side-step the technical details of Casimir physics and reconsider the basic ideas, with the hope of achieving some conceptual clarity. In so doing it will become apparent that these inconsistent interpretations are grounded in theories that fail to offer a consistently quantum-mechanical description of the interaction of the field with matter. There are two sides to this quantum-mechanical coin, but neither appears to be weighted. In the author’s opinion, the proper locus for interpreting the Casimir Effect is the theory of macroscopic quantum electrodynamics, in which the necessary quantization of the electromagnetic field and its coupling to bulk materials receives a canonical and consistently quantum-mechanical treatment.

2 Three Theories of the Casimir Force

2.1 Casimir’s Theory and the Quantum Vacuum

2.1.1 Theoretical Context

We shall focus our efforts on Casimir’s classic thought-experiment. In the standard account of the Casimir Effect, the predicted force occurs between a pair of neutral, parallel conducting plates, separated by a distance \(d\), in vacuum at zero temperature.Footnote 2 The interaction arises due to a disturbance of the vacuum state of the electromagnetic field (in which there are no real photons between the plates) [1]. This is a quantum effect, as classical electrodynamics does not predict a force at zero temperature.

The prescribed procedure may be summarised as follows [3]: take the infinite vacuum energy of the quantized electromagnetic field, with Dirichlet boundary conditions imposed on the field modes,

$$\begin{aligned} E =\frac{1}{2} \sum \hbar \omega , \end{aligned}$$
(3.2.1)

and subtract from it the infinite vacuum energy in free Minkowski space (or with the boundaries infinitely separated), \(E_{\infty }\), having first regularized both quantities \(E \rightarrow E(\xi ), E_{\infty } \rightarrow E_{\infty }(\xi )\) so that the subtraction procedure is well-defined. Once the difference between the two energies has been computed, the regularization is removed, \(\xi \rightarrow 0\), and the result that remains is finite:

$$\begin{aligned} E_{Casimir} = \lim \limits _{\xi \rightarrow 0} \left[ E(\xi ) - E_{\infty }(\xi ) \right] . \end{aligned}$$
(3.2.2)

This is the renormalised Casimir energy, from which we can derive the mechanical force exerted on two parallel plates. For Casimir’s case, in which the mirrors are perfectly reflective for all frequencies, we find the pressure force

$$\begin{aligned} P = - \frac{\hbar c \pi ^{2}}{240 d^{4}}. \end{aligned}$$
(3.2.3)

The astute reader will rightly object that our own procedure for extracting Casimir’s result in Chap. 1 differed slightly from this recipe, and indeed nobody follows the standard prescription precisely for the case of the electromagnetic field, though it has been pedagogically applied to a 1d scalar field [3] where the calculation is somewhat simpler. As we have seen, if we apply a frequency cutoff term \(exp(-\xi \omega / c)\) as the regulariser, we discover an additional divergent term that is not removed by subtracting the so-called background energy [9], which appears to correspond to waves running parallel to the plates.Footnote 3 This extra term has to be discarded also in order to state a finite electromagnetic Casimir energy. Typically it disappears in the course of applying the Euler-MacLaurin formula (e.g. [7, 11, 12]), or as a result of applying some other mathematical trick where the physical meaning is difficult to discern. Suffice it to say that the simple picture of taking the difference between two electromagnetic field energies can be somewhat misleading.

2.1.2 Physical Interpretation

Nevertheless, considered on the basis of an energy mode summation, as employed by Casimir [1], it seems the quantised electromagnetic field in its ground-state with ‘external boundary conditions’ is sufficient to determine a force—an almost matter-free prescription for obtaining the phenomenon in which the interacting bodies become simply topological features of the space [7]. Casimir’s formula, depending solely upon the constants \(\hbar \) and \(c\) and the distance \(d\) between the plates, serves to consolidate this impression [5].

But this interpretation is naive. The vacuum energy, as we have observed, is infinite, and in addition to imposing boundary conditions on the field we apply some form of regularisation to tame the mode summation and permit the subtraction (or extraction) of diverging terms. Although the various mathematical techniques employed to do this often obscure the fact, it is in the procedure of regularisation that some of the properties of matter—in particular, its dispersive behaviour—are permitted to leak into the calculation, albeit rather crudely [9]. Significantly, it is not possible to extract anything meaningful (or measureable) about the Casimir force until they are permitted to do so. Furthermore, when we relax the highly artificial boundary condition of perfect mirrors, as we must in order to predict the Casimir Effect in real materials, we are forced to sum contributions to the Casimir energy over a dispersive material response across the whole mode spectrum, substantially modifying the predicted force. To do this kind of calculation, however, we must abandon the mode summation and adopt a more sophisticated apparatus, like Lifshitz theory. Casimir’s result can still be recovered, but only as a limiting case [11].

2.2 Lifshitz Theory and Stochastic Fluctuations

2.2.1 Theoretical Context

Lifshitz theory, which we discussed in Chap. 2, has proven an important benchmark for the prediction of Casimir forces in more realistic cases, enjoying significant experimental verification [13, 14]. In the context of Lifshitz theory, the Casimir Effect is a result of fluctuating current densities in the two plates [1517]. A force arises from the interaction of the currents through the electromagnetic field that they generate in the cavity. The plates are now treated more realistically as dielectric with frequency-dependent permittivities and permeabilities, and this substantially affects both the size (and, in some cases, the natureFootnote 4) of the predicted force.

The formalism is written in terms of the electromagnetic Green function, which describes the field produced by sources of current within the system (2.2.30), including the stress tensor from which the force is derived (2.3.75, 2.3.76). The stress tensor, however, like the zero-point energy, contains a divergent contribution that must also be regularised.Footnote 5 Typically this is achieved by subtracting a stress \(\sigma _{0}\) calculated using an auxiliary Green function associated with an infinite homogeneous medium [11, 1720], and computing the physical stress in the limit of the point of measurement approaching a point source:

$$\begin{aligned} \sigma _{Casimir} = \lim \limits _{\mathbf {r'} \rightarrow \mathbf {r}} \left[ \sigma (\mathbf {r},\mathbf {r'}) - \sigma _{0}(\mathbf {r},\mathbf {r'}) \right] . \end{aligned}$$
(3.2.4)

One can then determine a finite stress tensor for the system that depends on the dielectric functions of the material at imaginary frequencies (quantities obtained from the dielectric properties for real frequencies by Hilbert transformation). Only then can the force be derived. Both Casimir’s and Lifshitz’ regularizations give identical results in the limiting case of a cavity sandwiched between perfectly reflecting mirrors (2.4.31).

2.2.2 Physical Interpretation

An incautious reading of Lifshitz theory might suggest that the role of the vacuum has been successfully banished from the Casimir Effect. Ontologically, the conditions seem to involve merely the material in the plates and a stochastic source of fluctuations within the material. An electromagnetic field results from the fluctuating currents in the plates, producing a mechanical stress in the same pieces of material that generated it. There is no Hamiltonian in the original formulation of Lifshitz theory [15, 16] and there are no quantized fields. There is therefore no ground-state of the electromagnetic field. To some, this does not even appear to be a quantum-mechanical theory at all [18, 2123].

But the fluctuations in the material that persist even at zero temperature are not a classical phenomenon; they are inserted ‘by hand’ using Rytov’s correlation function. This can be derived from statistical physics, or from fluctuation-dissipation theorem,Footnote 6 and affords the average electromagnetic field that would be present at finite (or zero) temperature [24].

Lifshitz theory is arguably uncommitted to the particulars of a quantum theory of light in material, however, as opposed to a merely stochastic theory of the phenomena, being based rather on the principles of thermodynamics and statistical physics[18, 21]. It embodies at best a minimal treatment of the quantum mechanics that is phenomenologically driven. It is therefore ontologically ambiguous about the role of the vacuum.Footnote 7 A clearer interpretation of the underlying physics cannot be achieved without the vantage point of a more quantum-mechanically consistent position.

2.3 Macroscopic QED and the Polariton Field

2.3.1 Theoretical Context

A recent and more sophisticated formulation of macroscopic quantum electrodynamics than the kind we adopted in Chap. 2 offers a canonical quantum-mechanical treatment of the interaction of light with real materials, without the detailed reference to the microscopic material structure that must defeat any complete treatment of such systems, and without sacrificing quantum-mechanical consistency along with more phenomenologically driven approaches [21]. Significantly, this form of macro-QED is the only canonical method that reproduces and justifies the general Lifshitz theory result for the stress tensor and determines the Casimir energy density inside a medium [16], resolving a long-standing dispute over the form that these expressions should take in this context [18, 20, 25, 26]. It applies with full generality to arbitrary magnetodielectrics, taking full account of the phenomena of dispersion and dissipation. First, in distinction to the shortcuts taken in Chap. 2, an action is formulated in terms of the dynamical variables \(\left\{ \phi ,\mathbf {A}\right\} \), the scalar and vector potentials of the fields, and \(\left\{ \mathbf {X}_{\omega },\mathbf {Y}_{\omega }\right\} \), a pair of oscillator fields incorporating the dissipative behaviour of the material:

$$\begin{aligned} S[\phi ,\mathbf {A},\mathbf {X}_{\omega },\mathbf {Y}_{\omega }] = S_{em}[\phi , \mathbf {A}] + S_{X}[\mathbf {X}_{\omega }] + S_{Y}[\mathbf {Y}_{\omega }] + S_{int}[\phi ,\mathbf {A},\mathbf {X}_{\omega },\mathbf {Y}_{\omega }], \end{aligned}$$
(3.2.5)

where \(S_{em}\) is the free electromagnetic action, \(S_{X}\) and \(S_{Y}\) are the actions for the free reservoir oscillators, and \(S_{int}\) is the interaction part of the action, coupling the electromagnetic fields to the field reservoirs of the material. Maxwell’s equations can be recovered from this action, and canonical quantisation proceeds straightforwardly. As before (2.1.40), a diagonalised Hamiltonian is achieved,Footnote 8

$$\begin{aligned} \hat{H} = \frac{1}{2}\sum _{\lambda = e,m} \int \text {d}^{3}\mathbf {r} \int \limits _{0}^{\infty } \text {d}\omega \, \hbar \omega \, \left( \hat{\mathbf{f }}_{\lambda }(\mathbf r ,\omega ) \cdot \hat{\mathbf{f }}_{\lambda }(\mathbf r ,\omega ) + \hat{\mathbf{f }}_{\lambda }(\mathbf r ,\omega ) \cdot \hat{\mathbf{f }}_{\lambda }(\mathbf r ,\omega ) \right) , \end{aligned}$$
(3.2.6)

where the eigenmodes are bosonic creation and annihilation operators obeying commutation relations

$$\begin{aligned} \left[ \hat{f}_{\lambda i}(\mathbf r ,\omega ), \hat{f}_{\lambda ' j}(\mathbf r ',\omega ') \right]&= \delta _{ij}\delta _{\lambda \lambda '}\delta (\omega -\omega ')\,\delta (\mathbf r -\mathbf r '), \end{aligned}$$
(3.2.7)
$$\begin{aligned} \left[ \hat{f}_{\lambda i}(\mathbf r ,\omega ), \hat{f}_{\lambda ' j}(\mathbf r ',\omega ') \right]&= 0. \end{aligned}$$
(3.2.8)

Charge and density operators for the material in the plates, as well as operators for the electromagnetic field, are then expressed in terms of the creation and annihilation operators of the system.

Casimir forces are caused by the stress-energy of the electromagnetic fields in a state of thermal equilibrium. We require the eigenmodes of the system to be in a thermal mixed quantum state. To determine the Casimir force, we consider the ground-state of the system and compute the electromagnetic part of the energy density or stress tensor [18], where the complete stress-energy-momentum tensor of the system is obtained self-consistently from the application of Noether’s theorem to the original action. The Casimir stress tensor is recovered as the expectation value of the electromagnetic component in thermal equilibrium. At zero-temperature, we recover the zero-point Casimir stress, which has the same form as the more general result for the stress tensor in Lifshitz theory, and from which the Casimir forces in the system can finally be determined [16], once the stress tensor has been regularised.Footnote 9

2.3.2 Physical Interpretation

The canonical theory of macroscopic quantum electrodynamics—at least, at present—seems to offer the richest and most general basis for interpreting Casimir phenomena. In macro-QED the materials and the fields are placed on a more equal footing. Physics necessitates the quantization of the fields, and a consistent account of quantised light in media demands a quantum representation of the relevant properties of the material. In the Hamiltonian for macro-QED, both aspects of the system are quantised and coupled. The quantum of this system—that is, its irreducible unit of excitation—is a type of polariton.

This in turn means that the matter is coupled to the vacuum state of the fields. Under the condition of thermal equilibrium,Footnote 10 it follows that the ground-state of the total system, including both the matter and the fields, is endowed with physical properties. To see this, consider the following: the fluctuating currents in one plate only interact with the currents in the other if they communicate with them, and they communicate through the electromagnetic field. At zero temperature, there are no photons between the plates, on pain of violating thermodynamic equilibrium; the electromagnetic field is therefore in its ground state. At zero temperature, therefore, the currents in the plates can only communicate through the zero-point radiation.

There is another sense in which macro-QED places the matter and the fields on an equal footing: the Lagrangian formulation that underlies the action, in which the fields, the material and their interaction are posited, is acausal in this respect: we could view the matter as producing the fields, or we could view the fields as inducing the currents in the matter; the actual physics of the phenomenon does not prioritise either.Footnote 11

3 Three Ontologies of the Casimir Effect

3.1 Semi-classical Ontologies

Broadly speaking, it is possible to characterise the polarisation of opinion (or preference) concerning the Casimir Effect into two ontologically distinct positions in which a certain metaphysical priority is exchanged:

  1. (I)

    a vacuum-field interpretation: there exists an electromagnetic quantum vacuum field, whose properties are modified by material (or topological) constraints, which gives rise to forces between material bodies.

  2. (II)

    a fluctuating material interpretation: there exist distributions of spontaneously polarised material, whose quantum fluctuations give rise to forces between them through the retarded electromagnetic field.

The first account touts the existence of a fluctuating electromagnetic quantum field in its ground-state—the quantum vacuum, a state that is void of particles or quanta, but has the property of an energy available for doing work (for an example of this view, see [3]). This property of the field is modified by the imposition of material (or topological) constraints. In the case of the two parallel plates, the energy of the vacuum is reduced (made more negative) by the motion of the plates towards each other. The attractive Casimir force that results is thus a consequence of the zero-point energy of the vacuum; that is, the energy associated with the fluctuations of the vacuum.

On the second account, it is claimed that one is not required to invoke electromagnetic quantum fluctuations of the vacuum to explain Casimir phenomena, and the Casimir force is essentially reinterpreted as a giant van der Waals effect (for examples of this approach, see [5, 27]). The material in the plates (as opposed to any field between the plates) is subject to quantum fluctuations. These spontaneous disturbances produce field-generating currents within the plates, which interact through the retarded electromagnetic field they have created. These interactions result in a force between the plates.

The seasoned theorist may put it down to a matter of personal taste as to which of the two approaches is preferable, arguing that either position is empirically adequate [28]. The ingeniously contrived path-integral scattering method developed in [29], for instance, obtains two equivalent representations of the Casimir energy, one in terms of fluctuating fields and the other in terms of fluctuating charges.

However, the theories in which these approaches are typically grounded do not enjoy both the consistency and the generality of macro-QED. For example, in addition to offering a canonical quantum mechanical theory of light in media and determining the general stress tensor, macro-QED is being fruitfully applied to the problem of electromagnetic effects resulting from the motion of dielectrics, including the hotly disputed problem of quantum friction, for which there is now a clear and sophisticated answer taking full account of the phenomena of dispersion and dissipation [30, 31]. Moreover, accounts (I) and (II) are ontologically divided insofar as the first requires a quantum vacuum state with physical properties and the second does not. The van der Waals interpretation does not assign any physical properties to the vacuum. Importantly, neither of these interpretations is grounded in both a general and consistently quantum-mechanical description of the interaction of the field with matter; typically, the quantum mechanical-treatment of the problem falls unevenly on one aspect or the other.

3.2 A Dual-Aspect Quantum Ontology

Let us consider instead this third option:

  1. (III)

    a dual-aspect quantum interpretation : there exists a vacuum state of the coupled system of matter and fields, which determines the ground-state properties of the electromagnetic field, giving rise to a force.

On this interpretation, the Casimir force is fundamentally a property of the coupled system of the matter and the fields, in which the interaction between the plates is mediated by the zero-point fields [18, 21]. This interpretation affirms and denies different tenets of interpretations (I) and (II):

First, in common with (I) rather than (II), there is a vacuum state in (III) which has a physical energy and a role to play in the Casimir Effect. That is, despite the absence of photons between the plates, and thermal vibrations within the plates, the walls of the cavity will still experience an attractive force. Contra (I), but in consort with (II), however, the Casimir Effect does not warrant the assignment of physical meaning to the energy of the vacuum state of the field as a ding-an-sich,Footnote 12 or totting up its modes in an enormous contribution to the cosmological constant (it is typically cut off at the Planck scale) [32, 33]. The Casimir Effect offers no justification for quantizing the plane waves of an infinite homogeneous space (which presupposes no coupling to matter) and reifying the ‘zero-point energy’ obtained. There is no force in that case, nor anything for a force to act on. In the quantisation of light coupled to matter, however, the modes of the field are no longer characterised by plane waves. In other words, it is in this state of interaction that we determine the Casimir energy and measure a Casimir force. Regularisation amounts to drawing a perimeter around this interaction.

This is enough clarification for our purposes. We have described the requisite ontology of the Casimir Effect as ‘dual-aspect’, an appelation that is intended to be sufficiently generous to encompass more detailed accounts within the constraints we have discussed. In some sense, the electromagnetic and material aspects of the system are simultaneously present in the vacuum state. However, it is not without the additional structure involved in the details of their interaction that they contribute any actual properties to the system that make contact with observable reality. This irreducible character of the system, in which the ‘whole’ is more than the sum of its ‘parts’, is formally represented in the Hamiltonian (via the action) through the addition of an interaction term. In seeking a more detailed account, perhaps we may take our cue from Heisenberg, who saw the wave function as neither the description of any actual state of affairs, nor merely a convenient calculating device, but as referring to a kind of potentiality [34], and would presumably have described the vacuum similarly.Footnote 13 Perhaps for others, the language of emergence may prove the more useful in relating the different aspects of this problem. These are conceptual problems that call for further discussion elsewhere.

4 Summary Remarks

In a general and consistently quantum-mechanical theory of light in media, the Casimir Effect may be properly described as a force arising out of the ground-state properties of a polariton field—a coupled, quantised system of dielectric material and electromagnetic fields. In its ground state, the system cannot be separated into material or electromagnetic quanta, since none have been excited. Nevertheless, a Casimir force is predicted between the plates.

In interpreting Casimir’s theory, however, the metaphysical emphasis seems to have fallen either on the electromagnetic field or the fluctuating material in the plates. On the first interpretation (I), a universal vacuum field is postulated, in which the ontological role of matter is (sometimes minimally) acknowledged in the form of boundary conditions and (sometimes unconsciously) in the regularisation process. On the second interpretation (II), the bulk material in the plates is prioritised, in which the fluctuating currents generate a field between the cavity that attracts the plates together. The role of the vacuum is void.

However, neither of these interpretations is entirely adequate, and this may lead to false predictions. For example, in opposition to (I), there is no reason to suppose that the energy of the vacuum state in the absence of any coupling is real, or that the Casimir Effect validates the huge contribution to the cosmological constant that this hypostatisation entails. The Casimir force, ironically, should not be seen through the lens of Casimir’s calculation, which is itself without physical application, being at best the limiting case of a more complicated, more abstract, and more consistently quantum-mechanical theory, in which the effects of matter and light are inextricably intertwined.