In Praise of Clausius Entropy: Reassessing the Foundations of Boltzmannian Statistical Mechanics

Weaver, Christopher Gregory

doi:10.1007/s10701-021-00437-w

In Praise of Clausius Entropy: Reassessing the Foundations of Boltzmannian Statistical Mechanics

Published: 17 May 2021

Volume 51, article number 59, (2021)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Foundations of Physics Aims and scope Submit manuscript

In Praise of Clausius Entropy: Reassessing the Foundations of Boltzmannian Statistical Mechanics

Download PDF

Christopher Gregory Weaver^1,2,3

610 Accesses
4 Citations
Explore all metrics

Abstract

I will argue, pace a great many of my contemporaries, that there's something right about Boltzmann's attempt to ground the second law of thermodynamics in a suitably amended deterministic time-reversal invariant classical dynamics, and that in order to appreciate what's right about (what was at least at one time) Boltzmann's explanatory project, one has to fully apprehend the nature of microphysical causal structure, time-reversal invariance, and the relationship between Boltzmann entropy and the work of Rudolf Clausius.

On the Statistical Viewpoint Concerning the Second Law of Thermodynamics—A Reminder on the Ehrenfests’ Urn Model

Einstein’s Approach to Statistical Mechanics: The 1902–04 Papers

Article 08 September 2016

Thermodynamics is more powerful than the role to it reserved by Boltzmann-Gibbs statistical mechanics

Article 24 October 2014

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

"Die Energie der Welt ist constant. Die Entropie der Welt strebt einem Maximum zu."^{Footnote 1}

-Rudolf Clausius (1822–1888)

"For Boltzmann…the probability calculus was primarily a technique for evading paradox; the mechanical approach to gas theory…exemplified by the H-theorem, was always his fundamental tool, the one to which he returned again and again."^{Footnote 2}

-Thomas Kuhn (1922–1996)

"Let us now turn to the second matter in dispute between us. That the majority of students don't understand philosophy doesn't bother me. But can any two people understand philosophical questions? Is there any sense at all in breaking one's head over such questions? Shouldn't the irresistible pressure to philosophize be compared with the nausea caused by migraine headaches? As if something could still struggle its strangled way out, even though nothing is actually there at all?

My opinion about the high, majestic task of philosophy is to make things clear, in order to finally heal mankind from these terrible migraine headaches. Now, I am one who hopes not to make you angry by my forthrightness, but the first duty of philosophy as love of wisdom is complete frankness. Through my study of Schopenhauer, I am learning Greek ways of thinking again, but piecemeal."^{Footnote 3}

-Ludwig Boltzmann’s (1844–1906) letter to Franz Brentano (Vienna, January 4th, 1905)

1 Introduction

With respect to an empirically successful physical theory T, it is believed that one can use T to acquire approximately true descriptions and explanations of phenomena in the world without appreciating T’s historical development if one has and uses a universally accepted formulation and partial interpretation of T (cf. the remarks in [226], p. 923). In contemporary statistical mechanics (SM) for the classical limit, everyone does what is right in their own eyes (borrowing some wording from Judges 21:25). Unlike contemporary Minkowskian special relativity, or classical Maxwellian electrodynamics, there is no generally agreed upon formulation or approach to non-equilibrium or equilibrium SM. Aside from the (a) Gibbsian approach,^{Footnote 4} there are (b) epistemic and information theoretic approaches including some with and some without the Shannon entropy,^{Footnote 5} (c) Boltzmannian approaches with and without ergodicity that use the Boltzmann entropy,^{Footnote 6} (d) stochastic dynamical approaches that modify the underlying classical microdynamics,^{Footnote 7} (e) the Brussels-Austin School,^{Footnote 8} and (f) the BBGKY Hierarchy approach, a type of chimera that includes both Gibbsian and Boltzmannian ideas.^{Footnote 9}

Scientific realism is the view that most of the unobservables that are essential to our best physical theories exist and that most property attributions to the self-same unobservables expressed in statements essential to our best physical theories are at least approximately true. Given realism, the multifarious ways of formulating and interpreting SM should not be brushed off as harmless. Each formulation and interpretation of SM recommends a distinctive scientific ontology. For example, some theories provide competing characterizations or interpretations of quantities like entropy. The Boltzmannians claim that thermodynamic entropy is the Boltzmann entropy (S_B), an objective property of physical systems whose mathematical representative can change its value over time (q.v., n. 11). For many Gibbsians, S_G(ρ) or the Gibbs entropy is thermodynamic entropy.^{Footnote 10}S_G(ρ) is a constant of motion on the assumed classical Hamiltonian mechanics. It is a time-independent function of a density or probability distribution associated with modal system ensembles. Sometimes interpretations of one and the same formulation of SM can imply different scientific ontologies. For example, different assumptions about the interpretation of probability in one and the same approach recommend non-identical scientific ontologies. One might adopt (a) and yet understand the involved probabilities to be propensities that cause relative frequencies. Or one could remain Gibbsian and yet believe that probabilities in the theory just are frequencies solely (i.e., propensities are removed from the interpretation of the formulation of (a)). Who is right? Some will shout the answer: “That theory or approach which enjoys the most empirical success is the theory that is closest to the truth!” (remember that I am assuming realism).

Suppose the exclaimed answer is correct and that (a)–(f) could somehow be empirically distinguished. Let us further suppose that in point of fact, the deliverances of experimentation and scientific observation privilege (c) the Boltzmannian approach. Like many modern promulgators of (a)–(b) and (d)–(f), defenders of standard Boltzmannian SM (BSM) claim to be in the possession of a unique ideological solidarity with those fathers of modern kinetic theory and statistical mechanics that are James Clerk Maxwell (1831–1879), Ludwig Eduard Boltzmann (1844–1906), and Josiah Willard Gibbs (1839–1903).^{Footnote 11} In light of the dizzying array of approaches, proponents of BSM have tried to distinguish their perspective, not just by pointing to their theory’s empirical success, but also by telling a story (the Standard Story) about how Boltzmann came to affirm a combinatorial characterization of entropy (q.v., n. 11) and a statistical statement of the second law of thermodynamics (q.v., Appendix 1).

According to the Standard Story, from 1866^{Footnote 12} to 1877,^{Footnote 13} Boltzmann hoped to provide a purely mechanical justification of the second law, eventually (as of 1872^{Footnote 14}) relying upon his famous minimum theorem (later called the H-theorem). However, Boltzmann’s efforts were met by the famous reversibility objection articulated by his colleague Johann Josef Loschmidt (1821–1895) in 1876.^{Footnote 15} In 1877,^{Footnote 16} Boltzmann repented and turned to his combinatorial arguments wherein was featured combinatorial entropy (q.v., n. 11) and a statistical understanding of the second law. He subsequently (to quote one renowned historian of physics) turned “his attention to other matters, returning…only occasionally, to add a footnote or two to his earlier expositions, or to comment on some other physicist’s discussion…” (M. J. Klein, Ehrenfest [143], p. 108).^{Footnote 17}

In what follows, I challenge the Standard Story while also providing the beginnings of a Boltzmannian approach that stands in true solidarity with Boltzmann’s corpus. Like Hans Christian Ørsted’s (1777–1851) reason for seeking a discovery of the interaction between electricity and magnetism (his Naturphilosophie),^{Footnote 18} or one of Maxwell’s reasons for preferring a field ontology in electrodynamics (viz., that causes must be spatiotemporally local),^{Footnote 19} or one of Albert Einstein’s (1879–1955) reasons for preferring the Lorentzian spacetime of general relativity to the Minkowski spacetime of special relativity (viz., the action-reaction principle),^{Footnote 20} my Boltzmannian outlook is motivated by a metaphysical thesis, a thesis that is friendly to (what I will show in Sect. 5.1 was) Boltzmann’s aim to mechanically explain^{Footnote 21} the process of entropic increase:

(Causal Collisions (CC)): Within the collisions that are quantified over by the hypothesis of molecular chaos (HMC) (Sect. 8.1) and that produce entropic increase thereby making true the Boltzmann equation (Sect. 4) and H-theorem (Sect. 5) are instances of an obtaining fundamental causal relation that is formally and temporally asymmetric. Particular instances of this fundamental relation in evolutions of thermodynamic systems necessitate one-sided chaos and produce the velocity correlations referenced by the (HMC).

I will detail precisely how (CC) does a surprising amount of explanatory work (it earns its keep) primarily by arguing that it enables one to respond to the reversibility objection without having to endorse Boltzmann’s combinatorial arguments.

2 The Maxwell Distribution

Let’s travel back in time to the year 1859 at a meeting of the British Association for the Advancement of Science in Aberdeen Scotland. Motivated by the 1857 Adams Prize that was announced in March of 1855, James Clerk Maxwell has just completed his studies on the rings of Saturn.^{Footnote 22} Those studies involved the utilization of probabilistic reasoning in physics ([126], p. 128, [203], p. 168) as well as reflection upon complex systems of colliding bodies ([125], p. 25). What is more, that reasoning and reflection primed Maxwell for the development of contributions to, and investigations of the early kinetic theory of gases.^{Footnote 23}

Maxwell’s presentation at the aforementioned 1859 meeting is entitled “Illustrations of the Dynamical Theory of Gases” and it will be published in two parts in The Philosophical Magazine a year later.^{Footnote 24} Maxwell’s exposition of proposition IV (found later in ([171], 22–24) includes a heuristic argument for a particular hypothesis concerning the velocity distribution for gas molecules understood as elastic spheres composing a gas at uniform pressure. A velocity distribution for a gas system is a quantitative description of the molecular velocities enjoyed by the constituents of the gas at particular temperatures. Velocity distributions can give one both the average and most probable molecular speeds of constituents of a gas system at various temperatures. From knowledge of average molecular speeds multifarious phenomenological properties can be inferred.

If we glide forward in time to 1867, we’ll find Maxwell at his family estate (Glenlair House) just before he’d become the first Cavendish Professor of Physics at Cambridge. There Maxwell publishes “On the Dynamical Theory of Gases”^{Footnote 25} in the Royal Society’s Philosophical Transactions after admitting the existence of problems with his 1860 theory of gas diffusion revealed in criticisms from Clausius in [74].^{Footnote 26} That paper sharpens some of his 1860 argumentation in that several of the assumptions of the 1860 work are abandoned in favor of more realistic assumptions, although both the 1860 and 1867 projects maintain the spirit of some earlier correspondence between Maxwell and Sir George Gabriel Stokes (1819–1903).^{Footnote 27}

Contrary to the reigning paradigm of thought at the time (especially in the work of Clausius^{Footnote 28}), Maxwell hypothesized that collisions between gas molecules over time do not produce the same or close to the same velocities for every constituent molecule of a gas system, although the molecular kinetic energies are caused by those collisions to equal or closely approach the same value.^{Footnote 29} Rather, over time, collisions produce a distribution of speeds or velocities. All velocities and positions of the molecular constituents consistent with the conservation laws and the system’s total energy are assumed to be nomologically possible as the system evolves.

For Maxwell, collisions are causal phenomena, as are the processes of physical systems that lead to them.^{Footnote 30} The reason why Maxwell believes collisions are causal is because within such processes forces act and those forces are causes.^{Footnote 31} Maxwell includes in the titles of the 1860 and 1867 projects the term ‘dynamical’. This is purposeful. As in his celebrated paper “A Dynamical Theory of the Electromagnetic Field” published in 1865,^{Footnote 32} Maxwell’s approach is dynamical because he’s trying to account for the motions of bodies by appeal to causal forces, except in the case of gas systems he does not involve the causal influences of fields. In December of 1866, Maxwell wrote to Stokes as follows: “I therefore call the theory a dynamical theory because it considers the motions of bodies as produced by certain forces”.^{Footnote 33}

Maxwell is propelled into his particular way of studying gas systems by reading Clausius's [73] memoir “On the Mean Length of the Paths Described by the Separate Molecules of Gaseous Bodies”.^{Footnote 34} He learns that the key to discerning the properties of gas systems is to look to collisions of the constituents of those molecules, which were (as with Maxwell) causal phenomena in the mind and work of Clausius. Clausius believed that around each gas constituent (or center of gravity) is a “sphere of action”^{Footnote 35} determined by the capacity of such constituents to “drive” one another “asunder” (i.e., to repel one another).^{Footnote 36} When a constituent α approaches another constituent β thereby entering β’s domain of repulsive influence or sphere of action, a rebounding effect results from a repulsive force, and both α and β (because of Newton’s third law of motion) change their velocities (given appropriate inertial masses). The acting repulsive force is causal in that it produces its “effects…at very small distances”.^{Footnote 37} In a manner very much dependent upon Clausius, Maxwell’s [174] work maintained that gas systems attain velocity distributions indicative of thermal equilibrium (q.v., Eqs. (0) and (1) below) because of the collisions of their constituents, where again, the collisions were understood by Maxwell to be the causal mechanisms that produce velocity changes.^{Footnote 38} This is all encoded in the underlying mathematics.^{Footnote 39}

Maxwell provided a quantitative statement of his velocity distribution. That is to say, he wrote down an equation for$f\left(\mathbf{v}\right)$ or the average number of molecular constituents in a gas that enjoy a velocity between two limits ($\mathbf{v}$ and $\mathbf{v}+{d}^{3}\mathbf{v}$) subsequent to a great many collisions between similar gas constituents (Maxwell, Part 1 [171], 22). The 1867 statement of $f\left(\mathbf{v}\right)$ takes the general form $f\left(\mathbf{v}\right)=\alpha {e}^{-\beta {u}^{2}}$(where the velocity $\mathbf{v}$ is a three-vector with three Cartesian components v_x, v_y, and v_z, the distribution function $f\left(\mathbf{v}\right)$ is isotropic,^{Footnote 40} α and β are constants, e is Euler’s number (the base of natural logarithms) such that $e\approx 2.71828$, and u is mean velocity). Or more precisely,

(0)
Maxwell’s Distribution Law (Vector Notation):$f\left(\mathbf{v}\right)\propto {\mathbf{v}}^{2}{e}^{-m{\mathbf{v}}^{2}/2kT}$ This function was said to satisfy the relation $f{(\mathbf{v}}_{1})f{(\mathbf{v}}_{2})=f{(\mathbf{u}}_{1})f({\mathbf{u}}_{2})$ for two distinct gas constituents enjoying respective pre-collision velocities ${(\mathbf{v}}_{1})$ and ${(\mathbf{v}}_{2})$, and post-collision velocities ${(\mathbf{u}}_{1})$ and $({\mathbf{u}}_{2})$.

Maxwell’s actual work (which was without modern vector notation) would affirm,
(1)
Maxwell’s Distribution Law:$f\left(v\right)=(\frac{N}{{\alpha }^{3}{\pi }^\frac{3}{2}}){e}^{-(\frac{{v}^{2}}{{\alpha }^{2}})}$(where $N$ is the number of gas molecules, and α² takes a value that is inversely proportional to the gas’s absolute temperature)^{Footnote 41}

Equation (1) (or (0)) implies that the distribution function is asymptotically Gaussian. It was understood by Maxwell to give a velocity distribution for the molecules of a gas in thermal equilibrium. He would try to show that his distribution is stable in the sense that collisions among molecules would not disrupt or otherwise change the distribution’s applicability to select gases in equilibrium. In other words, Maxwell attempted to quantitatively demonstrate that once gas systems achieve equilibrium status they stay there. His attempt failed.^{Footnote 42} Maxwell’s failure notwithstanding, important justifications of Maxwell’s distribution exist (Brush, vol. 2 [47], pp. 187–188), as do modern versions of his reasoning with suitable fixes ([88], pp. 81–84). We can now claim that for an appropriate restricted set of classical gas systems, Maxwell’s distribution is indeed the correct velocity distribution in that it accurately describes the distribution of velocities for those systems in equilibrium. Important experimental confirmation appears in the work of Nobel Laureate Otto Stern (1888–1969) and the 1927 experimentation of John A. Eldridge (b. 1891).^{Footnote 43}

Maxwell attempted to show that his distribution is the only stable distribution under collisions. To do that he used a collision number over some time period of dynamical evolution of the choice gas system (qq.v., n. 38 and n. 39).^{Footnote 44} How Maxwell acquired his collision number thereby attempting to justify his claim regarding stability appears to be mysterious. Numerous commentators have expressed their inability to get past several obscurities and confusions in Maxwell's [174] argumentation.^{Footnote 45} However, everyone seems to agree that his 1860 and 1867 reasoning made use of other assumptions some of which are probabilistic. I have found at least five. I explicate three of them below leaving the last two assumptions about the nature of collisions for Sect. 8.

(2)
The constituents of gas systems are centers of force and can be regarded as what we now call “Maxwell molecules”, i.e., point-like molecules (for all intents and purposes point-masses) or collections thereof that “move about as a single body”^{Footnote 46} that interact by means of central repulsive forces inversely proportional to the fifth power of the distance between them.^{Footnote 47}
(3)
Every direction of particle rebound subsequent to a binary collision is equally probable.^{Footnote 48}
(4)
If (3), then both ((a) all three velocity components of any involved velocity have independent probability distributions (and) (b) every displacement direction is as likely as every other).

Notice that assumptions (3) and (4) are at least in part about probabilities.^{Footnote 49}

Are (2)-(4) good assumptions? Leaving aside Maxwell’s claim regarding fifth powers, no atomist would baulk at (2). Assumption (4) is proven in [174]. But what about (3)? I shall not appraise it. Maxwell already did. He called it a precarious assumption which he believed put his approach in danger of being altogether unrelated to actual world collisions and interactions.^{Footnote 50} I wish to add only that Maxwell’s justification of (3) rested upon the work of Sir John Herschel (1792–1871), specifically Herschel’s unsigned 1850 review of Adolphe Quetelet’s (1796–1874) work on probability in the 92nd volume of the Edinburgh Review [128].^{Footnote 51} This influence is important because we know that Herschel held an epistemic or Bayesian view of probability, maintaining that probabilities are degrees of belief or credences.^{Footnote 52} Herschel’s understanding of probability seemed to have rubbed off on Maxwell for one can clearly see an allegiance to an epistemic interpretation of probability in Maxwell’s corpus.^{Footnote 53} This should not surprise us. Frequentism was the interpretation of choice in the twentieth century, but Bayesianism reigned supreme in physics during the nineteenth century ([5], p. 629). These matters will become important later.

3 The Maxwell–Boltzmann Distribution

From 1868 to 1871, Boltzmann generalized (1) (i.e., the Maxwell distribution) for gas molecules in such a way that he obtained a velocity distribution for systems of gas molecules with internal and gravitational degrees of freedom (the generalizations eventually captured systems of polyatomic gas molecules).^{Footnote 54},^{Footnote 55} Boltzmann’s main result is called the Maxwell–Boltzmann distribution. While Maxwell's [174] distribution took the form: $f\left(\mathbf{v}\right)=\alpha {e}^{-\beta {u}^{2}}$ (as noted above), Boltzmann's [16] distribution took the form: $f\left(\mathbf{v}\right)=a{e}^{-\beta E}$ where $a$ and $\beta$ stand for constants, and $E$ is energy. It’s more full content reads,

(5)
Maxwell–Boltzmann Distribution: $f\left(v\right)=A{e}^{-h(\frac{1}{2}m{v}^{2}+V[x])}$ (where A is the number of molecules such that that amount normalizes f; h is really just 1/kT in modern notation, k is Boltzmann’s constant)^{Footnote 56} or we could just write: $f\left(v\right)=A{e}^{-\frac{E}{kT}}$ (where E is total energy).^{Footnote 57}

Due to objections from Francis Guthrie (1831–1898), Maxwell would himself try his hand at deriving this generalized distribution in [175] and then again in [177].^{Footnote 58}

Boltzmann tried to prove that any distribution (for the types of gases with which he was concerned) would tend towards (5) (uniqueness), given a change in time, but would later (1898) state in volume two of his Lectures on Gas Theory that he could not actually prove its uniqueness ([35], pp. 313–340).

Although Maxwell did not seem to favor calling $f\left(v\right)$ a probability, both he (subsequent to 1867) and Boltzmann interpreted (5) in such a way that it said that the most highly probable energy of a gas molecule takes a value equal to kT, where it is understood that molecules could take on energies with a great many other values (consistent with the total energy values and conservation laws) because the likelihood of such energy assignments is never zero. That the distribution given in (5) represents an appropriate gas in equilibrium and that it gives the unique distribution for such a gas system is generally agreed upon by even modern practitioners of what we now call classical statistical mechanics. It is therefore a bona fide law of classical theory. Some scholars also maintain that Boltzmann’s derivation of (5) is “impeccable” (at least for the non-polyatomic cases), given that the distribution faithfully represents the speeds of molecules in systems at equilibrium ([64], pp. 88, and see also pp. 283–286).

4 die Fundamentalgleichung

After generalizing the Maxwell distribution so as to obtain the Maxwell–Boltzmann distribution, Boltzmann remarked that “[i]t has thus not yet been demonstrated that whatever the state of the gas may have been at the start, it must always approach the limit discovered by Maxwell”.^{Footnote 59} Boltzmann is here concerned with the missing proof of the uniqueness of the distribution function. In (Boltzmann, Further Studies on the Thermal Equilibrium of Gas Molecules [21]), Boltzmann turned to the task of finding an equation (what we would later call the Boltzmann equation) that tracks the evolution of the velocity distribution over time in irreversible processes so as to help reach the missing proof.^{Footnote 60} For cases involving systems with but one species of particle, the Boltzmann equation reads,

(6)
The Boltzmann Equation or Boltzmann’s Transport Equation: $\frac{\partial f}{\partial t}+\mathbf{a}\left(\frac{\partial f}{\partial \mathbf{v}}\right)+\mathbf{v}\left(\frac{\partial f}{\partial \mathbf{r}}\right)={\frac{\partial f}{\partial t}}_{coll.}$, here f is dependent upon time t, position r, and velocity v, and it represents the distribution function describing the gas system’s state and also how that state evolves, $\mathbf{a}$ represents the accelerations of the particles/molecules between their collisions, and the right-hand side of the equation or ${\frac{\partial f}{\partial t}}_{coll.}$ represents the collision produced rate of change of the distribution function f.^{Footnote 61}

Here is an expression closer to the original work,^{Footnote 62}

(7)
$${{\bf{Early\, Boltzmann\, Equation:}} \frac{\partial f}{\partial t}=\int \mathrm{d}{\mathbf{v}}_{2}\int \{f{(\mathbf{u}}_{1})f{(\mathbf{u}}_{2})-f{(\mathbf{v}}_{1})f({\mathbf{v}}_{2})\}\left|{\mathbf{v}}_{1}-{\mathbf{v}}_{2}\right| d\Omega \sigma \left(\Omega \right)}$$

where $d\Omega \sigma \left(\Omega \right)$ is the differential collision cross section “for a collision in which the relative velocity” after the collision is “in the solid angle $d\Omega$ at $\Omega$ compared to the relative velocity before”.^{Footnote 63} The involved integrals are over every possible scattering angle and every possible velocity ${\mathbf{v}}_{2}$ of the collision partner. Function f is the distribution function, velocities u₁ and u₂ are final (post-collision) velocities, and v₁ and v₂ are initial (pre-collision) velocities.

The Boltzmann equation “completely determines the evolution of the distribution f from its initial value”.^{Footnote 64} It says how “the distribution” changes “in time under the action of the collisions”.^{Footnote 65} And if you can correctly solve for f, then with (7) or some form of (6) you’ll obtain all that’s needed to compute thermodynamic phenomenological properties of the appropriate relevant system. Boltzmann would add that the right side of Eq. (7) vanishes such that $\frac{\partial f}{\partial t}=0$ when the distribution function is Maxwell’s, and all other functions tend toward Maxwell’s (uniqueness). Boltzmann also affirmed that the velocity distribution will cease to change once it becomes the Maxwell distribution (stability or stationarity).^{Footnote 66}

The literature on the Boltzmann equation is immense. It has grown large for several reasons. First, it has numerous beneficial applications and uses in modern physics.^{Footnote 67} You can use it to figure out transport coefficients (hence “transport equation”) for heat conduction, gas interdiffusion, and gas viscosity.^{Footnote 68} And it is utilized in contemporary physics for the study of neutron transport as well as plasma systems. Second, the equation is not time-reversal invariant,^{Footnote 69} and the reason why is usually connected to an assumption of the justification of the equation, viz., the HMC or hypothesis of molecular chaos defined and discussed in Sect. 8 below [228]. Third, given several assumptions, including the supposition that the gas system under evaluation is dilute and that its constituents are approximated as hard shells, the Boltzmann equation was derived from the time-reversal invariant equations of motion in classical mechanics by Oscar Lanford III (1940–2013).^{Footnote 70} There’s some question as to how the irreversibility or asymmetry of the distribution evolution emerges in a way (and this has been demonstrated in [211, 212]) that avoids the reversibility objection of Loschmidt discussed in Sect. 8 below.^{Footnote 71} I will have more to say about time-reversal invariance and the emergence of irreversibility shortly. For now, let us turn our attention to Boltzmann’s minimum theorem (i.e., the H-theorem).

5 The H-Theorem

My discussion in this section shall pertain to monatomic gases.

With the Boltzmann equation in hand, Boltzmann thought himself properly equipped for proving the uniqueness of the Maxwell distribution. One can already see how $\frac{\partial f}{\partial t}$ vanishes given that the distribution function is Maxwell’s because that function satisfies the relation: $f{(\mathbf{v}}_{1})f{(\mathbf{v}}_{2})=f{(\mathbf{u}}_{1})f({\mathbf{u}}_{2})$, as I have already noted. Justifying that conditional is not enough to secure uniqueness. One must also show that if $\frac{\partial f}{\partial t}$ vanishes, then the distribution function must be Maxwell’s. To acquire the needed demonstration, Boltzmann introduced the time-dependent functional H (not to be confused with the Hamiltonian).^{Footnote 73} He defined that functional in terms of the distribution function f.

(8)
$$\mathrm{H}\equiv \int f\mathrm{log}\,f\,d\mathbf{v}$$

On the assumption that the Boltzmann equation is omnitemporally true, and the assumption that the time and velocity dependent function f satisfies the Boltzmann equation, it can be rigorously proven that for any time t, the distribution function f is Maxwellian, just in case, $\frac{d\mathrm{H}}{dt}=0$. On the same assumptions (i.e., f satisfies the Boltzmann equation and that that equation is omnitemporally true) it can also be proven that,

(9)
$$\frac{d\mathrm{H}}{dt} \le 0,\; for\; any\; time\; t$$

But it will turn out that the relevant proofs make use of the hypothesis of molecular chaos (HMC) discussed and defined in Sect. 8 below. This was not realized by Boltzmann until sometime after his 1872 and 1875 work.

The conjunction of the above results is called Boltzmann’s minimum theorem or H-theorem ([226], p. 965), cf. ([63], pp. 137–140). The quantity H is a monotonically decreasing function in time unless the velocity distribution is the Maxwell distribution. And so, the theorem helps secure the uniqueness of the Maxwell distribution.^{Footnote 74} As Boltzmann’s 1896 summary of the H-theorem in his Lectures on Gas Theory stated, “[w]e have shown that the quantity we have called H can only decrease, so that the velocity distribution must necessarily approach Maxwell’s more and more closely”.^{Footnote 75}

The preceding discussion pertains to Boltzmann’s H-theorem for monatomic gases. In his 1872 work, he also tried to prove an H-theorem for polyatomic gases (see also (Boltzmann, On the Thermal Equilibrium of Gases on Which External Forces Act [23])). That proof did not fare well, as Lorentz found a problem with Boltzmann’s derivation.^{Footnote 76} Lorentz notes that part of Boltzmann’s derivation of the Boltzmann equation and the H-theorem is a commitment to the existence of reciprocal collisions. Boltzmann appears to assume that if there exists a collision $\left[A,B\right]\to [{A}^{\prime},{B}^{\prime}]$, then there exists an inverse collision (following Lorentz’s way of characterizing sets of velocities of colliding molecules) that is $\left[{A}^{\prime},{B}^{\prime}\right]\to [A,B]$. Lorentz proves that this assumption is false for polyatomic molecules that are non-spherical.^{Footnote 77} He then provides a simplified version of Boltzmann's [21] proof of the H-theorem for monatomic gases. This streamlined proof is later used by Boltzmann in both his Lectures on Gas Theory [35], and his 1887 response to Peter Guthrie Tait (1831–1901) entitled Über einige Fragen der Kinetischen Gastheorie (On Some Questions about Kinetic Gas Theory). Moreover, modern textbooks often choose to follow Lorentz’s proof for the monatomic case when presenting the derivation of the H-theorem for pedagogical purposes ([88], pp. 328–329, [152], pp. 599. nn. 38–39).

Boltzmann would graciously accept Lorentz’s criticism and provide a follow-up proof that made use of cycles of collisions as that which drives H-decrease in the polyatomic cases (q.v., n. 76). His maneuver is both unrealistic and embraced by no one, save Lorentz.^{Footnote 78}

So, Boltzmann’s attempts at proving an H-theorem for polyatomic gas types had problems. Not even his attempted demonstrations of the H-theorem for the monatomic cases are wholly without problems. A decisive and rigorous proof for the monatomic gas type would have to wait until the 1933 and 1957 work of Torsten Carleman (1892–1949).^{Footnote 79} In addition, Carlo Cercignani (1939–2010) taught us that that demonstration has a cousin yielding an H-theorem for polyatomic gases ([64], p. 96). Both Cercignani and Darrigol have proven an H-theorem for polyatomic gas types (ibid., 287–290; [88], pp. 493–496).^{Footnote 80} Thus, for both monatomic and polyatomic gas types, we have an H-theorem. How should we interpret it?

5.1 Interpreting the H-Theorem: Collisions and Causation

Boltzmann’s proposed mechanical explanations of the second law of thermodynamics characterize would systems of colliding gas molecules as systems whose constituents causally interact. There are four reasons why one should accept this interpretation. First, mechanical explanations of natural phenomena for physicists such as Clausius, Maxwell, and Boltzmann are part of (to quote Christiaan Huygens’s (1629–1695) characterization of the mechanical approach, a characterization alive and well during the nineteenth century) “the true Philosophy, in which one conceives the causes of all natural effects in terms of mechanical motions”.^{Footnote 81} In other words, a mechanical explanation just is one involving a report on causes that are mechanical motions inter alia (qq.v., n. 30, n. 31, and n. 33). In Sect. 2, I detailed how this approach to mechanical explanation shows up in the work of Clausius and Maxwell. As I shall demonstrate in Sect. 6, Boltzmann’s H-theorem is part of his attempt to mechanically explain the second law. It is therefore highly likely that by Boltzmann’s lights, the type of explanation of entropic increase the H-theorem offers is a causal explanation.

Second, in part one of his Lectures on Gas Theory, Boltzmann presents a Lorentz-inspired derivation of the H-theorem. In his discussion of value changes of the H-functional, Boltzmann reports that changes in H over a small period of time are “due to…causes” later noting that the changes result from collisions ([35], p. 50, and see also 51–52). This suggests that for Boltzmann, the process of entropic increase is a causal process and that collisions for Boltzmann are causal phenomena.

Third, although Boltzmann seems dismissive of metaphysics (he calls metaphysics a “spiritual migraine”^{Footnote 82}) and whilst he views physical hypotheses as pictures or images of the world that are not directly corresponding truths about it, Boltzmann does consistently interpret all forces (and so those forces at work in collisions) causally ([38], p. 54). That is to say, he believes that the image of the world supplied by physics depicts the world as a place endowed with causal forces. Commenting on Heinrich Hertz’s (1857–1894) 1887 discovery of a form of electromagnetic radiation (i.e., radio waves), Boltzmann causally interpreted the action of the electromagnetic field. He remarked, “…electric and magnetic forces do not act directly at a distance but are caused by changes of state that are propagated from one volume element to the next at the speed of light” ([38], p. 84). At the May 29th, 1886 meeting of the Imperial Academy of Science, and so well before gravitation would be reduced to spacetime curvature by Einstein in 1915, Boltzmann causally interpreted the gravitational force ([37], p. 17). At the same event, Boltzmann provided a causal characterization of pressure. He said that the molecules involved in thermodynamic systems impinge (or strike) “now more now less strongly, now head on now at an angle” maintaining that when the pressure produced by these impinging molecules is at a point “bigger…we shall at once look for an external cause that moves the molecules to flow preferentially to that point” (ibid., 20).

Some of the strongest evidence for my interpretation of Boltzmann comes from his 1899 Clark University lectures. In them Boltzmann describes the evolution of a gravitating system and says in that context that in general “the cause of motion…we call force”, concluding “that at least in this special case acceleration is the decisive feature of force…namely gravity”. ([37], pp. 127–128). Boltzmann added that:

Kirchhoff rejected the notion that it was the task of science to unravel the true nature of phenomena and to state their first and fundamental metaphysical causes. On the contrary he confined the task of natural science to describing phenomena, a stipulation that he still called a restriction.^{Footnote 83}

Boltzmann would connect Kirchhoff’s view to Hertz’s in his 1899 Munich lecture: “nobody has yet pointed out that a certain idea [apparently the same idea articulated in the lines just quoted] in Kirchhoff’s mechanics if followed to its logical conclusion leads directly to Hertz’s ideas” ([37], p. 89). Boltzmann would both reject Kirchhoff’s view ([38], p. 78) and distinguish his understanding of mechanics from that of Hertz by noting that his approach holds on to causal forces, while Hertz’s 1899 Principles of Mechanics in a New Form abandons them completely.^{Footnote 84} He said that “difficulties arise” for Hertz’s approach “as soon as one wants to represent the most ordinary processes of daily experience involving the action of force”.^{Footnote 85} Again, forces are at work in the collisions referenced by the Boltzmann equation. Therefore, according to Boltzmann, so too is causation.

Fourth, in some of Boltzmann’s notes on natural philosophy put together for a lecture to be given on November 23rd, 1904, Boltzmann said, “[i]t is just its own bad luck that changes in velocity must have a cause”, subsequently committing to a view about the relata of causation, i.e., that “[a] thing cannot be the cause of a thing, but merely of change”.^{Footnote 86} Colliding things produce velocity changes, according to the Boltzmann equation and H-theorem, and so these remarks support my reading. Boltzmann believes that the mechanism of velocity change in the process of entropic increase is a causal mechanism.

We can safely conclude that there’s good evidence from Boltzmann’s Lectures on Gas Theory, Boltzmann’s Lectures on Mechanics, his personal lecture notes, and his public lecture content that all supports the thesis that Boltzmann endorsed a causal approach to mechanistically explaining the second law.

5.2 Interpreting the H-Theorem: Applications and Exceptions

As early as his work in 1872 and 1875, Boltzmann recognized that there could be gas systems that have unique initial conditions such that they do not evolve to the Maxwell distribution. This is because such special systems start out precluding certain velocity and/or position values otherwise consistent with the conservation laws and energy totals. He conjectured that perhaps some constraints on very special systems keep their constituents from realizing all possible values consistent with those laws and totals. As Boltzmann himself put matters when discussing a gas confined to a container, “it is possible that only certain, and not all possible positions and velocities can occur in the course of time (e.g., if they were all initially in a line perpendicular to the vessel walls)”.^{Footnote 87} It is an assumption of the velocity/energy distribution approach of Maxwell and Boltzmann that every nomologically possible velocity be realizable by gas constituents. I believe Boltzmann was keen enough to realize the connection between special velocity precluding initial conditions and H-theorem inapplicability. My opinion is that as Boltzmann developed a mechanistic explanation of the second law, he knew of possible systems to which the H-theorem could not be applied. My reading is most, and not too, charitable. It entails that it did not take the articulation of Loschmidt’s reversibility paradox for Boltzmann to come to the realization that some monatomic gas systems escaped H-theorem application.^{Footnote 88} The principle of charity is not all that can be said for the proposed interpretation. It explains why (to quote Cercignani) Boltzmann, “when answering” Loschmidt’s paradox (discussed in Sect. 8 below):

did not indicate that he had changed his viewpoint, or that he had deepened his understanding of the subject, as a consequence of the reflections caused by the [reversibility] objection that had been raised against him, but acted as if he were simply re-elaborating his old ideas.^{Footnote 89}

The fact that such a report is correct has perplexed Boltzmann scholars.^{Footnote 90} There exists a challenge to render that report consistent with Boltzmann’s judgment that deriving the H-theorem amounts to rigorously proving “that whatever the distribution of live force [kinetic energy] may have been at the beginning [initial time], subsequent to a very long time period it must always approach that [one] found by Maxwell”.^{Footnote 91} The best way to introduce coherence and consistency here is to insist that even in 1872 and 1875 Boltzmann was aware of systems that did not approach Maxwell’s distribution on account of the unique initial conditions they enjoyed (agreeing in part with [7, 235]) though I am not claiming that Boltzmann’s H-theorem project was always statistical in the sense that at least Badino seems to have in mind). The necessity of approaching the Maxwell distribution rests upon the assumptions and antecedent of the H-theorem. As I’ve said several times now, one of the relevant assumptions is that the Boltzmann equation concerns all nomologically possible velocity and position values, i.e., all those that satisfy the laws of conservation ([88], p. 173 although I disagree with Darrigol’s presumption at n. 37). The cases that admit exceptions to the general claim that H always decreases, or that minus-H (where minus-H is proportional to entropy) always increases are cases that prohibit some velocity and position values.

5.2.1 Maxwell’s (But Really Thomson’s) Demon and Loschmidt’s Exorcism

My reading is controversial. Let me add further lines of support. Boltzmann’s proof of the H-theorem appears after the articulation of the Maxwell “demon”^{Footnote 92} case, a case well-known for the trouble it produces for any non-statistical and exceptionless statement of the second law. Maxwell discussed it for the first time in a letter to Tait, dated December 11th, 1867,^{Footnote 93} restating it in several places including the appendix to his 1871 book Theory of Heat. It is there that he supposes that there’s a container filled with air that possesses uniform pressure and temperature (the system is in equilibrium). The container is divided into two sides. The two sides of the container are labeled A and B, and the division between them is wrought by a diaphragm with a large hole in it. Over the hole is a sliding plate with very small mass that is controlled by “a being whose senses are so acute that he can see every molecule of the air, at least when it is near the hole”.^{Footnote 94} Maxwell says the being follows the command: Open the plate over the hole when a molecule possessing “more than the mean velocity” in compartment A moves near the hole (n. 94). This allows for faster molecules to move into compartment B. The plate is to remain closed for all other molecules, although when in compartment B, a slower (i.e., slower than the mean velocity) molecule draws near the opening, the plate is to be opened allowing that molecule to pass from B to A. Maxwell infers that compartment B will begin to enjoy an increase in the mean velocities of its inhabitant molecules, while compartment A will enjoy a decrease of mean velocities. These changes all obtain without the expenditure of work. As a result, “the [non-statistical and exceptionless] second law of thermodynamics is no longer true”,^{Footnote 95} and “[t]he 2^nd law of Thermodynamics has the same degree of truth as the statement that if you throw a tumblerful of water into the sea you cannot get the same water out again”.^{Footnote 96}

Great. There were reasons to abandon a non-statistical and exceptionless statement of the second law before Boltzmann’s proof of the H-theorem. But why should one think that Boltzmann was aware of those reasons when he tried to prove the H-theorem? Loschmidt articulated a Maxwell- “demon”-like case (without a demon) in 1869.^{Footnote 97} Boltzmann would have known of Loschmidt's version of the case since they were colleagues (q.v., n. 97). Indeed, Boltzmann responds to Loschmidt, providing a version of the case that resembles Maxwell’s. He wrote,

When for instance a gas at constant temperature is divided into two halves by a separating wall with a small hole on it, it would be possible to bring in front of the hole a contraption that guides the faster molecules preferably into one half and the slower ones preferably into the other half, which would contradict the second law.^{Footnote 98}

5.2.2 The Reversibility Objection Before Loschmidt

I’ve argued that Boltzmann showed an awareness of the real possibility of the existence of gas systems to which the H-theorem could not be applied. My evidence for this resides beside the articulation of Boltzmann’s original proofs of the H-theorem in his 1875 work. My second reason for maintaining that Boltzmann knew of failures of H-theorem application to even monatomic gas systems had to do with his awareness of a Maxwell “demon” case prior to 1872. I now add that the reversibility objection (q.v., Sect. 8 below) was articulated and probably known to Boltzmann before his proof of the H-theorem. This fact constitutes my third justification for believing that Boltzmann knew of H-theorem inapplicable systems before at least 1875. Notice that all three justifications support the further claim that Boltzmann was aware of nomologically possible violations of the non-statistical version of the second law prior to the 1875 publication of the H-theorem.

The already referenced 1867 letter from Maxwell to Tait included a penciled annotation by Thomson which read, “Very good. Another way [to violate the non-statistical second law of thermodynamics] is to reverse the motion of every particle of the Universe and to preside over the unstable motion thus produced”.^{Footnote 99} The next year Maxwell said there was an apparent conflict between the existence of irreversible processes and the reversibility of all motion.^{Footnote 100} Two years later, Maxwell would write to Strutt arguing that reversing all of the motions (i.e., velocities) of every particle in the universe would “upset the 2^nd law of Thermodynamics”.^{Footnote 101}

Of course, Boltzmann was quite far removed from Scotland and probably did not have access to the correspondence of Maxwell, Tait, or Thomson. Moreover, I can find no evidence that he was aware of the reversibility objection before the publication of [21]. However, there is evidence that Boltzmann knew the work of Thomson, taking it seriously enough to cite it. For example, Boltzmann cited Thomson’s work with Tait on the principle of least action in hydrodynamics at (Boltzmann, “On the Compressive Forces” [19]). In [22], Boltzmann cites Thomson once again, although this time on work related to electricity. We also know that much later, Boltzmann corresponded with Thomson in 1892 and 1893, answering several of Thomson’s objections to Boltzmann’s kinetic theory from energy dissipation.^{Footnote 102} I therefore think it is likely that Boltzmann read Thomson’s reversibility objection in Nature, published in 1874. We know Boltzmann read Nature because Boltzmann published in it several times.^{Footnote 103} Why is this important? Because (again) in 1875, Boltzmann tried to prove his H-theorem in much the same way he did in 1872. However, this time, he’d seek to make his earlier work known to a broader audience ([88], p. 172). His remarks regarding the non-statistical nature of the second law as viewed through the lens of the H-theorem are univocal:

[We] have so far proceeded as follows: [we] have shown that the quantity H cannot increase during the evolution of the state of the gas; wherefrom [we] have concluded that it must be constant in the case of equilibrium since it evidently cannot constantly decrease in this case. I was thus able to derive the definitive equations that lead to the equilibrium distribution of states. This suggests that the value that H takes in the case of equilibrium is the smallest of all the values that H can take in agreement with the conservation of the total number of atoms and the conservation of the total live force.^{Footnote 104}

If Boltzmann was aware of the reversibility objection before 1875, why would he assert the above if reversibility worries should cause him to abandon deterministic or non-statistical statements of minus-H increase over time? I believe the best charitable response to this query should not argue that Boltzmann had a statistical interpretation of the H-theorem all along (contra [7], q.v., my arguments against this in Sect. 5.3 below). Rather, the correct response suggests instead that Boltzmann believed that while the deterministic or non-statistical statement of the second law of thermodynamics admits exceptions, at least some of those exceptions (if not all of them) have to do with systems to which the H-theorem cannot be applied. However, if the H-theorem does apply to a system out of equilibrium, minus-H must increase monotonically for all time until equilibrium is obtained. There it will remain given that the Boltzmann equation holds for all time (anticipating here worries about fluctuations and recurrence about which I will say more in a part two essay). You see, proof of the H-theorem amounts to proof of a deterministic and non-statistical second law suitably restricted to systems that satisfy the antecedent of the H-theorem. Of course, my reading assumes that Boltzmann strongly associates H (or minus-H) with entropy. Let me now say more about that association in Boltzmann’s corpus.

5.3 Interpreting the H-Theorem: $-\mathbf{H}$ and Entropy

Boltzmann’s 1872 interpretation of the H-theorem is given in the following often-quoted passage which I will call PERICOPE,

It has thus been rigorously proved that, whatever may be the initial distribution of kinetic energy, in the course of a very long time it must always necessarily approach the one found by Maxwell [notice that he’s speaking here in terms of energy changes]. The procedure used so far is of course nothing more than a mathematical artifice employed in order to give a rigorous proof of a theorem whose exact proof has not previously been found. It gains meaning by its applicability to the theory of polyatomic gas molecules. There one can again prove that a certain quantity [H] can only decrease as a consequence of molecular motion, or in a limiting case can remain constant. One can also prove that for the atomic motion of a system of arbitrarily many material points there always exists a certain quantity which, in consequence of any atomic motion, cannot increase, and this quantity agrees up to a constant factor with the value found for the well-known integral $\int\frac{dQ}{T}=0$ in my paper on the ‘Analytical proof of the 2nd law, etc.’. We have therefore prepared the way for an analytical proof of the second law in a completely different way from those previously investigated. Up to now the object has been to show that $\int\frac{dQ}{T}=0$ for reversible cyclic processes, but it has not been proved analytically that this quantity is always negative for irreversible processes, which are the only ones that occur in nature. The reversible cyclic process is only an ideal, which one can more or less closely approach but never completely attain. Here, however, we have succeeded in showing that $\int\frac{dQ}{T}$ is in general negative, and is equal to zero only for the limiting case, which is of course the reversible cyclic process (since if one can go through the process in either direction, $\int\frac{dQ}{T}$ cannot be negative).^{Footnote 105}, ^{Footnote 106}

PERICOPE suggests that minus-H is entropy, and that H is equal to minus entropy.^{Footnote 107} It implies that the H-theorem is Boltzmann’s attempt to ground the second law of thermodynamics in mechanics or microdynamics.^{Footnote 108} But why does Boltzmann use minus-H and $\int \frac{\delta Q}{T}$ (which I will call the Clausius integral) interchangeably? The question is perplexing because the entropy of Boltzmann's [21] and 1875 work is that of a closed system, while the Clausius integral has to do with heat Q being exchanged with a system at absolute temperature T. To properly answer this question, we must first understand Rudolf Clausius’s theory of entropy.

6 Clausius Entropy

6.1 The Clausius Integral and Entropy

Clausius associated the entropy of physical systems,at least in certain contexts, with the Clausius integral in his 1865 memoir,^{Footnote 109} where the differential $\delta Q$ is a differential form representative of exchanged heat or infinitesimal heat transfers into the system as part of a reversible process. The source of the heat is some external system enjoying absolute temperature T. For reversible cyclic evolutions, we have:

(10)
$$\oint \frac{\delta Q}{T}=0$$

$\frac{\delta Q}{T}$ is therefore an exact differential. Clausius likewise tried to capture what he understood to be irreversible processes involving systems that evolve from one equilibrium state to another equilibrium state in a manner that could be reversed by means of a reversible transformation. While the relevant transformations in the cycle obtain, the inequality given by (11) holds true:

(11)
$$\int \frac{\delta Q}{T}<0$$

The total entropy of the global system that includes external sources for the needed heat exchange was said to increase in entropy (hence, an irreversible process). The entropic increase on account of a reversible transition of a system from one equilibrium state S₁ to a distinct equilibrium state S₂ is less than the entropy increase on account of an irreversible transition from one equilibrium state S₃ to a distinct equilibrium state S₄.^{Footnote 110} In ([79], p. 5), Clausius would add (12) below understanding the conjunction of it with (10) as an expression of the two fundamental equations of thermodynamics.

(12)
$$\delta Q=dU+\delta W$$

where U is energy^{Footnote 111} and $\delta W$ represents the infinitesimal process of work done while heat is transferred.^{Footnote 112}Q and W are here path dependent quantities, while U, like entropy S, is a path independent state function or property. Clausius would also add that,

(13)
$$\frac{\delta Q}{T}=\frac{dH}{T}+dZ$$

whereH and Z are both state functions (the latter by stipulation really), H being the heat of the body/system although he would famously reduce that notion to vis viva (kinetic energy) thereby reducing it to motion,^{Footnote 113} and Z being disgregation. More on these two quantities soon. Reflect, for now, on the fact that because (13) holds, and because Clausius was strongly associating entropy with $\frac{\delta Q}{T}$, in 1865, Clausius affirmed that:

(14)
$$dS=\frac{dH}{T}+dZ$$

And because (13) holds, we have shown precisely how Clausius strongly associates entropy with heat exchange. Indeed, one popular way of charactering Clausius’s understanding of the second law is as follows:

Heat cannot pass spontaneously from a body of lower temperature to a body of higher temperature.^{Footnote 114}

Clausius himself remarked under a section entitled “New Fundamental Principle concerning Heat”, “Heat cannot, of itself, pass from a colder to a hotter body,”^{Footnote 115} and as early as 1854, Clausius declared that he had found an analytic “expression of the second law” for reversible cyclic processes in Eq. (10).^{Footnote 116} These equations and remarks from Clausius flirt with Maxwell’s view of entropy and heat exchange in his Theory of Heat. Maxwell said, “when there is no communication of heat this quantity [entropy] remains constant, but when heat enters or leaves the body the quantity increases or diminishes”.^{Footnote 117} I believe it is a mistake to attribute Maxwell’s position to Clausius—Clausius’s strong association of entropy with the relevant integral notwithstanding—for two reasons. First, in 1865, Clausius’s statement of the second law referred to the entropy of the universe. The universe, however, is a closed isolated system that does not exchange heat with some other body. Moreover, he says that the entropy of the cosmos increases to a maximum. Not all subsystems of an entropy increasing cosmos such as ours will be in equilibrium. Thus, one cannot explain the increasing entropy of our cosmos by appeal to equilibrium subsystems exchanging heat with their environments.

Second, Maxwell’s statement was experimentally falsified by the Gay–Lussac–Joule experiment,^{Footnote 118} and there are good reasons to believe Clausius was aware of the relevant experimentation, for Clausius was familiar with the work of Joseph Louis Gay-Lussac (1778–1850), having discussed his work several times in The Mechanical Theory of Heat. He was also familiar with Joule’s experimentation, citing his results authoritatively throughout the same work.

6.2 The Complete Nature of Clausius Entropy

What then is the complete and fully general nature of entropy according to Clausius? He clearly had some more general conception, for again, in one place he characterizes the second law of thermodynamics as the principle that “[t]he entropy of the universe tends to a maximum”,^{Footnote 119} christening it (i.e., the law itself) with fundamental status.^{Footnote 120}

Clausius’s seminal work on thermodynamics was his 1864 Abhandlungen über die mechanische Wärmetheorie (Treatises on the Mechanical Theory of Heat) which was later (1876) to become a more “connected whole” under the title The Mechanical Theory of Heat (I have worked and will work with the 1879 English translation).^{Footnote 121} In that work, the entropy of a thermodynamic system is not as fundamental as that system’s energy U suitably understood ([84], p. 195). I include the qualification because in the 1867 (first English edition) of The Mechanical Theory of Heat [80], Clausius agreed with William Rankine (1820–1872), maintaining that small changes of energy are given by the sum of small changes to H and small changes to internal work (I) due to internal molecular forces, and so, $dU=dH+dI$.^{Footnote 122} In the improved and transmuted 1879 edition, U is “[t]he sum of the Vis Viva and of the Ergal…”,^{Footnote 123} but Ergal (J in Clausius’s work) is just potential energy (first coined by Rankine).^{Footnote 124} The type of energy involved here appears to be both kinetic and potential energy. And this notion of energy (U) is one and the same as that in the work of Thomson. Clausius related his conception to Thomson’s as early as 1866 ([79], p. 5).

More should be said because in the 1867 edition of The Mechanical Theory of Heat, and the original work behind it, there exists a more fundamental quantity lurking beneath entropy, viz., disgregation. Disgregation (Z in Clausius’s equations) is a quantity increased by heat. In fact, for Clausius, heat causally produces work as in [61] and it does this by increasing disgregation.^{Footnote 125} Disgregation itself is “the degree in which the molecules of a body are dispersed”.^{Footnote 126} In this earlier edition of Clausius’s major work, entropy, or infinitesimal changes thereof, was/were specified by appeal to, inter alia, disgregation or infinitesimal changes thereof. We already expressed the mathematics that encodes these facts via Eq. (14). As I’ve already said, the quantity H in that equation is the heat in the system, but by this Clausius meant vis viva or kinetic energy of molecular motion, as Maxwell ([176], p. 258) pointed out criticizing Tait’s misreading of Clausius.^{Footnote 127} This treatment (i.e., (14)) of entropy was lambasted by both Maxwell and Tait, the former providing the clearer and more measured response of the two.^{Footnote 128} As I already foreshadowed, Clausius removes disgregation from the later 1879 edition of The Mechanical Theory of Heat. But even if we keep Z in Clausius’s framework, the principle cause of the increase of disgregation is molecular motion responsible for increasing dispersion, and that motion can be understood in terms of kinetic energy. Thus, U (which for Clausius you’ll recall is Vis Viva and Ergal) resides beneath disgregation (i.e., it is more fundamental than disgregation) which is related to entropy in the way (14) suggests.

Having discovered Clausius’s mature view on the status of entropy, energy, and disgregation, in the hierarchy of being, we should now answer the question: What, according to Clausius, is entropy? Entropy is that property of physical systems that tracks (or serves as a measure of) the processes of energy transformation, often, though not always associated with heat exchange ([58], p. 273). He wrote,

We might call ‘S’ the transformational content of the body, just as we termed the magnitude ‘U’ the thermal and ergonal content. But as I hold it better to borrow terms for important magnitudes from the ancient languages so that they may be adopted unchanged in all modern languages, I propose to call the magnitude S, the entropy of the body, from the Greek word $\tau \rho o\pi \eta$, transformation.^{Footnote 129}

For Clausius, there “is a natural bias in the distribution of energy and in the direction which energy changes tend to take. Entropy gives us a measure of this bias in the case of material bodies or systems of bodies”.^{Footnote 130}

6.3 Boltzmann and the Clausius Entropy

There can be no doubt that Boltzmann knew Clausius’s work on entropy, and that his understanding of entropy was, in some places, one and the same as that of Clausius. For example, Boltzmann’s second published paper was “On the Mechanical Significance [Meaning] of the Second Law of Heat Theory”.^{Footnote 131} There, perhaps under the influence of [200], Boltzmann tries to provide an extension of Clausius’s earlier conception of entropy to systems that feature molecules that enjoy periodic motions assuming all the while many aspects of Clausius’s early kinetic theory of gases. That it is Clausius’s concept of entropy that Boltzmann is extending is well justified by the fact that in 1871, Clausius would provide the same extension of his (i.e., Clausius’s) concept to periodic motions [81], realizing after Boltzmann’s rebuke, that Boltzmann’s extension or generalization of Clausius’s concept came before Clausius’s own generalization in [82].^{Footnote 132} Furthermore, $dS=\frac{\delta Q}{T}$ holds in both of the aforementioned 1871 papers by Boltzmann and Clausius. Of course, this is the case in Clausius’s earlier work too.^{Footnote 133}

Later review and correspondence included at least two concessions by Boltzmann. Clausius in [81] accomplished something that Boltzmann in [15] failed to. Clausius provided the more accurate mathematical characterization of entropy, and [15] had ignored changes in the potential.^{Footnote 134} These concessions suggest that Boltzmann took Clausius’s work on entropy quite seriously. But one might now ask:

(Maxwell’s Question): Why would Boltzmann worry about priority in this context [Boltzmann rebuked Clausius for reproducing Boltzmann’s earlier work] when Boltzmann had already begun exploring the Maxwellian distribution-based approach to thermodynamics and statistical mechanics?

I call this Maxwell’s question because in an 1873 letter to Tait, Maxwell would ponder a similar query, expressing his astonishment about the continued interest in mechanical approaches to the second law. He wrote,

It is a rare sport to see those learned Germans contending for the priority of the discovery that the 2nd law of [thermodynamics] is the [Hamilton Principle]… the [Hamilton Principle], the while, soars along in a region unvexed by statistical considerations, while the German Icari flap their waxen wings in nephelococcygia [i.e., cuckoo land] amid those cloudy forms which the ignorance and finitude of human science have invested with the incommunicable attributes of the Queen of Heaven.^{Footnote 135}

The answer to Maxwell’s question is simple. Boltzmann cares about mechanical justifications of the second law and regards his H-theorem as a means whereby he can attain just such a mechanical justification. That is why in PERICOPE he speaks, albeit rather imprecisely and somewhat clumsily, of entropy as if it is represented by the Clausius integral. What Boltzmann is actually attempting to do is connect minus-H with Clausius’s energy transformational notion of entropy, the type of entropy which Clausius believed had a mechanical explanation. Boltzmann’s H-theorem should be viewed as the fulfillment of Clausius’s vision. The H-theorem does in fact provide a mechanical explanation of how energy tends to transform and, more derivatively, how entropy tends toward a maximum.^{Footnote 136}

That Boltzmann is borrowing Clausius’s transformational conception of entropy in his work on the H-theorem is a conclusion other scholars have reached. Olivier Darrigol and Jürgen Renn state,

Boltzmann…noted that the value of $-\mathrm{H}$ corresponding to Maxwell’s distribution was identical to Clausius’s entropy. For other distributions, he proposed to regard this function as an extension of the entropy concept to states out of equilibrium, since it was an ever-increasing function of time.^{Footnote 137}

But I go further than Darrigol and Renn because there is an additional inference to make. Because the H-theorem has to do with Clausius entropy, and Clausius entropy is defined in terms of energy, tracking how energy transforms over time, we can agree with noted historian Stephen G. Brush when he says that “the H-theorem is a microscopic version of the general principle of dissipation of energy proposed by Kelvin in 1852, and reformulated by Clausius in 1865 in the phrase, ‘the entropy of the universe tends to a maximum.’”^{Footnote 138} In other words, we can affirm that according to both Clausius and Boltzmann’s deep conception of entropy, entropy is that quantity which tracks the way energy is changing or transforming over time. This deep conception of entropy understood as a quantity that tracks energy can be used to describe thermodynamic systems both in and out of equilibrium. The H-theorem not only helps facilitate such descriptions it also helps provide a mechanistic explanation of entropic increase and stability after equilibrium is reached.

Understanding Boltzmann’s H-theorem as a theorem about a type of entropy that tracks transformations of energy is not new. Glimmers of that interpretation of Boltzmann appear in the work of Edward P. Culverwell’s (1855–1931) 1890 article on Boltzmann’s kinetic theory. Culverwell explicates Boltzmann’s characterization of a gas in equilibrium as a system enjoying a status which entails that that gas’s “energy is equally distributed among all…[its] degrees of freedom”.^{Footnote 139} Moreover, this precise entropy-energy connection is used in the contemporary practice of thermodynamics. As Klein and Nellis put it in their recent textbook on thermodynamics, “the property entropy is introduced in order to quantify the quality of energy”.^{Footnote 140}

7 The Probabilistic Interpretation of the H-Theorem

Given the reasoning of Sect. 6, it may be difficult to see room for the probability calculus and its accompanying interpretation since the explanation the H-theorem affords is mechanical. According to Sect. 5.1, for Boltzmann (as for Clausius and Maxwell) a mechanical explanation is one that explains features of a gas system SYS by appeal to the causal behavior of subsystems of SYS. As in Maxwell’s statistical mechanics, probability does enter Boltzmann’s reasoning. It does so in a way that manifests an epistemic view of probability. According to Boltzmann, we don’t know the precise state of the molecules of SYS, so we cook up our best understanding of how they are dancing (a statistical distribution law) and then, given that assumption, we look to mechanical features to see how the subsystem’s velocities are changing as they approach the state described by the distribution law. The approach to that state, as well as the mechanism whereby the system remains in that state has directly to do with causal influences among the subsystems of SYS. Appreciating the velocity changes due to causal influences revealed in the collisions that push a non-equilibrium system like SYS toward equilibrium is the means whereby we appreciate how the system’s entropy is changing over time. Our entire methodology is always removed from the precise details about actual world goings-on because the way we are modeling the involved causal influence through velocity change is through equations about how the distribution function itself changes over time. Our best efforts can only ever be approximate, and time has told us (or, from Boltzmann’s perspective, will tell us) that statistical hypotheses coupled with the right equations (i.e., the Boltzmann equation and a statement of the H-functional) bear fruit and aid us in our efforts to save the phenomena.

Obviously, the aforementioned statistical hypotheses include quantitative statements of equations revealing the contents of distribution functions like the Maxwell and Maxwell–Boltzmann distribution laws. That these laws were understood by Boltzmann to give probabilities is evidenced by both Boltzmann’s interpretation and reinterpretation of the Maxwell distribution in (Boltzmann, “Studies on the Equilibrium of Live Force Between Moving Material Points” [16]),^{Footnote 141} and in Boltzmann’s first attempt at proving the H-theorem in 1872. Beside his 1872 attempted proof is an agreement with Maxwell. Probabilistic methods are required (quoting Darrigol’s careful exegesis) “in order to deal with highly irregular processes involving a huge number of molecules. The irregularity and the law of large numbers explain the stability of macroscopic averages”.^{Footnote 142}

Does the admission of probabilistic resources into the H-theorem project mean that there are exceptions to the theorem? That is a tricky question. Theorems are necessarily true, if true. Given that the antecedent of a theorem is satisfied, the consequent is strictly implied. There are no exceptions to the H-theorem in one important sense then. But while Boltzmann does show, for systems satisfying the antecedent of the theorem, that the distribution of a gas system SYS at a time t causally depends (the Boltzmann equation is a deterministic equation) upon its distribution at some prior time, (as I’ve said before) the notion of a distribution itself is statistical or probabilistic.^{Footnote 143} There is no guarantee that nature always gives us systems fit for the assumption that the distribution used is appropriate (q.v., my discussion along these lines earlier in Sect. 5.2), and it is for this reason that I (and more importantly Boltzmann) have already said that there can be systems that do not evolve in the way demanded by the Boltzmann equation and the H-theorem.

7.1 Loschmidt’s Reversibility Objection: Articulated

Perhaps there is a way of more direclty objecting to the use of the H-theorem in attempts to mechanically explain appearances. Recall that the antecedent of the H-theorem is the conjunction that identity (8) holds, the Boltzmann equation is omnitemporally true, and the distribution function f satisfies that equation.^{Footnote 144} We can now appreciate the question: what if we could find a system that satisfied the antecedent of the theorem but which did not have a distribution that tended toward the Maxwell distribution? An objection along these lines was voiced by Loschmidt [169]. His objection (the reversibility objection) began with the correct assumption that the laws of classical mechanics (specifically Hamiltonian mechanics) are time-reversal invariant and that therefore the evolutions involving the increase of the minus-H function can be turned around resulting in evolutions involving a decreasing minus-H function. These minus-H function decreasing evolutions are perfectly consistent with the underlying laws of Hamiltonian mechanics because those laws are time-reversal invariant. If, however, the mechanics drives the minus-H function increase, then how can minus-H decrease be driven by the same mechanics? The reversed evolutions contradict the H-theorem. This is the reversibility paradox.^{Footnote 145}

7.1.1 The H-Theorem Untarnished in Boltzmann’s Eyes

My response to Loschmidt’s famous objection will come later (Sect. 8). For now, I point out that contrary to what the Standard Story would have us believe, after Boltzmann engaged with Loschmidt’s reversibility objection, he continued to positively affirm the H-theorem as a means whereby one achieves insight into the deterministic and exceptionless increase of entropy for systems that satisfy the antecedent of the H-theorem. He continued on in this way late into his career, viewing his H-theorem as the mechanical justification of the second law, even after articulating his combinatorial definition of entropy and his combinatorial arguments for a statistical statement of the second law. The H-theorem was viewed by him to be a more fundamental justification of the second law, one which the combinatorial arguments illustrate.^{Footnote 146} There are four reasons in favor of this understanding of Boltzmann’s work.

First, in Boltzmann’s very reply to Loschmidt, Boltzmann affirms that “the existence of microstates for which the entropy decreases does not contradict the general endeavor to deduce the entropy law from atomistic considerations” ([88], 198 my emphasis). The very section immediately following Boltzmann’s reply to Loschmidt is entitled “Comments on the Mechanical Meaning of the Second Law of Heat Theory”. There Boltzmann invests time and energy discussing the mechanical justification of the second law, never giving up on it.

Second, as late as Boltzmann’s first volume of the Lectures on Gas Theory (1896), Boltzmann says, “if at the beginning of some time interval [the value of the distribution function] is on the average the same at each position in the gas…, the same will hold true at all future times”.^{Footnote 147} And he would add, “the quantity we have called H can only decrease”.^{Footnote 148} In the second volume of the Lectures on Gas Theory published in 1898, after churning through several points in a proof sketch, Boltzmann concluded, “[s]ince the same holds for all other kinds of molecules, and similarly for collisions of different molecules of the same kind with each other, we have proved that in this special case the value of H can only decrease as a result of collisions”.^{Footnote 149} Undoubtedly, throughout Boltzmann’s corpus, the way he views the H-theorem and its implications is “predominantly deterministic”.^{Footnote 150}

Third, when Boltzmann was writing his Lectures on Gas Theory, he stopped midway through. Why did he do this (volume 1 was published in 1896, volume 2 in 1898)? He did it because he thought it necessary to author a treatise on mechanics because his gas theory was/or should be grounded (he believed) in mechanics. So, he published volume one of his treatise on mechanics in 1897. As he says in his Lectures on Gas Theory, the atomistic approach to the physics of matter provides the best mechanical explanation of phenomena ([35], pp. 26–27). This was no isolated supposition in Boltzmann’s more general corpus. As Jungnickel and McCormmach ([136], p. 191) state, "Boltzmann presented mechanics as the foundation of all theoretical physics."^{Footnote 151}

Fourth, Boltzmann’s general physical methodology distinguished between mechanical principles or laws, hypotheses, and the world. The laws are those of classical Hamiltonian mechanics. Hypotheses are principles like the second law of thermodynamics. Laws or mechanical principles are tested by the confirmation or disconfirmation of the hypotheses they entail. Hypotheses are confirmed in conjunction with the mechanical laws from which they follow.

…neither the Theory of Gases nor any other physical theory can be quite a congruent account of facts…Certainly, therefore, Hertz is right when he says: ‘The rigour of science requires, that we distinguish well the undraped figure of nature itself from the gay-coloured vesture with which we clothe it at our pleasure.’ But I think the predilection for nudity would be carried too far if we were to forego every hypothesis. Only we must not demand too much from hypotheses…^{Footnote 152}

He continued:

Every hypothesis must derive indubitable results from mechanically well-defined assumptions by mathematically correct methods. If the results agree with a large series of facts, we must be content, even if the true nature of facts is not revealed in every respect. No one hypothesis has hitherto attained this last end, the Theory of Gases not excepted. But this theory agrees in so many respects with the facts, that we can hardly doubt that in gases certain entities, the number and size of which can roughly be determined, fly about pell-mell. Can it be seriously expected that they will behave exactly as aggregates of Newtonian centres of force, or as the rigid bodies of our Mechanics? And how awkward is the human mind in divining the nature of things, when forsaken by the analogy of what we see and touch directly?^{Footnote 153}

This is a hypothetico-deductive method that includes the humble assertion that nature may not conform perfectly to our hypotheses and mechanical laws. According to this method, hypotheses like the second law must follow from mechanics.

While Boltzmann (Certain Questions [29]) provides me with some ammunition for my reading, the same source could be interpreted as completely taking it away:

It can never be proved from the equations of motion alone, that the minimum function H must always decrease. It can only be deduced from the laws of probability, that if the initial state is not specially arranged for a certain purpose, but haphazard…the probability that H decreases is always greater than that it increases.^{Footnote 154}

This quotation gives my exegetical project the most serious kind of trouble. In it, Boltzmann admits to being unable to recover the H-theorem from mechanical considerations and suggests that the relationship between the antecedent of the theorem and its consequent is a probabilistic relation. I find that this series of remarks contains elements that are false, and worse, nonsensical. Again, the antecedents of theorems entail their consequents, and yet Boltzmann is quite clearly allowing for cases in which the consequent fails to follow from satisfaction of its antecedent. That is nonsensical. In addition, Boltzmann says that the H-theorem does not follow from mechanics. But I have already pointed out how Lanford showed that on the supposition that a choice gas system is dilute and that its constituents are approximated as hard shells (plus some further assumptions), the Boltzmann equation follows from the time-reversal invariant equations of motion in classical mechanics.^{Footnote 155} As Fields medal winner Cédric Villani stated at his 2010 Cambridge University lecture,

Probably the single most important theorem in the [kinetic] theory remains the Lanford theorem from 1973. Lanford rigorously derived the Boltzmann equation from Newtonian mechanics…[for an appropriate domain]…in…[the appropriate limit] you recover the Boltzmann equation…This was the first result showing that you could…get this Boltzmann equation out of the Newton equation[s].^{Footnote 156}

Proofs of the H-theorem itself have been articulated in such a way that they satisfy the standards of rigor in contemporary mathematics (q.v., n. 79).

We should not take the passage quoted above and cited in note 154 too seriously despite how often it is quoted. Boltzmann contradicts it numerous times in his Lectures on Gas Theory, and those lectures are the best source for Boltzmann’s mature thought on thermodynamics and statistical mechanics. Here are the reasons for this:

(a)
In 1900, the minister in Vienna described Boltzmann’s attempt to acquire recognition and leadership status among the community of physicists through the publication of his Lectures (both his lectures on mechanics and those on gas theory) as his "almost morbid ambition".^{Footnote 157}
(b)
Boltzmann believed that experimental physicists were at a disadvantage when compared to theoretical physicists because the latter could publish books rooted in their lectures and thereby present theories, quoting Jungnickel and McCormmach,

…from the perspective of their preferred methods. Boltzmann's published lectures on theoretical physics—covering his favorite parts of it, Maxwell's electromagnetic theory, gas theory, and analytical mechanics—were not syntheses of authoritative writings in the field but his version of theoretical physics.^{Footnote 158}
(c)
There's evidence that Boltzmann believed that the atomic theory was going to fall out of favor and become completely abandoned. One of his reasons for publishing the Lectures on Gas Theory was to produce a historical deposit of the best statement of an atomistic physics of thermodynamics and statistical mechanics (as they pertained to the physics of gases) that he could muster so that when atomic theory was (in his words) "again revived, not too much will have to be rediscovered”.^{Footnote 159}

Points (a)-(c) clearly justify a high view of the Lectures on Gas Theory understood as the best avenue to Boltzmann’s mature thought on statistical mechanics. Interestingly, Boltzmann cites (Boltzmann, On the Relation between the Second Law and Probability Calculus [25]) only once in either of its two volumes. It seemed to have been a theme—not only in Boltzmann’s own corpus but also in the work of his contemporaries—that the H-theorem and mechanical approach take precedence.^{Footnote 160} Consider:

(d)
Outside of the Lectures on Gas Theory and after 1877, there are only five papers/works in which Boltzmann uses the probability calculus, and among these five, only one of them applies the probability calculus to a real-world physical scenario. Among the remaining four papers, two are really just replies, and the last two are summaries of his earlier 1877 work.^{Footnote 161}
(e)
Boltzmann’s combinatorial work was almost entirely ignored by his contemporaries. The standard discussion of the work of both Maxwell and Boltzmann at the end of the nineteenth century was Rev. Henry William Watson’s (1827–1903) A Treatise on the Kinetic Theory of Gases.^{Footnote 162} That work never once cites Boltzmann's 1877 paper in which he presents the two combinatorial arguments. Burbury’s A Treatise on the Kinetic Theory of Gases (Cambridge University Press, 1899) does not discuss Boltzmann’s combinatorial approach. Bryan mentions it in a footnote in his contribution to the Nature debates. And the principal concern of [100] (once an encyclopedia article on statistical mechanics published near the beginning of the twentieth century) was the status of Boltzmann’s H-theorem.^{Footnote 163}
(f)
That Boltzmann’s contemporaries understood him to prefer the mechanical approach to thermodynamics and statistical mechanics can be seen in the synopsis of one of his famous students, viz., Paul Ehrenfest (1880–1933). He wrote,

"Mechanical representations, were the material from which Boltzmann preferred to fashion his creations…He obviously derived intense aesthetic pleasure from letting his imagination play over a confusion of interrelated motions, forces and reactions until the point was reached where they could actually be grasped. This can be recognized at many points in his lectures on mechanics, on the theory of gases, and especially on electromagnetism. In lectures and seminars Boltzmann was never satisfied with just a purely schematic or analytical characterization of a mechanical model. Its structure and its motion were always pursued to the last detail."

^{Footnote 164}

As I’ve already suggested, Boltzmann published numerous replies as part of a mid-1890s discussion of his work in the journal Nature. Discussants included George Hartley Bryan (1864–1928), Burbury, Culverwell, Joseph Larmor (1857–1942), and Watson.^{Footnote 165} That debate took place just after the publication of a new proof^{Footnote 166} of the H-theorem in (H. W. [236], pp. 33–49). Although no one questioned the correctness of Watson’s suitably amended (by Culverwell) proof,^{Footnote 167} many objections and searching questions were raised about how Boltzmann used the H-theorem in his theorizing about irreversible thermodynamic processes and mechanics. In the face of those objections and questions, Boltzmann never once abandons the theorem (even though he could have easily reverted to his 1877 combinatorial and probabilistic approach in which the H-theorem played no essential role).^{Footnote 168}

8 The Reversibility Paradox Answered

In Sects. 2 and 5.1, I showed that Clausius, Maxwell, and Boltzmann thought of collisions as instances of causation that drive entropic increase (i.e., collisions are that which produces the transition from non-equilibrium states to equilibrium states). This fact underwrites the sense in which their way of explaining the second law was mechanical. With respect to Boltzmann and the H-theorem, ensuring minus-H increase requires special types of collisions. Not just any will do. Only collisions with a unique type of built in asymmetry get the job done. I turn now to exploring the full nature of that asymmetry. My exploration will reveal another way in which statistical considerations enter the mechanical explanation of the second law. It will also reveal the solution to the reversibility paradox.

8.1 The Hypothesis of Molecular Chaos

When in 1895, Boltzmann said that the H-theorem only guarantees that it is highly likely that both (i) appropriate non-equilibrium gas systems increase in entropy over time and (ii) that suitable equilibrium gas systems stay in equilibrium, he said this in reply to the reversibility paradox as articulated, not by Loschmidt but by Culverwell. I rejected Boltzmann’s response in Sect. 7.1.1, because it makes both false and nonsensical claims. I showed, in the same section, that his remarks do not reflect the refined and mature views he communicated elsewhere. Am I preparing the way for a non-statistical statement of the second law? No. As in both the work of Maxwell and Boltzmann, my theory will admit probabilistic considerations in at least two places. First, (again) the Maxwell distribution is itself a statistical principle. Second, it was realized during the Nature debates in the mid-1890s that an important assumption—which I will call the hypothesis of molecular chaos (HMC)—about the nature of collisions was required in order for the H-theorem to be applicable to real-world systems.^{Footnote 169} With this virtually everyone (whether mathematicians, historians of physics, philosophers of physics, or physicists themselves) agrees. Disagreement arises over the precise form of the assumption.^{Footnote 170} I maintain that the assumption is directly related to my explanation of how and in what way some systems avoid H-theorem application (q.v., Sect. 5.2). Systems that have very special initial conditions are not guaranteed to be the kind to which the H-theorem is applicable.^{Footnote 171} All positions and velocities consistent with the conservation laws must be allowed early on. One way to help ensure that the system does not begin in some special state is to suppose (and Bryan [53], p. 29) made this explicit), that the molecular constituents of the system are statistically independent in that their motions are not correlated temporally prior to that which produces entropic increase (i.e., collisions). That is to say, HMC states that the pre-collision velocities of two colliding molecules in a gas system of the right kind are statistically independent, and that the post-collision velocities of those same molecules become correlated both after and because of the collision. This one-sided or asymmetric molecular chaos propagates for positive times in the sense that collisions that drive minus-H increase retain this correlation-creating ability throughout the system’s evolution toward equilibrium. When I say that the velocities after collisions are correlated, I shall at least mean that in order to retrieve the probability of the post-(binary) collision trajectory of one of the molecules in the collision, one should conditionalize on the post-(binary) collision trajectory of the other molecule, inter alia and vice versa.

Bryan was not the first to notice the HMC in Boltzmann’s project. Something close to it was recognized by Lorentz in his 1886 correspondence with Boltzmann about time-reversal invariance and the derivation of the Maxwell–Boltzmann distribution for polyatomic gas systems. He stated,

We may assume that in a natural gas the particles have completely irregular positions and phases, or at least that there is no definite relation between the positions that the particles have before time dt [in which a collision of a given kind occurs] and the number of collisions which they will experience [during this time]. In contrast, it is clear that the positions and phases of the particles are not completely irregular with respect to the past collisions, because they result precisely from the latter collisions. Now, if we revert all velocities as you wish to do, we get a state in which the positions and phases are prepared for the forthcoming collisions and therefore complete irregularity no longer holds.^{Footnote 172}

Here Lorentz articulates the idea that before collisions during dt, gas particles are statistically independent and therefore “irregular” with respect to their positions and phases. He likewise affirms that collisions cause those positions and phases to become in some sense regular.

Through some persuasive efforts, Boltzmann came to accept (at least for a substantial period of time) the HMC. He wrote, “[w]e shall therefore, [he concludes in 1896] now explicitly make the assumption that the” pre-collision motions are “molecularly disordered and” remain “so throughout all future time”.^{Footnote 173} Elsewhere Boltzmann criticizes Gustav Kirchhoff’s (1824–1887) derivation of the Maxwell distribution in [140]. The basis of his critical review is that Kirchhoff has not properly assumed the HMC. Boltzmann turns out to be wrong about this, but the fact that he uses the HMC as a criterion for determining the threshold of a good derivation of the Maxwell distribution suggests a high view of the HMC.^{Footnote 174}

The more technically inclined reader will desire a formal presentation of the HMC in the language of mathematics.^{Footnote 175} I will not provide one because there isn’t one. Brilliant mathematicians have given this issue much thought and have concluded that “the physical derivation of the Boltzmann equation is based on the propagation of one-sided chaos, but no one knows how this property should be expressed mathematically…”^{Footnote 176} Call this the (No Mathematics Problem (NMP)). This may strike one as a troubling situation. But matters are worse. The second law is commonly used as part of a solution to the problem of the arrow of time which asks: Why do we perceptually behold irreversible processes when the laws of mechanics governing the micro-constituents of the systems in those processes are time-reversal invariant? If one’s answer in any way relies upon the H-theorem, then one’s answer will invite yet another problem of asymmetry: Why do the binary collisions that produce minus-H increase produce correlations only after those collisions obtain? As Brown et. al. stated, “… there is no reason given as to why the…[HMC]…holds for pre-collision velocities rather than post-collision ones”.^{Footnote 177} Call this the (Chaos Asymmetry Problem (CAP)).

8.2 The Solution at Long Last

My proposed resolution of the reversibility paradox will also serve as the solution to the NMP, and the CAP. The first step of the solution is to understand the HMC as an interpretive postulate about the nature of that which drives minus-H increase, viz., collisions. That the collisions are responsible for entropic increase in Boltzmann’s H-theorem-laden kinetic theory is acknowledged by virtually all scholars.^{Footnote 178} The standard story in kinetic theory is that collisions between molecules in non-equilibrium closed systems drive those systems into equilibrium, and that “equilibrium” writes Thomas Kuhn, “is, by definition, the state in which the distribution is unaffected by collisions”.^{Footnote 179} But now we must ask, if collisions causally produce minus-H increase, then why do collisions among molecules of gas systems in equilibrium fail to increase minus-H even further? Of course, once the system reaches equilibrium it is characterized by the Maxwell distribution, in which case, the functional H vanishes thereby reaching its minimum value (the H-theorem used to be called the minimum theorem). It’s just a mathematical fact that H cannot decrease, and that minus-H cannot increase. But mathematical facts can have metaphysical explanations. That is to say, there exists a reason why once H vanishes, entropy fails to increase, and that reason consists in the fact of energy dissipation. Recall Stephen G. Brush’s point (quoted previously) that “the H-theorem is a microscopic version of the general principle of [the] dissipation of energy proposed by Kelvin in 1852, and reformulated by Clausius in 1865…’”^{Footnote 180} The way energy has transformed and dissipated—remember entropy tracks this energy transformation—has left the system in equilibrium no longer allowing it to further transform.^{Footnote 181} The capability of the system to perform work becomes attenuated. Contemporary thermodynamicists such as Sanford Klein and Gregory Nellis interpret the second law “as a system for assigning quality to energy”. They continued,

Although energy is conserved, the quality of energy is always reduced during energy transformation processes. Lower quality energy is less useful to us in the sense that its capability for doing work has been diminished. The quality of energy is continuously degraded by all real processes; this observation can be expressed in lay terms as ‘running out of energy’.^{Footnote 182}

The energy transformative process is a causal one. That interpretation is plausible for at least two reasons. First, at the heart of the process in thermodynamic or statistical mechanical evolutions are causally efficacious collisions producing entropic increase. Second, kinetic energy is a causal quantity. Rankine said that “actual”, or what, in 1862, Thomson would identify as kinetic energy “is a measurable, transferable, and transformable affection of a substance, the presence of which causes the substance to tend to change its state in one or more respects…”.^{Footnote 183} Modern statements do not differ, as contemporary classical (non-relativistic) physics universally characterizes kinetic energy in terms of work. Changes in kinetic energy $dT$ (where T is not temperature but kinetic energy) are also specified by appeal to work done by net force, or $\mathbf{F}\cdot d\mathbf{r}$.^{Footnote 184} But forces in classical mechanics are causal.^{Footnote 185}

Some think we can forsake forces in a conceptually sophisticated enough classical (non-relativistic) mechanics if we appropriate Hamiltonian mechanics, an energy-based theory. Hamiltonian mechanics is an energy-based approach to the dynamics of classical (non-relativistic) systems because the laws of motion in Hamiltonian mechanics use the Hamiltonian or energy function (and here, I follow [218], pp. 528–531):

(15)
$$\mathcal{H}={\sum }_{i=1}^{n}\left({p}_{i}{\dot{q}}_{i}\right)-\mathcal{L}$$
given that [$i=1,\dots ,n$], and that the system is described by generalized momenta:

(16)
$${p}_{i}=\frac{\partial \mathcal{L}}{\partial {\dot{q}}_{i}}$$

and specified by generalized coordinates:

(17)
$$\mathbf{q}=({q}_{1},\dots ,{q}_{n})$$
Here the Lagrangian $\mathcal{L}$ is a function of $\mathbf{q},$ $\dot{\mathbf{q}}$ (specified below), and time. If the system is isolated, the generalized coordinates stand in a time-independent relationship to the Cartesian or rectangular coordinates tracking the system, and the potential energy of the system is velocity independent,

(18)
$$\mathcal{H}=T+U$$
and generalized momenta as well as generalized velocity can be written (respectively) as:

(19), (20)
$$\mathbf{p}=\left({p}_{1},\dots ,{p}_{n}\right), \dot{\mathbf{q}}={(\dot{q}}_{1},\dots ,{\dot{q}}_{n})$$

Both potential and kinetic energy are analyzed (at least in part) in terms of work (force times displacement). When the Hamiltonian equals kinetic and potential energy, force thereby enters Hamiltonian mechanics. When it is appropriate to specify the Hamiltonian in terms of the Lagrangian $\mathcal{L}$, force enters more indirectly.^{Footnote 186} The Lagrangian $\mathcal{L}$ is (in many appropriate circumstances) equal to $T-U$ (where U is here potential energy and not internal energy). So, the Lagrangian is (in appropriate circumstances) at least in part specified by appeal to the kinetic and potential energy of the system. But again, kinetic and potential energy, even in Hamiltonian mechanics, is, in part, standardly interpreted and analyzed in terms of work. But work is, in part, specified in terms of net force. Thus, forces are indispensable to any plausible interpretation of Hamiltonian mechanics, and therefore causation is as well since forces are causes (Hamilton would agree! Q.v., the end of n. 185).

If our interpretation of classical mechanics is causal, then it admits an asymmetry. Causation is formally asymmetric. How then do I meet the famous reversibility objection in the work of Thomson, Loschmidt, and Culverwell? Recall the gist of that objection. All minus-H increasing evolutions imply the possibility of minus-H decreasing reversed evolutions of an appropriate isolated gas system. This, thought Thomson, Loschmidt and Culverwell, is a consequence of the reversibility of the microdynamics ([89], p. 774). The reversibility of the microdynamics and therefore also the reversibility of the supervening macroscopic evolutions was thought to be a consequence of the reversal of the involved velocities. As Loschmidt wrote, “the entire course of events will be retraced if, at some instant, the velocities of all its parts are reversed”.^{Footnote 187} Or as Thomson put it, “[i]f, then, the motion of every particle of matter in the universe were precisely reversed at any instant, the course of nature would be simply reversed for ever after”.^{Footnote 188} Reversing velocities was deemed naturally possible because the underlying microdynamical equations of motion in Hamiltonian mechanics were correctly thought to be time-reversal invariant.

The idea that you get so much from simple velocity reversal of microconstituents of real-world classical systems is mistaken. Entropic increase as envisioned by the H-theorem-laden kinetic theory is not driven by an underlying microdynamical evolutionary process that is reversible. Am I denying that Hamilton’s equations of motion are time-reversal invariant? No. Recall that those equations are time-reversal invariant only if replacing t with minus-t (being careful to also flip the sign of all odd forms of t such as velocity) allows solutions to be taken to solutions (or as Thomson said, “any solution remains a solution” ([221], p. 441)). I am certainly not denying that. It’s a mathematical fact. However, the equations of motion, once fully interpreted and thereby rendered applicable to real-world classical systems, inform us about unfolding causal processes that possess an asymmetry even in the micro-processes. That asymmetry stems from the causation in collisions, the very engine of entropic increase and so also the source of the asymmetry in the HMC. This source is not directly represented by the mathematics expressing the microdynamical (Hamiltonian) laws and so it is no surprise that the HMC is not directly represented in that mathematics either. The HMC is not part of the formulation of mechanical principles that govern collisions (compare Uffink’s remarks in [226], p. 969). Rather, it is an understanding of how that formalism fits the real world (i.e., it is part of an interpretation of the mechanics).^{Footnote 189} But one might counter: The HMC is about collisions, and we have a classical mathematical collision theory.

8.2.1 Solving the Chaos Asymmetry and No Mathematics Problems

Go back to Maxwell’s “On the Dynamical Theory of Gases” [174]. There, Maxwell assumes that all collisions are elastic (total kinetic energy and momentum are conserved through collisions). This assumption is false for polyatomic molecules, and false for atomic collisions. The latter conjunct holds because in collisions between atoms some kinetic energy is converted into other forms of energy. But set these points aside. As in some modern accounts of classical collision theory, Maxwell accounted for constituent collisions between two arbitrary gas molecules by giving attention to their pre-collision velocities, their post-collision velocities, and those collision “parameters that are necessary to determine the final velocities of the molecules”.^{Footnote 190} As already revealed in preceding discussion (q.v., n. 39), the collision parameters are usually the azimuthal angle $\phi$, and the impact parameter $b$. With respect to binary collisions, the latter is nothing more than two modal entities, viz., the paths the two molecules would travel were they to fail to interact with one another (in the center-of-mass frame). The former is just an angle, viz., the angle that fixes the plane upon which sits the post-collision trajectories of both molecules. The type of interaction involved need not be restricted to a physical contact in a real-world collision because the types of entities interacting are not restricted to or always best approximated by point masses. The interaction may be complex involving various force-types. For example, it may include the exertion of non-contact forces made manifest in attractions or repulsions alongside or with contact forces. But even if the involved force impressions were purely contact forces, the impact parameter would not provide that which is sufficient for fully determining (in the sense of producing) the post-collision velocities. For elastic collisions of molecules of gases (not unlike ideal gases) with the same masses, one can through straightforward mathematical reasoning, determine (in the (epistemic) sense that you can infer or derive) the post-collision velocities of the two colliding molecules from knowledge of the laws of conservation, the impact parameter, the azimuthal angle, and the pre-collision velocities. The sense of determination here is epistemic because it would be obviously shortsighted to judge that because a mathematical fact about the post-collision velocities follows from mathematical facts about conservation, the pre-collision velocities, the azimuthal angle, and the impact parameter, that therefore nothing more in the world metaphysically determines the post-collision velocities when the phenomenon under study is a collision phenomenon involving impact force impression. There was a collision! There was an interaction between the two molecules! What has happened in Maxwell’s treatment (and as we shall see in Boltzmann’s treatment too) is that Maxwell has chosen to model around the interactions, or the intimate details of the impact-laden collisions.^{Footnote 191} What has happened is that Maxwell has utilized a conceptual strategy that Mark Wilson calls physics avoidance [243]. Maxwell sought to model the world using quantitative walk-arounds that enabled his models to escape severe mathematical difficulties.^{Footnote 192}

Boltzmann’s proof of the H-theorem treats collisions much the same way Maxwell modeled them, i.e., by using the conservation laws, plus the “initial value of the kinetic energies ${k}_{1}$ and ${k}_{2}$ of” two colliding molecules, “and by the value ${k{^{\prime}}}_{1}$ of the kinetic energy of the first molecule after the collision”.^{Footnote 193} Boltzmann’s combinatorial argument of 1877 practices a similar type of physics avoidance, except in that context it completely “neglects the contribution to the energy of the system that stems from interactions between the particles”.^{Footnote 194} Followers of Boltzmann who prefer the combinatorial method do not resist the neglect. In fact, the same Boltzmannians provide a means whereby we can empirically distinguish H-theorem-laden statistical mechanics from the more popular modern Boltzmannian approach found in places like [2, 115, 117, 162, 166]. For example, Goldstein et. al. call the entropy discussed in Boltzmann’s combinatorial approach, Boltzmann entropy (S_B). They provide sound justification for separating S_B from the entropy that is minus-H, stating that:

[i]f interaction cannot be ignored, then the H functional does not correspond to the Boltzmann entropy…[w]hen interaction can be ignored there is only kinetic energy, so the Boltzmann macro states based on the empirical distribution alone determine the energy and hence the H functional corresponds to the Boltzmann entropy.^{Footnote 195}

In modern classical mechanical approaches to Boltzmannian statistical mechanics that use an H-theorem and a Boltzmann collision operator Q, impact interactions are avoided or modeled around (as is implied in [230], p. 79). In that context too, collisions are all assumed to be binary, and the involved particles don’t really contact one another, for in that literature binary collisions are processes “in which two particles happen to come very close to each other, so that their respective trajectories are strongly deviated in a very short time”.^{Footnote 196} There is physics avoidance afoot here because modern theoreticians are engaging in modeling walk-arounds.

Ignoring interactive contact collisions between point-like objects is as old as Newtonian mechanics. Newton’s second law says that “[a] change in motion [not a rate] is proportional to the motive force impressed [where the proportionality constant is inertial mass] and takes place along the straight line in which that force is impressed” ([188], p. 416).^{Footnote 197} The masses of the objects to which the second law was intended to apply never equal zero. Even point masses have mass. However, when two point-like objects or point masses collide, their accelerations are obliterated, and as a result the second law fails as there is a force impressed but no resulting acceleration (and not because of a balance rendering the force vector equal to the zero vector). Newton was aware of this problem and saw no application of the second law of motion to contact interactions. Here is Wilson on Newton’s approach to the problem. Note the similarities to the methods of Maxwell and Boltzmann,

Whenever these radii contact one another (we shall only worry about the head on collision case), Newton abandons the requirement that the ‘a’ in ‘F = ma’ must make sense and shifts his focus to the two balls’ incoming stores of linear momentum and kinetic energy (as we now dub them), together with a purely empirical factor called a coefficient of restitution (it governs how much the total kinetic energy budget will diminish post-collision). In effect, this treatment blocks out the crucial interval of time $\Delta t$ where ‘F = ma’ fails to make sense and glues together the incoming and outgoing events exterior to $\Delta t$ through a mixture of conservation principles (conservation of linear momentum) and raw empirics (coefficients of restitution extracted from experiment). Formally, tactics that patch over problematic intervals or regions in this manner are frequently called matched asymptotics.^{Footnote 198}

The problem is not unique to Newton’s formulation of classical mechanics. It reappears in Hamiltonian mechanics. Mathematician Robert Devaney stated,

…specific Hamiltonian systems which arise in applications often suffer singularities as well. By a singularity we mean a point where the differential equation itself is undefined. A typical example of a singularity is a collision between two or more of the point masses in the Newtonian n-body problem. At collision, the differential equation breaks down: the velocities of the particles involved become undefined. A singularity or collision can create havoc among nearby solution curves. Solutions which pass near a singularity may behave in an erratic or unstable manner, and solutions which start out close to one another can end up far apart after passing by a singularity.^{Footnote 199}

That you should care about more than mere positions and post-collision velocities in such cases, and that you should give attention to the interactions during the relevant $\Delta t$ (sometimes this time interval is referred to with the symbol $\Delta {t}^{*}$) was expressed very clearly by Gottfried Wilhelm Leibniz (1646–1716). The fact that you could recover so much while ignoring the details of the evolution during $\Delta t$ was, for Leibniz, “a convenient trick”.^{Footnote 200} Leibniz thought that in order to get to the deep joints of nature, you need to pick up on what’s transpiring during the relevant $\Delta t$. Like Leibniz, I maintain that what one will find there (i.e., in the relevant $\Delta t$) is efficient causation or causal interaction ([165], pp. 139–142).^{Footnote 201}That causation (call it fundamental causation or causation_F) drives the engine of entropic or minus-H increase. It results in correlations (that’s why you can use correlations to find causal interactions), correlations that are one-sided precisely because causation_F is asymmetric. That is to say, obtaining causal_F relations in entropy producing collisions explain the HMC. The Chaos Asymmetry Problem (CAP) has been resolved. The propagating one-sided chaos referenced by the HMC is one-sided because the velocity correlations are the effects of temporarily prior causes in temporally directed obtaining causal_F relations. It is no surprise then that the Boltzmann equation breaks T-symmetry. It does this because the collisions it is about involve obtaining causal_F relations that are temporally asymmetric.

The introduction of causation_F into entropy increasing collisions during the relevant $\Delta ts$ resolves the No Mathematics Problem (NMP) as well. There’s no mathematical representation of the HMC because its source is unrepresented by the formalism and because the correlations HMC references are set down during $\Delta t$. Our best modeling of the collision process walks around those times since its chief concern is recovering post-collision velocities. We can nonetheless point to that best modeling as evidence of the existence of causation_F in the $\Delta ts$ because that modeling, while one step removed from the phenomena, nonetheless recognizes that forces and resulting accelerations obtain so as to get the velocity changes. Applying the time-reversal invariance operation will not change the directions of the forces (the causal structure) nor the directions of the resulting accelerations (though the displacement is reversed). Forces and accelerations are even forms of t.

8.2.2 Solving the Reversibility Paradox

To see the resolution of the reversibility paradox, recognize first that the time-reversal invariance operation in Hamiltonian mechanics is one that is applied to Hamilton's equations of motion (and appropriate deductive consequences thereof). Odd forms of t receive sign changes and solutions are still mapped to solutions. Execution of that operation together with the execution of an appropriate time-translation (so as to help us appreciate a temporally reversed evolution) will not entail that there exists an evolution that satisfies a temporally reversed HMC. This is because the HMC is not a part of the formulation of Hamiltonian mechanics, nor is it a deductive consequence of the equations of motion in Hamiltonian mechanics. Again, HMC is an interpretive hypothesis.

Suppose there's an elastic impact collision C between two molecules (1 and 2) of a monatomic gas system. Molecules 1 & 2 have velocities v₁ and v₂ (respectively) before C. After C, they take velocities u₁ and u₂ (respectively). v₁ does not equal u₁, and v₂ does not equal u₂, and I will assume that molecule 1′s mass is larger than molecule 2′s inertial mass, but not significantly larger. Velocities v₁ and v₂ produce C. C produces post-collision velocities u₁ and u₂. The fact that the accelerations and force impressions in C are not reversed under time-reversal suggests that in the time-reversed evolution, the velocity transitions run from $-{\mathbf{u}}_{1}$ and $-{\mathbf{u}}_{2}$ to $-{\mathbf{v}}_{1}$ and $-{\mathbf{v}}_{2}$. So, in the reversed evolution, you won’t approach the Maxwell distribution precisely because the velocity transitions/changes go in the wrong direction. They go in the wrong direction because of the fundamental causal structure of the evolution. The (one-step-removed) evidence for this resides in the fact that in the reversed evolution, the forces are still pushing in the same directions as the actual world evolution, and the accelerations keep their actual world directions as well. Because the HMC is an interpretive postulate, the time-reversal operation alone will not change its one-sidedness either. Whatever is done with the HMC in the reversed evolution is done by hand. The causal structure of the world must be changed to realize reversed evolutions.^{Footnote 202}

What of the classical possible world w at which monatomic gas systems of the right kind evolve in perfect accord with the models of Maxwell [174] and Boltzmann [21, 23]? At w, will there fail to be monotonic increase of $-H$, if such gas systems begin their evolutions in low entropy states? At w, all binary “collisions” never introduce problems of mathematical singularities because the constituent molecules never meet. My project seeks to causally interpret only those collisions quantified over by the HMC. My central thesis, CC in Sect. 1, made this clear. What I’m recommending is that we understand the HMC as an interpretive postulate about the types of collisions that transpire during the crucial $\Delta ts$. As I’ve argued, both Maxwell and Boltzmann did not include specific reference to such impact collisions in their mathematical models because (on my interpretation) they were employing the conceptual strategy of physics avoidance (hence the NMP). They were able to discover the various distribution laws because the model walk-arounds “do the trick” (as Leibniz would say) of recovering the velocities of gas molecules after the collisions that are walked around.^{Footnote 203} They encounter a reversibility paradox precisely because the surface meaning of their modeling describes systems like those in w, systems whose dynamical evolutions are such that their time-reversal yields a past-directed evolution. In w, $-H$ does not monotonically increase in accordance with the H-theorem. How could it? The evolutions there are completely time-reversable. Nonetheless, there is no violation of the H-theorem there because the HMC (a precondition of the theorem) fails to hold at w. The collisions the HMC quantifies over do not transpire there and so neither does entropic increase of the kind required by the H-theorem. But as soon as we shift to the real world, where monatomic gases like helium (He), argon (Ar), xenon (Xe) and others, evolve in ways featuring real world impact collisions avoided by the Maxwell–Boltzmann modeling but targeted from afar by that modeling nonetheless (and so my project remains true to the spirit of Boltzmann’s work), the HMC becomes part of a true and correct (in the appropriate limit) interpretation of Hamiltonian mechanics being made true by the causal structure of the actual world. It is the contingent causal way the world is that determines the entropic asymmetry described by the H-theorem. It is a consequence of my framework that the proposed interpretation of Hamiltonian mechanics makes a detectable empirical difference. It is to that empirical difference that I now turn.

Is the HMC empirically justified? Yes. It is indirectly justified by all the fruit or empirical success produced by the H-theorem and Boltzmann equation in modern kinetic theory. For example, it should be obvious by now that the H-theorem predicts that if a classical monatomic gas system SYS satisfies certain conditions, then SYS will evolve to thermal equilibrium over a sufficiently long period of time. That is in fact what we observe. More generally, the H-theorem predicts the truth of the second law of thermodynamics for systems that satisfy the antecedent of the theorem. Consequently, in a restricted sense, the theorem “demonstrates the second law of thermodynamics”.^{Footnote 204} Nature’s obedience to the second law is what we observe. In addition, I have already indicated how the Boltzmann equation is used to great benefit in the study of neutron transport, plasma physics, and the kinetic theory of gases (q.v., Sect. 4; and see [63, 85, 239]). What is more, the H-theorem and Boltzmann equation bear much fruit in hydrodynamics as well [217]. The empirical successes of the Maxwell and Maxwell–Boltzmann distributions discussed in the sources at note 43 are also relevant indirect justifications of the HMC. Why believe the above constitutes indirect evidence for the HMC? The HMC “is a fundamental requirement for the application of the Boltzmann kinetic theory, the Boltzmann transport equation, and the presence of Maxwell–Boltzmann statistics”.^{Footnote 205}

There is a recent and more direct justification as well. I call this other justification “more direct” and not “direct” tout court because we are not currently able to directly observe the correlated velocities of gas molecules. However, there are a class of granular media that are low-density media which approximate gas systems (they are called “granular gases” in light of this). G.W. Baxter and J.S. Olafsen gave attention to such systems in 2007. They discovered that these low-density granular systems exhibit molecular chaos, but that once the systems become sufficiently dense (i.e., once there are sufficient enough interactions (this is my gloss)), the velocities of the constituents of the relevant systems become correlated.^{Footnote 206}

9 Conclusion

I have shown that the Standard Story is historically inaccurate. Once Boltzmann discovered the H-theorem it remained front and center in his mind. He always believed that some systems did not experience minus-H increase and he was in possession of reasons for delimiting the second law to a statistical claim well before the publication of Loschmidt’s reversibility objection in 1876. But even after wrestling with that objection, Boltzmann always remained pragmatically committed to the project of mechanically justifying the second law. It is therefore in a truly Boltzmannian spirit that I have tried to resolve the reversibility paradox in a way that remains true to mechanical natural philosophy.

There remains at least one puzzle to solve. How ought the probabilities in the proposed causal Boltzmannian approach to be interpreted? I’ve shown that both Maxwell and Boltzmann favored (at least at one time) epistemic interpretations of the involved probabilities, and I believe that is the best option in this context. Of course, a lot more needs to be said about these epistemic probabilities, but I hope to articulate my opinions about the matter in a part two essay that uses the framework of this project to tackle the famous recurrence objections.

Notes

([78], p. 400).
([155], pp. 70–71).
([38], p. 125).
(For which see the standard presentation in [157]; and cf. the discussions in ([106, 190], pp. 55–88, but especially pp. 84–86), ([222], pp. 155–218, 246–282) and ([226], pp. 992–1005).
See ([134], pp. 4–38, 210–336, [156]; and while [224], pp. 59–70) does seem to be a proponent of an epistemic approach, entropy does not appear to be epistemically or information theoretically interpreted at ibid., 561.
See ([1, 2, 57, 115, 117]); ([114, 116 , 162, 166, 191], pp. 11–79).
See the results and literature discussed and cited in [204], especially Sect. 2. See also the interesting work being done at [205] (SISSA).
See ([197, 198]). Also see the discussions of this approach in ([9, 14, 41, 98, 138]).
See ([64], pp. 261–263), ([65, 66]). Also see the discussions of this approach in ([226], pp. 1034–1038) and ([229], pp. 118–138).
Suppose the real world statistical mechanical system of n-material points (SYS) of interest is in microstate x at an initial time t₀. Call this microstate of SYS, x₀. In both Gibbsian and Boltzmannian SM, x₀ is represented by a point on a 6 N-dimensional phase space $\Gamma$over which is defined the standard Lebesgue measure μ. N is the number of molecular or particle-constituents. The point itself represents the positions and momenta of the micro-constituents of the system. $\Gamma$ will have coarse-grained regions with volumes in the phase space. One could also understand these regions as subsets of $\Gamma$ over which one can define a σ-algebra (as in the explication in [106]. The evolution of SYS from x₀ to some other microstate at a later time, is given by an evolution function $\phi$_t, a measure-preserving flow or phase space orbit that is determined by solutions to the equations of motion (Hamilton’s equations). The later microstate of SYS as fixed by the evolution function is represented by ϕ_t(x₁), and the evolution from x₀ to that subsequent microstate of SYS is itself represented by a curve on $\Gamma$. If we were to imagine the microstate of SYS that is x traveling on the curve from x₀ to x₁, the volumes of the coarse-grained regions would remain the same, since the two approaches in view assume Liouville’s theorem. For details on Liouville’s theorem see ([218], pp. 543–546). Let an ensemble be a hypothetical infinite collection of non-interacting systems with the same structure as SYS, although every member of the ensemble represents a different physical state of SYS. The actual state of SYS is still given by a point x on $\Gamma$, but the ensemble itself can be represented as a cloud of phase space points. A phase space orbit of the cloud from one coarse-grained region to another represents the evolution of the ensemble over time.
That x_t is located in a particular region R of $\Gamma$ at a time t has a probability p_t(R), (Eq. 1.n10): ${p}_{t}\left(R\right)=$ ∫_R $\rho \left(x,t\right)\, {\textit{dx}}$ where $\rho$ is the ensemble, although it can be understood as supplying a probability, viz., the probability of the microstate of the system residing in a particular region of the phase space (or ${p}_{t}\left(R\right)$). We can now use the probability density $\rho \left(x,t\right)$ to define the Gibbs entropy ${S}_{G}(\uprho )$ as follows, (Eq. 2.n10): ${S}_{G}(\uprho )= \int \rho \mathrm{ln}\rho \,dx$ Given that SYS is in thermal equilibrium and that it is exemplifying a macroscopic physical quantity P, we can connect P with f, the latter having a phase average $\langle f\rangle$. (Eq. 3.n10): $\langle f\rangle =$ ∫_x $f\left(x\right)\rho \left(x,t\right)\, {\textit{dx}}$. We can now say that the value of variable f connected with quantity P exemplified by SYS in thermal equilibrium will be its phase average $\langle f\rangle$. The variable f here is then a macroscopic variable. My explication leans on the sources cited in note 4, and I especially lean on ([106], pp. 426–429), cf. ([115], pp. 64–65); and ([224], cf. [218]).
E.g., see the remarks in ([1], p. 76 n. 5 who emphatically references Gibbs), ([113], pp. 39–40) as well as those in [115], Sect. 2 where the authors attempt to connect what is sometimes called Boltzmann’s combinatorial entropy formula (i.e., ${S}_{B}(X)=k\, \mathrm{log}\,vol\Gamma \left(X\right)$ or the Boltzmann entropy of macrostate X of a physical system SYS equals the Boltzmann constant multiplied by the logarithm of the volume of the phase space region representative of X) with what they identify as Clausius entropy. However, their discussion of Clausius entropy leaves much to be desired. They quote Rudolf Clausius’s (1822–1888) statement of the second law (q.v., my n. 1 above) and then discuss textbook presentations of thermodynamic entropy. There is an attempt to show in (ibid.) that the Boltzmannian approach defended there has a distinguished pedigree because its ideas go back to Boltzmann and some of Boltzmann’s ideas about entropy align with or at least can capture some facets of Clausius’s work on thermodynamic entropy.
The aforementioned equation for the Boltzmann entropy was first proffered by Planck, not Boltzmann ([153], p. 61). Naturally enough, it was also Planck who first introduced k (“Boltzmann’s constant”) into physics.
See (Boltzmann, On the Mechanical Significance [Meaning] of the Second Law of Heat Theory [15]).
See (Boltzmann, Comment on Some Problems in Mechanical Heat Theory [24]) wherein Boltzmann responds to Loschmidt.
See (Boltzmann, Further Studies on the Thermal Equilibrium of Gas Molecules [21]).
See ([169], p. 139). It was Boltzmann who in (Boltzmann, Comment on Some Problems in Mechanical Heat Theory [24]), adjusted Loschmidt’s reasoning, turning it into what is now commonly called Loschmidt’s paradox (and here I’m agreeing with [88], 195).
(Boltzmann, On the Relation between the Second Law and Probability Calculus [25]).
Here is Klein’s more complete depiction (as found elsewhere) of what I am calling the Standard Story:

"It was Boltzmann who showed how irreversible behavior could be explained and who obtained an expression for the entropy in terms of the molecular distribution function. Under the pressure of Josef Loschmidt’s criticism of his H-theorem of 1872, Boltzmann constructed a fully statistical explanation of the second law, in which irreversibilty [sic.] was to be understood as the normal evolution of a system into the most probable state, that is, the most probable molecular distribution allowed by its circumstances. Boltzmann reached this fully statistical interpretation of the second law of thermodynamics in 1877. He evidently believed that the problem was settled, that he had explained the essential features of the second law, and he turned his attention to other matters. His later discussions of this problem, in the 90′s, were undertaken only in response to new criticisms, and always consisted of elaborations and more careful restatements of his statistical point of view”. (M.J. Klein, Mechanical [145], 63 emphasis mine).

Klein would add that Boltzmann liked Hermann von Helmholtz’s (1821–1894) attempt to provide a mechanical analogy for thermodynamics in the 1880s. Boltzmann explored the analogy himself. Klein goes so far as to suggest that Boltzmann accepted Helmholtz’s analogy suitably amended (ibid., 70). The shift to analogical considerations in Boltzmann’s thought is not typically part of the Standard Story in the work of contemporary Boltzmannians (e.g., [1, 2], cf. the not so contemporary [100]). See also ([206], pp. 32–44), although Sklar seems to maintain that Boltzmann’s combinatorial view was an attempted probabilistic interpretation of the H-theorem (ibid., 41). That reading is suspect because the functional H (or -H in the H-theorem “does not [always] correspond to the Boltzmann entropy” in Boltzmann’s combinatorial work ([116], p. 28). Sklar does express doubts about acquiring a definitive interpretation of the original literature at ([206], p. 37). Other proponents of at least key parts of the Standard Story include Tim Maudlin, as his view on this matter was presented at the 2019 Foundations of Physics Workshop: A Celebration of David Albert's Birthday at Columbia University under the title “S = k ln (B(W)): Boltzmann Entropy, the Second Law and the Architecture of Hell”. Brown et al. ([44], pp. 185, 187) and Uffink ([226], p. 967) affirm that part of the Standard Story which emphasizes an abandonment of the H-theorem (understood as an exceptionless and deterministic understanding of thermodynamics) in the face of Loschmidt’s reversibility objection. They affirm that Boltzmann replaced the H-theorem and its mechanical approach to justifying the second law with a probabilistic or statistical outlook. This can also be seen in the work of Brown and Myrvold in [43]. They remarked,

“…in his 1895 reply to Culverwell et al., Boltzmann is reiterating the probabilistic position he adopted in his first 1877 paper in response to Loschmidt’s objection to the original form of the H-theorem…from 1877…[a specific] process of equilibration becomes for Boltzmann merely probable…The change in thinking is particularly evident in the treatment of the homogeneity of the gas. For Boltzmann, in 1872, once this condition is achieved it is permanent. But in 1877, he flatly denies such permanence for arbitrary initial states. The understanding of irreversibility has taken on a new form, despite some very misleading remarks by Boltzmann to the contrary. The significance of this shift of reasoning…cannot be overstressed…” (ibid., pp. 26–27 emphasis in the original; these authors include a Sect. (8.2) entitled, “Post-H-theorem Boltzmann: Probability reigns” (ibid., 29)).

Dürr and Teufel stated that Loschmidt’s,

"reversibility objection…led Boltzmann to recognize that his famous H-theorem…which in its first publication claimed irreversible behavior for all initial conditions, was only true for typical initial conditions. Because, as Boltzmann immediately responded, [they have in mind (Boltzmann, On the Relation between the Second Law and Probability Calculus [25])] these bad initial conditions are really very special, more atypical than necessary." ([96], p. 87).

Ben-Menahem and Hemmo have written,

"…Boltzmann’s H-theorem turned out to be inconsistent with the fundamental time-symmetric principles of mechanics. This was the thrust of the reversibility objection raised by Loschmidt…It is at this juncture that probability came to play an essential role in physics. In the face of the reversibility objections, Boltzmann concluded that his H-theorem must be interpreted probabilistically." ([11], pp. 5–6).

We can add to this list a Nobel Laureate, ([203], pp. 244–245). All these thinkers seem to be under the heavy influence of [100]. Indeed, some of them note the influence (e.g., Ben-Menahem and Hemmo state that they “essentially follow the Ehrenfest and Ehrenfest reconstruction of Boltzmann’s ideas in a very schematic way” ([11], 5. n. 6). On this influence of the Ehrenfests, see ([7], p. 354). My turn away from the Standard Story follows (with important differences and departures) ([7, 155], and [235]).
See ([103]).
See ([127], pp. 173–174).
See ([42]).
There are important contemporary studies of mechanism and mechanistic explanation in the work of [111] and within the contributions to [112]. Some of the views expressed in the aforementioned sources can be used to help further develop the sense in which my Boltzmannian approach provides a mechanistic explanation of entropic increase.
See ([182] vol. 1, pp. 438–479) and [51]. The Adams Prize was named after John Couch Adams (1819–1892) who predicted the existence of Neptune in 1845. Of course, Maxwell won the 1857 Adams Prize, but it appears that his essay was the only one submitted for it. See Harman’s note (2) in ([182] vol. 1, 438–439). See also ([170, 181], p. 2).
For an introduction to the ten tenets of modern kinetic theory and some thoughts about how Maxwell contributed to that modern understanding, see ([129], pp. 311–315).
See ([171] and [172]). These two papers are misread by prominent contemporary philosophers of physics. For example, Frigg and Werndl ([105], p. 123) state that Maxwell assumes in his 1860 work that the constituents of gas systems do not interact. They state Maxwell shows how in equilibrium, the relevant gas types are described by the Maxwell–Boltzmann distribution (q.v., Sect. 3. I don’t know how Maxwell could have shown this. The Maxwell–Boltzmann distribution isn’t introduced until 1868. Boltzmann doesn’t propose a distribution law until then. Second, the gas constituents do interact through contacts or impacts in collisions. Even hard spheres can exert impact forces upon one another although they do not attract or exert repulsive forces upon one another. Furthermore, the subsystems would interact even if Maxwell restricted his attention to ideal gases. I make this last point because many seem to believe that ideal gas molecules don’t interact at all (ibid., 127, [104], p. 119). This is false. See footnote 191 below.
See ([174]).
See also ([208], p. 245, 346 n. 22).
See ([180]).
See ([49], p. 22).
He said, “my particles have not all the same velocity, but the velocities are distributed according to the same formula as the errors are distributed in the theory of least squares”. ([180], p. 10); (Maxwell vol. 1 [182], p. 610); (Brush, vol. 1 [46], p. 233).
For the point regarding kinetic energy, see ([88], pp. 301–310).
With respect to collisions, see (Maxwell vol. 1 [182], pp. 380, 405). With respect to velocities, note the draft comments at (Maxwell vol. 1 [182], p. 135), where he says, “[t]he external cause which sustains the motion of agitation in the case of Saturn’s rings is the different velocities…”.
“When the objects are mechanical, or are considered in a mechanical point of view, the causes are still more strictly defined, and are called forces”. (Maxwell vol. 1 [182], p. 378 emphasis mine) Harman adds in note (6) of ibid., “Compare Whewell’s view that the idea of cause construed as force is the ‘fundamental idea’ of mechanics”, subsequently citing ([238], pp. 177–254, 437–494) inter alia. Whewell influenced Maxwell’s work as is evidenced by Maxwell’s “Cambridge 'kinematical' research” approach in (SPM1, pp. 155–229), quoting ([209], p. 305).
In Maxwell's ([175]) demonstration of the generalized Maxwell distribution (i.e., the Maxwell–Boltzmann distribution discussed in Sect. 3 below) Maxwell very clearly invokes forces understood as causal mechanisms that influence the motions of gas molecules and act on systems ([175], pp. 537–538).
([173]).
(Maxwell vol. 2: part 2 [184], p. 291) emphasis mine. After quoting this precise passage, Harman ([126], p.127) adds “[t]his defines the dynamical basis of his theory of gases”. There is, of course, a sense in which (as Maxwell says) we abandon something like mechanical or dynamical descriptions of physical evolutions when we revert to statistical methods [178], p. 339), but that is only because we invoke statistical methods due to our inability to “follow every motion by the calculus”. (ibid.) Following every motion by the calculus is what Maxwell calls “the strict dynamical method” (ibid.).
([73]). We can judge that Maxwell learned from Clausius in the way I’m suggesting on the basis of correspondence between Maxwell and Stokes dated May 30, 1859 (Maxwell vol.1 [182], pp. 606–611).
([73], p. 84).
([73], pp. 82–83).
([73], p. 84). Besides the ‘sphere of action’ and ‘effects’ talk, Clausius also uses terms like ‘influence’. The “molecular forces are of influence in sensibly altering the motion of the molecule” (ibid., 82). My points are not evaded by resorting to the original German publication of 1858.
([193], p. 126). Maxwell thought that the Maxwell distribution is stable under collisions given that the number of a particular set of collisions $d\nu$ equals the number of reciprocal collisions $d{\nu^{\prime}}$. Collisions have pre and post-collision velocities. If there’s a binary collision—Maxwell restricted his reasoning to binary collisions—with pre-collision velocities v₁ and v₂ and post collisions velocities u₁ and u₂, then its reciprocal is the binary collision with pre-collision velocities u₁ and u₂ and post-collision velocities v₁ and v₂. It is not a trivial matter whether there are such reciprocal collisions for any set of existing collisions (q.v., the discussion of Lorentz and Boltzmann in Sect. 5 below). Later on, Maxwell asserted that there are such reciprocal collisions if the colliding objects are perfectly elastic (or perhaps point-like) molecules acting through central forces ([175], p. 537).
Following Darrigol’s ([88], pp. 81–83) reading of Maxwell, restrict the mind’s attention to a gas system S whose point-like molecules influence each other through central forces that only engage in binary elastic collisions. Consider that for Maxwell, there are a number of binary collisions $d\nu$ belonging to a particular collision-type $\sigma$. Suppose that the two colliding molecules are M₁ and M₂ that had pre-collision velocities v₁ and v₂ (respectively) and that took on post-collision velocities u₁ and u₂ (respectively). For Maxwell, whether a collision is of the $\sigma$-type depends upon collision parameters that are the azimuthal angle and the impact parameter ([174], pp. 56–57). The latter consists of the two paths the colliding molecules would have traveled were they to fail to interact with one another (in the center-of-mass reference frame). The azimuthal is the angle that fixes the plane upon which sits the post-collision trajectories of both molecules. Let $f\left(\mathbf{v}\right){\mathrm{d}}^{3}\upsilon$ give the number of molecules per unit volume that enjoy velocities within the ${\mathrm{d}}^{3}\upsilon$ range about velocity $\mathbf{v}$. And let q represent a property of any molecule in S, e.g., kinetic energy or inertial mass. Collisions can and do change the total value of q within a specific velocity element ${\mathrm{d}}^{3}{\upsilon }_{1}$. That change wrought by collisions is encoded by the equation ([88], p. 82):

$ \left( {{\text{Eq}}.{\text{ 1}}.{\text{n38}}} \right):\delta \left[ {q_{1} f\left( {{\mathbf{v}}_{1} } \right){\text{d}}^{3} \upsilon _{1} } \right] = \int_{{{\mathbf{v2}}\sigma }} {(q1^{\prime} - q_{1}) ~dv} $

If we were to suppose that molecule M₁ (or any molecule for that matter) enjoys a velocity within ${\mathrm{d}}^{3}{\upsilon }_{1}$, and that M₁ collides with M₂ (a distinct molecule that enjoys pre-collision velocity v₂), that collision will produce a variation or transmutation of q represented by the difference $\left({q}_{1}^{\prime}-{q}_{1}\right)$ that depends on the collision-type $\sigma$ (which is fixed by the collision parameters) and M₂′s pre-collision velocity v₂ (ibid., 81–83).
That is to say, the function is non-directional.
(Brush, vol. 1 [46], p. 233); ([174], p. 64). Hendrik A. Lorentz (1853–1928) derived the Maxwell distribution function for monatomic gases and showed its stability under collisions in [167].
See the discussions in ([48], p. 62).
See [101, 213, 214], cf. [215]. See the helpful commentary in [4], (Toennies et al. [223]).
([174], 58ff., [175]).
See the comments in (Brush, vol. 2 [47]); (Brush, vol. 1 [47]); ([88], p. 88), ([102], cf. [226], pp. 948–952).
([174], p. 54).
This assumption did not seem to be essential. Maxwell at times allows for a myriad of possible theories of the underlying microconstituents ([126], pp. 126–127), ([174], pp. 54–55), ([208], p. 246) but the actual reasoning does seem to employ (2).
Maxwell claims to have experimentally justified his characterization of the central repulsive forces involved in this assumption ([174], p. 51).
See ([108], pp. 7). This is an assumption of his 1860 work at least. It is still relevant to an assessment of Maxwell’s more mature work in 1867. Why? Because in his 1867 paper, Maxwell proves (4), and (4) references (3).
See on these two assumptions (Brush, vol. 1 [46], p. 186).
[174], p. 62 “this assumption may appear precarious”, ([108], p. 8).
The connection between Herschel and Maxwell has been established by [102]. See also (Brush, vol. 1 [46], pp. 183–189).
Herschel converted John Stuart Mill (1806–1873) to his epistemic view, causing Mill to forsake his objections to Laplace’s Bayesian interpretation of probability ([207, 245], p. 32. n. 18).
([5], pp. 629–630); and see the quotation of Maxwell in ([135], p. 1).
Polyatomic molecules are molecules with more than two atoms that enjoy internal degrees of freedom. They are sometimes described by internal variables that give one their vibrational, rotational, and electronic states ([154], p. 133). Polyatomic molecules therefore have states that are not exhausted by their translational velocities.
(Boltzmann, Studies on the Equilibrium of Live Force Between Moving Material Points 1868); (Boltzmann, On the Thermal Equilibrium Between Polyatomic Gas Molecules 1871); cf. (Boltzmann, Further Studies on Thermal Equilibrium among Gas Molecules 1872). See also the comments in the secondary literature at ([136], p. 61). According to Darrigol, Boltzmann also realized that one of his generalizations of Maxwell’s distribution yields a scientific “approach” that “can be applied to any system of point-atoms whereas Maxwell’s original reasoning applies to gases only”. ([88], p. 8).
(Brush, vol. 1 [46], p. 234; [203], p. 279). The factor ${e}^{-h(\frac{1}{2}m{v}^{2}+V[x])}$ is called the Boltzmann factor. The Maxwell–Boltzmann distribution in more modern discussions is explicitly dubbed a probability density function (PDF) and more commonly expressed as follows (for ideal gases),
(Eq. 1. n. 56):
$$f\left( v \right) = \left( {\frac{m}{{2\pi KT}}} \right)^{{3/2}} 4\pi v^{2} e^{{\left[ { - \frac{{m\left( {V_{x}^{2} + V_{y}^{2} + V_{z}^{2} } \right)}}{{2KT}}} \right]}}$$
See ([161], p. 291).
There are attempts to derive or justify (5) not only in the work of Boltzmann and Maxwell, but also in the work of George Bryan (1864–1928) (who tried to do without certain of Maxwell’s assumptions about collision numbers), Kirchhoff (whose argument is similar to Bryan’s), Lorentz (whose result is limited), and Max Planck (1858–1947) (whose argument rested on considerations having to do with time-reversal invariance). See ([52, 140], pp. 142–148) (see also [141, 167 192], cf. the discussion in [88], pp. 23–24, 323–327; 358–365) who summarizes Boltzmann’s responses to this literature.
In 1894, Boltzmann provided a new derivation of the Maxwell–Boltzmann distribution that did not rely upon any special reasoning or assumptions about collision numbers and that could be extended to polyatomic gas systems (Boltzmann, Application 1894). See the discussion at ([88], pp. 354–355).
Early on (in 1867), Maxwell would say about other types of matter such as polyatomic molecules, that,

"A law of the same general character is probably to be found connecting the temperature of liquid and solid bodies with the energy possessed by their molecules, although our ignorance of the nature of the connexions between the molecules renders it difficult to enunciate the precise form of the law." ([174], p. 54).
See ([119]) and the comments in ([49], pp. 23–24).
“Es ist somit noch nicht bewiesen, daß, wie immer der Zustand des Gases zu Anfang gewesen sein mag, er sich immer dieser von Maxwell gefundenen Grenze nähern muß”. BWA1, 319–320. (Boltzmann, Further Studies on the Thermal Equilibrium of Gas Molecules [21]; cf. [39], p. 266).
Unless I’ve used the translations of others, all translations from the German into English in this work were assisted by the following resources: ([97]); [216], and ([219]), plus some software or program assistance by Google Translate and Microsoft Word German language and spell checker software programs (q.v., the acknowledgments).
On the Boltzmann equation, see ([63, 154, 230]).
([201], p. 52). In (Boltzmann, Further Studies on the Thermal Equilibrium of Gas Molecules [21]), Boltzmann expressed the equation in terms of integrals that give one how the distribution function (understood as an energy and time-dependent function) changes with time. There are many other versions of this equation in Boltzmann’s work. Other forms of expression involve appropriate modifications for various cases in which an external force acts (such as Newtonian gravity) on the evolving system. See (Boltzmann, On the Thermal Equilibrium of Gases on Which External Forces Act [23]).
See ([143], p. 101).
Ibid.
([89], p. 773).
([203], p. 243).
Both Maxwell and Boltzmann had argued in favor of this point prior to 1872 (Brush, vol. 1 [46], p. 237).
([62, 63, 67]). See also ([132, 133, 186, 189]) for important results on the Boltzmann equation.
With very few mistakes (corrected later by Boltzmann), Maxwell [174] had already figured out how to try to compute these coefficients without the Boltzmann equation. His efforts used conservation laws which Darrigol says are “implicitly equivalent to the Boltzmann equation” ([88] p. 12).
See the proof in ([228], pp. 141, 167–168).
See ([158,159,160]; and q.v., Appendix 2. See [212, p. 64 and theorem 4.5) for a rigorous statement of the theorem. In some of the relevant literature on Lanford’s project, what’s shown is that in the Boltzmann-Grad limit and for rarefied gas systems whose molecules are approximated by hard spheres, given smallness of time, that a particular chaos property is exemplified by the choice systems at t₀ (and as a consequence temporally propagates for future times), and some other assumptions, one can move from the BBGKY formulation or hierarchy (of equations) to the Boltzmann equation, itself formulated in terms of a hierarchy (the Boltzmann hierarchy). There are proofs which forsake the smallness of time assumption and replace it with a smallness of norm (or smallness of initial data) assumption. See ([66], pp. 63–84), ([2012], pp. 48–76) and the literature cited therein.
Lanford ([160], p. 75) distinguishes his result from Boltzmann’s H-theorem. I’m interested in defending the latter which uses a different chaos property than that which is assumed in work on Lanford’s theorem. Both chaos properties have a No Mathematics Problem (defined and solved in Sect. 8 and Appendix 2 below).
See the discussions in [66, 226], pp. 1028–1033); and [228]. Given the terminology introduced and defined in Sect. 8, I maintain that Lanford’s project resolves the reversibility objection but does not resolve the Chaos Asymmetry Problem or the No Mathematics Problem. Uffink and Valente ([228], pp. 160–166) argue for something like the former idea, while Villani (230, pp. 95–100) agrees with the latter thesis.
Samuel Burbury (1831–1911) introduced H so as to supplant Boltzmann’s use of E [54]. Boltzmann would subsequently use H in 1895. Some folks have said that Burbury intended to use η or eta so as to follow Josiah Gibbs’s (1839–1903) representation of entropy. That is not true [50], p. 182. first note, ([88], p. 142. n. 8).
See BWA1, 335; (Boltzmann, Further Studies on the Thermal Equilibrium of Gas Molecules [21]. See also [89], p. 773).
([35, p. 55).
See ([45], p. 47), ([64], pp. 154–155), [88], pp. 319–327), [151, 152, 167].
Cercignani and Lampis [68] argue convincingly that the existence or non-existence of reciprocal collisions depends not so much on the shape of the molecules, but upon the nature of the interactions those molecules are involved in.
Interestingly, Boltzmann’s Lorentz-inspired argumentation does not make use of the Boltzmann equation. Rather, it “rests on a direct evaluation of the effect of collisions on the value of the H-function”. ([88], p. 327).
[60, 59]. The latter was published posthumously. I have not read these papers but was made aware of their contents by the discussion in ([64], pp. 96, 273–276). Also see ([231], pp. 4–8) for a proof sketch.
Darrigol’s proof (and compare the proof in [68]) avoids cycles of collisions and discretization techniques. It assumes that the collisions are corresponding collisions. Unlike Boltzmann’s Lorentz-inspired proof for polyatomic gas types, it does make use of the Boltzmann equation.
([131], p. 3). The points I make in this section stand in contrast to the viewpoint adopted in ([7], p. 361). There, Badino argues that “Boltzmann…did not draw a clear-cut line between a mechanistic and a probability-based account of a system’s approach to equilibrium”. (ibid.) The evidence I articulate in the main text that follows shows that Boltzmann thought of mechanistic explanations as special kinds of causal explanations. There is no evidence that he believed causal explanations were provided by his combinatorial approach. As I reveal in Sect. 8.2.1, the combinatorial approach ignores causal interactions while those ignored instances of causation are central to the H-theorem or mechanistic approach. The latter is more fundamental than the former in Boltzmann’s eyes precisely because it says something more directly about the engine of entropic increase, viz., causal collisions. That Boltzmann’s major influences cut a divide between causal mechanistic explanations and statistical ones is revealed in the remarks of Maxwell’s Theory of Heat. There, Maxwell said that we abandon mechanical descriptions or explanations of physical evolutions when we appropriate statistical methods ([178], p. 339).
In several of Boltzmann’s lectures, he uses the locution ‘mechanical cause’. He does this once in an interesting discussion of medical science. There, Boltzmann speaks as if mechanical explanations are causal explanations ([37], p. 133).
([38], p. 144).
([38], p. 78).
You will recall that according to Hertz, hidden masses explain motions. Forces do not. See Boltzmann’s summary at ([37], p. 90).
([38], p. 79). Boltzmann’s own treatment of mechanics in (Boltzmann, Vorlesungen über die Principe der Mechanik [32]) gives us more insight into Boltzmann’s attitude about Hertzian mechanics, for there Boltzmann would quite clearly disapprove of Hertz’s picture (ibid., 1–6; 37–42).
([38], p. 140). Boltzmann’s lecture notes on natural philosophy from 1903 to 1906 were brought together by Ilse Fasol-Boltzmann. There’s some evidence that Boltzmann may not have read these notes verbatim when delivering his lectures. See the comments of John Blackmore at ([38], p. 133).
The German reads,

"Es ist nun möglich, daß nur gewisse, nicht alle möglichen Positionen und Geschwindigkeiten derselben im Verlaufe der Zeit eintreten können (z. B. wenn sie sich zu Anfang alle in einer auf den Gefäßwänden beiderseits senkrechten Geraden befanden)." BWA2, 14. (Boltzmann, On the Thermal Equilibrium of Gases on Which External Forces Act [23]).
For a related point see ([88], p. 171). However, Darrigol adds, “[s]till, there is no reason to think that Boltzmann believes that the H function could fail to decrease in such cases”. (ibid.) The excerpt quoted in n. 87 provides the very reason Darrigol believes is missing.
([64], p. 120).
E.g., it seems to have been entertained before in (Klein, Development [146]).
Emphasis mine. The original German reads as follows:

"Es ist somit strenge bewiesen, daß, wie immer die Verteilung der lebendigen Kraft zu Anfang der Zeit gewesen sein mag, sie sich nach Verlauf einer sehr langen Zeit immer notwendig der von Maxwell gefundenen nähern muß." BWA1, 345. (Boltzmann, Further Studies on the Thermal Equilibrium of Gas Molecules [21]). Q.v., the translation provided by the source at note 105.
Maxwell’s discussion referenced a “finite being” (q.v., n. 93) and did not use the term ‘demon’. It was William Thomson (Lord Kelvin; 1824–1907) who introduced that notion in his ([221], p. 442, also attributing a definition of the term to Maxwell at the footnote on the same page). Maxwell did not approve of the use of this term ([149], p. 215).
(Maxwell, vol. 2: part 1 [183], pp. 328–334, but see specifically 331–332).
(Maxwell, vol. 2: part 2 [183], p. 585).
(ibid.).
(ibid., 583). This is from his December 6th, 1870 letter to John William Strutt (Lord Rayleigh; 1842–1919). He affirms the quoted conclusion after presenting the “demon” case. In the version articulated in that letter, Maxwell says “I do not see why even intelligence might not be dispensed with and the thing [the sliding plate covering the hole] be made self-acting”. (ibid.) See also the April 13th, 1868 letter to Mark Pattison, specifically at (ibid., 366–367) and ([179], pp. 153–154).
For modern studies of Maxwell’s “demon” case, see ([45], pp. 40–41) ([88], pp. 63–64), [92] who notes the historical fact I’m noting here, ([144]; [208], pp. 239–267), ([210], pp. 621–626).
([168]). See ([92], pp. 218–221), (Garber et al. [109], p. 57).
([17], p. 470), as quoted and translated by ([88], p. 182) emphasis mine.
([149], p. 214), cf. ([210], p. 625). Thomson would follow-up on this thought in ([221]).
(Maxwell, vol. 2. Part 1 [183], p. 361).
(Maxwell, vol. 2: Part 2 [184], p. 582).
See the correspondence cited in ([210], pp. 428–429 n. 81).
(Boltzmann, Certain Questions [28]); (Boltzmann, [Reply to Culverwell] [29]); (Boltzmann, Minimum Theorem [30]).
(Boltzmann, “On the Thermal Equilibrium of Gases on Which External Forces Act” [23]); BWA2, 22–23. Taken from the translation work of ([88], p. 175).
([36], p. 117). I will use ‘$\delta Q$’ to mean exchanged heat. It stands in for Boltzmann’s use of ‘$dQ$’. Below I will also use the expression ‘$\delta W$’ to mean exchanged work.
See the discussion of some of the ideas here in ([58], pp. 266–267, and [203], p. 231).
Well, for an ideal gas, entropy equals minus kHV ([93], p. 42).
As Segrè noted, “[u]ltimately, Boltzmann showed that H was the negative of the entropy. He had thus connected thermodynamics with mechanics, but through the roundabout way of the H-theorem”. ([203], p. 243).
[78]. The integral actually first appears in [72]. Q.v., n. 116. My reading of Clausius follows ([58, 64], pp. 80–85), ([88], pp. 42–50, [90], pp. 301–303) in some places, but it also depends heavily upon my own independent assessment of the work of Clausius.
([64], p. 82). See also ([88], pp. 47–48), [187], pp. 1–12).
In contemporary discussions, U represents internal energy. For the idea in Clausius, see ([72, 79, 84, 90], pp. 293–294), and the ensuing discussion in the main text below.
Even in contemporary physics, Eq. (12) is also commonly understood to be a mathematical expression of the first law of thermodynamics ([3], p. S1119; [244], p. 311).
Clausius remarked, “work may transform itself into heat, and heat conversely into work, the quantity of one bearing always a fixed proportion to the other”. ([84], p. 23 emphasis removed). For idealized gases and fluids, dW=pdV. This equation expresses the equivalence of heat and work, the very principle discovered by Julius Robert Mayer (1814–1878) and James Prescott Joule (1818–1889).
“…a motion of the particles does exist, and that heat is the measure of their vis viva”. ([71], p. 4).
([148], p. 207 emphasis removed).
([84], p. 78). He allowed for the passage of heat from cold to warmer bodies so long as there was a simultaneously occurring compensating process.
([72], p. 500),

"Demnach gilt für alle umkehrbaren Kreisprocesse als analytischer Ausdruck des zweiten Hauptsatzes der mechanischen Wärmetheorie die Gleichung (II.) $\int \frac{\delta Q}{T}=0$." (ibid. emphasis in the original).
([179], p. 162). The remark is repeated twice at (ibid., 190 and 191).
See ([58], p. 276, [69], pp. 142–144). The experiment involved a large container with two chambers separated by a diaphragm. The container features only thermally insulated walls cutting off all heat exchange between the container’s contents and the container’s environment. A gas is introduced into one of the chambers, and the diaphragm subsequently released. A free expansion takes place. As the gas expands, evolving adiabatically (no heat exchange!), no work is performed, no temperature change takes place, and yet entropy increases.
From ([78]), as translated by ([58], p. 273 my emphasis).
He calls both the first and second laws of thermodynamics “fundamental laws of the universe” and “fundamental theorems of the mechanical theory of heat” in his 1865 memoir ([78]). See ([58], pp. 272–273) and the citations and quotations therein.
([84]).
([90], pp. 293–294).
([84], p. 20).
([84], p. 11), cf. ([90], pp. 293–294), ([91]).
“…the increase of disgregation is the action by means of which heat performs work…” (Clausius, Application of the Theorem [75], p. 91).
([80], p. 220). Cf. ([142], p. 136). The notion of disgregation has a not too distant cousin in prior work by Rankine. The relevant notion in Rankine’s work is the metamorphic function.
([90], pp. 293–294).
([176]). Boltzmann thinks he has proven the existence of Clausius’s disgregation in (Boltzmann, Analytical Proof of the Second Principle [20]; [88], p. 130).
As quoted and translated by ([58], p. 272). Taken from ([78], p. 353). This understanding does not go away in his later work. See, e.g., ([84], p. 107).
([58], p. 272).
([15]).
([64], pp. 83–84), [88], pp. 108–109). That Boltzmann is extending Clausius’s concept of entropy is the opinion of Olivier Darrigol ([88], p. 70), though he uses the phrase “counterpart to Clausius’s entropy in periodic mechanical systems”.
The idea behind both of Boltzmann's and Clausius's 1871 papers was to extend this relation to systems featuring periodic molecular motion, and so additional terms are expressed via additional equalities. See ([88], p. 108).
See ([88], p. 109) for the details. Darrigol adds, “I agree with Clausius that Boltzmann’s derivation of the second equation implicitly excludes a change in the potential function”. (ibid.).
As quoted by ([88], p. 109). Boltzmann claimed in (Boltzmann, On the Mechanical Significance [Meaning] of the Second Law of Heat Theory [15]) to have derived a Clausiustical entropy law from the principle of least action, hence Maxwell’s reference to Hamilton’s principle.
As further evidence for the claims in the main text, consider the fact that the paper Boltzmann references in PERIOCOPE above is (Boltzmann, Analyatical Proof of the Second Principle [20]). There Boltzmann attempted to specify the entropy of a system that exemplifies what’s called the canonical distribution or $\rho \left(x\right)={e}^{-\beta H(x)}/\int {e}^{-\beta H}d\sigma$ (where $d\sigma$ provides the phase orbit invariant measure on the phase space used to model the system, and where the H(x) here is the phase (all positions and velocities of the atoms in the system) dependent energy of the same system). Here Boltzmann is reaching back to his earlier attempt in (Boltzmann, On the Mechanical Significance [Meaning] of the Second Law of Heat Theory [15]) to provide a mechanical explanation of the second law of thermodynamics. He thought that he could apply the notion of Clausius entropy to gas systems featuring molecules that enjoy periodic motions, subsequently coming to understand that he won’t be able to explain non-periodic gas systems. He follows (and this is further evidence that he’s working with Clausius’s understanding of entropy) (Clausius, Remarks on the Priority Claim of Mr. Boltzmann [82]) in his attempt to derive Clausius’s notion of disgregation. He uses that bit of ideology with its underlying concept to retrieve the accepted minus integral expression for the entropy of a system abiding by the canonical distribution, noting along the way, that one could relate or associate entropy with kinetic and potential energy. Like Clausius, Boltzmann is here plainly describing changes in entropy in terms of transformations of energy, and he is fully embracing not only Clausius’s notion of entropy but also Clausius’s notion of disgregation! See ([88], pp. 128–133, on which I lean) for additional commentary.
([89], p. 773). See also ([136], p. 64), ([155], p. 42); (Klein, Development [146], p. 68).
(Brush vol.1 [46], p. 80).
([86], p. 96). More contemporaries of Boltzmann could be cited.
([148], p. 237).
See ([88], p. 529, [226], Sect. 4.1).
([88], p. 531). But see BWA1, 316–317 for these ideas in Boltzmann.
Agreeing with this nice point in ([88], p. 532).
As will become clear, the theorem must also make use of the HMC (q.v., Sect. 8 for a definition).
See ([169]), (Boltzmann, Comment on Some Problems in Mechanical Heat Theory [24], [88], p. 198).
As Olivier Darrigol said in correspondence,

"If you (and Kuhn) mean that for Boltzmann the combinatorial entropy formula was not primitive and that the Boltzmann equation and the equilibrium theorems were in the end more important, I completely agree. For a couple of years after 1877 he seems to have believed that he had a new way to compute thermodynamical equilibrium with this formula. But he later realized (in 1881) that the formula [was] in fact derived from the better founded microcanonical distribution. In the lectures on gas theory, the combinatorial entropy formula is there only as a ‘mathematical illustration’ of the H function, which is introduced through the Boltzmann equation and the H theorem." (11/19/2019 emphasis mine).

Badino raises an important question that few in the literature have sought to answer, “…if it is true that Boltzmann in 1877 abandoned a strict mechanistic view in favor of a probabilistic one, why did he consistently keep using the 1872 approach in his publications throughout the rest of his life?” ([7], pp. 354–355). I believe that Badino and I would reply that he never abandoned the mechanistic view but our attitudes about how best to understand Boltzmann’s views about mechanics, probability, the H-theorem, and the combinatorial arguments, differ substantially.
As translated and quoted by ([155], p. 45).
([35], p. 55).
([35], p. 421 emphasis mine, see also page 432 where he says, “Hence dH/dt will be negative, and can be zero only when the condition (266) is satisfied for all collisions”.).
([155], p. 57).
Jungnickel and McCormmach would go on to point out that Boltzmann seems to judge that the old mechanical picture was starting to be superseded by a “new atomistic picture” ([136], p. 191). But that picture is not provided by statistical mechanics. It is provided by “modern electron theory”.
(Boltzmann, Certain Questions [29], p. 413).
(Boltzmann, Certain Questions [29], pp. 413–414) emphasis mine. Cf. ([88], p. 373).
(Boltzmann, Certain Questions [29], p. 414).
Again see [158,159,160]. More precisely, what Lanford showed was that in the Boltzmann-Grad limit and for systems approximated by the hard sphere model, given smallness of time, that a particular weak chaos property holds initially, and some other assumptions, one can move from the BBGKY formulation or hierarchy (of equations) to the Boltzmann equation, itself formulated in terms of a hierarchy (the Boltzmann hierarchy). Of course, the BBGKY can be connected to Hamiltonian mechanics. For that, see ([228], pp. 147–150).
Again see Lanford [158,159,160];. But see also my comments on the relevant result in footnote 70.
(Villani, Lecture [233]); (Villani, Lecture Notes [234], slide 17); cf. (Villani, Math Berlin [232]).
As quoted and cited by ([136], p. 188).
ibid., 189. Except for the word ‘his’, the emphasis is mine.
As quoted and translated by ([136], p. 189). See ([33]).
See also (Klein, Mechanical [145], p. 73).
These points are made by ([155], p. 70).
See ([236]). This is the second edition of the work. The first edition was published in 1876.
(M. J. Klein, Paul Ehrenfest [143], p. 122).
(Ehrenfest, Scientific Papers [99], p. 135) as translated by ([145], p. 72). Cf. ([147], p. 166).
The series of arguments and replies were published after the August 1894 meeting of the British Association for the Advancement of Science at the University of Oxford. Boltzmann referred to this meeting as “the unforgettable meeting of the British Association at Oxford” ([35], p. 22). For many of the details on the discussion I lean, not only on my own readings, but also on those in [44], (Brush, Vol. 2 [47], pp. 616–625), [49], ([64], pp. 120–133), ([88], pp. 366–382), ([95]); (Klein, Ehrenfest [143], pp. 110–112).
The proof had a flaw which Culverwell corrected ([88], p. 368).
Watson thought the proof was purely mechanical.
I should add that in (Boltzmann, Certain Questions [29], p. 414), Boltzmann does cite his 1877 combinatorial arguments so as to back the claim that he had already argued that the second law of thermodynamics is a statistical law.
See ([53], p. 29), ([55], p. 78).
I do not believe the necessary assumption is Burbury’s (Condition A) or the Ehrenfests’ [100] Stoßzahlansatz. As I will soon reveal, it will not ultimately matter which characterization you choose, for all believe the necessary assumption about the nature of the involved collisions is asymmetric and all believe the assumption is not part of the laws of Hamiltonian mechanics.
As Villani put it,

"…for most initial configurations, the evolution of the density under the microscopic dynamics is well approximated by the solution to the Boltzmann equation. Of course, this does not rule out the existence of ‘unlikely’ initial configurations for which the solution of the Boltzmann equation is a very bad approximation of the empirical measure." ([230], p. 98).

It is this idea that Boltzmann’s combinatorial arguments are meant to illustrate.
As quoted and annotated by ([88], p. 323). Some maintain that Burbury was the first to point out the HMC assumption, but this is incorrect. In fact, Burbury required that there be a persisting external perturbation that ensures that systems evolve in a manner consistent with the HMC. No one accepted Burbury’s particular way of couching the HMC. Bryan’s citation of Burbury in his [53] work is probably just an attempt to document that the recognition of a related assumption in Boltzmann’s work appears in the work of Burbury. Strictly speaking, Bryan’s diagnosis of the precise content of the assumption was different from the content of Burbury’s diagnosis.
([35], p. 42). See also ibid., 58–59; (Boltzmann, Maxwell's Distribution Again [28], ([64], p. 259, and [155], p. 64). Something like the idea expressed here may even be in the work of Clausius ([100], p. 5).
For more on the Boltzmann-Kirchhoff debate, see ([88], pp. 320–321,360–361).
Sometimes the assumption is said to be equivalent to the claim that the distribution function satisfies: ${f}^{\left(2\right)}\left({\mathbf{v}}_{1},{\mathbf{v}}_{2}\right)=f({\mathbf{v}}_{1})f({\mathbf{v}}_{2})$, where ${f}^{\left(2\right)}$ is the distribution function for a pair of molecules. Here the idea is that the probability of seeing a pair of molecules with velocities v₁ and v₂ (around d³v₁ and d³v₂ respectively) is equal to the product of finding a molecule with v₁ around d³v₁, and a molecule with v₂ around d³v₂ ([57], p. 85), cf. the remarks at ([226], p 1036) on the BBGKY approach. Villani ([230], p. 99) argues that this is not an adequate characterization and that it actually needs to be generalized sufficiently to get the right result. Villani does not know how to do this and worries that it can’t be done. I agree with Villani and rest on his authority.
([230], p. 99). In his well-regarded book, Herbert Spohn remarked, “…the decrease of [the] H-function is linked to instants of molecular chaos. These properties remain a guess” ([212], p. 76) emphasis mine.
([44], p. 181). See the same point in ([196], p. 40).
See also ([44], p. 175), (Brush, vol. 2 [47], pp. 443–444, "the later Maxwell–Boltzmann developments [are] based on consideration of molecular collisions" 619); ([49], pp. 25–26 “Boltzmann proved that…collisions always push f(x,v,t) toward the equilibrium Maxwell distribution” ibid. and see ibid., 22 on the idea in Maxwell), ([57], p. 89 reporting in n. 3 that Jos Uffink agrees), ([88], pp. 321–323 on the idea in Lorentz's thought); the idea is clearly in related work by Kirchhoff, for which see (ibid., 361); ([136], p. 64, with remarks about Boltzmann), (Klein, Ehrenfest [143], p. 100, attributing the view to Boltzmann, see also p. 102); ([174], pp. 62, 64, [203], p. 279, [206], p. 32).
([155], p. 62).
(Brush vol.1 [46], p. 80). It should not surprise us then to see in Boltzmann’s interpretation of the second law as explained by the H-theorem, remnants of Thomson’s (and Clausius’s) idea of energy dissipation. Those remnants show up in Maxwell’s own interpretation (which influenced Boltzmann’s work) of the second law as well. Although energy dissipation in Maxwell’s thought possessed a certain anthropocentric element. See ([209], pp. 303–304 and n. 41), [208], pp. 240–241,247–252); ([210], p. 623). For Maxwell’s actual work, see SPM2, 646.
There are worries about Poincaré recurrence and fluctuations looming. I have answers for those worries too. My explication of them must be left for another project.
([148], p. 2 cf., 350). We do have to be careful not to mix up or confuse energy and exergy. Exergy is also a useful quantity in thermodynamics. It is defined as “the capability to do useful work” (ibid., 351).
([199], p. 106).
In the case of disagreeing angles one affirms: W = F • s cos θ .
The rate of changes of kinetic energy are what’s important, for $T=\frac{1}{2}m{v}^{2}$ (for the single classical point mass) never has an absolute value because the point mass’s velocity or speed will be relative to a reference frame.
This was the opinion of Newton, Leibniz, Huygens, Lagrange, Hamilton, Laplace, Maxwell, Boltzmann, Helmholtz, Gibbs and a great many others. I’ll very briefly focus on Newton and Hamilton because they are the most relevant in this context.

Newton: Newton said that “forces…are the causes and effects of true motions”. ([188], p. 414). The entire purpose of the Principia is given in this statement at the end of the Scholium:

"But in what follows, a fuller explanation will be given of how to determine true motions from their causes, effects, and apparent differences, and, conversely, of how to determine from motions, whether true or apparent, their causes and effects. For this was the purpose for which I composed the following treatise." ([188], p. 415).

Hamilton: Sir William Rowan Hamilton’s (1805–1865) causal mechanics was indebted to Immanuel Kant’s (1724–1804) “Second Analogy of Experience” in the first Critique ([137], pp. 304–316). Like Kant, Hamilton believed that every dynamical evolution had to involve some causality ([123], p. 179). In the first of Hamilton’s two most famous papers on dynamics, “On a General Method in Dynamics” [120, 121], pp. 103–161), Hamilton reasons to what he calls the law of varying action (LVA):
$$\delta V=\sum m\left(\dot{x}\delta x+\dot{y}\delta y+\dot{z}\delta z\right)-\sum m\left(\dot{a}\delta a+\dot{b}\delta b+\dot{c}\delta c\right)+t\delta H$$
(Eq. 1 n. 185): also calling it the “equation of the characteristic function” $V$( [120], p. 252). $V$ “completely determines the mechanical system and gives us its state at any future time once the initial conditions are specified” ([123], p. 186). At the time, the function $V$ was sometimes called the action of the system, hence “law of varying action”. The above statement of the LVA entails that $V$ is a function of the 3n-coordinates for whatever point masses are in the system, and the Hamiltonian $H$. As I point out in the main text above, for conservative systems:
$$H=T+U$$
(Eq. 2 n. 185): Kinetic and potential energy enter the LVA through $H$. Importantly, Hamilton calls $U$ the force-function because it is always associated with a corresponding force ([120], p. 249). In addition, Hamilton explicitly connects variations of $U$ to work done by subsystems ([123], p. 184), and also defines $U$ in terms of a force law ([120], p. 249). For Hamilton, this is how dynamics is causation-laden.
And I do have in mind the Lagrangian and not the Lagrangian density.
([169], p. 139) as quoted and translated by ([64], p. 98).
([221], p. 442).
On this distinction, see ([237], pp. 52–71) and the literature cited therein.
([88], p. 82). A modern account resembling Maxwell’s can be found in ([218], pp. 557–593).
You see this in the way he characterizes collisions. Writing to Stokes in 1859, he said,

"I saw in the Philosophical Magazine…a paper by Clausius on the ‘mean length of path of a particle of air or gas…’…on the hypothesis of the elasticity of gas being due to the velocity of its particles and of their paths being rectilinear except when they come into close proximity to each other, which event may be called a collision." (Maxwell vol.1 [182], p. 606 emphasis mine).

In his 1867 paper he writes,

"In the present paper I propose to consider the molecules of a gas, not as elastic spheres of definite radius, but as small bodies or groups of smaller molecules repelling one another with a force whose direction always passes very nearly through the centres of gravity of the molecules." ([174], emphasis mine).

Quite clearly Maxwell had in mind molecules that interact in other ways besides elastic collisions involving impacts. But as I stated in the main text, real world molecules and particles interact by means of repulsions or attractions plus impacts. For example, there are electron–electron collisions or scatterings, especially at high energy levels, despite coulombic repulsion [163]. In dense plasma recombination phenomena, electron–electron collisions occur. However, these recombinations are not similar to ionic three-body recombination phenomena precisely because of operating Coulombic forces in the former recombination cases [8]. My reader will retort that the molecular or particulate world is a quantum world. Sure. But in the phenomenon of ionization as causally produced by a free electron, the free electron comes in and strikes, thereby impacting, an electron bound to an atom. The energy transferred to the bound electron is greater than the binding energy of the bound electron. Thus, the impact and resulting energy transfer frees the bound electron from the atom. It is true that such a case is captured or explained by quantum physics, however, the scattering involved is elastic and the cross-sections of each electron are the same in both the quantum and classical domains ([150], p. 215). The electron–electron interactions they discuss are cases involving real impact. See the very title of their paper.), cf. [185]. You can therefore “use classical methods for [the] evaluation of the ionization cross-sections of an atomic particle by electron impact” ([150], p. 215 emphasis mine). I can ensure the relevance and accuracy of classical physics for this phenomenon by restricting my discussion to slower electron velocities and non-highly excited atoms. I do this because Hans Bethe (1906–2005) (this point is made by ibid.) showed that with respect to large electron velocities, an additional (beyond the classical) logarithmic factor exists in the cross-section of ionization [12]. The classical method used by Kosarim et. al. adequately accounts for the experimental data.

It is sometimes said that the molecules of ideal gases do not interact at all ([104], p. 119). That is not true. The equation of state for ideal gases (i.e., the ideal gas law) includes the quantity that is pressure. Pressure is force over unit area. If the ideal gas were confined to a container, the molecules would causally produce pressure by interacting with or impacting the boundaries, themselves atomically constituted, of that container. There would fail to exist pressure in the system if there were no such interactions. This is why modern work in thermodynamics assumes that ideal gas molecules do in fact undergo interactions with perfectly elastic and adiabatic boundaries. In fact, ideal particles or molecules can bring about “irreversible work contributions” through transferring momentum with a moving piston by interacting with that piston ([122], pp. 2, 13). The types of interactions that are precluded in the ideal gas case are interactions via repulsions and attractions. How else could an ideal gas reach thermal equilibrium if its velocities never changed as a result of accelerations wrought by impressed (at least impact) forces? Modern theorists are careful to note that “[f]or an ideal gas interactions between all molecules are supposed negligible, other than for establishing thermal equilibrium” ([40], p. 3 emphasis mine). That ideal gas constituents collide with each other thereby impressing impact forces upon each other, is the standard view ([6], p 25, [139], p. 244, [240], p. 351, citations could be multiplied).
I should add that modeling from a distance is also important to Maxwell because when many molecules collide matters become intractable. This is not because we lack the ingenuity to solve the equations appropriately, it is because we do not have the right equations! He wrote, “[w]hen we come to deal with collisions among bodies of unknown number, size, and shape, we can no longer trace the mathematical laws of their motion with any distinctness”. ([170], p. 53) emphasis mine; (SPM1, 354). Garber adds,

"He [Maxwell] concluded by noting the inability of dynamics to address this last problem…Mechanics cannot deal with collisions among many bodies flying around…." ([110], p. 1701) emphasis mine.

For Clausius, collisions and even “impacts” resulting in rebound effects are not instances in which centers of gravity or gas constituents literally come into contact with one another. It was enough for Clausius that the centers enter one another’s spheres of action (q.v., my discussion of Clausius in Sect. 2 above).
([88], p. 139). Boltzmann wrote,

"Das Produkt dieser drei Größen muß noch multipliziert werden mit einem gewissen Proportionalitätsfaktor, von dem man leicht einsieht, daß er unendlich klein, wie dξ sein muß. Derselbe wird im Allgemeinen von der Natur des Zusammenstoßes, also von den, den Zusammenstoß bestimmenden Größen x, $x{^{\prime}}$ und ξ abhängen." BWA1, 324 emphasis mine.

Here Boltzmann clearly states that the nature of the binary collisions is determined by pre-collision kinetic energies and the one post-collision kinetic energy.
(Frigg and Werndl, [107], p.6).
([116], p. 28).
([230], p. 79).
The best discussion of how Newton understood his second law of motion can be found in ([194]), although I would add and emphasize a causal force ontology in Newton’s thought.
([242], p. 69). Some will object. They will note that if Newton’s Principia does anything it provides the correct physics of billiard ball interactions and evolutions. This is not the case ([241], pp. 567–598). As Wilson has said,

"What should be properly said is that Newton and his followers practiced an admirable restraint in their descriptive ambitions, by substituting a crude but reliable walk-around method for a very difficult moving boundary computation. Even today, modern models of impact follow a Newtonian pattern whenever they can get away with it…" ([243], p. 105. n. 13 emphasis mine).
([94], p. 535).
([243], p. 116). See ([164], p. 124). Wilson goes on to point out that Leibniz was at the time concerned with a cut-off method employed by Christiaan Huygens (1629–1695). That cut-off procedure resembles the matched asymptotics of both Newton and modern modeling.
As Wilson’s summary of Leibniz stated,

"…it is only by plowing over these $\Delta {t}^{*}$ events that we can explain the elastic behavior of our original wooded beam in a purist efficient causation manner that speaks of nothing but the pushing and pulling of contacting particles." ([243], p. 117).

I should add that unlike Leibniz I see no room in the temporal intervals for the final causation that is discussed in the context of detailing the importance of “the mutual interactions of bodies” ([165], p. 142).
If you follow the many philosophers of physics who maintain that the crucial asymmetric assumption of Boltzmann’s reasoning is different from the HMC as I have stated it, and that it is, instead something like the Stoßzahlansatz as explicated by the Ehrenfests, then you would do well to note that in ([100], pp. 85. n. 65) a proof-sketch is summarized. The argument shows that the Stoßzahlansatz cannot hold in both the real world and reversed evolutions. Compare the similar stronger argumentation in (Burbury [53], 320 I skip the meat and potatoes and give the thesis and conclusion),

"I said in my first letter on this subject that the condition A [an asymmetric assumption like the HMC], on which, or its equivalent, the proof is based, could not apply to the reversed motion. As that assertion has been questioned, may I confirm it thus?…Boltzmann’s theorem can be applied to both motions only on condition that it has no effect in either."
Why are they able to do the trick? How can time-reversal invariant modeling, modeling which when reversed yields past-directed evolutions, recover descriptions of asymmetric future-directed evolutions? That is a very interesting question, a question which Leibniz believed suggested teleology. I will not delve into this particular matter.
([118], p. 2351).
([10], p. 1). See also [130].
They remarked,

"The relative lack of velocity correlations in the second layer at low densities is evidence of the presence of molecular chaos in this system. The upper layer continues to demonstrate uncorrelated velocities until the density reaches 80%." ([10], p. 4).

They would add that they are unsure of how it is precisely that the correlations obtain in the system, but it seems clear that the interactions play a key role. Why else would density matter?
([104], p. 105). I have changed Frigg’s inequality from greater than or equal to, to just greater than.
Again, for a precise statement of Lanford’s theorem, see ([212], 64 theorem 4.5). Spohn also provides a rigorous statement of the necessary factorization condition.
([228], p. 160).
([158], p. 81).
([160], p. 75) emphasis in the original.
For Annalen der Physik or Annalen der Physik und Chemie (the latter title was used from 1824 to 1899), I cite volume numbers in accord with the norms established by the journal in June of 2010. This footnote pertains to the references section.

Abbreviations

BWAn:: Wissenschaftliche Abhandlungen von Ludwig Boltzmann, edited by Fritz Hasenöhrl (Leipzig: Barth, 1909), vol. n.
SPMn:: The Scientific Papers of James Clerk Maxwell, edited by W.D. Niven. (Cambridge: Cambridge University Press, 1890), vol. n.

References

Albert, D.Z.: Time and Chance. Harvard University Press, Cambridge (2000)
Book MATH Google Scholar
Albert, D.Z.: After Physics. Harvard University Press, Cambridge (2015)
Book Google Scholar
Alonso, A.A., Erik Ydstie, B.: Process systems, passivity and the second law of thermodynamics. Comput. Chem. Eng 2, S1119–S1124 (1996)
Article Google Scholar
Andrews, J.P.: The direct verification of Maxwell’s law of molecular velocities. Science Progress in the Twentieth Century (1919-1933) 23(89), 118–123 (1928)
Google Scholar
Appleby, D.M.: Facts, values and quanta. Found. Phys. 35(4), 627–668 (2005)
Article ADS MathSciNet MATH Google Scholar
Argo, P.E.: The Role of Acoustic-Gravity Waves in Generating Equatorial Ionospheric Irregularities. University of California San Diego Department of Physics, San Diego (1981)
Google Scholar
Badino, M.: Mechanistic slumber vs. statistical insomnia: the early history of Boltzmann’s H-Theorem (1868–1877). Eur. Phys. J. H. 36, 353–378 (2011)
Article Google Scholar
Bates, D.R., Kingston, A.E.: Recombination through electron-electron collisions. Nature 189, 652–653 (1961)
Article ADS Google Scholar
Batterman, R.W.: Randomness and probability in dynamical theories: on the proposals of the Prigogine school. Philos. Sci. 58(2), 241–263 (1991)
Article MathSciNet Google Scholar
Baxter, G.W., Olafsen, J.S.: Experimental evidence for molecular chaos in granular gases. Phys. Rev. Lett. 99, 028001 (2007)
Article ADS Google Scholar
Ben-Menahem, Y., Hemmo, M.: Introduction. In: Ben-Menahem, Y., Hemmo, M. (eds.) Probability in physics, pp. 1–16. Springer-Verlag, Berlin (2012)
Chapter Google Scholar
Bethe, H.: Zur Theorie des Durchgangs schneller Korpuskularstrahlen durch Materie. Ann. Phys. 397, 325–400 (1930)
Article MATH Google Scholar
Bevilacqua, F.: Helmholtz’s Über die Erhaltung der Kraft: The Emergence of a Theoretical Physicist. In: Cahan, D. (ed.) Hermann von Helmholtz and the Foundations of Nineteenth-Century Science, pp. 291–333. University of California Press, Berkeley (1993)
Google Scholar
Bishop, R.C.: Nonequilibrium statistical mechanics Brussels—Austin style. Stud. History Philos. Modern Phys. 35(1), 1–30 (2004)
Article ADS MathSciNet MATH Google Scholar
Boltzmann, L.: Über die mechanische Bedeutung des zweiten Hauptsatzes der Wärmetheorie. Wiener Berichte 53: 195–220. Cited as “On the Mechanical Significance [Meaning] of the Second Law of Heat Theory”. BWA1, pp. 9–33 (1866)
Boltzmann, L.: Studien über das Gleichgewicht der lebendigen Kraft zwischen bewegten materiellen Punkten. Wiener Berichte 58: 517–560. Cited as: “Studies on the Equilibrium of Live Force Between Moving Material Points” BWA1, 49–96 (1868)
Boltzmann, L.: Theorie der Wärme. Fortschr. Phys. 26, 441–504 (1870)
Google Scholar
Boltzmann, L.: Über das Wärmegleichgewicht zwischen mehratomigen Gasmolekülen. Weiner Berichte 63: 397–418. Cited as: “On the Thermal Equilibrium Between Polyatomic Gas Molecules”. BWA1, 237–258 (1871)
Boltzmann, L.: Über die Druckkräfte, weiche auf Ringe wirksam sind, die in bewegte Flüssigkeit tauchen. Journal für die reine und angewandte Mathematik. 1871 (73): 111–134. Cited as: “On the Compressive Forces”. BWA1, 200–227 (1871)
Boltzmann, L.: Analytischer Beweis des zweiten Hauptsatzes der mechanischen Wärmetheorie aus den Sätzen über das Gleichgewicht der lebendigen Kraft. Wiener Berichte 63: 712–732. Cited as: “Analytical Proof of the Second Principle”. BWA1, 288–308 (1871)
Boltzmann, L.: Weitere Studien über das Wärmegleichgewicht unter Gasmolekülen. Wiener Berichte 66: 275–370. Cited as: “Further Studies on Thermal Equilibrium among Gas Molecules”. BWA1, 316–402 (1872)
Boltzmann, L.: Über einige an meinen Versuchen über die elektrostatische Fernwirkung dielektrischer Körper anzubringende Korrektionen. Wiener Berichte 70: 307–341. BWA1, 556–586 (1874)
Boltzmann, L.: Über das Wärmegleichgewicht von Gasen, auf welche äußere Kräfte wirken. Wiener Berichte 72: 427–457. Cited as: “On the Thermal Equilibrium of Gases on Which External Forces Act”. BWA2, 1–30 (1875)
Boltzmann, L.: "Bemerkungen über einige Probleme der mechanischen Wärmetheorie. Wiener Berichte 75: 62–100. Cited as: “Comment on Some Problems in Mechanical Heat Theory” BWA2, 112–148 (1877)
Boltzmann, L.: Über die Beziehung zwischen dem zweiten Hauptsatze der mechanischen Wärmetheorie und der Wahrscheinlichkeitsrechnung respektive den Sätzen über das Wärmegleichgewicht. Wiener Berichte 76: 373–435. Cited as: “On the Relation between the Second Law and Probability Calculus”. BWA2, 164–223 (1877)
Boltzmann, L.: Über einige Fragen der kinetischen Gastheorie”. Wiener Berichte, 96: 891–918. Cited as: “On Some Questions about Kinetic Gas Theory”; BWA3, 293–320. Also published in English as: Ludwig Boltzmann. 1888. “On Some Questions in the Kinetic Theory of Gases. Philos. Mag. 25(153), 81–103 (1887)
Article Google Scholar
Boltzmann, L.: On the Application of the Determinantal Relation to the Kinetic Theory of Polyatomic Gases. Report of the Sixty-Fourth Meeting of the British Association for the Advancement of Science Held at Oxford in August 1894. London: John Murray, 102–106. Cited as: “Application”. BWA3: 520–525 (1894)
Boltzmann, L.: Nochmals das Maxwellsche Verteilungsgesetz der Geschwindigkeiten. Annalen der Physik und Chemie 291 (5): 223–224. Cited as: “Maxwell’s Distribution Again”. BWA3, 532–534 (1895)
Boltzmann, L.: On certain questions of the theory of gases. Nature 51: 413–415. Cited as: “Certain Questions”. BWA3, 535–544 (1895)
Boltzmann, L.: Professor Boltzmann’s letter on the kinetic theory of gases. Nature 51: 581. Cited as: “[Reply to Culverwell]”. BWA3: 545, there appearing as “Erwiderung an Culverwell” (1895)
Boltzmann, L.: On the minimum theorem in the theory of gases. Nature 52: 221. Cited as: “Minimum Theorem”. BWA3, 546 (1895)
Boltzmann, L.: Vorlesungen über die Príncipe der Mechanik. Verlag von Johann Ambrosius Barth, Leipzig (1897)
MATH Google Scholar
Boltzmann, L.: Über die Entwicklung der Methoden der theoretischen Physik in neuerer Zeit. In Ludwig Boltzmann, Populäre Schriften. (Leipzig: Verlag von Johann Ambrosius Barth, 1905), 198–227 (1899)
Boltzmann, L.: Populäre Schriften. Verlag von Johann Ambrosius Barth, Leipzig (1905)
MATH Google Scholar
Boltzmann, L.: Lectures on Gas Theory. Translated by Stephen G. Brush. New York: Dover (1964)
Boltzmann, L.: Further studies on the thermal equilibrium of gas molecules. In Kinetic theory: Volume 2: irreversible processes, edited by Stephen G. Brush, translated by Stephen G. Brush, 88–175. New York: Pergamon Press (1966)
Boltzmann, L.: Theoretical physics and philosophical problems, edited by Brian McGuinness. Translations from German were carried out by Paul Foulkes. Dordrecht-Holland: D. Reidel Publishing (1974)
Boltzmann, L.: Ludwig Boltzmann his later life and philosophy, 1900-1906 book one: a documentary history, edited by John Blackmore. Dordrecht: Springer (1995)
Boltzmann, L.: Further studies on the thermal equilibrium of gas molecules. In Stephen G. Brush, The Kinetic Theory of Gases: An Anthology of Classical Papers with Historical Commentary, translated by Stephen G. Brush, edited by Nancy S. Hall, 262–349. London: Imperial College Press (2003)
Bowler, M.G.: The physics of osmotic pressure. Eur. J. Phys. 38(5), 055102 (2017)
Article MATH Google Scholar
Bricmont, J.: Science of Chaos or Chaos in science? In: Gross, P.R., Levitt, N., Lewis, M.W. (eds.) The Flight from Science and Reason, pp. 131–175. New York Academy of Sciences, New York (1996)
Google Scholar
Brown, H., Lehmkuhl, D.: Einstein, the reality of space, and the action-reaction principle. In: Ghose, P. (ed.) Einstein, Tagore and the Nature of Reality, pp. 9–36. Routledge Publishers, New York (2017)
Google Scholar
Brown, H.R., W. Myrvold.: Boltzmann’s H-Theorem, its Limitations, and the Birth of (Fully) Statistical Mechanics. arXiv:0809.1304v1 [physics.hist-ph] (2008)
Brown, H.R., Myrvold, W., Uffink, J.: Boltzmann’s H-Theorem, its Discontents, and the Birth of Statistical Mechanics. Studies in History and Philosophy of Modern Physics 40(2), 174–191 (2009)
Article ADS MathSciNet MATH Google Scholar
Brush, S.G.: The development of the kinetic theory of gases VIII. Randomness and irreversibility. Archive History Exact Sci 12(1), 1–88 (1974)
Article MathSciNet Google Scholar
Brush, S.G.: The Kind of Motion We Call Heat: A History of the Kinetic Theory of Gases in the 19th Century Book 1: Physics and the Atomists. Amsterdam: North-Holland. Cited as: “Vol. 1” (1976)
Brush, S.G.: The Kind of Motion We Call Heat: A History of the Kinetic Theory of Gases in the 19th Century Book 2. Statistical Physics and Irreversible Processes. Amsterdam: North Holland. Cited as: “Vol. 2” (1976)
Brush, S.G.: Statistical Physics and the Atomic Theory of Matter: From Boyle and Newton to Landau and Onsager. Princeton University Press, Princeton, NJ (1983)
Google Scholar
Brush, S.G.: Gadflies and Geniuses in the History of Gas Theory. Synthese 119(1/2), 11–43 (1999)
Article Google Scholar
Brush, S.G.: The Kinetic Theory of Gases: An Anthology of Classic Papers with Historical Commentary, edited by Nancy S. Hall. London: Imperial College Press (2003)
Brush, S.G., Everitt, C.W.F., Garber, E.: Maxwell on Saturn’s Rings. MIT Press, Cambridge (1983)
Google Scholar
Bryan, G.H.: Report on the present state of our knowledge of thermodynamics part II: the laws of distribution of energy and their limitations. Report of the Sixty-Fourth Meeting of the British Association for the Advancement of Science Held at Oxford in August 1894. London: John Murray, 64–102 (1894)
Bryan, G.H.: The assumptions in Boltzmann’s minimum theorem. Nature 52, 29–30 (1895)
Article MATH Google Scholar
Burbury, S.H.: On some problems in the kinetic theory of gases. Philos. Mag. 30(185), 298–317 (1890)
Article MATH Google Scholar
Burbury, S.H.: Boltzmann’s Minimum Function. Nature 51, 78 (1894)
Article ADS MATH Google Scholar
Burbury, S.H.: Boltzmann’s Minimum Function. Nature 51, 320 (1895)
Article ADS Google Scholar
Callender, C.: The past histories of molecules. In: Beisbart, C., Hartmann, S. (eds.) Probabilities in Physics, pp. 83–113. Oxford University Press, New York (2011)
Chapter Google Scholar
Cardwell, D.S.L.: From Watt to Clausius: The Rise of Thermodynamics in the Early Industrial Age. Cornell University Press, Ithaca (1971)
Google Scholar
Carleman, T.: Problèmes Mathématiques dans la théorie cinétique des gaz. Almqvist & Wiksells Boktryckeri AB, Upssala (1957)
MATH Google Scholar
Carleman, T.: Sur la théorie de l’équation intégrodifférentielle de Boltzmann. Acta Mathematica 60, 91–146 (1933)
Article MathSciNet MATH Google Scholar
Carnot, S.: Reflections on the Motive Power of Fire and Other Papers on the Second Law of Thermodynamics, edited by E. Mendoza. Translated by R.H. Thurston. New York: Dover (1960)
Cercignani, C.: Theory and Application of the Boltzmann Equation. Scottish Academic Press, Edinburgh (1975)
MATH Google Scholar
Cercignani, C.: The Boltzmann Equation and Its Applications. Springer-Verlag, New York (1988)
Book MATH Google Scholar
Cercignani, C.: Ludwig Boltzmann: The Man Who Trusted Atoms. Oxford University Press, New York (1998)
MATH Google Scholar
Cercignani, C., Gerasimenko, V.I., Petrina, D.Y.: Many-Particle Dynamics and Kinetic Equations. Springer, Dordrecht (1997)
Book MATH Google Scholar
Cercignani, C., Illner, R., Pulvirenti, M.: The Mathematical Theory of Dilute Gases. Springer-Verlag, New York (1994)
Book MATH Google Scholar
Cercignani, C., Kremer, G.M.: The Relativistic Boltzmann Equation: Theory and Applications. Springer Basel, Basel (2002)
Book MATH Google Scholar
Cercignani, C., Lampis, M.: On the H-Theorem for Polyatomic Gases. J. Stat. Phys. 26(4), 795–801 (1981)
Article ADS MathSciNet Google Scholar
Cheng, Y.-C.: Macroscopic and Statistical Thermodynamics. Expanded English Edition World Scientific, London (2006)
Google Scholar
Clausius, R.: "Über die bewegende Kraft der Wärme und die Gesetze, welche sich daraus für die Wärmelehre selbst ableiten lassen." Annalen der Physik und Chemie 155: 368–397, 155 (4): 500–524 (1850)
Clausius, R.: “On the Moving Force of Heat, and the Laws Regarding the Nature of Heat itself which are Deducible Therefrom.” Philosophical Magazine. 2 (8): 1–21; Continued at 102–119 in issue 9. This is a translation of Clausius 1850 (1851)
Clausius, R.: Über eine veränderte Form des zweiten Hauptsatzes der mechanischen Wärmetheorie. Annalen der Physik und Chemie 169(12), 481–506 (1854)
Article ADS Google Scholar
Clausius, R.: On the Mean Length of the Paths Described by the Separate Molecules of Gaseous Bodies on the Occurrence of the Molecular Motion: Together with Some Other Remarks Upon the Mechanical Theory of Heat. Phil. Mag. 17(112), 81–91 (1859)
Article Google Scholar
Clausius, R.: “On the Conduction of Heat by Gases.” Philosophical Magazine. 23 (157): 512–534. Cited as: “Conduction” (1862)
Clausius, R.: “On the Application of the Theorem of the Equivalence of Transformations to the Internal Work of a Mass of Matter.” Philosophical Magazine. 24(159): 81–97. Cited as: “Application of the Theorem” (1862)
Clausius, R.: “On the Application of the Theorem of the Equivalence of Transformations to the Internal Work of a Mass of Matter”. Philosophical Magazine. 24(160): 201–213. Part Two of (Clausius Application of the Theorem, 1862) (1862)
Clausius, R.: Abhandlungen über die mechanische Wärmetheorie. Druck and Verlag von Friedrich Vieweg and Son, Braunschweig (1864)
Google Scholar
Clausius, R.: Über verschiedene für die Anwendung bequeme Formen der Hauptgleichungen der mechanischen Wärmetheorie. Annalen der Physik und Chemie 201(7), 353–400 (1865)
Article ADS Google Scholar
Clausius, R.: On the Determination of the Energy and Entropy of a Body. Phil. Mag. 32(213), 1–17 (1866)
Article Google Scholar
Clausius, R.: The Mechanical Theory of Heat with Its Applications to the Steam-Engine and to the Physical Properties of Bodies, edited by T. Archer Hirst with an Introduction by John Tyndall. John Van Voorst and printed by Taylor and Francis, London (1867)
Clausius, R.: “On the Reduction of the Second Axiom of the Mechanical Theory of Heat to General Mechanical Principles”. Philosophical Magazine. 42(279): 161–181. Cited as: “Reduction of the Second Law” (1871)
Clausius, R.: "Bemerkungen zu der Prioritätsreclamation des Hrn. Boltzmann." Annalen der Physik und Chemie 220(10): 265–280. Cited as: “Remarks on the Priority Claim of Mr. Boltzmann” (1871)
Clausius, R.: A Contribution to the History of the Mechanical Theory of Heat. Phil. Mag. 43(284), 106–115 (1872)
Article Google Scholar
Clausius, R.: The Mechanical Theory of Heat, translated by Walter R. Browne. London: Macmillan and Company (1879)
Cohen, E.G.D., Thirring, W. (eds.): The Boltzmann Equation: Theory and Applications. Springer Verlag, Vienna (1973)
Google Scholar
Culverwell, E.P.: Note on Boltzmann’s Kinetic Theory of Gases, and on Sir W. Thomson’s Address to Section A, British Association, 1884. Phil. Mag. 30(182), 95–99 (1890)
Article MATH Google Scholar
Darrigol, O.: Electrodynamics from Ampère to Einstein. Oxford University Press, New York, NY (2000)
MATH Google Scholar
Darrigol, O.: Atoms, Mechanics, and Probability: Ludwig Boltzmann’s Statistico-Mechanical Writings - An Exegesis. Oxford University Press, Oxford (2018)
Book MATH Google Scholar
Darrigol, O., Renn, J.: The Emergence of Statistical Mechanics. In: Buchwald, J.Z., Fox, R. (eds.) The Oxford Handbook of the History of Physics, pp. 765–788. Oxford University Press, New York (2013)
Google Scholar
Daub, E.E.: Atomism and thermodynamics. Isis 58(3), 292–303 (1967)
Article Google Scholar
Daub, E.E.: Probability and thermodynamics: the reduction of the second law. Isis 60(3), 318–330 (1969)
Article MATH Google Scholar
Daub, E.E.: Maxwell’s demon. Stud. Hist. Philos. Sci.: Part A. 1(3), 213–227 (1970)
Article MATH Google Scholar
Davies, P.C.W.: The Physics of Time Asymmetry. University of California Press, Berkeley (1977)
Google Scholar
Devaney, R.L.: Blowing up singularities in classical mechanical systems. Am. Math. Mon. 89(8), 535–552 (1982)
Article MathSciNet MATH Google Scholar
Dias, P.M.C.: "Will someone say exactly what the H-theorem proves?” A study of Burbury’s Condition A and Maxwell’s Proposition II. Arch. Hist. Exact Sci. 46(4), 341–366 (1994)
Article MathSciNet Google Scholar
Dürr, D., Teufel, S.: Bohmian Mechanics: The Physics and Mathematics of Quantum Theory. Springer-Verlag, Berlin (2009)
MATH Google Scholar
Durrell, M., Kohl, K., Loftus, G.: Essential German Grammar. McGraw-Hill, New York (2002)
Google Scholar
Earley, J.E., Sr.: Some philosophical influences on Ilya Prigogine’s statistical mechanics. Found. Chem. 8, 271–283 (2006)
Article Google Scholar
Ehrenfest, P.: Collected Scientific Papers, edited by M.J. Klein. Amsterdam: North-Holland Publication CO. (1959)
Ehrenfest, P., T. Ehrenfest.: The conceptual foundations of the statistical approach in mechanics. Translated by Michael J. Moravcsik. Mineola: Dover Publications (1990)
Eldridge, J.A.: Experimental test of Maxwell’s distribution law. Phys. Rev. 30, 931–935 (1927)
Article ADS Google Scholar
Everitt, C.W.F.: James Clerk Maxwell: Physicist and Natural Philosopher. Charles Scribner Sons, New York (1975)
Google Scholar
Friedman, M.: Kant—Naturphilosophie—Electromagnetism. In: Brain, R.M., Cohen, R.S., Knudsen, O. (eds.) Hans Christian Ørsted and the Romantic Legacy in Science Boston Studies in the Philosophy of Science, vol. 241, pp. 135–158. Springer, Dordrecht (2007)
Google Scholar
Frigg, R.: A field guide to recent work on the foundations of statistical mechanics. In: Rickles, D. (ed.) The Ashgate Companion to Contemporary Philosophy of Physics, pp. 99–196. Ashgate Publishers, London (2008)
Google Scholar
Frigg, R., Werndl, C.: Entropy: a guide for the perplexed. In: Beisbart, C., Hartmann, S. (eds.) Probabilities in Physics, pp. 115–142. Oxford University Press, New York (2011)
Chapter Google Scholar
Frigg, R., Werndl, C.: Statistical mechanics: a tale of two theories. The Monist. 102(4), 424–438 (2019)
Article Google Scholar
Frigg, R., Werndl, C.: forthcoming. Equilibrium in Boltzmannian statistical mechanics. In: Knox, E. and Wilson, A. (eds.), The Routledge Companion to Philosophy of Physics, New York: Routledge Publishers. (2019). http://www.romanfrigg.org/writings/boltzmann_equ_routledge.pdf.11 Accessed 2019.
Garber, E., S.G. Brush, C.W.F. Everitt.: Kinetic theory and the properties of gases: Maxwell’s work in its nineteenth-century context. In: Maxwell on Molecules and Gases, edited by Elizabeth Garber, Stephen G. Brush and C.W.F. Everitt. Cambridge, MA: MIT Press, 1–63 (1986)
Garber, E., S.G. Brush, C.W.F. Everitt.: Introduction. In: Maxwell on heat and statistical mechanics: on “Avoiding All Personal Enquiries” of molecules, edited by Elizabeth Garber, Stephen G. Brush, and C.W.F. Everitt, 29–103. Cambridge, MA: MIT Press. (1995)
Garber, E.: Subjects great and small: Maxwell on Saturn’s rings and kinetic theory. Philos. Trans. R. Soc. A. 366, 1697–1705 (2008)
Article ADS MathSciNet MATH Google Scholar
Glennan, S.: The New Mechanical Philosophy. Oxford University Press, Oxford (2017)
Book Google Scholar
Glennan, S., Illari, P.M. (eds.): The Routledge Handbook of Mechanisms and Mechanical Philosophy. Routledge, New York (2018)
Google Scholar
Goldstein, S.: Boltzmann’s approach to statistical mechanics. In: Bricmont, J., Dürr, D., Galavotti, M.C., Ghirardi, G., Petruccione, F., Zanghì, N. (eds.) Chance in Physics, pp. 39–54. Springer-Verlag, Berlin, Heidelberg (2001)
Chapter Google Scholar
Goldstein, S., Huse, D.A., Lebowitz, J.L., Sartori, P.: On the nonequilibrium entropy of large and small systems. In: Giacomin, G., Olla, S., Saada, E., Spohn, H., Stoltz, G. (eds.) Stochastic Dynamics Out of Equilibrium, pp. 581–596. Springer, Cham (2019)
Chapter MATH Google Scholar
Goldstein, S., Lebowitz, J.L.: On the (Boltzmann) entropy of nonequilibrium systems. Physica D 193, 53–66 (2004)
Article ADS MathSciNet MATH Google Scholar
Goldstein, S., J.L. Lebowitz, R. Tumulka, N. Zanghì.: Gibbs and Boltzmann entropy in classical and quantum mechanics. (2019) June 2. Accessed November 11, 2019. https://arxiv.org/abs/1903.11870.
Goldstein, S., Tumulka, R., Zanghì, N.: Is the hypothesis about a low entropy initial state of the universe necessary for explaining the arrow of time? Phys. Rev. D 94, 023520 (2016)
Article ADS MathSciNet Google Scholar
Gressman, P.T., Strain, R.M.: Sharp anisotropic estimates for the Boltzmann collision operator and its entropy production. Adv. Math. 227(6), 2349–2384 (2011)
Article MathSciNet MATH Google Scholar
Guthrie, J.: Kinetic theory of gases. Nature 8, 67 (1873)
Article ADS Google Scholar
Hamilton, W.R.: On a General Method in Dynamics by Which the Study of the Motions of All Free Systems of Attracting or Repelling Points is Reduced to the Search and Differentiation of One Central Relation, or Characteristic Function. Philos. Trans. R. Soc., 124: 247–308. In (Hamilton 1940, 103–163) (1834)
Hamilton, W. R.: The Mathematical Papers of Sir William Rowan Hamilton: Volume 2: Dynamics, edited by A.W. Conway and A.J. McConnell. Cambridge: Cambridge University Press. (1940)
Hanel, R., Jizba, P.: Time—energy uncertainty principle for irreversible heat engines. Philos. Trans. R. Soc. A 378(2170), 20190171 (2020)
Article ADS MathSciNet MATH Google Scholar
Hankins, T.L.: Sir William Rowan Hamilton. The Johns Hopkins University Press, Baltimore (1980)
MATH Google Scholar
Harman, P.M.: Energy, Force, and Matter: The Conceptual Development of Nineteenth-Century Physics. Cambridge University Press, Cambridge (1982)
Book Google Scholar
Harman, P.M.: Introduction. In: Harman, P.M. (ed.) The Scientific Letters and Papers of James Clerk Maxwell: Volume I 1846–1862, pp. 1–32. Cambridge University Press, Cambridge (1990)
Google Scholar
Harman, P.M.: The Natural Philosophy of James Clerk Maxwell. Cambridge University Press, New York, NY (1998)
MATH Google Scholar
Heimann, P.M.: Maxwell and the modes of consistent representation. Arch. Hist. Exact Sci. 6, 171–213 (1970)
Article MathSciNet Google Scholar
Herschel, J.: Quetelet on probabilities. In: Essays from the Edinburgh and Quarterly Reviews with Addresses and Other Pieces, 365–465. Longman, Brown, Green, Longmans & Roberts: London (1857)
Holton, G., Brush, S.G.: Physics, the Human Adventure: From Copernicus to Einstein and Beyond. Rutgers University Press, New Brunswick (2006)
Google Scholar
Huang, K.: Statistical Mechanics, 2nd edn. Wiley, Hoboken, NJ (1987)
MATH Google Scholar
Huygens, C.: Treatise on Light. Vol. 34, edited by Robert Maynard Hutchins (Editor in Chief), Mortimer Adler (Associate Editor), translated by Silvanus P. Thompson, 546–619. Chicago: University of Chicago Encyclopædia Britannica, Inc. (1952)
Illner, R., Pulvirenti, M.: Global validity of the Boltzmann equation for two- and three-dimensional rare gas in vacuum: erratum and improved result. Commun. Math. Phys. 121, 143–146 (1989)
Article ADS MATH Google Scholar
Illner, R., Shinbrot, M.: The Boltzmann equation: global existence for a rare gas in an infinite vacuum. Commun. Math. Phys. 95, 217–226 (1984)
Article ADS MathSciNet MATH Google Scholar
Jaynes, E.T.: Papers on Probability, Statistics and Statistical Physic. In: Roger D. Rosenkrantz. D. (eds.) Reidel Publishing Company: Dordrecht (1983)
Jeffreys, H.: Theory of Probability, 3rd edn. Clarendon Press, Oxford (1983)
MATH Google Scholar
Jungnickel, C., R. McCormmach.: Intellectual Mastery of Nature: Theoretical Physics from Ohm to Einstein Volume 2 The Now Mighty Theoretical Physics 1870–1925. University of Chicago Press: Chicago, IL (1986)
Kant, I.: Critique of Pure Reason, translated and edited by Paul Guyer and Allen W. Wood. Cambridge University Press: Cambridge (1998)
Karakostas, V.: On the Brussels school’s arrow of time in quantum theory. Philos. Sci. 63(3), 374–400 (1996)
Article MathSciNet Google Scholar
Keeton, C.: Principles of Astrophysics: Using Gravity and Stellar Physics to Explore the Cosmos. Springer-Verlag, New York (2014)
Book MATH Google Scholar
Kirchhoff, G.:Vorlesungen über die Theorie der Wärme, edited by Max Planck. Leipzig: Druck und Verlag von B.G. Teubner (1894)
Kirchhoff, G.: Abhandlungen über mechanische Wärmetheorie. In: Planck. M (ed.) Verlag von Wilhelm Engelmann: Leipzig (1898)
Klein, M.J.: Gibbs on Clausius. Hist. Stud. Phys. Sci. 1, 127–149 (1969)
Google Scholar
Klein, M.J.: Paul Ehrenfest: Volume 1: The Making of a Theoretical Physicist. Amsterdam: North-Holland Publishing Company. Cited as: “Ehrenfest”. (1970)
Klein, M.J.: Maxwell, His Demon, and the Second Law of Thermodynamics. American Scientist. 58: 84–97. Cited as: “Demon”. (1970)
Klein, Martin J. 1973. "Mechanical Explanation at the End of the Nineteenth Century." Centaurus 17 (1): 58–82. Cited as: “Mechanical”.
Klein, M.J.:. The development of Boltzmann's statistical ideas. In: Cohen E.G.D., Thirring W. The Boltzmann Equation: Theory and Applications, 53–106. Vienna: Springer Verlag. Cited as: “Development” (1973)
Klein, M.J.: Boltzmann, monocycles and mechanical explanation. In: Seeger, R.J., Cohen, R.S. (eds.) Philosophical Foundations of Science, pp. 155–175. D. Reidel Publishing Company, Dordrecht (1974)
Chapter Google Scholar
Klein, S., Nellis, G.: Thermodynamics. Cambridge University Press, New York, NY (2012)
MATH Google Scholar
Knott, C.G.: Life and Scientific Work of Peter Guthrie Tait: Supplementing the Two Volumes of Scientific Papers Published in 1898 and 1900. Cambridge University Press, Cambridge (1911)
MATH Google Scholar
Kosarim, A.V., Smirnov, B.M., Capitelli, M., Celiberto, R., Petrella, G., Laricchiuta, A.: Ionization of Excited Nitrogen Molecules by Electron Impact. Chem. Phys. Lett. 414, 215–221 (2005)
Article ADS Google Scholar
Kox, A.J.: The Correspondence Between Boltzmann and H.A. Lorentz. In: Sexl R., Blackmore J. (eds.) Ludwig Boltzmann: Internationale Tagung anlässlich des 75. Jahrestages seines Todes, 5–8 September 1981 Ausgewählte Abhandlungen, 73–86. Akademische Druck: Graz; Friedr Vieweg & Sohn: Braunschweig/Wiesbaden (1982)
Kox, A.J.: H.A. Lorentz’s contributions to kinetic gas theory. Ann. Sci. 47(6), 591–606 (1990)
Article MathSciNet Google Scholar
Kragh, H.: Quantum Generations: A History of Physics in the Twentieth Century. Princeton University Press, Princeton (1999)
Book MATH Google Scholar
Kremer, G.M.: An Introduction to the Boltzmann Equation and Transport Processes in Gases. Springer-Verlag, Berlin (2010)
Book MATH Google Scholar
Kuhn, T.S.: Black-Body Theory and the Quantum Discontinuity 1894–1912. Clarendon Press, Oxford (1978)
Google Scholar
Ladyman, J., Presnell, S., Short, A.J.: the use of the information-theoretic entropy in thermodynamics. Stud. History Philos. Modern. Phys. 39, 315–324 (2008)
Article ADS MathSciNet MATH Google Scholar
Landau, L.D., E.M. Lifshitz.: Statistical Physics. Third Edition, Revised and Enlarged by E.M. Lifshitz and L.P. Pitaevskii. Translated by J.B. Sykes and J. Kearsley. Burlington, MA: Elsevier. (2005)
Lanford III., Oscar, E.: Time evolution of large classical systems. In: Moser, J. (ed.) Dynamical Systems, Theory and Applications: Lecture Notes in Theoretical Physics 38, pp. 1–111. Springer, Berlin (1975)
Google Scholar
Lanford, III., Oscar, E.: On the derivation of the Boltzmann equation. Astérisque 40, 117–137 (1976)
MathSciNet Google Scholar
Lanford, III., Oscar, E.: The hard sphere gas in the Boltzmann-Grad limit. Physica 106A, 70–76 (1981)
Article ADS Google Scholar
Laurendeau, N.M.: Statistical Thermodynamics: Fundamentals and Applications. Cambridge University Press, Cambridge (2005)
Book MATH Google Scholar
Lebowitz, J.L., Maes, C.: Entropy: a dialogue. In: Greven, A., Keller, G., Warnecke, G. (eds.) Entropy, pp. 269–276. Princeton University Press, New Jersey (2003)
MATH Google Scholar
Lee, W.-R., Finkel’stein, A.M., Michaeli, K., Schwiete, G.: Role of electron-electron collisions for charge and heat transport at intermediate temperatures. Phys Rev Res. 2, 013148 (2020)
Article Google Scholar
Leibniz, G.W.: Philosophical Essays. translated by Roger Ariew and Daniel Garber. Indianapolis, IN: Hackett Publishing Company. (1989)
Leibniz, G.W.: Reflections on the Advancement of True Metaphysics and Particularly on the Nature of Substance Explained by Force. In Philosophical Texts, translated and edited by Richard Woolhouse and Richard S. Francks. 139-142 Oxford: Oxford University Press. (1998)
Loewer, B.: Why there is anything except physics. In: Hohwy, J., Kallestrup, J. (eds.) Being Reduced: New essays on Reduction, Explanation, and Causation, pp. 149–163. Oxford University Press, New York (2008)
Chapter Google Scholar
Lorentz, H.A.: Über das Gleichgewicht der lebendigen Kraft unter Gasmolekülen. Wiener Berichte 95, 115–152 (1887)
Google Scholar
Loschmidt, J.: Der zweite Satz der mechanischen Wärmetheorie. Wiener Berichte 59, 395–418 (1869)
MATH Google Scholar
Loschmidt, J.: Über den Zustand des Wärmegleichgewichtes eines Systems von Körpern mit Rücksicht auf die Schwerkraft. Wiener Berichte 73, 128–142 (1876)
Google Scholar
Maxwell, J.C.: On the Stability of the Motion of Saturn’s Rings: An Essay which Obtained the Adams Prize for the Year 1856, in the University of Cambridge. MacMillan and Co, Cambridge (1859)
Google Scholar
Maxwell, J.C.:. Illustrations of the dynamical theory of gases-part I. On the motions and collisions of perfectly elastic spheres. Philos. Mag., 19: 19–32. SPM1, pp. 377–391. (1860)
Maxwell, J.C.: Illustrations of the dynamical theory of gases-part II. On the process of diffusion of two or more kinds of moving particles among one another. Philos. Mag. 20: 21–37. SPM1, pp. 392–409. (1860)
Maxwell, J.C.: A Dynamical Theory of the Electromagnetic Field. Philos. Trans. R. Soc. 155: 459–512. SPM1, 526–597. (1865)
Maxwell, J.C.: On the dynamical theory of gases. Philos. Trans. R. Soc. 157: 49–88. SPM2, pp. 26–78. (1867)
Maxwell, J.C.: On the final state of a system of molecules in motion subject to forces of any kind. Nature 8: 537–538. SPM2, 351–354. (1873)
Maxwell, J.C.: Tait's "Thermodynamics". Nature 17: 257–259. SPM2, 660–671. (1878)
Maxwell, J.C.: On Boltzmann’s theorem on the average distribution of energy in a system of material points. Trans. Cambrid. Philos. Soc. 12: 547–570. SPM2, 713–741. (1879)
Maxwell, J.C.: Theory of Heat. 10th Edition, with corrections and additions by Lord Rayleigh. London: Longmans, Green, and Company. Originally published in 1871. (1891)
Maxwell, J.C.: Theory of Heat. with corrections and additions by Lord Rayleigh. London: Longmans, Green, and Company. (1902)
Maxwell, J.C.: Letter to Sir George Gabriel Stokes, on 30 May, 1859. In Memoir and Scientific Correspondence of the Late Sir George Gabriel Stokes, Vol. 2, edited by Joseph Larmor, 8-11. Cambridge: Cambridge University Press (1907)
Maxwell, J.C.: Maxwell on Saturn’s Rings, edited by Stephen G. Brush, C.W.F. Everitt, and Elizabeth Garber. Cambridge, MA: MIT Press. (1983)
Maxwell, J.C.: The scientific letters and papers of James Clerk Maxwell: Volume I 1846–1862, edited by P.M. Harman, Cambridge: Cambridge University Press. Cited as: “vol. 1”. (1990)
Maxwell, J.C.: The scientific letters and papers of James Clerk Maxwell: Volume II 1862–1873: Part I 1862–1868, edited by P.M. Harman. Cambridge: Cambridge University Press. Cited as: “vol. 2: part 1”. (2009)
Maxwell, J.C.: The scientific letters and papers of James Clerk Maxwell: Volume II 1862–1873: Part II 1869–1873, edited by P.M. Harman. Cambridge: Cambridge University Press. Cited as: “vol. 2: part 2”. (2009)
McCarthy, I.E., Weigold, E.: Electron-atom ionization. Adv. Atomic Mol. Opt. Phys. 27, 201–244 (1990)
Article ADS Google Scholar
Morgenstern, D.: General existence and uniqueness proof for spatially homogeneous solutions of the Maxwell-Boltzmann equation in the case of Maxwellian molecules. Proc. Natl. Acad. Sci. USA 40(8), 719–721 (1954)
Article ADS MathSciNet MATH Google Scholar
Müller, I., Weiss, W.: Entropy and Energy: A Universal Competition. Springer, Berlin (2005)
MATH Google Scholar
Newton, I.: The principia: mathematical principles of natural philosophy. Translated by I. Bernard Cohen, Anne Whitman and Julia Budenz. Berkeley and Los Angeles, CA: University of California Press. (1999)
Nishida, T., Imai, K.: Global solutions to the initial value problem for the nonlinear Boltzmann equation. Publ. Res. Inst. Math. Sci. 12(1), 229–239 (1976)
Article MathSciNet MATH Google Scholar
Peliti, L.: Statistical mechanics in a nutshell. Translated by Mark Epstein. Princeton: Princeton University Press. (2011)
Penrose, R.: Cycles of Time: An Extraordinary New View of the Universe. Vintage Publishers, New York (2012)
MATH Google Scholar
Planck, M.: Über den Beweis des Maxwell’schen Geschwindigkeitsvertheilungsgesetzes unter Gasmolecülen. Ann. Phys. Chem. 291(5), 220–222 (1895)
Article ADS MATH Google Scholar
Porter, T.M.: The Rise of Statistical Thinking 1820–1900. Princeton University Press, Princeton (1986)
Book Google Scholar
Pourciau, B.: Newton’s interpretation of Newton’s second law. Arch. Hist. Exact Sci. 60, 157–207 (2006)
Article MathSciNet MATH Google Scholar
Prevost, A., Egolf, D.A., Urbach, J.S.: Forcing and velocity correlations in a vibrated Granular Monolayer. Phys. Rev. Lett. 89, 084301 (2002)
Article ADS Google Scholar
Price, H.: Time’s Arrow and Archimedes’ Point: New Directions for the Physics of Time. Oxford University Press, New York (1996)
Google Scholar
Prigogine, I.: Chemical kinetics and dynamics. Ann. NY Acad. Sci. Chemical Explanation: Characteristics, Development, Autonomy. 988: 128–132 (2003)
Prigogine, I., Stengers, I.: Order Out of Chaos: Man’s New Dialogue with Nature. Foreword by Alvin Toffler. New York: Bantam Books (1984)
Rankine, W.: The general law of the transformation of energy. Philos. Mag. 5(30), 106–117 (1853)
Article Google Scholar
Rankine, W.: On the second law of thermodynamics. Philos. Mag. 30(203), 241–245 (1865)
Article Google Scholar
Rennie, R. (ed.): Oxford Dictionary of Physics, 7th edn. Oxford University Press, New York (2015)
Google Scholar
Sachs, R.G.: The Physics of Time Reversal. University of Chicago Press, Chicago (1987)
Google Scholar
Segrè, E.: From Falling Bodies to Radio Waves: Classical Physicists and Their Discoveries. Dover Publications, Mineola (1984)
Google Scholar
Seifert, U.: Stochastic thermodynamics, fluctuation theorems and molecular machines. Rep. Prog. Phys. 75, 126001 (2012)
Article ADS Google Scholar
SISSA. Statphys.sissa.it/wordpress/?page_id=2205
Sklar, L.: Physics and Chance: Philosophical Issues in the Foundations of Statistical Mechanics. Cambridge University Press, New York (1993)
Book Google Scholar
Skyrms, B.: Mill’s conversion: the Herschel connection. Philos. Imprint 18(23), 1–24 (2018)
Google Scholar
Smith, C.: The Science of Energy: A Cultural History of Energy Physics in Victorian Britain. University of Chicago Press, Chicago (1998)
Google Scholar
Smith, C.: Force, energy, and thermodynamics. In The Cambridge History of Science: Volume 5 The Modern Physical and Mathematical Sciences, edited by Mary Jo Nye, 289–310. Cambridge: Cambridge University Press. (2003)
Smith, C., Norton Wise, M.: Energy and Empire: A Biographical Study of Lord Kelvin. Cambridge Univerity Press, Cambridge (1989)
Google Scholar
Spohn, H.: Kinetic equations from Hamiltonian dynamics: Markovian limits. Rev. Mod. Phys. 52, 569–615 (1980)
Article ADS MathSciNet Google Scholar
Spohn, H.: Large Scale Dynamics of Interacting Particles. Springer-Verlag, Berlin (1991)
Book MATH Google Scholar
Stern, O.:. Nachtrag zu meiner Abeit: ‘Eine direkte Messung der thermischen Molekulargeschwindigkeit’. Zeitschrift für Physik 3: 417–421 (1920)
Stern, O.: Eine direkte Messung der thermischen Molekulargeschwindigkeit. Zeischrift für Physik. 2: 49–56 (1920)
Stern, O.: The method of molecular rays. Nobel Lecture. December 12th, 1946. https://www.nobelprize.org/uploads/2018/06/stern-lecture.pdf (1946)
Strutz, H.: 501 German Verbs Fully Conjugated in All the Tenses, 3rd edn. Barron’s Educational Series Inc., New York (1998)
Google Scholar
Succi, S., Karlin, I.V., Chen, H.: Colloquium: role of the H theorem in lattice Boltzmann hydrodynamic simulations. Rev. Mod. Phys. 74, 1203 (2002)
Article ADS Google Scholar
Taylor, J.R.: Classical Mechanics. University Science Books, Mill Valley (2005)
MATH Google Scholar
Terrell, P., Schnorr, V., Smith, W.V.A., Breitsprecher, R.: Harper Collins German Unabridged Dictionary, 5th edn. HarperCollins Publishers Inc., New York (2004)
Google Scholar
Thomson, W. (Lord Kelvin): On a universal tendency in nature to the dissipation of mechanical energy. Philos. Mag. 4 (25): 304–306 (1852)
Thomson, W. (Lord Kelvin): The kinetic theory of the dissipation of energy. Nature, 9: 441–444. (1874)
Thorne, K.S., Blandford, R.D.: Modern Classical Physics: Optics, Fluids, Plasmas, Elasticity, Relativity, and Statistical Physics. Princeton University Press, Princeton, NJ (2017)
MATH Google Scholar
Toennies, J.P., Schmidt-Böcking, H., Friedrich, B., Lower, J.C.A.: Otto Stern (1888–1969): the founding father of experimental atomic physics. Ann. Phys. 523(12), 1045–1070 (2011)
Article MATH Google Scholar
Tolman, R.C.: The Principles of Statistical Mechanics. Dover Publications, New York (1979)
MATH Google Scholar
Truesdell, C.: The Tragicomical History of Thermodynamics 1822–1854. Springer-Verlag, New York (1980)
Book MATH Google Scholar
Uffink, J.: Compendium of the foundations of classical statistical physics. In: Butterfield, J., Earman, J. (eds.) Handbook of the Philosophy of Science: Philosophy of Physics Part B, pp. 923–1074. Elsevier, Amsterdam (2007)
Google Scholar
Uffink, J.: Boltzmann's work in statistical physics, The Stanford Encyclopedia of Philosophy (Spring 2017 Edition), Edward N. Zalta (ed.), https://plato.stanford.edu/archives/spr2017/entries/statphys-Boltzmann/.
Uffink, J., Valente, G.: Time's arrow and Lanford's theorem. Séminaire Poincaré XV Le Temps 141–173 (2010)
Uhlenbeck, G.E., Ford, G.W.: Lectures in statistical mechanics. With an appendix on Quantum Statistics of Interacting Particles by E.W. Montroll. Providence, RI: American Mathematical Society (1963)
Villani, C.: A review of mathematical topics in collisional kinetic theory. In: Friedlander, S., Serre, D. (eds.) Handbook of Mathematical Fluid Dynamics, vol. 1, pp. 71–305. Elsevier Science, Amsterdam (2002)
Chapter MATH Google Scholar
Villani, C.: Entropy production and convergence to equilibrium. In: Golse, F., Olla, S. (eds.) Entropy Methods for the Boltzmann Equation: Lecture Notes in Mathematics, pp. 1–70. Springer-Verlag, Berlin (2008)
Google Scholar
Villani, C.: H-theorem and beyond: Boltzmann’s entropy in today’s mathematics. November 15. Accessed 11/10/2019. https://www.math-berlin.de/images/stories/villani-boltzmann-talk.pdf Cited as: “Math Berlin”. (2010)
Villani, C.: Entropy and H theorem: the mathematical legacy of Ludwig Boltzmann. Lecture given at Cambridge University in 2010. https://www.youtube.com/watch?v=dx8Rkek5KrE https://www.newton.ac.uk/seminar/20101115170018004 Cited as: “Lecture”.(2010)
Villani, C.: Entropy and H theorem: the mathematical legacy of Ludwig Boltzmann. Lecture Notes. Accessed 01/26/2020. https://www.newton.ac.uk/files/seminar/20101115170018004-152633.pdf Cited as: “Lecture Notes” (2010)
von Plato, Jan.: Creating modern probability. Cambridge University Press: Cambridge (1994)
Watson, H.W.: A Treatise on the Kinetic Theory of Gases, 2nd edn. Clarendon Press, Oxford (1893)
MATH Google Scholar
Weaver, C.G.: Fundamental Causation: Physics, Metaphysics, and the Deep Structure of the World. Routledge Publishers, New York (2019)
Google Scholar
Whewell, W.: The Philosophy of the Inducive Sciences, Founded Upon Their History:, vol. 2. John W. Parker, London (1840)
MATH Google Scholar
White, R.D., Robson, R.E., Dujko, S., Nicoletopoulos, P., Li, B.: Recent advances in the application of Boltzmann equation and fluid equation methods to charged particle transport in non-equilibrium plasmas. J. Phys. D 42, 194001 (2009)
Article ADS Google Scholar
Wilson, J.D.: College Physics. Prentice Hall, Upper Saddle River (1994)
Google Scholar
Wilson, M.: Wandering Significance: An Essay on Conceptual Behavior. Oxford University Press, Oxford (2006)
Book Google Scholar
Wilson, M.: What is “Classical Mechanics” anyway? In: Batterman, R. (ed.) The Oxford Handbook of the Philosophy of Physics, pp. 43–106. Oxford University Press, New York (2013)
Google Scholar
Wilson, M.: Physics Avoidance: Essays in Conceptual Strategy. Oxford University Press, Oxford (2017)
Book Google Scholar
Wood, P.R.: On the entropy of mixing, with particular reference to its effect on dredge-up during helium shell flashes. Astrophys. J. 248, 311–314 (1981)
Article ADS Google Scholar
Zabell, S.L.: Symmetry and its Discontents: Essays on the History of Inductive Probability. Cambridge University Press, New York (2005)
Book MATH Google Scholar

Download references

Acknowledgments

I thank Olivier Darrigol and Matthew Stanley for their comments on an earlier draft of this paper. I thank Jochen Bojanowski for a little translation help. I presented a version of the paper at the NY/NJ (Metro Area) Philosophy of Science group meeting at NYU in November of 2019. I’d like to especially thank Barry Loewer, Tim Maudlin, and David Albert for their criticisms at that event. Let me extend special thanks to Tim Maudlin for some helpful correspondence on various issues addressed in this paper. While Professor Maudlin and I still disagree, that correspondence was helpful. I also presented an earlier draft of this work to the Department of Physics at the University of Illinois at Urbana-Champaign in January 2020. I thank many of the physics faculty and graduate students for their questions and objections. A comment from Nigel D. Goldenfeld as well as a challenging question from Michael B. Weissman benefited the final product. Finally, I’d like to express an additional special thanks to Olivier Darrigol with whom I corresponded on various issues in Boltzmann scholarship as well as on numerous points made in this paper. I owe a great debt to him for that enlightening correspondence and for his Atoms, Mechanics, and Probability (OUP 2018) from which I learned so much.

Funding

None.

Author information

Authors and Affiliations

Department of Philosophy, University of Illinois at Urbana-Champaign, 200 Gregory Hall, 810 South Wright ST, MC-468, Urbana, IL, 61801, USA
Christopher Gregory Weaver
Department of Physics and Core Faculty in the Illinois Center for Advanced Studies of the Universe, University of Illinois at Urbana-Champaign, 1110 West Green ST, Urbana, IL, 61801, USA
Christopher Gregory Weaver
Center for Philosophy of Science, University of Pittsburgh, 1117 Cathedral of Learning, 4200 Fifth Ave, Pittsburgh, PA, USA
Christopher Gregory Weaver

Authors

Christopher Gregory Weaver
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Christopher Gregory Weaver.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Appendix 1: The Second Law of Thermodynamics in Boltzmannian Statistical Mechanics

(The Second Law of Thermodynamics (SL)): Necessarily, [with respect to “an arbitrary instant t = t₁” and a statistical mechanical system (SYS) at t₁, if SYS’s “Boltzmann entropy…at that time, S_B(t₁), is far below its maximum value”, it will be “highly probable that at any later time t₂” (t₂ > t₁), “we have S_B(t₂) $>$ S_B(t₁)”] and necessarily, [if SYS is at an arbitrary time t₁ in thermal equilibrium, then it will be “highly probable that at any later time t₂” (t₂ > t₁) we have S_B(t₂) $=$ S_B(t₁)].^{Footnote 207}

Appendix 2: Lanford’s Project and the Chaos Asymmetry Problem

Here I’m in broad agreement and am indebted to [228].

Oscar Lanford III realized that in order to solve what I have called the Chaos Asymmetry Problem (CAP) he needed a Hypothesis of Molecular Chaos (HMC) that outstrips the factorization condition used in his result. Thus, I believe that Lanford’s work supports the view that the HMC is not represented by the factorization condition needed for his famous theorem. This supports my judgment that there really is a No Mathematics Problem (NMP).

Consider:

When Lanford derived the Boltzmann equation from classical Hamiltonian mechanics for the Boltzmann-Grad limit and for a rarefied gas approximated by hard spheres, he assumed a factorization condition not unlike that which is stated in footnote 175.^{Footnote 209} However, Lanford perceived that there was something more lurking beneath his time-reversal invariant theorem that supports the time-asymmetric Boltzmann equation and helps represent irreversible entropic increase or equilibration governed by the inequality: $\frac{dH}{dt}\le 0$. We witness irreversible evolutions. We measure non-equilibrium systems and reliably track their march toward equilibrium over time. To save the phenomena, we have to ensure that we secure and use the Boltzmann equation and not the anti-Boltzmann equation (which is the Boltzmann equation with the sign of the relevant collision integral flipped). These two equations are demonstrably inequivalent ([228], pp. 167–168). To acquire the Boltzmann equation, one can use Lanford’s theorem, but one must assume that collision point configurations are incoming and not outgoing (ibid.). Incoming configurations determine a positive collision operator, while outgoing configurations yield the same operator with its sign flipped. It has been shown that even if one applies the time-reversal operation to incoming configurations or representations one does not obtain configurations equivalent to outgoing configurations (ibid., 172, proposition 5). Thus, there is something deeply irreversible obtained by Lanford’s project and the factorization condition is insensitive to it because that condition says nothing about which set of representations or configurations one should choose. The factorization condition works equally well with incoming or outgoing collision phase point representations ([158], p. 88). That is why Lanford himself “consistently stressed that mere factorization is not in itself the explanation of irreversibility”.^{Footnote 210} And that is why Lanford maintained that the:

…inequality $\frac{dH}{dt}\le 0$ shows that the reversibility of the underlying molecular dynamics has been lost in passing to the Boltzmann equation. The irreversibility must have been introduced in the Hypothesis of Molecular Chaos since the rest of the derivation was straightforward mechanics. Indeed, it is not hard to see directly that the Hypothesis of Molecular Chaos is asymmetric in time…One conclusion which must be drawn is that something more is involved in the Hypothesis of Molecular Chaos than simple statistical independence.^{Footnote 211}

The HMC was something beyond the factorization condition, for the factorization condition is itself time-symmetric.

If one focuses on the beautiful mathematical result that is Lanford’s theorem alone one will be unable to save the phenomenon that is irreversible thermodynamic system evolution even if in the appropriate limit. For Lanford, the closest mathematical model of what we seek to save comes not from his theorem but from Boltzmann’s.

None of this [Lanford’s theorem etc.], however, really implies that irreversible behavior must occur in the limiting regime; it merely makes this behavior plausible. For a really compelling argument in favor of irreversibility, it seems to be necessary to rely on some version of Boltzmann’s original proof of the H-theorem.^{Footnote 212}

But as I noted, Boltzmann’s H-theorem requires the HMC as I have presented it. Thus, we may conjoin to the conclusion that (a) Lanford’s project remains burdened by the No Mathematics problem the further conclusion that (b) it cannot meet the Chaos Asymmetry Problem. Uffink and Valente [228] (and it seems Lanford [158, 160]) agree with (b), while agreement with (a) can be found in [230] and perhaps ([212], p. 76).^{Footnote 213}

Rights and permissions

Reprints and permissions

About this article

Cite this article

Weaver, C.G. In Praise of Clausius Entropy: Reassessing the Foundations of Boltzmannian Statistical Mechanics. Found Phys 51, 59 (2021). https://doi.org/10.1007/s10701-021-00437-w

Download citation

Received: 06 October 2020
Accepted: 30 January 2021
Published: 17 May 2021
DOI: https://doi.org/10.1007/s10701-021-00437-w

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

In Praise of Clausius Entropy: Reassessing the Foundations of Boltzmannian Statistical Mechanics

Abstract

Similar content being viewed by others

On the Statistical Viewpoint Concerning the Second Law of Thermodynamics—A Reminder on the Ehrenfests’ Urn Model

Einstein’s Approach to Statistical Mechanics: The 1902–04 Papers

Thermodynamics is more powerful than the role to it reserved by Boltzmann-Gibbs statistical mechanics

1 Introduction

2 The Maxwell Distribution

3 The Maxwell–Boltzmann Distribution

4 die Fundamentalgleichung

5 The H-Theorem

5.1 Interpreting the H-Theorem: Collisions and Causation

5.2 Interpreting the H-Theorem: Applications and Exceptions

5.2.1 Maxwell’s (But Really Thomson’s) Demon and Loschmidt’s Exorcism

5.2.2 The Reversibility Objection Before Loschmidt

5.3 Interpreting the H-Theorem: \(-\mathbf{H}\) and Entropy

6 Clausius Entropy

6.1 The Clausius Integral and Entropy

6.2 The Complete Nature of Clausius Entropy

6.3 Boltzmann and the Clausius Entropy

7 The Probabilistic Interpretation of the H-Theorem

7.1 Loschmidt’s Reversibility Objection: Articulated

7.1.1 The H-Theorem Untarnished in Boltzmann’s Eyes

8 The Reversibility Paradox Answered

8.1 The Hypothesis of Molecular Chaos

8.2 The Solution at Long Last

8.2.1 Solving the Chaos Asymmetry and No Mathematics Problems

8.2.2 Solving the Reversibility Paradox

9 Conclusion

Notes

Abbreviations

References

Acknowledgments

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendices

Appendix 1: The Second Law of Thermodynamics in Boltzmannian Statistical Mechanics

Appendix 2: Lanford’s Project and the Chaos Asymmetry Problem

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation