Nature Abhors a Vacuum: A Simple Rigorous Example of Thermalization in an Isolated Macroscopic Quantum System

Shiraishi, Naoto; Tasaki, Hal

doi:10.1007/s10955-024-03289-6

Nature Abhors a Vacuum: A Simple Rigorous Example of Thermalization in an Isolated Macroscopic Quantum System

Published: 07 July 2024

Volume 191, article number 82, (2024)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Journal of Statistical Physics Aims and scope Submit manuscript

Nature Abhors a Vacuum: A Simple Rigorous Example of Thermalization in an Isolated Macroscopic Quantum System

Download PDF

67 Accesses
Explore all metrics

Abstract

We show, without relying on any unproven assumptions, that a low-density free fermion chain exhibits thermalization in the following (restricted) sense. We choose the initial state as a pure state drawn randomly from the Hilbert space in which all particles are in half of the chain. This represents a nonequilibrium state such that the half chain containing all particles is in equilibrium at infinite temperature, and the other half chain is a vacuum. We let the system evolve according to the unitary time evolution determined by the Hamiltonian and, at a sufficiently large typical time, measure the particle number in an arbitrary macroscopic region in the chain. In this setup, it is proved that the measured number is close to the equilibrium value with probability very close to one. Our result establishes the presence of thermalization in a concrete model in a mathematically rigorous manner. The key for the proof is a new strategy to show that a randomly generated nonequilibrium initial state typically has a large enough effective dimension by using only mild verifiable assumptions. In the present work, we first give general proof of thermalization based on two assumptions, namely, the absence of degeneracy in energy eigenvalues and a property about the particle distribution in energy eigenstates. We then justify these assumptions in a concrete free-fermion model, where the absence of degeneracy is established by using number-theoretic results. This means that our general result also applies to any lattice gas models in which the above two assumptions are justified. To confirm the potential wide applicability of our theory, we discuss some other models for which the essential assumption about the particle distribution is easily verified, and some non-random initial states whose effective dimensions are sufficiently large.

Entanglement in Fock space of random QFT states

Article Open access 28 July 2015

Hydrodynamic Projections and the Emergence of Linearised Euler Equations in One-Dimensional Isolated Systems

Article Open access 27 January 2022

Steady States and Universal Conductance in a Quenched Luttinger Model

Article 20 May 2016

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Whether the unitary time evolution in an isolated macroscopic quantum system can describe the phenomenon of thermalization or, equivalently, the approach to thermal equilibrium is an essential question in the foundation of statistical mechanics. Since there are several different formulations of thermalization, we shall first make clear what we precisely mean by thermalization in the present work. Consider a many-body quantum system with Hamiltonian $\hat{H}$ and take a pure initial state $|\Phi (0)\rangle $ in which energy is sharply distributed around some value E. We say that the system with this initial state thermalizes if the measurement result of any macroscopic observable $\hat{A}$ in the time-evolved state $|\Phi (t)\rangle $ after sufficiently long and typical time $t>0$ is indistinguishable (with probability very close to one) from the microcanonical average $\langle \hat{A}\rangle ^\textrm{MC}_E$. Note that we are dealing with the outcome of a single quantum mechanical measurement of $\hat{A}$ in the state $|\Phi (t)\rangle $ rather than the quantum mechanical expectation value $\langle \Phi (t)|\hat{A}|\Phi (t)\rangle $ or any other averaged quantities. Therefore, thermalization formulated in this manner guarantees that the result of a single experiment at a sufficiently later time is predicted precisely by equilibrium statistical mechanics.^{Footnote 1}

Our ultimate goal is to rigorously establish the presence of thermalization in the above strong form in a realistic macroscopic quantum system with a realistic nonequilibrium initial state. But this seems to be a formidably difficult problem for the moment. In the present paper, we report a partial result toward the goal, namely, complete proof that a low-density free fermion chain exhibits thermalization in the above sense but for a restricted class of observables [1].

The study of thermalization in isolated macroscopic quantum systems goes back to the early days of quantum mechanics [2], but considerable progress has been made in the present century partly motivated by modern ultracold atom experiments [3,4,5,6,7]. It is now a general consensus that a sufficiently complex many-body quantum system has the ability to thermalize only by the unitary time evolution [8, 9].

An important theoretical concept in the study of thermalization is the energy eigenstate thermalization hypothesis (ETH). It was first introduced (implicitly) by von Neumann in 1929 [2, 10] as an essential assumption for his quantum ergodic theorem. See [8, 11] for the relation between von Neumann’s ETH and the modern version of ETH proposed in [12, 13]. Another key theoretical concept is a large effective dimension of the initial state. It was first pointed out by Tasaki in 1998 [14] (without explicitly introducing the notion of the effective dimension) that one can show the presence of equilibration if the effective dimension is large enough. It is known that one can prove the presence of thermalization by assuming either (i) some (strong) form of ETH [2, 10, 15,16,17], (ii) some form of ETH and a large enough effective dimension [14, 18,19,20,21], or (iii) an effective dimension almost as large as the total dimension [17, 22].

It is strongly believed that the assumptions in the above scenarios (i), (ii), and (iii) are satisfied in a large class of sufficiently complex quantum systems and their realistic (nonequilibrium) initial states. However, it is extremely difficult, even if not impossible, to justify the assumptions rigorously for concrete models. As far as we know, there have been no concrete and nontrivial examples of quantum systems with short-range interaction in which the presence of thermalization was justified according to these scenarios without relying on any unproven assumptions. We note that an example based on a different mechanism is discussed in [23].

It is interesting, on the other hand, that there have been many examples of quantum systems in which the absence of thermalization was rigorously established. A well-known example is an integrable system, where the system relaxes not to the equilibrium state but to a state corresponding to an ensemble characterized by its local integrals of motion. The absence of thermalization in such systems with local integrals of motion is an old established property [24, 25], and has recently been studied in detail in terms of the generalized Gibbs ensemble [26, 27]. Another example is a system with many-body localization: A spin system with many-body localization has random interactions or a random magnetic field, and this randomness prohibits its thermalization as in the case of the Anderson localization [28,29,30,31,32,33]. Recently, a more exotic system was found where most initial states thermalize while some do not. This phenomenon was first observed in experiments of cold atoms [34], and independently from this experiment, a general theoretical framework covering such phenomena was proposed [35, 36]. Later, such phenomena were named quantum many-body scar states, and have attracted the interests of broad research fields [37,38,39,40,41,42,43]. Furthermore, it has even been shown that the problem of thermalization is, in general, undecidable [44].

The goal of this paper is to present a nontrivial and rigorous concrete example of thermalization (in a restricted sense) that does not rely on any unproven assumptions. We first develop a general theory of unitary time evolution in a low-density lattice gas that satisfies two crucial assumptions, and establish the presence of thermalization with respect to the number operator for any macroscopic region, assuming that the initial nonequilibrium state is generated randomly. The derivation is based on the above-mentioned scenario (iii), which requires the effective dimension of the initial state to be almost as large as the total Hilbert space dimension. We then prove that the two assumptions are indeed satisfied in the simplest model, namely, the free fermion chain with suitable parameters. Although a free fermion system does not exhibit full-fledged thermalization, i.e., the approach to thermal equilibrium from an arbitrary nonequilibrium state (with almost fixed energy), it does thermalize in our setting where the initial state is sufficiently complex. We should note that we are here using the notion of thermalization in a phenomenological manner, in the sense we focus only on macroscopically observable features and do not pay attention to microscopic mechanisms. More precisely, thermalization in our example is essentially indistinguishable from that observed in a realistic gas, provided that a macroscopic observer measures only the density of particles in a given region (and the coarse-grained momentum distribution).^{Footnote 2} See Sect. 4 for a related discussion, and [45,46,47] for detailed numerical studies of closely related problems.

It is important to note, however, that our general theory should apply to non-integrable models as well, in which one expects full-fledged thermalization to take place. In fact, the key assumption in our theory is about the particle distribution in energy eigenstates, which may be regarded as a very restrictive form of ETH. The other assumption is the absence of degeneracy in the energy spectrum of the model, which appears highly natural and plausible in complex many-body systems. Interestingly, if we assume the absence of degeneracy, we can justify the first assumption about the particle distribution for a wider class of lattice gas models, including interacting ones. It is an intriguing problem whether one can find non-integrable models in which our assumptions can be fully justified.

Before going into details of our theory, let us state precisely what we can prove for free fermion chains. Consider a system of N fermions on the chain $\{1,\ldots ,L\}$, where we fix the density $\rho =N/L$ and make N and L large. We take the standard Hamiltonian with uniform nearest-neighbor hopping

$$\begin{aligned} \hat{H}=\sum _{x=1}^L\bigl \{e^{i\theta }\,\hat{c}^\dagger _x\hat{c}_{x+1}+e^{-i\theta }\,\hat{c}^\dagger _{x+1}\hat{c}_x\bigr \}, \end{aligned}$$

(1.1)

where the phase $\theta \in \mathbb {R}$ is introduced (artificially) to break the reflection symmetry. See Sect. 3.1 for notations and details. In the most standard model with $\theta =0$, most energy eigenvalues are degenerate because of the reflection symmetry (which brings the wave number k to $-k$). It is likely that the degeneracies are lifted by a nonzero phase $\theta $. We assume that the parameters are properly chosen so that all the energy eigenvalues of $\hat{H}$ are nondegenerate. In fact, we prove in Sect. 3.2 that the model is free from degeneracy under some conditions. For example, it suffices to set $\theta =(4N+2L)^{-(L-1)/2}$ provided that L is an odd prime.

We choose initial state $|\Phi (0)\rangle $ randomly from the subspace of states in which all fermions are in the half-chain $\{1,\ldots ,(L-1)/2\}$. This corresponds to the infinite temperature equilibrium state confined in the half-chain. We then denote by $|\Phi (t)\rangle =e^{-i\hat{H}t}|\Phi (0)\rangle $ the state at time $t>0$. We let $\hat{N}_\textrm{left}$ be the operator that counts the number of fermions on the half-chain $\{1,\ldots ,(L-1)/2\}$. Then, our main result is as follows:

Theorem 1.1

When N (or L) is sufficiently large and $\rho =N/L\le 1/5$, the following is true with probability larger than $1-e^{-(\rho /3)N}$ (where the probability is that for the choice of $|\Phi (0)\rangle $). There exists a sufficiently long time $T>0$ and a subset (a collection of intervals) $G\subset [0,T]$ with $\mu (G)/T\ge 1-e^{-(\rho /4)N}$ (where $\mu (G)$ denotes the total length of the intervals in G) such that if one measures $\hat{N}_\textrm{left}$ in $|\Phi (t)\rangle =e^{-i\hat{H}t}|\Phi (0)\rangle $ at any $t\in G$ the measurement result $N_\textrm{left}$ satisfies

$$\begin{aligned} \Bigl |\frac{N_\textrm{left}}{N}-\frac{1}{2}\Bigr |\le \varepsilon _0(\rho ), \end{aligned}$$

(1.2)

with probability larger than $1-e^{-(\rho /4)N}$ (where the probability is that for quantum measurement). Here we set $\varepsilon _0(\rho )=\sqrt{\frac{3}{2}\rho }$.

The factors $e^{-(\rho /3)N}$ and $e^{-(\rho /4)N}$ are essentially negligible if $\rho N\gg 1$. Then the theorem states that it almost certainly happens that the measurement result of $\hat{N}_\textrm{left}/N$ at a sufficiently large and typical time is close to its equilibrium value 1/2 with precision $\varepsilon _0(\rho )$. Since the measurement result of $\hat{N}_\textrm{left}/N$ is 1 in the initial state $|\Phi (0)\rangle $, this establishes an irreversible behavior (or the approach to thermal equilibrium) with respect to the observable $\hat{N}_\textrm{left}$. We should note that our result is not limited for a single specific observable $N_\textrm{left}$. In fact, the main theorem, Theorem 2.4, is stated for the number operator for any macroscopic region. As we have already stressed, it is crucial that we are dealing with the result of a single projective measurement of $\hat{N}_\textrm{left}$ in the state $|\Phi (t)\rangle $, rather than its quantum mechanical average $\langle \Phi (t)|\hat{N}_\textrm{left}|\Phi (t)\rangle $.

We must note, however, that the precision $\varepsilon _0(\rho )$ in (1.2) is a function of the density $\rho $, and may not be small. One needs to consider a system with low density in order to have high precision. For example, $\rho \simeq 10^{-4}$ for $\varepsilon _0\simeq 10^{-2}$, or $\rho \simeq 0.04$ for $\varepsilon _0=1/4$. This density-dependence of the precision $\varepsilon _0(\rho )$ is a major shortcoming of the present theory, which reflects our strategy to base the theory only on mild verifiable assumptions. We nevertheless stress that our theorem establishes thermalization in a certain sense without relying on any unproven assumptions.

The present paper is organized as follows. In Sect. 2, we state our main thermalization theorem for a general lattice gas satisfying two assumptions, namely, Assumptions 2.1 and 2.2. Then in Sect. 3, we prove these two assumptions are indeed satisfied in free fermion chains with suitable parameters.

In Appendix A, we discuss the extension of our general theory to a model in which the energy spectrum is moderately degenerate. In Appendix B, we present two classes of models (one of which includes non-integrable models) in which we can justify Assumption 2.2 about the particle distribution in energy eigenstates, assuming that the energy eigenvalues are nondegenerate. We stress that Assumption 2.2 is indeed an essential nontrivial assumption in our theory. In Appendix C, we present some concrete estimates of the effective dimensions of some non-random initial states in the free fermion chain, and with the help of this estimate, we prove that some non-random initial states indeed thermalize. Finally, in Appendix D, we briefly discuss a possible extension of our result to finite temperature states.

2 General Results

Here we describe general systems of lattice gas, state necessary assumptions, and prove the main low-density thermalization theorem. The new observation about the effective dimension is summarized in Theorem 2.3.

2.1 Setting and Main Assumptions

Let $\Lambda $ be a lattice with L sites, and consider a system of N fermions on $\Lambda $.^{Footnote 3} A typical example is the chain $\Lambda =\{1,\ldots ,L\}$. We take the thermodynamic convention (except in Appendix C), in which we fix the density $\rho $, choose L and N such that $N/L\simeq \rho $, and make L and N sufficiently large. Our results are meaningful in the low-density regime, where $\rho $ is sufficiently small.

Let $\mathcal{H}_\textrm{tot}$ be the Hilbert space of the system with N particles on the lattice $\Lambda $. The dimension $D_\textrm{tot}$ of $\mathcal{H}_\textrm{tot}$ is given by

$$\begin{aligned} D_\textrm{tot}=\left( {\begin{array}{c}L\\ N\end{array}}\right) \sim e^{L\,S(\rho )}, \end{aligned}$$

(2.1)

where the relation $F(L)\sim G(L)$ means

$$\begin{aligned} \lim _{L\uparrow \infty }\frac{1}{L}\log \frac{F(L)}{G(L)}=0, \end{aligned}$$

(2.2)

and

$$\begin{aligned} S(p)=-p\log p-(1-p)\log (1-p), \end{aligned}$$

(2.3)

is the binominal entropy. The final expression in (2.1) comes from the Stirling formula.

We decompose the lattice $\Lambda $ disjointly into two parts as $\Lambda =\Lambda _1\cup \Lambda _2$, where $|\Lambda _1|=(L-1)/2$ and $|\Lambda _2|=(L+1)/2$ when L is odd, and $|\Lambda _1|=|\Lambda _2|=L/2$ when L is even. Throughout the present paper, we denote by |S| the number of elements in a set S. Let $\mathcal{H}_1$ denote the nonequilibrium subspace where all particles are in the sublattice $\Lambda _1$ and hence $\Lambda _2$ is empty. The dimension of $\mathcal{H}_1$ is

$$\begin{aligned} D_1=\left( {\begin{array}{c}\frac{L-1}{2}\\ N\end{array}}\right) \sim e^{(L/2)S(2\rho )}, \end{aligned}$$

(2.4)

where we assumed L is odd (but the result is essentially the same for even L). We denote by $\hat{P}_1$ the projection onto the subspace $\mathcal{H}_1$.

Let $\hat{H}$ be the Hamiltonian of the system. We assume that $\hat{H}$ preserves the particle number, and denote by $|\Psi _j\rangle \in \mathcal{H}_\textrm{tot}$ with $j=1,\ldots ,D_\textrm{tot}$ its normalized eigenstate (with N particles) corresponding to the energy eigenvalue $E_j$. We make two essential assumptions about energy eigenvalues and eigenstates.

Assumption 2.1

The energy eigenvalues $E_1,\ldots ,E_{D_\textrm{tot}}$ of $\hat{H}$ are nondegenerate.

It is believed that the energy eigenvalues of a quantum many-body system are, in general, nondegenerate unless there are special reasons (such as symmetry) that cause degeneracy. In other words, it is likely that accidental degeneracies can always be lifted by adding an appropriate small perturbation to the Hamiltonian. It is, however, not at all easy to make this intuition into proof for a concrete class of models. In Sect. 3.2, we shall prove that some free fermion models on a chain are indeed free from degeneracy. See Theorems 3.1 and 3.2.

Assumption 2.2

For any $j=1,\ldots ,D_\textrm{tot}$, the energy eigenstate $|\Psi _j\rangle $ satisfies

$$\begin{aligned} \langle \Psi _j|\hat{P}_1|\Psi _j\rangle \le 2^{-N}. \end{aligned}$$

(2.5)

Here $\langle \Psi _j|\hat{P}_1|\Psi _j\rangle $ is the probability to find all particles in $\Lambda _1$ in the state $|\Psi _j\rangle $. Note that one gets the probability $2^{-N}$ if each particle independently chooses between $\Lambda _1$ and $\Lambda _2$ with probability 1/2. The bound (2.5) is reasonable since the hardcore nature further reduces the probability. We expect the bound (2.5) to hold for a large class of interacting quantum lattice gases, but, for the moment, we are able to prove it for a class of non-interacting fermions (Sect. 3.3 and Appendix B.2) and systems of interacting fermions or hardcore bosons on a double lattice with special symmetry (Appendix B.1).

We also note that Assumption 2.2 is reminiscent of the strong ETH in the sense that it is an assertion about every energy eigenstate. But this is much weaker than the standard ETH since we only require that a single observable, rather than all macroscopic observables, satisfies the inequality (2.5), rather than an equality.

In what follows, we first show that, under Assumption 2.2, a random initial state has an extremely large effective dimension with high probability (Theorem 2.3). Then, by combining Assumption 2.1 and the largeness of effective dimension, we conclude that this initial state thermalizes (Theorem 2.4).

2.2 Initinal State and its Effective Dimension

Let $|\Phi (0)\rangle \in \mathcal{H}_\textrm{tot}$ be the normalized initial state of the system. We define the effective dimension $D_\textrm{eff}$ of $|\Phi (0)\rangle $ by

$$\begin{aligned} D_\textrm{eff}=\biggl (\sum _{j=1}^{D_\textrm{tot}}\bigl |\langle \Phi (0)|\Psi _j\rangle \bigr |^4\biggr )^{-1}, \end{aligned}$$

(2.6)

which quantifies the effective number of energy eigenstates that constitute the state $|\Phi (0)\rangle $. It holds in general that $1\le D_\textrm{eff}\le D_\textrm{tot}$. It is known that an initial state whose effective dimension $D_\textrm{eff}$ is almost as large as $D_\textrm{tot}$ generically exhibits thermalization, provided that the energy eigenvalues are nondegenerate. See section 6 of [17]. (See Appendix A for necessary modifications when there are degeneracies.) It is strongly believed that a realistic nonequilibrium initial state of a non-integrable many-body quantum system has an effective dimension almost as large as the total Hilbert space dimension.^{Footnote 4} See [48,49,50] for systematic convincing numerical studies.^{Footnote 5} However, it seems to be formidably difficult to prove this expectation rigorously. Currently available general lower bound for $D_\textrm{eff}$ only shows that it is only moderately large [51]. Our major task is to construct an initial state $|\Phi (0)\rangle $ that is far from equilibrium and has a large effective dimension $D_\textrm{eff}$.

To realize such an initial state with large $D_\textrm{eff}$, we choose $|\Phi (0)\rangle $ randomly from the subspace $\mathcal{H}_1$. To be precise, denoting by $\{|\Xi _j\rangle \}_{j=1,\ldots ,D_1}$ an arbitrary orthonormal basis of $\mathcal{H}_1$ we prepare an initial state as $|\Phi (0)\rangle =\sum _{j=1}^{D_1}c_j|\Xi _j\rangle $, where $c_j\in \mathbb {C}$ satisfies $\sum _j|c_j|^2=1$ and are drawn randomly according to the uniform measure on the unit sphere in the $D_1$ dimensional complex space. Such $|\Phi (0)\rangle $ describes a nonequilibrium state such that all particles are confined in the sublattice $\Lambda _1$, while the state restricted to $\Lambda _1$ is in thermal equilibrium at infinite temperature. In this state, the infinite temperature state in $\Lambda _1$ borders a vacuum in $\Lambda _2$. Therefore we can interpret the present initial state as a limiting case of a nonequilibrium state in which two equilibrium states with different pressures are in touch with each other.

We then have the following essential result, which is the main new observation in the present paper.

Theorem 2.3

Suppose that Assumption 2.2 is valid and that $\rho \le 1/5$. Then, for sufficiently large N, one has

$$\begin{aligned} \frac{D_\textrm{tot}}{D_\textrm{eff}}\le e^{\rho N}, \end{aligned}$$

(2.7)

with probability larger than $1-e^{-(\rho /3)N}$.

Here the probability is that for the random choice of the initial state $|\Phi (0)\rangle $. We thus see that, when $\rho $ is small, the effective dimension $D_\textrm{eff}$ is almost as large as $D_\textrm{tot}$ with probability very close to one. We shall see in Sect. 2.3 below that the upper bound (2.7) implies thermalization in a certain sense.

Proof of Theorem 2.3

It is well known (and can easily be shown) that for any $|\Xi \rangle \in \mathcal{H}_1$, one has

$$\begin{aligned} \overline{\bigl |\langle \Phi (0)|\Xi \rangle \bigr |^4}=\frac{2}{D_1(D_1+1)}\,\Vert |\Xi \rangle \Vert ^4, \end{aligned}$$

(2.8)

where the bar on the left-hand side denotes the average over the random choice of $|\Phi (0)\rangle $. See, e.g., [52]. Noting that $\langle \Phi (0)|\Psi _j\rangle =\langle \Phi (0)|\hat{P}_1|\Psi _j\rangle $ and that $\hat{P}_1|\Psi _j\rangle \in \mathcal{H}_1$, we find from (2.6) and (2.8) that

$$\begin{aligned} \overline{D_\textrm{eff}^{-1}} =\sum _{j=1}^{D_\textrm{tot}}\overline{\bigl |\langle \Phi (0)|\hat{P}_1|\Psi _j\rangle \bigr |^4} =\frac{2}{D_1(D_1+1)}\sum _{j=1}^{D_\textrm{tot}}\Vert \hat{P}_1|\Psi _j\rangle \Vert ^4. \end{aligned}$$

(2.9)

By using the assumed bound (2.5), which is written as $\Vert \hat{P}_1|\Psi _j\rangle \Vert ^2\le 2^{-N}$, we find

$$\begin{aligned} \overline{D_\textrm{eff}^{-1}}&\le \frac{2}{D_1(D_1+1)2^N}\sum _{j=1}^{D_\textrm{tot}}\Vert \hat{P}_1|\Psi _j\rangle \Vert ^2 =\frac{2}{D_1(D_1+1)2^N}{\text {Tr}}[\hat{P}_1] =\frac{2}{(D_1+1)2^N}, \end{aligned}$$

(2.10)

where we noted that ${\text {Tr}}[\hat{P}_1]=D_1$. Recalling (2.1) and (2.4), we see that

$$\begin{aligned} D_\textrm{tot}\overline{D_\textrm{eff}^{-1}}\le \frac{2D_\textrm{tot}}{2^ND_1}\sim \exp [L\,S(\rho )-\tfrac{L}{2}S(2\rho )-N\log 2]=e^{g(\rho )L}, \end{aligned}$$

(2.11)

with

$$\begin{aligned} g(\rho )&=S(\rho )-\tfrac{1}{2}S(2\rho )-\rho \log 2 \nonumber \\&=-(1-\rho )\log (1-\rho )+\frac{1-2\rho }{2}\log (1-2\rho ) \nonumber \\&=\frac{\rho ^2}{2}+\frac{\rho ^3}{2}+\frac{7\rho ^4}{12}+\cdots <\frac{2}{3}\rho ^2. \end{aligned}$$

(2.12)

Here the final inequality is verified for $\rho \in [0,1/5]$ with an aid of numerical evaluation. We can rewrite the estimate (2.11) into the bound

$$\begin{aligned} D_\textrm{tot}\overline{D_\textrm{eff}^{-1}}\le \exp [\tfrac{2}{3}\rho ^2L]=\exp [\tfrac{2}{3}\rho N], \end{aligned}$$

(2.13)

provided that L (or N) is sufficiently large. Theorem 2.3 then follows from Markov’s inequality as follows. Let p be the probability that $D_\textrm{tot}D_\textrm{eff}^{-1}$ is larger than $e^{\rho N}$. Then we see $D_\textrm{tot}\overline{D_\textrm{eff}^{-1}}\ge p\,e^{\rho N}$, which, with (2.13), implies $p\le e^{-(\rho /3)N}$. $\square $

One may prefer a statement for a definite (i.e., non-random) initial state rather than that for (the majority of) random initial states. In Appendix C, we discuss a non-random initial state whose effective dimension almost saturates as in Theorem 2.3.

2.3 Time Evolution and Thermalization

Let us now consider the state obtained from the initial state $|\Phi (0)\rangle $ by the unitary time evolution, i.e.,

$$\begin{aligned} |\Phi (t)\rangle =e^{-i\hat{H}t}|\Phi (0)\rangle =\sum _{j=1}^{D_\textrm{tot}}e^{-iE_jt}|\Psi _j\rangle \langle \Psi _j|\Phi (0)\rangle . \end{aligned}$$

(2.14)

We expect that, for sufficiently large and typical t, the time-evolved state $|\Phi (t)\rangle $ describes (in a certain physical sense) the thermal equilibrium at infinite temperature. See the next subsection.

To examine the property of the state $|\Phi (t)\rangle $, we take an arbitrary subset $\Gamma $ of $\Lambda $ such that $|\Gamma |=\gamma L$, where $\gamma $ is a constant of order 1, and measure the proportion of particles in $\Gamma $. We shall prove that, for sufficiently large and typical time t, the proportion is close to its equilibrium value, $\gamma $, with probability very close to one. This type of statement has been shown in the literature for initial states with extremely large effective dimensions [17, 22], and we follow the standard idea. Our precise statement is as follows.

Theorem 2.4

We fix the (small) density $\rho >0$, and take sufficiently large L and N such that $N/L\simeq \rho $. We consider a system of N particles on the lattice $\Lambda $ such that $|\Lambda |=L$ and let $\hat{H}$ be the Hamiltonian. Suppose that Assumption 2.1 about nondegeneracy is valid and also that the effective dimension $D_\textrm{eff}$ is large enough to satisfy the bound (2.7). (This is guaranteed by Theorem 2.3 to be extremely likely.) Take any $\Gamma \subset \Lambda $ such that $|\Gamma |=\gamma L$, and let $\hat{N}_\Gamma $ be the operator that counts the number of particles in $\Gamma $. Then there exists a constant $T>0$ and a subset (a collection of intervals) $G\subset [0,T]$ with

$$\begin{aligned} \frac{\mu (G)}{T}\ge 1-e^{-(\rho /4)N}, \end{aligned}$$

(2.15)

where $\mu (G)$ is the total length of the intervals in G. Suppose that one performs a measurement of the number operator $\hat{N}_\Gamma $ in the state $|\Phi (t)\rangle $ with arbitrary $t\in G$. Then, with probability larger than $1-e^{-(\rho /4)N}$, the measurement result $N_\Gamma $ satisfies

$$\begin{aligned} \Bigl |\frac{N_\Gamma }{N}-\gamma \Bigr |\le \varepsilon _0(\rho ), \end{aligned}$$

(2.16)

where the precision is given by

$$\begin{aligned} \varepsilon _0(\rho )=\sqrt{6\gamma (1-\gamma )\rho }. \end{aligned}$$

(2.17)

Here the probability is that for the quantum mechanical measurement. Suppose that N is sufficiently large so that $e^{-(\rho /4)N}$ is negligibly small. Then the theorem guarantees that (2.16) almost certainly holds for almost all t in [0, T]. The bound (2.16) states that the observed proportion $N_\Gamma /N$ is close to its equilibrium value, $\gamma $. Recalling that the initial state $|\Phi (0)\rangle $ is a nonequilibrium state in which all particles are in $\Lambda _1$, we have established that the system thermalizes only by means of unitary time evolution (2.14).

We must note, however, that the precision in the relation (2.16) is given by $\varepsilon _0(\rho )$, which is a function of $\rho $ as in (2.17) and may not be small. In fact, we need to make the density $\rho $ sufficiently low to achieve high precision. If one demands that the precision $\varepsilon _0(\rho )$ should be, for example, of order $10^{-2}$ then $\rho $ should be of order $10^{-4}$. This density dependence of the precision and the resulting limitation to dilute gases are the major shortcomings of the present theory, which comes from our strategy to base the whole theory on mild verifiable assumptions, namely, Assumptions 2.1 and 2.2.

We should also remark that our criterion for thermal equilibrium deals only with the number of particles in an arbitrary macroscopic region. We have proved the presence of thermalization, but only with respect to this rather restricted criterion. This again reflects the limitation arising from our mild assumptions. Although we expect that thermalization for other macroscopic quantities reflecting single-particle properties can be established by a straightforward extension of the present analysis, we are far from treating quantities that involve particle-particle correlations. See the discussion at the end of Sect. 4.

Proof of Theorem 2.4

The proof consists essentially of a combination of standard arguments found in the literature. For $\varepsilon >0$, let $\hat{P}^{\Gamma ,\varepsilon }_\textrm{neq}$ denote the projection operator onto the subspace of $\mathcal{H}_\textrm{tot}$ determined by

$$\begin{aligned} \biggl |\frac{\hat{N}_\Gamma }{N}-\gamma \biggr |\ge \varepsilon . \end{aligned}$$

(2.18)

Clearly, the expectation value $\langle \Phi (t)|\hat{P}^{\Gamma ,\varepsilon _0(\rho )}_\textrm{neq}|\Phi (t)\rangle $ is the probability that the measurement result of $\hat{N}_\Gamma $ in $|\Phi (t)\rangle $ does not satisfy the relation (2.16). From (2.14), we see that

$$\begin{aligned} \langle \Phi (t)|\hat{P}^{\Gamma ,\varepsilon }_\textrm{neq}|\Phi (t)\rangle =\sum _{j,j'=1}^{D_\textrm{tot}}e^{i(E_j-E_{j'})t} \langle \Phi (0)|\Psi _{j}\rangle \langle \Psi _j|\hat{P}^{\Gamma ,\varepsilon }_\textrm{neq}|\Psi _{j'}\rangle \langle \Psi _{j'}|\Phi (0)\rangle . \end{aligned}$$

(2.19)

Since we assumed that the energy eigenvalues $E_j$ are non-degegerate, the long-time average of $\langle \Phi (t)|\hat{P}^{\Gamma ,\varepsilon }_\textrm{neq}|\Phi (t)\rangle $ is expressed in terms of a single sum as

$$\begin{aligned} \lim _{T\uparrow \infty }\frac{1}{T}\int _0^Tdt\,\langle \Phi (t)|\hat{P}^{\Gamma ,\varepsilon }_\textrm{neq}|\Phi (t)\rangle&=\sum _{j=1}^{D_\textrm{tot}}\bigl |\langle \Phi (0)|\Psi _{j}\rangle \bigr |^2\langle \Psi _j|\hat{P}^{\Gamma ,\varepsilon }_\textrm{neq}|\Psi _j\rangle \nonumber \\&\le \sqrt{\biggl (\sum _{j=1}^{D_\textrm{tot}}\bigl |\langle \Phi (0)|\Psi _{j}\rangle \bigr |^4\biggr ) \biggl (\sum _{j=1}^{D_\textrm{tot}}\langle \Psi _j|\hat{P}^{\Gamma ,\varepsilon }_\textrm{neq}|\Psi _j\rangle ^2\biggr ) }\nonumber \\&\le \sqrt{ D_\textrm{tot}D_\textrm{eff}^{-1}\,\langle \hat{P}^{\Gamma ,\varepsilon }_\textrm{neq}\rangle _\infty , } \end{aligned}$$

(2.20)

where we defined the canonical average at infinite temperature by

$$\begin{aligned} \langle \cdots \rangle _\infty =\frac{{\text {Tr}}_{\mathcal{H}_\textrm{tot}}[\cdots ]}{D_\textrm{tot}}. \end{aligned}$$

(2.21)

In (2.20), the second line follows from the Schwarz inequality, and the final expression follows from (2.6) by noting $\langle \Psi _j|\hat{P}^{\Gamma ,\varepsilon }_\textrm{neq}|\Psi _j\rangle ^2\le \langle \Psi _j|\hat{P}^{\Gamma ,\varepsilon }_\textrm{neq}|\Psi _j\rangle $.

Below we prove the large-deviation type upper bound

$$\begin{aligned} \langle \hat{P}^{\Gamma ,\varepsilon }_\textrm{neq}\rangle _\infty \le C\exp \Bigl [-\frac{\varepsilon ^2}{3\gamma (1-\gamma )}N\Bigr ] =C\exp \Bigl [-2\rho \Bigl (\frac{\varepsilon }{\varepsilon _0(\rho )}\Bigr )^2N\Bigr ], \end{aligned}$$

(2.22)

with a constant $C>1$, assuming that N is sufficiently large and $\varepsilon $ is sufficiently small. Note that the right-hand side reduces to $Ce^{-2\rho N}$ if we set $\varepsilon =\varepsilon _0(\rho )$. Recalling (2.7), we find that the right-hand side of (2.20) with $\varepsilon =\varepsilon _0(\rho )$ is bounded from above by $\sqrt{C}\,e^{-(\rho /2)N}$. This means that there is sufficiently large T such that the finite-time average satisfies

$$\begin{aligned} \frac{1}{T}\int _0^Tdt\,\langle \Phi (t)|\hat{P}^{\Gamma ,\varepsilon _0(\rho )}_\textrm{neq}|\Phi (t)\rangle \le e^{-(\rho /2)N}. \end{aligned}$$

(2.23)

To rewrite the obtained relation into the form of Theorem 2.4, we apply Markov’s inequality. We let G be the set of time at which (2.16) is satisfied with probability larger than $1-e^{-(\rho /4)N}$:

$$\begin{aligned} G=\bigl \{t\in [0,T]\,\bigl |\,\langle \Phi (t)|\hat{P}^{\Gamma ,\varepsilon _0(\rho )}_\textrm{neq}|\Phi (t)\rangle \le e^{-(\rho /4)N}\bigr \}. \end{aligned}$$

(2.24)

The property of G stated in the theorem is fulfilled by construction. It remains to verify (2.15) for the above G. For this, it suffices to note that

$$\begin{aligned} \frac{1}{T}\int _0^Tdt\,\langle \Phi (t)|\hat{P}^{\Gamma ,\varepsilon _0(\rho )}_\textrm{neq}|\Phi (t)\rangle \ge \frac{1}{T}\int _{t\in [0,T]\backslash G}dt\,e^{-(\rho /4)N} =\frac{T-\mu (G)}{T}\,e^{-(\rho /4)N},\nonumber \\ \end{aligned}$$

(2.25)

which, with (2.23), implies the desired (2.15). $\square $

Derivation of (2.22)

We shall be brief since the derivation is standard and elementary. Let $\hat{P}_M$ be the projection onto the subspace with $\hat{N}_\Gamma =M$. It is clear that

$$\begin{aligned} \hat{P}^{\Gamma ,\varepsilon }_\textrm{neq}=\mathop {\sum _{M}}_{(|M/N-\gamma |\ge \varepsilon )}\hat{P}_M, \end{aligned}$$

(2.26)

and

$$\begin{aligned} \langle \hat{P}_M\rangle _\infty \sim \exp \left[ \gamma L\,S\left( \frac{M}{\gamma L}\right) +(1-\gamma )L\,S\left( \frac{N-M}{(1-\gamma )L}\right) -L\,S \left( \frac{N}{L}\right) \right] . \end{aligned}$$

(2.27)

When $|M/N-\gamma |=\varepsilon $ or, equivalently, $M/N-\gamma =\pm \varepsilon $, the two first argument of $S(\cdot )$ in the above expression read

$$\begin{aligned} \frac{M}{\gamma L}=\Bigl (1\pm \frac{\varepsilon }{\gamma }\Bigr )\rho ,\quad \frac{N-M}{(1-\gamma )L}=\Bigl (1\mp \frac{\varepsilon }{1-\gamma }\Bigr )\rho . \end{aligned}$$

(2.28)

Since $\langle \hat{P}_M\rangle _\infty $ takes a very sharp maximum around M such that $M/(\gamma L)=\rho $, the probability that $|M/N-\gamma |\ge \varepsilon $ is almost the same as the probability that $|M/N-\gamma |\simeq \varepsilon $. We thus have

$$\begin{aligned} \langle \hat{P}^{\Gamma ,\varepsilon }_\textrm{neq}\rangle _\infty&\sim \max _\pm \exp \left[ \gamma L\,S\left( \left( 1\pm \tfrac{\varepsilon }{\gamma }\right) \rho \right) +(1-\gamma )L\,S\left( \left( 1\mp \tfrac{\varepsilon }{1-\gamma }\right) \rho \right) -L\,S(\rho )\right] \nonumber \\&=\exp \biggl [-\Bigl \{\frac{1}{2}\frac{1}{\gamma (1-\gamma )}\frac{\rho }{1-\rho }\varepsilon ^2+O(\varepsilon ^3)\Bigr \}\,L\biggr ]. \end{aligned}$$

(2.29)

For sufficiently large L and small $\varepsilon $ this is converted into the inequality (2.22). $\square $

2.4 Nature of the Final State

As we have noted several times, we expect that the state $|\Phi (t)\rangle $ with sufficiently large and typical t represents (with certain limited accuracy) the thermal equilibrium state of the whole system at infinite temperature. Here we briefly explain why the infinite temperature state, rather than a finite temperature state, is the destination of the relaxation process.

Let us assume in general that the Hamiltonian is written as

$$\begin{aligned} \hat{H}=\hat{H}_1+\hat{H}_2+{\varDelta }\hat{H}, \end{aligned}$$

(2.30)

where $\hat{H}_1$ and $\hat{H}_2$ act only on $\Lambda _1$ and $\Lambda _2$, respectively, and ${\varDelta }\hat{H}$ is the interaction Hamiltonian between $\Lambda _1$ and $\Lambda _2$. We assume that $\hat{H}_1$ and $\hat{H}_2$ are almost identical and ${\varDelta }\hat{H}$ is smaller.^{Footnote 6} We shall use the standard convention that $\hat{H}_1|\Phi _\textrm{vac}\rangle =\hat{H}_2|\Phi _\textrm{vac}\rangle ={\varDelta }\hat{H}|\Phi _\textrm{vac}\rangle =0$, where $|\Phi _\textrm{vac}\rangle $ is the state with no particles. Then we see from the energy conservation that

$$\begin{aligned} \langle \Phi (t)|\hat{H}|\Phi (t)\rangle =\langle \Phi (0)|\hat{H}|\Phi (0)\rangle \simeq \langle \Phi (0)|\hat{H}_1|\Phi (0)\rangle \simeq \frac{{\text {Tr}}_{\mathcal{H}_1}[\hat{H}_1]}{{\text {Tr}}_{\mathcal{H}_1}[\hat{1}]}, \end{aligned}$$

(2.31)

where we recalled that $|\Phi (0)\rangle $ is drawn randomly from $\mathcal{H}_1$. In a standard lattice gas model at low density, we expect from extensivity that

$$\begin{aligned} \frac{{\text {Tr}}_{\mathcal{H}_1}[\hat{H}_1]}{{\text {Tr}}_{\mathcal{H}_1}[\hat{1}]}\simeq \frac{{\text {Tr}}_{\mathcal{H}_\textrm{tot}}[\hat{H}]}{{\text {Tr}}_{\mathcal{H}_\textrm{tot}}[\hat{1}]}\simeq N\epsilon _\infty , \end{aligned}$$

(2.32)

where $\epsilon _\infty $ is the energy per particle in the equilibrium state at infinite temperature. We thus see

$$\begin{aligned} \langle \Phi (t)|\hat{H}|\Phi (t)\rangle \simeq N\epsilon _\infty , \end{aligned}$$

(2.33)

i.e., $|\Phi (t)\rangle $ has almost the same energy as the equilibrium state of the whole system at infinite temperature. This is confirmed explicitly for the free fermion chain. In summary, if the initial state $|\Phi (0)\rangle $ has an almost saturating effective dimension, then the state after time evolution $|\Phi (t)\rangle $ represents thermal equilibrium at infinite temperature.

3 Free Fermion on the Chain

In this section, we discuss our concrete example, namely the one-dimensional system of free fermions. We shall show that the model satisfies Assumptions 2.1 and 2.2 if we take a suitable setting.

3.1 Energy Eigenstates and Eigenvalues

We consider the chain $\Lambda =\{1,2,\ldots ,L\}$, where L is odd. We denote the sites as $x,y,\ldots \in \Lambda $. Let $\hat{c}_x$ and $\hat{c}^\dagger _x$ be the annihilation and creation operators, respectively, of the fermion at site $x\in \Lambda $. They satisfy the canonical anticommutation relations $\{\hat{c}_x,\hat{c}_y\}=0$ and $\{\hat{c}_x,\hat{c}^\dagger _y\}=\delta _{x,y}$ for any $x,y\in \Lambda $, where $\{\hat{A},\hat{B}\}=\hat{A}\hat{B}+\hat{B}\hat{A}$. We denote by $|\Phi _\textrm{vac}\rangle $ the state with no particles.

We take the standard Hamiltonian

$$\begin{aligned} \hat{H}=\sum _{x=1}^L\bigl \{e^{i\theta }\,\hat{c}^\dagger _x\hat{c}_{x+1}+e^{-i\theta }\,\hat{c}^\dagger _{x+1}\hat{c}_x\bigr \}, \end{aligned}$$

(3.1)

where we set the hopping amplitude to be unity for convenience. We introduced the artificial phase factor $\theta \in [0,2\pi )$ in order to avoid degeneracy. We impose the periodic boundary condition and identify $\hat{c}_{L+1}$ with $\hat{c}_1$.

The Hamiltonian $\hat{H}$ is readily diagonalized in terms of the plane wave states. Setting the k-space as

$$\begin{aligned} \mathcal{K}=\Bigl \{\frac{2\pi }{L}\nu \,\Bigl |\,\nu =0,\pm 1,\ldots ,\pm \frac{L-1}{2}\Bigr \}, \end{aligned}$$

(3.2)

we define the creation operator

$$\begin{aligned} \hat{a}^\dagger _k=\frac{1}{\sqrt{L}}\sum _{x=1}^Le^{ikx}\,\hat{c}^\dagger _x, \end{aligned}$$

(3.3)

for $k\in \mathcal{K}$. It holds that $\{\hat{a}^\dagger _k,\hat{a}_{k'}\}=\delta _{k,k'}$. One can show from the basic anticommutation relations that

$$\begin{aligned}{}[\hat{H},\hat{a}^\dagger _k]=2\tau \cos (k+\theta )\,\hat{a}^\dagger _k. \end{aligned}$$

(3.4)

Let $\varvec{k}=(k_1,\ldots ,k_N)$ denote a collection of N elements in $\mathcal{K}$ such that $k_j<k_{j+1}$ for $j=1,\ldots ,N-1$, and define

$$\begin{aligned} |\Psi _{\varvec{k}}\rangle =\hat{a}^\dagger _{k_1}\hat{a}^\dagger _{k_2}\cdots \hat{a}^\dagger _{k_N}|\Phi _\textrm{vac}\rangle . \end{aligned}$$

(3.5)

From (3.4) we readily see that $|\Psi _{\varvec{k}}\rangle $ is an energy eigenstate, i.e.,

$$\begin{aligned} \hat{H}|\Psi _{\varvec{k}}\rangle =E_{\varvec{k}}|\Psi _{\varvec{k}}\rangle , \end{aligned}$$

(3.6)

where the energy eigenvalue is

$$\begin{aligned} E_{\varvec{k}}=2\sum _{j=1}^N\cos (k_j+\theta ). \end{aligned}$$

(3.7)

By counting the dimension, we see that these are the only energy eigenstates and eigenvalues.

3.2 Justification of Assumption 2.1

We prove two theorems for the free fermion chain that justify Assumption 2.1 about the absence of degeneracy in the energy eigenvalues. Note that the free fermion model on the continuous interval always has degenerate many-body energy eigenvalues. The degeneracy cannot be lifted by the flux insertion (which corresponds to the phase $\theta $). The following results on nondegeneracy essentially rely on the lattice nature of the model.

The first theorem rules out the degeneracy for most values of $\theta $.^{Footnote 7}

Theorem 3.1

(Nondegeneracy of $E_{\varvec{k}}$ for most $\theta $) Let L be an arbitrary odd prime and N be an arbitrary integer with $0<N\le L$. Except for a finite number of $\theta \in [0,2\pi )$, one has $E_{\varvec{k}}\ne E_{\varvec{k}'}$ whenever $\varvec{k}\ne \varvec{k}'$, i.e., the energy eigenvalues $E_{\varvec{k}}$ are nondegenerate.

The theorem, in particular, implies that if one draws $\theta $ randomly, then with probability one, all the energy eigenvalues $E_{\varvec{k}}$ are nondegenerate. The second theorem allows one to choose a model free from degeneracy without relying on a probabilistic choice.

Theorem 3.2

(Nondegeneracy of $E_{\varvec{k}}$ for small $|\theta |\ne 0$) Let L be an arbitrary odd prime and N be an arbitrary integer with $0<N\le L$. For any $\theta \ne 0$ such that

$$\begin{aligned} |\theta |\le \frac{1}{(4N+2L)^{(L-1)/2}}, \end{aligned}$$

(3.8)

one has $E_{\varvec{k}}\ne E_{\varvec{k}'}$ whenever $\varvec{k}\ne \varvec{k}'$, i.e., the energy eigenvalues $E_{\varvec{k}}$ are nondegenerate.

One thus knows that the model with, say, $\theta =(4N+2L)^{-(L-1)/2}$ is free from degeneracy.

As we noted after Assumption 2.1, it is expected that the energy eigenvalues of a quantum many-body system are generically nondegenerate when there is no reason (like symmetry) to cause degeneracy. Even for a model of free fermions, we expect that possible degeneracy can be lifted by tuning some parameters, like the flux $\theta $ or the site-dependent potential or hopping amplitude. However, it turns out that demonstrating nondegeneracy rigorously is very difficult in general. That is why we considered an artificial setting where the system size L is a prime number. In this case, the absence of degeneracy can be demonstrated by using number-theoretic results, as we shall see below.

We also note that the absence of degeneracy was rigorously established in a disordered free fermion chain. See Appendix A of [54].

To prove Theorems 3.1 and 3.2, it is convenient to introduce the standard occupation number description. For a given N-tuple $\varvec{k}=(k_1,\ldots ,k_N)$, we define the corresponding occupation numbers $\varvec{n}=(n_{-(L-1)/2},\ldots ,n_{(L-1)/2})$ as

$$\begin{aligned} n_\nu ={\left\{ \begin{array}{ll} 1,&{}\text {if }2\pi \nu /L=k_j \text { for some }j;\\ 0,&{}\text {otherwise}, \end{array}\right. } \end{aligned}$$

(3.9)

where $\nu =0,\pm 1,\ldots ,\pm (L-1)/2$. One clearly has

$$\begin{aligned} \sum _{\nu =-(L-1)/2}^{(L-1)/2}n_\nu =N. \end{aligned}$$

(3.10)

By using the occupation numbers, the energy eigenstate (3.5) and the energy eigenvalue (3.7) are written as

$$\begin{aligned} |\Psi _{\varvec{n}}\rangle =\Biggl (\prod _{\nu =-(L-1)/2}^{(L-1)/2}(\hat{a}^\dagger _{2\pi \nu /L})^{n_\nu }\Biggr )|\Phi _\textrm{vac}\rangle , \end{aligned}$$

(3.11)

and

$$\begin{aligned} E_{\varvec{n}}=2\sum _{\nu =-(L-1)/2}^{(L-1)/2}n_\nu \,\cos \Bigl (\frac{2\pi }{L}\nu +\theta \Bigr ), \end{aligned}$$

(3.12)

respectively. By defining “complex energy” by

$$\begin{aligned} \mathcal{E}_{\varvec{n}}=\sum _{\nu =-(L-1)/2}^{(L-1)/2}n_\nu \,\zeta ^\nu , \end{aligned}$$

(3.13)

with

$$\begin{aligned} \zeta =e^{i 2\pi /L}, \end{aligned}$$

(3.14)

we can express the energy eigenvalue (3.12) as

$$\begin{aligned} E_{\varvec{n}}=2{\text {Re}}[e^{i\theta }\mathcal{E}_{\varvec{n}}]. \end{aligned}$$

(3.15)

Let us state two number theoretic lemmas,^{Footnote 8} which play essential roles in the proof of Theorems 3.1 and 3.2. We recall L is an odd prime, and $\zeta $ is defined as (3.14).

Lemma 3.3

For any $m_1,\ldots ,m_{L-1}\in \mathbb {Z}$ such that $m_\mu \ne 0$ for some $\mu $, one has

$$\begin{aligned} \sum _{\mu =1}^{L-1}m_\mu \,\zeta ^\mu \ne 0. \end{aligned}$$

(3.16)

Here, it is crucial that the sum is from 1 to $L-1$, rather than from 1 to L. Otherwise (3.16) can never be true because $\sum _{\mu =1}^L\zeta ^\mu =0$. The lemma is a straightforward consequence of the classical result by Gauss, known as the irreducibility of the cyclotomic polynomials of prime index. See, e.g., Chapter 12, Section 2 of [56], and also Chapter 13, Section 2 of [57] or section 3.2 of [58].

The following lemma^{Footnote 9} provides an explicit lower bound for $|\sum _{\mu =1}^{L-1}m_\mu \,\zeta ^\mu |$.

Lemma 3.4

For any $m_1,\ldots ,m_{L-1}\in \mathbb {Z}$ such that $m_\mu \ne 0$ for some $\mu $, one has

$$\begin{aligned} \biggl |\sum _{\mu =1}^{L-1}m_\mu \,\zeta ^\mu \biggr |\ge \biggl (\sum _{\mu =1}^{L-1}|m_\mu |\biggr )^{-(L-3)/2}. \end{aligned}$$

(3.17)

Proof

The lemma is proved by using standard facts about the field norm and algebraic integers. See, e.g., [58]. Let $\alpha =\sum _{\mu =1}^{L-1}m_\mu \,\zeta ^\mu \in \mathbb {Z}[\zeta ]\subset \mathbb {Q}[\zeta ]$ and

$$\begin{aligned} \sigma _j(\alpha )=\sum _{\mu =1}^{L-1}m_\mu \,e^{i2\pi j\mu /L}, \end{aligned}$$

(3.18)

be its conjugates, where $j=1,\ldots ,L-1$. Note that $\sigma _1(\alpha )=\alpha $, $\sigma _j(\alpha )=\{\sigma _{L-j}(\alpha )\}^*$, and $|\sigma _j(\alpha )|\le M$ with $M=\sum _{\mu =1}^{L-1}|m_\mu |$. Let $N:\mathbb {Q}[\zeta ]\rightarrow \mathbb {Q}$ denote the field norm of $\mathbb {Q}[\zeta ]$. By definition, we have

$$\begin{aligned} N(\alpha )=\prod _{j=1}^{L-1}\sigma _j(\alpha )=\prod _{j=1}^{(L-1)/2}|\sigma _j(\alpha )|^2. \end{aligned}$$

(3.19)

Since Lemma 3.3 guarantees $\sigma _j(\alpha )\ne 0$ for all j, we see that $N(\alpha )>0$. Note that $\alpha $ is an algebraic integer, and hence so are its conjugates $\sigma _j(\alpha )$ and the norm $N(\alpha )$. It is known that an algebraic integer that is rational must be an integer. Since $N(\alpha )\in \mathbb {Q}$, we see $N(\alpha )\in \mathbb {Z}$ and hence $N(\alpha )\ge 1$. This bound, with (3.19), implies

$$\begin{aligned} |\alpha |^2\ge \biggl (\prod _{j=2}^{(L-1)/2}\bigl |\sigma _j(\alpha )\bigr |^2\biggr )^{-1}\ge \frac{1}{M^{L-3}}. \end{aligned}$$

(3.20)

$\square $

We are now ready to prove our physics theorems.

Proof of Theorem 3.1

We first show $\mathcal{E}_{\varvec{n}}\ne \mathcal{E}_{\varvec{n}'}$ if $\varvec{n}\ne \varvec{n}'$, where both $\varvec{n}$ and $\varvec{n}'$ are occupation numbers for N particle energy eigenstates. In other words, the complex energy eigenvalues are nondegenerate. From (3.13), we find

$$\begin{aligned} \mathcal{E}_{\varvec{n}}-\mathcal{E}_{\varvec{n}'}=\sum _{\nu =-(L-1)/2}^{(L-1)/2}(n_\nu -n'_\nu )\,\zeta ^\nu . \end{aligned}$$

(3.21)

We claim that there is at least one $\nu $ such that $n_{\nu }-n'_{\nu }=0$. To see this, it suffices to note that the converse, i.e., $n_\nu =0$, $n'_\nu =1$ or $n_\nu =1$, $n'_\nu =0$ for every $\nu $, implies $L=2N$, while L is odd. Let $\nu _0$ be such $\nu $, i.e., $n_{\nu _0}-n'_{\nu _0}=0$. Noting that the right-hand side of (3.21) does not contain the term proportional to $\zeta ^{\nu _0}$, we rewrite it as

$$\begin{aligned} \mathcal{E}_{\varvec{n}}-\mathcal{E}_{\varvec{n}'}=\zeta ^{\nu _0}\sum _{\mu =1}^{L-1}m_\mu \,\zeta ^\mu , \end{aligned}$$

(3.22)

with $m_\mu =n_{\nu _0+\mu }-n'_{\nu _0+\mu }$, where we used the “periodic boundary condition”, $\nu =\nu +L$, for the index. Since $m_\mu $ is not identically zero (because $\varvec{n}\ne \varvec{n}'$), we see $\mathcal{E}_{\varvec{n}}-\mathcal{E}_{\varvec{n}'}\ne 0$ from Lemma 3.3.

Now, the statement of the lemma is proved easily. Take any $\varvec{n}$ and $\varvec{n}'$ with $\varvec{n}\ne \varvec{n}'$. Since $e^{i\theta }(\mathcal{E}_{\varvec{n}}-\mathcal{E}_{\varvec{n}'})\ne 0$, (3.15) implies that the two energy eigenvalues $E_{\varvec{n}}$ and $E_{\varvec{n}'}$ are degenerate only at two values of $\theta $ for which the real part of $e^{i\theta }(\mathcal{E}_{\varvec{n}}-\mathcal{E}_{\varvec{n}'})$ vanishes. This means that the N-particle energy eigenvalues exhibit degeneracy at most at $D_\textrm{tot}(D_\textrm{tot}-1)$ different values of $\theta $, where we recalled that there are $D_\textrm{tot}$ distinct $\varvec{n}$’s. Except for these finite points in the continuous interval $[0,2\pi )$, the Hamiltonian has no degeneracy. $\square $

Proof of Theorem 3.2

Consider the model with $\theta =0$. Because of the reflection symmetry $\cos ((2\pi /L)\nu )=\cos (-(2\pi /L)\nu )$, we see from (3.12) that the energy eigenvalue $E_{\varvec{n}}$ depends only on $n_0$ and $n_\nu +n_{-\nu }$ for $\nu =1,\ldots ,(L-1)/2$. In particular, we get the same energy for $n_\nu =1$, $n_{-\nu }=0$ and $n_\nu =0$, $n_{-\nu }=1$. This means that $E_{\varvec{n}}$ is generally degenerate, and the maximum possible degree of degeneracy is $2^N$. We call such a degeneracy a trivial degeneracy.

We shall show that, in the model with $\theta =0$, there are no additional degeneracies than trivial degeneracies.^{Footnote 10} Take occupation numbers $\varvec{n}$ and $\varvec{n}'$ for N particles such that $n_\nu +n_{-\nu }\ne n'_\nu +n'_{-\nu }$ for some $\nu $ (including $\nu =0$). The energy eigenvalues $E_{\varvec{n}}$ and $E_{\varvec{n}'}$ do not exhibit trivial degeneracy. Since $\zeta ^*=\zeta ^{-1}$, we see from (3.13) and (3.15) that

$$\begin{aligned} E_{\varvec{n}}-E_{\varvec{n}'}=\mathcal{E}_{\varvec{n}}+(\mathcal{E}_{\varvec{n}})^*-\mathcal{E}_{\varvec{n}'}-(\mathcal{E}_{\varvec{n}'})^* =\sum _{\nu =-(L-1)/2}^{(L-1)/2}\tilde{n}_\nu \,\zeta ^\nu , \end{aligned}$$

(3.23)

where we set $\tilde{n}_\nu =n_\nu +n_{-\nu }-n'_\nu -n'_{-\nu }$. Noting that $\sum _{\nu =-(L-1)/2}^{(L-1)/2}\zeta ^\nu =0$, we rewrite (3.23) as

$$\begin{aligned} E_{\varvec{n}}-E_{\varvec{n}'}= \sum _{\nu =-(L-1)/2}^{(L-1)/2}(\tilde{n}_\nu -\tilde{n}_0)\,\zeta ^\nu =\sum _{\mu =1}^{L-1}m_\mu \,\zeta ^\mu , \end{aligned}$$

(3.24)

where

$$\begin{aligned} m_\mu ={\left\{ \begin{array}{ll} \tilde{n}_\mu -\tilde{n}_0,&{}\mu =1,\ldots ,\frac{L-1}{2};\\ \tilde{n}_{\mu -L}-\tilde{n}_0,&{}\mu =\frac{L+1}{2},\ldots ,L-1. \end{array}\right. } \end{aligned}$$

(3.25)

We shall see at the end of the proof that $m_\mu \ne 0$ for some $\mu $. Then, noting that

$$\begin{aligned} \sum _{\mu =1}^{L-1}|m_\mu |\le \sum _{\nu =-(L-1)/2}^{(L-1)/2}\{|\tilde{n}_\nu |+|\tilde{n}_0|\}\le 4N+2L, \end{aligned}$$

(3.26)

we find from Lemma 3.4 that

$$\begin{aligned} |E_{\varvec{n}}-E_{\varvec{n}'}|\ge \frac{1}{(4N+2L)^{(L-3)/2}}. \end{aligned}$$

(3.27)

This, in particular, means that the energy eigenvalues $E_{\varvec{n}}$ and $E_{\varvec{n}'}$ are not degenerate.

We shall now examine the effect of nonzero $\theta $. We make the $\theta $-dependence of the energy eigenvalues explicit by writing $E^{(\theta )}_{\varvec{n}}$ instead of $E_{\varvec{n}}$.

Suppose for some $\varvec{n}\ne \varvec{n}'$ that $E^{(0)}_{\varvec{n}}=E^{(0)}_{\varvec{n}'}$, i.e., ${\text {Re}}\mathcal{E}_{\varvec{n}}={\text {Re}}\mathcal{E}_{\varvec{n}'}$. The two energy eigenvalues exhibit trivial degeneracy. Since $\mathcal{E}_{\varvec{n}}\ne \mathcal{E}_{\varvec{n}'}$ (see the proof of Theorem 3.1 above), we must have ${\text {Im}}\mathcal{E}_{\varvec{n}}\ne {\text {Im}}\mathcal{E}_{\varvec{n}'}$. Recalling that (3.15) implies $E^{(\theta )}_{\varvec{n}}=2\cos \theta \,{\text {Re}}\mathcal{E}_{\varvec{n}}-2\sin \theta \,{\text {Im}}\mathcal{E}_{\varvec{n}}$, we see $E^{(\theta )}_{\varvec{n}}\ne E^{(\theta )}_{\varvec{n}'}$ for any $\theta \ne 0,\pi $. Trivial degeneracies are completely lifted.

Since we have shown that the model is free from trivial degeneracies for $\theta $ with $0<|\theta |<\pi $, we look for a sufficient condition that additional (nontrivial) degeneracy is not generated when $\theta $ is varied slightly from 0. We observe from (3.12) that the resulting change in the energy eigenvalue is bounded as

$$\begin{aligned} |E^{(\theta )}_{\varvec{n}}-E^{(0)}_{\varvec{n}}|&\le 2\sum _{\nu =-(L-1)/2}^{(L-1)/2}n_\nu \,\biggl |\cos \Bigl (\frac{2\pi }{L}\nu +\theta \Bigr )-\cos \Bigl (\frac{2\pi }{L}\nu \Bigr )\biggr | \nonumber \\&<2\sum _{\nu =-(L-1)/2}^{(L-1)/2}n_\nu \,|\theta |=2N\,|\theta |, \end{aligned}$$

(3.28)

for any $\varvec{n}$ such that (3.10) holds. We then find from (3.27) that no additional degeneracy can be generated if

$$\begin{aligned} 2\times 2N\,|\theta |\le \frac{1}{(4N+2L)^{(L-3)/2}}, \end{aligned}$$

(3.29)

This is satisfied if the condition (3.8) in the theorem is valid.

It remains to prove that $m_\mu \ne 0$ for some $\mu $, where $m_\mu $ is defined in (3.25). To this end, we assume $m_\mu =0$ for all $\mu $. First, suppose $\tilde{n}_0=0$. We then have $\tilde{n}_\nu =0$ for all $\nu $, but this contradicts the basic assumption that $n_\nu +n_{-\nu }\ne n'_\nu +n'_{-\nu }$ for some $\nu $. Next, suppose $\tilde{n}_0\ne 0$. We then have $n_\nu +n_{-\nu }-n'_\nu -n'_{-\nu }=\tilde{n}_0\ne 0$ for any $\nu \ne 0$. But this implies $\sum _{\nu \ne 0}n_\nu -\sum _{\nu \ne 0}n'_\nu =\frac{L-1}{2}\tilde{n}_0$, which apparently contradicts with the constraint on the total particle number, i.e., $\sum _{\nu }n_\nu =\sum _{\nu }n'_\nu =N$. $\square $

3.3 Justification of Assumption 2.2

We shall demonstrate that Assumption 2.2 about the particle distribution in the energy eigenstates is valid in the present free fermion chain. As in section 2.1, we disjointly decompose the chain $\Lambda =\{1,\ldots ,L\}$ as $\Lambda =\Lambda _1\cup \Lambda _2$ with $|\Lambda _1|=(L-1)/2$ and $|\Lambda _2|=(L+1)/2$. An obvious choice is $\Lambda _1=\{1,2,\ldots ,(L-1)/2\}$, but any subset will work similarly.

Let us decompose the creation operator $\hat{a}^\dagger _k$ defined in (3.3) as

$$\begin{aligned} \hat{a}^\dagger _k=\hat{b}^\dagger _{1,k}+\hat{b}^\dagger _{2,k}, \end{aligned}$$

(3.30)

where

$$\begin{aligned} \hat{b}^\dagger _{\alpha ,k}=\frac{1}{\sqrt{L}}\sum _{x\in \Lambda _\alpha }e^{ikx}\,\hat{c}^\dagger _x, \end{aligned}$$

(3.31)

with $\alpha =1,2$. Note that $\{\hat{b}^\dagger _{1,k},\hat{b}_{1,k'}\}$ with $k\ne k'$ is not necessarily vanishing. From (3.5), we obviously have

$$\begin{aligned} \hat{P}_1|\Psi _{\varvec{k}}\rangle =\hat{b}^\dagger _{1,k_1}\hat{b}^\dagger _{1,k_2}\ldots \hat{b}^\dagger _{1,k_N}|\Phi _\textrm{vac}\rangle , \end{aligned}$$

(3.32)

and hence

$$\begin{aligned} \langle \Psi _{\varvec{k}}|\hat{P}_1|\Psi _{\varvec{k}}\rangle&=\langle \Phi _\textrm{vac}|\hat{b}_{1,k_N}\ldots \hat{b}_{1,k_2}\hat{b}_{1,k_1} \hat{b}^\dagger _{1,k_1}\hat{b}^\dagger _{1,k_2}\ldots \hat{b}^\dagger _{1,k_N}|\Phi _\textrm{vac}\rangle \nonumber \\&\le \Vert \hat{b}_{1,k_1}\hat{b}^\dagger _{1,k_1}\Vert \,\langle \Phi _\textrm{vac}|\hat{b}_{1,k_N}\ldots \hat{b}_{1,k_2} \hat{b}^\dagger _{1,k_2}\ldots \hat{b}^\dagger _{1,k_N}|\Phi _\textrm{vac}\rangle . \end{aligned}$$

(3.33)

Here we used the basic property $\langle \Psi |\hat{A}|\Psi \rangle \le \Vert \hat{A}\Vert \langle \Psi |\Psi \rangle $ of the operator norm of an arbitrary operator $\hat{A}$. Noting that

$$\begin{aligned} \Vert \hat{b}_{1,k}\hat{b}^\dagger _{1,k}\Vert \le \frac{1}{2}, \end{aligned}$$

(3.34)

for any $k\in \mathcal{K}$ (as we shall show below), we get the desired bound (2.5) by repeatedly using (3.33).

To estimate the norm $\Vert \hat{b}_{1,k}\hat{b}^\dagger _{1,k}\Vert $, we first note by an explicit calculation that $\{\hat{b}_{1,k}, \hat{b}^\dagger _{1,k}\}=p$ with $p=(L-1)/(2L)\le 1/2$. Then by noting that $(\hat{b}_{1,k}\hat{b}^\dagger _{1,k})^2=p\,\hat{b}_{1,k}\hat{b}^\dagger _{1,k}$, we see that the self-adjoing operator $\hat{b}_{1,k}\hat{b}^\dagger _{1,k}$ has eigenvalues 0 and p. This means $\Vert \hat{b}_{1,k}\hat{b}^\dagger _{1,k}\Vert =p$, which implies (3.34).

It is clear that the above justification of Assumption 2.2 applies to a much more general class of free fermion systems. The only requirement is that there is a decomposition corresponding to (3.30) of the creation operator for single-particle energy eigenstate with the property (3.34). See Appendix B.2 for a class of examples.

4 Discussion

We developed in Sect. 2 a general theory for the thermalization in low-density lattice gases. Under the two essential assumptions, namely, Assumption 2.2 about the particle distribution in energy eigenstates and Assumption 2.1 about nondegneracy of energy eigenvalues, we have shown that the system exhibits thermalization (in a restricted sense) when the initial state is drawn randomly from the Hilbert space $\mathcal{H}_1$ in which all particles are confined in the half-lattice $\Lambda _1$. The essential observation, which is summarized in Theorem 2.3, is that Assumptions 2.2 implies the lower bound (2.7) on the effective dimension of the initial state. Combined with standard arguments, the lower bound implies the desired statement about thermalization.

Then, in Sect. 3, we justified Assumptions 2.1 and 2.2 for a class of free fermion chains without relying on any unproven assumptions. Free fermion models, which have infinitely many conserved quantities, are often referred to as examples of systems that fail to thermalize. One might then be puzzled to see that we have established thermalization in free fermion chains. The essential point is in the choice of the initial state $|\Phi (0)\rangle $. In a non-interacting fermion model with translation invariance, for example, the momentum distribution does not change under the unitary time evolution. Thus the system never thermalizes if it starts from a state with non-thermal momentum distribution. In our case, the momentum distribution is thermal from the beginning because the initial state is chosen to be a thermal state but with particles confined in the sublattice $\Lambda _1$. In the language of generalized Gibbs ensembles, our generalized ensemble with the random initial state is characterized by local integrals of motion taking the same values as the equilibrium ones.

Although free fermion chains are the only examples in which we can fully justify our two assumptions, we stress that our general theory should apply to much more general models, most of which are non-integrable. Non-integrable models are believed to exhibit robust thermalization from an arbitrary realistic nonequilibrium initial state. When applied to such models, our thermalization theorem is expected to describe a partial aspect of thermalization exhibited by the model. We might say that our theory is general and broad enough to cover not only full-fledged thermalization in non-integrable system but also (rather trivial) thermalization in free fermion chains.

As we have already discussed after Assumption 2.1, it is believed that energy eigenvalues are nondegenerate in a generic non-integrable model. Therefore, let us focus on Assumption 2.2, which asserts that the probability of finding all particles in the sublattice $\Lambda _1$ does not exceed $2^{-N}$ in any energy eigenstate as in (2.5). By accepting the assumption of nondegeneracy as a plausible one, we have two additonal classes of models in which we can prove (2.5) as presented in Appendix B.

Assumption 2.2 is reminiscent of the (strong) ETH in the sense that we postulate that every energy eigenstate exhibits more or less uniform particle distributions. Although we are able to prove the bound (2.5) only for limited models, we expect that it is valid for all (or for a great majority of) energy eigenstates of a generic macroscopic quantum system. The bound does not hold, for example, in a state where a macroscopic number of particles form a big cluster and move together, but such states cannot be an energy eigenstate of a model with short-range interactions. We, in particular, note that the average of the probability $\langle \Psi _j|\hat{P}_1|\Psi _j\rangle $ over all the energy eigenstates is

$$\begin{aligned} \frac{1}{D_\textrm{tot}}\sum _{j=1}^{D_\textrm{tot}}\langle \Psi _j|\hat{P}_1|\Psi _j\rangle =\frac{1}{D_\textrm{tot}}{\text {Tr}}[\hat{P}_1]=\frac{D_1}{D_\textrm{tot}}\sim 2^{-N}e^{-(L/2)\rho ^2}, \end{aligned}$$

(4.1)

and is much smaller than $2^{-N}$.

In Sect. 2, we only discussed thermalization in the sense that the ratio of the number of particles in a macroscopic region $\Gamma $ approaches its equilibrium value $\gamma $. It is, however, clear from the proof that our method automatically extends to other criteria for thermal equilibrium. Let $\hat{P}_\textrm{neq}$ be the projection operator onto the nonequilibrium subspace of $\mathcal{H}_\textrm{tot}$ determined by a certain criterion for thermal equilibrium. If the canonical expectation value of $\hat{P}_\textrm{neq}$ at infinite temperature satisfies

$$\begin{aligned} \langle \hat{P}_\textrm{neq}\rangle _\infty \le e^{-\kappa N}=e^{-\kappa \rho \,L}, \end{aligned}$$

(4.2)

with a constant $\kappa $ such that $\kappa >\rho $, then we can prove, exactly as in Theorem 2.4, that the expectation value $\langle \Phi (t)|\hat{P}_\textrm{neq}|\Phi (t)\rangle $ is extremely small for sufficiently large typical t, i.e., the system exhibits thermalization. Although we do not go into details, we expect that the assumption (4.2) is valid if one defines $\hat{P}_\textrm{neq}$ to be the projection onto the space where the total energy in a macroscopic region differs considerably from its expectation value in $\langle \cdot \rangle _\infty $.

We note, however, that if one employs a criterion of thermal equilibrium that involves, say, particle-particle correlation, then the assumption (4.2) with $\kappa >\rho $ for the corresponding nonequilibrium projection is never valid. This means that our theorem is simply powerless. This shortcoming is related to the limitation to low densities and reflects the limitation of our approach, which reflects our strategy to base the theory on mild assumptions.

Data Availibility Statement

All data and information relevant to this study are presented in the paper.

Notes

It is a common misconception that the prediction of equilibrium statistical mechanics should always be compared with an averaged quantity in the corresponding physical system. In fact, the law of large numbers guarantees that the statistical mechanical expectation value accurately predicts the outcome of a single measurement in the equilibrium state, provided that both the system and the quantity to be measured are macroscopic.
Researchers who emphasize microscopic mechanisms may not call the process thermalization since everything is governed by free particle dynamics. Our point is to focus only on macroscopically observable phenomena, assuming that the observer has no access to microscopic mechanisms.
All the results in Sect. 2 are also valid for a system of hardcore bosons.
To be precise this is true only when the final state represents the equilibrium state at infinite temperature (as in our case). In general, if the initial state $|\Phi (0)\rangle $ has energy close to E then the effective dimension is believed to be close to the dimension of the corresponding energy shell, i.e., the Hilbert space spanned by energy eigenstates whose eigenvalues are close to E. One can argue, although very heuristically, that a large effective dimension is a consequence of (a strong form of) ETH. Consider a system described by a short-ranged translation-invariant Hamiltonian $\hat{H}$ and assume that ETH is valid. For simplicity, we take the initial state $|\Phi (0)\rangle $ to be a translation invariant product state. (We assume $|\Phi (0)\rangle $ is not an eigenstate of $\hat{H}$.) Then $|\Phi (0)\rangle $ has energy distribution peaked around some value E. Let $|\Psi _j\rangle $ be the eigenstate of $\hat{H}$ with eigenvalue $E_j$. Since ETH asserts that energy eigenstates with close eigenvalues are similar to each other, it is reasonable to assume that the overlap $|\langle \Phi (0)|\Psi _j\rangle |^2$ is almost independent of j as long as $E_j\simeq E$. This implies that $D_\textrm{eff}$ is almost identical to the dimension of the energy shell around E.
We note that the diagonal entropy $S_\textrm{d}$ studied in these works is believed to be related to the effective dimension as $D_\textrm{eff}\sim \exp [S_\textrm{d}]$.
${\varDelta }\hat{H}$ may not be small in the class of models considered in Appendix B.
The theorem was proved by one of us in [53]. See also Proposition 10.1 in [17] for a similar statement for a slightly complicated model.
See [55] for elementary proofs of the two lemmas.
We learned the lemma and its proof from Wataru Kai and Kazuaki Miyatani.
As is suggested by this conclusion, one can prove, by using essentially the same argument, the absence of degeneracy in certain open fermion chains with a suitable boundary potential.
In this trivial model, the energy eigenvalues for a pair of sites (x, 1) and (x, 2) are either zero (when there is no particles), $\pm s_x+w_x$ (when there is one particle), or $2w_x+u_x$ (when there are two particles). The total energy eigenvalues are the sums of these eigenvalues and are nondegenerate if we choose $s_x$, $w_x$, and $u_x$ properly.
A non-optimal but simple example of a Golomb ruler is obtained by taking N such that $L=2^N-1$ is a (Mersenne) prime, and setting $x_j=2^{j-1}$ for $j=1,\ldots ,N$. In this case, the particle density $\rho \simeq N/2^N$ is exponentially small in N.
In a Golomb ruler, $x_{j}-x_{Q(j)}-x_{P'(j)}+x_{Q'(j)}=0\mod L$ holds only if (1) $j=P'(j)$ and $Q(j)=Q'(j)$, or (2) $j=Q(j)$ and $P'(j)=Q'(j)$. Now we decompose a set $\{1,2,\ldots , N\}$ into two subsets, A and B, where (1) holds in A and (2) holds in B. Then, $P'$, Q, $Q'$ can be expressed in the form of $P'=\textrm{id}^A\oplus \pi ^A$, $Q=\pi ^B\oplus \textrm{id}^B$, and $Q'=\pi ^A\oplus \pi ^B$, where $\pi ^A$ and $\pi ^B$ represent permutations on A and B, respectively. With the above form of permutations, we easily see $(-1)^{P'QQ'}=1$ if $x_{j}-x_{Q(j)}-x_{P'(j)}+x_{Q'(j)}=0\mod L$ holds for any j.
A derangement is a permutation in which no entry stays at the original position.
The floor function $\lfloor x\rfloor $ is the largest integers less than or equal to x.

References

There is a 27-minute video that explains the main results of the present work: https://youtu.be/eUIX8WdJftc
von Neumann, J.: Beweis des Ergodensatzes und des $H$-Theorems in der neuen Mechanik, Z. Phys. 57, 30 (1929); English translation (by R. Tumulka), Proof of the Ergodic Theorem and the H-Theorem in Quantum Mechanics, The European Phys. J. H 35 201–237 (2010). arxiv:1003.2133
Kinoshita, T., Wenger, T., Weiss, D.S.: A quantum Newton’s cradle. Nature 440, 900 (2006)
Google Scholar
Trotzky, S., Chen, Y.-A., Flesch, A., McCulloch, I.P., Schollwöck, U., Eisert, J., Bloch, I.: Probing the relaxation towards equilibrium in an isolated strongly correlated one-dimensional Bose gas. Nat. Phys. 8, 325 (2012). arxiv:1101.2659
Google Scholar
Gring, M., Kuhnert, M., Langen, T., Kitagawa, T., Rauer, B., Schreitl, M., Mazets, I., Adu Smith, D., Demler, E., Schmiedmayer, J.: Relaxation and Prethermalization in an Isolated Quantum System. Science 337, 1318 (2012). arxiv:1112.0013
Google Scholar
Langen, T., Erne, S., Geiger, R., Rauer, B., Schweigler, T., Kuhnert, M., Rohringer, W., Mazets, I.E., Gasenzer, T., Schmiedmayer, J.: Experimental observation of a generalized Gibbs ensemble. Science 348, 207 (2015). arxiv:1411.7185
Google Scholar
Kaufman, A.M., Tai, M.E., Lukin, A., Rispoli, M., Schittko, R., Preiss, P.M., Greiner, M.: Quantum thermalization through entanglement in an isolated many-body system. Science 353, 794 (2016). arxiv:1603.04409
D’Alessio, L., Kafri, Y., Polkovnikov, A., Rigol, M.: From quantum chaos and eigenstate thermalization to statistical mechanics and thermodynamics, Adv. Phys. 65, 239–362 (2016). arxiv:1509.06411
Gogolin, C., Eisert, J.: Equilibration, thermalisation, and the emergence of statistical mechanics in closed quantum systems. Rep. Prog. Phys. 79, 056001 (2016). arxiv:1503.07538
Goldstein, S., Lebowitz, J.L., Tumulka, R., Zanghì, N.: Long-time behavior of macroscopic quantum systems: Commentary accompanying the English translation of John von Neumann’s 1929 article on the quantum ergodic theorem. European Phys. J. H 35, 173–200 (2010). arxiv:1003.2129
Google Scholar
Rigol, M., Srednicki, M.: Alternatives to Eigenstate Thermalization. Phys. Rev. Lett. 108, 110601 (2012). arxiv:1108.0928
Google Scholar
Deutsch, J.M.: Quantum statistical mechanics in a closed system. Phys. Rev. A 43, 2046 (1991)
Google Scholar
Srednicki, M.: Chaos and quantum thermalization. Phys. Rev. E 50, 888 (1994)
Google Scholar
Tasaki, H.: From Quantum Dynamics to the Canonical Distribution: General Picture and a Rigorous Example. Phys. Rev. Lett. 80, 1373–1376 (1998). arxiv:cond-mat/9707253
Goldstein, S., Lebowitz, J.L., Mastrodonato, C., Tumulka, R., Zanghì, N.: On the Approach to Thermal Equilibrium of Macroscopic Quantum Systems. Phys. Rev. E 81, 011109 (2010). arxiv:0911.1724
Google Scholar
Reimann, P.: Generalization of von Neumann’s Approach to Thermalization, Phys. Rev. Lett. 115, 010403 (2015). arxiv:1507.00262
Tasaki, H.: Typicality of thermal equilibrium and thermalization in isolated macroscopic quantum systems, J. Stat. Phys. 163, 937–997 (2016). arxiv:1507.06479
Reimann, P.: Foundation of Statistical Mechanics under Experimentally Realistic Conditions. Phys. Rev. Lett. 101, 190403 (2008). arxiv:0810.3092
Google Scholar
Linden, N., Popescu, S., Short, A.J., Winter, A.: Quantum mechanical evolution towards thermal equilibrium. Phys. Rev. E 79, 061103 (2009). arxiv:0812.2385
Google Scholar
Reimann, P., Kastner, M.: Equilibration of isolated macroscopic quantum systems, New J. Phys. 14, 043020 (2012). http://iopscience.iop.org/1367-2630/14/4/043020
Reimann, P.: Equilibration of Isolated Macroscopic Quantum Systems under Experimentally Realistic Conditions. Phys. Scr. 86, 058512 (2012). arxiv:1210.5821
Google Scholar
Goldstein, S., Hara, T., Tasaki, H.: The approach to equilibrium in a macroscopic quantum system for a typical nonequilibrium subspace, preprint (2014). arxiv:1402.3380
Shiraishi, N.: Analytic model of thermalization: Quantum emulation of classical cellular automata, Phys. Rev. E 97, 062144 (2018). arxiv:1709.06315
Grad, H.: Statistical mechanics, thermodynamics, and fluid dynamics of systems with an arbitrary number of integrals. Pure and Applied Mathematics 5, 455 (1952)
Google Scholar
Jancel, R.: Foundations of Classical and Quantum Statistical Mechanics. Pergamon Press, UK (1969)
Google Scholar
Cazalilla, M.A.: Effect of Suddenly Turning on Interactions in the Luttinger Model. Phys. Rev. Lett. 97, 156403 (2006)
Google Scholar
Rigol, M., Dunjko, V., Yurovsky, V., Olshanii, M.: Relaxation in a Completely Integrable Many-Body Quantum System: An Ab Initio Study of the Dynamics of the Highly Excited States of 1D Lattice Hard-Core Bosons. Phys. Rev. Lett. 98, 050405 (2007). arxiv:cond-mat/0604476
Basko, D.M., Aleiner, I.L., Altshuler, B.L.: Metal-insulator transition in a weakly interacting many-electron system with localized single-particle states. Ann. Phys. 321, 1126 (2006). arXiv:cond-mat/0506617
Pal, A., Huse, D.A.: Many-body localization phase transition. Phys. Rev. B 82, 174411 (2010). arxiv:1010.1992
Google Scholar
Serbyn, M., Papić, Z., Abanin, D.A.: Local Conservation Laws and the Structure of the Many-Body Localized States. Phys. Rev. Lett. 111, 127201 (2013). arxiv:1305.5554
Google Scholar
Friesdorf, M., Werner, A.H., Brown, W., Scholz, V.B., Eisert, J.: Many-Body Localization Implies that Eigenvectors are Matrix-Product States. Phys. Rev. Lett. 114, 170505 (2015). arxiv:1409.1252
Google Scholar
Nandkishore, R., Huse, D.A.: Many body localization and thermalization in quantum statistical mechanics. Ann. Rev. Cond. Matt. Phys. 6, 15 (2015). arxiv:1404.0686
Google Scholar
Imbrie, J.Z.: On Many-Body Localization for Quantum Spin Chains. J. Stat. Phys. 163, 998 (2016). arxiv:1403.7837
Google Scholar
Bernien, H., Schwartz, S., Keesling, A., Levine, H.y, Omran, A., Pichler, H., Choi, S., Zibrov, A.S., Endres, M., Greiner, M., Vuletic, V., Lukin, M.D.: Probing many-body dynamics on a 51-atom quantum simulator. Nature 551, 579 (2017). arxiv:1707.04344
Shiraishi, N., Mori, T.: Systematic Construction of Counterexamples to Eigenstate Thermalization Hypothesis. Phys. Rev. Lett. 119, 030601 (2017). arxiv:1702.08227
Mori, T., Shiraishi, N.: Thermalization without eigenstate thermalization hypothesis after a quantum quench. Phys. Rev. E 96, 022153 (2017). arxiv:1707.05921
Turner, C.J., Michailidis, A.A., Abanin, D.A., Serbyn, M., Papić, Z.: Weak ergodicity breaking from quantum many-body scars. Nature Physics 14, 745 (2018). arxiv:1711.03528
Moudgalya, S., Rachel, S., Bernevig, B.A., Regnault, N.: Exact Excited States of Non-Integrable Models. Phys. Rev. B 98, 235155 (2018). arxiv:1708.05021
Moudgalya, S., Regnault, N., Bernevig, B.A.: Entanglement of exact excited states of Affleck-Kennedy-Lieb-Tasaki models: Exact results, many-body scars, and violation of the strong eigenstate thermalization hypothesis. Phys. Rev. B 98, 235156 (2018). arxiv:1806.09624
Lin, C.-J., Motrunich, O.I.: Exact Quantum Many-Body Scar States in the Rydberg-Blockaded Atom Chain. Phys. Rev. Lett. 122, 173401 (2019). arxiv:1810.00888
Ho, W.W., Choi, S., Pichler, H., Lukin, M.D.: Periodic Orbits, Entanglement, and Quantum Many-Body Scars in Constrained Models: Matrix Product State Approach. Phys. Rev. Lett. 122, 040603 (2019). arxiv:1807.01815
Shiraishi, N.: Connection between quantum-many-body scars and the AKLT model from the viewpoint of embedded Hamiltonians. J. Stat. Mech. 083103 (2019). arxiv:1904.05182
Serbyn, M., Abanin, D.A., Papić, Z.: Quantum Many-Body Scars and Weak Breaking of Ergodicity. Nat. Phys. 17, 675 (2021). arxiv:2011.09486
Shiraishi, N., Matsumoto, K.: Undecidability in quantum thermalization, Nature Comm. 12, 5084 (2021). https://www.nature.com/articles/s41467-021-25053-0
Rigol, M., Muramatsu, A., Olshanii, M.: Hard-core bosons on optical superlattices: Dynamics and relaxation in the superfluid and insulating regimes, Phys. Rev. A 74, 053616 (2006). arxiv:cond-mat/0612415
Rigol, M., Fitzpatrick, M.: Initial state dependence of the quench dynamics in integrable quantum systems. Phys. Rev. A 84, 033640 (2011). arxiv:1107.5811
Google Scholar
Pandey, S., Bhat, J.M., Dhar, A., Goldstein, S., Huse, D.A., Kulkarni, M., Kundu, A., Lebowitz, J.L.: Boltzmann entropy of a freely expanding quantum ideal gas, J. Stat. Phys. 190, article number 142, (2023). arxiv:2303.12330
Santos, L.F., Polkovnikov, A., Rigol, M.: Entropy of Isolated Quantum Systems after a Quench. Phys. Rev. Lett. 107, 040601 (2011). arxiv:1103.0557
Google Scholar
Rigol, M.: Quantum Quenches in the Thermodynamic Limit. Phys. Rev. Lett. 112, 170601 (2014). arxiv:1401.2160
Google Scholar
Rigol, M.: Fundamental Asymmetry in Quenches Between Integrable and Nonintegrable Systems, Phys. Rev. Lett. 116, 100601 (2016). arxiv:1511.04447
Farrelly, T., Brandao, F.G.S.L., Cramer, M.: Thermalization and Return to Equilibrium on Finite Quantum Lattice Systems, Phys. Rev. Lett. 118, 140601 (2017). arxiv:1610.01337
Ullah, N.: Invariance hypothesis and higher correlations of Hamiltonian matrix elements, Nuc. Phys. 58, 65 (1964).https://www.sciencedirect.com/science/article/pii/002955826490522X
Tasaki, H.: The approach to thermal equilibrium and “thermodynamic normality” — An observation based on the works by Goldstein, Lebowitz, Mastrodonato, Tumulka, and Zanghi in 2009, and by von Neumann in 1929, (unpublished note 2010). arxiv:1003.5424
Abdul-Rahman, H., Stolz, G.: A uniform area law for the entanglement of eigenstates in the disordered XY chain, J. Math. Phys. 56, 121901 (2015). arxiv:1505.02117
Tasaki, H.: Two number-theoretic theorems (that we found useful for quantum physics) and their elementary proofs, YouTube video (2023). https://youtu.be/YrCoBv0acgs
Tignol, J.-P.: Galois’ Theory of Algebraic Equations (second edition), (World Scientific, 2015)
Ireland, K., Rosen, M.: A Classical Introduction to Modern Number Theory, Graduate Texts in Mathematics. Springer, Berlin (1990)
Google Scholar
Stewart, I., Tall, D.: Algebraic number theory and Fermat’s last theorem. Chapman and Hall, USA (2020)
Google Scholar
Mathworld, Golomb ruler. https://mathworld.wolfram.com/GolombRuler.html
Singer, J.: A Theorem in Finite Projective Geometry and Some Applications to Number Theory. Trans. Am. Math. Soc. 43, 377–385 (1938). https://www.jstor.org/stable/1990067
Erdös, P., Turán, P.: On a problem of Sidon in additive number theory and some related problems, J. London Math. Soc. 16, 212–215 (1941). https://londmathsoc.onlinelibrary.wiley.com/doi/abs/10.1112/jlms/s1-16.4.212
Mathworld, derangement. https://mathworld.wolfram.com/Derangement.html
Teufel, S., Tumulka, R., Vogel, C.: Time Evolution of Typical Pure States from a Macroscopic Hilbert Subspace, J. Stat. Phys. 190, 69 (2023). arxiv:2210.10018

Download references

Acknowledgements

It is a pleasure to thank Shelly Goldstein, Takashi Hara, Shu Nakamura, and Marcos Rigol for useful discussions. We also thank Shin Nakano for his patient guidance in number theory, and Wataru Kai and Kazuaki Miyatani for letting us know of Lemma 3.4 and its proof. N.S. was supported by JSPS Grants-in-Aid for Early-Career Scientists No. JP19K14615, and H.T. by JSPS Grants-in-Aid for Scientific Research No. 22K03474.

Author information

Authors and Affiliations

Faculty of arts and sciences, University of Tokyo, 3-8-1 Komaba, Meguro-ku, Tokyo, Japan
Naoto Shiraishi
Department of Physics, Gakushuin University, Mejiro, Toshima-ku, Tokyo, 171-8588, Japan
Hal Tasaki

Authors

Naoto Shiraishi
View author publications
You can also search for this author in PubMed Google Scholar
Hal Tasaki
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hal Tasaki.

Ethics declarations

Conflict of interest

The authors declare no Conflict of interest.

Additional information

Communicated by Anatoli Polkovnikov.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

A: Models With Degenerate Energy Eigenvalues

Our general discussion in Sect. 2 is based on the crucial assumption, Assumption 2.1, that all the energy eigenvalues are nondegenerate. Here we shall see how one can treat models in which the degree of degeneracy is at most $d_\textrm{max}$. We find that our thermalization results remain valid as long as $d_\textrm{max}$ is not too large. Unfortunately, we do not know of any examples where a nontrivial upper bound for the degree of degeneracy is known.

Let $E_j$ with $j=1,\ldots ,N_\textrm{el}$ be the distinct energy eigenvalues. We denote by $|\Psi _{j,\ell }\rangle $ with $\ell =1,\ldots ,d_j$ the energy eigenstates corresponding to $E_j$, where $d_j$ is the degree of degeneracy of $E_j$. We assume that the collection of $|\Psi _{j,\ell }\rangle $ with all j, $\ell $ forms an orthonormal basis of $\mathcal{H}_\textrm{tot}$.

We first examine the discussion in Sect. 2.2 about the effective dimension. A straightforward generalization of the definition (2.6) of the effective dimension is

$$\begin{aligned} D_\textrm{eff}=\biggl (\sum _{j=1}^{N_\textrm{el}}\sum _{\ell =1}^{d_j}\bigl |\left\langle \Phi (0)|\Psi _{j,\ell }\right\rangle \bigr |^4\biggr )^{-1}. \end{aligned}$$

(A.1)

When energy eigenvalues are degenerate, however, it is convenient to employ the definition

$$\begin{aligned} \widetilde{D}_\textrm{eff}=\biggl (\sum _{j=1}^{N_\textrm{el}}\langle \Phi (0)|\hat{P}_j|\Phi (0)\rangle ^2\biggr )^{-1}, \end{aligned}$$

(A.2)

where $\hat{P}_j=\sum _{\ell =1}^{d_j}|\Psi _{j,\ell }\rangle \langle \Psi _{j,\ell }|$ is the projection onto the space corresponding to the energy eigenvalue $E_j$. Clearly, (A.2) reduces to the original (A.1) when there is no degeneracy. To evaluate (A.2), we note that

$$\begin{aligned} \langle \Phi (0)|\hat{P}_j|\Phi (0)\rangle ^2&=\sum _{\ell ,\ell '=1}^{d_j}\bigl |\left\langle \Phi (0)|\Psi _{j,\ell }\right\rangle \bigr |^2\,\bigl |\left\langle \Phi (0)|\Psi _{j,\ell '}\right\rangle \bigr |^2 \nonumber \\&\le \frac{1}{2}\sum _{\ell ,\ell '=1}^{d_j}\Bigl (\bigl |\left\langle \Phi (0)|\Psi _{j,\ell }\right\rangle \bigr |^4+\bigl |\left\langle \Phi (0)|\Psi _{j,\ell '}\right\rangle \bigr |^4\Bigr ) \nonumber \\&=d_j\sum _{\ell =1}^{d_j}\bigl |\left\langle \Phi (0)|\Psi _{j,\ell }\right\rangle \bigr |^4, \end{aligned}$$

(A.3)

where we noted $ab\le (a^2+b^2)/2$ to get the second line. We thus find

$$\begin{aligned} \widetilde{D}_\textrm{eff}^{-1}\le \sum _{j=1}^{N_\textrm{el}}d_j\sum _{\ell =1}^{d_j}\bigl |\left\langle \Phi (0)|\Psi _{j,\ell }\right\rangle \bigr |^4 \le d_\textrm{max}\sum _{j=1}^{N_\textrm{el}}\sum _{\ell =1}^{d_j}\bigl |\left\langle \Phi (0)|\Psi _{j,\ell }\right\rangle \bigr |^4 =\frac{d_\textrm{max}}{D_\textrm{eff}} \end{aligned}$$

(A.4)

where $d_\textrm{max}=\max _jd_j$.

Suppose that (2.5) in Assumption 2.2 is valid for the energy eigenstates $|\Psi _{j,\ell }\rangle $. Then Theorem 2.3 guarantees the crucial lower bound (2.7) for $D_\textrm{eff}$ defined as (A.1). We thus find from (A.4) that

$$\begin{aligned} \frac{D_\textrm{tot}}{\widetilde{D}_\textrm{eff}}\le d_\textrm{max}\,e^{\rho N}. \end{aligned}$$

(A.5)

We see that $\widetilde{D}_\textrm{eff}$ is large provided that $d_\textrm{max}$ is not too large. Note that the degeneracy does not essentially change the behavior of the effective dimension if $d_\textrm{max}$ grows subexponentially in N.

We move onto the discussion in Sect. 2.3 about the time evolution. Taking into account the degeneracy, the expression (2.14) for the time evolution reads

$$\begin{aligned} |\Phi (t)\rangle =e^{-i\hat{H}t}|\Phi (0)\rangle =\sum _{j=1}^{N_\textrm{el}}e^{-iE_jt}\hat{P}_j|\Phi (0)\rangle =\sum _{j=1}^{N_\textrm{el}}e^{-iE_jt}\,|\widetilde{\Psi }_j\rangle \, \sqrt{\langle \Phi (0)|\hat{P}_j|\Phi (0)\rangle }, \nonumber \\ \end{aligned}$$

(A.6)

where we defined

$$\begin{aligned} |\widetilde{\Psi }_j\rangle =\frac{\hat{P}_j|\Phi (0)\rangle }{\Vert \hat{P}_j|\Phi (0)\rangle \Vert }. \end{aligned}$$

(A.7)

Correspondingly, (2.20) is modified as

$$\begin{aligned} \lim _{T\uparrow \infty }\frac{1}{T}\int _0^Tdt\,\langle \Phi (t)|\hat{P}^{\Gamma ,\varepsilon }_\textrm{neq}|\Phi (t)\rangle&=\sum _{j=1}^{N_\textrm{el}}\langle \Phi (0)|\hat{P}_j|\Phi (0)\rangle \langle \widetilde{\Psi }_j|\hat{P}^{\Gamma ,\varepsilon }_\textrm{neq}|\widetilde{\Psi }_j\rangle \nonumber \\&\le \sqrt{ \biggl (\sum _{j=1}^{N_\textrm{el}}\langle \Phi (0)|\hat{P}_j|\Phi (0)\rangle ^2\biggr ) \biggl (\sum _{j=1}^{N_\textrm{el}}\langle \widetilde{\Psi }_j|\hat{P}^{\Gamma ,\varepsilon }_\textrm{neq}|\widetilde{\Psi }_j\rangle ^2\biggr ) } \nonumber \\&\le \sqrt{ \biggl (\sum _{j=1}^{N_\textrm{el}}\langle \Phi (0)|\hat{P}_j|\Phi (0)\rangle ^2\biggr ) \biggl (\sum _{j=1}^{N_\textrm{el}}\sum _{\ell =1}^{d_j}\langle \Psi _{j,\ell }|\hat{P}^{\Gamma ,\varepsilon }_\textrm{neq}|\Psi _{j,\ell }\rangle ^2\biggr ) } \nonumber \\&\le \sqrt{ D_\textrm{tot}\widetilde{D}_\textrm{eff}^{-1}\,\langle \hat{P}^{\Gamma ,\varepsilon }_\textrm{neq}\rangle _\infty . } \end{aligned}$$

(A.8)

Therefore the rest of the discussion remains valid if we replace $D_\textrm{eff}$ with $\widetilde{D}_\textrm{eff}$.

B: Models Satisfying Assumption 2.2

In this Appendix, we present two classes of models in which we can prove Assumption 2.2 about the particle distribution (under suitable assumptions about nondegeneracy). If we could also justify Assumption 2.1 about the nondegeneracy of energy eigenvalues, we would have further rigorous examples of thermalization. Unfortunately, we still do not know how nondegeneracy can be proved, although we believe it to be highly plausible.

The first class of models is that of interacting fermions on a specific class of lattices, while the second class is that of free fermions on arbitrary lattices with $\mathbb {Z}_2$ symmetry.

1.1 B.1. Interacting Fermions on Double-Lattice

First, we discuss a class of lattice gas models on a double lattice with special symmetry. In these models, we can easily verify the bound (2.5) for any energy eigenstate corresponding to a nondegenerate energy eigenvalue. See Lemma B.1 below. This means that Assumption 2.2 about the particle distribution is automatically valid if Assumption 2.1 about nondegeneracy of energy eigenvalues is valid. The class of models in fact contains many non-trivial interacting models for which we generally expect that energy eigenvalues are nondegenerate. We thus expect that the present class contains many examples in which our thermalization theorem, Theorem 2.4, is valid. Unfortunately, we are not able to prove nondegeneracy in concrete models, except for trivial decoupled models. See the discussion at the end of the present subsection.

We shall describe the class of models and state the basic observation, i.e., Lemma B.1. Although we here describe models of fermions for notational simplicity, extensions to hardcore bosons or quantum spin systems are trivial.

Let $\Lambda _0$ be a lattice with L/2 sites, and $\Lambda _1$ and $\Lambda _2$ be copies of $\Lambda _0$. Sites in $\Lambda _1$ and $\Lambda _2$ are denoted as (x, 1) and (x, 2), respectively, with $x\in \Lambda _0$. We consider a model of fermions on the whole lattice $\Lambda =\Lambda _1\cup \Lambda _2$.

We assume that the Hamiltonian $\hat{H}$ conserves the total particle number and is invariant under the exchange of two sites (x, 1) and (x, 2) for each $x\in \Lambda _0$. The latter is a highly nontrivial (and artificial) assumption, which enables us to prove the desired bound (2.5) easily. To be more precise we define for each $x\in \Lambda _0$ the unitary operator $\hat{U}_x$ that swaps (x, 1) and (x, 2). It is defined by $\hat{U}_x|\Phi _\textrm{vac}\rangle =|\Phi _\textrm{vac}\rangle $, $\hat{U}_x\,\hat{c}_{(x,1)}\hat{U}_x^\dagger =\hat{c}_{(x,2)}$, $\hat{U}_x\,\hat{c}_{(x,2)}\hat{U}_x^\dagger =\hat{c}_{(x,1)}$, and $\hat{U}_x\,\hat{c}_{(y,\nu )}\hat{U}_x^\dagger =\hat{c}_{(y,\nu )}$ for $y\ne x$ and $\nu =1,2$. Note that $(\hat{U}_x)^2=\hat{1}$. Our symmetry assumption is that $[\hat{U}_x,\hat{H}]=0$ for any $x\in \Lambda _0$.

If we restrict ourselves to models with standard particle-hopping and two-body interactions, the most general Hamiltonian takes the form

$$\begin{aligned} \hat{H}&=\mathop {\sum _{x,y\in \Lambda _0}}_{(x\ne y)}\bigl \{t_{x,y}(\hat{c}^\dagger _{(x,1)}+\hat{c}^\dagger _{(x,2)})(\hat{c}_{(y,1)}+\hat{c}_{(y,2)}) +\frac{v_{x,y}}{2}(\hat{n}_{(x,1)}+\hat{n}_{(x,2)})(\hat{n}_{(y,1)}+\hat{n}_{(y,2)})\bigr \} \nonumber \\&\qquad +\sum _{x\in \Lambda _0}\bigl \{s_x(\hat{c}^\dagger _{(x,1)}\hat{c}_{(x,2)}+\hat{c}^\dagger _{(x,2)}\hat{c}_{(x,1)})+w_x(\hat{n}_{(x,1)}+\hat{n}_{(x,2)})+u_x\,\hat{n}_{(x,1)}\hat{n}_{(x,2)}\bigr \}, \end{aligned}$$

(B.1)

where $t_{x,y}=(t_{y,x})^*\in \mathbb {C}$, $v_{x,y}=v_{y,x}\in \mathbb {R}$, and $s_x,w_x,u_x\in \mathbb {R}$. We defined the number operator by $\hat{n}_{(x,\sigma )}=\hat{c}^\dagger _{(x,\sigma )}\hat{c}_{(x,\sigma )}$.

Here is the basic observation in the present appendix.

Lemma B.1

Let $|\Psi \rangle $ be the normalized eigenstate corresponding to a nondgenerate energy eigenvalue of $\hat{H}$. Then we have

$$\begin{aligned} \langle \Psi |\hat{P}_1|\Psi \rangle \le 2^{-N}, \end{aligned}$$

(B.2)

which is the same as (2.5).

Proof

For a fixed particle number N, we define the basis states of the model by

$$\begin{aligned} |\Xi _{S_1,S_2}\rangle =\Bigl (\prod _{x\in S_1}\hat{c}^\dagger _{(x,1)}\Bigr )\Bigl (\prod _{x\in S_2}\hat{c}^\dagger _{(x,2)}\Bigr )|\Phi _\textrm{vac}\rangle , \end{aligned}$$

(B.3)

where $S_1$ and $S_2$ are arbitrary subsets of $\Lambda _0$ such that $|S_1|+|S_2|=N$. Take any normalized eigenstate $|\Psi \rangle $ of $\hat{H}$ and expand it in the above basis as

$$\begin{aligned} |\Psi \rangle =\mathop {\sum _{S_1,S_2\subset \Lambda _0}}_{(|S_1|+|S_2|=N)}\psi _{S_1,S_2}|\Xi _{S_1,S_2}\rangle , \end{aligned}$$

(B.4)

where $\psi _{S_1,S_2}\in \mathbb {C}$ are coefficients which satisfy $\sum |\psi _{S_1,S_2}|^2=1$. The symmetry of the Hamiltonian and the nondgeneracy imply $\hat{U}_x|\Psi \rangle =\pm |\Psi \rangle $ for any $x\in \Lambda _0$. This means that the expansion coefficients satisfy

$$\begin{aligned} |\psi _{S,\emptyset }|=|\psi _{S\backslash S',S'}|, \end{aligned}$$

(B.5)

for any $S\subset \Lambda _0$ with $|S|=N$ and any $S'\subset S$. We thus have

$$\begin{aligned} |\psi _{S,\emptyset }|^2=\frac{1}{2^N}\sum _{S'\subset S}|\psi _{S\backslash S',S'}|^2, \end{aligned}$$

(B.6)

which, when summed over S, yields

$$\begin{aligned} \mathop {\sum _{S\subset \Lambda _0}}_{(|S|=N)}|\psi _{S,\emptyset }|^2=\frac{1}{2^N}\mathop {\sum _{S\subset \Lambda _0}}_{(|S|=N)}\sum _{S'\subset S}|\psi _{S\backslash S',S'}|^2\le \frac{1}{2^N}\mathop {\sum _{S_1,S_2\subset \Lambda _0}}_{(|S_1|+|S_2|=N)}|\psi _{S_1,S_2}|^2=\frac{1}{2^N}. \end{aligned}$$

(B.7)

Noting that the left-hand side is $\langle \Psi |\hat{P}_1|\Psi \rangle $, we get (B.2). $\square $

This model considered here is generally non-integrable, and we expect that its energy eigenvalues are nondegenerate. It is desirable to find models in which the absence of degeneracy can be proved rigorously.

Unfortunately, the only case we can prove nondegeneracy is a trivial decoupled model with $t_{x,y}=v_{x,y}=0$ for all $x,y\in \Lambda _0$. We readily see that the energy eigenvalues are nondegenerate if $s_x$, $w_x$, and $u_x$ with $x\in \Lambda _0$ are chosen to be different from each other.^{Footnote 11} Therefore we can fully justify our main theorem, Theorem 2.4, for the model, but we should note that the result is trivial. In the initial state $|\Phi (0)\rangle $, each pair of sites (x, 1) and (x, 2) is either empty or occupied by one partilce at (x, 1). The time evolution then takes place independently for each pair of sites. If there is a particle in a pair, then a superposition of two states with a particle at (x, 1) and at (x, 2) is generated. This, when viewed macroscopically, results in thermalization. We must say that there is nothing interesting in this observation.

1.2 B.2. Free Fermions With $\mathbb {Z}_2$ Symmetry

Next, we discuss a class of free fermion models in which the bound (2.5) for the particle distribution, and hence Assumption 2.2 can be justified. We here follow the strategy outlined at the end of Sect. 3.3 and justify the inequality (3.34) for the fermion operators corresponding to single-particle energy eigenstates.

Let $\Lambda $ be an arbitrary lattice, and consider the most general free fermion Hamiltonian

$$\begin{aligned} \hat{H}=\sum _{x,y\in \Lambda }t_{x,y}\,\hat{c}^\dagger _x\hat{c}_y, \end{aligned}$$

(B.8)

where the hopping amplitude satisfies $t_{x,y}=(t_{y,x})^*\in \mathbb {C}$. Note that the diagonal element $t_{x,x}\in \mathbb {R}$ represents the single-body potential.

We assume that the model has $\mathbb {Z}_2$ symmetry in the sense that there is a one-to-one map $p:\Lambda \rightarrow \Lambda $ such that $p^2=\textrm{id}$, and that the Hamiltonian is invariant under the transformation p, i.e., $t_{p(x),p(y)}=t_{x,y}$ for any $x,y\in \Lambda $. We also assume that $\Lambda $ is disjointly decomposed as $\Lambda =\Lambda _1\cup \Lambda _2$ and that $p(\Lambda _1)\subset \Lambda _2$.

As an example, consider the chain $\Lambda =\{1,\ldots ,L\}$ with odd L, and let p be the inversion $p(x)=L+1-x$. Then the decomposition with $\Lambda _1=\{1,\ldots ,(L-1)/2\}$ and $\Lambda _2=\{(L+1)/2,\ldots ,L\}$ satisfies the above property.

Let $\varvec{\psi }=(\psi _x)_{x\in \Lambda }$ be a normalized single-particle energy eigenstate. To be precise, it satisfies the Schrödinger equation $\epsilon \,\psi _x=\sum _{y\in \Lambda }t_{x,y}\psi _x$ for all $x\in \Lambda $ with the (single-particle) energy eigenvalue $\epsilon $. Let us further assume that the energy eigenvalue $\epsilon $ is nondegenerate. Then, with respect to the symmetry transformation p, the corresponding wave function $\varvec{\psi }$ is either symmetric, i.e., $\psi _{p(x)}=\psi _x$ for all $x\in \Lambda $, or antisymmetric, i.e., $\psi _{p(x)}=-\psi _x$ for all $x\in \Lambda $. We then see that

$$\begin{aligned} \sum _{x\in \Lambda _1}|\psi _x|^2=\frac{1}{2}\sum _{x\in \Lambda _1}\bigl (|\psi _x|^2+|\psi _{p(x)}|^2\bigr ) \le \frac{1}{2}\sum _{x\in \Lambda }|\psi _x|^2=\frac{1}{2}, \end{aligned}$$

(B.9)

where we noted that $p(\Lambda _1)\subset \Lambda \backslash \Lambda _1$.

Let $\hat{a}^\dagger _{\varvec{\psi }}=\sum _{x\in \Lambda }\psi _x\,\hat{c}^\dagger _x$ be the creation operator of the state $\varvec{\psi }$. It can be decomposed as $\hat{a}^\dagger _{\varvec{\psi }}=\hat{b}^\dagger _{1,\varvec{\psi }}+\hat{b}^\dagger _{2,\varvec{\psi }}$ with $\hat{b}^\dagger _{1,\varvec{\psi }}=\sum _{x\in \Lambda _1}\psi _x\,\hat{c}^\dagger _x$ and $\hat{b}^\dagger _{2,\varvec{\psi }}=\sum _{x\in \Lambda _2}\psi _x\,\hat{c}^\dagger _x$. This corresponds to the decomposition (3.30). We also see from (B.9) that $\hat{b}^\dagger _{1,\varvec{\psi }}$ satisfies $\Vert \hat{b}_{1,\varvec{\psi }}\,\hat{b}^\dagger _{1,\varvec{\psi }}\Vert =\sum _{x\in \Lambda _1}|\psi _x|^2\le 1/2$, which corresponds to the desired (3.34).

We now assume that single-particle energy eigenvalues $\epsilon _1,\ldots ,\epsilon _{|\Lambda |}$ are all nondegenerate, and denote by $\hat{a}^\dagger _j$ the creation operator of the single-particle energy eigenstate corresponding to $\epsilon _j$. Then the foregoing discussion shows that each $\hat{a}^\dagger _j$ is decomposed as (3.30), and the operator for the sublattice $\Lambda _1$ satisfies the bound (3.34). Repeating the derivation in section 3.3, we see an N-body energy eigenstate of the form

$$\begin{aligned} |\Psi \rangle =\hat{a}^\dagger _{j_1}\cdots \hat{a}^\dagger _{j_N}|\Phi _\textrm{vac}\rangle , \end{aligned}$$

(B.10)

satisfies the desired bound (2.5).

Interestingly, it was only necessary to assume the nondegeneracy of single-particle energy eigenvalues to prove the desired bound (2.5) in this model. To ensure the presence of thermalization, we have to assume further that N-body energy eigenvalues are nondegenerate. It is rather likely that degeneracy is absent in a sufficiently complex free fermion model, but we do not know how to justify the claim. We also note that the p-symmetry may not be exact. It can be violated by a small perturbation as long as the bound (2.5) remains valid.

C: Effective Dimensions of Some Initial Particle Configurations in the Free Fermion Chain

In the main text, the initial state $|\Phi (0)\rangle $ is drawn randomly from the small Hilbert space $\mathcal{H}_1$. Conceptually speaking, it may be desirable to consider the time evolution starting from a non-random simple initial state. Here we again treat free fermion chains and examine the effective dimensions of some initial states in which particles have definite positions.

In Sect. C.2, we observe that the initial state where particles are arranged in a periodic manner has an effective dimension that is large but not large enough to guarantee thermalization. This observation suggests that a random initial state is mandatory in a free fermion model if we demand the effective dimension to be extremely large. This is very likely to be a common property for integrable models. In a non-integrable model, on the other hand, it is believed that even a regular initial state generally has an effective dimension almost as large as the total dimension.

In Sect. C.3, we consider an artificial class of initial configurations (Golomb ruler configurations) and show that the corresponding effective dimensions are almost as large as the total dimension. This leads to a statement about thermalization with a non-random initial state. In this class of models, however, the particle density inevitably tends to zero according to $\rho \sim N^{-1}$ as the particle number grows.

1.1 C.1. General formula for $D_\textrm{eff}$

We consider the free fermion chain as defined in section 3. Let the initial particle configuration be $\varvec{x}=(x_1,x_2,\ldots ,x_N)$ with $x_j\in \{1,\ldots ,L\}$ such that $x_j<x_{j+1}$ for $j=1,\ldots ,N-1$, and define the corresponding N fermion state as

$$\begin{aligned} |\Phi _{\varvec{x}}\rangle =\hat{c}^\dagger _{x_1}\hat{c}^\dagger _{x_2}\cdots \hat{c}^\dagger _{x_N}|\Phi _\textrm{vac}\rangle . \end{aligned}$$

(C.1)

We set $|\Phi _{\varvec{x}}\rangle $ as the initial state $|\Phi (0)\rangle $. Then we see from (2.6) that the effective dimension is given by

$$\begin{aligned} D_\textrm{eff}^{-1}=\sum _{\varvec{k}\in \tilde{\mathcal{K}}_N}\bigl |\langle \Phi _{\varvec{x}}|\Psi _{\varvec{k}}\rangle \bigr |^4, \end{aligned}$$

(C.2)

where $\tilde{\mathcal{K}}_N=\{(k_1,\ldots ,k_N)\,|\,k_j<k_{j+1}\}\subset \mathcal{K}^N$. (The k-space $\mathcal{K}$ is defined in (3.2).) Noting that (3.3) implies $\{\hat{c}_x,\hat{a}^\dagger _k\}=e^{ikx}/\sqrt{L}$, we see

$$\begin{aligned} \langle \Phi _{\varvec{x}}|\Psi _{\varvec{k}}\rangle =\langle \Phi _\textrm{vac}|\hat{c}_{x_N}\cdots \hat{c}_{x_1}\hat{a}^\dagger _{k_1}\cdots \hat{a}^\dagger _{k_N}|\Phi _\textrm{vac}\rangle =L^{-N/2}\sum _P(-1)^P\prod _{j=1}^Ne^{ik_jx_{P(j)}}, \end{aligned}$$

(C.3)

where the summation is over all possible N! permutations P of $\{1,\ldots ,N\}$ and $(-1)^P$ is the signature of P. It is useful to regard $\varvec{k}$ in the above expression as an element in $\mathcal{K}^N$ rather than its physical subspace $\tilde{\mathcal{K}}_N$. This replacement is justified since $\bigl |\langle \Phi _{\varvec{x}}|\Psi _{\varvec{k}}\rangle \bigr |$ is invariant under any permutations of $k_1,\ldots ,k_N$ and equals zero if $k_j=k_{j'}$ for some $j\ne j'$. We can thus rewrite (C.2) as

$$\begin{aligned} D_\textrm{eff}^{-1}=\frac{1}{N!}\sum _{\varvec{k}\in \mathcal{K}^N}\bigl |\langle \Phi _{\varvec{x}}|\Psi _{\varvec{k}}\rangle \bigr |^4. \end{aligned}$$

(C.4)

This rewriting is useful since one can now sum independently over $k_1,\ldots ,k_N\in \mathcal{K}$.

From (C.3), we see that

$$\begin{aligned} \bigl |\langle \Phi _{\varvec{x}}|\Psi _{\varvec{k}}\rangle \bigr |^2&= \frac{1}{L^N}\sum _{P,Q}(-1)^{PQ}\prod _{j=1}^Ne^{ik_j(x_{P(j)}-x_{Q(j)})} \nonumber \\&=\frac{N!}{L^N} +\frac{1}{L^N}\mathop {\sum _{P,Q}}_{(P\ne Q)}(-1)^{PQ}\prod _{j=1}^Ne^{ik_j(x_{P(j)}-x_{Q(j)})}, \end{aligned}$$

(C.5)

and

$$\begin{aligned} \bigl |\langle \Phi _{\varvec{x}}|\Psi _{\varvec{k}}\rangle \bigr |^4=C_1+C_2(\varvec{k})+C_3(\varvec{k}), \end{aligned}$$

(C.6)

with

$$\begin{aligned} C_1=\Bigl (\frac{N!}{L^N}\Bigr )^2,\quad C_2(\varvec{k})=\frac{2N!}{L^{2N}}\mathop {\sum _{P,Q}}_{(P\ne Q)}(-1)^{PQ}\prod _{j=1}^Ne^{ik_j(x_{P(j)}-x_{Q(j)})},\end{aligned}$$

(C.7)

$$\begin{aligned} C_3(\varvec{k})=\frac{1}{L^{2N}}\mathop {\sum _{P,Q,P',Q'}}_{(P\ne Q,\,P'\ne Q')}(-1)^{PQP'Q'} \prod _{j=1}^Ne^{ik_j\{(x_{P(j)}-x_{Q(j)})-(x_{P'(j)}-x_{Q'(j)})\}}. \end{aligned}$$

(C.8)

We shall evaluate the sum (C.4) by using the decomposition (C.6). Clearly

$$\begin{aligned} \frac{1}{N!}\sum _{\varvec{k}\in \mathcal{K}^N}C_1=\frac{N!}{L^N}, \end{aligned}$$

(C.9)

The remaining sums are evaluated by using the standard formula

$$\begin{aligned} \sum _{k\in \mathcal{K}}e^{ikx}={\left\{ \begin{array}{ll} L,&{}x=0\mod L;\\ 0,&{}\text {otherwise}, \end{array}\right. } \end{aligned}$$

(C.10)

where $x\in \mathbb {Z}$. Note that in the expression for $C_2(\varvec{k})$ in (C.7), one has $x_{P(j)}-x_{Q(j)}\ne 0$ for at least one j because $P\ne Q$. We thus see

$$\begin{aligned} \frac{1}{N!}\sum _{\varvec{k}\in \mathcal{K}^N}C_2(\varvec{k})=0. \end{aligned}$$

(C.11)

The sum of $C_3(\varvec{k})$ is generally nonzero and can be evaluated as

$$\begin{aligned} \frac{1}{N!}\sum _{\varvec{k}\in \mathcal{K}^N}C_3(\varvec{k})&= \frac{1}{L^NN!}\mathop {\sum _{P,Q,P',Q'}}_{(P\ne Q,\,P'\ne Q')}(-1)^{PQP'Q'}\nonumber \\&\quad \cdot \prod _{j=1}^N\chi [(x_{P(j)}-x_{Q(j)})-(x_{P'(j)}-x_{Q'(j)})=0\mod L], \end{aligned}$$

(C.12)

where the characteristic function is defined as $\chi [\text {true}]=1$ and $\chi [\text {false}]=0$. Let us write the right-hand side of (C.12) as $\mathcal{S}_{\varvec{x}}/L^N$. From (C.4), (C.6), (C.9), (C.11), and (C.12), we see that the effective dimension of the initial state $|\Phi (0)\rangle =|\Phi _{\varvec{x}}\rangle $ is given by

$$\begin{aligned} D_\textrm{eff}=\frac{L^N}{N!+\mathcal{S}_{\varvec{x}}}. \end{aligned}$$

(C.13)

Our main task is to evaluate the sum $\mathcal{S}_{\varvec{x}}$ defined in (C.12) for a given particle configuration $\varvec{x}$. For later convenience we sum over P in (C.12) (and write $P^{-1}Q$, $P^{-1}P'$, and $P^{-1}Q'$ as Q, $P'$, and $Q'$, respectively) to rewrite the expression as

$$\begin{aligned} \mathcal{S}_{\varvec{x}}=\mathop {\sum _{Q,P',Q'}}_{(Q\ne \textrm{id},\,P'\ne Q')}(-1)^{QP'Q'} \prod _{j=1}^N\chi [x_{j}-x_{Q(j)}-x_{P'(j)}+x_{Q'(j)}=0\mod L]. \nonumber \\ \end{aligned}$$

(C.14)

1.2 C.2. $D_\textrm{eff}$ in Periodic Configurations

First, we consider regular particle configurations with a period $p=1,2,\ldots $. Fix p, and choose the chain length L and the particle number N such that $L=pN$. We consider the initial particle distribution given by

$$\begin{aligned} x_j=pj, \end{aligned}$$

(C.15)

for $j=1,\ldots ,N$.

Then (C.14) is computed as

$$\begin{aligned} \mathcal{S}_{\varvec{x}}&=\mathop {\sum _{Q,P',Q'}}_{(Q\ne \textrm{id},\,P'\ne Q')}(-1)^{QP'Q'} \prod _{j=1}^N\chi [pj-{pQ(j)}-{pP'(j)}+{pQ'(j)}=0\mod L] \nonumber \\&=\mathop {\sum _{Q,P',Q'}}_{(Q\ne \textrm{id},\,P'\ne Q')}(-1)^{QP'Q'} \prod _{j=1}^N\chi [j-{Q(j)}-{P'(j)}+{Q'(j)}=0\mod N], \end{aligned}$$

(C.16)

which depends only on N and is independent of L and p. Thus, we can evaluate the above sum by employing a useful choice of p. Fortunately, this sum becomes trivial for $p=1$, and therefore we compute the sum in the case of $p=1$. A fermion system with $L=N=1$, which is fully filled, has one-dimensional Hilbert space and hence $D_\textrm{eff}=1$. We see from (C.13) that $\mathcal{S}_{\varvec{x}}=N^N-N!$. Recalling the L independence of $\mathcal{S}_{\varvec{x}}$, we get a remarkably simple result

$$\begin{aligned} D_\textrm{eff}=\Bigl (\frac{L}{N}\Bigr )^N=e^{-(\rho \log \rho )L}, \end{aligned}$$

(C.17)

for any L and N (such that $L=pN$), where $\rho =1/p$ is the particle density. We thus see that the effective dimension grows exponentially with the system size L, as expected. But it turns out that it is not large enough. Combining (C.17) with (2.1), we see

$$\begin{aligned} \frac{D_\textrm{tot}}{D_\textrm{eff}}\sim e^{\{-(1-\rho )\log (1-\rho )\}L}=e^{\{\rho +O(\rho ^2)\}L}, \end{aligned}$$

(C.18)

and hence $D_\textrm{eff}$ is considerably smaller compared with the total dimension $D_\textrm{tot}$. This conclusion is consistent with the numerical result in [46]. We conclude that our strategy of the proof of Theorem 2.4 is ineffective for this initial state. Interestingly, it was found numerically in [45, 46] that the free fermion chain with similar initial states exhibits thermalization in some sense.

1.3 C.3. $D_\textrm{eff}$ in Golomb-Ruler Configurations

We next discuss a class of particle configurations for which the effective dimension $D_\textrm{eff}$ is easily evaluated and turns out to be almost as large as the total dimension $D_\textrm{tot}$. In these settings, however, the particle density inevitablyapproaches zero as N gets larger.

A sequence of natural numbers $\varvec{x}=(x_1,\ldots ,x_N)$ is called a Golomb ruler [59] if for any $j\ne k$, one has $x_j-x_k=x_\ell -x_m$ only when $j=\ell $ and $k=m$. The periodic boundary counterpart (in which one replaces the condition $x_j-x_k=x_\ell -x_m$ by $x_j-x_k=x_\ell -x_m \mod L$) is called a modular Golomb ruler. The optimal (minimum) system size L of a modular Golomb ruler for given N is $L=N(N-1)$, since $x_j-x_k$ takes $N(N-1)$ distinct positive integers. The optimal configuration, if exists, is called a perfect difference set. Interestingly, perfect difference sets are proven to exist if $N-1$ is a prime power $p^n$ [60].

We set the configuration of N particles as a modular Golomb ruler $\varvec{x}=(x_1,\ldots ,x_N)$ ($x_1<x_2\cdots <x_N$). By taking $x_1=1$ and choosing the system size L as a prime such that $L\ge 2x_N-1$, we see that for any $j\ne k$, one has $x_j-x_k=x_\ell -x_m\mod L$ only when $j=\ell $ and $k=m$ (i.e., a modular Golomb ruler). A nontrivial and asymptotically optimal example^{Footnote 12} of a Golomb ruler can be found in [61], where the following sequence

$$\begin{aligned} x_j=1+2N(j-1)+\{(j-1)^2\mod N\}, \end{aligned}$$

(C.19)

for $j=1,\ldots ,N$ with a prime $N>2$ is shown to be a Golomb ruler. Since $1+2N(N-1)\le x_N\le 1+2N(N-1)+N-1$, the aforementioned construction leads to the chain length as $L\simeq 4N^2$ with the particle density $\rho \simeq (4N)^{-1}$. Note that the optimal (densest) Golomb ruler has density $\rho =O(N^{-1})=O(L^{-1/2})$, and thus the above construction is asymptotically optimal.

We shall fix an arbitrary initial particle configuration $\varvec{x}$ that forms a modular Golomb ruler and evaluate its effective dimension. We first bound the sign factor in (C.14) as $(-1)^{QP'Q'}\le 1$ to get

$$\begin{aligned} \mathcal{S}_{\varvec{x}}\le \mathop {\sum _{Q,P',Q'}}_{(Q\ne \textrm{id},\,P'\ne Q')} \prod _{j=1}^N\chi [x_{j}-x_{Q(j)}-x_{P'(j)}+x_{Q'(j)}=0\mod L]. \end{aligned}$$

(C.20)

In fact, it can be shown that this is an equality^{Footnote 13}, but the upper bound is enough for our purpose.

Let us fix a permutation $Q\ne \textrm{id}$, and examine the conditions for $\prod _{j=1}^N\chi [\cdots ]=1$, i.e., $x_{j}-x_{Q(j)}-x_{P'(j)}+x_{Q'(j)}=0\mod L$ for all $j=1,\ldots ,N$. If j is such that $Q(j)\ne j$, the condition for $\varvec{x}$ implies $P'(j)=j$ and $Q'(j)=Q(j)$. We see there is no choice for $P'(j)$ and $Q'(j)$. If $j=Q(j)$, on the other hand, the only requirement is $P'(j)=Q'(j)$. There is some freedom for choosing $P'(j)$ and $Q'(j)$.

Let $n_Q$ be the number of j such that $Q(j)=j$. Since $Q\ne \textrm{id}$, we see $n_Q=0,1,\ldots ,N-2$. From the above consideration, we see that there are $n_Q!$ choices for $P'$ (and thus $Q'$) for fixed Q. We thus find

$$\begin{aligned} (\text {RHS of } (C.20))=\mathop {\sum _{Q}}_{(Q\ne \textrm{id})}n_Q!=\sum _{n=0}^{N-2}n!\,\mathcal{N}(n), \end{aligned}$$

(C.21)

where $\mathcal{N}(n)$ is the number of $Q\ne \textrm{id}$ such that $n_Q=n$. The value of $\mathcal{N}(n)$ is computed explicitly as

$$\begin{aligned} \mathcal{N}(n)=\left( {\begin{array}{c}N\\ n\end{array}}\right) d_{N-n}, \end{aligned}$$

(C.22)

where $d_m$ is the m-th de Montmort number (also known as the m-th derangement number or the subfactorial of m). The de Montmort number counts the number of derangement^{Footnote 14} on n elements. Fortunately, the de Montmort number $d_m$ is explicitly computed as

$$\begin{aligned} d_m=\left\lfloor \frac{m!+1}{e}\right\rfloor \end{aligned}$$

(C.23)

with the floor function^{Footnote 15}$\lfloor \cdot \rfloor $ [62]. This expression, with (C.20) and (C.21), leads to a simple upper bound

$$\begin{aligned} \mathcal{S}_x\le \sum _{n=0}^{N-2} \frac{N!}{e}\biggl (1+\frac{1}{(N-n)!}\biggr ) =\frac{N!}{e}\biggl (N-1+\sum _{m=2}^N\frac{1}{m!}\biggr ) \le \frac{N!}{e}(N-3+e). \end{aligned}$$

(C.24)

Substituting this into (C.13), we can bound the effective dimension from below as

$$\begin{aligned} D_\textrm{eff}\ge \frac{eL^N}{(N+2e-3)N!}. \end{aligned}$$

(C.25)

Thus the ratio between the total dimension and the effective dimension is bounded as

$$\begin{aligned} \frac{D_\textrm{tot}}{D_\textrm{eff}}\le \frac{(N+2e-3)\,L!}{e(L-N)!\,L^N}\le \frac{N+2e-3}{e}. \end{aligned}$$

(C.26)

Note that $D_\textrm{tot}=\left( {\begin{array}{c}L\\ N\end{array}}\right) $ is approximated by $(L/N)^N$ when $N\ll L$, and hence grows super-exponentially in N. (If we take the initial configuration (C.19) then $D_\textrm{tot}\sim (4N)^N$.) This means that $D_\textrm{eff}$ satisfying (C.26) is extremely close to $D_\textrm{tot}$.

As we have stressed, such a large effective dimension is expected in a non-integrable quantum many-body system, but not in an integrable system. Here we have a large $D_\textrm{eff}$ in a free fermion model because of the artificial Golomb-ruler configuration. But recall that this choice is possible only in the extremely low density $\rho =O(N^{-1})$.

The above observation about the large effective dimension leads to a statement about thermalization. Take a sufficiently large and arbitrary prime N and a prime L such that $L\ge 2x_N-1$ with $x_N$ given by (C.19). We consider the system of N fermions on the chain $\{1,\ldots ,L\}$ with the Hamiltonian (3.1). We take the phase factor $\theta $ for which the energy eigenvalues (3.7) are nondegenerate. (See Lemma 3.1.)

For simplicity we restrict our observable only to the particle number in the left half of the chain, i.e.,

$$\begin{aligned} \hat{N}_\textrm{left}=\sum _{j=1}^{(L+1)/2}\hat{n}_j. \end{aligned}$$

(C.27)

The equilibrium value is of course $\langle \hat{N}_\textrm{left}\rangle _\infty =N/2$. Let the initial state be $|\Phi (0)\rangle =|\Phi _{\varvec{x}}\rangle =\hat{c}^\dagger _{x_1}\cdots \hat{c}^\dagger _{x_N}|\Phi _\textrm{vac}\rangle $, where the configuration $x_1,\ldots ,x_N$ is given by (C.19). Since

$$\begin{aligned} \frac{\hat{N}_\textrm{left}}{N}|\Phi (0)\rangle =|\Phi (0)\rangle , \end{aligned}$$

(C.28)

the initial state is highly nonequilibrium with respect to the quantity $\hat{N}_\textrm{left}/N$.

Then by using the large deviation type estimate

$$\begin{aligned} \biggl \langle \hat{P}\biggl [ \Bigl |\frac{\hat{N}_\textrm{left}}{N}-\frac{1}{2}\Bigr |\ge \varepsilon \biggr ]\biggr \rangle _\infty \le e^{-(4\varepsilon ^2/3)N}, \end{aligned}$$

(C.29)

which follows from (2.22), and the standard argument (as in the proof of Theorem 2.4), we can prove the following.

Theorem C.1

For any $\varepsilon >0$, there exists a constant $T>0$ and a subset (a collection of intervals) $G\subset [0,T]$ with

$$\begin{aligned} \frac{\mu (G)}{T}\ge 1-e^{-(\varepsilon ^2/4)N}, \end{aligned}$$

(C.30)

where $\mu (G)$ is the total length of the intervals in G. Suppose that one performs a measurement of the number operator $\hat{N}_\textrm{left}$ in the state $|\Phi (t)\rangle $ with arbitrary $t\in G$. Then, with probability larger than $1-e^{-(\varepsilon ^2/4)N}$, the measurement result $N_\textrm{left}$ satisfies

$$\begin{aligned} \Bigl |\frac{N_\textrm{left}}{N}-\frac{1}{2}\Bigr |\le \varepsilon , \end{aligned}$$

(C.31)

i.e., the state is found in thermal equilibrium.

Thus thermalization starting from a deterministic initial state has been established without any unproven assumptions. Here one can choose the precision $\varepsilon >0$ arbitrarily. But in order to make the factor $e^{-(\varepsilon ^2/4)N}$ negligibly small, one must take N large enough, which means that the density becomes lower.

D: Possible Extension to the Finite Temperature Situation

Throughout the present paper, we only focused on situations where the initial and the final states correspond to infinite temperature thermal states. See, in particular, Sect. 2.4. We believe that our results can be extended to finite temperature settings with extra technical efforts. Although we do not elaborate on the extension, we here briefly discuss the setting and essential steps in the proof.

We consider the free fermion Hamiltonian (1.1), (3.1). Decompose the Hamiltonian as in (2.30), where we choose $\Lambda _1$ as the half-chain $\{1,\ldots ,(L-1)/2\}$. It follows that $\Vert {\varDelta }\hat{H}\Vert =h_0$ is independent of the system size. Denote by $|\tilde{\Psi }_j\rangle \in \mathcal{H}_1$ the normalized eigenstate of $\hat{H}_1$ with eigenvalue $\tilde{E}_j$. For the energy density $u\in (-2,0)$ and the energy width ${\varDelta }u>0$, we define the nonequilibrium microcanonical energy shell by

$$\begin{aligned} \mathcal{H}_1^u=\textrm{span}\bigl \{|\tilde{\Psi }_j\rangle \,\Bigl |\, \bigl |\tfrac{\tilde{E}_j}{N}-u\bigr |\le {\varDelta }u\bigr \}\subset \mathcal{H}_1. \end{aligned}$$

(D.1)

Noting that $\hat{H}_2|\tilde{\Psi }_j\rangle =0$, we observe

$$\begin{aligned} \langle \tilde{\Psi }_j|(\hat{H}-\tilde{E}_j)^2|\tilde{\Psi }_j\rangle =\langle \tilde{\Psi }_j|\{(\hat{H}_1-\tilde{E}_j)+{\varDelta }\hat{H}\}^2|\tilde{\Psi }_j\rangle =\langle \tilde{\Psi }_j|\{({\varDelta }\hat{H})^2|\tilde{\Psi }_j\rangle \le (h_0)^2,\nonumber \\ \end{aligned}$$

(D.2)

which implies that $|\tilde{\Psi }_j\rangle $ is (with a minor error when N is large) a superposition of $|\Psi _k\rangle $ such that $|E_k-\tilde{E}_j|\lesssim h_0$. We thus find that any state $|\Phi (0)\rangle \in \mathcal{H}_1^u$ and its time-evolution $|\Phi (t)\rangle =e^{-i\hat{H}t}|\Phi (0)\rangle $ belongs (again, with minor errors when N is large) to the standard microcanonical energy shell

$$\begin{aligned} \mathcal{H}_\textrm{tot}^u=\textrm{span}\bigl \{|\Psi _j\rangle \,\Bigl |\, \bigl |\tfrac{E_j}{N}-u\bigr |\le {\varDelta }u'\bigr \}\subset \mathcal{H}_\textrm{tot}, \end{aligned}$$

(D.3)

with ${\varDelta }u'>{\varDelta }u$.

In the finite temperature setting, we choose initial state $|\Phi (0)\rangle $ randomly and uniformly from the nonequilibrium energy shell $\mathcal{H}^u_1$. The goal is to show that Theorem 2.4 (with suitable modifications of constants) is valid for the time-evolved state $|\Phi (0)\rangle $.

Recalling that $|\Phi (0)\rangle $ (essentially) belongs to $\mathcal{H}^u_\textrm{tot}$, our strategy for the proof will be to properly replace $\mathcal{H}_1$ and $\mathcal{H}_\textrm{tot}$ in the original proof with $\mathcal{H}^u_1$ and $\mathcal{H}^u_\textrm{tot}$, respectively. Let us see how the proof of the most important estimate of the effective dimension, Theorem 2.3, is modified. Interestingly, a small modification is sufficient. Denoting by $\hat{P}^u_1$ the projection onto $\mathcal{H}^u_1$, and by $D_1^u$ the dimension of $\mathcal{H}^u_1$, we find

$$\begin{aligned} \overline{D_\textrm{eff}^{-1}}&=\sum _{j=1}^{D_\textrm{tot}}\overline{\bigl |\langle \Phi (0)|\hat{P}^u_1|\Psi _j\rangle \bigr |^4} =\frac{2}{D^u_1(D^u_1+1)}\sum _{j=1}^{D_\textrm{tot}}\Vert \hat{P}^u_1|\Psi _j\rangle \Vert ^4 \nonumber \\&\le \frac{2}{D^u_1(D^u_1+1)}\sum _{j=1}^{D_\textrm{tot}}\Vert \hat{P}_1|\Psi _j\rangle \Vert ^2\,\Vert \hat{P}^u_1|\Psi _j\rangle \Vert ^2 \nonumber \\&\le \frac{2}{D^u_1(D^u_1+1)2^N}{\text {Tr}}[\hat{P}^u_1]=\frac{2}{(D^u_1+1)2^N}, \end{aligned}$$

(D.4)

which is a faithful extension of the key inequality (2.10). The analog of Theorem 2.3 is proved if we properly estimate the ratio $D^u_\textrm{tot}/D^u_1$. Another nontrivial (but technical) step for the proof of the desired extension of Theorem 2.4 is the derivation of the large-deviation upper bound (2.22) for the microcanonical average.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Shiraishi, N., Tasaki, H. Nature Abhors a Vacuum: A Simple Rigorous Example of Thermalization in an Isolated Macroscopic Quantum System. J Stat Phys 191, 82 (2024). https://doi.org/10.1007/s10955-024-03289-6

Download citation

Received: 02 November 2023
Accepted: 07 June 2024
Published: 07 July 2024
DOI: https://doi.org/10.1007/s10955-024-03289-6

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Nature Abhors a Vacuum: A Simple Rigorous Example of Thermalization in an Isolated Macroscopic Quantum System

Abstract

Similar content being viewed by others

Entanglement in Fock space of random QFT states

Hydrodynamic Projections and the Emergence of Linearised Euler Equations in One-Dimensional Isolated Systems

Steady States and Universal Conductance in a Quenched Luttinger Model

1 Introduction

Theorem 1.1

2 General Results

2.1 Setting and Main Assumptions

Assumption 2.1

Assumption 2.2

2.2 Initinal State and its Effective Dimension

Theorem 2.3

Proof of Theorem 2.3

2.3 Time Evolution and Thermalization

Theorem 2.4

Proof of Theorem 2.4

Derivation of (2.22)

2.4 Nature of the Final State

3 Free Fermion on the Chain

3.1 Energy Eigenstates and Eigenvalues

3.2 Justification of Assumption 2.1

Theorem 3.1

Theorem 3.2

Lemma 3.3

Lemma 3.4

Proof

Proof of Theorem 3.1

Proof of Theorem 3.2

3.3 Justification of Assumption 2.2

4 Discussion

Data Availibility Statement

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

Appendices

Appendices

A: Models With Degenerate Energy Eigenvalues

B: Models Satisfying Assumption 2.2

1.1 B.1. Interacting Fermions on Double-Lattice

Lemma B.1

Proof

1.2 B.2. Free Fermions With \(\mathbb {Z}_2\) Symmetry

C: Effective Dimensions of Some Initial Particle Configurations in the Free Fermion Chain

1.1 C.1. General formula for \(D_\textrm{eff}\)

1.2 C.2. \(D_\textrm{eff}\) in Periodic Configurations

1.3 C.3. \(D_\textrm{eff}\) in Golomb-Ruler Configurations

Theorem C.1

D: Possible Extension to the Finite Temperature Situation

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation