The Wasserstein Distance of Order 1 for Quantum Spin Systems on Infinite Lattices

De Palma, Giacomo; Trevisan, Dario

doi:10.1007/s00023-023-01340-y

The Wasserstein Distance of Order 1 for Quantum Spin Systems on Infinite Lattices

Original Paper
Published: 24 June 2023

Volume 24, pages 4237–4282, (2023)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Annales Henri Poincaré Aims and scope Submit manuscript

The Wasserstein Distance of Order 1 for Quantum Spin Systems on Infinite Lattices

Download PDF

303 Accesses
2 Citations
2 Altmetric
Explore all metrics

Abstract

We propose a generalization of the Wasserstein distance of order 1 to quantum spin systems on the lattice $\mathbb {Z}^d$, which we call specific quantum $W_1$ distance. The proposal is based on the $W_1$ distance for qudits of De Palma et al. (IEEE Trans Inf Theory 67(10):6627–6643, 2021) and recovers Ornstein’s $\bar{d}$-distance for the quantum states whose marginal states on any finite number of spins are diagonal in the canonical basis. We also propose a generalization of the Lipschitz constant to quantum interactions on $\mathbb {Z}^d$ and prove that such quantum Lipschitz constant and the specific quantum $W_1$ distance are mutually dual. We prove a new continuity bound for the von Neumann entropy for a finite set of quantum spins in terms of the quantum $W_1$ distance, and we apply it to prove a continuity bound for the specific von Neumann entropy in terms of the specific quantum $W_1$ distance for quantum spin systems on $\mathbb {Z}^d$. Finally, we prove that local quantum commuting interactions above a critical temperature satisfy a transportation-cost inequality, which implies the uniqueness of their Gibbs states.

Entanglement Bounds in the XXZ Quantum Spin Chain

Article 05 June 2020

Singular continuous Cantor spectrum for magnetic quantum walks

Article 05 February 2020

A Note on the Spectral Gap of the Fredrickson–Andersen One Spin Facilitated Model

Article Open access 04 November 2020

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

Let $\mathcal {X}$ be a finite set endowed with the distance D and let $\mu $ and $\nu $ be probability distributions on $\mathcal {X}$. A coupling between $\mu $ and $\nu $ is a probability distribution on two copies of $\mathcal {X}$ with marginal distributions equal to $\mu $ and $\nu $, respectively. The theory of optimal mass transport considers $\mu $ and $\nu $ as distributions of a unit amount of mass, and any coupling $\pi $ prescribes a plan to transform the distribution $\mu $ into the distribution $\nu $, i.e., $\pi (x,y)$ is the amount of mass that is moved from x to y for any $x,\,y\in \mathcal {X}$. Assuming that the cost of moving a unit of mass from x to y is equal to D(x, y), the cost of the coupling $\pi $ is equal to $\mathbb {E}_{(X,Y)\sim \pi }D(X,Y)$. The Monge–Kantorovich distance between $\mu $ and $\nu $ is given by the minimum cost among all the couplings between $\mu $ and $\nu $ [1,2,3]. Such distance is also called earth mover’s distance or Wasserstein distance of order 1, often shortened to $W_1$ distance. The exploration of the theory of optimal mass transport has led to the creation of an extremely fruitful field in mathematical analysis, with applications ranging from differential geometry and partial differential equations to machine learning [3,4,5,6].

The Hamming distance constitutes a natural choice for the distance D when $\mathcal {X}$ is a set of finite strings over an alphabet. The $W_1$ distance with respect to the Hamming distance is called Ornstein’s $\bar{d}$-distance and was first considered in [7], together with its extension to stationary stochastic processes. Originally introduced as a tool for the classification of stationary processes in ergodic theory, since then it has found further applications in probability theory, such as the statistical estimation of processes [8,9,10,11,12], information theory, such as coding theorems for a large class of discrete noisy channels with memory and rate distortion theory [13, 14] and recently also machine learning, as a peculiar case of Wasserstein auto-encoders [15, 16].

Ref. [17] proposed a generalization of the $W_1$ distance to the space of the quantum states of a finite set of qudits or spins, called quantum $W_1$ distance. The generalization is based on the notion of neighboring quantum states. Two quantum states of a finite set of qudits are neighboring if they coincide after discarding one qudit. The quantum $W_1$ distance proposed in Ref. [17] is the distance induced by the maximum norm that assigns distance at most 1 to any couple of neighboring states. Such quantum $W_1$ distance recovers Ornstein’s $\bar{d}$-distance in the case of quantum states diagonal in the canonical basis and inherits most of its properties. The quantum $W_1$ distance has found several applications in quantum information theory. In the context of statistical mechanics of quantum spin systems, a connection with quantum speed limits [18] has been found. Furthermore, transportation-cost inequalities have been proved, which upper bound the square of the quantum $W_1$ distance between a generic quantum state and the Gibbs state of a local quantum commuting Hamiltonian with the relative entropy between the same states [19]. Such inequalities have been applied to prove the equivalence between the microcanonical and the canonical ensembles of quantum statistical mechanics [19] and to prove limitations of variational quantum algorithms [20, 21]. Moreover, the quantum $W_1$ distance has been applied to quantify the complexity of quantum circuits [22]. In the context of quantum state tomography, the quantum $W_1$ distance has been employed as quantifier of the quality of the learned quantum state, and the transportation-cost inequalities have led to an efficient algorithm to learn Gibbs states of local quantum commuting Hamiltonians [23, 24]. In the context of quantum machine learning, the quantum $W_1$ distance has been employed as cost function of the quantum version of generative adversarial networks [25,26,27,28]. Furthermore, the quantum $W_1$ distance has been applied in the context of differential privacy of a quantum computation [29, 30]. Finally, the quantum $W_1$ distance has been extended to general composite systems [31], which include the case of a finite tensor product of $C^*$ algebras, but also provides a way to define a quantum $W_1$ distance between quantum channels.

1.1 Our Contribution

In this paper, we propose a generalization of the $W_1$ distance to quantum spin systems on the lattice $\mathbb {Z}^d$ [32,33,34,35] based on the quantum $W_1$ distance of Ref. [17]. Quantum spin systems on infinite lattices play a key role in quantum statistical mechanics since they provide a model to study the thermodynamic limit of infinite size of the system. Such limit is necessary to define phase transitions and to identify the properties of the system that are independent on boundary effects and boundary conditions, and more generally to make a clear distinction between the local and the global properties of the system.

We define the specific quantum $W_1$ distance between two translation-invariant states as the limit of the distance between their marginal states on an hypercube divided by the volume of the hypercube for the volume of the hypercube tending to infinity (Definition 4.1). Contrarily to the trace distance, the specific quantum $W_1$ distance has an intensive nature that makes it suitable to capture the closeness of states that are locally similar but become perfectly distinguishable globally, such as Gibbs states at close but different temperatures. We provide in Definition 4.2 an equivalent definition of the specific quantum $W_1$ distance that does not require the limit. We propose a generalization of the Lipschitz constant to quantum interactions on $\mathbb {Z}^d$ (Definition 5.2), and we prove in Theorem 6.1 that the specific quantum $W_1$ distance and the Lipschitz constant are mutually dual.

We prove in Proposition 7.1 that the specific quantum $W_1$ distance recovers Ornstein’s $\bar{d}$-distance in the case of quantum states whose marginal states on a finite number of spins are all diagonal in the canonical basis. We prove in Proposition 8.1 a Poincaré inequality stating that for any product state, the variance of the local Hamiltonians associated with an interaction grows linearly with the volume. In Theorem 8.1, we prove a Gaussian concentration inequality for the maximally mixed state of a finite set of spins, and we apply it in Corollary 8.1 to prove an upper bound to the pressure of a quantum interaction on $\mathbb {Z}^d$ in terms of its Lipschitz constant.

In Theorem 9.1, we prove a continuity bound for the von Neumann entropy in terms of the $W_1$ distance. The bound applies to quantum systems made by a finite number of spins or qudits and states that the difference between the von Neumann entropy of any two quantum states divided by the number of spins is upper bounded by a universal function of the ratio between the $W_1$ distance and the number of spins. The bound of Theorem 9.1 contains only intensive quantities, and thanks to this property we apply it to prove a continuity bound for the specific von Neumann entropy in terms of the specific quantum $W_1$ distance (Theorem 10.1). Theorem 9.1 improves [17, Theorem 1], which is a weaker continuity bound for the von Neumann entropy in terms of the $W_1$ distance. Contrarily to the bound of Theorem 9.1, the bound of [17, Theorem 1] cannot be expressed in terms of only intensive quantities, and therefore such bound would not be sufficient to prove a continuity bound for the specific von Neumann entropy. Besides the applications to quantum spin systems, Theorem 9.1 can be useful in quantum Shannon theory in the context of rate-distortion theory, which addresses the problem of determining the maximum compression rate of a quantum state if a certain level of distortion in the recovered state is allowed [36,37,38,39,40,41,42,43].

In the remainder of the paper, we apply our $W_1$ distance to study the statistical mechanics of quantum spin systems on infinite lattices. We propose a definition of $w_1$-Gibbs state as a translation-invariant state such that the $W_1$ distance between its marginal state on a hypercube and the Gibbs state of the local Hamiltonian on the same hypercube scales sublinearly with the volume of the hypercube (Definition 11.1). If an interaction admits a $w_1$-Gibbs state, then such state is unique (Proposition 11.1) and is an equilibrium state of the interaction (Proposition 11.2) in the sense of Kubo–Martin–Schwinger [33]. In Sect. 12, we consider transportation-cost inequalities for interactions on the quantum spin lattice $\mathbb {Z}^d$. Such inequalities imply the uniqueness of the Gibbs state of the interaction (Theorem 12.1) and a continuity bound for the specific entropy in terms of the specific relative entropy with respect to the Gibbs state (Proposition 12.2). Finally, we prove that transportation-cost inequalities are satisfied by interactions that contain only terms acting on single spins (Corollary 12.1) and geometrically local commuting interactions above a critical temperature (Theorem 12.2 and Theorem 12.3).

The paper is structured as follows. In Sect. 2, we introduce quantum spin systems on the lattice $\mathbb {Z}^d$ and in Sect. 3 we present the quantum $W_1$ distance and the quantum Lipschitz constant of Ref. [17]. In Sect. 4 and Sect. 5, we generalize the quantum $W_1$ distance and the quantum Lipschitz constant, respectively, to quantum spin systems on the lattice $\mathbb {Z}^d$. In Sect. 6, we prove the duality between the specific quantum $W_1$ distance and the Lipschitz constant and in Sect. 7 we prove that the specific quantum $W_1$ distance that we propose recovers Ornstein’s $\bar{d}$-distance. In Sect. 8, we prove the quantum Poincaré and Gaussian concentration inequalities for product states. In Sect. 9, we prove the continuity bound for the von Neumann entropy in terms of the $W_1$ distance, and in Sect. 10, we prove the continuity bound for the specific entropy in terms of the specific quantum $W_1$ distance. In Sect. 11, we introduce the notion of $w_1$-Gibbs state. In Sect. 12, we present and prove the transportation-cost inequalities for Gibbs states. We conclude in Sect. 13 presenting some perspective applications of this work. “Appendix A” recalls some relevant properties of the quantum $W_1$ distance. “Appendix B” contains some auxiliary proofs, and “Appendix C” contains the proof of the auxiliary lemmas.

1.2 Related Approaches

Several quantum generalizations of optimal transport distances have been proposed. One line of research by Carlen, Maas, Datta and Rouzé [44,45,46,47,48,49,50] defines a quantum Wasserstein distance of order 2 from a Riemannian metric on the space of quantum states based on a quantum analog of a differential structure. Exploiting their quantum differential structure, Refs. [46, 47, 51] also define a quantum generalization of the Lipschitz constant and of the Wasserstein distance of order 1. Alternative definitions of quantum Wasserstein distances of order 1 based on a quantum differential structure are proposed in Refs. [52,53,54,55]. Refs. [56,57,58] propose quantum Wasserstein distances of order 1 based on a distance between the vectors of the canonical basis.

Another line of research by Golse, Mouhot, Paul and Caglioti [59,60,61,62,63,64,65,66,67,68,69] arose in the context of the study of the semiclassical limit of quantum mechanics and defines a family of quantum Wasserstein distances of order 2 built on a quantum generalization of couplings. Such distances have been generalized to von Neumann algebras [70,71,72].

Ref. [73] proposes another quantum Wasserstein distance of order 2 based on couplings, with the property that each quantum coupling is associated to a quantum channel. The relation between quantum couplings and quantum channels in the framework of von Neumann algebras has been explored in [74]. The problem of defining a quantum Wasserstein distance of order 1 through quantum couplings has been explored in Ref. [75].

The quantum Wasserstein distance between two quantum states can be defined as the classical Wasserstein distance between the probability distributions of the outcomes of an informationally complete measurement performed on the states, which is a measurement whose probability distribution completely determines the state. This definition has been explored for Gaussian quantum systems with the heterodyne measurement in Refs. [76,77,78].

2 Quantum Spin Systems on Infinite Lattices

In this section, we introduce the setting of quantum spin systems on infinite lattices and fix the notation for the paper. For more details, the reader is encouraged to consult the books [32,33,34,35].

2.1 Algebra and States

We associate to each $x\in \mathbb {Z}^d$ the single-spin Hilbert space $\mathcal {H}_x = \mathbb {C}^q$. Let $\mathcal {F}_{\mathbb {Z}^d}$ be the collection of all the finite subsets of $\mathbb {Z}^d$. We associate to each $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$ the Hilbert space

$$\begin{aligned} \mathcal {H}_\Lambda = \bigotimes _{x\in \Lambda }\mathcal {H}_x. \end{aligned}$$

(2.1)

For each $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$, we denote with $\mathfrak {U}_\Lambda $ the algebra of the linear operators acting on $\mathcal {H}_\Lambda $ equipped with the operator norm, which we denote with $\Vert \cdot \Vert _\infty $. For any $\Lambda '\subseteq \Lambda $, $\mathfrak {U}_{\Lambda '}$ can be canonically identified with a subalgebra of $\mathfrak {U}_\Lambda $. This identification will always be implicit.

We denote with $\mathcal {O}_\Lambda \subset \mathfrak {U}_\Lambda $ the set of the self-adjoint linear operators acting on $\mathcal {H}_\Lambda $, and with $\mathcal {O}^T_\Lambda \subset \mathcal {O}_\Lambda $ the set of the traceless self-adjoint linear operators acting on $\mathcal {H}_\Lambda $. We denote with $\mathcal {S}_\Lambda \subset \mathcal {O}_\Lambda $ the set of the quantum states acting on $\mathcal {H}_\Lambda $, i.e., the positive semidefinite linear operators with unit trace, and with $\textrm{Tr}_\Lambda $ the partial trace over $\mathcal {H}_\Lambda $. We say that $\rho \in \mathcal {S}_\Lambda $ is a product state if there exists a collection of states $\left\{ \rho _x\in \mathcal {S}_x\right\} _{x\in \Lambda }$ such that

$$\begin{aligned} \rho = \bigotimes _{x\in \Lambda }\rho _x. \end{aligned}$$

(2.2)

Some results of this paper do not require the lattice structure of $\mathbb {Z}^d$ and apply to generic finite spin systems. If $\Lambda $ is a generic finite set, we still define $\mathcal {H}_\Lambda $, $\textrm{Tr}_\Lambda $, $\mathfrak {U}_\Lambda $, $\mathcal {O}_\Lambda $, $\mathcal {O}_\Lambda ^T$ and $\mathcal {S}_\Lambda $ as above.

The strictly local algebra of the spin lattice $\mathbb {Z}^d$ is

$$\begin{aligned} \mathfrak {U}_{\mathbb {Z}^d}^{loc} = \bigcup _{\Lambda \in \mathcal {F}_{\mathbb {Z}^d}}\mathfrak {U}_\Lambda , \end{aligned}$$

(2.3)

and is equipped with the norm inherited from the operator norm of each $\mathfrak {U}_\Lambda $. The quasi-local algebra $\mathfrak {U}_{\mathbb {Z}^d}$ is the completion of $\mathfrak {U}_{\mathbb {Z}^d}^{loc}$ with respect to such norm, which we still denote with $\Vert \cdot \Vert _\infty $. For any (not necessarily finite) $\Gamma \subseteq \mathbb {Z}^d$, we define

$$\begin{aligned} \mathfrak {U}_\Gamma = \overline{\bigcup _{X\in \mathcal {F}_{\mathbb {Z}^d},\,X\subseteq \Gamma }\mathfrak {U}_X}\subseteq \mathfrak {U}_{\mathbb {Z}^d}, \end{aligned}$$

(2.4)

where the closure is with respect to the $\Vert \cdot \Vert _\infty $ norm in $\mathfrak {U}_{\mathbb {Z}^d}$. When $\Gamma $ is finite or $\Gamma =\mathbb {Z}^d$, (2.4) is consistent with the previous definitions. We denote with $\mathcal {O}_\Gamma $ the set of the self-adjoint elements of $\mathfrak {U}_\Gamma $.

A quantum state $\rho $ of the spin lattice $\mathbb {Z}^d$ is a positive linear functional on $\mathfrak {U}_{\mathbb {Z}^d}$ with $\rho (\mathbb {I})=1$. We denote the set of the quantum states of $\mathbb {Z}^d$ with $\mathcal {S}_{\mathbb {Z}^d}$. Analogously, for any (not necessarily finite) $\Gamma \subseteq \mathbb {Z}^d$, a quantum state $\rho $ of $\Gamma $ is a positive linear functional on $\mathfrak {U}_\Gamma $ with $\rho (\mathbb {I})=1$. We denote with $\mathcal {S}_\Gamma $ the set of the quantum states of $\Gamma $. If $\Gamma $ is finite, this definition is consistent with the previous one by setting for any $A\in \mathfrak {U}_\Gamma $

$$\begin{aligned} \rho (A) = \textrm{Tr}_\Gamma \left[ \rho \,A\right] . \end{aligned}$$

(2.5)

Let $\Gamma '\subseteq \Gamma \subseteq \mathbb {Z}^d$ and let $\rho \in \mathcal {S}_\Gamma $. The marginal state $\rho _{\Gamma '}$ of $\rho $ on $\Gamma '$ is the restriction of $\rho $ on $\mathfrak {U}_{\Gamma '}$. If $\Gamma $ is finite, we have

$$\begin{aligned} \rho _{\Gamma '} = \textrm{Tr}_{\Gamma \setminus \Gamma '}\rho . \end{aligned}$$

(2.6)

Since $\mathfrak {U}_{\mathbb {Z}^d}^{loc}$ is dense in $\mathfrak {U}_{\mathbb {Z}^d}$ by construction, any $\rho \in \mathcal {S}_{\mathbb {Z}^d}$ is completely determined by the collection of its marginal states $(\rho _\Lambda )_{\Lambda \in \mathcal {F}_{\mathbb {Z}^d}}$. We say that $\rho \in \mathcal {S}_{\mathbb {Z}^d}$ is a product state if for any $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$ the marginal state $\rho _\Lambda $ is a product state.

We associate to each $x\in \mathbb {Z}^d$ the translation operator $\tau _x$, which is the automorphism of $\mathfrak {U}_{\mathbb {Z}^d}$ that sends $\mathfrak {U}_\Gamma $ to $\mathfrak {U}_{\Gamma +x}$ for any $\Gamma \subseteq \mathbb {Z}^d$. With some abuse of notation, we denote with $\tau _x$ also the automorphism of $\mathcal {S}_{\mathbb {Z}^d}$ such that for any $\rho \in \mathcal {S}_{\mathbb {Z}^d}$ and any $A\in \mathfrak {U}_{\mathbb {Z}^d}$ we have

$$\begin{aligned} (\tau _x\rho )(\tau _x A) = \rho (A). \end{aligned}$$

(2.7)

With some further abuse of notation, for any $\Gamma \subseteq \mathbb {Z}^d$ we denote with $\tau _x$ also the isomorphism between $\mathcal {S}_\Gamma $ and $\mathcal {S}_{\Gamma +x}$ such that (2.7) holds for any $\rho \in \mathcal {S}_\Gamma $ and any $A\in \mathfrak {U}_\Gamma $. We say that $\rho \in \mathcal {S}_{\mathbb {Z}^d}$ is translation invariant if $\tau _x\rho = \rho $ for any $x\in \mathbb {Z}^d$. We denote with $\mathcal {S}_{\mathbb {Z}^d}^I\subset \mathcal {S}_{\mathbb {Z}^d}$ the set of the translation-invariant quantum states of $\mathbb {Z}^d$.

For each $a\in \mathbb {N}_+^d$, we denote with $\Lambda _a$ the box

$$\begin{aligned} \Lambda _a = \left\{ x\in \mathbb {Z}^d: -a \le x < a\right\} ,\qquad \left| \Lambda _a\right| = \prod _{i=1}^d 2a_i, \end{aligned}$$

(2.8)

where inequalities between vectors hold for each component. Given a sequence $\left( a^{(n)}\right) _{n\in \mathbb {N}}\subset \mathbb {N}_+^d$, we say that $a^{(n)}\rightarrow \infty $ if $a^{(n)}_i\rightarrow \infty $ for any $i=1,\,\ldots ,\,d$.

Definition 2.1

(Trace distance). The trace distance on $\mathcal {S}_{\mathbb {Z}^d}$ is the distance induced by the norm on $\mathfrak {U}_{\mathbb {Z}^d}$: For any $\rho ,\,\sigma \in \mathcal {S}_{\mathbb {Z}^d}$,

$$\begin{aligned} T(\rho ,\sigma ) = \frac{1}{2}\sup \left\{ \left| \rho (A) - \sigma (A)\right| :A\in \mathfrak {U}_{\mathbb {Z}^d},\,\left\| A\right\| _\infty \le 1\right\} . \end{aligned}$$

(2.9)

Proposition 2.1

The trace distance on $\mathcal {S}_{\mathbb {Z}^d}$ is the supremum of the trace distances between the marginal states: For any $\rho ,\,\sigma \in \mathcal {S}_{\mathbb {Z}^d}$,

$$\begin{aligned} T(\rho ,\sigma ) = \frac{1}{2}\sup _{\Lambda \in \mathcal {F}_{\mathbb {Z}^d}}\left\| \rho _\Lambda - \sigma _\Lambda \right\| _1, \end{aligned}$$

(2.10)

where $\Vert \cdot \Vert _1$ denotes the trace norm on $\mathfrak {U}_\Lambda $ given by

$$\begin{aligned} \left\| A\right\| _1 = \textrm{Tr}_\Lambda \sqrt{A^\dag A},\qquad A\in \mathfrak {U}_\Lambda . \end{aligned}$$

(2.11)

Proof

See Sect. B.1. $\square $

Definition 2.2

(Specific entropy [33, Proposition 6.2.38]). The von Neumann entropy of a quantum state $\rho $ acting on a finite-dimensional Hilbert space is [79,80,81]

$$\begin{aligned} S(\rho ) = - \textrm{Tr}\left[ \rho \ln \rho \right] . \end{aligned}$$

(2.12)

The specific entropy of $\rho \in \mathcal {S}_{\mathbb {Z}^d}^I$ is the entropy per site in the limit of infinite volume:

$$\begin{aligned} s(\rho ) = \lim _{a\rightarrow \infty }\frac{S(\rho _{\Lambda _a})}{\left| \Lambda _a\right| } = \inf _{a\in \mathbb {N}_+^d}\frac{S(\rho _{\Lambda _a})}{|\Lambda _a|}. \end{aligned}$$

(2.13)

Definition 2.3

(Specific relative entropy [82]). The relative entropy [79,80,81] between the quantum states $\rho $ and $\sigma $ acting on a finite-dimensional Hilbert space is

$$\begin{aligned} S(\rho \Vert \sigma ) = -\textrm{Tr}\left[ \rho \left( \ln \rho - \ln \sigma \right) \right] . \end{aligned}$$

(2.14)

The specific relative entropy between the states $\rho ,\,\sigma \in \mathcal {S}_{\mathbb {Z}^d}^I$ is the relative entropy per site in the limit of infinite volume:

$$\begin{aligned} s(\rho \Vert \sigma ) = \lim _{a\rightarrow \infty }\frac{S(\rho _{\Lambda _a}\Vert \sigma _{\Lambda _a})}{\left| \Lambda _a\right| }, \end{aligned}$$

(2.15)

whenever the limit exists.

Remark 2.1

$s(\rho \Vert \sigma )=0$ does not imply $\rho =\sigma $. Indeed, let $\rho _\Lambda = |0\rangle \langle 0|^{\otimes \Lambda }$ and $\sigma _\Lambda = \frac{1}{2}\left( |0\rangle \langle 0|^{\otimes \Lambda } + |1\rangle \langle 1|^{\otimes \Lambda }\right) $ for any $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$. Then, $S(\rho _\Lambda \Vert \sigma _\Lambda ) =\ln 2$ and $s(\rho \Vert \sigma )=0$.

2.2 Interactions

An interaction $\Phi $ is a collection of observables $\left( \Phi (\Lambda )\right) _{\Lambda \in \mathcal {F}_{\mathbb {Z}^d}}$, where $\Phi (\Lambda )\in \mathcal {O}_\Lambda $ for any $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$. We can associate to $\Phi $ the formal Hamiltonian

$$\begin{aligned} H^\Phi _{\mathbb {Z}^d} = \sum _{\Lambda \in \mathcal {F}_{\mathbb {Z}^d}}\Phi (\Lambda ). \end{aligned}$$

(2.16)

In general, the series (2.16) does not converge not even weakly, and $H^\Phi _{\mathbb {Z}^d}$ cannot be defined as an element of $\mathfrak {U}_{\mathbb {Z}^d}$.

We can define for any $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$ the local Hamiltonian on $\Lambda $ with open boundary conditions

$$\begin{aligned} H_\Lambda ^\Phi = \sum _{X\subseteq \Lambda }\Phi (X) \in \mathcal {O}_\Lambda . \end{aligned}$$

(2.17)

An interaction $\Phi $ is translation invariant if $\Phi (\Lambda +x) = \tau _x(\Phi (\Lambda ))$ for any $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$ and any $x\in \mathbb {Z}^d$. For $r>0$, we denote with $\mathcal {B}_{\mathbb {Z}^d}^r$ the set of translation-invariant interactions satisfying

$$\begin{aligned} \Vert \Phi \Vert _r = \sum _{0\in \Lambda \in \mathcal {F}_{\mathbb {Z}^d}}e^{r\left( \left| \Lambda \right| -1\right) }\left\| \Phi (\Lambda )\right\| _\infty < \infty . \end{aligned}$$

(2.18)

The specific energy observable of $\Phi \in \mathcal {B}_{\mathbb {Z}^d}^r$ is

$$\begin{aligned} E_\Phi = \sum _{0\in \Lambda \in \mathcal {F}_{\mathbb {Z}^d}}\frac{\Phi (\Lambda )}{\left| \Lambda \right| } \in \mathcal {O}_{\mathbb {Z}^d} \end{aligned}$$

(2.19)

and satisfies [33, Proposition 6.2.39]

$$\begin{aligned} \lim _{a\rightarrow \infty }\frac{1}{\left| \Lambda _a\right| }\left\| H^\Phi _{\Lambda _a} - \sum _{x\in \Lambda _a}\tau _x E_\Phi \right\| _\infty = 0. \end{aligned}$$

(2.20)

Therefore, for any $\rho \in \mathcal {S}_{\mathbb {Z}^d}^I$ we have that $\rho (E_\Phi )$ is equal to the average energy per site of $\rho $ in the limit of infinite volume:

$$\begin{aligned} \lim _{a\rightarrow \infty }\frac{\rho \left( H^\Phi _{\Lambda _a}\right) }{\left| \Lambda _a\right| } = \rho \left( E_\Phi \right) . \end{aligned}$$

(2.21)

2.3 Gibbs States

Let $\Phi \in \mathcal {B}^r_{\mathbb {Z}^d}$. For any $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$, the local Gibbs state of $\Phi $ on $\Lambda $ with open boundary conditions is the Gibbs state of $H^\Phi _\Lambda $:

$$\begin{aligned} \omega ^\Phi _\Lambda = \frac{e^{-H^\Phi _\Lambda }}{\textrm{Tr}_\Lambda e^{-H^\Phi _\Lambda }} \in \mathcal {S}_\Lambda . \end{aligned}$$

(2.22)

Since the temperature can always be reabsorbed in the interaction, in the whole paper we set it to one.

Remark 2.2

The states $\left( \omega ^\Phi _\Lambda \right) _{\Lambda \in \mathcal {F}_{\mathbb {Z}^d}}$ defined in (2.22) are not necessarily the marginal states of a single global state $\omega \in \mathcal {S}_{\mathbb {Z}^d}$.

The pressure of $\Phi \in \mathcal {B}_{\mathbb {Z}^d}^r$ is [33, Theorem 6.2.40]

$$\begin{aligned} P(\Phi ) = \lim _{a\rightarrow \infty }\frac{\ln \textrm{Tr}_{\Lambda _a} e^{-H^\Phi _{\Lambda _a}}}{\left| \Lambda _a\right| } = \sup _{\rho \in \mathcal {S}^I_{\mathbb {Z}^d}}\left( s(\rho ) - \rho (E_\Phi )\right) . \end{aligned}$$

(2.23)

The states $\rho \in \mathcal {S}^I_{\mathbb {Z}^d}$ that achieve the supremum in (2.23) are called equilibrium states of $\Phi $ and satisfy the Kubo–Martin–Schwinger condition [33]. We denote with $\mathcal {S}_{eq}(\Phi )$ the set of such states. For any $\Phi \in \mathcal {B}^r_{\mathbb {Z}^d}$, $\mathcal {S}_{eq}(\Phi )$ is nonempty, convex and compact with respect to the trace distance.

3 The Quantum $W_1$ Distance

Ref. [17] introduced the following generalization of the Wasserstein distance of order 1 and of the Lipschitz constant to quantum systems made by a finite number of spins. Since Ref. [17] does not require the lattice structure of $\mathbb {Z}^d$, here $\Lambda $ denotes a fixed generic finite set. The quantum $W_1$ distance is based on the notion of neighboring quantum states. The states $\rho ,\,\sigma \in \mathcal {S}_\Lambda $ are neighboring if there exists $x\in \Lambda $ such that $\textrm{Tr}_x\rho = \textrm{Tr}_x\sigma $. The quantum $W_1$ norm is the maximum norm that assigns distance at most 1 to each couple of neighboring states:

Definition 3.1

($W_1$ norm). Let $\Lambda $ be a finite set. We define for any $\Delta \in \mathcal {O}_\Lambda ^T$

$$\begin{aligned} \left\| \Delta \right\| _{W_1} = \frac{1}{2}\min \left\{ \sum _{x\in \Lambda }\left\| \Delta ^{(x)}\right\| _1: \Delta ^{(x)}\in \mathcal {O}^T_\Lambda ,\;\textrm{Tr}_x \Delta ^{(x)} = 0,\; \sum _{x\in \Lambda }\Delta ^{(x)} = \Delta \right\} .\nonumber \\ \end{aligned}$$

(3.1)

The quantum $W_1$ distance on $\mathcal {S}_\Lambda $ is the distance induced by the quantum $W_1$ norm: For any $\rho ,\,\sigma \in \mathcal {S}_\Lambda $,

$$\begin{aligned} W_1(\rho ,\sigma ) = \left\| \rho - \sigma \right\| _{W_1}. \end{aligned}$$

(3.2)

Definition 3.2

(Lipschitz constant). Let $\Lambda $ be a finite set. We define the dependence of $H\in \mathcal {O}_\Lambda $ on the site $x\in \Lambda $ as

$$\begin{aligned} \partial _x H = 2\min _{H_{\Lambda \setminus x}\in \mathcal {O}_{\Lambda \setminus x}}\left\| H - H_{\Lambda \setminus x}\right\| _\infty . \end{aligned}$$

(3.3)

The quantum Lipschitz constant of $H\in \mathcal {O}_\Lambda $ is

$$\begin{aligned} \left\| H\right\| _L = \max _{x\in \Lambda }\partial _xH. \end{aligned}$$

(3.4)

Proposition 3.1

([17, Proposition 8]). The quantum $W_1$ norm and the quantum Lipschitz constant are mutually dual, i.e., for any $\Delta \in \mathcal {O}_\Lambda ^T$ we have

$$\begin{aligned} \left\| \Delta \right\| _{W_1} = \max \left\{ \textrm{Tr}_\Lambda \left[ \Delta \,H\right] :H\in \mathcal {O}_\Lambda ,\,\left\| H\right\| _L\le 1\right\} . \end{aligned}$$

(3.5)

4 The Quantum $W_1$ Distance for Infinite Lattices

In this section, we extend the quantum Wasserstein distance of order 1 of Ref. [17] to the quantum states of the spin lattice $\mathbb {Z}^d$. As for the entropy and the relative entropy, we define a specific quantum $W_1$ distance, which we denote with $w_1$, as the $W_1$ distance per site in the limit of infinite volume:

Definition 4.1

(Specific quantum $W_1$ distance). For any $\rho ,\,\sigma \in \mathcal {S}_{\mathbb {Z}^d}^I$ we define

$$\begin{aligned} w_1(\rho ,\sigma ) = \lim _{a\rightarrow \infty }\frac{\left\| \rho _{\Lambda _a} - \sigma _{\Lambda _a}\right\| _{W_1}}{\left| \Lambda _a\right| }. \end{aligned}$$

(4.1)

Remark 4.1

We define the specific quantum $W_1$ distance only for translation-invariant states since the limit (4.1) may not exist for generic states in $\mathcal {S}_{\mathbb {Z}^d}$.

Proposition 4.1

The limit in (4.1) exists for any $\rho ,\,\sigma \in \mathcal {S}_{\mathbb {Z}^d}^I$ and is equal to

$$\begin{aligned} w_1(\rho ,\sigma ) = \sup _{a\in \mathbb {N}_+^d}\frac{\left\| \rho _{\Lambda _a} - \sigma _{\Lambda _a}\right\| _{W_1}}{|\Lambda _a|}. \end{aligned}$$

(4.2)

Moreover, $w_1$ is a distance on $\mathcal {S}_{\mathbb {Z}^d}^I$.

Proof

For any $a\in \mathbb {N}_+^d$, let

$$\begin{aligned} f(a) = \left\| \rho _{\Lambda _a} - \sigma _{\Lambda _a}\right\| _{W_1}. \end{aligned}$$

(4.3)

Given $a\in \mathbb {N}_+^d$, $k\in \mathbb {N}$ and $i\in \left\{ 1,\,\ldots ,\,d\right\} $, let

$$\begin{aligned} b=\left( a_1,\,\ldots ,\,a_i + k,\,\ldots ,\,a_d\right) ,\qquad c=\left( a_1,\,\ldots ,\,k,\,\ldots ,\,a_d\right) . \end{aligned}$$

(4.4)

We have

$$\begin{aligned} \Lambda _b = \left( \Lambda _a - k\,e_i\right) \cup \left( \Lambda _c + a_i\,e_i\right) , \end{aligned}$$

(4.5)

where $e_i$ is the i-th vector of the canonical basis of $\mathbb {R}^d$. Then, we get from Proposition A.3 and from the translation invariance of $\rho $ and $\sigma $

$$\begin{aligned} f(b) \ge \left\| \rho _{\Lambda _a - k e_i} - \sigma _{\Lambda _a - k e_i}\right\| _{W_1} + \left\| \rho _{\Lambda _c + a_i e_i} - \sigma _{\Lambda _c + a_i e_i}\right\| _{W_1} = f(a) + f(c).\nonumber \\ \end{aligned}$$

(4.6)

Then, f is superadditive in each variable, and we have from the multidimensional Fekete’s lemma Lemma C.1

$$\begin{aligned} w_1(\rho ,\sigma ) = \lim _{a\rightarrow \infty }\frac{f(a)}{\left| \Lambda _a\right| } = \sup _{a\in \mathbb {N}_+^d}\frac{f(a)}{\left| \Lambda _a\right| }. \end{aligned}$$

(4.7)

The nonnegativity and the triangle inequality for $w_1$ follow from the nonnegativity and the triangle inequality for $W_1$, respectively. Let $w_1(\rho ,\sigma ) = 0$. Then, (4.2) implies

$$\begin{aligned} \left\| \rho _{\Lambda _a} - \sigma _{\Lambda _a}\right\| _{W_1} = 0 \qquad \forall \,a\in \mathbb {N}_+^d, \end{aligned}$$

(4.8)

i.e., $\rho _{\Lambda _a} = \sigma _{\Lambda _a}$ for any $a\in \mathbb {N}_+^d$. Let $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$, and let us choose $a\in \mathbb {N}_+^d$ such that $\Lambda \subseteq \Lambda _a$. Then,

$$\begin{aligned} \rho _\Lambda = \textrm{Tr}_{\Lambda _a\setminus \Lambda }\rho _{\Lambda _a} = \textrm{Tr}_{\Lambda _a\setminus \Lambda }\sigma _{\Lambda _a} = \sigma _\Lambda , \end{aligned}$$

(4.9)

hence $\rho = \sigma $. $\square $

The specific quantum $W_1$ distance is always upper bounded by the trace distance:

Proposition 4.2

For any $\rho ,\,\sigma \in \mathcal {S}_{\mathbb {Z}^d}^I$, we have

$$\begin{aligned} w_1(\rho ,\sigma ) \le T(\rho ,\sigma ). \end{aligned}$$

(4.10)

Moreover, for any $a\in \mathbb {N}_+^d$ we have

$$\begin{aligned} \left\| \rho _{\Lambda _a} - \sigma _{\Lambda _a}\right\| _1 \le 2\left| \Lambda _a\right| w_1(\rho ,\sigma ). \end{aligned}$$

(4.11)

Proof

We have

$$\begin{aligned} w_1(\rho ,\sigma ) = \sup _{a\in \mathbb {N}_+^d}\frac{\left\| \rho _{\Lambda _a} - \sigma _{\Lambda _a}\right\| _{W_1}}{|\Lambda _a|} \overset{\mathrm {(a)}}{\le } \sup _{a\in \mathbb {N}_+^d}\frac{\left\| \rho _{\Lambda _a} - \sigma _{\Lambda _a}\right\| _1}{2} \overset{\mathrm {(b)}}{\le } T(\rho ,\sigma ),\quad \end{aligned}$$

(4.12)

where (a) follows from Proposition A.1 and (b) follows from Proposition 2.1. From Proposition A.1, we have for any $a\in \mathbb {N}_+^d$

$$\begin{aligned} w_1(\rho ,\sigma ) \ge \frac{\left\| \rho _{\Lambda _a} - \sigma _{\Lambda _a}\right\| _{W_1}}{|\Lambda _a|} \ge \frac{\left\| \rho _{\Lambda _a} - \sigma _{\Lambda _a}\right\| _{1}}{2\left| \Lambda _a\right| }. \end{aligned}$$

(4.13)

The claim follows. $\square $

The specific quantum $W_1$ distance admits an equivalent definition, which directly generalizes Definition 3.1 to infinite lattices and does not require the limit over hypercubes.

Definition 4.2

We define for any $\sigma $, $\rho \in \mathcal {S}_{\mathbb {Z}^d}^I$,

$$\begin{aligned} \left\| \rho - \sigma \right\| _{w_1}&= \inf \Bigg \{ c\ge 0 : \exists \, \rho ', \, \sigma ' \in \mathcal {S}_{\mathbb {Z}^d} :\; \rho '_{\mathbb {Z}^d \setminus 0} = \sigma '_{\mathbb {Z}^d\setminus 0}\,,\nonumber \\ \rho _\Lambda - \sigma _\Lambda&= c \sum _{x \in \Lambda } (\tau _x\rho ')_{\Lambda } - (\tau _x\sigma ')_{\Lambda }\quad \forall \, \Lambda \in \mathcal {F}_{\mathbb {Z}^d}\Bigg \}\,. \end{aligned}$$

(4.14)

Let us collect some basic properties of the quantity defined above.

Proposition 4.3

The infimum in (4.14) is attained for some $c \ge 0$, $\rho ', \sigma '\in \mathcal {S}_{\mathbb {Z}^d}$. Moreover, given sequences $(\rho _n)_{n \in \mathbb {N}}$, $(\sigma _n)_{n \in \mathbb {N}} \subseteq \mathcal {S}_{\mathbb {Z}^d}^I$ weakly converging, respectively, toward $\rho $ and $\sigma $, then

$$\begin{aligned} \Vert \rho - \sigma \Vert _{w_1} \le \liminf _{ n \rightarrow \infty } \Vert \rho _n - \sigma _n\Vert _{w_1}\,. \end{aligned}$$

(4.15)

Proof

Both statements follow from the weak sequential compactness of $\mathcal {S}_{\mathbb {Z}^d}$, together with the fact that for every $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$, the restriction map on states $\rho \mapsto \rho _{\Lambda }$ is weakly continuous. Considering a sequence $(c_n,\rho '_n,\sigma _n')_{n \in \mathbb {N}}$ such that $\lim _n c_n = \Vert \rho - \sigma \Vert _{w_1}$ and, using compactness to extract converging subsequences, assume that $\lim _n \rho _n' = \rho '$, $\lim _n \sigma _n' = \sigma '$. By continuity, we deduce that c, $\rho '$, $\sigma '$ satisfy the conditions in (4.14), hence they are minimizers. A similar argument gives (4.15). $\square $

The rest of this section is devoted to showing the equivalence between Definition 4.1 and Definition 4.2.

Theorem 4.1

For $\sigma $, $\rho \in \mathcal {S}_{\mathbb {Z}^d}^I$, we have

$$\begin{aligned} w_1(\rho , \sigma ) = \Vert \rho - \sigma \Vert _{w_1}\,. \end{aligned}$$

(4.16)

We split the proof into several intermediate results. We begin with the following upper bound.

Lemma 4.1

(Upper bound). For $\sigma $, $\rho \in \mathcal {S}_{\mathbb {Z}^d}^I$, we have

$$\begin{aligned} w_1(\rho , \sigma ) \le \Vert \rho - \sigma \Vert _{w_1} \,. \end{aligned}$$

(4.17)

Proof

Given $c\ge 0$, $\rho '$, $\sigma ' \in \mathcal {S}_{\mathbb {Z}^d}$ as in the right-hand side of (4.14), for any $a \in \mathbb {N}^d_+$, we write the identity

$$\begin{aligned} \rho _{\Lambda _a} - \sigma _{\Lambda _a} = \sum _{x \in \Lambda _a} c \left( \rho ^{(x)}_{\Lambda _a} - \sigma ^{(x)}_{\Lambda _a}\right) \,, \end{aligned}$$

(4.18)

where we define, for $x \in \Lambda _a$, the states $\rho ^{(x)}_{\Lambda _a} = (\tau _x \rho ')_{\Lambda _a}$ and $\sigma ^{(x)}_{\Lambda _a} = (\tau _x \sigma ')_{\Lambda _a}$. Using (2.6), it follows that

$$\begin{aligned} \textrm{Tr}_{x} \rho ^{(x)}_{\Lambda _a} = \textrm{Tr}_{x} \left[ (\tau _x \rho ')_{\Lambda _a} \right] = (\tau _x \rho ')_{\Lambda _a \setminus x} = \tau _x\rho '_{(\Lambda _a-x) \setminus 0} \,, \end{aligned}$$

(4.19)

and similarly with $\sigma ^{(x)}_{\Lambda _a}$, so that $\textrm{Tr}_{x} \rho ^{(x)}_{\Lambda _a} = \textrm{Tr}_{x} \sigma ^{(x)}_{\Lambda _a}$ for every $x \in \Lambda _a$. Therefore, by definition of $W_1$ on $\mathcal {S}_{\Lambda _a}$, we have the inequality

$$\begin{aligned} \Vert \rho _{\Lambda _a} - \sigma _{\Lambda _a} \Vert _{W_1} \le c | \Lambda _a | \,. \end{aligned}$$

(4.20)

Dividing both sides by $|\Lambda _a|$ and letting $a \rightarrow \infty $, we deduce $w_1(\rho , \sigma ) \le c$, hence (4.17). $\square $

To establish the lower bound, we consider a periodic approximation of the marginal states over a box $\Lambda _a$. We write, for any $a \in \mathbb {N}_+^d$ and $k \in \mathbb {Z}^d$,

$$\begin{aligned} 2ak =(2a_ik_i)_{i=1}^d\,. \end{aligned}$$

(4.21)

Proposition 4.4

(Periodic approximation). For $\sigma $, $\rho \in \mathcal {S}_{\mathbb {Z}^d}^I$, and $a \in \mathbb {N}_+^d$, define ${\tilde{\rho }}^a$, ${\tilde{\sigma }}^a \in \mathcal {S}_{\mathbb {Z}^d}^I$ as follows:

$$\begin{aligned} {\tilde{\rho }}^a = \frac{1}{|\Lambda _a|} \sum _{x \in \Lambda _a} \tau _x \bigotimes _{k \in \mathbb {Z}^d} \tau _{2ak} \rho _{\Lambda _a}\,, \qquad {\tilde{\sigma }}^a = \frac{1}{|\Lambda _a|} \sum _{x \in \Lambda _a} \tau _x \bigotimes _{k \in \mathbb {Z}^d} \tau _{2ak} \sigma _{\Lambda _a}\,.\nonumber \\ \end{aligned}$$

(4.22)

Then, we have

$$\begin{aligned} \Vert {\tilde{\rho }}^{a} - {\tilde{\sigma }}^{a} \Vert _{w_1} \le \frac{ \Vert \rho _{\Lambda _a} - \sigma _{\Lambda _a} \Vert _{W_1}}{|\Lambda _a|} \,. \end{aligned}$$

(4.23)

Proof

We introduce first some notation. For disjoint sets $R, S \subseteq \mathbb {Z}^d$, write

$$\begin{aligned} \alpha _{R, S} = \left( \bigotimes _{k \in R} \tau _{2ak} \rho _{\Lambda _a}\right) \otimes \left( \bigotimes _{k \in S}\tau _{2ak} \sigma _{\Lambda _a}\right) \,, \end{aligned}$$

(4.24)

which we further simplify to $\alpha _R = \alpha _{R,S}$ whenever $S = \mathbb {Z}^d {\setminus } R$. With the above notation, we have

$$\begin{aligned} \tilde{\rho }^a = \frac{1}{|\Lambda _a|} \sum _{x \in \Lambda _a} \tau _x \alpha _{\mathbb {Z}^d}\,, \qquad \tilde{\sigma }^a = \frac{1}{|\Lambda _a|} \sum _{x \in \Lambda _a} \tau _x \alpha _{\emptyset }\,. \end{aligned}$$

(4.25)

For $x \in \Lambda _a$, let $c_{x} \ge 0$ and $\rho ^{(x)}_{\Lambda _a}, \sigma ^{(x)}_{\Lambda _a} \in \mathcal {S}_{\Lambda _a}$ be such that

$$\begin{aligned} \textrm{Tr}_x \rho ^{(x)}_{\Lambda _a} = \textrm{Tr}_x \sigma ^{(x)}_{\Lambda _a}\;, \qquad \rho _{\Lambda _a} - \sigma _{\Lambda _a} = \sum _{x \in \Lambda _a} c_{x} \left( \rho ^{(x)}_{\Lambda _a} - \sigma ^{(x)}_{\Lambda _a}\right) , \end{aligned}$$

(4.26)

and introduce the states

$$\begin{aligned} {\tilde{\rho }}^{(x)} = \alpha _{R_d, S_d} \otimes \rho ^{(x)}_{\Lambda _a} \,, \qquad {\tilde{\sigma }}^{(x)} = \alpha _{R_d, S_d} \otimes \sigma ^{(x)}_{\Lambda _a}\,, \end{aligned}$$

(4.27)

where $R_d, S_d$ are disjoint sets with $R_d \cup S_d = \mathbb {Z}^d{\setminus } 0$, to be specified in (4.41) below (their precise definition will be relevant only later). Notice that

$$\begin{aligned} \textrm{Tr}_x \tilde{\rho }^{(x)} = \alpha _{R_d, S_d}\otimes \textrm{Tr}_x \rho ^{(x)}_{\Lambda _a} = \alpha _{R_d, S_d}\otimes \textrm{Tr}_x \sigma ^{(x)}_{\Lambda _a} = \textrm{Tr}_x \tilde{\sigma }^{(x)} \,, \end{aligned}$$

(4.28)

hence, for every $z \in \mathbb {Z}^d$, $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$ with $x+ z \notin \Lambda $,

$$\begin{aligned} \left( \tau _{z} \tilde{\rho }^{(x)}\right) _\Lambda = \left( \tau _{z} \tilde{\sigma }^{(x)}\right) _\Lambda \,, \end{aligned}$$

(4.29)

Moreover,

$$\begin{aligned} \sum _{x \in \Lambda _a} c_x \left( \tilde{\rho }^{(x)} - \tilde{\sigma }^{(x)} \right)&= \alpha _{R_d, S_d}\otimes \sum _{x \in \Lambda _a} c_x \left( \rho ^{(x)}_{\Lambda _a} - \sigma ^{(x)}_{\Lambda _a} \right) = \alpha _{R_d ,S_d} \otimes \left( \rho _{\Lambda _a} - \sigma _{\Lambda _a}\right) \nonumber \\&= \alpha _{R_d \cup 0} -\alpha _{R_d}\, . \end{aligned}$$

(4.30)

We assume that $c = \sum _{x \in \Lambda _a} c_x >0$, otherwise (4.26) yields $\rho _{\Lambda _a} = \sigma _{\Lambda _a}$ hence $\tilde{\rho }^{a} = \tilde{\sigma }^{a}$ and (4.23) holds since $\Vert {\tilde{\rho }}^a - {\tilde{\sigma }}^a\Vert _{w_1} = 0$, by choosing $c=0$ and any $\rho ' =\sigma ' \in \mathcal {S}_{\mathbb {Z}^d}$. Therefore, letting $p_x = c_x/c$, we define the states

$$\begin{aligned} \rho ' = \sum _{x \in \Lambda } p_x \tau _{-x} \tilde{\rho }^{(x)}\,, \quad \sigma ' = \sum _{x \in \Lambda } p_x \tau _{-x} \tilde{\sigma }^{(x)}\,. \end{aligned}$$

(4.31)

By (4.28), we have the identity

$$\begin{aligned} \textrm{Tr}_0 \rho ' = \sum _{x \in \Lambda } p_x \textrm{Tr}_0 \tau _{-x} \tilde{\rho }^{(x)} = \sum _{x \in \Lambda } p_x\tau _{-x} \textrm{Tr}_x \tilde{\rho }^{(x)} = \sum _{x \in \Lambda } p_x \tau _{-x}\textrm{Tr}_x \tilde{\sigma }^{(x)} = \textrm{Tr}_0 \sigma '\,.\nonumber \\ \end{aligned}$$

(4.32)

If we prove that, for every $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$,

$$\begin{aligned} \tilde{\rho }^a_{\Lambda } - \tilde{\sigma }^a_{\Lambda } = c \sum _{y \in \Lambda }\left( (\tau _y \rho ')_{\Lambda } - (\tau _y \sigma ')_{\Lambda }\right) \,, \end{aligned}$$

(4.33)

then

$$\begin{aligned} \Vert \tilde{\rho }^a - \tilde{\sigma }^a\Vert _{w_1} \le c\,, \end{aligned}$$

(4.34)

and (4.23) follows. To show (4.33), we write explicitly

$$\begin{aligned} c \sum _{y \in \Lambda } \left( (\tau _{y} \rho ')_{\Lambda } - (\tau _{y} \sigma ')_{\Lambda }\right)&= \sum _{y \in \Lambda } \sum _{x \in \Lambda _a} c_x \left( ( \tau _{y-x} \tilde{\rho }^{(x)})_{\Lambda } - ( \tau _{y-x} \tilde{\sigma }^{(x)})_{\Lambda } \right) \nonumber \\ {}&= \sum _{x \in \Lambda _a} c_x \sum _{y \in \Lambda } ( \tau _{y-x} \tilde{\rho }^{(x)})_{\Lambda } - ( \tau _{y-x} \tilde{\sigma }^{(x)})_{\Lambda } \nonumber \\&= \sum _{x \in \Lambda _a} c_x \sum _{z \in \Lambda -x } ( \tau _{z} \tilde{\rho }^{(x)})_{\Lambda } - ( \tau _{z} \tilde{\sigma }^{(x)})_{\Lambda } \, . \end{aligned}$$

(4.35)

where the last line follows letting $z = y-x$. Using (4.29), we extend the summation over $z \in \Lambda '$, for any $\Lambda ' \in \mathcal {F}_{\mathbb {Z}^d}$ such that

$$\begin{aligned} \Lambda '\supseteq \bigcup _{x \in \Lambda _a} ( \Lambda -x)\,. \end{aligned}$$

(4.36)

Then, exchanging again the order of summation and using (4.30),

$$\begin{aligned} \sum _{x \in \Lambda _a} c_x \sum _{z \in \Lambda ' } ( \tau _{z} \tilde{\rho }^{(x)} )_{\Lambda } - ( \tau _{z} \tilde{\sigma }^{(x)})_{\Lambda }&= \sum _{z \in \Lambda ' } \sum _{x \in \Lambda _a} c_x \left( ( \tau _{z} \tilde{\rho }^{(x)} )_{\Lambda } - ( \tau _{z} \tilde{\sigma }^{(x)})_{\Lambda }) \right) \end{aligned}$$

(4.37)

$$\begin{aligned}&= \sum _{z \in \Lambda ' } (\tau _z \alpha _{R_d \cup 0})_{\Lambda } - ( \tau _z \alpha _{R_d})_{\Lambda } \end{aligned}$$

(4.38)

We now specify the sets $\Lambda '$ and $R_d$ in such a way that the above summation is telescopic and yields (4.33). First, we let

$$\begin{aligned} \Lambda ' = \bigcup _{k \in \Lambda _b} \left( \Lambda _a + 2ak\right) \,, \end{aligned}$$

(4.39)

with $b \in \mathbb {N}^d_+$ sufficiently large so that (4.36) holds. Then, recalling (4.25), to obtain (4.33) it is sufficient to prove that

$$\begin{aligned} \sum _{k \in \Lambda _b } (\tau _{2ak} \alpha _{R_d \cup 0})_{\Lambda } - ( \tau _{2ak} \alpha _{R_d})_{\Lambda } = (\alpha _{\mathbb {Z}^d})_{\Lambda } - (\alpha _\emptyset )_{\Lambda }\,. \end{aligned}$$

(4.40)

The following recursive definition for the subsets $R_d \subseteq \mathbb {Z}^d\setminus 0$ serves exactly this purpose. We let

$$\begin{aligned} R_1 = \mathbb {Z}_-\,, \; R_{d} = \left( \mathbb {Z}^{d-1} \times \mathbb {Z}_-\right) \cup \left( R_{d-1} \times 0 \right) \,, \end{aligned}$$

(4.41)

so that

$$\begin{aligned} R_d \cup \{ 0 \} = R_d+ e_1\,, \end{aligned}$$

(4.42)

where we write $e_i \in \mathbb {Z}^d$ for the natural basis vectors, for $i=1, \ldots , d$ (see Fig. 1).

We decompose the right-hand side in (4.40) as a double summation, over $k_1$ and $k_{\setminus 1} = (-b_1, k_2, \ldots , k_{d})$, so that, for fixed $k_{\setminus 1}$, we find a telescopic sum

$$\begin{aligned} \sum _{0 \le k_1< 2 b_1} \tau _{2ak_{\setminus 1}} \alpha _{R_d + e_1} - \tau _{2ak_{\setminus 1}} \alpha _{R_d}&= \sum _{0 \le k_1 < 2b_1} \tau _{2ak_{\setminus 1}} \alpha _{R_d + (k_1+1)e_1 } - \tau _{2ak_{\setminus 1}} \alpha _{R_d+k_1 e_1 } \nonumber \\&= \tau _{2ak_{\setminus 1}} \alpha _{R_d + 2b_1 e_1 } - \tau _{2ak_{\setminus 1}} \alpha _{R_d}\, . \end{aligned}$$

(4.43)

Since $\Lambda \subseteq \Lambda '$, and

$$\begin{aligned} (R_d + 2b_1 e_1 + k_{\setminus 1}) \cap \Lambda _b = (R_d + e_{2} + k_{\setminus 1}) \cap \Lambda _b\,, \end{aligned}$$

(4.44)

(see Fig. 2), it follows that

$$\begin{aligned} (\tau _{2ak_{\setminus 1}} \alpha _{R_d + 2b_1 e_1})_{\Lambda }- (\tau _{2ak_{\setminus 1}} \alpha _{R_d})_{\Lambda } = (\tau _{2a k_{\setminus 1}} \alpha _{R_d + e_{2}})_{\Lambda } - (\tau _{2ak_{\setminus 1}} \alpha _{R_d})_{\Lambda }\,. \nonumber \\ \end{aligned}$$

(4.45)

We further proceed decomposing the summation upon $k_2$ and

$$\begin{aligned} k_{\setminus 1,2} = (-b_1, -b_2, k_3, \ldots , k_d), \end{aligned}$$

(4.46)

so that we obtain a similar telescopic sum. By iterating the same argument up to summation over $k_d$, we eventually conclude that

$$\begin{aligned} \sum _{k \in \Lambda _b } (\tau _{2ak} \alpha _{R_d \cup 0})_{\Lambda } - ( \tau _{2ak} \alpha _{R_d})_{\Lambda } = (\tau _{ - 2a b} \alpha _{R_d + 2b_d e_d})_{\Lambda } - (\tau _{ - 2a b} \alpha _{R_d})_{\Lambda }\,, \nonumber \\ \end{aligned}$$

(4.47)

which gives (4.40) since

$$\begin{aligned} (R_d+2b_d e_d - 2 ab) \cap \Lambda _b = \Lambda _b\,, \; (R_d - 2 ab) \cap \Lambda _b = \emptyset \,, \end{aligned}$$

(4.48)

thus the proof is completed. $\square $

Using the above construction, we establish the following lower bound, hence completing the proof of Theorem 4.1.

Lemma 4.2

(Lower bound). For $\sigma $, $\rho \in \mathcal {S}_{\mathbb {Z}^d}^I$, we have

$$\begin{aligned} w_1(\rho , \sigma ) \ge \Vert \rho - \sigma \Vert _{w_1}\,. \end{aligned}$$

(4.49)

Proof

With the notation of Proposition 4.4, we argue that the states $\tilde{\rho }^a$ weakly converge to $\rho $. For any fixed $b \in \mathbb {N}^d_+$, if $x \in \Lambda _a$ is such that $\Lambda _b \subseteq \Lambda _a+x$, then

$$\begin{aligned} \left( \tau _x \bigotimes _{k \in \mathbb {Z}^d} \tau _{2ak} \rho _{\Lambda _a} \right) _{\Lambda _b} = \rho _{\Lambda _b}\,. \end{aligned}$$

(4.50)

Therefore, denoting by $G_a\subseteq \Lambda _a$ the set of such x’s, we write

$$\begin{aligned} \tilde{\rho }^a_{\Lambda _b} = \frac{|G_a|}{|\Lambda _a|} \rho _{\Lambda _b} + \sum _{x \in \Lambda _a\setminus G_a} \left( \tau _x \bigotimes _{k \in \mathbb {Z}^d} \tau _{2ak} \rho _{\Lambda _a} \right) _{\Lambda _b} \end{aligned}$$

(4.51)

Since $|G_a| = |\Lambda _a|- |\Lambda _b|$, it follows that, as $a \rightarrow \infty $, $\tilde{\rho }^a_{\Lambda _b}$ converge in $\mathcal {S}_{\Lambda _b}$ toward $\rho _{\Lambda _b}$. This holds for any $b \in \mathbb {N}^d_+$, hence we obtain the desired weak convergence in $\mathcal {S}_{\mathbb {Z}^d}$ of $\tilde{\rho }^a$ toward $\rho $. By (4.15), Proposition 4.4 and Proposition 4.1 we have the inequalities

$$\begin{aligned} \Vert \rho - \sigma \Vert _{w_1}&\le \liminf _{a \rightarrow \infty } \Vert \tilde{\rho }^a - \tilde{\sigma }^a \Vert _{w_1} \nonumber \\&\le \limsup _{a \rightarrow \infty } \Vert \tilde{\rho }^a - \tilde{\sigma }^a \Vert _{w_1} \le \sup _{a \in \mathbb {N}^d_+} \frac{ \Vert \rho _{\Lambda _a} - \sigma _{\Lambda _a} \Vert _{W_1}}{|\Lambda _a|} = w_1(\rho , \sigma )\, , \end{aligned}$$

(4.52)

and the proof of (4.49) is completed. $\square $

As a consequence of the above argument, we also obtain that the periodic approximations always converge with respect to the specific quantum $W_1$ distance.

Corollary 4.1

With the notation of Proposition 4.4, we have

$$\begin{aligned} \lim _{a \rightarrow \infty } w_1( \tilde{\rho }^a, \tilde{\sigma }^a) = w_1(\rho , \sigma )\,. \end{aligned}$$

(4.53)

Proof

In (4.52), all inequalities must be equalities, hence the limit. $\square $

5 The Quantum Lipschitz Constant for Infinite Lattices

In this section, we extend the definition of quantum Lipschitz constant of Ref. [17] to interactions on the quantum spin lattice $\mathbb {Z}^d$.

The first step toward defining a Lipschitz constant for interactions is to extend to $\mathcal {O}_{\mathbb {Z}^d}$ the definition of dependence on a site:

Definition 5.1

For any $H\in \mathcal {O}_{\mathbb {Z}^d}$ and any $x\in \mathbb {Z}^d$, we define

$$\begin{aligned} \partial _x H = 2\inf _{A\in \mathcal {O}_{\mathbb {Z}^d\setminus x}}\left\| H - A\right\| _\infty . \end{aligned}$$

(5.1)

Proposition 5.1

For any $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$, any $H\in \mathcal {O}_\Lambda $ and any $x\in \Lambda $, (3.3) and (5.1) are equivalent.

Proof

See Sect. B.2. $\square $

We can now define the Lipschitz constant for interactions:

Definition 5.2

(Lipschitz constant). We define the Lipschitz constant of $\Phi \in \mathcal {B}^r_{\mathbb {Z}^d}$ as the dependence of the formal Hamiltonian $H^\Phi _{\mathbb {Z}^d}$ on the site 0:

$$\begin{aligned} \left\| \Phi \right\| _L = \partial _0 \sum _{0\in \Lambda \in \mathcal {F}_{\mathbb {Z}^d}} \Phi (\Lambda ), \end{aligned}$$

(5.2)

where the series converges absolutely in the $\Vert \cdot \Vert _\infty $ norm.

The Lipschitz constant of $\Phi $ is also equal to the dependence on a fixed site of the local Hamiltonian on a region in the limit of infinite volume:

Proposition 5.2

For any $\Phi \in \mathcal {B}_{\mathbb {Z}^d}^r$ and any $x\in \mathbb {Z}^d$, we have

$$\begin{aligned} \lim _{a\rightarrow \infty }\partial _xH^\Phi _{\Lambda _a} = \left\| \Phi \right\| _L. \end{aligned}$$

(5.3)

Proof

We have

$$\begin{aligned} \left| \left\| \Phi \right\| _L - \partial _xH^\Phi _{\Lambda _a}\right|&\overset{\mathrm {(a)}}{=} \left| \left\| \Phi \right\| _L - \partial _0 H^\Phi _{\Lambda _a-x}\right| = \left| \left\| \Phi \right\| _L - \partial _0 \sum _{0\in \Lambda \subseteq \Lambda _a-x}\Phi (\Lambda )\right| \nonumber \\&\overset{\mathrm {(b)}}{\le } \partial _0 \sum _{0\in \Lambda \in \mathcal {F}_{\mathbb {Z}^d}:\Lambda \not \subseteq \Lambda _a-x} \Phi (\Lambda ) \le 2\sum _{0\in \Lambda \in \mathcal {F}_{\mathbb {Z}^d}:\Lambda \not \subseteq \Lambda _a-x}\left\| \Phi (\Lambda )\right\| _\infty \,, \end{aligned}$$

(5.4)

where (a) follows from the translation invariance of $\Phi $ and (b) follows since $\partial _0$ is a seminorm. Since

$$\begin{aligned} \sum _{0\in \Lambda \in \mathcal {F}_{\mathbb {Z}^d}}\left\| \Phi (\Lambda )\right\| _\infty \le \left\| \Phi \right\| _r < \infty , \end{aligned}$$

(5.5)

we have

$$\begin{aligned} \lim _{a\rightarrow \infty }\left| \left\| \Phi \right\| _L - \partial _xH^\Phi _{\Lambda _a}\right| \le 2\lim _{a\rightarrow \infty }\sum _{0\in \Lambda \in \mathcal {F}_{\mathbb {Z}^d}:\Lambda \not \subseteq \Lambda _a-x}\left\| \Phi (\Lambda )\right\| _\infty = 0. \end{aligned}$$

(5.6)

The claim follows. $\square $

5.1 Physical Equivalence

Different interactions may give rise to the same formal Hamiltonian. Such interactions are called physically equivalent. The concept of physical equivalence between interactions was formally introduced in [83,84,85]. We adopt the definition of [86, Section 4.7] and [82, 87]. The reader can find more details in [88, Section 2.4.6].

Definition 5.3

The interaction $\Phi ,\,\Psi \in \mathcal {B}^r_{\mathbb {Z}^d}$ are physically equivalent if any of the following equivalent conditions holds:

1.
The local Hamiltonians of $\Phi $ and $\Psi $ normalized by the number of sites differ only by a constant in the limit of infinite volume:
$$\begin{aligned} \lim _{a\rightarrow \infty }\left\| \frac{H^{\Phi }_\Lambda - H^{\Psi }_\Lambda }{\left| \Lambda _a\right| } - \omega (E_{\Phi -\Psi })\,\mathbb {I}\right\| _\infty = 0, \end{aligned}$$
(5.7)
where $\omega \in \mathcal {S}^I_{\mathbb {Z}^d}$ is the uniform distribution, i.e., $\omega _\Lambda = \frac{\mathbb {I}_\Lambda }{q^{|\Lambda |}}$ for any $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$.
2.
$\Phi $ and $\Psi $ generate the same time evolution: For any $A\in \mathfrak {U}_{\mathbb {Z}^d}$ and any $t\in \mathbb {R}$, we have
$$\begin{aligned} \lim _{a\rightarrow \infty } \left\| e^{i H^\Phi _{\Lambda _a} t}\,A\,e^{-i H^\Phi _{\Lambda _a} t} - e^{i H^\Psi _{\Lambda _a} t}\,A\,e^{-i H^\Psi _{\Lambda _a} t}\right\| _\infty = 0. \end{aligned}$$
(5.8)
3.
For any $A\in \mathfrak {U}^{loc}_{\mathbb {Z}^d}$, we have
$$\begin{aligned} \sum _{\Lambda \in \mathcal {F}_{\mathbb {Z}^d}} \left[ \Phi (\Lambda ) - \Psi (\Lambda ),\,A\right] = 0. \end{aligned}$$
(5.9)

Proposition 5.3

The interaction $\Phi \in \mathcal {B}^r_{\mathbb {Z}^d}$ is physically equivalent to the null interaction iff $\left\| \Phi \right\| _L = 0$.

Proof

Let $\Phi $ be physically equivalent to the null interaction. Let
$$\begin{aligned} K = \sum _{0\in \Lambda \in \mathcal {F}_{\mathbb {Z}^d}}\Phi (\Lambda ), \end{aligned}$$
(5.10)
where the series converges absolutely in the $\Vert \cdot \Vert _\infty $ norm. We have for any $A\in \mathfrak {U}_0$
$$\begin{aligned} 0 = \sum _{\Lambda \in \mathcal {F}_{\mathbb {Z}^d}} \left[ \Phi (\Lambda ),\,A\right] = \sum _{0\in \Lambda \in \mathcal {F}_{\mathbb {Z}^d}}\left[ \Phi (\Lambda ),\,A\right] = \left[ K,\,A\right] , \end{aligned}$$
(5.11)
therefore $K\in \mathcal {O}_{\mathbb {Z}^d}\setminus 0$ and $\left\| \Phi \right\| _L = \partial _0 K = 0$.
Let $\left\| \Phi \right\| _L = 0$. Let K be as in (5.10). We have
$$\begin{aligned} 2\inf _{H\in \mathcal {O}_{\mathbb {Z}^d\setminus 0}}\left\| K - H\right\| _\infty = \partial _0 K = \left\| \Phi \right\| _L = 0, \end{aligned}$$
(5.12)
therefore there exists a sequence $\left( K^{(n)}\right) _{n\in \mathbb {N}}\subset \mathcal {O}_{\mathbb {Z}^d\setminus 0}$ such that
$$\begin{aligned} \lim _{n\rightarrow \infty }\left\| K - K^{(n)}\right\| _\infty = 0. \end{aligned}$$
(5.13)
Then, $K\in \mathcal {O}_{\mathbb {Z}^d\setminus 0}$, and for any $A_0\in \mathfrak {U}_0$ we have
$$\begin{aligned} \sum _{\Lambda \in \mathcal {F}_{\mathbb {Z}^d}}\left[ \Phi (\Lambda ),\,A_0\right] = \left[ K,\,A_0\right] = 0. \end{aligned}$$
(5.14)
Let us prove that for any $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$ and any $A_\Lambda \in \mathfrak {U}_\Lambda $ we have
$$\begin{aligned} \sum _{X\in \mathcal {F}_{\mathbb {Z}^d}}\left[ \Phi (X),\,A_\Lambda \right] = 0 \end{aligned}$$
(5.15)
by induction on the size of $\Lambda $. We have already proved the claim for $|\Lambda |=1$. Let us fix $x\in \Lambda $. By linearity, we can assume that $A_\Lambda = A_x\otimes A_{\Lambda \setminus x}$ with $A_{\Lambda \setminus x}\in \mathfrak {U}_{\Lambda \setminus x}$. We have from the inductive hypothesis
$$\begin{aligned}{} & {} \sum _{X\in \mathcal {F}_{\mathbb {Z}^d}}\left[ \Phi (X),\,A_x\otimes A_{\Lambda \setminus x}\right] = \sum _{X\in \mathcal {F}_{\mathbb {Z}^d}}\left( \left[ \Phi (X),\,A_x\right] A_{\Lambda \setminus x} \right. \nonumber \\{} & {} \quad \left. + A_x\left[ \Phi (X),\,A_{\Lambda \setminus x}\right] \right) = 0.\nonumber \\ \end{aligned}$$
(5.16)
The claim follows.

$\square $

6 Duality for the $w_1$ Distance

Using Theorem 4.1, we prove the following dual formulation for the specific quantum $W_1$ distance.

Theorem 6.1

(Duality). For $\rho $, $\sigma \in \mathcal {S}_{\mathbb {Z}^d}^I$, we have

$$\begin{aligned} w_1(\rho , \sigma ) = \sup \left\{ \rho (E_{\Phi }) - \sigma (E_{\Phi }): \Phi \in \mathcal {B}_{\mathbb {Z}^d}^r, \, \left\| \Phi \right\| _L\le 1\right\} \,. \end{aligned}$$

(6.1)

The result extends Proposition 3.1 to infinite spin systems, but unlike the finite dimensional case, in general there is no reason for the supremum in (6.1) to be attained in $\mathcal {B}_{\mathbb {Z}^d}^r$.

Proof

Let $\Phi \in \mathcal {B}_{\mathbb {Z}^d}^r$ with $\Vert \Phi \Vert _L \le 1$, and let $c \ge 0$, $\rho '$, $\sigma ' \in \mathcal {S}_{\mathbb {Z}^d}$ be as in (4.14). For $a \in \mathbb {N}^d_+$ and $\Lambda \subseteq \Lambda _a \in \mathcal {F}_{\mathbb {Z}^d}$, we have

$$\begin{aligned} \rho ( \Phi (\Lambda ) ) - \sigma ( \Phi (\Lambda ) )&= c \sum _{x \in \Lambda _a} \tau _x\rho '( \Phi (\Lambda )) - \tau _x\rho '( \Phi (\Lambda )) \nonumber \\&= c \sum _{x \in \Lambda _a} \rho '( \Phi (\Lambda -x) ) - \sigma '( \Phi (\Lambda -x) ) \, . \end{aligned}$$

(6.2)

If $x \notin \Lambda $, then $0 \notin \Lambda -x$, hence $\rho '( \Phi (\Lambda -x) ) = \sigma '( \Phi (\Lambda -x) )$ because $\rho '_{\Lambda -x} = \sigma '_{\Lambda -x}$. Therefore, the sum above can be restricted upon $x \in \Lambda $, i.e., $0 \in \Lambda -x$. We then sum upon $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$ and make a change of variable $\Lambda ' = \Lambda - x$, obtaining

$$\begin{aligned} \sum _{0 \in \Lambda \subseteq \Lambda _a } \frac{ \rho ( \Phi (\Lambda ) ) - \sigma (\Phi (\Lambda )) }{|\Lambda |}&= c \sum _{0 \in \Lambda \subseteq \Lambda _a } \sum _{x \in \Lambda } \frac{ \rho '( \Phi (\Lambda -x) ) - \sigma '( \Phi (\Lambda -x) ) }{|\Lambda |} \nonumber \\&= c \sum _{0 \in \Lambda ' \in \mathcal {F}_{\mathbb {Z}^d} } \left( \rho '( \Phi (\Lambda ') ) - \sigma '( \Phi (\Lambda ') )\right) \frac{ g_a(\Lambda ') }{|\Lambda '|} , \end{aligned}$$

(6.3)

where $g_a(\Lambda ')$ denotes the number of pairs $(x,\Lambda )$ with $x \in \Lambda \subseteq \Lambda _a$, $0 \in \Lambda $, such that $\Lambda ' = \Lambda -x$. Notice that the above is a finite sum, since we may restrict upon $\Lambda ' \subseteq \Lambda _{2a}$, otherwise $g_a(\Lambda ') = 0$. Moreover, for every such pair $(x,\Lambda )$, it must be $x \in -\Lambda '$, since $0 \in \Lambda $. Therefore, for every $\Lambda ' \in \mathcal {F}_{\mathbb {Z}^d}$ with $0 \in \Lambda '$,

$$\begin{aligned} 0 \le \frac{ g_a(\Lambda ') }{|\Lambda '|} \le 1\,, \qquad \text {and} \qquad \lim _{a \rightarrow \infty } \frac{g_a(\Lambda ')}{|\Lambda '|} =1\,, \end{aligned}$$

(6.4)

since every pair $(x, \Lambda )$ with $x \in - \Lambda '$ and $\Lambda =x+\Lambda '$ satisfies $x \in \Lambda \subseteq \Lambda _a$ if a is sufficiently large. Therefore, by the dominated convergence theorem for series, we deduce that

$$\begin{aligned} \lim _{a \rightarrow \infty } \sum _{0 \in \Lambda ' \in \mathcal {F}_{\mathbb {Z}^d} } \rho '( \Phi (\Lambda ') ) \frac{ g_a(\Lambda ') }{|\Lambda '|} = \sum _{0 \in \Lambda ' \in \mathcal {F}_{\mathbb {Z}^d} } \rho '( \Phi (\Lambda ') )\,, \end{aligned}$$

(6.5)

and similarly for $\sigma '$. The left-hand side in (6.3) converges to $\rho (E_\Phi ) - \sigma (E_{\Phi })$ as $a \rightarrow \infty $, hence we obtain the identity

$$\begin{aligned} \rho (E_\Phi ) - \sigma (E_{\Phi }) = c \sum _{0 \in \Lambda \in \mathcal {F}_{\mathbb {Z}^d} } \rho '( \Phi (\Lambda ) ) - \sum _{0 \in \Lambda \in \mathcal {F}_{\mathbb {Z}^d} } \sigma '( \Phi (\Lambda ) ). \end{aligned}$$

(6.6)

Given $A \in \mathcal {O}_{\mathbb {Z}^d\setminus 0}$, since $\rho '_{\mathbb {Z}^d{\setminus } 0} = \sigma '_{\mathbb {Z}^d{\setminus } 0}$, we have $\rho '( A ) = \sigma '(A)$, thus

$$\begin{aligned} \sum _{0 \in \Lambda \in \mathcal {F}_{\mathbb {Z}^d} } \rho '( \Phi (\Lambda ) ) - \sum _{0 \in \Lambda \in \mathcal {F}_{\mathbb {Z}^d} } \sigma '( \Phi (\Lambda ) )&= \rho '\left( \sum _{0 \in \Lambda \in \mathcal {F}_{\mathbb {Z}^d} } \Phi (\Lambda ) - A \right) \nonumber \\&\quad - \sigma '\left( \sum _{0 \in \Lambda \in \mathcal {F}_{\mathbb {Z}^d} } \Phi (\Lambda ) - A \right) \end{aligned}$$

(6.7)

$$\begin{aligned}&\le 2 \left\| \sum _{0 \in \Lambda \in \mathcal {F}_{\mathbb {Z}^d} } \Phi (\Lambda ) - A \right\| _\infty \, . \end{aligned}$$

(6.8)

Being $A \in \mathcal {O}_{\mathbb {Z}^d\setminus 0}$ arbitrary, we deduce the inequality

$$\begin{aligned} \sum _{0 \in \Lambda \in \mathcal {F}_{\mathbb {Z}^d} } \rho '( \Phi (\Lambda ) ) - \sum _{0 \in \Lambda \in \mathcal {F}_{\mathbb {Z}^d} } \sigma '( \Phi (\Lambda ) ) \le \Vert \Phi \Vert _{L} \le 1\,, \end{aligned}$$

(6.9)

which from (6.6) gives

$$\begin{aligned} \sup \left\{ \rho (E_{\Phi }) - \sigma (E_{\Phi }): \Phi \in \mathcal {B}_{\mathbb {Z}^d}^r, \, \left\| \Phi \right\| _L\le 1\right\} \le c\,. \end{aligned}$$

(6.10)

Recalling that c and $\rho '$, $\sigma '$ are chosen as in (4.14), we deduce

$$\begin{aligned} \sup \left\{ \rho (E_{\Phi }) - \sigma (E_{\Phi }): \Phi \in \mathcal {B}_{\mathbb {Z}^d}^r, \, \left\| \Phi \right\| _L\le 1\right\} \le \Vert \rho - \sigma \Vert _{w_1}, \end{aligned}$$

(6.11)

i.e., inequality $\ge $ holds in (6.1).

For the converse inequality, given any $H \in \mathcal {O}_{\Lambda _a}$ with $\Vert H\Vert _L \le 1$, we define the translation-invariant interaction

$$\begin{aligned} \Phi ^H(\Lambda ) = \frac{ \tau _x H}{|\Lambda _a|} \end{aligned}$$

(6.12)

if $\Lambda = \Lambda _a + x$ for some $x \in \mathbb {Z}^d$, $\Phi ^H(\Lambda ) = 0$ otherwise. Notice that $\Phi ^H \in \mathcal {B}_{\mathbb {Z}^d}^r$ and

$$\begin{aligned} \Vert \Phi ^H \Vert _L \le \sum _{x \in \Lambda _a} \frac{ \partial _0 \tau _{-x} H}{|\Lambda _a|} \le \sum _{x \in \Lambda _a} \frac{ \partial _{x} H}{|\Lambda _a|} \le \Vert H\Vert _L \le 1\,. \end{aligned}$$

(6.13)

Since $\rho $, $\sigma \in \mathcal {S}_{\mathbb {Z}^d}^I$, we have $\rho (\tau _xH) = \rho (H)$, $\sigma (\tau _x H) = \sigma (H)$, hence

$$\begin{aligned} \rho (E_{\Phi ^H}) - \sigma (E_{\Phi ^H}) = \sum _{x \in \Lambda _a} \frac{ \rho ( \tau _{-x} H ) - \sigma (\tau _{-x} H)}{|\Lambda _a|^2} = \frac{ \rho (H) - \sigma (H)}{|\Lambda _a|}\,. \end{aligned}$$

(6.14)

The duality for the quantum $W_1$ distance on the finite lattice $\Lambda _a$ yields

$$\begin{aligned} \frac{ \Vert \rho _{\Lambda _a}- \sigma _{\Lambda _a} \Vert _{W_1} }{|\Lambda _a|}&= \sup \left\{ \rho (E_{\Phi ^H}) - \sigma (E_{\Phi ^H}): H \in \mathcal {O}_{\Lambda _a} , \, \left\| H\right\| _L\le 1\right\} \nonumber \\&\le \sup \left\{ \rho (E_{\Phi }) - \sigma (E_{\Phi }): \Phi \in \mathcal {B}_{\mathbb {Z}^d}^r , \, \left\| \Phi \right\| _L\le 1\right\} \, . \end{aligned}$$

(6.15)

Letting $a \rightarrow \infty $, we obtain inequality $\le $ in (6.1), hence the thesis. $\square $

7 Recovery of Ornstein’s $\bar{d}$-Distance

As in the finite dimensional case, the specific quantum $W_1$ distance between states recovers Ornstein’s $\bar{d}$-distance, when restricted to diagonal states in the canonical basis, i.e., $\rho \in \mathcal {S}_{\mathbb {Z}^d}$ such that, for every $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$, $\rho _{\Lambda }$ is diagonal in the basis $( |x \rangle \langle x |)_{x \in [q]^{\Lambda }}$.

There is indeed a correspondence between probability measures $\mu $ on $[q]^\mathbb {Z^d}$ and such states, defined by mapping $\mu $ to the diagonal state $\rho \in \mathcal {S}_{\mathbb {Z}^d}$ such that, for every $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$,

$$\begin{aligned} \rho _{\Lambda } = \sum _{x \in [q]^{\Lambda }} \mu _{\Lambda }(x)\, | x \rangle \langle x|\,, \end{aligned}$$

(7.1)

where $\mu _{\Lambda }$ denotes the marginal of $\mu $ on $\Lambda $. Since states are determined by their collection of marginals, (7.1) completely determines $\rho $.

The correspondence is clearly invertible, arguing similarly on the space of probability measures $[q]^\mathbb {Z^d}$. With a slight abuse of notation, we write $|x \rangle \langle x| \in \mathcal {S}_{\mathbb {Z}^d}$ for the diagonal state corresponding to the Dirac probability measure concentrated at $x \in [q]^{\mathbb {Z}^d}$, so that one can also write

$$\begin{aligned} \rho = \int _{[q]^{\mathbb {Z}^d} } | x \rangle \langle x|\,\textrm{d} \mu (x)\,, \end{aligned}$$

(7.2)

where integration is in the sense of Pettis (also called weak integral).

Given two shift-invariant (i.e., stationary) probability measures $\mu $, $\nu $ on the infinite product space $[q]^{\mathbb {Z}^d}$, Ornstein’s $\bar{d}$-distance [7, 13] is defined as

$$\begin{aligned} \bar{d} (\mu , \nu ) = \sup _{a \in \mathbb {N}^d_+} \frac{W_{1}\left( \mu _{\Lambda _a}, \nu _{\Lambda _a}\right) }{|\Lambda _a|}\,, \end{aligned}$$

(7.3)

where, for $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$, $W_{1}$ denotes the classical optimal transport distance with Hamming cost on $[q]^{\Lambda }$, i.e.,

$$\begin{aligned} W_{1}\left( \mu _{\Lambda }, \nu _{\Lambda } \right) = \min _{\pi \in \mathcal {C}(\mu _{\Lambda }, \nu _{\Lambda })} \sum _{x, y \in [q]^{\Lambda }} h(x,y) \pi (x,y), \end{aligned}$$

(7.4)

with $\mathcal {C}(\mu _{\Lambda }, \nu _{\Lambda })$ being the set of couplings between the probability distributions $\mu _{\Lambda }$, $\nu _{\Lambda }$, and

$$\begin{aligned} h(x, y) = \left| \left\{ i \in \Lambda \,: \, x_i \ne y_i \right\} \right| \,. \end{aligned}$$

(7.5)

This distance is usually defined only in the case $d=1$, but the extension to $d \ge 1$ is straightforward and informally discussed already in [7, Appendix 4].

Proposition 7.1

Given stationary probability measures $\mu $, $\nu $ on $[q]^{\mathbb {Z}^d}$, let $\rho $, $\sigma \in \mathcal {S}_{\mathbb {Z}^d}^I$ denote the associated diagonal states,

$$\begin{aligned} \rho = \int _{[q]^{\mathbb {Z}^d}} | x \rangle \langle x| \textrm{d} \mu (x), \quad \sigma = \int _{[q]^{\mathbb {Z}^d}} | x \rangle \langle x| \textrm{d} \nu (x)\,. \end{aligned}$$

(7.6)

Then, we have

$$\begin{aligned} w_1(\rho , \sigma ) = \bar{d} (\mu , \nu )\,. \end{aligned}$$

(7.7)

Proof

For every $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$, we have, by [17, Proposition 5],

$$\begin{aligned} \left\| \rho _{\Lambda } - \sigma _{\Lambda } \right\| _{W_1} = W_1\left( \mu _{\Lambda }, \nu _{\Lambda } \right) . \end{aligned}$$

(7.8)

Choosing $\Lambda = \Lambda _a$, for $a \in \mathbb {N}^d_+$, dividing by $|\Lambda _a|$ and letting $a \rightarrow \infty $ yields the thesis. $\square $

Ornstein’s $\bar{d}$-distance (7.3) can be equivalently defined [13, Theorem 1] as

$$\begin{aligned} {\bar{d}}\left( \mu , \nu \right) = \min _{\pi \in \mathcal {C}^I(\mu , \nu )} \sum _{x, y \in [q]} h(x,y) \pi _0(x,y)\,, \end{aligned}$$

(7.9)

where $\mathcal {C}^I(\mu ,\nu )$ denotes the set of stationary couplings between the probability distributions $\mu $, $\nu $, and $\pi _0$ is the marginal density of $\pi $ at 0. Definition 4.2 together with Theorem 4.1 provides a similar characterization for quantum spin systems, where stationary couplings are replaced in (4.14) by representations of the difference the states as series of translates. In fact, if the states $\rho $, $\sigma $ are diagonal, we can also restrict minimization in (4.14) to diagonal states $\rho '$, $\sigma '$, corresponding to probability measures $\mu '$, $\nu '$, obtaining the following further equivalent representation of Ornstein’s distance.

Corollary 7.1

Given stationary probability measures $\mu $, $\nu $ on $[q]^{\mathbb {Z}^d}$, we have

$$\begin{aligned}&\bar{d} (\mu , \nu )= \min \Bigg \{ c\ge 0 : \exists \, \mu ', \, \nu ' \text { probability measures on } [q]^{\mathbb {Z}^d} :\; \mu '_{\mathbb {Z}^d \setminus 0} = \nu '_{\mathbb {Z}^d\setminus 0}\,,\nonumber \\&\quad \quad \quad \mu _\Lambda - \nu _\Lambda = c \sum _{x \in \Lambda } (\tau _x \mu ')_{\Lambda } - (\tau _x \nu ')_{\Lambda }\quad \forall \, \Lambda \in \mathcal {F}_{\mathbb {Z}^d}\Bigg \}\,. \end{aligned}$$

(7.10)

To our knowledge, duality for Orstein’s $\bar{d}$-distance is not explicitly discussed in the literature. A result can be obtained directly from Theorem 6.1 for diagonal states, simply noticing that the supremum may run among interactions $\Phi $ such that each $\Phi (\Lambda )$ is also diagonal, i.e., corresponding to a function

$$\begin{aligned} f(\Lambda ): [q]^{\mathbb {Z}^d} \rightarrow \mathbb {R}\,, \end{aligned}$$

(7.11)

depending only on the coordinates in $\Lambda $. Let us denote by $\mathcal {B}^{r, {\text {diag}}}_{\mathbb {Z}^d}$ the set of translation invariant diagonal interactions satisfying (2.18). The Lipschitz constant of $f \in \mathcal {B}^{r, {\text {diag}}}_{\mathbb {Z}^d}$ coincides with the oscillation of the function on $[q]^{\mathbb {Z}^d}$,

$$\begin{aligned} x \mapsto \sum _{ 0 \in \Lambda \in \mathcal {F}_{\mathbb {Z}^d}} f(\Lambda )(x) \end{aligned}$$

(7.12)

with respect to the 0-th coordinate, i.e.,

$$\begin{aligned} \Vert f \Vert _{L} = \sup \left\{ \sum _{ 0 \in \Lambda \in \mathcal {F}_{\mathbb {Z}^d}}\left( f(\Lambda ) (x) - f(\Lambda ) (y) \right) \,: \, x, y \in [q]^{\mathbb {Z}^d}, x_k=y_k \text { for every } k \in \mathbb {Z}^d \setminus 0 \right\} \,.\nonumber \\ \end{aligned}$$

(7.13)

The specific energy of f is identified with the function on $[q]^{\mathbb {Z}^d}$,

$$\begin{aligned} x \mapsto e_f(x) = \sum _{0\in \Lambda \in \mathcal {F}_{\mathbb {Z}^d}}\frac{f(\Lambda )(x)}{\left| \Lambda \right| }\,. \end{aligned}$$

(7.14)

With this notation, Theorem 6.1 yields the following result.

Corollary 7.2

Given stationary probability measures $\mu $, $\nu $ on $[q]^\mathbb {Z^d}$, we have

$$\begin{aligned} \bar{d}(\mu ,\nu ) = \sup \left\{ \int _{[q]^{\mathbb {Z}^d}} e_f(x) d \mu (x)- \int _{[q]^{\mathbb {Z}^d}} e_f(x) \textrm{d} \nu (x): f \in \mathcal {B}_{\mathbb {Z}^d}^{r,{\text {diag}}}, \, \left\| f \right\| _L\le 1\right\} \,.\nonumber \\ \end{aligned}$$

(7.15)

8 Quantum Concentration Inequalities

8.1 Poincaré Inequality

In this section, we prove the following quantum Poincaré inequality stating that for any interaction $\Phi $, the variance of the local Hamiltonian on $\Lambda _a$ on a product state scales at most linearly with the volume of $\Lambda _a$ in the limit $a\rightarrow \infty $, and the proportionality constant is upper bounded by the square of the Lipschitz constant of $\Phi $:

Proposition 8.1

(Poincaré inequality). Let $\omega \in \mathcal {S}_{\mathbb {Z}^d}$ be a product state. Then, for any interaction $\Phi \in \mathcal {B}_{\mathbb {Z}^d}^r$ we have

$$\begin{aligned} \limsup _{a\rightarrow \infty }\frac{\textrm{Var}_{\omega _{\Lambda _a}}H^\Phi _{\Lambda _a}}{\left| \Lambda _a\right| } \le \left\| \Phi \right\| _L^2. \end{aligned}$$

(8.1)

8.1.1 Proof of Proposition 8.1

The proof of Proposition 8.1 is based on its counterpart for quantum spin systems on finite lattices:

Proposition 8.2

(Quantum Poincaré inequality [20, Lemma F.1]). Let $\Lambda $ be a finite set, and let $\omega \in \mathcal {S}_\Lambda $ be a product state. Then, for any $H\in \mathcal {O}_\Lambda $ we have

$$\begin{aligned} \textrm{Var}_\omega H = \textrm{Tr}\left[ \omega \left( H - \textrm{Tr}\left[ \omega \,H\right] \mathbb {I}\right) ^2\right] \le \sum _{x\in \Lambda }\left( \partial _x H\right) ^2. \end{aligned}$$

(8.2)

The key step in the proof of Proposition 8.1 is the following proposition, which states that the Lipschitz constant of $\Phi $ is equal to the quadratic average of the dependence of the local Hamiltonians of $\Phi $ on the sites in the limit of infinite volume:

Proposition 8.3

Let $\Phi \in \mathcal {B}_{\mathbb {Z}^d}^r$. Then,

$$\begin{aligned} \lim _{a\rightarrow \infty }\frac{1}{\left| \Lambda _a\right| }\sum _{x\in \Lambda _a}\left( \partial _x H^\Phi _{\Lambda _a}\right) ^2 = \left\| \Phi \right\| _L^2. \end{aligned}$$

(8.3)

Proof

We fix $\epsilon >0$. Let $N\in \mathbb {N}$ and $0\in \Lambda _1\in \mathcal {F}_{\mathbb {Z}^d},\,\ldots ,\,0\in \Lambda _N\in \mathcal {F}_{\mathbb {Z}^d}$ such that

$$\begin{aligned} \sum _{0\in \Lambda \in \mathcal {F}_{\mathbb {Z}^d},\,\Lambda \ne \Lambda _1,\,\ldots ,\,\Lambda _N}\left\| \Phi (\Lambda )\right\| _\infty < \epsilon . \end{aligned}$$

(8.4)

Let $a\in \mathbb {N}_+^d$ such that $\Lambda _1\cup \ldots \cup \Lambda _N\subseteq \Lambda _a$ and

$$\begin{aligned} \left| \partial _0 H^\Phi _{\Lambda _{a}} - \left\| \Phi \right\| _L\right| < 2\,\epsilon . \end{aligned}$$

(8.5)

We have

$$\begin{aligned} \sum _{0\in \Lambda \in \mathcal {F}_{\mathbb {Z}^d},\,\Lambda \not \subseteq \Lambda _a}\left\| \Phi (\Lambda )\right\| _\infty < \epsilon . \end{aligned}$$

(8.6)

Let us fix $x\in \mathbb {Z}^d$. For any $b\in \mathbb {N}_+^d$ with $b\ge a \pm x$, we have $\Lambda _a\subseteq \Lambda _b-x$ and

$$\begin{aligned} \left| \partial _x H^\Phi _{\Lambda _b} - \partial _0 H^\Phi _{\Lambda _a}\right|&\overset{\mathrm {(a)}}{=} \left| \partial _0 H^\Phi _{\Lambda _b-x} - \partial _0 H^\Phi _{\Lambda _a}\right| \overset{\mathrm {(b)}}{\le } \partial _0\left( H^\Phi _{\Lambda _b-x} - H^\Phi _{\Lambda _a}\right) = \partial _0\sum _{\Lambda \subseteq \Lambda _b-x,\,\Lambda \not \subseteq \Lambda _a}\Phi (\Lambda ) \nonumber \\&= \partial _0\sum _{0\in \Lambda \subseteq \Lambda _b-x,\,\Lambda \not \subseteq \Lambda _a}\Phi (\Lambda ) \le 2\sum _{0\in \Lambda \subseteq \Lambda _b-x,\,\Lambda \not \subseteq \Lambda _a}\left\| \Phi (\Lambda )\right\| _\infty < 2\,\epsilon \,, \nonumber \\ \end{aligned}$$

(8.7)

where (a) follows from the translation invariance of $\Phi $ and (b) follows since $\partial _0$ is a seminorm. We then have

$$\begin{aligned} \left| \partial _x H^\Phi _{\Lambda _{b}} - \left\| \Phi \right\| _L\right| < 4\,\epsilon . \end{aligned}$$

(8.8)

Let $b>a$, such that any $x\in \Lambda _{b-a}$ satisfies $b\ge a\pm x$. We have from Lemma C.4

$$\begin{aligned} \sum _{x\in \Lambda _b}\left( \partial _x H^\Phi _{\Lambda _b}\right) ^2&= \sum _{x\in \Lambda _{b-a}}\left( \partial _x H^\Phi _{\Lambda _b}\right) ^2 + \sum _{x\in \Lambda _b\setminus \Lambda _{b-a}}\left( \partial _x H^\Phi _{\Lambda _b}\right) ^2\nonumber \\&\le \left| \Lambda _{b-a}\right| \left( \left\| \Phi \right\| _L + 4\,\epsilon \right) ^2 + 4\left( \left| \Lambda _b\right| - \left| \Lambda _{b-a}\right| \right) \left\| \Phi \right\| _r^2\,, \end{aligned}$$

(8.9)

therefore

$$\begin{aligned} \limsup _{b\rightarrow \infty } \frac{1}{\left| \Lambda _b\right| }\sum _{x\in \Lambda _b}\left( \partial _x H^\Phi _{\Lambda _b}\right) ^2&\le \limsup _{b\rightarrow \infty }\left( \frac{\left| \Lambda _{b-a}\right| }{\left| \Lambda _b\right| }\left( \left\| \Phi \right\| _L + 4\,\epsilon \right) ^2 + 4\left( 1 - \frac{\left| \Lambda _{b-a}\right| }{\left| \Lambda _b\right| }\right) \left\| \Phi \right\| _r^2\right) \nonumber \\&=\left( \left\| \Phi \right\| _L + 4\,\epsilon \right) ^2\,. \end{aligned}$$

(8.10)

Since $\epsilon $ is arbitrary, we get

$$\begin{aligned} \limsup _{b\rightarrow \infty } \frac{1}{\left| \Lambda _b\right| }\sum _{x\in \Lambda _b}\left( \partial _x H^\Phi _{\Lambda _b}\right) ^2 \le \left\| \Phi \right\| _L^2. \end{aligned}$$

(8.11)

$$\begin{aligned} \sum _{x\in \Lambda _b}\left( \partial _x H^\Phi _{\Lambda _b}\right) ^2 \ge \sum _{x\in \Lambda _{b-a}}\left( \partial _x H^\Phi _{\Lambda _b}\right) ^2 \ge \left| \Lambda _{b-a}\right| \left( \left\| \Phi \right\| _L - 4\,\epsilon \right) ^2, \end{aligned}$$

(8.12)

and

$$\begin{aligned} \liminf _{b\rightarrow \infty } \frac{1}{\left| \Lambda _b\right| }\sum _{x\in \Lambda _b}\left( \partial _x H^\Phi _{\Lambda _b}\right) ^2 \ge \liminf _{b\rightarrow \infty }\frac{\left| \Lambda _{b-a}\right| }{\left| \Lambda _b\right| }\left( \left\| \Phi \right\| _L - 4\,\epsilon \right) ^2 = \left( \left\| \Phi \right\| _L - 4\,\epsilon \right) ^2. \nonumber \\ \end{aligned}$$

(8.13)

Since $\epsilon $ is arbitrary, we get

$$\begin{aligned} \liminf _{b\rightarrow \infty } \frac{1}{\left| \Lambda _b\right| }\sum _{x\in \Lambda _b}\left( \partial _x H^\Phi _{\Lambda _b}\right) ^2 \ge \left\| \Phi \right\| _L^2. \end{aligned}$$

(8.14)

The claim follows. $\square $

We can now conclude the proof of Proposition 8.1. We have from Proposition 8.2 and Proposition 8.3

$$\begin{aligned} \limsup _{a\rightarrow \infty }\frac{\textrm{Var}_{\omega _{\Lambda _a}}H^\Phi _{\Lambda _a}}{\left| \Lambda _a\right| } \le \limsup _{a\rightarrow \infty }\frac{1}{\left| \Lambda _a\right| }\sum _{x\in \Lambda _a}\left( \partial _x H^\Phi _{\Lambda _a}\right) ^2 = \left\| \Phi \right\| _L^2. \end{aligned}$$

(8.15)

The claim follows.

8.2 Gaussian Concentration Inequality

In this section, we prove the following Gaussian concentration inequality (Theorem 8.1) for quantum spin systems on finite lattices and apply it to prove an upper bound to the pressure of an interaction in terms of its Lipschitz constant (Corollary 8.1).

Theorem 8.1

(Gaussian concentration inequality). Let $\Lambda $ be a finite set and let $\omega \in \mathcal {S}_\Lambda $ be a product state with full support. Then, for any $H\in \mathcal {O}_\Lambda $ we have

$$\begin{aligned} \ln \textrm{Tr}_\Lambda e^{H + \ln \omega } \le \textrm{Tr}_\Lambda \left[ \omega \,H\right] + \frac{1}{2}\sum _{x\in \Lambda }\left( \partial _x H\right) ^2. \end{aligned}$$

(8.16)

Remark 8.1

Ref. [17] proved the following Gaussian concentration inequality:

Theorem 8.2

([17, Theorem 3]). Let $\Lambda $ be a finite set and let $\omega = \frac{\mathbb {I}_\Lambda }{q^{|\Lambda |}}\in \mathcal {S}_\Lambda $ be the uniform distribution. Then, for any $H\in \mathcal {O}_\Lambda $ we have

$$\begin{aligned} \ln \textrm{Tr}_\Lambda e^{H + \ln \omega } \le \textrm{Tr}_\Lambda \left[ \omega \,H\right] + \frac{\left| \Lambda \right| }{8}\left\| H\right\| _L^2. \end{aligned}$$

(8.17)

Upon replacing the constant $\frac{1}{8}$ by $\frac{1}{2}$, the inequality (8.17) is implied by (8.16).

Proof

We will prove the claim by induction on the size of $\Lambda $. For $\Lambda =\emptyset $ equality holds in (8.16). Let us fix $x\in \Lambda $, and let $\Lambda _0 = \Lambda \setminus x$. Let $H_{\Lambda _0}\in \mathcal {O}_{\Lambda _0}$ such that

$$\begin{aligned} \partial _x H = 2\left\| H - H_{\Lambda _0}\right\| _\infty . \end{aligned}$$

(8.18)

We have

$$\begin{aligned} \left\| H - \textrm{Tr}_x\left[ \omega _x\,H\right] \right\| _\infty{} & {} = \left\| H - H_{\Lambda _0} - \textrm{Tr}_x\left[ \omega _x\left( H - H_{\Lambda _0}\right) \right] \right\| _\infty \nonumber \\{} & {} \le 2\left\| H - H_{\Lambda _0}\right\| _\infty = \partial _x H. \end{aligned}$$

(8.19)

Using the inequality

$$\begin{aligned} e^t \le \frac{\sinh a}{a}\,t + e^\frac{a^2}{2},\qquad |t|\le a, \end{aligned}$$

(8.20)

we get

$$\begin{aligned} e^{H - \textrm{Tr}_x\left[ \omega _x H\right] } \le \frac{\sinh \partial _x H}{\partial _x H}\left( H - \textrm{Tr}_x\left[ \omega _x\,H\right] \right) + e^\frac{\left( \partial _x H\right) ^2}{2}, \end{aligned}$$

(8.21)

therefore

$$\begin{aligned} \textrm{Tr}_x\left[ \omega _x\,e^{H - \textrm{Tr}_x\left[ \omega _x H\right] }\right] \le e^\frac{\left( \partial _x H\right) ^2}{2}. \end{aligned}$$

(8.22)

For any $y\in \Lambda _0$, let $H_{\Lambda {\setminus } y}\in \mathcal {O}_{\Lambda {\setminus } y}$ such that

$$\begin{aligned} \partial _y H = 2\left\| H - H_{\Lambda \setminus y}\right\| _\infty . \end{aligned}$$

(8.23)

We have

$$\begin{aligned} \partial _y \textrm{Tr}_x\left[ \omega _x H\right] \le 2\left\| \textrm{Tr}_x\left[ \omega _x H\right] - \textrm{Tr}_x\left[ \omega _x H_{\Lambda \setminus y}\right] \right\| _\infty \le 2\left\| H - H_{\Lambda \setminus y}\right\| _\infty = \partial _y H.\nonumber \\ \end{aligned}$$

(8.24)

We then have

$$\begin{aligned} \ln \textrm{Tr}_\Lambda e^{H + \ln \omega }&= \ln \textrm{Tr}_\Lambda \exp \left( H - \textrm{Tr}_x\left[ \omega _x H\right] + \ln \omega _x + \textrm{Tr}_x\left[ \omega _x H\right] + \ln \omega _{\Lambda _0}\right) \nonumber \\&\overset{\mathrm {(a)}}{\le } \ln \int _0^\infty \textrm{Tr}_\Lambda \left[ e^{\textrm{Tr}_x\left[ \omega _x H\right] + \ln \omega _{\Lambda _0}} \left( \omega _x^{-1}+t\right) ^{-1} e^{H - \textrm{Tr}_x\left[ \omega _x H\right] }\left( \omega _x^{-1}+t\right) ^{-1}\right] \textrm{d}t\nonumber \\&= \ln \textrm{Tr}_{\Lambda _0}\left[ e^{\textrm{Tr}_x\left[ \omega _x H\right] + \ln \omega _{\Lambda _0}}\,\textrm{Tr}_x\left[ e^{H - \textrm{Tr}_x\left[ \omega _x H\right] }\int _0^\infty \left( \omega _x^{-1}+t\right) ^{-2}\textrm{d}t\right] \right] \nonumber \\&= \ln \textrm{Tr}_{\Lambda _0}\left[ e^{\textrm{Tr}_x\left[ \omega _x H\right] + \ln \omega _{\Lambda _0}}\,\textrm{Tr}_x\left[ \omega _x\,e^{H - \textrm{Tr}_x\left[ \omega _x H\right] }\right] \right] \nonumber \\&\overset{\mathrm {(b)}}{\le } \frac{\left( \partial _x H\right) ^2}{2}+ \ln \textrm{Tr}_{\Lambda _0}e^{\textrm{Tr}_x\left[ \omega _x H\right] + \ln \omega _{\Lambda _0}}\nonumber \\&\overset{\mathrm {(c)}}{\le } \frac{\left( \partial _x H\right) ^2}{2} + \textrm{Tr}_\Lambda \left[ \omega \,H\right] + \frac{1}{2}\sum _{y\in \Lambda _0}\left( \partial _y \textrm{Tr}_x\left[ \omega _x H\right] \right) ^2\nonumber \\&\overset{\mathrm {(d)}}{\le } \textrm{Tr}_\Lambda \left[ \omega \,H\right] + \frac{1}{2}\sum _{y\in \Lambda }\left( \partial _y H\right) ^2\,, \end{aligned}$$

(8.25)

where (a) follows from the Golden–Thompson inequality with three matrices [89], (b) from (8.22), (c) from the inductive hypothesis and (d) from (8.24). The claim follows. $\square $

Corollary 8.1

Let $\Phi \in \mathcal {B}^r_{\mathbb {Z}^d}$ and let $\omega \in \mathcal {S}^I_{\mathbb {Z}^d}$ be the uniform distribution, i.e., $\omega _\Lambda = \frac{\mathbb {I}_\Lambda }{q^{|\Lambda |}}$ for any $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$. Then,

$$\begin{aligned} P(\Phi ) \le \ln q + \frac{\left\| \Phi \right\| _L^2}{2} - \omega (E_\Phi ). \end{aligned}$$

(8.26)

Proof

We have

$$\begin{aligned} P(\Phi )&= \lim _{a\rightarrow \infty }\frac{\ln \textrm{Tr}_{\Lambda _a} e^{-H^\Phi _{\Lambda _a}}}{\left| \Lambda _a\right| } \overset{\mathrm {(a)}}{\le } \ln q + \lim _{a\rightarrow \infty }\left( \frac{1}{2\left| \Lambda _a\right| }\sum _{x\in \Lambda _a}\left( \partial _x H^\Phi _{\Lambda _a}\right) ^2 - \frac{\omega \left( H^\Phi _{\Lambda _a}\right) }{\left| \Lambda _a\right| }\right) \nonumber \\&\overset{\mathrm {(b)}}{=} \ln q + \frac{\left\| \Phi \right\| _L^2}{2} - \omega (E_\Phi )\,, \end{aligned}$$

(8.27)

where (a) follows from Theorem 8.1 and (b) from Proposition 8.3 and (2.21). The claim follows. $\square $

9 $W_1$ Continuity of the von Neumann Entropy

In this section, we prove the following continuity bound of the von Neumann entropy with respect to the quantum $W_1$ distance:

Theorem 9.1

($W_1$ continuity of the von Neumann entropy). Let $\Lambda $ be a finite set. For any $\rho ,\,\sigma \in \mathcal {S}_\Lambda $, we have

$$\begin{aligned} \frac{\left| S(\rho ) - S(\sigma )\right| }{\left| \Lambda \right| } \le h_2\left( \frac{\left\| \rho -\sigma \right\| _{W_1}}{\left| \Lambda \right| }\right) + \frac{\left\| \rho - \sigma \right\| _{W_1}}{\left| \Lambda \right| }\ln \left( q^2-1\right) . \end{aligned}$$

(9.1)

Theorem 9.1 generalizes to the quantum setting the following continuity bound of the Shannon entropy with respect to the classical $W_1$ distance:

Theorem 9.2

($W_1$ continuity of the Shannon entropy [90, Proposition 8]). Let $\Lambda $ be a finite set. For any two probability distributions $\mu ,\,\nu $ on $[q]^\Lambda $, we have

$$\begin{aligned} \left| S(\mu ) - S(\nu )\right| \le \left| \Lambda \right| h_2\left( \frac{W_1(\mu ,\nu )}{\left| \Lambda \right| }\right) + W_1(\mu ,\nu )\ln \left( q-1\right) . \end{aligned}$$

(9.2)

The quantum continuity bound (9.1) is identical to the classical bound (9.2) upon replacing q by $q^2$. Such replacement is necessary, since the von Neumann entropy does not always satisfy the classical bound (9.2) [17].

Remark 9.1

Ref. [17] proved the following weaker continuity bound for the von Neumann entropy in terms of the $W_1$ distance:

Theorem 9.3

([17, Theorem 1]). Let $\Lambda $ be a finite set. For any $\rho ,\,\sigma \in \mathcal {S}_\Lambda $,

$$\begin{aligned} \left| S(\rho ) - S(\sigma )\right| \le g\left( \left\| \rho -\sigma \right\| _{W_1}\right) + \left\| \rho - \sigma \right\| _{W_1}\ln \left( q^2\left| \Lambda \right| \right) , \end{aligned}$$

(9.3)

where for any $t\ge 0$

$$\begin{aligned} g(t) = \left( t+1\right) \ln \left( t+1\right) - t\ln t. \end{aligned}$$

(9.4)

Due to the term $\ln \left| \Lambda \right| $, the bound (9.3) does not have the right scaling with respect to $\left| \Lambda \right| $ to prove a continuity bound for the specific entropy in terms of the specific quantum $W_1$ distance. On the contrary, Theorem 9.1 will be crucial in the proof of such a bound, which will be the subject of Sect. 10.

9.1 Proof of Theorem 9.1

The proof of Theorem 9.1 is based on the following notion of distance operator:

Definition 9.1

(Distance operator [91, Section 2], [92, Definition 15]). Let $\mathcal {V}$ be a subspace of $\mathcal {H}_\Lambda $. For any $k=0,\,\ldots ,\,\left| \Lambda \right| $, we define the fattening $\mathcal {V}_k$ of $\mathcal {V}$ of radius k as the span of the linear operators acting on at most k sites applied to a vector in $\mathcal {V}$:

$$\begin{aligned} \mathcal {V}_k = \textrm{span}\left\{ O|\psi \rangle :|\psi \rangle \in \mathcal {V},\;O\in \mathfrak {U}_X:X\subseteq \Lambda ,\;|X|\le k\right\} , \end{aligned}$$

(9.5)

such that

$$\begin{aligned} \mathcal {V} = \mathcal {V}_0 \subseteq \ldots \subseteq \mathcal {V}_{\left| \Lambda \right| } = \mathcal {H}_\Lambda . \end{aligned}$$

(9.6)

We define the distance operator of $\mathcal {V}$ as the linear operator $H_\mathcal {V}\in \mathcal {O}_\Lambda $ that has eigenvalue k on $\mathcal {V}_k\cap \mathcal {V}_{k-1}^\perp $ for each $k=0,\,\ldots ,\,\left| \Lambda \right| $.

The following Proposition 9.1 provides the link between the distance operator and the $W_1$ distance:

Proposition 9.1

Let $\mathcal {V}$ be a subspace of $\mathcal {H}_\Lambda $, and let $\rho ,\,\sigma \in \mathcal {S}_\Lambda $ such that the support of $\sigma $ is contained in $\mathcal {V}$. Then,

$$\begin{aligned} \left\| \rho - \sigma \right\| _{W_1} \ge \textrm{Tr}_\Lambda \left[ \rho \,H_\mathcal {V}\right] . \end{aligned}$$

(9.7)

Proof

Since $\textrm{Tr}_\Lambda \left[ \sigma \,H_\mathcal {V}\right] =0$, it is sufficient to prove that $\left\| H_\mathcal {V}\right\| _L \le 1$. For any $k=0,\,\ldots ,\,\left| \Lambda \right| $, let $\Pi _k$ be the orthogonal projector onto $\mathcal {V}_k$, such that

$$\begin{aligned} H_\mathcal {V} = \sum _{k=0}^{\left| \Lambda \right| }\left( \mathbb {I} - \Pi _k\right) . \end{aligned}$$

(9.8)

For any $x\in \Lambda $, let

$$\begin{aligned} \mathcal {V}_{k,x} = \textrm{span}\left\{ O|\psi \rangle :|\psi \rangle \in \mathcal {V},\;O\in \mathfrak {U}_X:X\subseteq \Lambda ,\;|X|\le k,\;x\in X\right\} , \end{aligned}$$

(9.9)

and let $\Pi _{k,x}$ be the orthogonal projector onto $\mathcal {V}_{k,x}$. We have $\mathcal {V}_{k-1} \subseteq \mathcal {V}_{k,x} \subseteq \mathcal {V}_k$, therefore

$$\begin{aligned} 0 \le \Pi _k - \Pi _{k,x} \le \Pi _k - \Pi _{k-1}. \end{aligned}$$

(9.10)

The subspace $\mathcal {V}_{k,x}$ is invariant with respect to the action of any unitary operator $U\in \mathfrak {U}_x$. Then, $\Pi _{k,x}$ commutes with any such U, and therefore $\Pi _{k,x}\in \mathcal {O}_{\Lambda \setminus x}$. Then,

$$\begin{aligned} \partial _xH_\mathcal {V}&= \partial _x\sum _{k=0}^{\left| \Lambda \right| } \left( \mathbb {I} - \Pi _k\right) = \partial _x\sum _{k=0}^{\left| \Lambda \right| } \Pi _k = \partial _x\sum _{k=0}^{\left| \Lambda \right| }\left( \Pi _k - \Pi _{k,x}\right) \nonumber \\&\overset{(\textrm{a})}{\le } \left\| \sum _{k=0}^{\left| \Lambda \right| } \left( \Pi _k - \Pi _{k,x}\right) \right\| _\infty \left\| \sum _{k=0}^{\left| \Lambda \right| } \left( \Pi _k - \Pi _{k-1}\right) \right\| _\infty \overset{(\textrm{b})}{=} 1\,, \end{aligned}$$

(9.11)

where (a) follows from Lemma C.2 and (b) follows observing that $\Pi _k - \Pi _{k-1}$ is the orthogonal projector onto $\mathcal {V}_k\cap \mathcal {V}_{k-1}^\perp $. The claim follows. $\square $

We first prove Theorem 9.1 when $\sigma $ is proportional to an orthogonal projector:

Proposition 9.2

Let $\mathcal {V}$ be a subspace of $\mathcal {H}_\Lambda $, let $\Pi $ be the associated orthogonal projector and let

$$\begin{aligned} \sigma = \frac{\Pi }{\dim \mathcal {V}}. \end{aligned}$$

(9.12)

Then, for any $\rho \in \mathcal {S}_\Lambda $ we have

$$\begin{aligned} S(\rho ) - \ln \dim \mathcal {V} \le \left| \Lambda \right| h_2\left( \frac{\left\| \rho -\sigma \right\| _{W_1}}{\left| \Lambda \right| }\right) + \left\| \rho - \sigma \right\| _{W_1}\ln \left( q^2-1\right) . \end{aligned}$$

(9.13)

Proof

For any $0\le t \le 1$, let

$$\begin{aligned} \phi (t) = h_2(t) + t\ln \left( q^2-1\right) , \end{aligned}$$

(9.14)

and let

$$\begin{aligned} \left\| \rho - \sigma \right\| _{W_1} = \left| \Lambda \right| w. \end{aligned}$$

(9.15)

The claim becomes

$$\begin{aligned} S(\rho ) \le \ln \dim \mathcal {V} + \left| \Lambda \right| \phi (w). \end{aligned}$$

(9.16)

$\phi $ is increasing in $\left[ 0,1-\frac{1}{q^2}\right] $ and decreasing in $\left[ 1-\frac{1}{q^2},1\right] $ with

$$\begin{aligned} \phi (0) = 0,\qquad \phi \left( 1-\tfrac{1}{q^2}\right) = \ln q^2,\qquad \phi (1) = \ln \left( q^2-1\right) . \end{aligned}$$

(9.17)

Let $0<w^*<1-\frac{1}{q^2}$ satisfy

$$\begin{aligned} \phi (w^*) = \ln q. \end{aligned}$$

(9.18)

If $w\ge w^*$, the claim is trivial. Indeed, if $w^*\le w \le 1-\frac{1}{q^2}$ we have

$$\begin{aligned} \phi (w) \ge \phi (w^*) = \ln q, \end{aligned}$$

(9.19)

while if $1-\frac{1}{q^2}<w\le 1$ we have

$$\begin{aligned} \phi (w) \ge \phi (1) = \ln \left( q^2-1\right) \ge \ln q. \end{aligned}$$

(9.20)

We can then assume $w<w^*$.

Let $H_{\mathcal {V}}$ be the distance operator of $\mathcal {V}$. For any $k=0,\,\ldots ,\,\left| \Lambda \right| $, let $\mathcal {W}_k$ be the eigenspace of $H_{\mathcal {V}}$ with eigenvalue k, let $P_k$ be the orthogonal projector onto $\mathcal {W}_k$, and let $p_k = \textrm{Tr}_\Lambda \left[ \rho \,P_k\right] $ be the probability that a measurement of $H_\mathcal {V}$ on $\rho $ has outcome k. Let

$$\begin{aligned} \tilde{\rho } = \sum _{k=0}^{\left| \Lambda \right| }p_k\,\frac{P_k}{\dim \mathcal {W}_k}. \end{aligned}$$

(9.21)

We have

$$\begin{aligned} 0 \le S(\rho \Vert \tilde{\rho }) = S(\tilde{\rho }) - S(\rho ). \end{aligned}$$

(9.22)

For any $x\in \left\{ 0,\ldots ,q^2-1\right\} ^{\Lambda }$, let H(x) be the number of components of x that are different from 0, and for any $k=0,\,\ldots ,\,\left| \Lambda \right| $, let

$$\begin{aligned} D_k = \left| H^{-1}(k)\right| = \left| \left\{ x\in \left\{ 0,\ldots ,q^2-1\right\} ^{\Lambda }:H(x)=k\right\} \right| . \end{aligned}$$

(9.23)

Let X be a random variable with values in $\left\{ 0,\ldots ,q^2-1\right\} ^{\Lambda }$ distributed as follows. Let the probability distribution of H(X) be p, and for any $k=0,\,\ldots ,\,\left| \Lambda \right| $, let the probability distribution of X conditioned on $H(X)=k$ be uniform, such that the probability of $x\in \left\{ 0,\ldots ,q^2-1\right\} ^{\Lambda }$ is

$$\begin{aligned} \mathbb {P}(X=x) = \frac{p_{H(x)}}{D_{H(x)}}. \end{aligned}$$

(9.24)

Since H(X) has the same probability distribution as $H_\mathcal {V}$ measured on $\rho $, we have

$$\begin{aligned} \mathbb {E}\,H(X) = \textrm{Tr}_\Lambda \left[ \rho \,H_{\mathcal {V}}\right] =: \left| \Lambda \right| u. \end{aligned}$$

(9.25)

By the maximum entropy principle, the Shannon entropy of X is upper bounded by the Shannon entropy of the Gibbs distribution of H with average energy $\left| \Lambda \right| u$:

$$\begin{aligned} S(X) \le \left| \Lambda \right| \phi (u). \end{aligned}$$

(9.26)

We then have

$$\begin{aligned} S(\rho )&\overset{\mathrm {(a)}}{\le } S(\tilde{\rho }) = \sum _{k=0}^{\left| \Lambda \right| } p_k\ln \frac{\dim \mathcal {W}_k}{p_k} \overset{\mathrm {(b)}}{\le } \sum _{k=0}^{\left| \Lambda \right| } p_k\ln \frac{D_k\dim \mathcal {V}}{p_k} = \ln \dim \mathcal {V} + S(X)\nonumber \\&\overset{\mathrm {(c)}}{\le } \ln \dim \mathcal {V} + \left| \Lambda \right| \phi (u)\,, \end{aligned}$$

(9.27)

where (a) follows from (9.22), (b) from Lemma C.3 and (c) from (9.26). We have from Proposition 9.1

$$\begin{aligned} w \ge \frac{\textrm{Tr}_\Lambda \left[ \rho \,H_{\mathcal {V}}\right] }{\left| \Lambda \right| } = u, \end{aligned}$$

(9.28)

hence

$$\begin{aligned} \phi (u) \le \phi (w). \end{aligned}$$

(9.29)

The claim follows. $\square $

Without loss of generality, we can assume $S(\rho )\ge S(\sigma )$. For any $k\in \mathbb {N}$ and any $\delta >0$, let $P_{k,\delta }$ be the $\delta $-typical projector of $\sigma ^{\otimes k}$, i.e., the orthogonal projector on the sum of the eigenspaces of $\sigma ^{\otimes k}$ with eigenvalues contained in $\left[ e^{-k\left( S(\sigma )+\delta \right) },e^{-k\left( S(\sigma )-\delta \right) }\right] $. $P_{k,\delta }$ satisfies [79,80,81, Section 5.5]

$$\begin{aligned}{} & {} \sigma ^{\otimes k} \ge e^{-k\left( S(\sigma )+\delta \right) }\,P_{k,\delta }, \end{aligned}$$

(9.30a)

$$\begin{aligned}{} & {} \liminf _{k\rightarrow \infty }\frac{\ln \textrm{Tr}_\Lambda P_{k,\delta }}{k}\ge S(\sigma ) - \delta . \end{aligned}$$

(9.30b)

The property (9.30a) implies

$$\begin{aligned}{} & {} \frac{\ln \textrm{Tr}_\Lambda P_{k,\delta }}{k} \le S(\sigma ) + \delta , \end{aligned}$$

(9.31a)

$$\begin{aligned}{} & {} \frac{1}{k}\,S\left( \left. \frac{P_{k,\delta }}{\textrm{Tr}_\Lambda P_{k,\delta }}\right\| \sigma ^{\otimes k}\right) \le S(\sigma ) + \delta - \frac{\ln \textrm{Tr}_\Lambda P_{k,\delta }}{k}. \end{aligned}$$

(9.31b)

We have from (9.31a) and Proposition 9.2

$$\begin{aligned} S(\rho ) - S(\sigma ) \le \frac{S\left( \rho ^{\otimes k}\right) - \ln \textrm{Tr}_\Lambda P_{k,\delta }}{k} + \delta \le \left| \Lambda \right| \phi \left( \frac{\left\| \rho ^{\otimes k} - \frac{P_{k,\delta }}{\textrm{Tr}_\Lambda P_{k,\delta }}\right\| _{W_1}}{k\left| \Lambda \right| }\right) + \delta . \nonumber \\ \end{aligned}$$

(9.32)

We have

$$\begin{aligned} \frac{\left\| \rho ^{\otimes k} - \frac{P_{k,\delta }}{\textrm{Tr}_\Lambda P_{k,\delta }}\right\| _{W_1}}{k\left| \Lambda \right| }&\le \frac{\left\| \rho ^{\otimes k} - \sigma ^{\otimes k}\right\| _{W_1}}{k\left| \Lambda \right| } + \frac{\left\| \sigma ^{\otimes k} - \frac{P_{k,\delta }}{\textrm{Tr}_\Lambda P_{k,\delta }}\right\| _{W_1}}{k\left| \Lambda \right| } \nonumber \\&\overset{\mathrm {(a)}}{\le } w + \sqrt{\frac{2}{k}\,S\left( \left. \frac{P_{k,\delta }}{\textrm{Tr}_\Lambda P_{k,\delta }}\right\| \sigma ^{\otimes k}\right) }\nonumber \\&\overset{\mathrm {(b)}}{\le } w + \sqrt{2\left( S(\sigma ) + \delta - \frac{\ln \textrm{Tr}_\Lambda P_{k,\delta }}{k}\right) }\,, \end{aligned}$$

(9.33)

where (a) follows from Proposition A.3 and Proposition C.1, and (b) follows from (9.31b). We get from (9.30b)

$$\begin{aligned} \limsup _{k\rightarrow \infty }\frac{\left\| \rho ^{\otimes k} - \frac{P_{k,\delta }}{\textrm{Tr}_\Lambda P_{k,\delta }}\right\| _{W_1}}{k\left| \Lambda \right| } \le w + 2\sqrt{\delta }. \end{aligned}$$

(9.34)

We then get from (9.32)

$$\begin{aligned} S(\rho ) - S(\sigma ) \le \left| \Lambda \right| \phi \left( w + 2\sqrt{\delta }\right) + \delta , \end{aligned}$$

(9.35)

and the claim follows taking the limit $\delta \rightarrow 0$.

10 $w_1$ Continuity of the Specific Entropy

A fundamental consequence of Theorem 9.1 is the following continuity bound for the specific entropy in terms of the specific quantum $W_1$ distance:

Corollary 10.1

($w_1$ continuity of the specific entropy). The specific entropy satisfies the following continuity bound with respect to the specific quantum $W_1$ distance: For any $\rho ,\,\sigma \in \mathcal {S}_{\mathbb {Z}^d}^I$, we have

$$\begin{aligned} \left| s(\rho ) - s(\sigma )\right| \le h_2\left( w_1(\rho ,\sigma )\right) + w_1(\rho ,\sigma )\ln \left( q^2-1\right) , \end{aligned}$$

(10.1)

where

$$\begin{aligned} h_2(t) = -t\ln t - \left( 1-t\right) \ln \left( 1-t\right) ,\qquad 0\le t\le 1 \end{aligned}$$

(10.2)

is the binary entropy function.

Proof

The claim follows from Theorem 9.1: We have

$$\begin{aligned} \left| s(\rho ) - s(\sigma )\right|&= \lim _{a\rightarrow \infty }\frac{\left| S\left( \rho _{\Lambda _a}\right) - S\left( \sigma _{\Lambda _a}\right) \right| }{\left| \Lambda _a\right| }\nonumber \\&\le \lim _{a\rightarrow \infty }\left( h_2\left( \frac{\left\| \rho _{\Lambda _a}-\sigma _{\Lambda _a}\right\| _{W_1}}{\left| \Lambda _a\right| }\right) + \frac{\left\| \rho _{\Lambda _a} - \sigma _{\Lambda _a}\right\| _{W_1}}{\left| \Lambda _a\right| }\ln \left( q^2-1\right) \right) \nonumber \\&= h_2\left( w_1(\rho ,\sigma )\right) + w_1(\rho ,\sigma )\ln \left( q^2-1\right) \,. \end{aligned}$$

(10.3)

$\square $

11 $w_1$-Gibbs States

We define the $w_1$-Gibbs states of the interaction $\Phi $ as the translation-invariant states whose marginal states have a $W_1$ distance from the local Gibbs states of $\Phi $ that scales sublinearly with the volume:

Definition 11.1

($w_1$-Gibbs state). Let $\Phi \in \mathcal {B}^r_{\mathbb {Z}^d}$. We define for any $\rho \in \mathcal {S}^I_{\mathbb {Z}^d}$ the specific quantum $W_1$ distance between $\rho $ and $\Phi $ as the limit of the $W_1$ distance per site between the marginals of $\rho $ and the local Gibbs states of $\Phi $:

$$\begin{aligned} w_1(\rho ,\Phi ) = \limsup _{a\rightarrow \infty }\frac{\left\| \rho _{\Lambda _a} - \omega ^\Phi _{\Lambda _a}\right\| _{W_1}}{\left| \Lambda _a\right| }. \end{aligned}$$

(11.1)

We say that the state $\omega \in \mathcal {S}^I_{\mathbb {Z}^d}$ is a $w_1$-Gibbs state of $\Phi $ if $w_1(\omega ,\Phi ) = 0$.

We also define for any $\Phi \in \mathcal {B}^r_{\mathbb {Z}^d}$ and any $\rho \in \mathcal {S}^I_{\mathbb {Z}^d}$ the specific relative entropy between $\rho $ and $\Phi $ as the limit of the relative entropy per site between the marginals of $\rho $ and the local Gibbs states of $\Phi $:

$$\begin{aligned} s(\rho \Vert \Phi ) = \lim _{a\rightarrow \infty }\frac{S\left( \rho _{\Lambda _a}\left\| \omega ^\Phi _{\Lambda _a}\right. \right) }{\left| \Lambda _a\right| } = P(\Phi ) - s(\rho ) + \rho (E_\Phi ). \end{aligned}$$

(11.2)

We have $s(\rho \Vert \Phi )\ge 0$, with equality iff $\rho \in \mathcal {S}_{eq}(\Phi )$.

An interaction can have at most one $w_1$-Gibbs state:

Proposition 11.1

(Uniqueness of the $w_1$-Gibbs state). Let $\Phi \in \mathcal {B}^r_{\mathbb {Z}^d}$ have a $w_1$-Gibbs state $\omega \in \mathcal {S}^I_{\mathbb {Z}^d}$. Then, for any $\rho \in \mathcal {S}^I_{\mathbb {Z}^d}$ we have

$$\begin{aligned} w_1(\rho ,\Phi ) = w_1(\rho ,\omega ). \end{aligned}$$

(11.3)

In particular, $\Phi $ can have at most one $w_1$-Gibbs state.

Proof

We have

$$\begin{aligned} \left| w_1(\rho ,\Phi ) - w_1(\rho ,\omega )\right|&= \left| \limsup _{a\rightarrow \infty }\frac{\left\| \rho _{\Lambda _a} - \omega ^\Phi _{\Lambda _a}\right\| _{W_1} - \left\| \rho _{\Lambda _a} - \omega _{\Lambda _a}\right\| _{W_1}}{\left| \Lambda _a\right| }\right| \nonumber \\&\le \limsup _{a\rightarrow \infty }\frac{\left\| \omega _{\Lambda _a} - \omega ^\Phi _{\Lambda _a}\right\| _{W_1}}{\left| \Lambda _a\right| } = w_1(\omega ,\Phi ) = 0\,. \end{aligned}$$

(11.4)

If also $\rho $ is a $w_1$-Gibbs state of $\Phi $, we have

$$\begin{aligned} w_1(\rho ,\omega ) = w_1(\rho ,\Phi ) = 0, \end{aligned}$$

(11.5)

hence $\rho = \omega $. The claim follows. $\square $

If an interaction admits a $w_1$-Gibbs state, then such state is also an equilibrium state:

Proposition 11.2

We have for any $\Phi \in \mathcal {B}^r_{\mathbb {Z}^d}$ and any $\rho \in \mathcal {S}^I_{\mathbb {Z}^d}$

$$\begin{aligned} s(\rho \Vert \Phi ) \le h_2(w_1(\rho ,\Phi )) + w_1(\rho ,\Phi )\left( \ln \left( q^2-1\right) + 2\left\| \Phi \right\| _r\right) . \end{aligned}$$

(11.6)

In particular, if $\Phi $ has a $w_1$-Gibbs state $\omega \in \mathcal {S}^I_{\mathbb {Z}^d}$, then $\omega \in \mathcal {S}_{eq}(\Phi )$.

Proof

We have

$$\begin{aligned} s(\rho \Vert \Phi )&= \lim _{a\rightarrow \infty }\frac{S\left( \rho _{\Lambda _a}\left\| \omega ^\Phi _{\Lambda _a}\right. \right) }{\left| \Lambda _a\right| } = \lim _{a\rightarrow \infty }\frac{S\left( \omega ^\Phi _{\Lambda _a}\right) - S(\rho _{\Lambda _a})+\textrm{Tr}_{\Lambda _a}\left[ \left( \rho _{\Lambda _a} - \omega ^\Phi _{\Lambda _a}\right) H^\Phi _{\Lambda _a}\right] }{\left| \Lambda _a\right| }\nonumber \\&\overset{\mathrm {(a)}}{\le } \lim _{a\rightarrow \infty }\left( h_2\left( \frac{\left\| \rho _{\Lambda _a} - \omega ^\Phi _{\Lambda _a}\right\| _{W_1}}{\left| \Lambda _a\right| }\right) + \frac{\left\| \rho _{\Lambda _a} - \omega ^\Phi _{\Lambda _a}\right\| _{W_1}}{\left| \Lambda _a\right| }\left( \ln \left( q^2-1\right) + 2\left\| \Phi \right\| _r\right) \right) \nonumber \\&= h_2(w_1(\rho ,\Phi )) + w_1(\rho ,\Phi )\left( \ln \left( q^2-1\right) + 2\left\| \Phi \right\| _r\right) \,, \end{aligned}$$

(11.7)

where (a) follows from Theorem 9.1 and (C.16). The claim follows. $\square $

12 Quantum Transportation-Cost Inequalities

Definition 12.1

(TCI). The interaction $\Phi \in \mathcal {B}^r_{\mathbb {Z}^d}$ satisfies a Transportation-Cost Inequality (TCI) with constant $c>0$ if the square of the specific quantum $W_1$ distance with respect to $\Phi $ is upper bounded by $\frac{c}{2}$ times the specific relative entropy with respect to $\Phi $, i.e., if for any $\rho \in \mathcal {S}^I_{\mathbb {Z}^d}$ we have

A fundamental consequence of (TCI) is the uniqueness of the equilibrium state of $\Phi $:

Proposition 12.1

(Uniqueness of the equilibrium state). Let $\Phi \in \mathcal {B}^r_{\mathbb {Z}^d}$ satisfy (TCI). Then, $\Phi $ has a unique equilibrium state, which is a $w_1$-Gibbs state.

Proof

Let $\omega \in \mathcal {S}_{eq}(\Phi )$. From (TCI), we have

$$\begin{aligned} w_1(\omega ,\Phi )^2 \le \frac{c}{2}\,s(\omega \Vert \Phi ) = 0, \end{aligned}$$

(12.1)

therefore $\omega $ is a $w_1$-Gibbs state of $\Phi $. Since the $w_1$-Gibbs state is unique, the equilibrium state is unique, too. $\square $

Another property of the interactions satisfying (TCI) is the following upper bound to the variation of the specific entropy in terms of the specific relative entropy:

Proposition 12.2

Let $\Phi \in \mathcal {B}^r_{\mathbb {Z}^d}$ satisfy (TCI) and let $\omega \in \mathcal {S}^I_{\mathbb {Z}^d}$ be its unique equilibrium state. Let $w^*$ be as in (9.18). Then, for any $\rho \in \mathcal {S}_{\mathbb {Z}^d}^I$ such that

$$\begin{aligned} s(\rho \Vert \Phi ) \le \frac{2\,{w^*}^2}{c} \end{aligned}$$

(12.2)

we have

$$\begin{aligned} \left| s(\rho ) - s(\omega )\right| \le h_2\left( \sqrt{\frac{c}{2}\,s(\rho \Vert \Phi )}\right) + \sqrt{\frac{c}{2}\,s(\rho \Vert \Phi )}\ln \left( q^2-1\right) . \end{aligned}$$

(12.3)

Proof

From Theorem 12.1, we have that $\omega $ is a $w_1$-Gibbs state of $\Phi $. We then have

$$\begin{aligned} \left| s(\rho ) - s(\omega )\right|&\overset{\mathrm {(a)}}{\le } h_2(w_1(\rho ,\omega )) + w_1(\rho ,\omega )\ln \left( q^2-1\right) \nonumber \\&\overset{\mathrm {(b)}}{\le } h_2\left( \sqrt{\frac{c\,s(\rho \Vert \Phi )}{2}}\right) + \sqrt{\frac{c\,s(\rho \Vert \Phi )}{2}}\ln \left( q^2-1\right) \,, \end{aligned}$$

(12.4)

where (a) follows from Theorem 10.1 and (b) from Proposition 11.1 and (TCI). The claim follows. $\square $

In the following, we will prove that (TCI) is satisfied by interactions containing only single-site terms (Sect. 12.1) and local commuting interactions at high temperature (Sect. 12.2).

12.1 Product States

The simplest setting where (TCI) holds is when the interaction contains only terms acting on single spins and the associated Gibbs state is a product state. Ref. [17] proved the following TCI for product states on finite lattices:

Theorem 12.1

(Quantum Marton’s Transportation Inequality [17, Theorem 2]). Let $\Lambda $ be a finite set and let $\sigma \in \mathcal {S}_\Lambda $ be a product state. Then, for any $\rho \in \mathcal {S}_\Lambda $ we have

$$\begin{aligned} \left\| \rho - \sigma \right\| _{W_1}^2 \le \frac{\left| \Lambda \right| }{2}\,S(\rho \Vert \sigma ). \end{aligned}$$

(12.5)

Theorem 12.1 implies the following TCI for product states on $\mathbb {Z}^d$:

Corollary 12.1

(TCI for product states). Let $\omega \in \mathcal {S}_{\mathbb {Z}^d}^I$ be a product state. Then, for any $\rho \in \mathcal {S}_{\mathbb {Z}^d}^I$ we have

$$\begin{aligned} w_1(\rho ,\omega )^2 \le \frac{1}{2}\,s(\rho \Vert \omega ). \end{aligned}$$

(12.6)

Therefore, any $\Phi \in \mathcal {B}^r_{\mathbb {Z}^d}$ that contains only terms acting on single spins (i.e., such that $\Phi (\Lambda )=0$ for any $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$ with $|\Lambda |\ge 2$) satisfies (TCI) with $c=1$.

Proof

The claim (12.6) follows from Theorem 12.1: We have

$$\begin{aligned} w_1(\rho ,\omega )^2 = \lim _{a\rightarrow \infty }\frac{\left\| \rho _{\Lambda _a} - \omega _{\Lambda _a}\right\| _{W_1}^2}{\left| \Lambda _a\right| ^2} \le \lim _{a\rightarrow \infty }\frac{S(\rho _{\Lambda _a}\Vert \omega _{\Lambda _a})}{2\left| \Lambda _a\right| } = \frac{s(\rho \Vert \omega )}{2}. \end{aligned}$$

(12.7)

Let $\Phi \in \mathcal {B}^r_{\mathbb {Z}^d}$ contain only terms acting on single sites. We have for any $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$

$$\begin{aligned} \omega ^\Phi _{\Lambda } = \bigotimes _{x\in \Lambda }\frac{e^{-\Phi (x)}}{\textrm{Tr}_xe^{-\Phi (x)}} = \bigotimes _{x\in \Lambda }\omega ^\Phi _x, \end{aligned}$$

(12.8)

therefore there exists a product state $\omega \in \mathcal {S}^I_{\mathbb {Z}^d}$ such that $\omega _\Lambda = \omega ^\Phi _\Lambda $ for any $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$. We have $w_1(\rho ,\omega ) = w_1(\rho ,\Phi )$ and $s(\rho \Vert \omega ) = s(\rho \Vert \Phi )$, therefore (12.6) implies (TCI) with $c=1$. The claim follows. $\square $

12.2 Local Commuting Interactions at High Temperature

A more general setting where (TCI) can be proved is the case of geometrically local commuting interactions, where each spin interacts with a finite number of spins and all the terms of the interaction mutually commute.

Ref. [19] proved the following TCI for Gibbs states of local commuting interactions on a finite lattice employing a quantum generalization of Ollivier’s coarse Ricci curvature [93, 94]:

Theorem 12.2

(High-temperature TCI for local commuting interactions [19, Theorem 4 and Proposition 9]). Let $\Phi \in \mathcal {B}^r_{\mathbb {Z}^d}$ be geometrically local and commuting, i.e., each spin interacts with at most N spins where

$$\begin{aligned} N = \left| \bigcup _{0\in X\in \mathcal {F}_{\mathbb {Z}^d}:\,\Phi (X)\ne 0} X\right| < \infty , \end{aligned}$$

(12.9)

and $\left[ \Phi (X),\,\Phi (Y)\right] =0$ for any $X,\,Y\in \mathcal {F}_{\mathbb {Z}^d}$. Let

$$\begin{aligned} M= & {} \inf _{t\ge 0}\left( \left( e^{\left\| \Phi \right\| _r}+1\right) \sqrt{1+t^2}\left\| \Phi \right\| _r q^{\frac{3+\sqrt{1+t^2}}{2}} e^{\left\| \Phi \right\| _r\left( 2+\frac{\sqrt{1+t^2}}{2}\right) }\right. \nonumber \\{} & {} \left. + 2\left\| \Phi \right\| _r e^{2\left\| \Phi \right\| _r} + 4\,e^{-\pi t}\right) , \end{aligned}$$

(12.10)

and let us assume that

$$\begin{aligned} \kappa = 1 - \left( 2N-1\right) \left( N-1\right) M > 0. \end{aligned}$$

(12.11)

Then, for any $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$ and any $\rho \in \mathcal {S}_{\Lambda }$ we have

$$\begin{aligned} \left\| \rho - \omega ^\Phi _{\Lambda }\right\| _{W_1}^2 \le \frac{2\,N^2\left| \Lambda \right| }{\left( 1-e^{-\kappa }\right) ^2} \, S\left( \rho \left\| \omega ^\Phi _{\Lambda }\right. \right) . \end{aligned}$$

(12.12)

In particular, $\Phi $ satisfies (TCI) with

$$\begin{aligned} c = \frac{4\,N^2}{\left( 1-e^{-\kappa }\right) ^2}. \end{aligned}$$

(12.13)

Remark 12.1

Choosing in (12.10)

$$\begin{aligned} t = \frac{\ln \frac{1}{\left\| \Phi \right\| _r}}{\pi + \frac{\ln q}{2}}, \end{aligned}$$

(12.14)

we get $M \le O\left( \left\| \Phi \right\| _r\right) $ for $\left\| \Phi \right\| _r\rightarrow 0$.

Another strategy to prove TCIs for quantum spin systems on a finite lattice is to prove that suitable local quantum Markov semigroups that have the Gibbs state as unique fixed point satisfy a modified logarithmic Sobolev inequality [19, 95,96,97], which states that the semigroup contracts exponentially the relative entropy with respect to the Gibbs state. Ref. [97] proved that above a critical temperature, the Gibbs states of commuting nearest-neighbor interactions satisfy a modified logarithmic Sobolev inequality. Exploiting this result, Ref. [19] proved the following TCI for such Gibbs states:

Theorem 12.3

(High-temperature TCI for nearest-neighbor interactions [19, Theorem 5]). Let $\Phi \in \mathcal {B}^r_{\mathbb {Z}^d}$ be a nearest-neighbor interaction, i.e., $\Phi (\Lambda )=0$ for any $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$ that contains at least two sites that are not neighboring. Then, there exists a critical inverse temperature $\beta _c>0$ such that for any $0\le \beta <\beta _c$ there exists $c_\beta >0$ such that for any $a\in \mathbb {N}_+^d$ we have

$$\begin{aligned} \left\| \rho _{\Lambda _a} - \omega ^{\beta \Phi }_{\Lambda _a}\right\| _{W_1}^2 \le \frac{c_\beta \left| \Lambda _a\right| }{2} \, S\left( \rho _{\Lambda _a}\left\| \omega ^{\beta \Phi }_{\Lambda _a}\right. \right) . \end{aligned}$$

(12.15)

In particular, $\beta \,\Phi $ satisfies (TCI) with $c=c_\beta $.

Corollary 12.2

All the interactions satisfying the hypotheses either of Theorem 12.2 or of Theorem 12.3 have a unique equilibrium state.

Remark 12.2

The uniqueness of the equilibrium states for all the interactions $\Phi \in \mathcal {B}^r_{\mathbb {Z}^d}$ such that $r>\log q$ and $\left\| \Phi \right\| _r < \frac{1}{2q}$ has been proved in [98].

13 Perspectives

In this paper, we have proposed a specific Wasserstein distance of order 1 for quantum spin systems on infinite lattices. We expect the proposed distance to be a powerful tool in the study of the statistical mechanics of quantum spin systems, quantum dynamical systems, and tomography of quantum states:

1.
The specific quantum $W_1$ distance can be employed to study the diameter of the set of the equilibrium states of an interaction close to a thermal phase transition. Above the critical temperature, the equilibrium state is unique and the diameter is zero, while below the critical temperature the diameter is strictly positive. The limit of the diameter of the set of the equilibrium states as the temperature tends from below to the critical value, and in particular whether such limit is zero or strictly positive, can be employed to characterize the phase transition.
2.
In [7], Ornstein proposed the $\bar{d}$-distance as a natural metric for the classification of stochastic processes and singled out a large class (the so-called B-processes), containing, e.g., all mixing Markov processes, such that a fundamental isomorphism theorem holds: two processes are isomorphic if and only if their entropies coincide. The isomorphism here is in the sense of dynamical systems, i.e., a measurable and invertible transformation mapping one probability measure to the other. The specific quantum $W_1$ distance could provide a useful analytical tool toward establishing analogous results in the setting of quantum dynamical systems [99].
3.
The statistical problem of estimating a stationary ergodic process, in a given family, from the observation of a single sample path of length n can be quantitatively addressed using Ornstein’s $\bar{d}$-distance. In [8], it is proved that an empirical block scheme, i.e., the product probability naturally obtained from the observed frequencies on sliding window of length k, converges in the $\bar{d}$-distance, as n grows, toward the target process, provided that it is a B-process and k grows at least logarithmically with respect to n. Exploring the quantum analogue of this and related results, e.g., for discrimination between two sampled processes [9], may extend the scope of the recent works [23, 24] on tomography of a quantum state and stimulate novel approaches, particularly when the number of accessible independent copies is extremely constrained.

References

Monge, G.: Mémoire sur la théorie des déblais et des remblais. Histoire de l’Académie Royale des Sciences de Paris (1781)
Kantorovich, L.V.: On the translocation of masses. In Dokl. Akad. Nauk. USSR (NS) 37, 199–201 (1942)
MathSciNet MATH Google Scholar
Vershik, A.M.: Long history of the Monge–Kantorovich transportation problem. Math. Intell. 35(4), 1–9 (2013)
MathSciNet MATH Google Scholar
Villani, C.: Optimal Transport: old and new, volume 338. Springer Science & Business Media (2008)
Ambrosio, L., Gigli, N., Savaré, G.: Gradient Flows: in Metric Spaces and in the Space of Probability Measures. Springer Science & Business Media (2008)
Peyré, G., Cuturi, M.: Computational optimal transport: with applications to data science. Found. Trends® Mach. Learn. 11(5–6), 355–607 (2019)
MATH Google Scholar
Ornstein, D.S.: An application of ergodic theory to probability theory. Ann. Probab. 1(1), 43–58 (1973)
MathSciNet MATH Google Scholar
Ornstein, D.S., Weiss, B.: How sampling reveals a process. Ann. Probab. 18(3), 905–930 (1990)
MathSciNet MATH Google Scholar
Ornstein, D.S., Shields, P.C.: The d-recognition of processes. Adv. Math. 104(2), 182–224 (1994)
MathSciNet MATH Google Scholar
Fernandez, R., Galves, A.: Markov approximations of chains of infinite order. Bull. Braz. Math. Soc. 33(3), 295–306 (2002)
MathSciNet MATH Google Scholar
Csiszár, I., Talata, Z.: On rate of convergence of statistical estimation of stationary ergodic processes. IEEE Trans. Inf. Theory 56(8), 3637–3641 (2010)
MathSciNet MATH Google Scholar
O’Connor, K., McGoff, K., Nobel, A.B: Estimation of stationary optimal transport plans (2021). arXiv preprint arXiv:2107.11858
Gray, R.M., Neuhoff, D.L., Shields, P.C.: A generalization of Ornstein’s d distance with applications to information theory. Ann. Probab. 315–328 (1975)
Gray, R.M.: Entropy and Information Theory. Springer Science & Business Media (2011)
Choi, Y., Won, J.-H.: Ornstein auto-encoders. In IJCAI, pp. 2172–2178 (2019)
Choi, Y., Lee, S., Won, J.-H.: Learning from nested data with ornstein auto-encoders. In: International Conference on Machine Learning, pp. 1943–1952. PMLR (2021)
De Palma, G., Marvian, M., Trevisan, D., Lloyd, S.: The quantum wasserstein distance of order 1. IEEE Trans. Inf. Theory 67(10), 6627–6643 (2021)
MathSciNet MATH Google Scholar
Hamazaki, R.: Speed limits for macroscopic transitions. PRX Quantum 3, 020319 (2022)
ADS Google Scholar
De Palma, G., Rouzé, C.: Quantum concentration inequalities. Ann. Henri Poincaré 23, 3391–3429 (2022)
ADS MathSciNet MATH Google Scholar
De Palma, G., Marvian, M., Rouzé, C., França, D.S.: Limitations of variational quantum algorithms: a quantum optimal transport approach. PRX Quantum 4, 010309 (2023)
Google Scholar
Chou, C.-N., Love, P.J., Sandhu, J.S., Shi, J.: Limitations of local quantum algorithms on random MAX-k-XOR and Beyond. In: Bojańczyk, M., Merelli, E., Woodruff, D.P. (eds), 49th International Colloquium on Automata, Languages, and Programming (ICALP 2022), vol. 229 of Leibniz International Proceedings in Informatics (LIPIcs), pp. 41:1–41:20, Dagstuhl, Germany (2022). Schloss Dagstuhl – Leibniz-Zentrum für Informatik
Li, L., Bu, K., Koh, D.E., Jaffe, A., Lloyd, S.: Wasserstein Complexity of Quantum Circuits (2022). arXiv:2208.06306
Rouzé, C., França, D.S.: Learning quantum many-body systems from a few copies (2021). arXiv:2107.03333
Maciejewski, F.B., Puchała, Z., Oszmaniec, M.: Exploring quantum average-case distances: proofs, properties, and examples. IEEE Trans. Inf. Theory 1–1 (2023)
Kiani, B.T., De Palma, G., Marvian, M., Liu, Z.-W., Lloyd, S.: Learning quantum data with the quantum earth mover’s distance. Quantum Sci. Technol. 7(4), 045002 (2022)
ADS Google Scholar
Herr, D., Obert, B., Rosenkranz, M.: Anomaly detection with variational quantum generative adversarial networks. Quantum Sci. Technol. 6(4), 045004 (2021)
ADS Google Scholar
Anschuetz, E.R., Kiani, B.T.: Quantum variational algorithms are swamped with traps. Nat. Commun. 13(1), 7760 (2022)
ADS Google Scholar
Coyle, B.: Machine learning applications for noisy intermediate-scale quantum computers. Ph.D. thesis, University of Edinburgh (2022)
Hirche, C., Rouzé, C., França, D.S.: Quantum differential privacy: an information theory perspective (2022). arXiv:2202.10717
Angrisani, A., Doosti, M., Kashefi, E.: Differential privacy amplification in quantum and quantum-inspired algorithms (2022). arXiv:2203.03604
Duvenhage, R., Mapaya, M.: Quantum Wasserstein distance of order 1 between channels (2022). arXiv:2210.03483
Bratteli, O., Robinson, D.W.: Operator Algebras and Quantum Statistical Mechanics I: C*- and W*-Algebras. Symmetry Groups. Decomposition of States. Theoretical and Mathematical Physics. Springer, Berlin (2013)
Google Scholar
Bratteli, O., Robinson, D.W.: Operator Algebras and Quantum Statistical Mechanics II: Equilibrium States Models in Quantum Statistical Mechanics. Theoretical and Mathematical Physics. Springer, Berlin (2013)
MATH Google Scholar
Naaijkens, P.: Quantum Spin Systems on Infinite Lattices: A Concise Introduction. Lecture Notes in Physics. Springer International Publishing (2017)
Alicki, R., Alicki, I.T.P.A.R., Fannes, M.: Quantum Dynamical Systems. Oxford University Press (2001)
Barnum, H.: Quantum rate-distortion coding. Phys. Rev. A 62(4), 042309 (2000)
ADS MathSciNet Google Scholar
Devetak, I., Berger, T.: Quantum rate-distortion theory for IID sources. In: Proceedings of 2001 IEEE International Symposium on Information Theory (IEEE Cat. No. 01CH37252), pp. 276. IEEE (2001)
Devetak, I., Berger, T.: Quantum rate-distortion theory for memoryless sources. IEEE Trans. Inf. Theory 48(6), 1580–1589 (2002)
MathSciNet MATH Google Scholar
Chen, X.-Y., Wang, W.-M.: Entanglement information rate distortion of a quantum Gaussian source. IEEE Trans. Inf. Theory 54(2), 743–748 (2008)
MathSciNet MATH Google Scholar
Datta, N., Hsieh, M.-H., Wilde, M.M.: Quantum rate distortion, reverse Shannon theorems, and source-channel separation. IEEE Trans. Inf. Theory 59(1), 615–630 (2013)
MathSciNet MATH Google Scholar
Datta, N., Hsieh, M.-H., Wilde, M.M., Winter, A.: Quantum-to-classical rate distortion coding. J. Math. Phys. 54(4), 042201 (2013)
ADS MathSciNet MATH Google Scholar
Wilde, M.M., Datta, N., Hsieh, M.-H., Winter, A.: Quantum rate-distortion coding with auxiliary resources. IEEE Trans. Inf. Theory 59(10), 6755–6773 (2013)
MathSciNet MATH Google Scholar
Salek, S., Cadamuro, D., Kammerlander, P., Wiesner, K.: Quantum rate-distortion coding of relevant information. IEEE Trans. Inf. Theory 65(4), 2603–2613 (2018)
MathSciNet MATH Google Scholar
Carlen, E.A., Maas, J.: An analog of the 2-Wasserstein metric in non-commutative probability under which the Fermionic Fokker-Planck equation is gradient flow for the entropy. Commun. Math. Phys. 331(3), 887–926 (2014)
ADS MathSciNet MATH Google Scholar
Carlen, E.A., Maas, J.: Gradient flow and entropy inequalities for quantum Markov semigroups with detailed balance. J. Funct. Anal. 273(5), 1810–1869 (2017)
MathSciNet MATH Google Scholar
Carlen, E.A., Maas, J.: Non-commutative calculus, optimal transport and functional inequalities in dissipative quantum systems. J. Stat. Phys. 178(2), 319–378 (2020)
ADS MathSciNet MATH Google Scholar
Rouzé, C., Datta, N.: Concentration of quantum states from quantum functional and transportation cost inequalities. J. Math. Phys. 60(1), 012202 (2019)
ADS MathSciNet MATH Google Scholar
Datta, N., Rouzé, C.: Relating relative entropy, optimal transport and Fisher information: a quantum HWI inequality. Ann. Henri Poincaré 21, 2115–2150 (2020)
ADS MathSciNet MATH Google Scholar
Van Tan, V., Hasegawa, Y.: Geometrical bounds of the irreversibility in Markovian systems. Phys. Rev. Lett. 126, 010601 (2021)
MathSciNet Google Scholar
Wirth, M.: A dual formula for the noncommutative transport distance. J. Stat. Phys. 187(2), 1–18 (2022)
MathSciNet MATH Google Scholar
Gao, L., Junge, M., LaRacuente, N.: Fisher information and logarithmic Sobolev inequality for matrix-valued functions. Ann. Henri Poincaré 21(11), 3409–3478 (2020)
ADS MathSciNet MATH Google Scholar
Chen, Y., Georgiou, T.T., Ning, L., Tannenbaum, A.: Matricial Wasserstein-1 distance. IEEE Control Syst. Lett. 1(1), 14–19 (2017)
MathSciNet Google Scholar
Ryu, E.K., Chen, Y., Li, W., Osher, S.: Vector and matrix optimal mass transport: theory, algorithm, and applications. SIAM J. Sci. Comput. 40(5), A3675–A3698 (2018)
MathSciNet MATH Google Scholar
Chen, Y., Georgiou, T.T., Tannenbaum, A.: Matrix optimal mass transport: a quantum mechanical approach. IEEE Trans. Autom. Control 63(8), 2612–2619 (2018)
MathSciNet MATH Google Scholar
Chen, Y., Georgiou, T.T., Tannenbaum, A.: Wasserstein geometry of quantum states and optimal transport of matrix-valued measures. In: Emerging Applications of Control and Systems Theory, pp. 139–150. Springer (2018)
Agredo, J.: A Wasserstein-type distance to measure deviation from equilibrium of quantum Markov semigroups. Open Syst. Inf. Dyn. 20(02), 1350009 (2013)
MathSciNet MATH Google Scholar
Agredo, J.: On exponential convergence of generic quantum Markov semigroups in a Wasserstein-type distance. Int. J. Pure Appl. Math. 107(4), 909–925 (2016)
Google Scholar
Ikeda, K.: Foundation of quantum optimal transport and applications. Quantum Inf. Process. 19(1), 25 (2020)
ADS MathSciNet MATH Google Scholar
Golse, F., Mouhot, C., Paul, T.: On the mean field and classical limits of quantum mechanics. Commun. Math. Phys. 343(1), 165–205 (2016)
ADS MathSciNet MATH Google Scholar
Caglioti, E., Golse, F., Paul, T.: Towards optimal transport for quantum densities. Annali Scuola Normale Superiore-Classe di Scienze, pp. 49–4, (2022)
Golse, F.: The quantum N-body problem in the mean-field and semiclassical regime. Philos. Trans. R. Soc. A Math. Phys. Eng. Sci. 376(2118), 20170229 (2018)
ADS MathSciNet MATH Google Scholar
Golse, F., Paul, T.: The Schrödinger equation in the mean-field and semiclassical regime. Arch. Ration. Mech. Anal. 223(1), 57–94 (2017)
MathSciNet MATH Google Scholar
Golse, F., Paul, T.: Wave packets and the quadratic Monge–Kantorovich distance in quantum mechanics. C.R. Math. 356(2), 177–197 (2018)
MathSciNet MATH Google Scholar
Caglioti, E., Golse, F., Paul, T.: Quantum optimal transport is cheaper. J. Stat. Phys. 181(1), 149–162 (2020)
ADS MathSciNet MATH Google Scholar
Friedland, S., Eckstein, M., Cole, S., Życzkowski, K.: Quantum Monge–Kantorovich problem and transport distance between density matrices. Phys. Rev. Lett. 129, 110402 (2022)
ADS MathSciNet Google Scholar
Cole, S., Eckstein, M., Friedland, S., Życzkowski, K.: Quantum Optimal Transport (2021). arXiv:2105.06922
Duvenhage, R.: Optimal quantum channels. Phys. Rev. A 104, 032604 (2021)
ADS MathSciNet Google Scholar
Bistroń, R., Eckstein, M., Życzkowski, K.: Monotonicity of a quantum 2-Wasserstein distance. J. Phys. A Math. Theor. 56(9), 095301 (2023)
ADS MathSciNet MATH Google Scholar
Van Tan, V., Saito, K.: Thermodynamic unification of optimal transport: thermodynamic uncertainty relation, minimum dissipation, and thermodynamic speed limits. Phys. Rev. X 13, 011013 (2023)
Google Scholar
Duvenhage, R.: Quadratic Wasserstein metrics for von Neumann algebras via transport plans. J. Oper. Theory 88(2), 289–308 (2022)
MathSciNet MATH Google Scholar
Duvenhage, R.: Wasserstein distance between noncommutative dynamical systems (2021). arXiv:2112.12532
Duvenhage, R., Skosana, S., Snyman, M.: Extending quantum detailed balance through optimal transport (2022). arXiv preprint arXiv:2206.15287
De Palma, G., Trevisan, D.: Quantum optimal transport with quantum channels. Ann. Henri Poincaré 22(10), 3199–3234 (2021)
ADS MathSciNet MATH Google Scholar
Duvenhage, R., Snyman, M.: Balance between quantum Markov semigroups. Ann. Henri Poincaré 19(6), 1747–1786 (2018)
ADS MathSciNet MATH Google Scholar
Agredo, J., Fagnola, F.: On quantum versions of the classical Wasserstein distance. Stochastics 89(6–7), 910–922 (2017)
MathSciNet MATH Google Scholar
Życzkowski, K., Slomczynski, W.: The Monge distance between quantum states. J. Phys. A Math. Gen. 31(45), 9095 (1998)
ADS MathSciNet MATH Google Scholar
Życzkowski, K., Slomczynski, W.: The Monge metric on the sphere and geometry of quantum states. J. Phys. A Math. Gen. 34(34), 6689 (2001)
ADS MathSciNet MATH Google Scholar
Bengtsson, I., Życzkowski, K.: Geometry of Quantum States: An Introduction to Quantum Entanglement. Cambridge University Press (2017)
Nielsen, M.A., Chuang, I.L.: Quantum Computation and Quantum Information: 10th Anniversary Edition. Cambridge University Press (2010)
Wilde, M.M.: Quantum Information Theory. Cambridge University Press, 2 edition (2017)
Holevo, A.S.: Quantum Systems, Channels, Information: A Mathematical Introduction. Texts and Monographs in Theoretical Physics. De Gruyter (2019)
Jakšić, V., Pillet, C.-A., Tauber, C.: Approach to equilibrium in translation-invariant quantum systems: some structural results. Annales Henri Poincaré (2023). https://doi.org/10.1007/s00023-023-01281-6
Griffiths, R.B., Ruelle, D.: Strict convexity (“continuity’’) of the pressure in lattice systems. Commun. Math. Phys. 23(3), 169–175 (1971)
ADS MathSciNet Google Scholar
Roos, H.: Strict convexity of the pressure: a note on a paper of R. B. Griffiths and D. Ruelle. Commun. Math. Phys. 36(4), 263–276 (1974)
ADS MathSciNet Google Scholar
Israel, R.B.: Convexity in the Theory of Lattice Gases. Princeton Series in Physics. Princeton University Press (2015)
Ruelle, D.: Thermodynamic Formalism: The Mathematical Structure of Equilibrium Statistical Mechanics. Cambridge Mathematical Library. Cambridge University Press, Cambridge (2004)
MATH Google Scholar
Jakšić, V., Pillet, C.-A., Tauber, C.: A note on adiabatic time evolution and quasi-static processes in translation-invariant quantum systems. Ann. Henri Poincaré (2023). https://doi.org/10.1007/s00023-023-01282-5
Van Enter, A.C.D., Fernández, R., Sokal, A.D.: Regularity properties and pathologies of position-space renormalization-group transformations: scope and limitations of Gibbsian theory. J. Stat. Phys. 72(5), 879–1167 (1993)
ADS MathSciNet MATH Google Scholar
Lieb, E.H.: Convex trace functions and the Wigner–Yanase–Dyson conjecture. Adv. Math. 11(3), 267–288 (1973)
MathSciNet MATH Google Scholar
Polyanskiy, Y., Yihong, W.: Wasserstein continuity of entropy and outer bounds for interference channels. IEEE Trans. Inf. Theory 62(7), 3992–4002 (2016)
MathSciNet MATH Google Scholar
Osborne, T., Winter, A.: A quantum generalisation of Talagrand’s inequality. Tobias J. Osborne’s Research Notes. https://tjoresearchnotes.wordpress.com/2009/02/13/a-quantum-generalisation-of-talagrands-inequality (2009)
Eldar, L., Harrow, A.W.: Local Hamiltonians whose ground states are hard to approximate. In: 2017 IEEE 58th Annual Symposium on Foundations of Computer Science (FOCS), pp. 427–438 (2017)
Ollivier, Y.: Ricci curvature of Markov chains on metric spaces. J. Funct. Anal. 256(3), 810–864 (2009)
MathSciNet MATH Google Scholar
Gao, L., Rouzé, C.: Ricci curvature of quantum channels on non-commutative transportation metric spaces (2021). arXiv:2108.10609
Bardet, I., Capel, Á., Gao, L., Lucia, A., Pérez-García, D., Rouzé, C.: Entropy decay for Davies semigroups of a one dimensional quantum lattice (2021). arXiv:2112.00601
Bardet, I., Capel, Á., Gao, L., Lucia, A., Pérez-García, D., Rouzé, C.: Rapid thermalization of spin chain commuting Hamiltonians. Phys. Rev. Lett. 130, 060401 (2023)
ADS MathSciNet Google Scholar
Capel, Á., Rouzé, C., França, D.S.: The modified logarithmic Sobolev inequality for quantum spin systems: classical and commuting nearest neighbour interactions (2020). arXiv:2009.11817
Fröhlich, J., Ueltschi, D.: Some properties of correlations of quantum lattice systems in thermal equilibrium. J. Math. Phys. 56(5), 053302 (2015)
ADS MathSciNet MATH Google Scholar
Alicki, R., Fannes, M.: Quantum Dynamical Systems. Oxford University Press, Oxford (2001)
Capobianco, S.: Multidimensional cellular automata and generalization of Fekete’s lemma. Discrete Math. Theor. Comput. Sci. 10(3) (2008)

Download references

Acknowledgements

We thank Emily Beatty for useful suggestions to improve the presentation of the proof of Theorem 9.1. GDP has been supported by the HPC National Centre for HPC, Big Data and Quantum Computing—Proposal code CN00000013, CUP J33C22001170001, funded within PNRR—Mission 4—Component 2 Investment 1.4. GDP is a member of the “Gruppo Nazionale per la Fisica Matematica (GNFM)” of the “Istituto Nazionale di Alta Matematica “Francesco Severi” (INdAM)”. DT is a member of the INdAM group “Gruppo Nazionale per l’Analisi Matematica, la Probabilità e le loro Applicazioni (GNAMPA)” and was partially supported by the INdAM-GNAMPA project 2022 “Temi di Analisi Armonica Subellittica.”

Author information

Authors and Affiliations

Department of Mathematics, University of Bologna, Bologna, Italy
Giacomo De Palma
Department of Mathematics, University of Pisa, Pisa, Italy
Dario Trevisan

Authors

Giacomo De Palma
View author publications
You can also search for this author in PubMed Google Scholar
Dario Trevisan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Giacomo De Palma.

Additional information

Communicated by David Pérez-García.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Appendices

Properties of the Quantum $W_1$ Distance

Proposition A.1

([17, Proposition 2]). For any finite set $\Lambda $ and any $\Delta \in \mathcal {O}_\Lambda ^T$, we have

$$\begin{aligned} \frac{1}{2}\left\| \Delta \right\| _1 \le \left\| \Delta \right\| _{W_1} \le \frac{\left| \Lambda \right| }{2}\left\| \Delta \right\| _1. \end{aligned}$$

(A.1)

Proposition A.2

([17, Proposition 5]). Let $\Lambda '\subseteq \Lambda $ be finite sets. Then, for any $\Delta \in \mathcal {O}_\Lambda ^T$ such that $\textrm{Tr}_{\Lambda '}\Delta =0$ we have

$$\begin{aligned} \left\| \Delta \right\| _{W_1} \le \frac{q^2-1}{q^2}\left| \Lambda '\right| \left\| \Delta \right\| _1. \end{aligned}$$

(A.2)

Proposition A.3

(Superadditivity [17, Proposition 4]). The quantum $W_1$ distance is superadditive in general and additive for product states, i.e., for any two disjoint finite sets $\Lambda ,\,\Lambda '$ and any $\rho ,\,\sigma \in \mathcal {S}_{\Lambda \Lambda '}$ we have

$$\begin{aligned} \left\| \rho - \sigma \right\| _{W_1} \ge \left\| \rho _\Lambda - \sigma _\Lambda \right\| _{W_1} + \left\| \rho _{\Lambda '} - \sigma _{\Lambda '}\right\| _{W_1}, \end{aligned}$$

(A.3)

and for any $\rho _\Lambda ,\,\sigma _\Lambda \in \mathcal {S}_\Lambda $ and any $\rho _{\Lambda '},\,\sigma _{\Lambda '}\in \mathcal {S}_{\Lambda '}$ we have

$$\begin{aligned} \left\| \rho _\Lambda \otimes \rho _{\Lambda '} - \sigma _\Lambda \otimes \sigma _{\Lambda '}\right\| _{W_1} = \left\| \rho _\Lambda - \sigma _\Lambda \right\| _{W_1} + \left\| \rho _{\Lambda '} - \sigma _{\Lambda '}\right\| _{W_1}. \end{aligned}$$

(A.4)

Auxiliary Proofs

1.1 Proof of Proposition 2.1

Proposition

(2.1) The trace distance on $\mathcal {S}_{\mathbb {Z}^d}$ is the supremum of the trace distances between the marginal states: For any $\rho ,\,\sigma \in \mathcal {S}_{\mathbb {Z}^d}$,

$$\begin{aligned} T(\rho ,\sigma ) = \frac{1}{2}\sup _{\Lambda \in \mathcal {F}_{\mathbb {Z}^d}}\left\| \rho _\Lambda - \sigma _\Lambda \right\| _1, \end{aligned}$$

(B.1)

where $\Vert \cdot \Vert _1$ denotes the trace norm on $\mathfrak {U}_\Lambda $ given by

$$\begin{aligned} \left\| A\right\| _1 = \textrm{Tr}_\Lambda \sqrt{A^\dag A},\qquad A\in \mathfrak {U}_\Lambda . \end{aligned}$$

(B.2)

Proof

Since $\mathfrak {U}_{\mathbb {Z}^d}^{loc}$ is dense in $\mathfrak {U}_{\mathbb {Z}^d}$, we have

$$\begin{aligned} 2\,T(\rho ,\sigma )&= \sup _{A\in \mathfrak {U}_{\mathbb {Z}^d}^{loc}:\Vert A\Vert _\infty \le 1}\left| \rho (A) - \sigma (A)\right| = \sup _{\Lambda \in \mathcal {F}_{\mathbb {Z}^d}}\sup _{A\in \mathfrak {U}_\Lambda :\Vert A\Vert _\infty \le 1}\left| \rho (A) - \sigma (A)\right| \nonumber \\&= \sup _{\Lambda \in \mathcal {F}_{\mathbb {Z}^d}}\sup _{A\in \mathfrak {U}_\Lambda :\Vert A\Vert _\infty \le 1}\left| \textrm{Tr}_\Lambda \left[ \left( \rho _\Lambda - \sigma _\Lambda \right) A\right] \right| = \sup _{\Lambda \in \mathcal {F}_{\mathbb {Z}^d}}\left\| \rho _\Lambda - \sigma _\Lambda \right\| _1\,. \end{aligned}$$

(B.3)

The claim follows. $\square $

1.2 Proof of Proposition 5.1

Proposition

(5.1) For any $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$, any $H\in \mathcal {O}_\Lambda $ and any $x\in \Lambda $, (3.3) and (5.1) are equivalent.

Proof

Let

$$\begin{aligned} \partial _x H = 2\min _{A\in \mathcal {O}_{\Lambda \setminus x}}\left\| H - A\right\| _\infty ,\qquad \tilde{\partial }_x H = 2\inf _{A\in \mathcal {O}_{\mathbb {Z}^d\setminus x}}\left\| H - A\right\| _\infty . \end{aligned}$$

(B.4)

We clearly have $\tilde{\partial }_x H \le \partial _x H$. Let $\omega _{\mathbb {Z}^d{\setminus }\Lambda }\in \mathcal {S}_{\mathbb {Z}^d{\setminus }\Lambda }$ be the uniform distribution on $\mathbb {Z}^d\setminus \Lambda $, and let $\Psi _\Lambda :\mathfrak {U}_{\mathbb {Z}^d}\rightarrow \mathfrak {U}_\Lambda $ be the completely positive unital linear map such that for any $A\in \mathfrak {U}_{\mathbb {Z}^d}$ and any $\rho _\Lambda \in \mathcal {S}_\Lambda $

$$\begin{aligned} \textrm{Tr}_\Lambda \left[ \rho _\Lambda \,\Psi _\Lambda (A)\right] = (\omega _{\mathbb {Z}^d\setminus \Lambda }\otimes \rho _\Lambda )(A). \end{aligned}$$

(B.5)

Let $A\in \mathcal {O}_{\mathbb {Z}^d\setminus x}$. We have for any $\rho _\Lambda \in \mathcal {S}_\Lambda $ and any unitary operator $U_x\in \mathfrak {U}_x$

$$\begin{aligned} \textrm{Tr}_\Lambda \left[ \rho _\Lambda \,U_x^\dag \,\Psi _\Lambda (A)\,U_x\right]&= \textrm{Tr}_\Lambda \left[ U_x\,\rho _\Lambda \,U_x^\dag \,\Psi _\Lambda (A)\right] = \left( \omega _{\mathbb {Z}^d\setminus \Lambda }\otimes U_x\,\rho _\Lambda \,U_x^\dag \right) (A)\nonumber \\&= (\omega _{\mathbb {Z}^d\setminus \Lambda }\otimes \rho _\Lambda )\left( U_x^\dag \,A\,U_x\right) = (\omega _{\mathbb {Z}^d\setminus \Lambda }\otimes \rho _\Lambda )(A)\nonumber \\ {}&= \textrm{Tr}_\Lambda \left[ \rho _\Lambda \,\Psi _\Lambda (A)\right] \,, \end{aligned}$$

(B.6)

therefore $U_x^\dag \,\Psi _\Lambda (A)\,U_x = \Psi _\Lambda (A)$, hence $\Psi _\Lambda (A)\in \mathcal {O}_{\Lambda \setminus x}$. We then have

$$\begin{aligned} \partial _x H \le 2\left\| H - \Psi _\Lambda (A)\right\| _\infty = 2\left\| \Psi _\Lambda (H-A)\right\| _\infty \le 2\left\| H-A\right\| _\infty , \end{aligned}$$

(B.7)

where the last inequality follows since $\Psi _\Lambda $ is completely positive and unital. We then have $\partial _x H \le \tilde{\partial }_x H$. The claim follows. $\square $

Auxiliary Lemmas

Lemma C.1

(Multidimensional Fekete’s lemma [100]). Let $f:\mathbb {N}_+^d\rightarrow \mathbb {R}$ be superadditive with respect to each variable, i.e.,

$$\begin{aligned} f(x_1,\,\ldots ,\,x_i+t,\,\ldots ,\,x_d) \ge f(x_1,\,\ldots ,\,x_i,\,\ldots ,\,x_d) + f(x_1,\,\ldots ,\,t,\,\ldots ,\,x_d)\nonumber \\ \end{aligned}$$

(C.1)

for any $x_1,\,\ldots ,\,x_d,\,t\in \mathbb {N}$ and any $i=1,\,\ldots ,\,d$. Then,

$$\begin{aligned} \lim _{x\rightarrow \infty }\frac{f(x)}{x_1\ldots x_d} = \sup _{x\in \mathbb {N}_+^d}\frac{f(x)}{x_1\ldots x_d}. \end{aligned}$$

(C.2)

Lemma C.2

Let $H\in \mathcal {O}_\Lambda $ be positive semi-definite. Then, for any $x\in \Lambda $,

$$\begin{aligned} \partial _x H \le \left\| H\right\| _\infty . \end{aligned}$$

(C.3)

Proof

We have

$$\begin{aligned} -\frac{\left\| H\right\| _\infty }{2}\,\mathbb {I} \le H - \frac{\left\| H\right\| _\infty }{2}\,\mathbb {I} \le \frac{\left\| H\right\| _\infty }{2}\,\mathbb {I}, \end{aligned}$$

(C.4)

therefore

$$\begin{aligned} \partial _x H \le 2\left\| H - \frac{\left\| H\right\| _\infty }{2}\,\mathbb {I}\right\| _\infty \le \left\| H\right\| _\infty . \end{aligned}$$

(C.5)

The claim follows. $\square $

Proposition C.1

Let $\Lambda _1,\,\ldots ,\,\Lambda _k$ be k copies of the finite set $\Lambda $. Then, for any $\rho \in \mathcal {S}_{\Lambda _1\ldots \Lambda _k}$ and any $\sigma \in \mathcal {S}_\Lambda $ we have

$$\begin{aligned} \left\| \rho - \sigma ^{\otimes k}\right\| _{W_1}^2 \le 2k\left| \Lambda \right| ^2\,S\left( \rho \left\| \sigma ^{\otimes k}\right. \right) . \end{aligned}$$

(C.6)

Proof

The proof follows the same lines as the proof of [17, Theorem 2]. We have

$$\begin{aligned} \left\| \rho - \sigma ^{\otimes k}\right\| _{W_1}&\le \sum _{i=1}^k\left\| \sigma ^{\otimes \left( i-1\right) }\otimes \rho _{\Lambda _i\ldots \Lambda _k} - \sigma ^{\otimes i}\otimes \rho _{\Lambda _{i+1}\ldots \Lambda _k}\right\| _{W_1} \nonumber \\&\overset{\mathrm {(a)}}{\le } \left| \Lambda \right| \sum _{i=1}^k\left\| \rho _{\Lambda _i\ldots \Lambda _k} - \sigma \otimes \rho _{\Lambda _{i+1}\ldots \Lambda _k}\right\| _1 \nonumber \\&\overset{\mathrm {(b)}}{\le } \left| \Lambda \right| \sum _{i=1}^k\sqrt{2\,S\left( \rho _{\Lambda _i\ldots \Lambda _k}\left\| \sigma \otimes \rho _{\Lambda _{i+1}\ldots \Lambda _k}\right. \right) } \nonumber \\&= \left| \Lambda \right| \sum _{i=1}^k\sqrt{2\left( S(\rho _{\Lambda _i}) + S(\rho _{\Lambda _{i+1}\ldots \Lambda _k}) - S(\rho _{\Lambda _i\ldots \Lambda _k}) + S(\rho _{\Lambda _i}\Vert \sigma )\right) } \nonumber \\&\overset{\mathrm {(c)}}{\le } \left| \Lambda \right| \sqrt{2k\sum _{i=1}^k\left( S(\rho _{\Lambda _i}) + S(\rho _{\Lambda _{i+1}\ldots \Lambda _k}) - S(\rho _{\Lambda _i\ldots \Lambda _k}) + S(\rho _{\Lambda _i}\Vert \sigma )\right) } \nonumber \\&=\left| \Lambda \right| \sqrt{2k}\sqrt{\sum _{i=1}^k\left( S(\rho _{\Lambda _i}) + S(\rho _{\Lambda _i}\Vert \sigma )\right) - S(\rho )} =\left| \Lambda \right| \sqrt{2k\,S\left( \rho \left\| \sigma ^{\otimes k}\right. \right) }\,. \end{aligned}$$

(C.7)

(a) follows from Proposition A.2 observing that

$$\begin{aligned} \textrm{Tr}_{\Lambda _i}\left[ \sigma ^{\otimes \left( i-1\right) }\otimes \rho _{\Lambda _i\ldots \Lambda _k} - \sigma ^{\otimes i}\otimes \rho _{\Lambda _{i+1}\ldots \Lambda _k}\right] = 0; \end{aligned}$$

(C.8)

(b) follows from Pinsker’s inequality; (c) follows from the concavity of the square root. The claim follows. $\square $

Lemma C.3

We have

$$\begin{aligned} \dim \mathcal {W}_k \le D_k\dim \mathcal {V}. \end{aligned}$$

(C.9)

Proof

Let $A_0,\,\ldots ,\,A_{q^2-1}$ be a basis of $\mathbb {C}^{q\times q}$ with $A_0 = \mathbb {I}$. For any $x\in \left\{ 0,\,\ldots ,\,q^2-1\right\} ^{\Lambda }$, let

$$\begin{aligned} A_x = \bigotimes _{i\in \Lambda }A_{x_i}, \end{aligned}$$

(C.10)

where each $A_{x_i}$ acts on the site i. We have

$$\begin{aligned} \mathcal {W}_k \subseteq \textrm{span}\left\{ A_x|\psi \rangle :|\psi \rangle \in \mathcal {V},\,H(x)\le k\right\} . \end{aligned}$$

(C.11)

We also have

$$\begin{aligned} \mathcal {W}_{k-1} \subseteq \textrm{span}\left\{ A_x|\psi \rangle :|\psi \rangle \in \mathcal {V},\,H(x)\le k-1\right\} , \end{aligned}$$

(C.12)

and since $\mathcal {W}_k\perp \mathcal {W}_{k-1}$, we have

$$\begin{aligned} \mathcal {W}_k \subseteq \textrm{span}\left\{ A_x|\psi \rangle :|\psi \rangle \in \mathcal {V},\,H(x) = k\right\} . \end{aligned}$$

(C.13)

Therefore,

$$\begin{aligned} \dim \mathcal {W}_k \le \left| H^{-1}(k)\right| \dim \mathcal {V}. \end{aligned}$$

(C.14)

The claim follows. $\square $

Lemma C.4

Let $\Phi \in \mathcal {B}_{\mathbb {Z}^d}^r$. Then, for any $\Lambda \in \mathcal {F}_{\mathbb {Z}^d}$ and any $x\in \Lambda $ we have

$$\begin{aligned} \partial _x H^\Phi _\Lambda \le 2\left\| \Phi \right\| _r, \end{aligned}$$

(C.15)

and

$$\begin{aligned} \left\| H^\Phi _\Lambda \right\| _L \le 2\left\| \Phi \right\| _r. \end{aligned}$$

(C.16)

Proof

We have

$$\begin{aligned} \partial _x H^\Phi _\Lambda\le & {} 2\sum _{x\in X \subseteq \Lambda }\left\| \Phi (X)\right\| _\infty \overset{\mathrm {(a)}}{=} 2\sum _{0\in X \subseteq \Lambda -x}\left\| \Phi (X)\right\| _\infty \nonumber \\\le & {} 2\sum _{0\in X \in \mathcal {F}_{\mathbb {Z}^d}}\left\| \Phi (X)\right\| _\infty \le 2\left\| \Phi \right\| _r, \end{aligned}$$

(C.17)

where (a) follows from the translation invariance of $\Phi $. The claim follows. $\square $

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

De Palma, G., Trevisan, D. The Wasserstein Distance of Order 1 for Quantum Spin Systems on Infinite Lattices. Ann. Henri Poincaré 24, 4237–4282 (2023). https://doi.org/10.1007/s00023-023-01340-y

Download citation

Received: 07 November 2022
Accepted: 15 June 2023
Published: 24 June 2023
Issue Date: December 2023
DOI: https://doi.org/10.1007/s00023-023-01340-y

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

The Wasserstein Distance of Order 1 for Quantum Spin Systems on Infinite Lattices

Abstract

Similar content being viewed by others

Entanglement Bounds in the XXZ Quantum Spin Chain

Singular continuous Cantor spectrum for magnetic quantum walks

A Note on the Spectral Gap of the Fredrickson–Andersen One Spin Facilitated Model

1 Introduction

1.1 Our Contribution

1.2 Related Approaches

2 Quantum Spin Systems on Infinite Lattices

2.1 Algebra and States

Definition 2.1

Proposition 2.1

Proof

Definition 2.2

Definition 2.3

Remark 2.1

2.2 Interactions

2.3 Gibbs States

Remark 2.2

3 The Quantum \(W_1\) Distance

Definition 3.1

Definition 3.2

Proposition 3.1

4 The Quantum \(W_1\) Distance for Infinite Lattices

Definition 4.1

Remark 4.1

Proposition 4.1

Proof

Proposition 4.2

Proof

Definition 4.2

Proposition 4.3

Proof

Theorem 4.1

Lemma 4.1

Proof

Proposition 4.4

Proof

Lemma 4.2

Proof

Corollary 4.1

Proof

5 The Quantum Lipschitz Constant for Infinite Lattices

Definition 5.1

Proposition 5.1

Proof

Definition 5.2

Proposition 5.2

Proof

5.1 Physical Equivalence

Definition 5.3

Proposition 5.3

Proof

6 Duality for the \(w_1\) Distance

Theorem 6.1

Proof

7 Recovery of Ornstein’s \(\bar{d}\)-Distance

Proposition 7.1

Proof

Corollary 7.1

Corollary 7.2

8 Quantum Concentration Inequalities

8.1 Poincaré Inequality

Proposition 8.1

8.1.1 Proof of Proposition 8.1

Proposition 8.2

Proposition 8.3

Proof

8.2 Gaussian Concentration Inequality

Theorem 8.1

Remark 8.1

Theorem 8.2

Proof

Corollary 8.1

Proof

9 \(W_1\) Continuity of the von Neumann Entropy

Theorem 9.1

Theorem 9.2

Remark 9.1