Abstract
We continue the investigation, started in Jakšić et al. in (J. Stat. Phys. 166:926–1015, 2017), of a network of harmonic oscillators driven out of thermal equilibrium by heat reservoirs. We study the statistics of the fluctuations of the heat fluxes flowing between the network and the reservoirs in the nonequilibrium steady state and in the large time limit. We prove a large deviation principle for these fluctuations and derive the fluctuation relation satisfied by the associated rate function.
We’re sorry, something doesn't seem to be working properly.
Please try refreshing the page. If that doesn't work, please contact support so we can address the problem.
Avoid common mistakes on your manuscript.
1 Introduction
Fluctuation Relations (FRs for short) describe universal features of the statistical properties of physical systems. The first instance of such a relation goes back to 1905 and the celebrated work of Einstein on Brownian motion. Despite a few early occurrences in the literatureFootnote 1, it is only after the works [17, 19, 25, 26] that FRs became a major research direction in nonequilibrium statistical mechanics, see, e.g., [11, 12, 29, 40, 41] on the theoretical side and [16, 52] on the experimental one. See also [23, 24, 50] for more mathematically oriented introductions to the subject and the reviews [34, 36, 46, 51] for more exhaustive references. In this work, we shall adhere to the somewhat restrictive but mathematically precise perspective advocated by Gallavotti and Cohen in [25, 26], and call FR a universal—i.e., model independent—symmetry property of the rate function describing the large deviations of some distinguished observable of a physical system in a steady state in the large time limit. As explained in [17, 22, 36, 40], such FRs, sometimes coined detailed FRs in the physics literature, provide extensions of the well-known Green–Kubo and Onsager relations of linear response theory to the nonequilibrium regime.
While FRs in smooth chaotic dynamical systems on compact phase space are now pretty well understood, their status is more problematic in the presence of singularities and for general deterministic or stochastic dynamical system with non-compact phase space. As observed through the study of specific models, FRs may only have a limited domain of validity and/or acquire non-universal features in such circumstances, see [6, 21, 45, 48, 53].
-
From the physical point of view, there are issues related to the proper choice of the relevant observable, as discussed in [37]. While most studies concern entropy production, work and heat transfer in time-dependent protocols involving a single heat reservoir, there is also some interests in investigating individual heat fluxes in multi-reservoir systems.
-
From the mathematical point of view, the problems are often related to the failure of standard approaches to the derivation of a large deviation principle (see the series [30]– [33] and [43]), or due to the technical difficulties met in applying the contraction principle, as in [5, Section 3.4].
In order to reach a better understanding of FRs we need to further investigate simple models which allow for a clean mathematical treatment. Networks of oscillators [20, 42] are among the simplest candidates. While some progresses have been achieved in our understanding of the non-equilibrium dynamics of networks of anharmonic oscillators (see, e.g., the recent works [9, 10]), a complete picture of FRs for these systems seems to be still out of reach of currently available techniques (to the best of our knowledge, the only partial results can be found in [48] for chains of oscillators). The circumstances are much more favorable to networks of harmonic oscillators [16, 38]. Despite being very special, the latter provide effective models for a wide range of systems and processes, from macroscopic electrical circuits [27, 52] to the microscopic dynamics of protein [1, 28], including the motion of mesoscopic colloidal particles [35, 53]. A novel control-theoretic approach to stochastically driven harmonic networks has been developed in [37]. There, FRs were obtained for various quantities related to the entropy produced by a general harmonic network driven out of equilibrium by thermal forcing (see Eq. (2.11) below for a typical result). The purpose of the present work is to continue these investigations, following the same control-theoretic strategy, and focusing on the individual energy currents flowing between the network and its environment.
Let us briefly describe the settings of [37] which will be used in this work. We focus on a collection, indexed by a finite set \(\mathscr {I}\), of one-dimensional harmonic oscillators with position and momentum coordinates \(q=(q_i)_{i\in \mathscr {I}}\) and \(p=(p_i)_{i\in \mathscr {I}}\). The Hamiltonian of this system is the quadratic form
where \(|\,\cdot \,|\) denotes the Euclidean norm and \(\kappa \) an automorphism of \(\mathbb {R}^\mathscr {I}\).
Besides the conservative harmonic forces deriving from this Hamiltonian, a subset of the oscillators, indexed by \({\partial \mathscr {I}}\subset \mathscr {I}\), is acted upon by thermal reservoirs. The latter are described by Langevin forces
where \(\vartheta _i>0\) denotes the temperature of the \(i^\text {th}\)–reservoir, \(\gamma _i>0\) the rate at which energy is dissipated in this reservoir and \(\dot{w}_i\) is a standard white noise. We interpret the workFootnote 2
performed by the Langevin force \(f_i\) during the time interval [0, t] as the amount of heat injected in the network by the \(i^\text {th}\)–reservoir during this period. We denote by \(\Xi =\mathbb {R}^{\partial \mathscr {I}}\) the vector space where the heat currents \(\Phi (t)=(\Phi _i(t))_{i\in {\partial \mathscr {I}}}\) take their values and write the associated Euclidean inner product as \(\langle \xi ,\Phi \rangle =\sum _{i\in {\partial \mathscr {I}}}\xi _i\Phi _i\).
The main results of the present work concern the statistics of the \(\Xi \)–valued process \(\{\Phi (t)\}_{t\ge 0}\) induced by the stationary Markov process generated by the system of stochastic differential equations
with appropriate initial conditions. More precisely, and under a controllability condition which ensures the existence and uniqueness of an invariant measure for this Markov process:
-
We identify a subspace \(\mathscr {L}\subset \Xi \) characterized by the fact that for \(\xi \in \mathscr {L}\) one has
$$\begin{aligned} \langle \xi ,\Phi (t)\rangle =\mathscr {Q}_\xi (q(t),p(t))-\mathscr {Q}_\xi (q(0),p(0)) \end{aligned}$$(1.3)where \(\mathscr {Q}_\xi \) is a quadratic form which is a first integral of the harmonic network. Applying a general result of [7] allows us to describe the asymptotics of (1.3) in the limit \(t\rightarrow \infty \). Besides a large deviation principle (LDP for short) for the fluctuations of order t of this quantity, we also get the explicit form of its limiting distribution, which has full support. This is in sharp contrast with what would happen if \(\mathscr {Q}_\xi \) was a bounded function on the phase space: the right-hand side of (1.3)—often coined a “boundary term” in the physics literature—would have no order t fluctuations and its limiting law would have compact support.
-
We then focus on the component \(\Phi (t)^\perp \) of the heat flux orthogonal to \(\mathscr {L}\). We show that it has fluctuations of order t satisfying a local LDP whose rate function I is the (partial) Legendre transform of a real analytic function g for which we provide several explicit representations. In particular, we connect g to the spectral properties of a finite dimensional matrix and the domain of validity of the LDP to some associated algebraic Riccati equation. Both functions g and I satisfy a FR.
-
We derive a simple sufficient condition, in terms of the solutions to the above mentioned Riccati equation, which ensures that our LDP and the associated FR hold globally. We also provide several examples where our condition is fulfilled. This shows that there is a regime where the components of the heat flux along the subspace \(\mathscr {L}\) are responsible for the failure of the global FR for the entropy production observed in [37]. Our examples show, however, that our sufficient condition does not survive strong thermal forcing.
-
In cases where the LDP for \(\Phi (t)^\perp \) holds globally without our sufficient condition being satisfied, we show that the rate function only satisfies the universal FR on a proper subset of \(\Xi \) which is again described in terms of the solutions to the Riccati equation.
The remaining parts of the paper are organized as follows. In Sect. 2, we introduce a general class of harmonic networks driven out of thermal equilibrium by heat reservoirs. We describe the stochastic processes generated by their nonequilibrium dynamics and, within this probabilistic framework, we identify the fluxes of energy flowing between the network and the heat reservoirs. Then, we briefly recall some results of [37] on the fluctuations of the entropy produced by the networks in a nonequilibrium steady state: a large deviation principle and the associated FRs. Finally, we sketch a naive argument which will motivate the approach followed in this work.
In Sect. 3, we formulate our main results on the fluctuations of the heat fluxes in a nonequilibrium steady state of the network. Under a natural controllability assumption, we provide an explicit formula for the large time asymptotics of the cumulant generating function of these fluxes. Then, we describe the resulting local LDP for the fluctuations of the heat fluxes and the associated FRs. Finally, under an additional assumption on the network, we provide a global LDP with its associated FRs.
In Sect. 4, we provide some specific examples to which our results apply. The final Section 5 collects all the proofs of our results.
2 The Model
2.1 Setup
In order to set up the notation to be used in the sequel, we briefly recall the general framework of [37], referring the reader to this paper for more details.
Notations and Conventions. Let E and F be real or complex Hilbert spaces. L(E, F) denotes the set of (continuous) linear operators \(A:E\rightarrow F\) and \(L(E)=L(E,E)\). For \(A\in L(E,F)\), \(A^*\in L(F,E)\) denotes the adjoint of A, \(\Vert A\Vert \) its operator norm, \(\mathop {\mathrm {Ran}}\nolimits A\subset F\) its range and \(\mathop {\mathrm {Ker}}\nolimits A\subset E\) its kernel. We denote the spectrum of \(A\in L(E)\) by \(\mathop {\mathrm {sp}}\nolimits (A)\). A is non-negative (resp. positive), written \(A \ge 0\) (resp. \(A> 0\)), if it is self-adjoint and \(\mathop {\mathrm {sp}}\nolimits (A)\in [0,\infty [\) (resp. \(\mathop {\mathrm {sp}}\nolimits (A)\subset ]0,\infty [\)). We write \(A\ge B\) whenever \(A-B\ge 0\). A pair \((A,Q)\in L(E)\times L(F,E)\) is said to be controllable if the smallest A-invariant subspace of E containing \(\mathop {\mathrm {Ran}}\nolimits Q\) is E itself. Denoting by \(\mathbb {C}_{\mp }\) the open left/right half-plane, \(A\in L(E)\) is said to be stable/anti-stable whenever \(\mathop {\mathrm {sp}}\nolimits (A)\subset \mathbb {C}_{\mp }\).
We consider the harmonic network described in the Introduction. The configuration space \(\mathbb {R}^\mathscr {I}\) is endowed with its Euclidean structure and the phase space \(\Gamma =\mathbb {R}^\mathscr {I}\oplus \mathbb {R}^\mathscr {I}\) is equipped with its canonical symplectic structure. On these spaces, \(|\ \ |\) and \(\,\cdot \,\) denote the Euclidean norm and inner product, respectively. Recall that the Euclidean inner product of the space \(\Xi =\mathbb {R}^{\partial \mathscr {I}}\) is written \(\langle \xi ,\Phi \rangle =\sum _{i\in {\partial \mathscr {I}}}\xi _i\Phi _i\).
Convention. We identify \(\xi \in \Xi \) with the element of \(L(\Xi )\) defined by
In particular, whenever we write inequalities involving such \(\xi \), they are always to be interpreted as operator inequalities.
Introducing the linear map \(\iota \in L(\mathbb {R}^{\partial \mathscr {I}},\mathbb {R}^\mathscr {I})\) defined by
we set
The internal energy of the network then writes \(h(x)=\frac{1}{2}|x|^2\), and its dynamics is described by the following system of Itô stochastic differential equations
w denoting a standard \(\Xi \)–valued Wiener process. The solution of the Cauchy problem associated to (2.2), with initial condition \(x(0)=x_0\), can be written explicitly as
This relation defines a family of \(\Gamma \)–valued Markov processes indexed by the initial condition \(x_0\in \Gamma \). The generator of this degenerate diffusion process is given by
where \(B=QQ^*\). We shall denote \(\mathbb {P}_{x_0}\) the probability measure induced on the path space \(C([0,\infty [,\Gamma )\) and by \(\mathbb {E}_{x_0}\) the corresponding expectation functional. Given a probability measure \(\nu \) on \(\Gamma \), we further set \(\mathbb {P}_\nu =\int \mathbb {P}_x\nu (\mathrm {d}x)\) and define similarly \(\mathbb {E}_\nu \).
For later references, we note the following structural relations
Moreover, denoting by \(\theta \) the time-reversal involution \((p,q)\mapsto (-p,q)\) of \(\Gamma \),
Setting
we also have
In the sequel, we shall further assume the Kalman condition
(C) The pair (A, Q) is controllable.
We recall (see [37, Theorem 3.2] and references therein) that under this assumption the process (2.3) admits a unique invariant measure \(\mu \), the centered Gaussian measure on \(\Gamma \) with covariance
which is also characterized as the unique solution of the Lyapunov equation \(AM+MA^*+B=0\).
Remark
If the environment is in thermal equilibrium at temperature \(T_0>0\), i.e., if \(\vartheta =T_0 I_{\Xi }\), then \(M=T_0I_\Gamma \) and \(\mu \) is the Gibbs measure \(\mu (\mathrm {d}x)\propto \mathrm {e}^{-h(x)/T_0}\mathrm {d}x\).
2.2 Heat Fluxes and Entropy Production
Following [37], we shall interpret the work (1.2) performed by the Langevin force during the time interval [0, t] as the amount of heat injected in the network by the \(i^\text {th}\)–reservoir during this period. In [37], a LDP and extended fluctuation relations were proven for the total amount of entropy dissipated in the reservoirs, i.e., the entropy produced by the network
Let us briefly recall these results.
Assuming the Kalman condition (C), the family \(\{\mathfrak {S}(t)\}_{t\ge 0}\) satisfies a global LDP with a good rate function \(I:\mathbb {R}\rightarrow [0,+\infty ]\), i.e., for any Borel set \(S\subset \mathbb {R}\), one has
where \(\dot{S}\) and \({\bar{S}}\) denote respectively the interior and the closure of S. The mean entropy production rate
exists and is non-negative. Whenever \(\mathrm {ep}>0\), the rate function satisfies the FR
for \(|s|\le \mathrm {ep}\). However, this universal relation fails for \(|s|>\mathrm {ep}\) where, instead, a model dependent extended fluctuation relation holds (see [37, Section 3.7]).
Our aim here is to derive a LDP and FRs for the individual heat fluxes \(\Phi (t)=(\Phi _i(t))_{i\in {\partial \mathscr {I}}}\). To motivate our approach, let us sketch a naive argument leading to the desired results.
The Gärtner-Ellis theorem (see, e.g., [15] or [14]) is a well-known route to the LDP. To follow it, one has to show the existence and some regularity properties of the large time limit
of the cumulant generating function
A simple calculation yields the expression
By Itô calculus, one has
where
is a deformation of the Markov generator (2.4), the matrices \(A_\xi \) and \(C_\xi \) being given by
A naive application of Girsanov formula yields
and in particular
Given the specific form of \(L_\xi \) and the fact that it generates a positivity preserving semigroup, it is natural to seek an eigenvector \(\Psi _\xi \) to its dominant eigenvalue \(\lambda _{\xi }=\max \{\mathfrak {R}\lambda \,|\,\lambda \in \mathop {\mathrm {sp}}\nolimits (L_{\xi })\}\) in the Gaussian form
A simple calculation shows that the eigenvalue problem splits into the following algebraic Riccati equation for a symmetric matrix X,
and the relation
where \(X_\xi \) denotes the maximalFootnote 3 solution of (2.16). From the structural relations (2.5)–(2.7) and the fact that \([\xi ,\vartheta ]=0\) one easily deduces that the formal adjoint \(L^*_\xi \) of \(L_\xi \) is given by
where the map \(\Theta \) is defined by \(\Theta f=f\circ \theta \).
Assuming \(L_\xi \) to have a non-vanishing spectral gap, we obtain
as \(t\rightarrow \infty \), and hence
provided the prefactor
is finite and positive.Footnote 4 The fluctuation relation
then follows from (2.17).
Assuming the limiting cumulant generating function \(\xi \mapsto \lambda _\xi \) to be everywhere differentiable on \(\Xi \), the Gärtner-Ellis theorem yields the LDP
where \(\dot{F}/{\bar{F}}\) denotes the interior/closure of the Borel set \(F\subset \Xi \), the rate function being given by the Legendre transform
Relation (2.19) thus translates into the FR
where we recognize, in the right-hand side, the entropy production rate corresponding to the heat flux \(\varphi \).
There are several issues with the above formal derivation. In particular, we can’t expect the fluctuation relation (2.20) (resp. (2.19)) to hold for all values of \(\varphi \in \Xi \) (resp. for all values of \(\xi \in \Xi \)). Indeed, by the contraction principle, the validity of (2.20) for all \(\varphi \in \Xi \) would entail the validity of (2.11) for all \(s\in \mathbb {R}\), in contradiction with the above mentioned result of [37]. The main contribution of the present work is to provide a rigorous proof of a large deviation principle for heat fluxes, an explicit formula for the rate function I and a description of the domain of validity of the universal relation (2.20).
3 Main results
3.1 The Limiting Cumulant Generating Function
In this paragraph, we first formulate the generalized detailed balance relation (see [18, Section 4.3] and [5, Section 2.2]) which plays a central role in our analysis. Then we state our main result on the large time asymptotics of the cumulant generating function (2.13).
Proposition 3.1
Given \(\xi \in \Xi \), we shall write \({\widetilde{\xi }}\rhd \xi \) whenever \({\widetilde{\xi }}\in L(\Gamma )\) is self-adjoint and satisfies
To such a \({\widetilde{\xi }}\), we associate the quadratic forms
and
and the measure \(\mu _{{\widetilde{\xi }}}\) on \(\Gamma \) defined by
Then, the following assertions hold:
-
(1)
\(\mu _{\widetilde{\xi }}\) and \(\sigma _{\widetilde{\xi }}\) satisfy
$$\begin{aligned} \mu _{\widetilde{\xi }}\circ \theta =\mu _{\widetilde{\xi }},\qquad \sigma _{\widetilde{\xi }}\circ \theta =-\sigma _{\widetilde{\xi }}. \end{aligned}$$ -
(2)
Denote by \(L^{\widetilde{\xi }}\) the formal adjoint of the Markov generator (2.4) w.r.t. the inner product of the Hilbert space \(L^2(\Gamma ,\mu _{\widetilde{\xi }})\). Then the generalized detailed balance relation
$$\begin{aligned} \Theta L^{\widetilde{\xi }}\Theta =L+\sigma _{\widetilde{\xi }}\end{aligned}$$holds.
-
(3)
There exists \({\widetilde{\xi }}\in L(\Gamma )\) satisfying (3.1) and such that \(\Sigma _{\widetilde{\xi }}=0\) iff
$$\begin{aligned} \mathrm {e}^{-\mathscr {Q}_{\widetilde{\xi }}(x)}L_\eta \mathrm {e}^{\mathscr {Q}_{\widetilde{\xi }}(x)}=L_{\eta +\xi } \end{aligned}$$holds for all \(\eta \in \Xi \). Moreover, under Condition (C), such a \({\widetilde{\xi }}\), if it exists, is unique and satisfies
$$\begin{aligned} \mathop {\mathrm {sp}}\nolimits ({\widetilde{\xi }})=\mathop {\mathrm {sp}}\nolimits (\xi ). \end{aligned}$$ -
(4)
The functional (2.14) can be written as
$$\begin{aligned} \langle \xi ,\Phi (t)\rangle =\mathscr {Q}_{\widetilde{\xi }}(x(t))-\mathscr {Q}_{\widetilde{\xi }}(x(0))+\int _0^t\sigma _{\widetilde{\xi }}(x(s))\mathrm {d}s. \end{aligned}$$
Given the structure of the map Q and the diagonal nature of \(\xi \), the existence of \({\widetilde{\xi }}\in L(\Gamma )\) satisfying (3.1) is obvious. Apart from Part (3), the proof of the previous proposition is identical to the elementary proof of Proposition 3.5 in [37] and we omit it. The first statement in (3) follows from an explicit calculation. One easily checks that \(\Sigma _{\widetilde{\xi }}=0\) is equivalent to \([A,{\widetilde{\xi }}]=0\), which implies that \({\widetilde{\xi }}A^nQ=A^nQ\xi \) for any \(n\ge 0\). Thus, Condition (C) immediately yields the uniqueness of \({\widetilde{\xi }}\). Similarly, one deduces from the relation \(({\widetilde{\xi }}-z)^{-1}A^nQ=QA^n(\xi -z)^{-1}\), obviously valid for \(z\in \mathbb {C}\setminus \mathbb {R}\), that \({\widetilde{\xi }}\) and \(\xi \) have the same spectrum.
Proposition 3.2
Assume that Condition (C) holds.
-
(1)
For \(\xi \in \Xi \) and \(\omega \in \mathbb {R}\), the operator
$$\begin{aligned} E_\xi (\omega )=Q^*(A^*-\mathrm {i}\omega )^{-1}\Sigma _{\widetilde{\xi }}(A+i\omega )^{-1}Q \end{aligned}$$(3.2)is self-adjoint on the complexification of \(\Xi \) and does not depend on the choice of \(\,{\widetilde{\xi }}\rhd \xi \). Moreover, the map \(\mathbb {R}\times \Xi \ni (\omega ,\xi )\mapsto E_\xi (\omega )\) is continuous.
-
(2)
The set
$$\begin{aligned} \mathscr {D}=\bigcap _{\omega \in \mathbb {R}}\{\xi \in \Xi \,|\,I-E_{\xi }(\omega )>0\} \end{aligned}$$is open, convex, centrally symmetric around the point \((2\vartheta )^{-1}\) and contains
$$\begin{aligned} \mathscr {D}_0=\{\xi \in \Xi \,|\,0<\xi <\vartheta ^{-1}\}. \end{aligned}$$Its lineality spaceFootnote 5 is given by
$$\begin{aligned} \mathscr {L}=\bigcap _{\omega \in \mathbb {R}}\{\xi \in \Xi \,|\,E_\xi (\omega )=0\} =\{\xi \in \Xi \,|\,\Sigma _{\widetilde{\xi }}=0\text { for some } {\widetilde{\xi }}\rhd \xi \}, \end{aligned}$$(3.3)and in particular \(\varvec{1}=(1,1,\ldots ,1)\in \mathscr {L}\).
-
(3)
The function
$$\begin{aligned} g(\xi )=-\int _{-\infty }^{+\infty }\log \det (I-E_\xi (\omega ))\frac{\mathrm {d}\omega }{4\pi }, \end{aligned}$$is convex and real analytic on \(\mathscr {D}\). It is centrally symmetric w.r.t. the point \((2\vartheta )^{-1}\) and translation invariant in the direction \(\mathscr {L}\), i.e.,
$$\begin{aligned} g(\vartheta ^{-1} - \xi )=g(\xi )=g(\xi +\eta ) \end{aligned}$$(3.4)for all \(\xi \in \mathscr {D}\) and \(\eta \in \mathscr {L}\). In particular, \(g(0)=g(\vartheta ^{-1})=0\).
-
(4)
One has
$$\begin{aligned} \mathscr {L}=\nabla g(\mathscr {D})^\perp , \end{aligned}$$and the following alternative holds: Either \(\mathscr {D}=\mathscr {L}=\Xi \) and g vanishes identically, or \(\mathscr {L}^\perp \not =\{0\}\) and g is strictly convex on the section \(\mathscr {S}=\mathscr {D}\cap \mathscr {L}^\perp \), the closure of \(\mathscr {S}\) being a compact convex subset of \(\mathscr {L}^\perp \).
-
(5)
For \(\xi \in \Xi \), define
$$\begin{aligned} K_{\xi }=\begin{bmatrix} -A_\xi&QQ^*\\ C_\xi&A^*_\xi \\ \end{bmatrix}, \end{aligned}$$(3.5)where \(A_{\xi }\) and \(C_{\xi }\) are given by (2.15). Then \(\mathscr {D}\) is the connected component of the point \(\xi =0\) in the set
$$\begin{aligned} \{\xi \in \Xi \,|\,\mathop {\mathrm {sp}}\nolimits (K_{\xi })\cap \mathrm {i}\mathbb {R}=\emptyset \}. \end{aligned}$$Moreover, for any \((\omega ,\xi )\in \mathbb {R}\times \mathscr {D}\) one has
$$\begin{aligned} \det (K_{\xi }-\mathrm {i}\omega )=|\det (A+\mathrm {i}\omega )|^2\det (I-E_{\xi }(\omega )). \end{aligned}$$ -
(6)
The function g has a bounded continuous extension to the closed set \(\,\overline{\mathscr {D}}\) which is given by
$$\begin{aligned} g(\xi )=\frac{1}{4}\mathop {\mathrm {tr}}\nolimits (Q\vartheta ^{-1}Q^*) -\frac{1}{4}\sum _{\lambda \in \mathop {\mathrm {sp}}\nolimits (K_\xi )}|\mathfrak {R}\lambda |m_\lambda , \end{aligned}$$(3.6)where \(m_\lambda \) denotes the algebraic multiplicity of \(\lambda \in \mathop {\mathrm {sp}}\nolimits (K_\xi )\).
-
(7)
For any finite \(\xi _0\in \partial \mathscr {D}\) one has
$$\begin{aligned} \lim _{\mathscr {D}\ni \xi \rightarrow \xi _0}|\nabla g(\xi )|=\infty . \end{aligned}$$Thus, setting \(g(\xi )=+\infty \) for \(\xi \in \Xi \setminus \overline{\mathscr {D}}\) yields an essentially smooth, essentially strictly convex, closed, proper convex function \(g:\Xi \rightarrow ]-\infty ,+\infty ]\).
-
(8)
For all \(\xi \in \overline{\mathscr {D}}\) the Riccati equation (2.16) has a maximal self-adjoint solution \(X_\xi \). The map \(\xi \mapsto X_\xi \) is continuous and concave on \(\overline{\mathscr {D}}\), and
$$\begin{aligned} g(\xi )=-\frac{1}{2}\mathop {\mathrm {tr}}\nolimits (Q^*(X_\xi -{\widetilde{\xi }}) Q). \end{aligned}$$(3.7)Moreover, setting \(D_\xi =A_\xi -BX_\xi \), the pair \((D_\xi ,Q)\) is controllable and \(\mathop {\mathrm {sp}}\nolimits (D_\xi )=\mathop {\mathrm {sp}}\nolimits (K_\xi )\cap \overline{\mathbb {C}_-}\).
The lineality subspace \(\mathscr {L}\) is related to conservation laws of the harmonic network. Indeed, under its Hamiltonian dynamics, the network evolves according to \(x_t=\mathrm {e}^{t\Omega }x_0\) and hence
for \(\xi \in \mathscr {L}\). It follows that, for any \(\xi \in \mathscr {L}\), the quadratic form \(\mathscr {Q}_{\widetilde{\xi }}\) is a first integral of the Hamiltonian flow. In particular the direction \(\varvec{1}\in \mathscr {L}\) and the invariance \(g(\xi +\lambda \varvec{1})=g(\xi )\) is related to the conservation of the total energy of the network, \(h=\mathscr {Q}_I\). This symmetry of the cumulant generating function of currents was already described, in the quantum setting, in [2], see also [8] for a detailed discussion.
It follows from [37, Theorem 3.13] that the entropy production rate of the network (2.10) is related to the function g by
Thus, \(\mathrm {ep}=0\) whenever the alternative \(\mathscr {L}=\Xi \) in Part (4) of Proposition 3.2 holds. In the following we shall avoid trivialities assuming, without further notice, that \(\mathrm {ep}>0\) and hence \(\mathscr {L}^\perp \not =\{0\}\) and
with \(\mathscr {S}=\mathscr {D}\cap \mathscr {L}^\perp \).
By Proposition 3.2 (8) the functions
are continuous and respectively convex/concave on \(\overline{\mathscr {D}}\). The function g of the preceding proposition is related to the limiting cumulant generating function (2.12) by the following
Proposition 3.3
Under Assumption (C) one has
where (compare this with the right-hand side of (2.18))
is a bounded, open, convex subset of \(\,\mathscr {D}\) such that
In particular, \(\mathscr {D}_\infty \) contains a neighborhood of 0.
3.2 Fluctuations of Conserved Quantities
As mentioned above, each \(\xi \in \mathscr {L}\) is associated to a first integral \(\mathscr {Q}_{\widetilde{\xi }}\) of the harmonic network. In this section, we briefly focus on these conserved quantities. Since \(\xi _\lambda =\xi +\lambda \varvec{1}\in \mathscr {L}\) and \({\widetilde{\xi }}_\lambda ={\widetilde{\xi }}+\lambda I>0\) for \(\lambda \in \mathbb {R}\) large enough, there is no loss of generality in assuming that \(\mathscr {Q}_{\widetilde{\xi }}\ge 0\). It follows from Proposition 3.1 (4) and [7, Proposition 2.2] that the law of
under \(\mathbb {P}_\mu \) converges towards a variance-gamma distribution
with density
whereFootnote 6\(m=2|\mathscr {I}|\), \(N={\widetilde{\xi }}^{1/2}M{\widetilde{\xi }}^{1/2}\), \(\sigma \) is the Lebesgue measure on the unit sphere \(S^{m-1}\) of \(\Gamma \), and K denotes a modified Bessel function [54]. As mentioned in the Introduction, the variance-gamma distribution as full support on \(\mathbb {R}\). Moreover, the above convergence is accompanied by a LDP: for any open set \(O\subset \mathbb {R}\),
with the rate function
This applies, in particular, to the fluctuations of the total energy which was the primary concern in [7].
3.3 A Local Fluctuation Theorem
While the results of the previous section quantify departures from the conservation laws, the main results of this paper deal with the component of the heat currents fluctuations which do not violate these conservation laws.
Recall that \(\mathscr {S}=\mathscr {D}\cap \mathscr {L}^\perp \) is the (precompact, convex) base of the cylinder \(\mathscr {D}\subset \Xi \). Denoting by \(\Pi \in L(\Xi )\) the orthogonal projection on \(\mathscr {L}^\perp \), setting \(\mathscr {S}_\infty =\Pi \mathscr {D}_\infty \) and defining the function \(I:\mathscr {L}^\perp \rightarrow [0,+\infty [\) by
a direct application of the Gärtner-Ellis theorem yields the following
Theorem 3.4
Assume that Condition (C) holds. Then, under the law \(\mathbb {P}_\mu \), the family \(\{\Pi \Phi (t)\}_{t\ge 0}\) satisfies a local LDP with the good rate function I given by (3.9), i.e., for any Borel set \(F\subset \mathscr {L}^\perp \), one has
where \(\dot{F}\) and \({\bar{F}}\) denote respectively the interior and the closure of F and
Moreover, for \(\varphi \in \mathscr {F}_0=\{\nabla g(\xi )\,|\,\xi \in \mathscr {S}_\infty \text { and } \vartheta ^{-1}-\xi \in \mathscr {S}_\infty \}\supset \nabla g(\mathscr {D}_0)\), the fluctuation relation
holds.
3.4 A Global Fluctuation Theorem
To improve on Theorem 3.4 and obtain a global LDP on \(\mathscr {L}^\perp \), we impose a further condition on the network:
(R)\( \displaystyle \min _{\xi \in \partial \mathscr {S}}\left( \Lambda _+(\xi )-\Lambda _-(\xi )\right) >0. \)
Since \(\Lambda _+-\Lambda _-\) is a concave function of \(\xi \) on \(\mathscr {S}\), Condition (R) ensures that it is positive on \(\mathscr {S}\).
Theorem 3.5
Assume that Conditions (C) and (R) hold. Then, under the law \(\,\mathbb {P}_\mu \), the family \(\{\Pi W(t)\}_{t\ge 0}\) satisfies a global LDP with the good rate function I given by (3.9), i.e., for any Borel set \(F\subset \mathscr {L}^\perp \), one has
where \(\dot{F}\) and \({\bar{F}}\) denote respectively the interior and the closure of F. Moreover, the fluctuation relation
holds for all \(\varphi \in \mathscr {L}^\perp \).
Remark
We stress that Theorem 3.5 only gives sufficient conditions for the global validity of the LDP. We conjecture that Condition (C) alone is sufficient for (3.11) to hold for all Borel sets \(F\subset \mathscr {L}^\perp \) (with the rate function I given by (3.9)). However, we were not able to prove this claim, and in particular we are not aware of any general result in the theory of large deviations which would imply it. We leave this conjecture as an interesting open problem.
Discussion
-
1.
The quantity \(-\langle \vartheta ^{-1},\varphi \rangle \) is the entropy production rate associated with a given heat current \(\varphi \in \mathscr {L}^\perp \) (compare with (2.9)). Thus, the FR (3.12) implies that current fluctuations \(\varphi \) with negative entropy production rate are exponentially suppressed, as \(t\rightarrow \infty \), relative to the opposite fluctuation \(-\varphi \) (which has positive entropy production rate).
-
2.
We note that, according to the results of [37], the entropy production (2.9) never satisfies the FR (2.11) globally (i.e., for all \(s\in \mathbb {R}\)). This is in sharp contrast with the component of the heat flux along \(\mathscr {L}^\perp \). Indeed, we will see in the next section that a global FR is possible in this case.
-
3.
Suppose that, in view of the above conjecture, Condition (R) being violated, the LDP (3.11) holds for all Borel sets \(F\subset \mathscr {L}^\perp \). It follows that the FR (3.12) does not hold for all \(\varphi \in \mathscr {L}^\perp \) but is replaced by an extended—i.e., non-universal—FR on the unbounded set
$$\begin{aligned} \{\varphi \in \mathscr {L}^\perp \,|\,\text {either }\varphi \not \in \nabla g(\mathscr {S}_\infty ) \text { or }-\varphi \not \in \nabla g(\mathscr {S}_\infty )\}, \end{aligned}$$as illustrated in Figure 3, below. The graph of the rate function in the region \(\mathscr {L}^\perp \setminus \nabla g(\mathscr {S}_\infty )\) has the peculiar form of a ruled surface. More precisely, for \(\xi \in \partial \mathscr {S}_\infty \) let \(\varphi _0=\nabla g(\xi )\) and denote by \(\eta \) the exterior normal to \(\mathscr {S}_\infty \) at \(\xi \). Then, one has \(I(\varphi _0+\lambda \eta )=I(\varphi _0)+\lambda \eta \cdot \xi \) for all \(\lambda >0\). This behavior generalizes to the multivariate case the affine character of the rate function of scalar observables (heat, entropy production,...) found in [3, 13, 21, 37, 45, 53]. In these circumstances, the rate function is convex on \(\mathscr {L}^\perp \) but fails to be strictly convex on the complement of \(\nabla g(\mathscr {S}_\infty )\). As in the scalar cases mentioned above, the somewhat striking consequence of these extended FRs is an increase of the probability ratio for current fluctuations with negative to positive entropy production rate.
-
4.
From a mathematical perspective, deviations from the FR (3.12) are due to the lack of essential smoothness of the limiting cumulant generating function (3.8). In the setting of the present work, this can be traced back to the divergence of a pre-exponential factor in the asymptotic expansion of the cumulant generating function (the term \(d_t\) in (5.13) below). The analysis of [44, 45] suggests that this phenomenon may remain relevant beyond the harmonic/Gaussian setting, however we are not aware of any rigorous result in this direction.
-
5.
From the physical point of view, the circumstances leading to the failure of the global FR (3.12) are still not well understood. However, the examples below tend to indicate that the strength of the thermal drive (and hence large currents) is one determinant factor.
4 Examples
Observe that Eq. (3.2) implies that the matrix \(E_\xi (\omega )\) and thence the function \(g(\xi )\) are invariant under the simultaneous rescaling
with \(\lambda >0\). Furthermore, one easily checks that, under the same rescaling, the maximal solution to the Riccati equation (2.16) obeys \(X_\xi \mapsto \lambda X_\xi \), so that \(\Lambda _\pm (\xi )\mapsto \lambda \Lambda _\pm (\xi )\). Consequently, without losing in generality, we shall fix the average temperature according to
in all our examples, denoting temperature ratios by \([\vartheta _1:\vartheta _2:\cdots ]\). For systems out of thermal equilibrium, we shall also use a special system of cartesian coordinates on the space \(\mathscr {L}^\perp \): we set its origin at the orthogonal projection of the symmetry center \(\left( (2\vartheta _i)^{-1}\right) _{i\in {\partial \mathscr {I}}}\) on \(\mathscr {L}^\perp \), and chose the first basis vector along the same direction. Finally, we note that in all the examples below, it is straightforward to check that Condition (C) is satisfied and that \(\mathscr {L}=\mathbb {R}\varvec{1}\). We shall therefore concentrate our discussions on the validity of Condition (R).
4.1 A Lozenge Network
As a first example of numerical exploitation of our scheme, we investigate some properties of the Lozenge network of Fig. 1. With \(|\mathscr {I}|=4\) and \(|{\partial \mathscr {I}}|=3\), the parameters of the model are given by
Consider first the case of thermal equilibrium: [1 : 1 : 1]. The mean heat fluxes vanish, \({\bar{\varphi }}=0\). From the right pane of figure 1, which shows the functions \(\mathscr {S}\ni \xi \rightarrow \Lambda _\pm (\xi )\), one infers that Condition (R) is verified so that, by Theorem 3.5, the global LDP (3.11) holds with a rate function I satisfying the FR (2.20) on \(\mathscr {L}^\perp \).
By continuity, Condition (R) persists sufficiently near thermal equilibrium so that the same conclusions hold there. Figure 2 shows the “spectral gap” \(\Lambda _+-\Lambda _-\) on the boundary of the set \(\mathscr {S}\) for different temperature ratios. It appears that this gap eventually closes (i.e., takes non-positive values) when the temperature differences become large. We conclude that the FR (2.20) breaks down in this regime. This is illustrated on Figure 3 where the rate function I and the anomalous fluctuation function \(\Delta (\varphi )=I(\varphi )-I(-\varphi )-\vartheta ^{-1}\cdot \varphi \) are plotted for the temperature ratios [1 : 2 : 64].
4.2 A Triangular Network
Our second example is the triangular network already considered in [37] and illustrated on the left pane of Fig. 4. Here we have \(|\mathscr {I}|=6\), \(|{\partial \mathscr {I}}|=3\) and the parameters are
In thermal equilibrium one finds that \(\mathscr {S}\) is the disk of radius \(\sqrt{3}/2\) centered at 0 on \(\mathscr {L}^\perp \). The spectral gap \(\Lambda _+-\Lambda _-\) is open, as seen on the right pane of Figure 4. Hence, here again, Theorem 3.5 applies: the global LDP (3.11) and the FR (2.20) hold on \(\mathscr {L}^\perp \) near equilibrium. The spectral gap on the boundary \(\partial \mathscr {S}\) is plotted in Figure 5 for various temperature ratios \([\vartheta _1:\vartheta _2:\vartheta _3]\). One observes a similar behavior as in our first example.
4.3 A Heat Pump Network
Our last example is the heat pump network of [20], see Figure 6. With \(|\mathscr {I}|=6\) and \(|{\partial \mathscr {I}}|=4\), the parameters:
were chosen in [20] in such a way that the mean steady heat current between the vertices 5 and 6 vanishes while the heat flows from the hot reservoir to the cold one on the left side, and from the cold to the hot one on the right side. Thus, the right side of the device acts as a heat pump. On the right pane of Figure 6 we plot the two functions \(\Lambda _\pm \) on the boundary \(\partial \mathscr {S}\).Footnote 7 Condition (R) is satisfied, so that the global large deviation principle (3.11) and the FR (2.20) hold on \(\mathscr {L}^\perp \) near this non-equilibrium heat pump regime.
As shown in Fig. 7, here again the spectral gap closes as the temperatures differences increase.
5 Proofs
5.1 Proof of Proposition 3.2
In order to prove the proposition, we shall need the following two Lemmas.
Lemma 5.1
Let A and B be linear operators on a finite dimensional Hilbert space and \(\mathbb {C}=V_-\cup \ell \cup V_+\) a partition of the plane into two open half-planes \(V_{\pm }\) and a separating line \(\ell \). Assume that \((\mathop {\mathrm {sp}}\nolimits (A)\cup \mathop {\mathrm {sp}}\nolimits (B))\cap \ell =\emptyset \) and that A and B have the same number of repeated eigenvalues in \(V_+\). Denote by \(A_\pm \) and \(B_\pm \) the parts of A and B corresponding to their spectra in \(V_\pm \). Let K be a compact set such that \((\mathop {\mathrm {sp}}\nolimits (A)\cup \mathop {\mathrm {sp}}\nolimits (B))\cap V_+\subset K\subset V_+\). Then the following holds:
-
(1)
The function \(f(z)=\log \det \left( (z-A)^{-1}(z-B)\right) \) is analytic in \(V_+\setminus K\).
-
(2)
For any Jordan curve \(\gamma \) in \(V_+\setminus K\) “enclosing” K
$$\begin{aligned} \oint _\gamma f(z)\frac{\mathrm {d}z}{2\pi \mathrm {i}}=-\oint _\gamma zf'(z)\frac{dz}{2\pi \mathrm {i}} =\mathop {\mathrm {tr}}\nolimits (A_+-B_+). \end{aligned}$$(5.1)
Proof
(1) By assumption we can enumerate the repeated eigenvalues of A and B in such a way that
with
In terms of these eigenvalues, we have
The Möbius transformation \(z\mapsto \frac{z-b}{z-a}\) maps the interior of the complement of any open neighborhood of the line segment joining a to b to a simply connected open subset of \(\mathbb {C}\setminus \{0\}\). It follows that the function \(\log \frac{z-b}{z-a}\) is analytic on the complement of any neighborhood of the segment joining a to b. Thus, the functions \(z\mapsto \log \frac{z-\mu _j^+}{z-\lambda _j^+}\) and \(z\mapsto \log \frac{z-\mu _i^-}{z-\lambda _i^-}\) are analytic in \(V_+\setminus K\) and so is f(z).
(2) The first identity in (5.1) now follows from integration by parts. Finally, noticing that
the second identity follows from the Riesz formula
\(\square \)
Lemma 5.2
Let A and B be as in the previous Lemma, where \(\ell \) is the imaginary axis and \(V_+\) the right half-plane. Then
In particular, if the spectra of A and B are symmetric w.r.t. \(\ell \), then \(\mathop {\mathrm {tr}}\nolimits (A_-)=-\overline{\mathop {\mathrm {tr}}\nolimits (A_+)}\) and similarly for B, so
where \(m_\lambda \) denotes the algebraic multiplicity of the eigenvalue \(\lambda \).
Proof
Denote by \(\gamma _R\) the positively oriented boundary of the intersection of the disk of radius R centered at 0 with the right half-plane. Applying the previous Lemma and observing that f is analytic in a neighborhood of \(\ell \) we get, for R large enough,
To evaluate the second integral on the right-hand side we note that, as \(R\rightarrow \infty \),
so
and
It follows that
and we conclude that
The last two statements follow from elementary calculations. \(\square \)
We now turn to the proof of Proposition 3.2
(1) Set \(R(\omega )=\vartheta ^{-1}Q^*(A+\mathrm {i}\omega )^{-1}Q\). Assumption (C) implies that A is stable (see (2.5)) so that the map \(\mathbb {R}\ni \omega \mapsto R(\omega )\in L(\Xi )\) is continuous. Using the identities
a simple calculation yields that for any \({\widetilde{\xi }}\rhd \xi \in \Xi \) one has
with \(\zeta =\vartheta ^{1/2}\xi \vartheta ^{1/2}\). All the stated properties immediately follow.
(2) It will be convenient to introduce \(U(\omega )=I+R(\omega )\) and to rescale \(\xi \) by setting \(\zeta =(\xi _i\vartheta _i)_{i\in {\partial \mathscr {I}}}\). With this change of variable
and
It immediately follows that \(\mathscr {D}_0\subset \mathscr {D}\). Moreover, since \(I-F_\zeta (\omega )\) is an affine function of \(\zeta \), \(\mathscr {D}\) is convex.
Using (2.5), elementary calculations (see [37, Section 5.5]) show that, for any \(\omega \in \mathbb {R}\), \(U(\omega )^{-1}=U(-\omega )\) and \(|\det (U(\omega ))|=1\). The first relation allows us to derive
and the second one yields that \(\zeta \in \mathscr {D}\Longleftrightarrow I-\zeta \in \mathscr {D}\), which shows that \(\mathscr {D}\) is centrally symmetric around the point I / 2. To show that it is open, we now argue that its complement \(\mathscr {D}^\mathrm {c}\) is closed. Indeed, \(\xi \in \mathscr {D}^\mathrm {c}\) iff there exists \(\omega \in \mathbb {R}\) such that \(E_\xi (\omega )\) has an eigenvalue \(\lambda \ge 1\). Thus, if \(\xi _n\) is a sequence in \(\mathscr {D}^\mathrm {c}\) which converges to \(\xi \), there exist \(\omega _n\in \mathbb {R}\), \(\lambda _n\ge 1\) and unit vectors \(u_n\) such that \(E_{\xi _n}(\omega _n)u_n=\lambda _n u_n\). Given the fact that
one concludes that \(|\omega _n|\le C^{1/2}-1\) and \(\lambda _n\in [1,C]\). Thus the sequence \((\xi _n,\omega _n,\lambda _n,u_n)\) has a convergent subsequence with limit \((\xi ,\omega ,\lambda ,u)\) satisfying \(E_{\xi }(\omega )u=\lambda u\), \(\lambda \ge 1\) and \(u\not =0\). Hence, \(\xi \in \mathscr {D}^\mathrm {c}\) and consequently \(\mathscr {D}^\mathrm {c}\) is closed.
If the map \(\omega \mapsto E_\eta (\omega )\) vanishes identically on \(\mathbb {R}\), then it follows from the identity \(I-E_{\xi +\lambda \eta }(\omega )=I-E_{\xi }(\omega )\) that \(\mathscr {D}+\lambda \eta \subset \mathscr {D}\) for all \(\lambda \in \mathbb {R}\). Reciprocally, if the later condition holds, then for all \(\xi \in \mathscr {D}\) and all \(\lambda >0\), one has
Letting \(\lambda \rightarrow \infty \) we deduce that \(E_\eta (\omega )\) vanishes identically. Note that the later condition is satisfied for any \(\eta \) such that there exists \({\widetilde{\eta }}\rhd \eta \) with \(\Sigma _{\widetilde{\eta }}=0\). This is in particular the case for \(\eta =\varvec{1}\) and \({\widetilde{\eta }}=I_\Gamma \). Reciprocally, since it follows from Condition (C) that the set \(\{(A+\mathrm {i}\omega )^{-1}Qu\,|\omega \in \mathbb {R},u\in \mathbb {C}^{\partial \mathscr {I}}\}\) is total in \(\mathbb {C}^{\mathscr {I}}\), one concludes that \(E_\xi (\omega )=0\) for all \(\omega \in \mathbb {R}\) implies \(\Sigma _{\widetilde{\xi }}=0\) for any \({\widetilde{\xi }}\rhd \xi \).
(3) Consider first the map \(\mathscr {D}\ni \eta \mapsto -\log \det (I-F_\eta (\omega ))\) for fixed \(\omega \in \mathbb {R}\). The previous discussion clearly implies that it is real analytic. Its convexity follows from an elementary calculation which yields that
for \(\eta \in \mathscr {D}\) and \(z\in \mathbb {C}^{\partial \mathscr {I}}\). From (5.2) one further deduces that \(F_\eta (\omega )=O(\omega ^{-2})\) as \(|\omega |\rightarrow \infty \), locally uniformly in \(\eta \in \mathscr {D}\). It follows that
is convex and real analytic on \(\mathscr {D}\). The identity (5.3) leads to
and in particular \(\det (I-F_{\varvec{1}}(\omega ))=1\). This proves the first equality in (3.4). The second one follows from (3.3) and the linearity of the map \(\eta \mapsto F_\eta \).
(4) The second equality in (3.4) implies that \(\eta \cdot \nabla g(\xi )=0\) for all \(\xi \in \mathscr {D}\) and all \(\eta \in \mathscr {L}\). To establish the reciprocal property, note that since \(\mathscr {D}\) is open, for any \(\xi \in \mathscr {D}\) and any \(\eta \in \nabla g(\mathscr {D})^\perp \) there exists \(\epsilon >0\) such that \(\xi +\alpha \eta \in \mathscr {D}\) for \(|\alpha |<\epsilon \). It follows that the function \(\alpha \mapsto g(\xi +\alpha \eta )\) is constant in a real neighborhood of 0 and hence extends by analyticity to the constant function on the line \(\xi +\mathbb {R}\eta \). Since, by Part (7), g is singular on \(\partial \mathscr {D}\), it follows that \(\xi +\mathbb {R}\eta \subset \mathscr {D}\), i.e., \(\eta \in \mathscr {L}\).
Consequently, g vanishes identically whenever \(\mathscr {L}=\Xi \). In the opposite case, the calculation in Part (3) gives that the Hessian of g satisfies
for non-zero \(\eta \not \in \mathscr {L}\). It follows that the restriction of \(g''(\xi )\) to \(\mathscr {L}^\perp \) is positive definite which implies that the restriction of g to \(\mathscr {S}\) is strictly convex. To show that the closure of \(\mathscr {S}\) is compact, let us assume that \(\mathscr {S}\) is unbounded. Since \(\mathscr {S}\) is convex and centrally symmetric w.r.t. the orthogonal projection \(\xi _0\) of \((2\vartheta )^{-1}\) onto \(\mathscr {L}^\perp \), it follows that for some non-vanishing \(\xi \in \mathscr {L}^\perp \) one has \(\xi _0+\lambda \xi \in \mathscr {S}\) for all \(\lambda \in \mathbb {R}\), i.e.,
for all \(\omega \in \mathbb {R}\). Letting \(|\lambda |\rightarrow \infty \) yields that \(\xi \in \mathscr {L}\) which contradicts the fact that \(0\not =\xi \in \mathscr {L}^\perp \).
(5) We start with some simple consequences of Condition (C). For a short introduction to the necessary elementary material, we refer the reader to [39, Section 4]. Since \(A_\xi =A+Q\xi Q^*\), the pair \((A_\xi ,Q)\) is controllable for all \(\xi \). The relation \(A^*_\xi =-A_{\vartheta ^{-1}-\xi }\) shows that the same is true for the pair \((A^*_\xi ,Q)\). Thus, one has
for all \(\xi \). This implies that if \(Q^*u=0\) and \((A_\xi -z)u=0\) or \((A^*_\xi -z)u=0\), then \(u=0\), i.e., no eigenvector of \(A_\xi \) or \(A^*_\xi \) can live in \(\mathop {\mathrm {Ker}}\nolimits Q^*\). Assume now \(z\in \mathop {\mathrm {sp}}\nolimits (A_\xi )\) and let \(u\ne 0\) be a corresponding eigenvector. Since
taking the real part of \(\langle u,(A_\xi -z)u\rangle =0\) we infer
Thus, controllability of \((A_\xi ,Q)\) implies that for \(\pm (\xi -(2\vartheta )^{-1})>0\) one has \(\mathop {\mathrm {sp}}\nolimits (A_\xi )\subset \mathbb {C}_\pm \) and in particular \(\mathop {\mathrm {sp}}\nolimits (A_\xi )\cap \mathrm {i}\mathbb {R}=\emptyset \). Hence, for \(\xi >(2\vartheta )^{-1}\) and \(\omega \in \mathbb {R}\), Schur’s complement formula yields
where we have set
One easily checks that \(r_\xi (\omega )=r_0(\omega )(I+\xi r_0(\omega ))^{-1}\) from which a simple calculation gives
Inserting the last identity into the right-hand side of (5.4) and using the fact that
we obtain
Both sides of this identity being polynomials in \(\xi \), it extends to all \(\xi \in \Xi \). It follows that
By continuity of the function \(\xi \mapsto \displaystyle \min _{\omega \in \mathbb {R}}\det (I-E_\xi (\omega ))\), \(\mathscr {D}\) is the connected component of the point \(\xi =0\) in the left-hand side of this identity.
(6) For \(\xi \in \Xi \), \(K_\xi \) is \(\mathbb {R}\)-linear on the real vector space \(\Gamma \oplus \Gamma \). Thus, its spectrum is symmetric w.r.t. the real axis. Observing that \(J K_\xi +K_\xi ^*J=0\), where J is the unitary operator
we conclude that the spectrum of \(K_\xi \) is also symmetric w.r.t. the imaginary axis. Assume now that \(\xi \in \mathscr {D}\). Since the eigenvalues of \(K_\xi \) are continuous functions of \(\xi \), \(K_\xi \) and \(K_0\) have the same number of repeated eigenvalues in the left/right half-plane. From (5.5) we deduce
and Lemma 5.2 allows us to conclude that
Since \(\mathop {\mathrm {sp}}\nolimits (K_0)=\overline{\mathop {\mathrm {sp}}\nolimits (A)}\cup \mathop {\mathrm {sp}}\nolimits (-A)\) and A is stable, we have
and (3.6) follows for \(\xi \in \mathscr {D}\). Since the eigenvalues of \(K_\xi \) are continuous functions of \(\xi \in \Xi \), this relation extends to \(\xi \in \overline{\mathscr {D}}\). The boundedness of this extension follows from the translation invariance along \(\mathscr {L}\) and the precompactness of \(\mathscr {S}\).
(7) The idea of the proof is that \(\xi _0\in \partial \mathscr {D}\) iff at least one eigenvalue of \(E_{\xi _0}(\omega )\) reaches its global maximum 1 at some \(\omega _0\in \mathbb {R}\). Since \(E_{\xi _0}(\omega )\) is a real analytic function of \(\omega \), the function \(\mathop {\mathrm {tr}}\nolimits \left( (1-E_{\xi _0}(\omega ))^{-1}\right) \) has a pôle at \(\omega =\omega _0\), and since this function is non-negative the order of this pôle must be even. Consequently,
For \(\xi \in \mathscr {D}\), a simple calculation and Cauchy–Schwarz inequality yield
Since \(0\not \in \partial \mathscr {D}\), it suffices to show that the integral on the right-hand side diverges to \(+\infty \) as \(\xi \rightarrow \xi _0\in \partial \mathscr {D}\). Let us fix \(\xi _0\in \partial \mathscr {D}\) and set \(V_\delta =\{\xi \in \mathscr {D}\,|\,|\xi -\xi _0|<\delta \}\). Elementary considerations show that for sufficiently small \(\delta >0\) and sufficiently large \(M>0\) there exists a constant C such that
for any \(\xi \in V_\delta \). Making \(\delta \) smaller and M larger if necessary, we can assume that \(J_\xi (\omega )=(I-E_{\xi -\xi _0}(\omega ))^{-1/2}\) satisfies \(1/\sqrt{2}\le J_\xi (\omega )\le \sqrt{2}\) for \((\omega ,\xi )\in [-M,M]\times V_\delta \). Writing
and observing that this implies, in particular, that \(I-J_\xi (\omega )E_{\xi _0}(\omega )J_\xi (\omega )>0\), we derive
By Fatou’s lemma
and by the above argument, the last integral is \(+\infty \).
(8) The existence and uniqueness of the maximal solutions of Eq. (2.16) as well as the stated properties of \(D_\xi \) follow from [39, Theorems 7.3.7 and 7.5.1], Part (5), and the relation \(D_\xi =A+Q(\xi Q^*-Q^*X_\xi )\). It further follows from (3.6) and the symmetries of \(\mathop {\mathrm {sp}}\nolimits (K_\xi )\) discussed in the proof of Part (5) that
The proof of Proposition 3.2 is complete.
5.2 Proof of Proposition 3.3
5.2.1 Some Properties of the Algebraic Riccati Equations (2.16)
In order to prove Proposition 3.3 we shall need some properties of the algebraic Riccati equation
This is the purpose of the following proposition which provides a generalization of [37, Proposition 5.5]. In the sequel, whenever we mention a solution of (5.6), we always mean a self-adjoint\(X\in L(\Gamma )\) such that \(\mathscr {R}_\xi (X)=0\). We say that such a solution X is maximal (resp. minimal) if any other solution \(X'\in L(\Gamma )\) satisfies \(X'\le X\) (resp. \(X'\ge X\)).
Proposition 5.3
Assume that Condition (C) holds.
-
(1)
For \(\xi \in \mathscr {D}\) the Riccati equation (5.6) has a unique maximal solution \(X_\xi \) and a unique minimal solution \(-\theta X_{\vartheta ^{-1} - \xi }\theta \). Moreover, the matrix
$$\begin{aligned} D_\xi =A_\xi -BX_\xi \end{aligned}$$is stable and satisfies
$$\begin{aligned} Y_\xi =X_\xi +\theta X_{\vartheta ^{-1}-\xi }\theta >0. \end{aligned}$$ -
(2)
If \(\xi _0\in \partial \mathscr {D}\) is finite, then the non-tangential limit
$$\begin{aligned} X_{\xi _0}=\lim _{\mathscr {D}\ni \xi \rightarrow \xi _0} X_{\xi } \end{aligned}$$exists and is the maximal solution of the corresponding limiting Riccati equation \(\mathscr {R}_{\xi _0}(X)=0\).
-
(3)
The function \(\mathscr {D}\ni \xi \mapsto X_\xi \in L(\Gamma )\) is real analytic and concave. Moreover, \(X_\xi <0\) for \(\xi <0\), \(X_\xi >0\) for \(\xi \) in the convex hull of the set \(\mathscr {D}_0\cup \{\xi \in \mathscr {D}\,|\,\xi >\vartheta ^{-1}\}\), \(X_0=0\) and \(X_{\vartheta ^{-1}}=\theta M^{-1}\theta \).
-
(4)
For any \(\xi \in \overline{\mathscr {D}}\) and \(\eta \in \mathscr {L}\) one has
$$\begin{aligned} X_{\xi +\eta }=X_\xi +{\widetilde{\eta }}. \end{aligned}$$ -
(5)
For \(t>0\), set
$$\begin{aligned} M_{\xi ,t}=\int _0^t\mathrm {e}^{sD_\xi }B\mathrm {e}^{sD^*_\xi }\mathrm {d}s>0. \end{aligned}$$Then, for all \(\xi \in \overline{\mathscr {D}}\) one has
$$\begin{aligned} \lim _{t\rightarrow \infty }M^{-1}_{\xi ,t}=\inf _{t>0}M^{-1}_{\xi ,t}=Y_\xi \ge 0, \end{aligned}$$and \(\ker (Y_\xi )\) is the spectral subspace of \(D_\xi \) corresponding to its imaginary eigenvalues.
-
(6)
Set \(\Delta _{\xi ,t}=M^{-1}_{\xi ,t}-Y_\xi \). For all \(\xi \in \overline{\mathscr {D}}\), one has
$$\begin{aligned} \mathrm {e}^{tD^*_\xi }M^{-1}_{\xi ,t}\mathrm {e}^{tD_\xi }=\theta \Delta _{\vartheta ^{-1}-\xi ,t}\theta , \end{aligned}$$(5.7)and
$$\begin{aligned} \lim _{t\rightarrow \infty }\frac{1}{t}\log \det (\Delta _{\xi ,t})=4g(\xi )-\mathop {\mathrm {tr}}\nolimits (Q\vartheta ^{-1} Q^*). \end{aligned}$$In particular, for \(\xi \in \mathscr {D}\), \(\Delta _{\xi ,t}\rightarrow 0\) exponentially fast as \(t\rightarrow \infty \).
-
(7)
Let \(\widetilde{D}_\xi =\theta D_{\vartheta ^{-1}-\xi }\theta \). Then
$$\begin{aligned} Y_\xi \mathrm {e}^{t\widetilde{D}^*_\xi }=\mathrm {e}^{tD^*_\xi }Y_\xi \end{aligned}$$for all \(\xi \in \overline{\mathscr {D}}\) and \(t\in \mathbb {R}\).
-
(8)
For \(\xi \in \mathscr {D}\) and \(\eta \in \Xi \)
$$\begin{aligned} \eta \cdot \nabla g(\xi )=\frac{1}{2}\mathop {\mathrm {tr}}\nolimits \left( \Sigma _{\widetilde{\eta }}Y_\xi ^{-1}\right) \end{aligned}$$
Proof
We refer to [39] for a detailed introduction to algebraic Riccati equations (see also the Appendix in [37] for a summary of the necessary basic facts). Our proof is similar to that of [37, Proposition 5.5]. The Hamiltonian matrix \(K_{\xi }\) associated to the Riccati equation (5.6) is given by Eq. (3.5). Let \(\mathscr {H}\) be the complex Hilbert space \(\mathbb {C}\Xi \oplus \mathbb {C}\Xi \) on which \(K_{\xi }\) acts. The operator
acts unitarily on \(\mathscr {H}\). We have already observed in the proof of Proposition 3.2 (6) that for \(\xi \in \Xi \), the spectrum of \(K_{\xi }\) is symmetric w.r.t. the real axis and the imaginary axis. The time-reversal covariance relations
which follow easily from the definitions of the operators \(A_\xi \), B, \(C_\xi \), further yield \(\Theta K_\xi -K_{\vartheta ^{-1} -\xi }\Theta =0\) which implies
Let \(\mathscr {H}_- (K_\xi )\) be the spectral subspace of \(K_\xi \) for the part of its spectrum in the open left half-plane \(\mathbb {C}_-\).
(1) By Proposition 3.2 (5), \(\mathop {\mathrm {sp}}\nolimits (K_\xi )\cap \mathrm {i}\mathbb {R}=\emptyset \) for \(\xi \in \mathscr {D}\) and the existence and uniqueness of the maximal and minimal solutions of the Riccati equation (5.6) follow from [39, Theorems 7.3.7 and 7.5.1]. The relation between minimal and maximal solutions is a consequence of the Relations (5.8) which imply that
By [39, Theorems 7.5.1], the maximal solution \(X_\xi \) is related to the spectral subspace \(\mathscr {H}_-(K_\xi )\) by
moreover, \(\mathop {\mathrm {sp}}\nolimits (D_\xi )=\mathop {\mathrm {sp}}\nolimits (K_\xi )\cap \mathbb {C}_-\).
\(Y_\xi =X_\xi +\theta X_{\vartheta ^{-1} - \xi }\theta \) is called the gap of Eq. (5.6). As the difference between its maximal and minimal solutions, it is obviously non-negative. It further has the remarkable property that for any solution X, \(\ker (Y_\xi )\) is the spectral subspace of \(A_\xi - BX\) for the part of its spectrum in \(\mathrm {i}\mathbb {R}\) [39, Theorem 7.5.3]. Since \(\mathop {\mathrm {sp}}\nolimits (D_\xi )\subset \mathbb {C}_-\), we must have \(Y_\xi >0\).
(2) Let \(\xi _0\in \partial \mathscr {D}\) be finite and \(\eta \not =0\) be non-tangential to \(\partial \mathscr {D}\) at \(\xi _0\). Set \(\xi _t=\xi _0-t\eta \). W.l.o.g. we may assume that \(\xi _1\in \mathscr {D}\). The function
is concave and its first derivative vanishes at \(t=1\). Hence, it is monotone non-decreasing. We claim that the set \(\{ X_\xi \, |\,\xi \in \mathscr {D},|\xi |<r\}\) is bounded in \(L(\Gamma )\) for any finite r. It thus follows that
exists. By continuity, one has \(\mathscr {R}_{\xi _0}(X)=0\) and \(\mathop {\mathrm {sp}}\nolimits (A_{\xi _0}-BX)\subset \overline{\mathbb {C}}_-\), and it follows from [39, Theorem 7.5.1] that X is the maximal solution of the limiting Riccati equation. In particular, the non-tangential limit exists (i.e., does not depend on the direction).
To prove our claim, we first derive a bound on \({\hat{X}}_\xi =Q^*X_\xi Q\). Using \((Q^*X_\xi Q)^2\le \Vert Q\Vert ^2 Q^*X_\xi ^2Q\), one easily deduces from (5.6) and Cauchy-Schwarz inequality
where \(a_\xi \) and \(b_\xi \) are locally bounded functions of \(\xi \). Solving the resulting quadratic inequality yields that \(\mathop {\mathrm {tr}}\nolimits ({\hat{X}}_\xi ^2)\), and hence \(\mathop {\mathrm {tr}}\nolimits (Q^*X_\xi ^2Q)\) are locally bounded as functions of \(\xi \). Rewriting (5.6) as the Lyapunov equation
and using the fact that A is stable, we get
It follows that for any \(T\in L(\Gamma )\)
from which one easily concludes that \(\Vert X_\xi \Vert \) is locally bounded.
(3) The spectral projection of \(K_{\xi }\) for the part of its spectrum in \(\mathbb {C}_+\) can be written as
Since \(\mathscr {D}\ni \xi \mapsto P_\xi \) is real analytic by regular perturbation theory, \(Y_\xi ^{-1}\) and \(X_\xi Y_\xi ^{-1}\) are real analytic function of \(\xi \in \mathscr {D}\). The same holds for \(Y_\xi \) and \(X_\xi =X_\xi Y_\xi ^{-1} Y_\xi \).
Invoking the implicit function theorem and using the stability of \(D_\xi \), one easily computes derivatives of the map \(\mathscr {D}\ni \xi \mapsto X_\xi \). The first derivative is the linear map
where, as usual, we identify \(\eta \in \Xi \) with the corresponding diagonal matrix in \(L(\Xi )\) and \({\widetilde{\eta }}\rhd \eta \). The second derivative is the quadratic form
and concavity follows from the obvious fact that \(X''_\xi [\eta ]\le 0\).
To prove the inequalities let us rewrite the Riccati equation (5.6) as a Lyapunov equation
and recall that, as established in the proof of Proposition 3.2 (5), \(\mp A_\xi \) is stable for \(\pm (\xi -(2\vartheta )^{-1})>0\). Thus, we have
and since \(C_\xi \le 0\) for \(|\xi -(2\vartheta )^{-1}|\ge (2\vartheta )^{-1}\), we conclude that \(X_\xi \le 0\) for \(\xi <0\) and \(X_\xi \ge 0\) for \(\xi >\vartheta ^{-1}\). The controllability of \((\mp A_\xi ,Q)\) yields that these inequalities for \(X_\xi \) are strict. Writing (5.6) as
the stability of \(D_\xi \) gives
and since \(C_\xi \ge 0\) for \(\xi \in \mathscr {D}_0\), we can conclude that \(X_\xi \ge 0\) for such \(\xi \). The controllability of \((D_\xi ,Q)\) again yields the strict inequality. The concavity of the map \(\xi \mapsto X_\xi \) implies that the subset of all \(\xi \in \mathscr {D}\) such that \(X_\xi >0\) is convex, so that it contains the convex hull of \(\mathscr {D}_0\cup \{\xi \in \mathscr {D}\,|\,\xi >\vartheta ^{-1}\}\).
From
we deduce \(X_0=0\). To prove the last assertion, starting from Eq. (5.6) and invoking Relations (5.8) one shows that \(\widehat{M}=\theta X_{\vartheta ^{-1}}^{-1}\theta \) satisfies the Lyapunov equation \(A\widehat{M}+\widehat{M}A^*+B=0\). Since A is stable, this equation has a unique solution given by (2.8), hence \(\widehat{M}=M\).
(4) A simple calculation yields
for any \(X\in L(\Gamma )\) and \(\xi ,\eta \in \Xi \). Thus, \(\mathscr {R}_{\xi +\eta }(X_\xi +{\widetilde{\eta }})=\Sigma _{\widetilde{\eta }}\) and since \(A_{\xi +\eta }-B(X_\xi +{\widetilde{\eta }})=D_\xi \), we conclude that whenever \(\Sigma _{\widetilde{\eta }}=0\) one has \(X_{\xi +\eta }=X_\xi +{\widetilde{\eta }}\).
(5)–(7) The proof follows line by line the one of the corresponding Parts of [37, Proposition 5.5].
(8) Upon differentiating Eq. (3.7) one gets
Further, using (5.10) leads to
and the result now follows from Part (5). \(\square \)
5.2.2 Proof of Proposition 3.3
By Proposition 5.3, for \(\xi \in \overline{\mathscr {D}}\), we have \(A=D_\xi +QQ^*(X_\xi -{\widetilde{\xi }})\) with \({\widetilde{\xi }}\rhd \xi \), and we can rewrite the equation of the motion (2.2) as
where
Let \(Z_\xi (t)\) be the stochastic exponential of the local martingale
Combining Eq. (5.6) with the relations \({\widetilde{\xi }}QQ^*=QQ^*{\widetilde{\xi }}=Q\xi Q^*\) and \({\widetilde{\xi }}QQ^*{\widetilde{\xi }}=Q\xi ^2Q^*\), we derive
and we can write the quadratic variation of \(\eta _\xi \) as
Itô calculus and Proposition 3.1 (4) give
with
and, taking (3.7) into account,
The proof of the following Lemma is identical to the one of [37, Lemma 5.7], and we omit it.
Lemma 5.4
For \(\xi \in \overline{\mathscr {D}}\), the process
is a \(\mathbb {P}_x\)-martingale for all \(x\in \Gamma \).
Applying Girsanov theorem, we conclude that \(\{w_\xi (t)\}_{t\in [0,\tau ]}\) is a standard Wiener process under the law \(\mathbb {Q}^\tau _{\xi ,\mu }[\,\cdot \,]=\mathbb {E}_\mu [Z_\xi (\tau )\,\cdot \,]\). It follows that the finite-time cumulant generating function can be written as
for \(\xi \in \overline{\mathscr {D}}\), i.e.,
where
and \(Q_\xi ^t\) is the Markov semigroup associated with the SDE (5.11). From the explicit solution
to this SDE we easily obtain the representation
Setting
and \(\mathscr {D}_t=\{\xi \in \mathscr {D}\,|\,N_{\xi ,t}>0\}\), an elementary calculation then leads to
Schur’s complement formula and Proposition 5.3 (5–6) lead to the factorization
where
\(\Delta _{\xi ,t}\), as defined in Proposition 5.3 (6), and \(\widetilde{\Delta }_{\xi ,t}=\mathrm {e}^{tD_\xi ^*}(X_\xi +X_\xi (\theta X_{\vartheta ^{-1}-\xi }\theta +\Delta _{\xi ,t})^{-1}X_\xi )\mathrm {e}^{tD\xi }\) are strictly positive for \(t>0\) and vanish exponentially as \(t\rightarrow \infty \). Thus, for \(\xi \in \mathscr {D}\),
and setting
one has
It follows that, for all \(\xi \in \mathscr {D}_\infty \), the limit
is finite and positive, which yields the first part of (3.8). To deal with the second part, we note that whenever \(\xi \in \Xi \setminus \overline{\mathscr {D}_\infty }\), then either \(\xi \in \Xi \setminus \overline{\mathscr {D}}\) or \(\xi \in \overline{\mathscr {D}}\setminus \overline{\mathscr {D}_\infty }\). In the latter case, either \(X_\xi +\theta X_{\vartheta ^{-1}}\theta \) or \(X_{\vartheta ^{-1}-\xi }\) has a negative eigenvalue and the matrix \(N_{\xi ,t}\) loses its positiveness as \(t\rightarrow \infty \). It follows that \(d_t(\xi )=+\infty \) and hence \(g_t(\xi )=+\infty \) for large enough t. For \(\xi \in \Xi \setminus \overline{\mathscr {D}}\), applying [37, Lemma 5.8] to the functions \(f_t(\alpha )=g_t(\alpha \xi )\) yields the desired result.
Finally, we note that the continuity and concavity of the map \(\mathscr {D}\ni \xi \mapsto X_\xi \) imply that \(\mathscr {D}_\infty \) is an open convex subset of \(\mathscr {D}\). The positiveness of \(N_\xi \) for \(\xi \in \overline{\mathscr {D}_0}\setminus \{\vartheta ^{-1}\}\) is a consequence of its continuity and proves the last statement, concluding the proof of Proposition 3.3.
5.3 Proof of Theorem 3.4
Since \(X_0=0\) and \(X_{\vartheta ^{-1}}=\theta M^{-1}\theta >0\), it follows from the continuity of the map \(\overline{\mathscr {D}}\ni \xi \mapsto X_\xi \) that the open set \(\mathscr {D}_\infty \) contains 0. By Proposition 3.3, \(\mathscr {D}_\infty \) is the interior of the essential domain of the limiting cumulant generating function (3.8). The Gärtner-Ellis theorem thus implies that the LDP upper bound in (3.10) hold for any Borel \(F\subset \mathscr {L}^\perp \), with the rate function
Moreover, the corresponding lower bound holds for any subset F of the set \(\mathscr {F}\) of exposed points of this function. Let us set \(\mathscr {E}=\nabla g(\mathscr {S}_\infty )\). We have to show that \(\mathscr {F}=\mathscr {E}\).
By Proposition 3.2 (3+4+6) , for all \(\varphi \in \mathscr {L}^\perp \) one has
It follows from Proposition 3.2 (4+7) and [47, Theorem 26.5] that, as a function on \(\mathscr {L}^\perp \), J is the Legendre conjugate of the restriction of g to \(\mathscr {S}\). In particular, it is strictly convex and differentiable on \(\mathscr {L}^\perp \). Moreover, \(\nabla g:\mathscr {S}\rightarrow \mathscr {L}^\perp \) is a homeomorphism whose inverse is \(\nabla J:\mathscr {L}^\perp \rightarrow \mathscr {S}\). Since \(\mathscr {D}_\infty \) is open and convex, so is \(\mathscr {S}_\infty \), and its image \(\nabla g(\mathscr {S}_\infty )=\nabla g(\mathscr {D}_\infty )=\mathscr {E}\) is open and connected. We note that
for \(\varphi \in \mathscr {L}^\perp \). For \(\varphi =\nabla g(\xi )\in \mathscr {E}\) one has
i.e., I and J coincide on \(\mathscr {E}\). In particular, I is strictly convex on any convex subset of \(\mathscr {E}\). Suppose that \(\varphi \in \mathscr {E}\) is not an exposed point of I. Since \(\varphi =\nabla g(\xi )\) with \(\xi \in \mathscr {S}_\infty \), there exists \(\psi \in \mathscr {L}^\perp \) such that \(\psi \not =\varphi \) and \(I(\psi )=I(\varphi )+\xi \cdot (\psi -\varphi )\). Invoking convexity, one shows that \(I(\psi _\lambda )=I(\varphi )+\xi \cdot (\psi _\lambda -\varphi )\) with \(\psi _\lambda =\lambda \psi +(1-\lambda )\varphi \) and \(\lambda \in [0,1]\), which contradicts the strict convexity of I in a convex neighborhood of \(\varphi \).
Whenever both \(\pm \varphi \in \mathscr {E}\), we have \(\varphi =\nabla g(\xi )\) and \(-\varphi =\nabla g(\vartheta ^{-1}-\xi )\) with \(\xi \in \mathscr {S}_\infty \) and \(\vartheta ^{-1}-\xi \in \mathscr {S}_\infty \) , and thus,
Finally, we note that \(\mathscr {F}_0=\{\nabla g(\xi )\,|\,\xi \in \mathscr {D}_\infty \text { and }\vartheta ^{-1}-\xi \in \mathscr {D}_\infty \}\). Proposition 5.3 (3) implies that \(X_\xi >0\) and \(X_{\vartheta ^{-1}-\xi }>0\) for \(\xi \in \mathscr {D}_0\), and hence that \(\nabla g(\mathscr {D}_0)\subset \mathscr {F}_0\).
5.4 Proof of Theorem 3.5
By Theorem 3.4 it suffices to show that, under Condition (R), one has \(\nabla g(\mathscr {S}_\infty )=\mathscr {L}^\perp \). By Proposition 5.3 (4), for any \(\xi \in \mathscr {S}\) and any \(\eta \in \mathscr {L}\) one has
so that
Similarly, from
we deduce that
It follows that
and hence
Thus, under Condition (R), \(\nabla g(\mathscr {S}_\infty )=\nabla g(\mathscr {S})=\nabla g(\mathscr {D})=\mathscr {L}^\perp \).
Notes
The integral there is to be taken in Itô’s sense.
\(X_\xi \) is maximal whenever \(X\le X_\xi \) for all self-adjoint X such that \(\mathscr {R}_\xi (X)=0\).
Here, we used the fact that the steady state covariance M satisfies \(M^{-1}=\theta X_{\vartheta ^{-1}}\theta \).
The lineality space of a convex set \(\mathscr {C}\subset \mathbb {R}^n\) is the set of vectors \(y\in \mathbb {R}^n\) such that \(x+\lambda y\in \mathscr {C}\) for all \(x\in \mathscr {C}\) and all \(\lambda \in \mathbb {R}\), see [47]
Recall that M, given in (2.8), is the covariance of the invariant measure \(\mu \).
In Figures 6 and 7 the set \(\partial \mathscr {S}\) is mapped to the closed unit disk by first mapping \(\partial \mathscr {S}\) to the unit sphere and then mapping the point with spherical coordinates \((\varphi ,\theta )\in [0,2\pi ]\times [0,\pi ]\) on this sphere to the point \(\frac{\theta }{\pi }(\cos \varphi ,\sin \varphi )\) of the plane.
References
Atilgan, A.R., Durrell, S.R., Jernigan, R.L., Demirel, M.C., Keskin, O., Bahar, I.: Anisotropy of fluctuation dynamics of proteins with an elastic network model. Biophys. J. 80, 505–515 (2001)
Andrieux, D., Gaspard, P., Monnai, T., Tasaki, S.: The fluctuation theorem for currents in open quantum systems. New J. Phys. 11, 043014 (2009)
Baiesi, M., Jacobs, T., Maes, C., Skantzos, N.S.: Fluctuation symmetries for work and heat. Phys. Rev. E 74, 021111 (2006)
Bochkov, G.N., Kuzovlev, YuE: General theory of thermal fluctuations in nonlinear systems. Sov. Phys. JETP 45, 125–130 (1977)
Bodineau, T., Lefevere, R.: Large deviations of lattice Hamiltonian dynamics coupled to stochastic thermostats. J. Stat. Phys. 133, 1–27 (2008)
Bonetto, F., Gallavotti, G., Giuliani, A., Zamponi, F.: Chaotic hypothesis, fluctuation theorem and singularities. J. Stat. Phys. 123, 39–54 (2006)
Benoist, T., Jakšić, V., Pillet, C.-A.: Energy statistics in open harmonic networks. J. Stat. Phys. 168, 1016–1030 (2017)
Benoist, T., Panati, A., Pautrat, Y.: Heat conservation and fluctuations for open quantum systems in the Two-Time Measurement picture. Preprint arXiv:1810.09999
Cuneo, N., Eckmann, J.-P.: Controlling general polynomial networks. Commun. Math. Phys. 328, 1255–1274 (2014)
Cuneo, N., Eckmann, J.-P., Hairer, M., Rey-Bellet, L.: Non-equilibrium steady states for networks of oscillators. Electronic J. Probab. 23, 1–28 (2018)
Chetrite, R., Gawȩdzki, K.: Fluctuation relations for diffusion processes. Commun. Math. Phys. 282, 469–518 (2008)
Crooks, G.E.: Nonequilibrium measurements of free energy differences for microscopically reversible Markovian systems. J. Stat. Phys. 90, 1481–1487 (1998)
Cohen, E.G.D., van Zon, R.: Extended heat-fluctuation theorems for a system with deterministic and stochastic forces. Phys. Rev. E 69, 056121 (2004)
den Hollander, F.: Large Deviations. Fields Institute Monographs. AMS, Providence (2000)
Dembo, A., Zeitouni, O.: Large Deviations. Techniques and Applications. Springer, Berlin (1998)
Douarche, F., Joubaud, S., Garnier, N.B., Petrosyan, A., Ciliberto, S.: Work fluctuation theorems for harmonic oscillators. Phys. Rev. Lett. 97, 140603 (2006)
Evans, D.J., Cohen, E.G.D., Morriss, G.P.: Probability of second law violation in shearing steady flows. Phys. Rev. Lett. 71, 2401–2404 (1993)
Eckmann, J.-P., Pillet, C.-A., Rey-Bellet, L.: Non-equilibrium statistical mechanics of anharmonic chains coupled to two heat baths at different temperatures. Commun. Math. Phys. 201, 657–697 (1999)
Evans, D.J., Searles, D.J.: Equilibrium microstates which generate second law violating steady states. Phys Rev. E 50, 1645–1648 (1994)
Eckmann, J.-P., Zabey, E.: Strange heat flux in (an)harmonic networks. J. Stat. Phys. 114, 515–523 (2004)
Farago, J.: Injected power fluctuations in Langevin equation. J. Stat. Phys. 107, 781–803 (2002)
Gallavotti, G.: Extension of Onsager’s reciprocity to large fields and the chaotic hypothesis. Phys. Rev. Lett. 77, 4334–4337 (1996)
Gallavotti, G.: New methods in nonequilibrium gases and fluids. Open Syst. Inf. Dyn. 6, 101–136 (1999)
Gallavotti, G.: Statistical Mechanics. A Short Treatise. Springer, Berlin (2000)
Gallavotti, G., Cohen, E.G.D.: Dynamical ensembles in nonequilibrium statistical mechanics. Phys. Rev. Lett. 74, 2694–2697 (1995)
Gallavotti, G., Cohen, E.G.D.: Dynamical ensembles in stationary states. J. Stat. Phys. 80, 931–970 (1995)
Granger, L., Mehlis, J., Roldán, É., Ciliberto, S., Kantz, H.: Fluctuation theorem between non-equilibrium states in an RC circuit. New J. Phys. 17, 065005 (2015)
Haliloglu, T.B., Erman, B.: Gaussian dynamics of folded proteins. Phys. Rev. Lett. 79, 3090–3093 (1997)
Jarzynski, C.: A nonequilibrium equality for free energy differences. Phys. Rev. Lett. 78, 2690–2693 (1997)
Jakšić, V., Nersesyan, V., Pillet, C.-A., Shirikyan, A.: Large deviations from a stationary measure for a class of dissipative PDEs with random kicks Commun. Pure Appl. Math. 68, 2108–43 (2015)
Jakšić, V., Nersesyan, V., Pillet, C.-A., Shirikyan, A.: Large deviations and mixing for dissipative PDEs with unbounded random kicks. Nonlinearity 31, 540–596 (2018)
Jakšić, V., Nersesyan, V., Pillet, C.-A., Shirikyan, A.: Large deviations and Gallavotti-Cohen principle for dissipative PDEs with rough noise. Commun. Math. Phys. 336, 131–170 (2015)
Jakšić, V., Nersesyan, V., Pillet, C.-A., Shirikyan, A.: Large deviations and entropy production in viscous fluid flows. Preprint arXiv:1902.03278
Jakšić, V., Ogata, Y., Pautrat, Y., Pillet, C.-A.: Entropic fluctuations in quantum statistical mechanics-an introduction. In: Fröhlich, J., Salmhofer, M., Mastropietro, V., De Roeck, W., Cugliandolo, L.F. (eds.) Quantum Theory from Small to Large Scales. Oxford University Press, Oxford (2012)
Jop, P., Petrosyan, A., Ciliberto, S.: Work and dissipation fluctuations near the stochastic resonance of a colloidal particle. EPL 81, 50005 (2008)
Jakšić, V., Pillet, C.-A., Rey-Bellet, L.: Entropic fluctuations in statistical mechanics I. Classical dynamical systems. Nonlinearity 24, 699–763 (2011)
Jakšić, V., Pillet, C.-A., Shirikyan, A.: Entropic fluctuations in thermally driven harmonic networks. J. Stat. Phys. 166, 926–1015 (2017)
Kundu, A., Sabhapandit, S., Dhar, A.: Large deviations of heat flow in harmonic chains. J. Stat. Mech. 2011, P03007 (2011)
Lancaster, P., Rodman, L.: The Algebraic Riccati Equation. Clarendon Press, Oxford (1995)
Lebowitz, J.L., Spohn, H.: A Gallavotti-Cohen-type symmetry in the large deviation functional for stochastic dynamics. J. Stat. Phys. 95, 333–365 (1999)
Maes, C.: The fluctuation theorem as a Gibbs property. J. Stat. Phys. 95, 367–392 (1999)
Maes, C., Netočný, K., Verschuere, M.: Heat conduction networks. J. Stat. Phys. 111, 1219–1244 (2003)
Nersesyan, V.: Large deviations for the Navier-Stokes equations driven by a white-in-time noise. To appear in Ann. H. Lebesgue (2019)
Nickelsen, D., Engel, A.: Asymptotics of work distributions: the pre-exponential factor. Eur. Phys. J. B 82, 207–218 (2011)
Rákos, A., Harris, R.J.: On the range of validity of the fluctuation theorem for stochastic Markovian dynamics. J. Stat. Mech. 2011, P05005 (2008)
Rondoni, L., Mejía-Monasterio, C.: Fluctuations in non-equlibrium statistical mechanics: models, mathematical theory, physical mechanisms. Nonlinearity 20, 1–37 (2007)
Rockafellar, R.T.: Convex Analysis. Princeton University Press, Princeton (1972)
Rey-Bellet, L., Thomas, L.E.: Fluctuations of the entropy production in anharmonic chains. Ann. Henri Poincaré 3, 483–502 (2002)
Ruelle, D.: Statistical Mechanics: Rigorous Results (3rd edition). Mathematical Physics Monograph Series. Benjamin, Reading, MA (1977)
Ruelle, D.: Smooth dynamics and new theoretical ideas in nonequilibrium statistical mechanics. J. Stat. Phys. 95, 393–468 (1999)
Seifert, U.: Stochastic thermodynamics, fluctuation theorems and molecular machines. Rep. Prog. Phys. 75, 126001 (2012)
van Zon, R., Ciliberto, S., Cohen, E.G.D.: Power and heat fluctuation theorems for electric circuits. Phys. Rev. Lett. 92, 130601 (2004)
Visco, P.: Work fluctuations for a Brownian particle between two thermostats. J. Stat. Mech. 2006, 6006 (2006)
Watson, G.N.: A Treatise on the Theory of Bessel Functions, 2nd edn. Cambridge University Press, Cambridge (1966)
Acknowledgements
This research was supported by the Agence Nationale de la Recherche (ANR) through the grant NONSTOPS (ANR-17-CE40-0006) and the CNRS collaboration grant Fluctuation theorems in stochastic systems. The work of C.-A.P. has been carried out in the framework of the Labex Archimède (ANR-11-LABX-0033) and of the A*MIDEX project (ANR-11-IDEX-0001-02), funded by the “Investissements d’Avenir” French Government program managed by the ANR. Parts of this work were performed during the visits of M.D. and M.H. at the University of Toulon and of C.-A.P. at the University of Sfax. We thank the CPT, the University of Toulon and the Mathematics Department of Sfax for their hospitality and support. M.H. and C.-A.P. are also grateful to the Centre de Recherches Mathématiques de l’Université de Montréal for its hospitality and the Simons foundation and CNRS for their support during their stay in Montréal in the fall 2018.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Damak, M., Hammami, M. & Pillet, CA. A Detailed Fluctuation Theorem for Heat Fluxes in Harmonic Networks Out of Thermal Equilibrium. J Stat Phys 180, 263–296 (2020). https://doi.org/10.1007/s10955-019-02398-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10955-019-02398-x