Abstract
We introduce a meta logarithmic-Sobolev (log-Sobolev) inequality for the Lindbladian of all single-mode phase-covariant Gaussian channels of bosonic quantum systems and prove that this inequality is saturated by thermal states. We show that our inequality provides a general framework to derive information theoretic results regarding phase-covariant Gaussian channels. Specifically, by using the optimality of thermal states, we explicitly compute the optimal constant \(\alpha _p\), for \(1\le p\le 2\), of the p-log-Sobolev inequality associated with the quantum Ornstein–Uhlenbeck semigroup. Prior to our work, the optimal constant was only determined for \(p=1\). Our meta log-Sobolev inequality also enables us to provide an alternative proof for the constrained minimum output entropy conjecture in the single-mode case. Specifically, we show that for any single-mode phase-covariant Gaussian channel \(\Phi \), the minimum of the von Neumann entropy \(S\big (\Phi (\rho )\big )\) over all single-mode states \(\rho \) with a given lower bound on \(S(\rho )\) is achieved at a thermal state.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Quantum Gaussian channels are prototype noise models for the transmission of quantum information in current quantum communication technologies and information processing. In particular, they model transmission channels for sending quantum data encoded in bosonic systems through optical fibers and free space. Despite several developments in the past decades, there are still wide open conjectures regarding information theoretic properties of these channels. These conjectures are important not only due to the significance of quantum Gaussian channels in quantum information science, but also as quantum generalizations of some influential results in functional analysis.
In functional analysis and information theory, the optimality of Gaussian functions and Gaussian distributions has been established for various optimization problems involving Gaussian kernels or Gaussian channels. To name a few results, it is shown in [7] that only Gaussian functions saturate Gross’s celebrated logarithmic-Sobolev (log-Sobolev) inequality [25]. Also, it is shown in [36] that “Gaussian kernels have only Gaussian maximizers.” In information theory, it is shown that the optimal input distribution for Gaussian broadcast channels is Gaussian [20]. More generally, various inequalities including Brascamp–Lieb inequalities, the Loomis–Whitney inequality, the Prékopa–Leindler inequality, sharp form of Young’s inequality for convolution of functions as well as the entropy power inequality are tight for Gaussian functions and distributions; see [1] and references therein. Thus, it is tempting to verify the validity of these results in the non-commutative case for quantum Gaussian channels.
Single-mode quantum Gaussian channels have been classified in [30]; see also [31]. Among these classes are the three important physical classes of attenuator, amplifier and additive-noise channels, which have two main features. First, these channels are phase-covariant meaning, that they are covariant with respect to the group of phase operators, and second, they form quantum Markov semigroups [28]. These two features make these channels suitable candidates for generalizing some of the aforementioned properties of Gaussian kernels in functional analysis to the quantum case. In particular, as we show in this paper, these channels satisfy a new inequality that is saturated by Gaussian states, which we refer to as meta log-Sobolev inequality and provides a general framework for deriving quantum information-theoretic results regarding bosonic systems.
One application of our meta log-Sobolev inequality is to compute the optimal constants of the p-log-Sobolev inequalities for a semigroup of attenuator channels, called the quantum (bosonic) Ornstein–Uhlenbeck semigroup. This semigroup is of particular interest as it resembles its classical counterpart when acting on certain operators; see [9, Equation (7.5)]. The classical Ornstein–Uhlenbeck semigroup is an important example of a Markov semigroup, which can be understood as the convolution of an input distribution by a Gaussian one. Its associated log-Sobolev inequality takes the form
where the entropy \({{\textrm{Ent}}}_2(f) = {\mathbb {E}}[f^2\log f^2] - {\mathbb {E}}[f^2]\log {\mathbb {E}}[f^2]\) and the 2-norm \(\Vert f\Vert _2={\mathbb {E}}[f^2]^{1/2}\) are defined with respect to the standard normal distribution. This inequality was first established by Gross in his seminal work [25].
Given the log-Sobolev inequality (1) for the classical Ornstein–Uhlenbeck semigroup and the development of the theory of hypercontractivity for quantum semigroups [39], it is natural to ask if the quantum Ornstein–Uhlenbeck semigroup is hypercontractive and satisfies a log-Sobolev inequality.Footnote 1 The 2-log-Sobolev inequality for the quantum Ornstein–Uhlenbeck semigroup takes the form
Here, \(\sigma _\beta =(1-e^{-\beta }) e^{-\beta a^\dagger a}\) is a thermal (and then Gaussian) state with parameter \(\beta >0\), \([{\textbf{a}}, X]={\textbf{a}}X-X{\textbf{a}}\) is the commutator of the annihilation operator with X, and in the special case that \(\rho = \Big (\sigma ^{\frac{1}{4}}_\beta X \sigma _\beta ^{\frac{1}{4}}\Big )^2\) is a quantum state
Moreover, \(\alpha _2\) is called the 2-log-Sobolev constant whose optimal value, depending on \(\beta \), is one of our main problems of study in this work.
Using the spectral properties of the Lindbladian of the quantum Ornstein–Uhlenbeck semigroup established in [9], a bound on the 2-log-Sobolev constant \(\alpha _2\) is derived in [6]. This bound confirms the hypercontractivity of the quantum Ornstein–Uhlenbeck semigroup, yet it seems far from being optimal. Moreover, the bound of [6] works only in the single-mode case and is not clear to satisfy the so called tensorization property.
Although 2-log-Sobolev inequalities (assuming some regularity conditions [35, 39]) imply hypercontractivity inequalities, for a more refined characterization of the latter inequalities we need to establish generalizations of the above 2-log-Sobolev inequality for all values of \(1\le p\le 2\). These generalizations, called p-log-Sobolev inequalities, are all equivalent and ignorant of the parameter p for the classical Ornstein–Uhlenbeck semigroup, yet they differ in the quantum case.Footnote 2
The 1-log-Sobolev (also called the modified log-Sobolev) inequality for the quantum Ornstein–Uhlenbeck semigroup is first studied in [34] where it is examined for Gaussian states and it is conjectured that Gaussian states are optimal for the 1-log-Sobolev inequality. This conjecture is first proven in [8]. An alternative proof of this inequality and an extension are also derived in [12].
Our meta log-Sobolev inequality is also related to computing the classical capacity of attenuators, amplifiers and additive-noise channels, i.e., finding the maximum rate of reliable transmission of classical data through these channels. Since these channels are phase-covariant, their classical capacity is related to their minimum output entropy, i.e., the minimum entropy of their output states; see, e.g., [31]. It was a long-standing open problem that the minimum output entropy of these channels is additive and is achieved over Gaussian states [21, 23, 33]. This conjecture was answered in the affirmative in [22]. Extensions of this result have also been proven for which we refer to [18, 32] and references therein.
Generalizing the aforementioned minimum output entropy conjecture, the constrained minimum output entropy (CMOE) conjecture states that the minimum output entropy of the above phase-covariant Gaussian channels over input states with a given entropy is achieved by Gaussian states [27]. More precisely, the CMOE conjecture states that if \(\Phi \) is a single-mode phase-covariant Gaussian channel, which by the above discussion is either attenuator, amplifier or additive-noise, then for any m-mode state \(\rho \) we have
where \(\tau \) is a single-mode thermal state satisfying \(S(\tau ) = \frac{1}{\,}m S(\rho )\). This conjecture is in particular related to the problem of computing the capacity of quantum Gaussian broadcast channels [27]. Using tools developed in [14] the CMOE conjecture is proven in [15, 16] in the special case of \(m=1\). It is also proven for arbitrary m in the parameter regime that the channel \(\Phi \) becomes entanglement breaking [11].
Some other conjectures regarding the optimality of Gaussian states have also been proposed including an entropy photon-number inequality [13, 26] and a sharp Young’s inequality for the beam-splitter [18]. For more details on quantum Gaussian optimizer conjectures in quantum information theory we refer to [18].
1.1 Our Contributions
In this paper, we introduce a meta log-Sobolev inequality for phase-covariant Gaussian quantum channels. In the Schrödinger picture, the Lindbladian of the semigroups of attenuator, amplifier and additive-noise channels mentioned above, takes the form
where \(\nu _0, \nu _1\ge 0\) are non-negative constants and \(\{X, Y\} = XY+YX\) denotes anti-commutator. Then, given any \(\omega \ge 0\), we define the function
over single-mode states \(\rho \). Later, we will see that for the choice of
and \(p=2\), the first term in \(\Upsilon (\rho )\) resembles the Dirichlet form appearing on the right hand side of (2). In this case, \({{\textrm{Ent}}}_{p, \sigma _\beta }(X)=D(\rho \Vert \sigma _\beta )\) can be written in terms of the other two terms in \(\Upsilon (\rho )\). Thus, computing the infimum of \(\Upsilon (\rho )\) is useful in obtaining the optimal log-Sobolev constant \(\alpha _p\).
Our main result, which we call a meta log-Sobolev inequality, states that optimal states in the minimization of \(\Upsilon (\rho )\) are thermal, i.e.,
for any state \(\rho \) of a single-mode bosonic system.
By using (4), we turn the problem of computing the optimal p-log-Sobolev constant \(\alpha _p\) into an optimization problem over thermal states that are characterized by a single real parameter. We explicitly compute the optimal constant for the quantum Ornstein–Uhlenbeck semigroup for any \(1\le p\le 2\):
This result for \(p=1\) recovers the modified log-Sobolev inequality of [8], and for \(p=2\) is an improvement over the bound of [6].
We also study the quantum Ornstein–Uhlenbeck semigroup in the multimode case. It is well known that log-Sobolev constants for classical Markov semigroups have the tensorization property (see, e.g., [38]), meaning that considering the m-fold tensor product of the semigroup, the log-Sobolev constants of the resulting semigroup are the same as those of the original one. This property is not known to hold for arbitrary quantum Markov semigroups. Thus, we also consider the m-mode version of (2):
where \(X>0\) runs over m-mode operators, and \({{\widehat{\alpha }}}_{2}\) is the 2-log-Sobolev constant for the m-fold tensor product semigroup. We show that for any \(m>1\),
where \(\alpha _2\) is given by (5).
To prove our result (6), we follow the approach of [6]. We use properties of the spectrum of the generator \({\mathcal {L}}\) of the quantum Ornstein–Uhlenbeck semigroup to reduce the problem for arbitrary m-mode operators X to operators that are diagonal in the number basis. To this end, we prove an entropic inequality which might be of independent interest. Next, restricting to diagonal operators, we obtain an essentially classical Markov semigroup that is known to satisfy the tensorization property. Thus, the problem for m-mode states is reduced to that of single-mode states for which we have already established the optimal log-Sobolev constant in (5). We note that, although, our first step of the proof is inspired by [6], our penalty term that compensates for the reduction to diagonal states is smaller than that of [6]. Moreover, as mentioned above we compute the exact value of the 2-log-Sobolev constant for diagonal states while [6] derives only a lower bound. Thus, our estimate on the 2-log-Sobolev inequality is not just an m-mode generalization of [6], but a twofold improvement thereof.
Our meta log-Sobolev inequality (4) provides a general framework that can be applied in other settings as well. In this paper, using (4) we also give an alternative proof of the CMOE conjecture (3) in the case of \(m=1\).
Outline of the paper The rest of this paper is organized as follows. In Sect. 2, we review the class of single-mode phase-covariant Gaussian channels. We argue that these channels form semigroups and compute their Lindbladians. Section 3 is devoted to our meta log-Sobolev inequality (4) and its proof. The log-Sobolev constants for the quantum Ornstein–Uhlenbeck semigroup are computed in Sect. 4. Also, our proof of the CMOE conjecture is given in Sect. 5. Final remarks are discussed in Sect. 6, and some detailed computations are left for the appendices.
2 Phase-Covariance Gaussian Channels
A single-mode bosonic continuous-variable system is described by the annihilation operator \({\textbf{a}}\) and it hermitian conjugate, the creation operator \({\textbf{a}}^\dagger \). These operators satisfy the bosonic commutation relation
Fock states \(\{|n\rangle :\, n\ge 0\}\), also called number states, form an orthonormal basis for the corresponding Hilbert space and are eigenvectors of \({\textbf{a}}^\dagger {\textbf{a}}\) called the number operator: \({\textbf{a}}^\dagger {\textbf{a}}|n\rangle =n|n\rangle \). We indeed have \({\textbf{a}}|n\rangle = \sqrt{n}|n-1\rangle \) and \({\textbf{a}}^\dagger |n\rangle = \sqrt{n+1}|n+1\rangle \). Single-mode quantum states \(\rho \) are density operators (\(\rho \succeq 0\) and \({{\textrm{tr}}}(\rho )=1\)) acting on this Hilbert space.
All states that we consider in this paper are assumed to be physical and hence have finite mean photon number. More precisely, for a pure state \(\rho =|\psi \rangle \langle \psi |\) we assume that \(|\psi \rangle \) belongs to the domain of \({\textbf{a}}\) and \(\langle \psi |{\textbf{a}}^\dagger {\textbf{a}}|\psi \rangle = \Vert {\textbf{a}}|\psi \rangle \Vert ^2 < +\infty \), and more generally for a mixed state with eigen-decomposition \(\rho = \sum _{j=0}^\infty \lambda _j |\psi _j\rangle \langle \psi _j|\) we assume that \(|\psi _j\rangle \), for any j with \(\lambda _j>0\), belongs to the domain of \({\textbf{a}}\) and \(\sum _j \lambda _j \Vert {\textbf{a}}|\psi _j\rangle \Vert ^2 < +\infty \). The latter expression is indeed equal to \({{\textrm{tr}}}({\textbf{a}}\rho {\textbf{a}}^\dagger )\), yet we often write it as \({{\textrm{tr}}}(\rho {\textbf{a}}^\dagger {\textbf{a}})\), the conventional notation for the mean photon number, considering the following formal definition for the expectation value of an operator in quantum mechanics.Footnote 3 For any positive semidefinite operator X, we formally define \({{\textrm{tr}}}(\rho X) = \sum _j \lambda _j \Vert X^{1/2} |\psi _j\rangle \Vert ^2\) if \(|\psi _j\rangle \) belongs to the domain of \(X^{1/2}\) for all j with \(\lambda _j>0\), and let \({{\textrm{tr}}}(\rho X) = +\infty \) otherwise.Footnote 4 Also, for a self-adjoint operator X with decomposition \(X=X_+-X_-\) where both \(X_+, X_-\) are positive semidefinite, we define \({{\textrm{tr}}}(\rho X) = {{\textrm{tr}}}(\rho X_+) - {{\textrm{tr}}}(\rho X_-)\).
Quantum states can be represented in terms of displacement (Weyl) operators \(D_\xi =\exp (\xi {\textbf{a}}^\dagger -{{\bar{\xi }}}{\textbf{a}})\) with \(\xi \in {\mathbb {C}}\) as
where \(\chi _\rho (\xi )={{\textrm{tr}}}(\rho D_\xi )\) is called the characteristic function [5]. The characteristic function is the Fourier transform of the Wigner function [29].
A quantum state is called Gaussian if its characteristic and equivalently Wigner functions are Gaussian. Therefore, Gaussian states can be simply described in terms of the first-order and second-order moments of their Wigner function. Thermal states are an important class of Gaussian states. A thermal state with parameter \(\beta >0\) that is proportional to the inverse of temperature, is diagonal in the Fock basis and is given by
The characteristic function of this thermal state equals
At zero temperature (\(\beta =\infty \)), we obtain the vacuum state \(\sigma _\infty =|0\rangle \langle 0|\).
The evolution of quantum systems, in general, is described by quantum channels \(\Phi \) that are linear completely positive and trace-preserving superoperators. Quantum channels that transform Gaussian states into Gaussian states are known as Gaussian channels [3, 31, 33, 42]. Gaussian channels that are unitary are called Gaussian unitaries.
A single-mode Gaussian channels is called phase-covariantFootnote 5 [22, 24] if it satisfies
for any state \(\rho \) and \(\theta \in [0,2\pi )\), where \(U_\theta =e^{i\theta \, {\textbf{a}}^\dagger {\textbf{a}}}\) is the phase-rotation unitary. By the classification of single-mode Gaussian channels [30, 31], a single-mode phase-covariant Gaussian channel can be described in terms of its action on the characteristic function. For any such channel there are parameters \(\gamma , \lambda \ge 0\) such that
where the complete positivity condition implies \(\gamma \ge |1-\lambda |\) [33]. The channel \(\Phi \) is called quantum limited if \(\gamma =|1-\lambda |\). By using the characteristic function of thermal states (9) in (11), one can verify that phase-covariant Gaussian channels transform thermal states to thermal states.
Single-mode phase-covariant Gaussian channels consist of three classes of attenuator channels corresponding to \(0\le \lambda <1\), additive-noise channels corresponding to \(\lambda =1\), and amplifier channels corresponding \(1<\lambda \). These are important physical channels in describing dynamics of continuous-variable quantum systems.
A crucial property of single-mode phase-covariant Gaussian channels is that they admit semigroup structures [23, 28]. Specifically, any such channel can be written as \(\Phi _{t_0}=e^{-t_0{\mathcal {L}}}\) for some \(t_0\ge 0\) where \(\big \{\Phi _{t} = e^{-t{\mathcal {L}}}:\, t\ge 0\big \}\) is a semigroup of single-mode phase-covariant Gaussian channels. We show in Appendix A that the corresponding Lindbladian \({\mathcal {L}}\) of such a semigroup takes the form
where \(\{X, Y\} = XY+YX\) denotes the anti-commutator, and \(\nu _0, \nu _1\ge 0\) are parameters determining the semigroup. We argue in Appendix A that three ranges for the parameters \(\nu _0, \nu _1\) give the three classes of single-mode phase-covariant channels: \(\nu _1>\nu _0\) corresponds to attenuator channels, \(\nu _0=\nu _1\) corresponds to additive-noise channels, and \(\nu _1<\nu _0\) corresponds to amplifier channels.
Attenuator channels, in general, can be physically modeled by applying a beam splitter unitary with the transmissivity of \(0\le \lambda \le 1\) on the system and an auxiliary system in the thermal state \(\sigma _\beta \), and then tracing out the second subsystem:
The characteristic function of the output state is given by
Choosing the transmissivity parameter \(\lambda _t=e^{-2ct}\) as a function of time, where \(c>0\) is some constant, we obtain a semigroup of attenuation channels. The generator of this semigroup given in (12) has parameters \(\nu _0 = c \left( \coth (\beta /2)-1\right) \) and \(\nu _1 =c \left( \coth (\beta /2)+1\right) \). Note that for the special choice of \(c=\sinh (\beta /2)\) we have \(\nu _0=e^{-\beta /2}\) and \(\nu _1=e^{\beta /2}\). This semigroup of attenuator channels is sometimes called the quantum Ornstein–Uhlenbeck semigroup.
Amplifier channels can be described by replacing the beam splitter unitary in the above model by a two-mode squeezing unitary to get
where \(\lambda \ge 1\) is the squeezing parameter. In this case, the relation between input and output characteristic functions becomes
Choosing \(\lambda _t=e^{2ct}\) with \(c>0\) to be a function of time, we obtain a semigroup with the generator corresponding to parameters \(\nu _0=c\left( \coth (\beta /2)+1\right) \) and \(\nu _1=c\left( \coth (\beta /2)-1\right) \) in (12).
Additive-noise channels can be modeled by applying a displacement operator whose parameter is chosen at random according to a Gaussian probability distribution:
This channel in terms of characteristic functions can be written as
Again, by setting \(\gamma _t=2ct\) with \(c>0\) we obtain a semigroup whose generator given by (12) has parameters \(\nu _0=\nu _1=c\). This semigroup is sometimes called the quantum heat semigroup.
We emphasize that by the above discussion any single-mode phase-covariant channel (11) can be viewed as a member of one of the above three semigroups. The point is that, the above choices of parameters \(\lambda _t, \gamma _t\) cover all the valid ranges of \(\lambda , \gamma \) in (11).
In this paper, we also consider m-mode bosonic systems, described by m pairs of annihilation and creation operators \(\{{\textbf{a}}_1, {\textbf{a}}_1^\dagger , \dots , {\textbf{a}}_m, {\textbf{a}}_m^\dagger \}\) satisfying \([{\textbf{a}}_{i}, {\textbf{a}}_j]=0\) and \([{\textbf{a}}_{i}, {\textbf{a}}_j^\dagger ]=\delta _{i,j}\). Vectors of the associated tensor product Hilbert space can be expressed in terms of m-mode number states \(\{|n_1,\dots ,n_m\rangle :\, n_1,\dots , n_m\ge 0\}\). Also, the characteristic function for an m-mode state \(\rho \) is given by \(\chi _\rho (\xi )={{\textrm{tr}}}\big (\rho D_{\xi }\big )\), where \(\xi =(\xi _1,\dots ,\xi _m)\in {\mathbb {C}}^m\) and \(D_\xi = D_{\xi _1}\otimes \dots \otimes D_{\xi _m}\). As in the single-mode case, physical multimode states \(\rho \) considered in this paper have finite mean photon number: \({{\textrm{tr}}}(\rho H_m)<+\infty \), where \(H_m=\sum _{j=1}^m {\textbf{a}}_j^\dagger {\textbf{a}}_j\) is the m-mode number operator. The definitions of Gaussian channels and unitaries can be extended to the multimode case. Any Gaussian unitary can be decomposed into displacement operators, multimode passive transformations, which preserve the mean photon number, and single-mode squeezing transformations; see [41] for more details.
3 Meta Log-Sobolev Inequality
As discussed in the previous section, all the Lindbladians associated with single-mode phase-covariant channels have the form \({\mathcal {L}}=\nu _0{\mathcal {L}}_{0} + \nu _1 {\mathcal {L}}_1\) for some \(\nu _0, \nu _1\ge 0\) where
Let \(\langle \cdot , \cdot \rangle \) denote the Hilbert–Schmidt inner product:
Also, let \({\mathcal {L}}^*\) be the adjoint of the generator \({\mathcal {L}}\) with respect to this inner product: \(\langle X, {\mathcal {L}}(Y)\rangle = \langle {\mathcal {L}}^*(X), Y\rangle \). Indeed, \({\mathcal {L}}^*\) is the generator in the Heisenberg picture given by
Let \(p\ge 1\) and \({\hat{p}}\) be the Hölder conjugate of p given byFootnote 6
Then, for any single-mode quantum state \(\rho \) define
where \(\omega \ge 0\) is a fixed parameter and \(S(\rho )= -{{\textrm{tr}}}(\rho \log \rho )\) is the von Neumann entropy of the state. The factor \({\hat{p}}\) in the first term of \(\Upsilon (\rho )\) is for the sake of normalization in the limiting case of \(p\rightarrow 1^+\). Indeed, since \({\mathcal {L}}^*(I)=0\), we have
Thus, by convention for \(p=1\) we let \({\hat{p}}\, \big \langle {\mathcal {L}}\big (\rho ^{1/p}\big ), \rho ^{1/{\hat{p}}}\big \rangle = \langle {\mathcal {L}}(\rho ), \log \rho \rangle \).
A straightforward computation verifies that
We may think of this equation as the starting definition of \(\Upsilon (\rho )\). In Appendix B, we show that if \(\rho \) has a finite mean photon number, then all the terms in the above equation are finite and \(\Upsilon (\rho )\) given by (17) is well defined.
One of the main technical contributions of our work is that the infimum of \(\Upsilon (\rho )\) over states \(\rho \) is achieved at single-mode thermal states. To this end, it would be beneficial to compute \(\Upsilon (\rho )\) for thermal states. Using (17), for a single-mode thermal state,Footnote 7
where we put \(0<x=e^{-\beta }<1\) in (8), we have
Optimizing over the choice of the thermal state \(\tau \), we define
Theorem 1
(Meta log-Sobolev inequality). For any \(\nu _0, \nu _1, \omega \ge 0\) and \( p\ge 1\) define \(\Upsilon (\rho )\) by (16). Let \(\eta _{{{{\textrm{th}}}}}\) be the infimum of \(\Upsilon (\tau )\) over thermal states as in (19). Then, for any single-mode quantum state \(\rho \) with finite mean photon number we have
The proof of this theorem is broken into two steps:
-
(i)
The first step is to reduce the problem for arbitrary states \(\rho \) to states that are diagonal in the Fock basis. To this end, we use ideas developed in [14]. We show that, fixing the eigenvalues of \(\rho \) and rotating its eigen-basis, we can obtain a diagonal state \({{\widehat{\rho }}}\) that satisfies \(\Upsilon (\rho )\ge \Upsilon ({{\widehat{\rho }}})\). We then conclude that diagonal states are sufficient when minimizing \(\Upsilon (\cdot )\).
-
(ii)
In the second step, we show that the optimal diagonal states are thermal. The proof idea in this step is extracted from tensorization type arguments. Tensorization was first used by Gross [25] for proving his celebrated log-Sobolev inequality. The idea in [25] is that by the central limit theorem, an expectation value with respect to a Gaussian distribution can be understood as the limit \(n\rightarrow +\infty \) of the expectation value of some lifted function on the product space \(\{0,1\}^n\) with respect to a product probability measure. Here, being interested in thermal states, our distributions of reference are geometric distributions, and geometric distributions can be understood in terms of the first success in a sequence of independent Bernoulli trials. Thus, it is natural to employ a tensorization type argument in order to reduce the optimization over general single-mode diagonal states to thermal ones. In our proof, we do not directly refer to Bernoulli trials, yet our intuition on why and how it works is really rooted in a tensorization argument as described here.
Proof
(i) In the first step of the proof, we show that for any state \(\rho \), there is a state \({{\widehat{\rho }}}\) that is diagonal in the Fock basis and satisfies \(\Upsilon (\rho )\ge \Upsilon ({{\widehat{\rho }}})\).
Let
be the eigen-decomposition of \(\rho \) where \(\lambda _0 \ge \lambda _1\ge \cdots \ge 0\) are the eigenvalues of \(\rho \) and \(\{|\psi _n\rangle :\, n\ge 0\}\) is its eigen-basis. In this case, there are \(r_k=\lambda _k-\lambda _{k+1}\ge 0\) such that
where \(P_k = \sum _{n=0}^{k} |\psi _n\rangle \langle \psi _n|\) is the projection operator on the first \(k+1\) eigenvectors of \(\rho \). Now we define
where \(\Pi _k = \sum _{n=0}^{k} |n\rangle \langle n|\) is the projection operator on the first \(k+1\) vectors in the Fock basis. We note that by definition, \({{\widehat{\rho }}} = \sum _{k=0}^n \lambda _k |n\rangle \langle n|\) is diagonal in the Fock basis and shares the same eigenvalues with \(\rho \). Therefore, they have the same entropy
Next, we note that the infimum of \({{\textrm{tr}}}(Q_k {\textbf{a}}^\dagger {\textbf{a}})\) over projectors \(Q_k\) with rank \(k+1\), equals the sum of the first \(k+1\) smallest eigenvalues of \({\textbf{a}}^\dagger {\textbf{a}}\), and is achieved by \(Q_k = \Pi _k\). This implies that the mean photon number of \(\rho \) is lower bounded by that of \({{\widehat{\rho }}} \):
We also have
We note that \(\rho ^{1/p} = \sum _n \lambda _n^{1/p} |\psi _n\rangle \langle \psi _n|\) with \(\lambda _0^{1/p}\ge \lambda _1^{1/p}\ge \cdots \ge 0\). Then, there are \(r_{p,k} \ge 0\) such that \(\rho ^{1/p} = \sum _k r_{p, k} P_k\). Similarly, there are \(r_{{\hat{p}}, k}\ge 0\) such that \(\rho ^{1/{\hat{p}}} = \sum _k r_{{\hat{p}}, k} P_k\). We also have \({{\widehat{\rho }}}^{1/p} = \sum _k r_{p, k} \Pi _k\) and \({{\widehat{\rho }}}^{1/{\hat{p}}} = \sum _k r_{{\hat{p}}, k} \Pi _k\). Therefore, we have
where in the last inequality we use \(P_k, P_\ell \preceq I\). Thus, using the commutation relation \([{\textbf{a}}, {\textbf{a}}^\dagger ]= 1\), we get
Repeating the same computation for \({{\widehat{\rho }}}\), we observe that the support of \({\textbf{a}}\Pi _\ell {\textbf{a}}^\dagger \) is included in the support of \(\Pi _k\) if \(k\ge \ell \). This implies \({{\textrm{tr}}}\big ( \Pi _k {\textbf{a}}\Pi _\ell {\textbf{a}}^\dagger \big ) = {{\textrm{tr}}}\big ( {\textbf{a}}\Pi _\ell {\textbf{a}}^\dagger \big )\). Similarly, if \(k<\ell \), then the support of \({\textbf{a}}^\dagger \Pi _k{\textbf{a}}\) is inside the support of \(\Pi _\ell \), which implies \({{\textrm{tr}}}\big ( \Pi _k {\textbf{a}}\Pi _\ell {\textbf{a}}^\dagger \big )={{\textrm{tr}}}\big ( {\textbf{a}}^\dagger \Pi _k {\textbf{a}}\Pi _\ell \big ) = {{\textrm{tr}}}\big ( \Pi _k {\textbf{a}}{\textbf{a}}^\dagger \big )\). Therefore, we have
By using the same argument, we also have
Putting (21)–(24) together and using the fact that \(\nu _0, \nu _1, \omega \ge 0\) we conclude that \(\Upsilon (\rho )\ge \Upsilon ({{\widehat{\rho }}})\) for \(p>1\). The same inequality for \(p=1\) is obtained by taking the limit \(p\rightarrow 1^+\). Therefore, to prove \(\Upsilon (\rho )\ge \eta _{{{{\textrm{th}}}}}\) in Theorem 1, it suffices to restrict to diagonal states in the Fock basis.
(ii) We now, in the second step, show that the optimal diagonal states that minimize \(\Upsilon (\rho )\) are thermal. Let
be the eigen-decomposition of \(\rho \). A straightforward computation yields
By the definition of \(\eta _{{{\textrm{th}}}}\) in (19), for any \(0\le x\le 1\) we haveFootnote 8
For any \(\ell \ge 0\) let
We note that \(s_\ell = \lambda _\ell + s_{\ell +1}\) and \(0\le x_\ell \le 1\). Therefore,
Multiplying both sides by \(s_\ell \) and summing over \(\ell \), we obtain
We compute and estimate each term in the above equation as follows. First, we have
Second, by applying Hölder’s inequality, we obtain
Similarly, we obtain
We also have
Finally, the last term of (26) becomes
where in the last line we use \(s_0=\sum _n \lambda _n=1\) and \(\lim _{k\rightarrow +\infty } s_k=0\). Using these equations in (26) and compared to (25) we arrive at \(\Upsilon (\rho )\ge \eta _{{{\textrm{th}}}}\).
The proof for \(p=1\) is similar; we only need to replace the Hölder inequality in (27) with
that is derived from Hölder’s inequality by taking an appropriate limit.Footnote 9\(\square \)
We remark that in part (ii) of the above proof, when the diagonal state \(\rho \) is thermal, the parameter \(x_\ell \) is independent of \(\ell \). Moreover, in this case, the Hölder inequality applied in (27) is tight.
Note that our meta log-Sobolev inequality can be viewed as a generalization of a log-Sobolev inequality introduced in [17, Theorem 3.1] which holds only for the generator of the quantum-limited attenuator channel.
3.1 Meta Log-Sobolev Inequality for Multimode States
For some applications, it is useful to consider the function \(\Upsilon (\cdot )\) in the multimode case and prove a generalization of Theorem 1 for multimode quantum states. In this subsection, we establish such a generalization but for the special cases of multimode states that can be prepared by applying a Gaussian unitary on multimode states that are diagonal in the Fock basis, which include all multimode Gaussian states, and multimode states that can be prepared by applying a passive Gaussian unitary on any product state.
For any m-mode quantum state \(\rho \) with finite mean photon number define
where as before \(\nu _{0}, \nu _{1},\omega \ge 0\) and \({\mathcal {L}}_j = \nu _{0}{\mathcal {L}}_{0, j}+\nu _{1}{\mathcal {L}}_{1, j}\) is the Lindbladian acting on the j-th mode. Note that we still use \(\Upsilon (\rho )=\Upsilon _{1}(\rho )\) for the single-mode case. To establish our results on the m-mode generalizations of Theorem 1, we first state a lemma.
Lemma 2
For any m-mode state \(\rho \) the followings hold:
-
(i)
Let \(\rho ' = D_{ \xi } \rho D_\xi ^\dagger \) where \(D_\xi = D_{\xi _1}\otimes \dots \otimes D_{\xi _m}\) is an m-mode displacement operator. Suppose that \({{\textrm{tr}}}(\rho {\textbf{a}}_j)=0\) for all j. Then, \(\Upsilon _m(\rho ')\ge \Upsilon _m(\rho )\).
-
(ii)
Let \(\rho ' = U\rho U^\dagger \) where U is a passive transformation, i.e., U is a Gaussian unitary that commutes with \(H_m=\sum _j {\textbf{a}}_j^\dagger {\textbf{a}}_j\). Then, \(\Upsilon _m(\rho ')= \Upsilon _m(\rho )\).
-
(iii)
Let \(\rho ' = S_r\rho S_r^\dagger \) where \(S_r = e^{\frac{1}{2}\sum _j r_j({\textbf{a}}_j^2 - ({\textbf{a}}_j^\dagger )^2)}\) is a squeezing transformation. Suppose that \({{\textrm{tr}}}\big (\rho {\textbf{a}}_j^2\big )={{\textrm{tr}}}\big ( \rho ^{1/p} {\textbf{a}}_j \rho ^{1/{\hat{p}}} {\textbf{a}}_j\big )=0\) for all j. Then, \(\Upsilon _m(\rho ')\ge \Upsilon _m(\rho )\).
Proof
(i) We note that \(D_\xi ^\dagger {\textbf{a}}_j D_\xi = {\textbf{a}}_j + \xi _j\). Therefore, \(D_\xi ^\dagger {\textbf{a}}_j^\dagger {\textbf{a}}_j D_\xi = {\textbf{a}}_j^\dagger {\textbf{a}}_j + \xi _j {\textbf{a}}_j^\dagger + {{\bar{\xi }}}_j {\textbf{a}}_j + |\xi _j|^2\). Moreover,
where for the second equality we use the assumption \({{\textrm{tr}}}(\rho {\textbf{a}}_j)=0\). We similarly have
Therefore, we have
By using similar computations, we can also show that \(\big \langle {\mathcal {L}}_{1,j}(\rho '^{1/p}), \rho '^{1/{\hat{p}}}\big \rangle = \big \langle {\mathcal {L}}_{1,j}(\rho ^{1/p}), \rho ^{1/{\hat{p}}}\big \rangle \). On the other hand, \(S(\rho ') = S(D_\xi \rho D_\xi ^\dagger ) = S(\rho )\). Putting these together, we find that
(ii) For any passive transformation U, there is an \(m\times m\) unitary matrix \((u_{jk})_{j,k}\) such that \(U^\dagger {\textbf{a}}_j U = \sum _{k} u_{jk} {\textbf{a}}_{k}\). Employing this relation yields
where we used \(\sum _{j}u_{jk}{\bar{u}}_{jk'}=\delta _{k,k'}\). Applying similar computations, one can also verify that \(\big \langle {\mathcal {L}}_{b,j}(\rho '^{1/p}), \rho '^{1/{\hat{p}}}\big \rangle = \big \langle {\mathcal {L}}_{b,j}(\rho ^{1/p}), \rho ^{1/{\hat{p}}}\big \rangle \) for \(b\in \{0,1\}\). Therefore, using these relations together with \(S(\rho ') = S(\rho )\), we obtain \(\Upsilon _m(\rho ')=\Upsilon _m(\rho )\).
(iii) The proof of this part is more involved. First, we note that \(S_r^\dagger {\textbf{a}}_j S_r = \cosh (r_j){\textbf{a}}_j - \sinh (r_j){\textbf{a}}_j^\dagger \), which using \(\cosh ^2(r_j) - \sinh ^2(r_j)=1\) implies
Using these relations and applying the assumption on \(\rho \), we find that
Let \(\rho =\sum _k \lambda _k|\psi _k\rangle \langle \psi _k|\) be the eigen-decomposition of \(\rho \). Then, by Young’s inequality we have
Therefore, we can see that
and similarly \( \big \langle {\mathcal {L}}_{1,j}\big (\rho '^{1/p}\big ), \rho '^{1/{\hat{p}}}\big \rangle \ge \big \langle {\mathcal {L}}_{1,j}\big (\rho ^{1/p}\big ), \rho ^{1/{\hat{p}}}\big \rangle \). Moreover, once again using (29) we have
We also have \(S(\rho ')=S(\rho )\). Putting these together we arrive at \(\Upsilon _m(\rho ')\ge \Upsilon _m(\rho )\). \(\square \)
Theorem 3
For any \(\nu _0, \nu _1, \omega \ge 0\) let \(\eta _{{{\textrm{th}}}}\) be given by (19). Then, for any multimode state \(\rho '=U_{\textrm{G}}\, \rho U_{\textrm{G}}^\dagger \) where \(U_{\textrm{G}}\) is an m-mode Gaussian unitary, we have
assuming that one of the following conditions is satisfied:
-
(a)
\(\rho =\rho _{1}\otimes \cdots \otimes \rho _{m}\) is a product state and \(U_G\) is passive.
-
(b)
\(\rho =\rho _{{\textrm{diag}}}\) is diagonal in the Fock basis.
Proof
We first note that by the definition of \(\Upsilon _m(\cdot )\) and using Theorem 1 we have
Then, (a) follows from part (ii) of Lemma 2. To prove (b), we use the Bloch-Messiah decomposition to express the Gaussian unitary \(U_G\) as \(U_{\textrm{G}}=D_\xi U S_r V\), where \(D_\xi \) is a multimode displacement operator, U, V are passive Gaussian transformations, and \(S_r\) is a product of single-mode squeezing transformations [41]. Therefore, the multimode state can be written as
where \(\rho _{{\textrm{diag}}}\) is diagonal in the Fock basis. We use this expression to prove (30) in the following steps:
-
By Theorem 1, the meta log-Sobolev inequality holds for all single-mode diagonal states. Then, applying a standard tensorization argument in the classical case, i.e., using the chain rule and the concavity of the entropy function (see, e.g., [38, Proposition 3.7]), we find that \(\Upsilon _m(\rho _{\textrm{diag}})\ge \eta _{{{{\textrm{th}}}}}\). Then, by part (ii) of Lemma 2, we have \(\Upsilon _m(V \rho _{{\textrm{diag}}} V^\dagger )=\Upsilon _m(\rho _{{\textrm{diag}}})\ge \eta _{{{\textrm{th}}}}\).
-
Next we use part (iii) of Lemma 2 to establish (30) for \(S_r V \rho _{{\textrm{diag}}} V^\dagger S_r^\dagger \). To this end, we need to verify the required assumptions \({{\textrm{tr}}}\big (V \rho _{{\textrm{diag}}} V^\dagger {\textbf{a}}_j^2\big )={{\textrm{tr}}}\big ( (V \rho _{{\textrm{diag}}} V^\dagger )^{1/p} {\textbf{a}}_j (V \rho _{{\textrm{diag}}} V^\dagger )^{1/{\hat{p}}} {\textbf{a}}_j\big )=0\). We note that \({{\textrm{tr}}}\big (V \rho _{{\textrm{diag}}} V^\dagger {\textbf{a}}_j^2\big ) = {{\textrm{tr}}}\big (\rho _{{\textrm{diag}}} (V^\dagger {\textbf{a}}_j V)^2\big )\) and \(V^\dagger {\textbf{a}}_j V= \sum _{k} v_{jk} {\textbf{a}}_{k}\) is a linear combination of \({\textbf{a}}_{k}\)’s. Moreover, \({{\textrm{tr}}}(\rho _{{\textrm{diag}}} {\textbf{a}}_k{\textbf{a}}_{k'})=0\) for any \(k, k'\) simply because \(\rho _{{\textrm{diag}}}\) is diagonal in the Fock basis. Therefore, we have \({{\textrm{tr}}}\big (V \rho _{{\textrm{diag}}} V^\dagger {\textbf{a}}_j^2\big )=0\). By the same argument, we also have \({{\textrm{tr}}}\big ( (V \rho _{{\textrm{diag}}} V^\dagger )^{1/p} {\textbf{a}}_j (V \rho _{{\textrm{diag}}} V^\dagger )^{1/{\hat{p}}} {\textbf{a}}_j\big )=0\). Hence, \(\Upsilon _m(S_r V \rho _{{\textrm{diag}}} V^\dagger S_r^\dagger )=\Upsilon _m(V \rho _{{\textrm{diag}}} V^\dagger )=\Upsilon _m(\rho _{{\textrm{diag}}})\ge \eta _{{{\textrm{th}}}}\).
-
Once again, using part (ii) of Lemma 2 we find that (30) holds for \(U S_r V \rho _{{\textrm{diag}}} V^\dagger S_r^\dagger U^\dagger \) since we have already verified it for \(S_r V \rho _{{\textrm{diag}}} V^\dagger S_r^\dagger \) and U is a passive transformation.
-
Finally, we note that \(V^\dagger S_r^\dagger U^\dagger {\textbf{a}}_j U S_r V\) is a linear combination of \({\textbf{a}}_k\)’s and \({\textbf{a}}_k^\dagger \)’s. Moreover, \({{\textrm{tr}}}(\rho _{{\textrm{diag}}} {\textbf{a}}_k) ={{\textrm{tr}}}(\rho _{{\textrm{diag}}} {\textbf{a}}_k^\dagger )=0\). Therefore, \({{\textrm{tr}}}(U S_r V \rho _{{\textrm{diag}}} V^\dagger S_r^\dagger U^\dagger {\textbf{a}}_j)=0\) for all j. Thus, by part (i) of Lemma 2, inequality (30) holds for \(\rho = D_\xi U S_r V \rho _{{\textrm{diag}}} V^\dagger S_r^\dagger U^\dagger D_\xi ^\dagger \). \(\square \)
Notice that Theorem 3 implies \(\Upsilon _m(\rho )\ge \eta _{{{{\textrm{th}}}}}\) for all multimode Gaussian states. The point is that by Williamson’s theorem any Gaussian state can be transformed into a tensor product of thermal states using a Gaussian unitary, and thermal states are diagonal in the Fock basis [41].
4 Log-Sobolev Inequalities for the Quantum Ornstein–Uhlenbeck Semigroup
As discussed in Sect. 2, the semigroup of attenuator channels is sometimes called the quantum (bosonic) Ornstein–Uhlenbeck semigroup.Footnote 10 In this section, we explicitly compute the optimal p-log-Sobolev constant for this semigroup for any \(1\le p\le 2\), and derive a quantum variant of the celebrated log-Sobolev inequality of Gross [25]. Due to the equivalence of log-Sobolev inequalities and hypercontractivity inequalities for quantum Markov semigroups [2, 35, 39], our results provide the optimal hypercontractivity inequalities for the quantum Ornstein–Uhlenbeck semigroup.
We need to develop some notations to present our results. Let \(\Phi _t=e^{-t{\mathcal {L}}}\) be the quantum attenuator channel with
where \(\beta >0\) is some parameter. The adjoint of the channel with respect to the Hilbert–Schmidt inner product is denoted by \(\Phi _t^*\) and describes the evolution in the Heisenberg picture. Then, \(\Phi _t^*=e^{-t{\mathcal {L}}^*}\) whereFootnote 11
The semigroup \(\{\Phi _t:\, t\ge 0\}\) has a fixed point which we denote by \(\sigma =\sigma _\beta \):
Then, it is natural to define a weighted inner product with respect to this state:
where
We emphasize that the weighted inner product \(\langle \cdot , \cdot \rangle _\sigma \) should not be confused with the Hilbert–Schmidt inner product \(\langle \cdot , \cdot \rangle \) that has no subscript. This inner product induces the 2-norm
where \(|Y| = \sqrt{Y^\dagger Y}\). It can be verified that \({\mathcal {L}}=\Gamma _\sigma \circ {\mathcal {L}}^*\circ \Gamma _\sigma ^{-1}\) and that \({\mathcal {L}}^*\) is self-adjoint with respect to this weighted inner product. Thus, we may consider the Dirichlet form associated with \({\mathcal {L}}^*\) given by
Here, \(p\ge 1\) and the case of \(p=1\) is understood in the limit as
Now, a p-log-Sobolev inequality with parameters \(c\ge 0\) takes the form:
where \(D(\rho \Vert \sigma )={{\textrm{tr}}}(\rho \log \rho ) - {{\textrm{tr}}}(\rho \log \sigma )\) is Umegaki’s relative entropy. The optimal constant c for which the above inequality holds is usually denoted by \(\alpha _p\), i.e.,
where the infimum is taken over all states with finite mean photon number. We note that, as it will become clear in the proof of the following theorem, similar to \(\Upsilon (\rho )\), \({\mathcal {E}}_p(\rho )\) is well defined for any state \(\rho \) with finite mean photon number.
Theorem 4
For any \(\beta >0\), the p-log-Sobolev constant of the quantum Ornstein–Uhlenbeck semigroup is given by
if \(p>1\) and \(\alpha _1 = \lim _{p\rightarrow 1^+}\alpha _p = \frac{1}{2} \sinh (\beta /2)\).
Proof
We note that \(\sigma ^s{\textbf{a}}= e^{s\beta } {\textbf{a}}\sigma ^s\) for every s, and \(\sigma \) commutes with \({\textbf{a}}^\dagger {\textbf{a}}\). Then, for a single-mode state \(\rho \) we have
where \({\mathcal {L}}_0, {\mathcal {L}}_1\) are given in (15) and
On the other hand, since \(\sigma = (1-e^{-\beta }) e^{-\beta {\textbf{a}}^\dagger {\textbf{a}}}\), we have
Putting these together, we find that for \(\alpha _p\) given by (33) we have
where \(\Upsilon (\rho )\) is given in (16) with \(\nu _0 = \frac{\nu '_0}{\alpha _p}\), \(\nu _1=\frac{\nu '_1}{\alpha _p}\) and \(\omega =\frac{\omega '}{\alpha _p}-\beta =0\). Therefore, to prove the theorem we can use Theorem 1 to conclude that it suffices to show that for any thermal state \(\tau \) we have
where \(\alpha _p\) is given by (33), and it is the best possible such constant.
By the above computations, we have
Here, for convenience we use a new parametrization for thermal states:
Using this, we have
Therefore, we get
We also have
where \(d(y^2\Vert x^2) = y^2\log y^2 + (1-y^2)\log (1-y^2) -y^2\log x^2-(1-y^2)\log (1-x^2) \) is the binary relative entropy function. Hence, (35) is equivalent to \(\phi (x, y)\ge 0\) for all \(0<x, y<1\) where \(x^2=e^{-\beta }\) and
Now the idea is to fix y and think of \(\phi (x, y)\) as a function of x. It is shown in Appendix C that
We argue that this derivative vanishes on the interval (0, 1) only if \(x=y\). To this end, we use \(\log t< t-1\) for \(1\ne t\in \big \{x^{\frac{2}{p}}, x^{\frac{2}{{\hat{p}}}}\big \}\) to conclude that for \(0<x<1\),
Therefore, \( \frac{{\textrm{d}}}{{\textrm{d}}x}\phi (x, y)\) is nonzero for \(0<x\ne y<1\) and its sign depends on whether \(x>y\) or \(x<y\). In fact, if \(0<x_1<y<x_2<1\), then
Thus, the minimum of \(\phi (x, y)\), as a function of x, is achieved at \(x=y\) and we have \(\phi (y, y)=0\). This means that \(\phi (x, y)\ge 0\) for all \(0<x, y<1\), and (35) holds for any thermal state \(\tau \). Also, the limiting case of \(y\rightarrow 1^-\) confirms that the constant \(\alpha _p\) in (35) is optimal. \(\square \)
We remark that some authors define the quantum Ornstein–Uhlenbeck semigroup by considering arbitrary parameters \(\nu _0, \nu _1\) satisfying \(\nu _1>\nu _0>0\), while here we study only the case of \(\nu _0=\nu _1^{-1}=e^{-\beta /2}\). Nevertheless, by a rescaling argument all the log-Sobolev constants of these semigroups can also be computed using Theorem 4.
Extending the definition of the weighted 2-norm, for any \(p\ge 1\) we may define
We let \(L_p(\sigma )\) to be the closure of the space of bounded operators under this norm.
The following corollary is a consequence of the above theorem, and [39, Theorem 3.8] (see also [2, Theorem 11]).
Corollary 5
Let \(\Phi _{t} = e^{-t{\mathcal {L}}}\) where \({\mathcal {L}}\) is given by (3138). Also, let \(\Phi ^*_{t} = e^{-t{\mathcal {L}}^*}\). Then, for any \(1< q\le p< +\infty \) and operator \(X\in L_q(\sigma )\) we have
where \(\alpha _2= \frac{4}{\beta }\sinh ^2(\beta /4)\).
Proof
It is well known that it suffices restrict to operators that are positive semidefinite (see, e.g., [2]). For \(p\ge q\) define \(t(p):= \frac{1}{4\alpha _2} \log \frac{p-1}{q-1}\) and note that \(t(q)=0\). For a positive semidefinite operator X let \(X_p:= \Phi _{t(p)}^*(X)\) and define \(f(p):=\Vert X_p\Vert _{p, \sigma } - \Vert X\Vert _{q, \sigma }\). Our goal is to show that \(f(p)\le 0\) for any \(p\ge q\). To this end, since \(f(q)=0\), it suffices to verify that \(f'(p)\le 0\). Computing the derivative (see the details in the proof of [2, Theorem 11]) we find that
where
It is shown in Appendix D that \( \alpha _p\ge \alpha _2\) for all p. Then, the negativity of \(f'(p)\) follows from the log-Sobolev inequality of Theorem 4.
For the above argument, we need to make sure that \(\Phi _{t}^*(X)\) is a well-defined operator. To this end, we notice that the quantum Ornstein–Uhlenbeck semigroup satisfies the Feller property with respect to the algebra of compact operators [9, Theorem 5.1]. In particular, \(\Phi _t^*(X)\) is a well-defined compact operator if X is compact. Thus, as also done in the proof of [39, Theorem 3.8], in the above argument we may restrict to compact operators X, and after proving the inequality for such an operator, generalize to arbitrary \(X\in L_q(\sigma )\) by a continuity argument. We also note that since \(\sigma \) is Gaussian, when X is compact and then bounded, the corresponding density operator \(\rho _p\) in the above argument has a finite mean photon number. Thus, the log-Sobolev inequality of Theorem 4 can indeed be employed. \(\square \)
4.1 Log-Sobolev Inequality for Multimode States
It is well known that log-Sobolev constants for classical Markov semigroups satisfy the tensorization property, meaning that for generators \({\mathcal {K}}_1, {\mathcal {K}}_2\) of two classical Markov semigroups we have \(\alpha _p({\mathcal {K}}_1\otimes I + I\otimes {\mathcal {K}}_2)= \min \{\alpha _p({\mathcal {K}}_1), \alpha _p({\mathcal {K}}_2)\}\), where \({\mathcal {K}}_1\otimes {\mathcal {I}} + {\mathcal {I}}\otimes {\mathcal {K}}_2\) is the generator of the tensor product semigroup \(\{e^{-t{\mathcal {K}}_1}\otimes e^{-t{\mathcal {K}}_2}:\, t\ge 0\}\). This tensorization property is known only for some special quantum Markov semigroups; see [2] and references therein for more details. Thus, a natural question is whether the tensorization property holds for the quantum Ornstein–Uhlenbeck semigroup. To explore this problem we first develop some notations.
For parameters \(\beta _1, \dots , \beta _m>0\) let
be the generator of an m-mode quantum Markov semigroup where
Then, the quantum channels
form an m-mode semigroup. The fixed point of this semigroup is
and its corresponding Dirichlet form is equal to
Now, as in the single mode case we are interested in the p-log-Sobolev constant
Assuming that the optimal states in the above optimization problem are Gaussian, the following corollary is an evidence for the tensorization property.
Corollary 6
Suppose that \(\beta _1=\cdots =\beta _m= \beta \). Then, for any m-mode Gaussian state \(\rho \) we have
where \(\alpha _{p} = \frac{p{\hat{p}}}{4\beta } e^{\beta /2}\big (1-e^{-\beta /p}\big )\big (1-e^{-\beta /{\hat{p}}}\big )\).
Proof
This is a direct consequence of Theorem 3 as well as Theorem 4 and its proof. \(\square \)
Our next goal is to prove a lower bound on \({{\widehat{\alpha }}}_2\) which probably is not optimal, yet may be useful for some applications. To this end, it is useful to consider a correspondence between density operators \(\rho \) and positive semidefinite operators X as follows:
Then, we note that \(\Vert X\Vert _{2, {{\widehat{\sigma }}}}^2 = {{\textrm{tr}}}(\rho )=1\). Indeed, \(\Gamma _{{{\widehat{\sigma }}}}^{-1/2}\) is an isometric embedding of the space of Hilbert–Schmidt operators into \(L_2({{\widehat{\sigma }}})\), the Hilbert space of operators equipped with norm \(\Vert \cdot \Vert _{2, {{\widehat{\sigma }}}}\). With this correspondence, we have
From this equation, we realize that the spectrum of \({{\widehat{{\mathcal {L}}}}}^*\) is a relevant object in the study of the Dirichlet form. Here, we consider \({{\widehat{{\mathcal {L}}}}}^*\) as an operator acting on the Hilbert space \(L_2({{\widehat{\sigma }}})\). We note that, since \({{\widehat{\sigma }}}\) is a Gaussian state, any operator X that is a polynomial of operators \({\textbf{a}}_j, {\textbf{a}}_j^\dagger \) belongs to \(L_2({{\widehat{\sigma }}})\). Letting \({\mathcal {P}}\) be the space of these polynomial operators, we find that indeed \({\mathcal {P}} \subset L_2({{\widehat{\sigma }}})\) is a dense subset. Moreover, by definition, \({{\widehat{{\mathcal {L}}}}}^*\) leaves this subspace invariant: \({{\widehat{{\mathcal {L}}}}}^*({\mathcal {P}})\subseteq {\mathcal {P}}\). Thus, the domain of \({{\widehat{{\mathcal {L}}}}}^*\) contains the dense subset \({\mathcal {P}}\) [9]. In the following proposition, we summarize Theorem 7.2 of [9] regarding the spectrum of \({{\widehat{{\mathcal {L}}}}}^*\) acting on \(L_{2}({{\widehat{\sigma }}})\), and for the sake of completeness present its main proof idea in Appendix E.Footnote 12
Proposition 7
[9]. Consider \({\widehat{{\mathcal {L}}}}^*\) as an operator acting on the Hilbert space \(L_2({{\widehat{\sigma }}})\) equipped with norm \(\Vert \cdot \Vert _{2, {{\widehat{\sigma }}}}\). For \(z_1, \dots , z_m\in {\mathbb {C}}\) with \(|z_j|=1\) define the quadrature operator \({\textbf{q}}_{j,z_j} = \frac{1}{\sqrt{2}} (z_j{\textbf{a}}_j^\dagger + {{\bar{z}}}_j{\textbf{a}}_j)\). Also, for any j, let \(\{h_{j,k_j}(t):\, k_j\ge 0\}\) be the set of Hermite polynomials specified by
Then, for any tuple \(k=(k_1, \dots , k_m)\) of non-negative integers, the operator \(V_k:=\bigotimes _{j=1}^m h_{j,k_j}({\textbf{q}}_{j,z_j})\) is an eigenvector of \({{\widehat{{\mathcal {L}}}}}^*\) with eigenvalue \(\sum _{j=1}^m\sinh (\beta _j/2)k_j\). Moreover, these operators (and their appropriate linear combinations) are all the eigenvectors of \({{\widehat{{\mathcal {L}}}}}^*\).
For any \(\ell =(\ell _1, \dots , \ell _m) \in {\mathbb {Z}}^m\), let \({\mathcal {F}}_\ell \subset L_2({{\widehat{\sigma }}})\) be the space of operators X such that \(\Vert X\Vert _{2, {{\widehat{\sigma }}}}<+\infty \) and for any \(n=(n_1, \dots , n_m)\) their matrix entries in the Fock basis satisfy
Then, any operator X with \(\Vert X\Vert _{2, {{\widehat{\sigma }}}}<+\infty \) can be decomposed as
Following [6] we call (40) the diagonal decomposition of X. A crucial property of the diagonal decomposition is that \({\mathcal {F}}_\ell \)’s are orthogonal subspaces, and \(\langle X_\ell , X_{\ell '}\rangle _{{{\widehat{\sigma }}}}=0\) if \(\ell \ne \ell '\). This is a consequence of the fact that \({{\widehat{\sigma }}}\) is diagonal in the Fock basis.
Corollary 8
[9]. For any \(\ell \in {\mathbb {Z}}^m\) the subspace \({\mathcal {F}}_\ell \) is invariant under \({{\widehat{{\mathcal {L}}}}}^*\), and the eigenvalues of \({{\widehat{{\mathcal {L}}}}}^*\) restricted to \({\mathcal {F}}_\ell \) are contained in \(\big \{\sum _j \sinh (\beta _j/2)k_j:~ k_j\ge |\ell _j| \big \}\). In particular, for any \(X_\ell \in {\mathcal {F}}_\ell \) we have
Proof
The fact that \({\mathcal {L}}^*({\mathcal {F}}_\ell )\subseteq {\mathcal {F}}_\ell \) is easily verified using the definition of \({{\widehat{{\mathcal {L}}}}}^*\). Therefore, considering the diagonal decomposition \(V_k= \sum _\ell V_{k, \ell }\) of the eigenvector \(V_k\) defined in Proposition 7, we find that if \(V_{k, \ell }\ne 0\), then it is also an eigenvector of \({{\widehat{{\mathcal {L}}}}}^*\) with the same eigenvalue as that of \(V_k\). On the other hand, by definition, \(V_k\) is a polynomial of degree \(k_j\) in terms of \({\textbf{a}}_j\) and \({\textbf{a}}_j^\dagger \). This means that we have \(V_{k, \ell }=0\) if \(|\ell _j|>k_j\) for some j. We also note that since by Proposition 7, the closure of the span of \(V_k\)’s is the whole space \(L_2({{\widehat{\sigma }}})\), the closure of the span of operators \(V_{k, \ell }\) equals \({\mathcal {F}}_\ell \). Indeed, \({\mathcal {F}}_\ell = \overline{\text {span}\{V_{k, \ell }:\, k_j\ge |\ell _j|, \forall j\}}\). Putting these together the desired result is implied. \(\square \)
To state our next lemma, it is convenient to extend the definition of the entropy function for operators that are not necessarily normalized. For any positive operator \(X=\Gamma _{{{\widehat{\sigma }}}}^{-1/2}(\rho ^{1/2})\) where \(\rho \) is a density operator, we define
Extending this definition to non-normalized operators satisfying \(\Vert X\Vert _{2, {{\widehat{\sigma }}}}<+\infty \), we let
The significance of this entropy function is in its connection to the derivative of \(p\mapsto \Vert X\Vert _{p, {{\widehat{\sigma }}}}\) at \(p=2\) for which we refer to [2, 35, 39].
Lemma 9
Let X be a positive operator satisfying \({{\textrm{tr}}}\big (\rho H_m \big )<+\infty \) where \( \rho = \Gamma _{{{\widehat{\sigma }}}}^{1/2}(X)^2 \) and \(H_m=\sum _{j=1}^m {\textbf{a}}_j^\dagger {\textbf{a}}_j\) is the m-mode number operator. Then, for any vector \(w=(w_\ell )_\ell \) of positive real numbers we have
where \(X_\ell \)’s are given by the diagonal decomposition \(X= \sum _\ell X_\ell \) with \(X_\ell \in {\mathcal {F}}_\ell \). Also, \(I_{2,2}(X_\ell ) = \Gamma _{{{\widehat{\sigma }}}}^{-1/2}\big (\big |\Gamma _{{{\widehat{\sigma }}}}^{1/2}(X_\ell ) \big |\big )\) and \(\Vert w\Vert _2^2 = \sum _\ell w_\ell ^2\).
Proof
Fix a finite subset \(S\subset {\mathbb {Z}}^m\) satisfying \(-\ell \in S\) for all \(\ell \in S\). For some technical reason, in the following we restrict to the subspace of operators X satisfying \(X_\ell =0\) if \(\ell \notin S\). Later, we will relax this assumption.
For any \(z\in {\mathbb {C}}\) define the map \(T_z\) by
where as before \(X=\sum _\ell X_\ell \) is the diagonal decomposition of X. Note that by the above assumption this sum is indeed a finite sum and over \(\ell \in S\). By the orthogonality of the subspaces \({\mathcal {F}}_\ell \) for any \(t\in {\mathbb {R}}\) we have
Next, recall the definition of norm \(\Vert \cdot \Vert _{p, {{\widehat{\sigma }}}}\) given by (37). Using the triangle and Cauchy–Schwarz inequalities, for any \(t\in {\mathbb {R}}\) we have
For any \(1\le p\le +\infty \) define
It is well known and can be easily verified that \(\Vert \cdot \Vert _{2, p, {{\widehat{\sigma }}}}\) satisfies the triangle inequality and is really a norm. Moreover, these norms form an interpolation family [40]. Now, with this notation, (42) and (43) imply that
Therefore, by the interpolation inequality [37], we have
This means that
We note that this inequality turns into an equality for \(p=2\). Therefore, we must have
Computing the derivative of both sides by [2, Proposition 3] and using the fact that \(X_{\ell }^\dagger = X_{-\ell }\), the desired inequality is obtained.
Now we relax the assumption that \(X_\ell \) is nonzero only for finitely many \(\ell \). Assume that \(X= \Vert X\Vert _{2, {{\widehat{\sigma }}}} \Gamma _{{{\widehat{\sigma }}}}^{-1/2}\big (\rho ^{1/2}\big )\) where \(\rho \) is a quantum state with finite mean photon number, in which case
Let \(\Pi _k\) be the projection on the span of basis vectors \(|n_1, \dots , n_m\rangle \) satisfying \(n_1+\cdots + n_m\le k\). Let \(X^{(k)} = \Pi _k X \Pi _k\) and consider the quantum state \(\rho ^{(k)}=\Vert X^{(k)}\Vert ^{-2}_{2, {{\widehat{\sigma }}}} \Gamma _{{{\widehat{\sigma }}}}^{-1/2}\big (X^{(k)}\big )^2\). Observe that \(X_\ell ^{(k)}\) is nonzero only for finitely many values of \(\ell \). Then, by the above argument (41) holds for \(X^{(k)}\). Taking the limit of \(k\rightarrow +\infty \), clearly \(\big \Vert X_\ell ^{(k)}\big \Vert _{2, {{\widehat{\sigma }}}}\) tends to \(\Vert X_\ell \Vert _{2, {{\widehat{\sigma }}}}\), which also implies that the 2-norm of \(X^{(k)}\) tends to that of X. For the entropy terms, it can be verified that the mean photon number of \(\rho ^{(k)}\) tends to that of \(\rho \). This implies that \({{\textrm{tr}}}\big (\rho ^{(k)} \log {{\widehat{\sigma }}}\big )\) tends to \({{\textrm{tr}}}(\rho \log {{\widehat{\sigma }}})\). Next, using the assumption \({{\textrm{tr}}}(\rho H_m)<+\infty \), we can apply the continuity bound for the entropy function [43, Lemma 18] to conclude that \(S\big (\rho ^{(k)}\big )\) tends to \(S(\rho )\). Therefore, \({{\textrm{Ent}}}_{2, {{\widehat{\sigma }}}}\big (X^{(k)}\big )\rightarrow {{\textrm{Ent}}}_{2, {{\widehat{\sigma }}}}(X)\) and similarly \({{\textrm{Ent}}}_{2, {{\widehat{\sigma }}}}\big (I_{2,2}\big (X^{(k)}_\ell \big ) \rightarrow {{\textrm{Ent}}}_{2, {{\widehat{\sigma }}}}\big (I_{2,2}(X_\ell )\) as \(k\rightarrow +\infty \). Putting these together the desired inequality for X is implied. \(\square \)
We remark that Lemma 9 holds beyond the diagonal decomposition considered above and works for any decomposition of the space of operators into orthogonal subspaces. Thus, this lemma is of independent interest and may find other applications.
Theorem 10
For any \(\beta _1, \dots , \beta _m>0\) we have
where \(\beta _{\min } = \min _j \beta _j\) and \(\alpha _{2, \min } = \min _j \frac{4\sinh ^2(\beta _j/4)}{\beta _j} = \frac{4\sinh ^2(\beta _{\min }/4)}{\beta _{\min }}\).
Proof
We need to show that
for any X satisfying \({{\textrm{tr}}}\big (\Gamma _{{{\widehat{\sigma }}}}^{1/2}(X)^2 H_m\big )<+\infty \). We use Lemma 9 for the choice of \(w_\ell =e^{-\frac{c}{2} |\ell |}\) where \(c>0\) and \(|\ell | = \sum _{j=1}^m |\ell _j|\). We have \(\log \Vert w\Vert _2^2 - \log w_\ell ^2 = c|\ell | +m\log \frac{e^c+1}{e^c-1}\) and
Now, since \(X_\ell \in {\mathcal {F}}_\ell \), by Corollary 8 we have
Next, using the fact that \(X_\ell \in {\mathcal {F}}_\ell \), it is easily verified that the operator \(I_{2,2}(X_\ell )\) is diagonal. On the other hand, the restriction of \({{\widehat{{\mathcal {L}}}}}^*\) to diagonal operators is essentially a classical Markov semigroup for which the tensorization property holds. Thus, the 2-log-Sobolev constant of \({{\widehat{{\mathcal {L}}}}}^*\) restricted to diagonal operators equals the minimum of the 2-log-Sobolev constants of \({\mathcal {L}}^*_j\)’s restricted to (single-mode) diagonal operators. The latter quantity is computed in Theorem 4. Putting these together, we conclude that
Using the above two inequalities in (45), we find that
It is shown in [2, Lemma 23] that for any operator Y,
Therefore, using \(X_\ell ^\dagger = X_{-\ell }\), the above inequalities imply
Next, by Proposition 7 the generator \({{\widehat{{\mathcal {L}}}}}^*\) has the spectral gap \(\sinh (\beta _{\min }/2)\). Thus, by [39, Theorem 4.2] the above inequality implies
Finally, letting \(c=\log (2m+1 )\) and using \(\log (1+\frac{1}{\,}m)\le \frac{1}{\,}m\) the desired inequality (44) is obtained. \(\square \)
5 Proof of the CMOE Conjecture
In this section, we present an alternative proof of the CMOE conjecture for single-mode phase-covariant Gaussian channels, which is first established in [15, 16].
Theorem 11
(CMOE Conjecture). For any single-mode phase-covariant Gaussian channel \(\Phi \), and any quantum state \(\rho \) with finite mean photon number we have
where \(\tau \) is a single-mode thermal state satisfying \(S(\rho ) = S(\tau )\).
To prove this theorem, we first state a consequence of Theorem 1.
Theorem 12
Let \(\{\Phi _t:\, t\ge 0\}\) denote a semigroup of single-mode phase-covariant Gaussian channels. Then, for any \(\alpha > 0\) we have
where the infimum is taken over all quantum states \(\rho \) with finite mean photon number, and \(\eta _{{{\textrm{th}}}}(\alpha )\) is the infimum of the same function restricted to thermal states.
Proof of Theorem 12
Let \({\mathcal {L}}\) be the Lindbladian of the semigroup \(\{\Phi _t:\, t\ge 0\}\) so that \(\Phi _t=e^{-t{\mathcal {L}}}\). We have
Then, the claim follows from Theorem 1 for \(p=1\) and appropriate choices of \(\nu _0, \nu _1\ge 0\) and \(\omega =0\). \(\square \)
We need yet another technical ingredient to prove Theorem 11. For \(0<x<1\), let
be a thermal state with parameter x. Then, letting \(\Phi _t= e^{-t{\mathcal {L}}}\) with \({\mathcal {L}}=\nu _0{\mathcal {L}}_0+\nu _1{\mathcal {L}}_1\) and using (48), we find that the objective function in (47) for thermal states equals
This equation can also be derived by taking the limit of \(p\rightarrow 1^+\) in (18).
Assume that \(\nu _0>0\) and as before let \(\alpha >0\). Then, we have \(\lim _{x\rightarrow 0^+ }f_\alpha (x) = \lim _{x\rightarrow 1^- }f_\alpha (x) = +\infty \). This means that the minimum of \(f_\alpha (x)\) is attained at roots of \(f'_\alpha (x)\). We have
Therefore, \(f'_\alpha (x)=0\) iff \(g(x)=\alpha \) where
Computing the derivative of g(x), we have
Then, using \(\log x\le x-1\) and \(-\log x= \log \frac{1}{x}\le \frac{1}{x}-1\) we find that \(g'(x)\le 0\) for \(0<x<1\), and that g(x) is monotone decreasing in this interval. Moreover, since \(\nu _0>0\), we have \(\lim _{x\rightarrow 0^+} g(x)=+\infty \) and \(\lim _{x\rightarrow 1^-} g(x) = 0\). This means that \(g(x)\ge 0\) for all \(0<x<1\), and g(x) takes any positive value in this interval exactly once. As a conclusion, \(g(x)=\alpha \) has exactly one solution in (0, 1) for any \(\alpha >0\). We conclude that if \(\alpha >0\), the minimum of \(f_\alpha (x)\) is attained at the unique root of \(f'_\alpha (x)\). More importantly, for any \(0<x_0<1\), there is \(\alpha _{0}=g(x_0)\) such that the minimum of \(f_{\alpha _{0}}(x)\) is attained at \(x=x_0\).
Proof of Theorem 11
As a phase-covariant Gaussian channel, we have \(\Phi =\Phi _{t_1}\) for some phase-covariant Gaussian semigroup \(\{\Phi _t:\, t\ge 0\}\) and some \(t_1>0\).Footnote 13 Let \({\mathcal {L}}=\nu _0{\mathcal {L}}_0+\nu _1{\mathcal {L}}_1\) be the Lindbladian of this semigroup. In order to apply the above computations, by taking the limit of \(\nu _0\rightarrow 0^+\) we assume with no loss of generality that \(\nu _0>0\). The point is that if \(S\big (\Phi _{t_1}(\rho )\big )\ge S(\Phi _{t_1}(\tau ))\) holds for any \(\nu _0>0\), then by the continuity of von Neumann entropy [43, Lemma 18], it holds for \(\nu _0=0\) as well.
Suppose that \(S\big (\Phi _{t_1}(\rho )\big )< S(\Phi _{t_1}(\tau _{x_0}))\) where \(\tau =\tau _{x_0}\) is the thermal state in the statement of the theorem with parameter \(x_0\) as in (49). We note that by assumption \(S(\Phi _{t}(\rho ))= S(\Phi _{t}(\tau _{x_0}))\) for \(t=0\). Thus, we may define
Then, by continuity we have \(S(\Phi _{t_0}(\rho ))= S(\Phi _{t_0}(\tau _{x_0}))\) and
Recall that \(\Phi _t(\tau _{x_0})\) is a thermal state for any \(t>0\). Let \(x_t\) be the parameter of this thermal state: \(\tau _{x_t} = \Phi _t(\tau _{x_0})\). We note that \(t\mapsto x_t\) is smooth. Then, by the above discussions, there is a smooth function \(t\mapsto \alpha _t\) with \(\alpha _t>0\) such that
Employing Theorem 12, for any \(t_0\le t\le t_1\) we obtain
Taking the integral of both sides yields
We note that \(S(\Phi _{t_0}(\rho ))= S(\tau _{x_{t_0}})\), so this inequality is in contradiction with (50). This implies that the starting inequality \(S\big (\Phi _{t_1}(\rho )\big )< S(\Phi _{t_1}(\tau _{x_0}))\) does not hold. \(\square \)
Generalization of the CMOE conjecture to the multimode case is an open problem in general. However, generalizing the proof idea of Theorem 11 and using Theorem 3, we can prove the conjecture for a special class of multimode states that can be prepared by applying a passive Gaussian unitary \(U_G\) on an arbitrary product state, \(\rho =U_G \rho _{1}\otimes \rho _{2}\otimes \cdots \otimes \rho _{m} U_G^\dagger \).
6 Conclusion
In this paper, we introduced a meta log-Sobolev inequality, formalized by Theorem 1, that provides a general information theoretic framework to study phase-covariant Gaussian quantum channels. In general, this inequality and our technical proof, particularly the tensorization argument that we used in the second step, are of independent interest. However, by using this result, we have derived new optimal bounds in the context of p-log-Sobolev inequalities for the semigroup of attenuation channels.
Specifically, in Theorem 4, we explicitly computed the optimal p-log-Sobolev constant of the quantum Ornstein–Uhlenbeck semigroup for any \(1\le p\le 2\). We also in Theorem 10 established a bound on the 2-log-Sobolev constant in the multimode case that scales logarithmically with the number of modes. We conjecture that the log-Sobolev constants in the multimode case match those of the single-mode case and the tensorization property holds in general. We showed in Corollary 6 that this conjecture holds assuming that the optimal multimode states are Gaussian.
Quantum log-Sobolev inequalities are also defined for \(0\le p< 1\) and are related to reverse hypercontractivity inequalities [2, 10]. An interesting open problem is to compute the p-log-Sobolev constant for this range of parameter, which we leave it for future works.
The proof of our multimode log-Sobolev inequality is based on an entropic inequality in Lemma 9 which is of independent interest. Using this inequality, one might be able to prove log-Sobolev inequalities for the tensor product of other quantum Markov semigroups. We leave exploration of such applications of Lemma 9 for future works.
Our results have novel consequences even in the fully classical case, i.e., when restricting to states \(\rho \) that are diagonal in the Fock basis. In particular, as pointed out in [6], restricting the quantum Ornstein–Uhlenbeck semigroup to diagonal states, we obtain a classical birth-death process whose log-Sobolev constants were not known prior to our work (except in the case of \(p=1\)). Moreover, our results have consequences for the classical operation of thinning on the space of integer-valued random variables, for which we refer to [18] and references therein.
We also presented an alternative proof of the CMOE conjecture in Theorem 11 in the single-mode case. This conjecture in the multimode case is left as a major open problem. Our results contribute to the resolution of Gaussian optimizer conjectures in the quantum case [18]. These conjectures state the optimality of Gaussian states for certain optimization problems regarding Gaussian channels. Theorem 1 establishes this conjecture for the quite general function \(\Upsilon (\rho )\) in the single-mode case. Generalizing this theorem for multimode states can resolve the CMOE conjecture in the multimode case. To this end, ideas developed in [19] might be useful. We leave further exploration of this problem for future works.
Data availability
Data sharing is not applicable to this article as no data sets were generated or analyzed during the current study.
Notes
Amplifier and additive-noise channels are not hypercontractive in the usual sense.
The parameter p is also relevant in the study of log-Sobolev inequalities for finite classical Markov semigroups; see [38] and references therein.
Note that although \({{\textrm{tr}}}({\textbf{a}}\rho {\textbf{a}}^\dagger )\) may be well defined and finite, \(\rho {\textbf{a}}^\dagger {\textbf{a}}\) may not be a trace class operator. For more details on the expectation value of unbounded operators in quantum mechanics we refer to [4].
Note that \(\Vert {\textbf{a}}|\psi \rangle \Vert =\Vert |{\textbf{a}}^\dagger {\textbf{a}}|^{1/2}|\psi \rangle \Vert \), so this definition matches the one for the number operator.
Phase-covariant channels are also known as gauge-covariant channels.
If \(p=1\), we let \({\hat{p}}=+\infty \).
Here, we take the limits \(x\rightarrow 0^+\) and \(x\rightarrow 1^-\) to include the boundary values.
This inequality can also be derived from the non-negativity of the Kullback–Leibler divergence.
The quantum Ornstein–Uhlenbeck semigroup restricted to a certain subspace of operators resembles the classical Ornstein–Uhlenbeck semigroup [9, Equation (7.5)].
In the literature of log-Sobolev inequalities usually \({\mathcal {L}}\) is the generator in the Heisenberg picture and \({\mathcal {L}}^*\) is the generator in the Schrödinger picture. Here, we change the notation since we started with channels in the Schrödinger picture in previous sections.
In fact in [9], the spectrum of \(\Gamma _{{{\widehat{\sigma }}}}^{1/2}\circ {{\widehat{{\mathcal {L}}}}}^*\circ \Gamma _{{{\widehat{\sigma }}}}^{-1/2}\) as an operator acting on the space of Hilbert–Schmidt operators is computed.
If \(t_1=0\), then \(\Phi \) is the identity channel and there is nothing to prove.
References
Anantharam, V., Jog, V., Nair, C.: Unifying the Brascamp–Lieb inequality and the entropy power inequality. IEEE Trans. Inf. Theory 68(12), 7665–7684 (2022). https://doi.org/10.1109/TIT.2022.3192913
Beigi, S., Datta, N., Rouzé, C.: Quantum reverse hypercontractivity: its tensorization and application to strong converses. Commun. Math. Phys. 376(2), 753–794 (2020). https://doi.org/10.1007/s00220-020-03750-z
Braunstein, S.L., van Loock, P.: Quantum information with continuous variables. Rev. Mod. Phys. 77, 513–577 (2005). https://doi.org/10.1103/RevModPhys.77.513
Busch, P., Lahti, P., Pellonpää, J.-P., Ylinen, K.: Quantum Measurement, vol. 23. Springer, Berlin (2016). https://doi.org/10.1007/978-3-319-43389-9
Cahill, K.E., Glauber, R.J.: Density operators and quasiprobability distributions. Phys. Rev. 177, 1882–1902 (1969). https://doi.org/10.1103/PhysRev.177.1882
Carbone, R., Sasso, E.: Hypercontractivity for a quantum Ornstein–Uhlenbeck semigroup. Probab. Theory Relat. Fields 140(3), 505–522 (2008). https://doi.org/10.1007/s00440-007-0073-2
Carlen, E.A.: Superadditivity of Fisher’s information and logarithmic Sobolev inequalities. J. Funct. Anal. 101(1), 194–211 (1991). https://doi.org/10.1016/0022-1236(91)90155-X
Carlen, E.A., Maas, J.: Gradient flow and entropy inequalities for quantum Markov semigroups with detailed balance. J. Funct. Anal. 273(5), 1810–1869 (2017). https://doi.org/10.1016/j.jfa.2017.05.003
Cipriani, F., Fagnola, F., Lindsay, J.M.: Spectral analysis and feller property for quantum Ornstein–Uhlenbeck semigroups. Commun. Math. Phys. 210, 85–105 (2000). https://doi.org/10.1007/s002200050773
Cubitt, T., Kastoryano, M., Montanaro, A., Temme, K.: Quantum reverse hypercontractivity. J. Math. Phys. 56(10), 102204 (2015). https://doi.org/10.1063/1.4933219
De Palma, G.: New lower bounds to the output entropy of multi-mode quantum Gaussian channels. IEEE Trans. Inf. Theory 65(9), 5959–5968 (2019). https://doi.org/10.1109/TIT.2019.2914434
De Palma, G., Huber, S.: The conditional entropy power inequality for quantum additive noise channels. J. Math. Phys. 59(12), 122201 (2018). https://doi.org/10.1063/1.5027495
De Palma, G., Mari, A., Giovannetti, V.: A generalization of the entropy power inequality to bosonic quantum systems. Nat. Photonics 8(12), 958–964 (2014). https://doi.org/10.1038/nphoton.2014.252
De Palma, G., Trevisan, D., Giovannetti, V.: Passive states optimize the output of bosonic gaussian quantum channels. IEEE Trans. Inf. Theory 62(5), 2895–2906 (2016). https://doi.org/10.1109/TIT.2016.2547426
De Palma, G., Trevisan, D., Giovannetti, V.: Gaussian states minimize the output entropy of one-mode quantum gaussian channels. Phys. Rev. Lett. 118(16), 160503 (2017). https://doi.org/10.1103/PhysRevLett.118.160503
De Palma, G., Trevisan, D., Giovannetti, V.: Gaussian states minimize the output entropy of the one-mode quantum attenuator. IEEE Trans. Inf. Theory 63(1), 728–737 (2017). https://doi.org/10.1109/TIT.2016.2621748
De Palma, G., Trevisan, D., Giovannetti, V.: The one-mode quantum-limited gaussian attenuator and amplifier have gaussian maximizers. Ann. Henri Poincaré 19, 2919–2953 (2018). https://doi.org/10.1007/s00023-018-0703-5
De Palma, G., Trevisan, D., Giovannetti, V., Ambrosio, L.: Gaussian optimizers for entropic inequalities in quantum information. J. Math. Phys. 59(8), 081101 (2018). https://doi.org/10.1063/1.5038665
Gao, L., Rouzé, C.: Complete entropic inequalities for quantum Markov Chains. Arch. Ration. Mech. Anal. 245, 183–238 (2022). https://doi.org/10.1007/s00205-022-01785-1
Geng, Y., Nair, C.: The capacity region of the two-receiver gaussian vector broadcast channel with private and common messages. IEEE Trans. Inf. Theory 60(4), 2087–2104 (2014). https://doi.org/10.1109/TIT.2014.2304457
Giovannetti, V., Guha, S., Lloyd, S., Maccone, L., Shapiro, J.H.: Minimum output entropy of bosonic channels: a conjecture. Phys. Rev. A 70(3), 032315 (2004). https://doi.org/10.1103/PhysRevA.70.032315
Giovannetti, V., Holevo, A.S., Garcia-Patron, R.: A solution of gaussian optimizer conjecture for quantum channels. Commun. Math. Phys. 334, 1553–1571 (2015). https://doi.org/10.1007/s00220-014-2150-6
Giovannetti, V., Holevo, A.S., Lloyd, S., Maccone, L.: Generalized minimal output entropy conjecture for one-mode gaussian channels: definitions and some exact results. J. Phys. A Math. Theor. 43(41), 415305 (2010). https://doi.org/10.1088/1751-8113/43/41/415305
Giovannetti, V., Holevo, A.S., Mari, A.: Majorization and additivity for multimode bosonic gaussian channels. Theor. Math. Phys. 182, 284–293 (2015). https://doi.org/10.1007/s11232-015-0262-6
Gross, L.: Logarithmic Sobolev inequalities. Am. J. Math. 97(4), 1061–1083 (1975). https://doi.org/10.2307/2373688
Guha, S., Erkmen, B.I., Shapiro, J.H.: The entropy photon-number inequality and its consequences. In: 2008 Information Theory and Applications Workshop, pp. 128–130 (2008). https://doi.org/10.1109/ITA.2008.4601037
Guha, S., Shapiro, J.H., Erkmen, B.I.: Classical capacity of bosonic broadcast communication and a minimum output entropy conjecture. Phys. Rev. A 76(3), 032303 (2007). https://doi.org/10.1103/PhysRevA.76.032303
Heinosaari, T., Holevo, A.S., Wolf, M.M.: The semigroup structure of gaussian channels. Quantum Inf. Comput. 10(7), 619–635 (2010). https://doi.org/10.26421/QIC10.7-8-4
Hillery, M., O’Connell, R., Scully, M., Wigner, E.: Distribution functions in physics: fundamentals. Phys. Rep. 106(3), 121–167 (1984). https://doi.org/10.1016/0370-1573(84)90160-1
Holevo, A.S.: One-mode quantum gaussian channels: structure and quantum capacity. Probl. Inf. Transm. 43, 1–11 (2007). https://doi.org/10.1134/S0032946007010012
Holevo, A.S.: Quantum Systems, Channels, Information: A Mathematical Introduction. De Gruyter, Berlin (2013). https://doi.org/10.1515/9783110273403
Holevo, A.S.: Gaussian optimizers and the additivity problem in quantum information theory. Russ. Math. Surv. 70(2), 331 (2015). https://doi.org/10.1070/RM2015v070n02ABEH004949
Holevo, A.S., Werner, R.F.: Evaluating capacities of bosonic gaussian channels. Phys. Rev. A 63(3), 032312 (2001). https://doi.org/10.1103/PhysRevA.63.032312
Huber, S., König, R., Vershynina, A.: Geometric inequalities from phase space translations. J. Math. Phys. 58(1), 012206 (2017). https://doi.org/10.1063/1.4974224
Kastoryano, M.J., Temme, K.: Quantum logarithmic Sobolev inequalities and rapid mixing. J. Math. Phys. 54(5), 052202 (2013). https://doi.org/10.1063/1.4804995
Lieb, E.H.: Gaussian kernels have only gaussian maximizers. Invent. Math. 102, 179–208 (1990). https://doi.org/10.1007/BF01233426
Lunardi, A.: Interpolation Theory, vol. 16. Springer, Berlin (2018). https://doi.org/10.1007/978-88-7642-638-4
Mossel, E., Oleszkiewicz, K., Sen, A.: On reverse hypercontractivity. Geom. Funct. Anal. 23(3), 1062–1097 (2013). https://doi.org/10.1007/s00039-013-0229-4
Olkiewicz, R., Zegarlinski, B.: Hypercontractivity in noncommutative \(l_p\) spaces. J. Funct. Anal. 161(1), 246–285 (1999). https://doi.org/10.1006/jfan.1998.3342
Pisier, G.: Non-commutative Vector Valued \(p\)-spaces and Completely \(p\)-summing Map. Number 247. Astérisque (1998)
Serafini, A.: Quantum Continuous Variables: A Primer of Theoretical Methods. CRC Press, Boca Raton (2017). https://doi.org/10.1201/9781315118727
Weedbrook, C., Pirandola, S., García-Patrón, R., Cerf, N.J., Ralph, T.C., Shapiro, J.H., Lloyd, S.: Gaussian quantum information. Rev. Mod. Phys. 84, 621–669 (2012). https://doi.org/10.1103/RevModPhys.84.621
Winter, A.: Tight uniform continuity bounds for quantum entropies: conditional entropy, relative entropy distance and energy constraints. Commun. Math. Phys. 347, 291–313 (2016). https://doi.org/10.1007/s00220-016-2609-8
Acknowledgements
SB is thankful to Reza Seyyedali for several inspiring discussions. This work is supported by the NRF grant NRF2021-QEP2-02-P05 and the Ministry of Education, Singapore, under the Research Centres of Excellence program.
Author information
Authors and Affiliations
Contributions
Both authors have contributed equally to the work.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no potential Conflict of interest in relation to the study in this paper.
Additional information
Communicated by David Pérez-García.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Appendices
Generators of Phase-covariant Gaussian Channels
In this appendix, we show that any single-mode phase-covariant channel given by (11) is a member of a semigroup whose Lindbladian equals (12).
Let us consider the family of channels \(\{\Phi _t:\, t\ge 0\}\) corresponding to parameters \(\{(\lambda _t, \gamma _t):\, t\ge 0\}\) in (11):
Suppose that \(\{\Phi _t:\, t\ge 0\}\) is a semigroup which in particular means that \(\Phi _0\) is the identity channel corresponding to parameters \(\gamma _0=0\) and \(\lambda _0=1\). Then, the generator of this semigroup is determined by
Using (51), we compute the derivative
where \(\lambda '_0=\frac{{\textrm{d}}\lambda _t}{{\textrm{d}}t}\big |_{t=0}\), \(\gamma '_0=\frac{{\textrm{d}}\gamma _t}{{\textrm{d}}t}\big |_{t=0}\). Here, we use \(\frac{{\textrm{d}}}{{\textrm{d}}\lambda } D_{\lambda \delta } = (\xi {\textbf{a}}^\dagger - {{\bar{\xi }}} {\textbf{a}}) D_{\lambda \xi }\) in the first line, and the well-known equations \([{\textbf{a}}, D_\xi ] =\xi D_\xi \) and \([{\textbf{a}}^\dagger , D_\xi ] ={{\bar{\xi }}} D_\xi \) in the second line. Replacing the characteristic functions in the above equation by their corresponding operators yields
where we define
Therefore, if the channels defined by (51) form a semigroup, then the corresponding Lindbladian is equal to
We use this equation to obtain the generator of the three classes of phase-covariant Gaussian channels.
1.1 Attenuator Channels
The attenuator channel can be physically modeled by overlapping the input state and a thermal state \(\sigma _\beta \) on a beam splitter with transmissivity \(\lambda \):
where the 2-mode unitary operator \(U_{\text {BS},\lambda }\) can be described by
Using these relations, the characteristic function of the two-mode state \(U_{\text {BS},\lambda } (\rho \otimes \sigma _\beta ) U_{\text {BS},\lambda }^\dagger \) equals
Thus, the characteristic function of the output state of the attenuation channel is given by
where we replace the characteristic function of the thermal state from (9).
Setting \(\lambda =\lambda _t=e^{-2ct}\) with \(c>0\) in the above relation, we observe that attenuator channels \(\{\Phi ^{\text {att}}_{\lambda _t}:\, t\ge 0\}\) form a semigroup. Comparing the above equation with (51) also gives \(\gamma _t=\coth (\beta /2) (1-\lambda _t)\). Therefore, by plugging \(\lambda _0'=-2c\) and \(\gamma _0'=2c\coth (\beta /2) \) into (52), we obtain the corresponding generator
1.2 Amplifier Channels
Following a similar procedure as for attenuator channels, we can derive the generator of the semigroup of amplifier channels. An amplifier channel can be modeled by applying a two-mode squeezing unitary with squeezing parameter \(\lambda \ge 1\) on the input state and a thermal state \(\sigma _\beta \), and then tracing over the second mode:
Here, the action of the two-mode squeezing unitary can be described by
Using these relations, we have
Thus, the characteristic function of the output state is given by
where in the last line the characteristic function of the thermal state (9) is replaced.
Setting \(\lambda =\lambda _t=e^{2ct}\) with \(c>0\), we observe that amplifier channels \(\{\Phi ^{\text {amp}}_{\lambda _t}:\, t\ge 0\}\) form a semigroup. Here, \(\gamma _t=\coth (\beta /2) (\lambda _t-1)\). Thus, by plugging \(\lambda _0'=2c\) and \(\gamma _0'=2c\coth (\beta /2)\) in (52), we obtain the generator
Note that here we get the same coefficients as those of the generator of the attenuator semigroup, but swapped.
1.3 Additive-noise Channels
The classical additive-noise Gaussian channels can be modeled by applying a displacement operator whose parameter is chosen at random according to a Gaussian probability distribution:
The characteristic function of the output state of this channel is given by
where we use \(D^{\dagger }_\alpha D_\xi D_\alpha =\exp (\xi {\bar{\alpha }}-{\bar{\xi }}\alpha ) D_\xi \) in the second line.
By choosing \(\gamma =2ct\) with \(c>0\), the channels \(\{\Phi ^{\text {add}}_t:\, t\ge 0\}\) form a semigroup. Here, \(\lambda _t=1\). Thus, by inserting \(\lambda _0'=0\) and \(\gamma _0'=2c\) in (52) we obtain the generator
\(\Upsilon (\rho )\) is Well Defined if \({{\textrm{tr}}}(\rho {\textbf{a}}^\dagger {\textbf{a}})<+\infty \)
In this appendix, we show that \(\Upsilon (\rho )\) given by
is well defined for any state \(\rho \) with finite mean photon number. For such a state both \({{\textrm{tr}}}(\rho {\textbf{a}}^\dagger {\textbf{a}}), {{\textrm{tr}}}(\rho {\textbf{a}}{\textbf{a}}^\dagger )\) are finite. Moreover, if \(\rho = \sum _j \lambda _j |\psi _j\rangle \langle \psi _j|\) is the eigen-decomposition of \(\rho \), then we have
where in the second line we use Young’s inequality. Thus, \( {{\textrm{tr}}}\big ( \rho ^{1/p} {\textbf{a}}\rho ^{1/{\hat{p}}} {\textbf{a}}^\dagger \big )\) and similarly \({{\textrm{tr}}}\big ( \rho ^{1/p} {\textbf{a}}^\dagger \rho ^{1/{\hat{p}}} {\textbf{a}}\big )\) are finite. Also, using the non-negativity of Umegaki’s quantum relative entropy \(D(\rho \Vert \tau ) = {{\textrm{tr}}}(\rho \log \rho ) - {{\textrm{tr}}}(\rho \log \tau )\), for the thermal state \(\tau = (1-1/e) e^{-{\textbf{a}}^\dagger {\textbf{a}}} \) we have
Therefore, since \({{\textrm{tr}}}\big (\rho {\textbf{a}}^\dagger {\textbf{a}}\big )\) is finite, \(S(\rho )\) is also finite.
Verification of (36)
Recall that
Then, computing the derivative term-by-term yields
Therefore, we have
This gives (36).
Proof of \(\alpha _2=\min _{p\ge 1} \alpha _p\)
In this appendix, we show that for any \(p\ge 1\) the following inequality holds:
First, we may restrict to \(p\in [1, 2]\). Then, with the change of variable \(p=\frac{2}{1+t}\) with \(t\in [0,1]\) the above inequality is equivalent to
Starting with the left hand side, we compute
Here, the inequality holds since \(0\le t\le 1 \). We are done.
Proof of Proposition 7
It suffices to prove the proposition for \(m=1\). In this case, we drop subscript j and denote \({{\widehat{{\mathcal {L}}}}}\) and \({{\widehat{\sigma }}}\) simply by \({\mathcal {L}}\) and \(\sigma \), respectively.
Using the commutation relation \([{\textbf{a}}, {\textbf{a}}^\dagger ]=1\), we find that \([{\textbf{a}}, {\textbf{q}}_z ] = \frac{z}{\sqrt{2}}\), and
This means that
and similarly
Using these and \(|z|=1\), a straightforward computation shows that
Compared to (39), we find that
Finally, note that the operators \(h_k({\textbf{q}}_z)\), over all \(k\ge 0\) and \(|z|=1\), span the whole space of polynomials of \({\textbf{a}}\) and \({\textbf{a}}^\dagger \), which is dense in \(L_2(\sigma )\).
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Beigi, S., Rahimi-Keshari, S. A Meta Logarithmic-Sobolev Inequality for Phase-Covariant Gaussian Channels. Ann. Henri Poincaré (2024). https://doi.org/10.1007/s00023-024-01487-2
Received:
Accepted:
Published:
DOI: https://doi.org/10.1007/s00023-024-01487-2