Swiveled Rényi entropies

Dupuis, Frédéric; Wilde, Mark M.

doi:10.1007/s11128-015-1211-x

Swiveled Rényi entropies

Published: 15 February 2016

Volume 15, pages 1309–1345, (2016)
Cite this article

Download PDF

Access provided by Autonomous University of Puebla

Quantum Information Processing Aims and scope Submit manuscript

Swiveled Rényi entropies

Download PDF

274 Accesses
15 Citations
2 Altmetric
Explore all metrics

Abstract

This paper introduces “swiveled Rényi entropies” as an alternative to the Rényi entropic quantities put forward in Berta et al. (Phys Rev A 91(2):022333, 2015). What distinguishes the swiveled Rényi entropies from the prior proposal of Berta et al. is that there is an extra degree of freedom: an optimization over unitary rotations with respect to particular fixed bases (swivels). A consequence of this extra degree of freedom is that the swiveled Rényi entropies are ordered, which is an important property of the Rényi family of entropies. The swiveled Rényi entropies are, however, generally discontinuous at $\alpha =1$ and do not converge to the von Neumann entropy-based measures in the limit as $\alpha \rightarrow 1$, instead bounding them from above and below. Particular variants reduce to known Rényi entropies, such as the Rényi relative entropy or the sandwiched Rényi relative entropy, but also lead to ordered Rényi conditional mutual information and ordered Rényi generalizations of a relative entropy difference. Refinements of entropy inequalities such as monotonicity of quantum relative entropy and strong subadditivity follow as a consequence of the aforementioned properties of the swiveled Rényi entropies. Due to the lack of convergence at $\alpha =1$, it is unclear whether the swiveled Rényi entropies would be useful in one-shot information theory, so that the present contribution represents partial progress toward this goal.

Forward and Reverse Entropy Power Inequalities in Convex Geometry

Weighted p-Rényi Entropy Power Inequality: Information Theory to Quantum Shannon Theory

Article 25 November 2023

Entropy Measures and Views of Information

Discover the latest articles, news and stories from top researchers in related subjects.

Quantum Computing

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

In 1961, Alfred Rényi defined a parametrized family of entropies now bearing his name, by relaxing one of the axioms that singles out the Shannon entropy [34]. This led to both the $\alpha $-Rényi entropy and the $\alpha $-Rényi divergence, defined respectively for a parameter $\alpha \in \left( 0,1\right) \cup \left( 1,\infty \right) $ and probability distributions p and q as

$$\begin{aligned} H_{\alpha }(p)&\equiv \frac{1}{1-\alpha }\log \sum _{x}\left[ p( x) \right] ^{\alpha }, \end{aligned}$$

(1)

$$\begin{aligned} D_{\alpha }(p\Vert q)&\equiv \frac{1}{\alpha -1}\log \sum _{x}\left[ p( x) \right] ^{\alpha }\left[ q( x) \right] ^{1-\alpha }, \end{aligned}$$

(2)

where $\log $ denotes the natural logarithm here and throughout the paper. The Shannon entropy and relative entropy are recovered in the limit as $\alpha \rightarrow 1$:

$$\begin{aligned} \lim _{\alpha \rightarrow 1}H_{\alpha }(p)&=H(p)\equiv -\sum _{x}p( x) \log p( x),\end{aligned}$$

(3)

$$\begin{aligned} \lim _{\alpha \rightarrow 1}D_{\alpha }(p\Vert q)&=D(p\Vert q)\equiv \sum _{x}p( x) \log \frac{p( x) }{q( x)}. \end{aligned}$$

(4)

What began largely as a theoretical exploration ended up having many practical ramifications, especially in the contexts of information theory and statistics. For example, it is now well known that the Rényi entropies play a fundamental role in obtaining a sharpened understanding of the trade-off between communication rate, error probability, and number of resources in communication protocols, such as data compression and channel coding [10, 16]. “Smoothing” the Rényi entropies [33] has also led to the development of “ one-shot” information theory [32, 38], with applications to cryptography.

Part of what makes the Rényi entropies so useful in applications is their properties: convergence to the Shannon and relative entropies in the limit as $\alpha \rightarrow 1$, monotonicity in the parameter $~\alpha $, and additivity, in addition to others. The convergence to the Shannon and relative entropies ensures that, by taking this limit, one recovers asymptotic information-theoretic statements, such as the data compression theorem or the channel capacity theorem, from the more fine-grained statements. Monotonicity in the parameter $\alpha $ ensures that $H_{\alpha }( p) $ gives more weight to low surprisal events for $\alpha >1$ and vice versa for $\alpha <1$, helping to characterize the aforementioned trade-off in information-theoretic settings. The additivity property implies that the Rényi entropies can simplify immensely when evaluated for memoryless stochastic processes.

In light of the progress that the Rényi paradigm has brought to information theory, one is left to wonder if this could happen in more exotic settings, such as quantum information theory and/or for “multipartite” settings (here by multipartite, we mean three or more parties). This line of thought has led to the development of several non-commutative generalizations of the Rényi relative entropy in (2), which has in turn led to a sharpened understanding of several quantum information-theoretic tasks (see [9, 39] and references therein) and refinements of the uncertainty principle [8]. As far as we are aware, the development of the multipartite generalization of the Rényi entropy in (1) is less explored, with the exception of a recent proposal [6] for a multipartite quantum generalization.

With the intent of developing either a multipartite classical or quantum generalization of (1), one might suggest after a moment’s thought to replace a quantity which features a linear combination of entropies by one with the same linear combination of Rényi entropies. However, this approach is objectively unsatisfactory in at least two regards: Properties of the original information measure are not preserved by doing so, and one is not guaranteed to have the powerful monotonicity in $\alpha $ property mentioned above. For example, take the case of the conditional mutual information of a tripartite density operator $\rho _{{ ABC}}$ defined as

$$\begin{aligned} I(A;B|C)_{\rho }\equiv H({ AC})_{\rho }+H({ BC})_{\rho }-H(C)_{\rho }-H({ ABC})_{\rho }, \end{aligned}$$

(5)

where $H(F)_{\sigma }\equiv -{\text {Tr}}\{\sigma _{F}\log \sigma _{F}\}$ is the quantum entropy of a density operator $\sigma $ on system F. One of the most important properties of this quantity is that it is non-negative (known as strong subadditivity of quantum entropy [22, 23]), and as a consequence, it is monotone non-increasing with respect to any quantum channel applied to the system A [7] (by symmetry, the same is true for one applied to B). However, if we define a Rényi generalization of $I(A;B|C)_{\rho }$ as $H_{\alpha }({ AC})_{\rho }+H_{\alpha }({ BC})_{\rho }-H_{\alpha }(C)_{\rho }-H_{\alpha }({ ABC})_{\rho }$, where $H_{\alpha }(F)_{\sigma }\equiv \left[ \log {\text {Tr}}\{\sigma _{F}^{\alpha }\}\right] /\left( 1-\alpha \right) $, then explicit counterexamples reveal that this Rényi generalization can be negative, and monotonicity with respect to quantum channels need not hold, and neither does monotonicity in $\alpha $ [25].

To remedy these deficiencies, the authors of [6] put forward a general prescription for producing a Rényi generalization of a quantum information measure, with the aim of having the properties of the original measure retained while also satisfying the monotonicity in $\alpha $ property. The work in [6] was only partially successful in this regard. Continuing with our example of conditional mutual information, consider the following Rényi generalization [5]:

$$\begin{aligned} I_{\alpha }(A;B|C)_{\rho }\equiv \frac{1}{\alpha -1}\log {\text {Tr}}\left\{ \rho _{ABC}^{\alpha }\rho _{AC}^{\left( 1-\alpha \right) /2}\rho _{C}^{\left( \alpha -1\right) /2}\rho _{BC}^{1-\alpha }\rho _{C}^{\left( \alpha -1\right) /2}\rho _{AC}^{\left( 1-\alpha \right) /2}\right\} . \end{aligned}$$

(6)

For $\alpha \in [0,1)\cup (1,2]$, the quantity is non-negative, monotone non-increasing with respect to quantum channels acting on the B system, converges to $I(A;B|C)_{\rho }$ in the limit as $\alpha \rightarrow 1$, and is conjectured to obey the monotonicity in $\alpha $ property (with some numerical and analytical evidence in favor established) [5]. However, hitherto a proof of the monotonicity in $\alpha $ property for $I_{\alpha }(A;B|C)_{\rho }$ remains lacking. It is also an open question to determine whether $I_{\alpha }(A;B|C)_{\rho }$ is monotone non-increasing with respect to quantum channels acting on the A system—this partially has to do with the fact that $I_{\alpha }(A;B|C)_{\rho }$ is not symmetric with respect to exchange of the A and B systems, unlike the conditional mutual information in (5).

2 Summary of results

In this paper, we modify the recently proposed Rényi generalizations of quantum information measures from [6] by placing “ swivels” in a given chain of operators.^{Footnote 1} As an example of the idea, consider that we can rewrite the quantity in (6) in terms of the Schatten 2-norm as follows:

$$\begin{aligned} I_{\alpha }( A;B|C) _{\rho }\equiv \frac{2}{\alpha -1}\log \left\| \rho _{BC}^{\left( 1-\alpha \right) /2}\rho _{C}^{\left( \alpha -1\right) /2} \rho _{AC}^{\left( 1-\alpha \right) /2}\rho _{ABC}^{\alpha /2}\right\| _{2}. \end{aligned}$$

(7)

The new idea is to modify this quantity to include swivels as follows:

$$\begin{aligned}&I_{\alpha }^{\prime }( A;B|C) _{\rho } \nonumber \\&\quad \equiv \,\frac{2}{\alpha -1}\max _{V_{\rho _{AC} }\in \mathbb {V}_{\rho _{AC}},V_{\rho _{C}}\in \mathbb {V}_{\rho _{C}}}\log \left\| \rho _{BC}^{\left( 1-\alpha \right) /2}V_{\rho _{C}}\rho _{C}^{\left( \alpha -1\right) /2}\rho _{AC}^{\left( 1-\alpha \right) /2}V_{\rho _{AC}} \rho _{ABC}^{\alpha /2}\right\| _{2}, \quad \quad \end{aligned}$$

(8)

where $\mathbb {V}_{\omega }$ is the compact set of all unitaries $V_{\omega } $ commuting with the Hermitian operator $\omega $. Thus, the fixed eigenbases of $\rho _{C}$ and $\rho _{AC}$ act as swivels connecting adjacent operators in the operator chain above, such that the unitary rotations $V_{\rho _{C}}$ and $V_{\rho _{AC}}$ about these swivels are allowed. Of course, such swivels make no difference when the density operator $\rho _{ABC}$ and its marginals commute with each other (the classical case), or when the C system is trivial, in which case the above quantity reduces to a Rényi mutual information

$$\begin{aligned} I_{\alpha }^{\prime }( A;B) _{\rho }&\equiv \frac{2}{\alpha -1}\max _{V_{\rho _{A}}\in \mathbb {V}_{\rho _{A}}}\log \left\| \rho _{B}^{\left( 1-\alpha \right) /2}\rho _{A}^{\left( 1-\alpha \right) /2}V_{\rho _{A}} \rho _{AB}^{\alpha /2}\right\| _{2} \end{aligned}$$

(9)

$$\begin{aligned}&=\frac{2}{\alpha -1}\log \left\| \rho _{B}^{\left( 1-\alpha \right) /2}\rho _{A}^{\left( 1-\alpha \right) /2}\rho _{AB}^{\alpha /2}\right\| _{2}. \end{aligned}$$

(10)

We mention that we were led to the definition in (8) as a consequence of the developments in [45], in which similar swivels appeared in refinements of entropy inequalities such as monotonicity of quantum relative entropy and strong subadditivity.

The quantity in (8) satisfies some of the properties already established for $I_{\alpha }( A;B|C) _{\rho }$ in [5], which include nonnegativity for $\alpha \in [0,1)\cup (1,2]$ and monotonicity with respect to quantum channels acting on the B system. However, the extra degree of freedom in (8) allows us to prove that this swiveled Rényi conditional mutual information is monotone non-decreasing in $\alpha $ for $\alpha \in [ 0,1)\cup (1,2] $.

The swiveled Rényi entropies are in general discontinuous at $\alpha =1$ and do not converge to the von Neumann entropy-based measures in the limit as $\alpha \rightarrow 1$. Thus, the present paper represents a work in progress toward the general goal of find Rényi generalizations of quantum information measures that satisfy all of the desired properties that one would like to have. It thus remains an open question to find Rényi quantities that meet all desiderata.

The rest of the paper proceeds by developing this idea in detail. We review some background material in Sect. 3, which includes various quantum Rényi entropies and the Hadamard three-line theorem, the latter being the essential tool for establishing monotonicity in $\alpha $ for the swiveled Rényi entropies. We then focus in Sect. 4 on developing swiveled Rényi generalizations of the quantum relative entropy difference in (13), given that many different information measures can be written in terms of this relative entropy difference, including conditional mutual information (see, e.g., the discussions in [35, 44, 45]). Our main contributions are Theorems 2 and 3, which state that these quantities are monotone non-decreasing in $\alpha $ for particular values. We then briefly discuss how refinements of entropy inequalities follow as a consequence of the properties of the swiveled Rényi entropies. Section 5 discusses swiveled Rényi conditional mutual information and justifies that they possess the properties stated above. We extend the idea in Sect. 6 to establish swiveled Rényi generalizations of an arbitrary linear combination of von Neumann entropies with coefficients chosen from the set $\left\{ -1,0,1\right\} $. We finally show how our methods can be used to address an open question posed in [47]. Section 8 concludes with a summary and some open directions.

3 Preliminaries

3.1 Quantum states and channels

A quantum state is described mathematically by a density operator, which is a positive semi-definite operator with trace equal to one. A quantum channel is a linear, trace-preserving, completely positive map. For more background on quantum information theory, we refer to [27, 43]. Our results apply to finite-dimensional Hilbert spaces. For most developments, we take $\rho $, $\sigma $, and $\mathcal {N}$ to be as given in the following definition:

Definition 1

Let $\rho $ be a density operator acting on a finite-dimensional Hilbert space $\mathcal {H}$, $\sigma $ be a nonzero positive semi-definite operator acting on $\mathcal {H}$, and $\mathcal {N}$ be a quantum channel, taking operators acting on $\mathcal {H}$ to those acting on a finite-dimensional Hilbert space $\mathcal {K}$.

Sometimes we need more restrictions, in which case we take $\rho $, $\sigma $, and $\mathcal {N}$ as follows:

Definition 2

Let $\rho $, $\sigma $, and $\mathcal {N}$ be as given in Definition 1, with the additional restriction that $\rho $ and $\sigma $ are positive definite, and $\mathcal {N}$ is such that $\mathcal {N}(\rho )$ and $\mathcal {N}(\sigma )$ are also positive definite.

We employ the common convention that functions of Hermitian operators are evaluated on their support. In more detail, the support of a Hermitian operator A, written as ${\text {supp}}(A)$, is defined as the vector space spanned by its eigenvectors whose corresponding eigenvalues are nonzero. Let an eigendecomposition of A be given as $A = \sum _{i:a_{i} \ne 0} a_{i} \vert i\rangle \langle i \vert $ for eigenvectors $\{ \vert i \rangle \}$. Then ${\text {supp}}(A) = {\text {span}} \{ \vert i \rangle : a_{i} \ne 0\}$. Let $\varPi _{A}$ denote the projection onto the support of A. A function f of an operator A is then defined as $f(A) = \sum _{i:a_{i}\ne 0} f(a_{i}) \vert i\rangle \langle i \vert $.

3.2 Entropies and norms

Let $\rho $, $\sigma $, and $\mathcal {N}$ be as given in Definition 1. The quantum relative entropy [42] is defined as

$$\begin{aligned} D( \rho \Vert \sigma ) \equiv {\text {Tr}}\left\{ \rho \left[ \log \rho -\log \sigma \right] \right\} , \end{aligned}$$

(11)

whenever ${\text {supp}}(\rho ) \subseteq {\text {supp}}(\sigma )$, and otherwise, it is defined to be equal to $+\infty $. The quantum relative entropy is monotone non-increasing with respect to quantum channels [24, 41], in the sense that

$$\begin{aligned} D( \rho \Vert \sigma ) \ge D( \mathcal {N}( \rho ) \Vert \mathcal {N}( \sigma ) ) . \end{aligned}$$

(12)

Another relevant information measure is the quantum relative entropy difference, defined as

$$\begin{aligned} \Delta ( \rho ,\sigma ,\mathcal {N}) \equiv D( \rho \Vert \sigma ) -D( \mathcal {N}( \rho ) \Vert \mathcal {N}( \sigma ) ) . \end{aligned}$$

(13)

We can use the Schatten norms in order to establish Rényi generalizations of von Neumann entropies, which are more refined information measures for quantum states and channels that reduce to the von Neumann quantities in a limit. The Schatten p-norm of an operator A is defined as

$$\begin{aligned} \left\| A\right\| _{p}\equiv \left[ {\text {Tr}}\left\{ \left| A\right| ^{p}\right\} \right] ^{1/p}, \end{aligned}$$

(14)

where $p\ge 1$ and $\left| A\right| \equiv \sqrt{A^{\dag }A}$ (note that we sometimes use the notation $\left\| A\right\| _{p}$ even for values $p\in \left( 0,1\right) $ when the quantity on the right-hand side of (14) is not a norm). From the above definition, we can see that the following equalities hold for any operators A and B:

$$\begin{aligned} {\text {Tr}}\left\{ B^{\dag }A^{\dag }AB\right\}&=\left\| AB\right\| _{2}^{2}, \end{aligned}$$

(15)

$$\begin{aligned} \left\| B^{\dag }A^{\dag }AB\right\| _{p}^{p}&=\left\| AB\right\| _{2p}^{2p}. \end{aligned}$$

(16)

The quantum Rényi entropy of a state $\rho $ is defined for $\alpha \in \left( 0,1\right) \cup \left( 1,\infty \right) $ as

$$\begin{aligned} H_{\alpha }( \rho ) \equiv \frac{1}{1-\alpha }\log {\text {Tr}} \{ \rho ^{\alpha }\} = \frac{\alpha }{1-\alpha }\log \left\| \rho \right\| _{\alpha }, \end{aligned}$$

(17)

and reduces to the von Neumann entropy in the limit as $\alpha \rightarrow 1$:

$$\begin{aligned} \lim _{\alpha \rightarrow 1}H_{\alpha }( \rho ) =H( \rho ) . \end{aligned}$$

(18)

There are at least two ways to generalize the quantum relative entropy, which we refer to as the Rényi relative entropy $D_{\alpha }( \rho \Vert \sigma ) $ [28] and the sandwiched Rényi relative entropy $\widetilde{D}_{\alpha }( \rho \Vert \sigma ) $ [26, 46]. They are defined respectively as follows:

$$\begin{aligned} D_{\alpha }( \rho \Vert \sigma )&\equiv \frac{1}{\alpha -1}\log {\text {Tr}} \left\{ \rho ^{\alpha }\sigma ^{1-\alpha }\right\} \end{aligned}$$

(19)

$$\begin{aligned}&=\frac{2}{\alpha -1}\log \left\| \sigma ^{\left( 1-\alpha \right) /2} \rho ^{\alpha /2}\right\| _{2}, \end{aligned}$$

(20)

$$\begin{aligned} \widetilde{D}_{\alpha }( \rho \Vert \sigma )&\equiv \frac{1}{\alpha -1} \log {\text {Tr}}\left\{ \left( \sigma ^{\left( 1-\alpha \right) /2\alpha }\rho \sigma ^{\left( 1-\alpha \right) /2\alpha }\right) ^{\alpha }\right\} \end{aligned}$$

(21)

$$\begin{aligned}&=\frac{2\alpha }{\alpha -1}\log \left\| \sigma ^{\left( 1-\alpha \right) /2\alpha }\rho ^{1/2}\right\| _{2\alpha }, \end{aligned}$$

(22)

if $\alpha \in (0,1)$ or if $\alpha \in (1,\infty )$ and ${\text {supp}}(\rho ) \subseteq {\text {supp}}(\sigma )$. If $\alpha \in (1,\infty )$ and ${\text {supp}}(\rho ) \nsubseteq {\text {supp}}(\sigma )$, then they are defined to be equal to $+\infty $. The rewritings in (20) and (22) are helpful for our developments in this paper and follow from (15)–(16) and the following:

$$\begin{aligned} {\text {Tr}}\left\{ \rho ^{\alpha }\sigma ^{1-\alpha }\right\}&={\text {Tr}}\left\{ \rho ^{\alpha /2}\sigma ^{\left( 1-\alpha \right) /2}\sigma ^{\left( 1-\alpha \right) /2}\rho ^{\alpha /2}\right\} ,\end{aligned}$$

(23)

$$\begin{aligned} {\text {Tr}}\left\{ \left( \sigma ^{\left( 1-\alpha \right) /2\alpha }\rho \sigma _{\alpha }^{\left( 1-\alpha \right) /2\alpha }\right) ^{\alpha }\right\}&=\left\| \sigma ^{\left( 1-\alpha \right) /2\alpha } \rho \sigma ^{\left( 1-\alpha \right) /2\alpha }\right\| _{\alpha }^{\alpha }\end{aligned}$$

(24)

$$\begin{aligned}&=\left\| \rho ^{1/2}\sigma ^{\left( 1-\alpha \right) /\alpha }\rho ^{1/2}\right\| _{\alpha }^{\alpha }. \end{aligned}$$

(25)

Both Rényi generalizations reduce to the quantum relative entropy in the limit as $\alpha \rightarrow 1$ [26, 28, 46]:

$$\begin{aligned} \lim _{\alpha \rightarrow 1}D_{\alpha }( \rho \Vert \sigma ) =\lim _{\alpha \rightarrow 1}\widetilde{D}_{\alpha }( \rho \Vert \sigma ) =D( \rho \Vert \sigma ). \end{aligned}$$

(26)

The Rényi relative entropy is monotone non-increasing with respect to quantum channels when $\alpha \in [ 0,1)\cup (1,2] $ [28]:

$$\begin{aligned} D_{\alpha }( \rho \Vert \sigma ) \ge D_{\alpha }( \mathcal {N}( \rho ) \Vert \mathcal {N}( \sigma ) ) , \end{aligned}$$

(27)

and the sandwiched Rényi relative entropy possesses a similar monotonicity property when $\alpha \in [ 1/2,1)\cup (1,\infty ) $ [2, 18]:

$$\begin{aligned} \widetilde{D}_{\alpha }( \rho \Vert \sigma ) \ge \widetilde{D}_{\alpha }( \mathcal {N}( \rho ) \Vert \mathcal {N}( \sigma ) ). \end{aligned}$$

(28)

By picking particular values of the Rényi parameter $\alpha $, the quantities above take on special forms and have meaning in operational contexts, being known as the zero-relative entropy [11], the collision relative entropy [14], the min-relative entropy [15], and the max-relative entropy [11], respectively:

$$\begin{aligned} D_{0}( \rho \Vert \sigma )&=-\log {\text {Tr}}\left\{ \rho ^{0} \sigma \right\} ,\end{aligned}$$

(29)

$$\begin{aligned} D_{2}( \rho \Vert \sigma )&=\log \left\| \rho \sigma ^{-1/2}\right\| _{2},\end{aligned}$$

(30)

$$\begin{aligned} \widetilde{D}_{1/2}( \rho \Vert \sigma )&=-\log F( \rho ,\sigma ),\end{aligned}$$

(31)

$$\begin{aligned} D_{\max }( \rho \Vert \sigma )&=\lim _{\alpha \rightarrow \infty }\widetilde{D}_{\alpha }( \rho \Vert \sigma ) \nonumber =\log \left\| \sigma ^{-1/2}\rho \sigma ^{-1/2}\right\| _{\infty } \nonumber \\&=\log \left\| \sigma ^{-1/2}\rho ^{1/2}\right\| _{\infty }^{2}, \end{aligned}$$

(32)

where $F( \rho ,\sigma ) \equiv \left\| \sqrt{\rho }\sqrt{\sigma }\right\| _{1}^{2}$ is the quantum fidelity [40].

3.3 Hadamard three-line theorem

One of the most important technical tools for proving our main result is the operator version of the Hadamard three-line theorem given in [2], in particular, the very slight modification stated in [13]. We note that the theorem below is a variant of the Riesz–Thorin operator interpolation theorem (see, e.g., [3, 31]).

Theorem 1

Let $S\equiv \left\{ z\in \mathbb {C}:0\le {\text {Re}} \left\{ z\right\} \le 1\right\} $, and let $L( \mathcal {H}) $ be the space of bounded linear operators acting on a Hilbert space $\mathcal {H}$. Let $G:S\rightarrow L( \mathcal {H}) $ be a bounded map that is holomorphic on the interior of S and continuous on the boundary.^{Footnote 2} Let $\theta \in \left( 0,1\right) $ and define $p_{\theta }$ by

$$\begin{aligned} \frac{1}{p_{\theta }}=\frac{1-\theta }{p_{0}}+\frac{\theta }{p_{1}}, \end{aligned}$$

(33)

where $p_{0},p_{1}\in [1,\infty ]$. For $k=0,1$ define

$$\begin{aligned} M_{k}=\sup _{t\in \mathbb {R}}\left\| G\left( k+it\right) \right\| _{p_{k}}. \end{aligned}$$

(34)

Then

$$\begin{aligned} \left\| G\left( \theta \right) \right\| _{p_{\theta }}\le M_{0}^{1-\theta }M_{1}^{\theta }. \end{aligned}$$

(35)

3.4 Rényi generalizations of the quantum relative entropy difference

Let $\rho $, $\sigma $, and $\mathcal {N}$ be as given in Definition 1. In [35], two Rényi generalizations of the relative entropy difference in (13) were defined as follows:

$$\begin{aligned} \Delta _{\alpha }(\rho ,\sigma ,\mathcal {N})&\equiv \frac{1}{\alpha -1} \log {\text {Tr}}\left\{ \rho ^{\alpha }\sigma ^{\left( 1-\alpha \right) /2}\mathcal {N}^{\dag }\left( \left[ \mathcal {N}(\sigma )\right] ^{\left( \alpha -1\right) /2}\left[ \mathcal {N}(\rho )\right] ^{1-\alpha }\left[ \mathcal {N}(\sigma )\right] ^{\left( \alpha -1\right) /2}\right) \sigma ^{\left( 1-\alpha \right) /2}\right\} ,\nonumber \\ \widetilde{\Delta }_{\alpha }(\rho ,\sigma ,\mathcal {N})&\equiv \frac{1}{\alpha ^{\prime }}\log \left\| \rho ^{1/2}\sigma ^{-\alpha ^{\prime } /2}\mathcal {N}^{\dag }\left( \left[ \mathcal {N}(\sigma )\right] ^{\alpha ^{\prime }/2}\left[ \mathcal {N}(\rho )\right] ^{-\alpha ^{\prime } }\left[ \mathcal {N}(\sigma )\right] ^{\alpha ^{\prime }/2}\right) \sigma ^{-\alpha ^{\prime }/2}\rho ^{1/2}\right\| _{\alpha }, \end{aligned}$$

(36)

where $\alpha ^{\prime }\equiv \left( \alpha -1\right) /\alpha $. Let U be an isometric extension of $\mathcal {N}$, so that

$$\begin{aligned} \mathcal {N}(\cdot )={\text {Tr}}_{E}\left\{ U\left( \cdot \right) U^{\dag }\right\} . \end{aligned}$$

(37)

We can write the adjoint $\mathcal {N}^{\dag }$ in terms of this isometric extension as follows:

$$\begin{aligned} \mathcal {N}^{\dag }(\cdot )=U^{\dag }\left( (\cdot )\otimes I_{E}\right) U. \end{aligned}$$

(38)

This then allows us to write the definitions above in a simpler form:

$$\begin{aligned} \Delta _{\alpha }(\rho ,\sigma ,\mathcal {N})&=\frac{2}{\alpha -1}\log \left\| \left( \left[ \mathcal {N}(\rho )\right] ^{\left( 1-\alpha \right) /2}\left[ \mathcal {N}(\sigma )\right] ^{\left( \alpha -1\right) /2}\otimes I_{E}\right) U\sigma ^{\left( 1-\alpha \right) /2}\rho ^{\alpha /2}\right\| _{2}, \end{aligned}$$

(39)

$$\begin{aligned} \widetilde{\Delta }_{\alpha }(\rho ,\sigma ,\mathcal {N})&=\frac{2}{\alpha ^{\prime }}\log \left\| \left( \left[ \mathcal {N}(\rho )\right] ^{-\alpha ^{\prime }/2}\left[ \mathcal {N}(\sigma )\right] ^{\alpha ^{\prime } /2}\otimes I_{E}\right) U\sigma ^{-\alpha ^{\prime }/2}\rho ^{1/2}\right\| _{2\alpha }. \end{aligned}$$

(40)

It is known that the following limits hold for $\rho $, $\sigma $, and $\mathcal {N}$ taken as in Definition 2 [35]:

$$\begin{aligned} \lim _{\alpha \rightarrow 1}\Delta _{\alpha }(\rho ,\sigma ,\mathcal {N})=\lim _{\alpha \rightarrow 1}\widetilde{\Delta }_{\alpha }(\rho ,\sigma ,\mathcal {N} )=\Delta (\rho ,\sigma ,\mathcal {N}). \end{aligned}$$

(41)

The fact that these limits hold for $\rho $, $\sigma $, and $\mathcal {N}$ taken as in Definition 1 and subject to ${\text {supp}} (\rho )\subseteq {\text {supp}}(\sigma )$ follows from [45] and the development in “Appendix 1”. [12] proved that for $\alpha \in [0,1)\cup (1,2]$,

$$\begin{aligned} \Delta _{\alpha }(\rho ,\sigma ,\mathcal {N})\ge 0, \end{aligned}$$

(42)

and for $\alpha \in [1/2,1)\cup (1,\infty ]$:

$$\begin{aligned} \widetilde{\Delta }_{\alpha }(\rho ,\sigma ,\mathcal {N})\ge 0, \end{aligned}$$

(43)

when $\rho $, $\sigma $, and $\mathcal {N}$ are taken as in Definition 2. The latter inequality was refined recently in [45] for $\alpha \in (1/2,1]$ and for $\rho $, $\sigma $, and $\mathcal {N}$ taken as in Definition 1 and subject to ${\text {supp}} (\rho )\subseteq {\text {supp}}(\sigma )$. It remains an open question to determine whether these quantities are non-decreasing in $\alpha $ for any non-trivial range of $\alpha $ (note that [35] argued that they are non-decreasing in $\alpha $ in a neighborhood of $\alpha =1$).

4 Swiveled Rényi generalizations of the quantum relative entropy difference

In the spirit of the discussion in Sect. 2, we consider different definitions of $\Delta _{\alpha }( \rho ,\sigma ,\mathcal {N}) $ and $\widetilde{\Delta }_{\alpha }( \rho ,\sigma ,\mathcal {N}) $ in order to allow for unitary rotations about swivels, i.e., an optimization over unitaries of the form $V_{\mathcal {N}( \sigma ) }$ and $V_{\sigma }$:

Definition 3

Let $\rho $, $\sigma $, and $\mathcal {N}$ be as given in Definition 1. We define swiveled Rényi generalizations of the quantum relative entropy difference in (13) as follows:

$$\begin{aligned} \Delta _{\alpha }^{\prime }(\rho ,\sigma ,\mathcal {N})&\equiv \frac{2}{\alpha -1}\max _{V_{\sigma },V_{\mathcal {N}(\sigma )}}\nonumber \\&\quad \log \left\| \left( \left[ \mathcal {N}(\rho )\right] ^{\left( 1-\alpha \right) /2}V_{\mathcal {N} (\sigma )}\left[ \mathcal {N}(\sigma )\right] ^{\left( \alpha -1\right) /2}\otimes I_{E}\right) U\sigma ^{\left( 1-\alpha \right) /2}V_{\sigma } \rho ^{\alpha /2}\right\| _{2}, \end{aligned}$$

(44)

$$\begin{aligned} \widetilde{\Delta }_{\alpha }^{\prime }(\rho ,\sigma ,\mathcal {N})&\equiv \frac{2}{\alpha ^{\prime }}\max _{V_{\sigma },V_{\mathcal {N}(\sigma )}}\nonumber \\&\quad \log \left\| \left( \left[ \mathcal {N}(\rho )\right] ^{-\alpha ^{\prime } /2}V_{\mathcal {N}(\sigma )}\left[ \mathcal {N}(\sigma )\right] ^{\alpha ^{\prime }/2}\otimes I_{E}\right) U\sigma ^{-\alpha ^{\prime }/2}V_{\sigma } \rho ^{1/2}\right\| _{2\alpha }, \end{aligned}$$

(45)

where $\alpha ^{\prime }=\left( \alpha -1\right) /\alpha $ and the optimizations are over the compact sets of unitaries $V_{\sigma }$ and $V_{\mathcal {N} (\sigma )}$ commuting with $\sigma $ and $\mathcal {N}(\sigma )$, respectively.

This slight extra degree of freedom allows us to establish that $\Delta _{\alpha }^{\prime }$ and $\widetilde{\Delta }_{\alpha }^{\prime }$ are monotone non-decreasing in $\alpha $ for particular values (see Theorems 2 and 3).

4.1 Reduction to Rényi relative entropy

Observe that by choosing $\mathcal {N}={\text {Tr}}$, we find that $\Delta _{\alpha }^{\prime }$ reduces to the Rényi relative entropy whenever ${\text {supp}}(\rho )\subseteq {\text {supp}}(\sigma )$:

$$\begin{aligned} \Delta _{\alpha }^{\prime }(\rho ,\sigma ,{\text {Tr}})&=\frac{2}{\alpha -1}\log \left\| \sigma ^{\left( 1-\alpha \right) /2}\rho ^{\alpha /2}\right\| _{2}+\log {\text {Tr}}\left\{ \sigma \right\} \end{aligned}$$

(46)

$$\begin{aligned}&=D_{\alpha }(\rho \Vert \sigma )+\log {\text {Tr}}\left\{ \sigma \right\} , \end{aligned}$$

(47)

and $\widetilde{\Delta }_{\alpha }^{\prime }$ to the sandwiched Rényi relative entropy whenever ${\text {supp}}(\rho )\subseteq {\text {supp}}(\sigma )$:

$$\begin{aligned} \widetilde{\Delta }_{\alpha }^{\prime }\left( \rho ,\sigma ,{\text {Tr}} \right)&\equiv \frac{2}{\alpha ^{\prime }}\log \left\| \sigma ^{-\alpha ^{\prime }/2}\rho ^{1/2}\right\| _{2\alpha }+\log {\text {Tr}} \left\{ \sigma \right\} \end{aligned}$$

(48)

$$\begin{aligned}&=\widetilde{D}_{\alpha }(\rho \Vert \sigma )+\log {\text {Tr}}\left\{ \sigma \right\} , \end{aligned}$$

(49)

just as

$$\begin{aligned} \Delta (\rho ,\sigma ,{\text {Tr}})=D(\rho \Vert \sigma )+\log {\text {Tr}} \left\{ \sigma \right\} . \end{aligned}$$

(50)

4.2 Behavior around $\alpha =1$

Here we discuss the behavior of $\Delta _{\alpha }^{\prime }$ and $\widetilde{\Delta }_{\alpha }^{\prime }$ around $\alpha =1$, with the result being that these quantities are generally discontinuous at $\alpha =1$:

Proposition 1

Let $\rho $, $\sigma $, and $\mathcal {N}$ be as given in Definition 2. Then

$$\begin{aligned} \lim _{\alpha \nearrow 1}\Delta _{\alpha }^{\prime }(\rho ,\sigma ,\mathcal {N})&=\lim _{\alpha \nearrow 1}\widetilde{\Delta }_{\alpha }^{\prime }(\rho ,\sigma ,\mathcal {N})=\min _{V_{\mathcal {N}(\sigma )},V_{\sigma }} f(1,V_{\mathcal {N}(\sigma )},V_{\sigma }), \end{aligned}$$

(51)

$$\begin{aligned} \lim _{\alpha \searrow 1}\Delta _{\alpha }^{\prime }(\rho ,\sigma ,\mathcal {N})&=\lim _{\alpha \searrow 1}\widetilde{\Delta }_{\alpha }^{\prime }(\rho ,\sigma ,\mathcal {N})=\max _{V_{\mathcal {N}(\sigma )},V_{\sigma }} f(1,V_{\mathcal {N}(\sigma )},V_{\sigma }), \end{aligned}$$

(52)

where

$$\begin{aligned}&f(1,V_{\mathcal {N}(\sigma )},V_{\sigma })\equiv {\text {Tr}}\left\{ \rho \left[ \log \rho -\log \sigma \right] \right\} \nonumber \\&\quad -{\text {Tr}}\left\{ \mathcal {N}\left( \left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] \right) \left[ \log \left[ V_{\mathcal {N}(\sigma )}^{\dag }\mathcal {N}(\rho )V_{\mathcal {N}(\sigma )}\right] -\log \left[ \mathcal {N}(\sigma )\right] \right] \right\} . \end{aligned}$$

(53)

As a consequence, we have that

$$\begin{aligned} \min _{V_{\mathcal {N}(\sigma )},V_{\sigma }}f(1,V_{\mathcal {N}(\sigma )} ,V_{\sigma })\le f\left( 1,I,I\right) =\Delta (\rho ,\sigma ,\mathcal {N} )\le \max _{V_{\mathcal {N}(\sigma )},V_{\sigma }}f(1,V_{\mathcal {N}(\sigma )},V_{\sigma }), \end{aligned}$$

(54)

and there is generally a discontinuity at $\alpha =1$.

Proof

Let $\mathcal {A}\subseteq [0,2]$, which we will choose shortly. Define the function $f:\mathcal {A}\times \mathbb {V}_{\mathcal {N}(\sigma )} \times \mathbb {V}_{\sigma }\rightarrow \mathbb {R}$ as

$$\begin{aligned}&f(\alpha ,V_{\mathcal {N}(\sigma )},V_{\sigma })\equiv \frac{2}{\alpha -1}\nonumber \\&\quad \log \left\| \left( \left[ \mathcal {N}(\rho )\right] ^{\left( 1-\alpha \right) /2}V_{\mathcal {N}(\sigma )}\left[ \mathcal {N}(\sigma )\right] ^{\left( \alpha -1\right) /2}\otimes I_{E}\right) U\sigma ^{\left( 1-\alpha \right) /2}V_{\sigma }\rho ^{\alpha /2}\right\| _{2}, \end{aligned}$$

(55)

whenever $\alpha \ne 1$, and $f(1,V_{\mathcal {N}(\sigma )},V_{\sigma })$ as in (53). One can check that

$$\begin{aligned} \lim _{\alpha \rightarrow 1}f(\alpha ,V_{\mathcal {N}(\sigma )},V_{\sigma })=f(1,V_{\mathcal {N}(\sigma )},V_{\sigma }), \end{aligned}$$

(56)

for example by performing Taylor expansions to calculate the limit (see “Appendix 3” for details of this calculation). The function f is then continuous in $\alpha $, $V_{\sigma }$, and $V_{\mathcal {N}(\sigma )}$. Furthermore, it fulfills the conditions of Lemma 1 in “Appendix 2” if we choose $\mathcal {A}=[1,M]$ for any $M\in (1,2]$ and $\mathcal {T}=\mathbb {V}_{\mathcal {N}(\sigma )}\times \mathbb {V}_{\sigma }$. Hence, we get that

$$\begin{aligned} \Delta _{\alpha }^{\prime }(\rho ,\sigma ,\mathcal {N})=\max _{V_{\mathcal {N} (\sigma )},V_{\sigma }}f(\alpha ,V_{\mathcal {N}(\sigma )},V_{\sigma }) \end{aligned}$$

(57)

is continuous on $\alpha \in [1,M]$ and thus

$$\begin{aligned} \lim _{\alpha \searrow 1}\Delta _{\alpha }^{\prime }(\rho ,\sigma ,\mathcal {N} )=\max _{V_{\mathcal {N}(\sigma )},V_{\sigma }}f(1,V_{\mathcal {N}(\sigma )},V_{\sigma }). \end{aligned}$$

(58)

Repeating the same argument with $\mathcal {A}=[0,1]$ yields that

$$\begin{aligned} \Delta _{\alpha }^{\prime }(\rho ,\sigma ,\mathcal {N})=\min _{V_{\mathcal {N} (\sigma )},V_{\sigma }}f(\alpha ,V_{\mathcal {N}(\sigma )},V_{\sigma }) \end{aligned}$$

(59)

is continuous on [0, 1] and thus

$$\begin{aligned} \lim _{\alpha \nearrow 1}\Delta _{\alpha }^{\prime }(\rho ,\sigma ,\mathcal {N} )=\min _{V_{\mathcal {N}(\sigma )},V_{\sigma }}f(1,V_{\mathcal {N}(\sigma )},V_{\sigma }). \end{aligned}$$

(60)

Given that $\Delta (\rho ,\sigma ,\mathcal {N})=f(1,I,I)$, we can conclude the following inequality:

$$\begin{aligned} \min _{V_{\mathcal {N}(\sigma )},V_{\sigma }}f(1,V_{\mathcal {N}(\sigma )} ,V_{\sigma })\le \Delta (\rho ,\sigma ,\mathcal {N})\le \max _{V_{\mathcal {N} (\sigma )},V_{\sigma }}f(1,V_{\mathcal {N}(\sigma )},V_{\sigma }) \end{aligned}$$

(61)

The arguments for the quantity $\widetilde{\Delta }_{\alpha }^{\prime } (\rho ,\sigma ,\mathcal {N})$ are similar, so we just sketch them briefly. Define the function

$$\begin{aligned}&g(\alpha ,V_{\mathcal {N}(\sigma )},V_{\sigma })\equiv \frac{2\alpha }{\alpha -1}\nonumber \\&\qquad \log \left\| \left( \left[ \mathcal {N}(\rho )\right] ^{\left( 1-\alpha \right) /2\alpha }V_{\mathcal {N}(\sigma )}\left[ \mathcal {N} (\sigma )\right] ^{\left( \alpha -1\right) /2\alpha }\otimes I_{E}\right) U\sigma ^{\left( 1-\alpha \right) /2\alpha }V_{\sigma }\rho ^{1/2}\right\| _{2\alpha },\nonumber \\ \end{aligned}$$

(62)

for $\alpha \ne 1$ and set $g(1,V_{\mathcal {N}(\sigma )},V_{\sigma })=f(1,V_{\mathcal {N}(\sigma )},V_{\sigma })$. One can then compute (again via Taylor expansions, e.g.) that

$$\begin{aligned} \lim _{\alpha \rightarrow 1}g(\alpha ,V_{\mathcal {N}(\sigma )},V_{\sigma })=g(1,V_{\mathcal {N}(\sigma )},V_{\sigma }). \end{aligned}$$

(63)

The rest of the argument proceeds as above, which leads to the other equalities in (51)–(52).

4.3 Monotonicity in the Rényi parameter

This section contains our main result, that both $\Delta _{\alpha }^{\prime }$ and $\widetilde{\Delta }_{\alpha }^{\prime }$ are monotone non-decreasing with respect to $\alpha $ for particular values.

Theorem 2

Let $\rho $, $\sigma $, and $\mathcal {N}$ be as given in Definition 1. The swiveled Rényi quantity $\Delta _{\alpha }^{\prime }( \rho ,\sigma ,\mathcal {N}) $ is monotone non-decreasing with respect to $\alpha \in \left[ 0,1)\cup (1,2\right] $, in the sense that for $0\le \alpha \le \gamma \le 2$, $\alpha \ne 1$, and $\gamma \ne 1$

$$\begin{aligned} \Delta _{\alpha }^{\prime }( \rho ,\sigma ,\mathcal {N}) \le \Delta _{\gamma } ^{\prime }( \rho ,\sigma ,\mathcal {N}) . \end{aligned}$$

(64)

Proof

The main tool for our proof is Theorem 1. We break the proof of inequality in (64) into several cases. We first consider $1<\alpha <\gamma \le 2$. For some $W_{\mathcal {N}(\sigma )} \in \mathbb {V}_{\mathcal {N}(\sigma )}$ and $W_{\sigma }\in \mathbb {V}_{\sigma }$, pick

$$\begin{aligned} G\left( z\right)&=\left[ \mathcal {N}(\rho )\right] ^{-z\left( \gamma -1\right) /2}W_{\mathcal {N}(\sigma )}\left[ \mathcal {N}(\sigma )\right] ^{z\left( \gamma -1\right) /2}U\sigma ^{-z\left( \gamma -1\right) /2}W_{\sigma }\rho ^{\left( 1+z\left( \gamma -1\right) \right) /2} , \end{aligned}$$

(65)

$$\begin{aligned} p_{0}&=2,\end{aligned}$$

(66)

$$\begin{aligned} p_{1}&=2,\end{aligned}$$

(67)

$$\begin{aligned} \theta&=\frac{\alpha -1}{\gamma -1}\in \left( 0,1\right) , \end{aligned}$$

(68)

which fixes $p_{\theta }=2$. Then

$$\begin{aligned} M_{0}&=\sup _{t\in \mathbb {R}}\left\| G\left( it\right) \right\| _{2} \end{aligned}$$

(69)

$$\begin{aligned}&=\sup _{t\in \mathbb {R}}\left\| \left[ \mathcal {N}(\rho )\right] ^{-it\left( \gamma -1\right) /2}W_{\mathcal {N}(\sigma )}\left[ \mathcal {N} (\sigma )\right] ^{it\left( \gamma -1\right) /2}\right. \nonumber \\&\quad \left. U\sigma ^{-it\left( \gamma -1\right) /2}W_{\sigma }\rho ^{\left( 1+it\left( \gamma -1\right) \right) /2}\right\| _{2}\end{aligned}$$

(70)

$$\begin{aligned}&=\left\| \rho ^{1/2}\right\| _{2}=1,\end{aligned}$$

(71)

$$\begin{aligned} M_{1}&=\sup _{t\in \mathbb {R}}\left\| G\left( 1+it\right) \right\| _{2}\end{aligned}$$

(72)

$$\begin{aligned}&=\sup _{t\in \mathbb {R}}\left\| \left[ \mathcal {N}(\rho )\right] ^{-\frac{\left( 1+it\right) }{2}\left( \gamma -1\right) }W_{\mathcal {N} (\sigma )}\left[ \mathcal {N}(\sigma )\right] ^{\frac{\left( 1+it\right) }{2}\left( \gamma -1\right) }\right. \nonumber \\&\left. U\sigma ^{-\frac{\left( 1+it\right) }{2}\left( \gamma -1\right) }W_{\sigma }\rho ^{\frac{\left( 1+\left( 1+it\right) \left( \gamma -1\right) \right) }{2}}\right\| _{2}\end{aligned}$$

(73)

$$\begin{aligned}&\le \max _{V_{\mathcal {N}(\sigma )},V_{\sigma }}\left\| \left[ \mathcal {N}(\rho )\right] ^{\left( 1-\gamma \right) /2}V_{\mathcal {N} (\sigma )}\left[ \mathcal {N}(\sigma )\right] ^{\left( \gamma -1\right) /2}U\sigma ^{\left( 1-\gamma \right) /2}V_{\sigma }\rho ^{\gamma /2}\right\| _{2}\end{aligned}$$

(74)

$$\begin{aligned}&=\exp \left\{ \frac{\gamma -1}{2}\Delta _{\gamma }^{\prime }(\rho ,\sigma ,\mathcal {N})\right\} ,\end{aligned}$$

(75)

$$\begin{aligned} \left\| G\left( \theta \right) \right\| _{2}&=\left\| \left[ \mathcal {N}(\rho )\right] ^{\left( 1-\alpha \right) /2}W_{\mathcal {N} (\sigma )}\left[ \mathcal {N}(\sigma )\right] ^{\left( \alpha -1\right) /2}U\sigma ^{\left( 1-\alpha \right) /2}W_{\sigma }\rho ^{\alpha /2}\right\| _{2}. \end{aligned}$$

(76)

We then apply Theorem 1 to find that the following inequality holds for all $W_{\mathcal {N}(\sigma )}\in \mathbb {V}_{\mathcal {N} (\sigma )}$ and $W_{\sigma }\in \mathbb {V}_{\sigma }$:

$$\begin{aligned}&\left\| \left[ \mathcal {N}(\rho )\right] ^{\left( 1-\alpha \right) /2}W_{\mathcal {N}(\sigma )}\left[ \mathcal {N}(\sigma )\right] ^{\left( \alpha -1\right) /2}U\sigma ^{\left( 1-\alpha \right) /2}W_{\sigma } \rho ^{\alpha /2}\right\| _{2}\nonumber \\&\quad \le \left[ \exp \left\{ \frac{\gamma -1}{2}\Delta _{\gamma }^{\prime }(\rho ,\sigma ,\mathcal {N})\right\} \right] ^{\frac{\alpha -1}{\gamma -1}}. \end{aligned}$$

(77)

As a consequence, we can take the maximum over all $W_{\mathcal {N}(\sigma )} \in \mathbb {V}_{\mathcal {N}(\sigma )}$ and $W_{\sigma }\in \mathbb {V}_{\sigma }$ and apply the definition in (44) to establish that

$$\begin{aligned} \exp \left\{ \frac{\alpha -1}{2}\Delta _{\alpha }^{\prime }(\rho ,\sigma ,\mathcal {N})\right\} \le \left[ \exp \left\{ \frac{\gamma -1}{2} \Delta _{\gamma }^{\prime }(\rho ,\sigma ,\mathcal {N})\right\} \right] ^{\frac{\alpha -1}{\gamma -1}}. \end{aligned}$$

(78)

We finally apply a logarithm to arrive at the conclusion that (64) holds for all $1<\alpha <\gamma \le 2$.

To get the monotonicity for the range $0\le \alpha <\gamma <1$, we exchange $\alpha $ and $\gamma $ in (65)–(68) and apply the same reasoning as in (69)–(77) to arrive at the following inequality:

$$\begin{aligned} \exp \left\{ \frac{\gamma -1}{2}\Delta _{\gamma }^{\prime }(\rho ,\sigma ,\mathcal {N})\right\} \le \left[ \exp \left\{ \frac{\alpha -1}{2} \Delta _{\alpha }^{\prime }(\rho ,\sigma ,\mathcal {N})\right\} \right] ^{\frac{\gamma -1}{\alpha -1}}. \end{aligned}$$

(79)

Taking a negative logarithm and noting that $0\le \alpha <\gamma <1$ then gives (64) for this range.

We are now left with proving the case $\alpha \in [0,1)$ and $\gamma \in (1,2]$ the dual parameter of $\alpha $, such that $\alpha +\gamma =2$. Notice that $\alpha -1=-\left( \gamma -1\right) $. Let $f\left( z,\gamma \right) =\left( 1-2z\right) \left( \gamma -1\right) $. We pick

$$\begin{aligned} G\left( z\right)&=\left[ \mathcal {N}(\rho )\right] ^{-f\left( z,\gamma \right) /2}\left[ \mathcal {N}(\sigma )\right] ^{f\left( z,\gamma \right) /2}U\sigma ^{-f\left( z,\gamma \right) /2}\rho ^{\left( 1+f\left( z,\gamma \right) \right) /2},\end{aligned}$$

(80)

$$\begin{aligned} p_{0}&=2,\end{aligned}$$

(81)

$$\begin{aligned} p_{1}&=2,\end{aligned}$$

(82)

$$\begin{aligned} \theta&=1/2, \end{aligned}$$

(83)

so that $p_{\theta }=2$. Consider that $f\left( \theta ,\gamma \right) =0$, so that

$$\begin{aligned} \left\| G\left( \theta \right) \right\| _{2}&=\left\| \left[ \mathcal {N}(\rho )\right] ^{-f\left( \theta ,\gamma \right) /2}\left[ \mathcal {N}(\sigma )\right] ^{f\left( \theta ,\gamma \right) /2} U\sigma ^{-f\left( \theta ,\gamma \right) /2}\rho ^{\left( 1+f\left( \theta ,\gamma \right) \right) /2}\right\| _{2}\end{aligned}$$

(84)

$$\begin{aligned}&=\left\| U\rho ^{1/2}\right\| _{2}=\left\| \rho ^{1/2}\right\| _{2}=1. \end{aligned}$$

(85)

We then find that

$$\begin{aligned} M_{0}&=\sup _{t\in \mathbb {R}}\left\| G\left( it\right) \right\| _{2} \end{aligned}$$

(86)

$$\begin{aligned}&=\sup _{t\in \mathbb {R}}\left\| \left[ \mathcal {N}(\rho )\right] ^{-\left( 1-2it\right) \left( \gamma -1\right) /2}\left[ \mathcal {N} (\sigma )\right] ^{\left( 1-2it\right) \left( \gamma -1\right) /2}\right. \nonumber \\&\left. U\sigma ^{-\left( 1-2it\right) \left( \gamma -1\right) /2}\rho ^{\left( 1+\left( 1-2it\right) \left( \gamma -1\right) \right) /2}\right\| _{2}\end{aligned}$$

(87)

$$\begin{aligned}&\le \max _{V_{\mathcal {N}(\sigma )},V_{\sigma }}\left\| \left[ \mathcal {N}(\rho )\right] ^{\left( 1-\gamma \right) /2}V_{\mathcal {N} (\sigma )}\left[ \mathcal {N}(\sigma )\right] ^{\left( \gamma -1\right) /2}U\sigma ^{\left( 1-\gamma \right) /2}V_{\sigma }\rho ^{\gamma /2}\right\| _{2}\end{aligned}$$

(88)

$$\begin{aligned}&=\exp \left\{ \frac{\gamma -1}{2}\Delta _{\gamma }^{\prime }(\rho ,\sigma ,\mathcal {N})\right\} . \end{aligned}$$

(89)

Consider that

$$\begin{aligned} f\left( 1+it,\gamma \right) =\left( 1-2\left( 1+it\right) \right) \left( \gamma -1\right) =-\left( 1+2it\right) \left( \gamma -1\right) =\left( 1+2it\right) \left( \alpha -1\right) . \end{aligned}$$

(90)

Thus, similarly, we have

$$\begin{aligned} M_{1}&=\sup _{t\in \mathbb {R}}\left\| G\left( 1+it\right) \right\| _{2}\end{aligned}$$

(91)

$$\begin{aligned}&=\sup _{t\in \mathbb {R}}\left\| \left[ \mathcal {N}(\rho )\right] ^{-\left( 1+2it\right) \left( \alpha -1\right) /2}\left[ \mathcal {N} (\sigma )\right] ^{\left( 1+2it\right) \left( \alpha -1\right) /2}\right. \nonumber \\&\left. U\sigma ^{-\left( 1+2it\right) \left( \alpha -1\right) /2}\rho ^{\left( 1+\left( 1+2it\right) \left( \alpha -1\right) \right) /2}\right\| _{2}\end{aligned}$$

(92)

$$\begin{aligned}&\le \max _{V_{\mathcal {N}(\sigma )},V_{\sigma }}\left\| \left[ \mathcal {N}(\rho )\right] ^{\left( 1-\alpha \right) /2}V_{\mathcal {N} (\sigma )}\left[ \mathcal {N}(\sigma )\right] ^{\left( \alpha -1\right) /2}U\sigma ^{\left( 1-\alpha \right) /2}V_{\sigma }\rho ^{\alpha /2}\right\| _{2}\end{aligned}$$

(93)

$$\begin{aligned}&=\exp \left\{ \frac{\alpha -1}{2}\Delta _{\alpha }^{\prime }(\rho ,\sigma ,\mathcal {N})\right\} . \end{aligned}$$

(94)

Applying Theorem 1 gives

$$\begin{aligned} 1&\le \exp \left\{ \frac{\gamma -1}{4}\Delta _{\gamma }^{\prime }(\rho ,\sigma ,\mathcal {N})\right\} \exp \left\{ \frac{\alpha -1}{4}\Delta _{\alpha }^{\prime }(\rho ,\sigma ,\mathcal {N})\right\} \end{aligned}$$

(95)

$$\begin{aligned}&=\exp \left\{ \frac{\gamma -1}{4}\Delta _{\gamma }^{\prime }(\rho ,\sigma ,\mathcal {N})\right\} \exp \left\{ \frac{-\left( \gamma -1\right) }{4}\Delta _{\alpha }^{\prime }(\rho ,\sigma ,\mathcal {N})\right\} , \end{aligned}$$

(96)

which implies (64) for $\alpha \in [0,1)$ and $\gamma =2-\alpha $. Putting the three cases together along with Proposition 1 gives the inequality in (64) for $0\le \alpha \le \gamma \le 2$, $\alpha \ne 1$, and $\gamma \ne 1$.

Theorem 3

Let $\rho $, $\sigma $, and $\mathcal {N}$ be as given in Definition 1. The swiveled Rényi quantity $\widetilde{\Delta }_{\alpha }^{\prime }(\rho ,\sigma ,\mathcal {N})$ is monotone non-decreasing with respect to $\alpha \in [1/2,1)\cup (1,\infty ]$, in the sense that for $1/2\le \alpha \le \gamma \le \infty $, $\alpha \ne 1$, and $\gamma \ne 1$

$$\begin{aligned} \widetilde{\Delta }_{\alpha }^{\prime }(\rho ,\sigma ,\mathcal {N})\le \widetilde{\Delta }_{\gamma }^{\prime }(\rho ,\sigma ,\mathcal {N}). \end{aligned}$$

(97)

Proof

We handle the inequality in (97) in a similar way as in the previous proof. First, suppose that $1<\alpha <\gamma $. Let $\alpha ^{\prime }=\left( \alpha -1\right) /\alpha $ and $\gamma ^{\prime }=\left( \gamma -1\right) /\gamma $, and note that $\alpha ^{\prime } ,\gamma ^{\prime }>0$ for the choices given. For some $W_{\mathcal {N}(\sigma )}\in \mathbb {V}_{\mathcal {N}(\sigma )}$ and $W_{\sigma }\in \mathbb {V}_{\sigma }$, pick

$$\begin{aligned} G\left( z\right)&=\left[ \mathcal {N}(\rho )\right] ^{-z\gamma ^{\prime }/2}W_{\mathcal {N}(\sigma )}\left[ \mathcal {N}(\sigma )\right] ^{z\gamma ^{\prime }/2}U\sigma ^{-z\gamma ^{\prime }/2}W_{\sigma }\rho ^{1/2} , \end{aligned}$$

(98)

$$\begin{aligned} p_{0}&=2,\end{aligned}$$

(99)

$$\begin{aligned} p_{1}&=2\gamma ,\end{aligned}$$

(100)

$$\begin{aligned} \theta&=\frac{\alpha ^{\prime }}{\gamma ^{\prime }}\in \left( 0,1\right) , \end{aligned}$$

(101)

which fixes $p_{\theta }=2\alpha $. Then we find the following expression for $M_{0}$

$$\begin{aligned} M_{0}&=\sup _{t\in \mathbb {R}}\left\| G\left( it\right) \right\| _{2}\end{aligned}$$

(102)

$$\begin{aligned}&=\sup _{t\in \mathbb {R}}\left\| \left[ \mathcal {N}(\rho )\right] ^{-it\gamma ^{\prime }/2}W_{\mathcal {N}(\sigma )}\left[ \mathcal {N} (\sigma )\right] ^{it\gamma ^{\prime }/2}U\sigma ^{-it\gamma ^{\prime }/2} W_{\sigma }\rho ^{1/2}\right\| _{2}\end{aligned}$$

(103)

$$\begin{aligned}&=\left\| \rho ^{1/2}\right\| _{2}=1, \end{aligned}$$

(104)

and the following ones for $M_{1}$ and $\left\| G\left( \theta \right) \right\| _{2\alpha }$:

$$\begin{aligned} M_{1}&=\sup _{t\in \mathbb {R}}\left\| G\left( 1+it\right) \right\| _{2\gamma }\end{aligned}$$

(105)

$$\begin{aligned}&=\sup _{t\in \mathbb {R}}\left\| \left[ \mathcal {N}(\rho )\right] ^{-\left( 1+it\right) \gamma ^{\prime }/2}W_{\mathcal {N}(\sigma )}\left[ \mathcal {N}(\sigma )\right] ^{\left( 1+it\right) \gamma ^{\prime }/2}\right. \nonumber \\&\quad \left. U\sigma ^{-\left( 1+it\right) \gamma ^{\prime }/2}W_{\sigma }\rho ^{1/2} \right\| _{2\gamma }\end{aligned}$$

(106)

$$\begin{aligned}&\le \max _{V_{\mathcal {N}(\sigma )},V_{\sigma }}\left\| \left[ \mathcal {N}(\rho )\right] ^{-\gamma ^{\prime }/2}V_{\mathcal {N}(\sigma )}\left[ \mathcal {N}(\sigma )\right] ^{\gamma ^{\prime }/2}U\sigma ^{-\gamma ^{\prime } /2}V_{\sigma }\rho ^{1/2}\right\| _{2\gamma }\end{aligned}$$

(107)

$$\begin{aligned}&=\exp \left\{ \frac{\gamma ^{\prime }}{2}\widetilde{\Delta }_{\gamma }^{\prime }(\rho ,\sigma ,\mathcal {N})\right\} ,\end{aligned}$$

(108)

$$\begin{aligned} \left\| G\left( \theta \right) \right\| _{2\alpha }&=\left\| \left[ \mathcal {N}(\rho )\right] ^{-\alpha ^{\prime }/2}W_{\mathcal {N}(\sigma )}\left[ \mathcal {N}(\sigma )\right] ^{\alpha ^{\prime }/2}U\sigma ^{-\alpha ^{\prime }/2}W_{\sigma }\rho ^{1/2}\right\| _{2\alpha }. \end{aligned}$$

(109)

Applying Theorem 1, we find that the following inequality holds for all $W_{\mathcal {N}(\sigma )}\in \mathbb {V}_{\mathcal {N}(\sigma )}$ and $W_{\sigma }\in \mathbb {V}_{\sigma }$:

$$\begin{aligned} \left\| \left[ \mathcal {N}(\rho )\right] ^{-\alpha ^{\prime }/2} W_{\mathcal {N}(\sigma )}\left[ \mathcal {N}(\sigma )\right] ^{\alpha ^{\prime }/2}U\sigma ^{-\alpha ^{\prime }/2}W_{\sigma }\rho ^{1/2}\right\| _{2\alpha } \le \left[ \exp \left\{ \frac{\gamma ^{\prime }}{2}\widetilde{\Delta }_{\gamma }^{\prime }(\rho ,\sigma ,\mathcal {N})\right\} \right] ^{\frac{\alpha ^{\prime } }{\gamma ^{\prime }}}. \end{aligned}$$

(110)

We can then take a maximum over all $W_{\mathcal {N}(\sigma )}\in \mathbb {V} _{\mathcal {N}(\sigma )}$ and $W_{\sigma }\in \mathbb {V}_{\sigma }$ and apply the definition in (45) to establish that

$$\begin{aligned} \exp \left\{ \frac{\alpha ^{\prime }}{2}\widetilde{\Delta }_{\alpha }^{\prime }(\rho ,\sigma ,\mathcal {N})\right\} \le \left[ \exp \left\{ \frac{\gamma ^{\prime }}{2}\widetilde{\Delta }_{\gamma }^{\prime }(\rho ,\sigma ,\mathcal {N})\right\} \right] ^{\frac{\alpha ^{\prime }}{\gamma ^{\prime }}}. \end{aligned}$$

(111)

The inequality in (97) then follows for $1<\alpha <\gamma $ after taking a logarithm.

To get the monotonicity for the range $1/2\le \alpha <\gamma <1$, we exchange $\alpha $ and $\gamma $ in (98)–(101) and apply the same reasoning as in (102)–(110) to arrive at the following inequality:

$$\begin{aligned} \exp \left\{ \frac{\gamma ^{\prime }}{2}\widetilde{\Delta }_{\gamma }^{\prime }(\rho ,\sigma ,\mathcal {N})\right\} \le \left[ \exp \left\{ \frac{\alpha ^{\prime }}{2}\widetilde{\Delta }_{\alpha }^{\prime }(\rho ,\sigma ,\mathcal {N})\right\} \right] ^{\frac{\gamma \prime }{\alpha ^{\prime }}}. \end{aligned}$$

(112)

Taking a negative logarithm and noting that $1/2\le \alpha <\gamma <1$, so that $\alpha ^{\prime },\gamma ^{\prime }\in [-1,0)$, then gives (97) for this range.

We are now left with proving the case $\alpha \in [1/2,1)$ and $\gamma \in (1,\infty ]$ the dual parameter of $\alpha $: such that $1/\alpha +1/\gamma =2$. Notice that $\alpha ^{\prime }=-\gamma ^{\prime }$ and we have that $\gamma ^{\prime }>0$. We pick

$$\begin{aligned} G\left( z\right)&=\left[ \mathcal {N}(\rho )\right] ^{-\left( 1-2z\right) \alpha ^{\prime }/2}\left[ \mathcal {N}(\sigma )\right] ^{\left( 1-2z\right) \alpha ^{\prime }/2}U\sigma ^{-\left( 1-2z\right) \alpha ^{\prime }/2}\rho ^{1/2},\end{aligned}$$

(113)

$$\begin{aligned} p_{0}&=2\alpha ,\end{aligned}$$

(114)

$$\begin{aligned} p_{1}&=2\gamma ,\end{aligned}$$

(115)

$$\begin{aligned} \theta&=1/2, \end{aligned}$$

(116)

so that $p_{\theta }=2$. Consider that

$$\begin{aligned} \left\| G\left( \theta \right) \right\| _{2}&=\left\| \left[ \mathcal {N}(\rho )\right] ^{-\left( 1-2\theta \right) \alpha ^{\prime } /2}\left[ \mathcal {N}(\sigma )\right] ^{\left( 1-2\theta \right) \alpha ^{\prime }/2}U\sigma ^{-\left( 1-2\theta \right) \alpha ^{\prime }/2} \rho ^{1/2}\right\| _{2}\end{aligned}$$

(117)

$$\begin{aligned}&=\left\| U\rho ^{1/2}\right\| _{2}=\left\| \rho ^{1/2}\right\| _{2}=1. \end{aligned}$$

(118)

We then find that

$$\begin{aligned} M_{0}&=\sup _{t\in \mathbb {R}}\left\| G\left( it\right) \right\| _{2\alpha }\end{aligned}$$

(119)

$$\begin{aligned} \mathbf{}&=\sup _{t\in \mathbb {R}}\left\| \left[ \mathcal {N}(\rho )\right] ^{-\left( 1-2it\right) \alpha ^{\prime }/2}\left[ \mathcal {N}(\sigma )\right] ^{\left( 1-2it\right) \alpha ^{\prime }/2}U\sigma ^{-\left( 1-2it\right) \alpha ^{\prime }/2}\rho ^{1/2}\right\| _{2\alpha }\end{aligned}$$

(120)

$$\begin{aligned}&\le \max _{V_{\mathcal {N}(\sigma )},V_{\sigma }}\left\| \left[ \mathcal {N}(\rho )\right] ^{-\alpha ^{\prime }/2}V_{\mathcal {N}(\sigma )}\left[ \mathcal {N}(\sigma )\right] ^{\alpha ^{\prime }/2}U\sigma ^{-\alpha ^{\prime } /2}V_{\sigma }\rho ^{1/2}\right\| _{2\alpha }\end{aligned}$$

(121)

$$\begin{aligned}&=\exp \left\{ \frac{\alpha ^{\prime }}{2}\widetilde{\Delta }_{\alpha }^{\prime }(\rho ,\sigma ,\mathcal {N})\right\} . \end{aligned}$$

(122)

Consider that

$$\begin{aligned} \left( 1-2\left( 1+it\right) \right) \alpha ^{\prime }=-\left( 1+2it\right) \alpha ^{\prime }=\left( 1+2it\right) \gamma ^{\prime }. \end{aligned}$$

(123)

Thus, similarly, we have

$$\begin{aligned} M_{1}&=\sup _{t\in \mathbb {R}}\left\| G\left( 1+it\right) \right\| _{2\gamma }\end{aligned}$$

(124)

$$\begin{aligned}&=\sup _{t\in \mathbb {R}}\left\| \left[ \mathcal {N}(\rho )\right] ^{-\left( 1+2it\right) \gamma ^{\prime }/2}\left[ \mathcal {N}(\sigma )\right] ^{\left( 1+2it\right) \gamma ^{\prime }/2}U\sigma ^{-\left( 1+2it\right) \gamma ^{\prime }/2}\rho ^{1/2}\right\| _{2\gamma }\end{aligned}$$

(125)

$$\begin{aligned}&\le \max _{V_{\mathcal {N}(\sigma )},V_{\sigma }}\left\| \left[ \mathcal {N}(\rho )\right] ^{-\gamma ^{\prime }/2}V_{\mathcal {N}(\sigma )}\left[ \mathcal {N}(\sigma )\right] ^{\gamma ^{\prime }/2}U\sigma ^{-\gamma ^{\prime } /2}V_{\sigma }\rho ^{1/2}\right\| _{2\gamma }\end{aligned}$$

(126)

$$\begin{aligned}&=\exp \left\{ \frac{\gamma ^{\prime }}{2}\widetilde{\Delta }_{\gamma }^{\prime }(\rho ,\sigma ,\mathcal {N})\right\} . \end{aligned}$$

(127)

Applying Theorem 1 gives

$$\begin{aligned} 1&\le \exp \left\{ \frac{\alpha ^{\prime }}{4}\widetilde{\Delta }_{\alpha }^{\prime }(\rho ,\sigma ,\mathcal {N})\right\} \exp \left\{ \frac{\gamma ^{\prime }}{4}\widetilde{\Delta }_{\gamma }^{\prime }(\rho ,\sigma ,\mathcal {N} )\right\} \end{aligned}$$

(128)

$$\begin{aligned}&=\exp \left\{ -\frac{\gamma ^{\prime }}{4}\widetilde{\Delta }_{\alpha } ^{\prime }(\rho ,\sigma ,\mathcal {N})\right\} \exp \left\{ \frac{\gamma ^{\prime }}{4}\widetilde{\Delta }_{\gamma }^{\prime }(\rho ,\sigma ,\mathcal {N})\right\} , \end{aligned}$$

(129)

which implies (97) for $\alpha \in [1/2,1)$ and $1/\gamma =2-1/\alpha $. Putting the three cases together along with Proposition 1 gives the inequality in (97) for $1/2\le \alpha \le \gamma \le \infty $, $\alpha \ne 1$, and $\gamma \ne 1$.

4.4 Bounds for the quantum relative entropy difference

A recent work [45] established refinements of the monotonicity of quantum relative entropy, strong subadditivity, and other entropy inequalities. In this section, we point out that these results follow as a consequence of the properties of the swiveled Rényi entropies and along the way establish two new refinements of these entropy inequalities.

We begin with a brief background. Let $\mathcal {P}_{\sigma ,\mathcal {N}}$ denote the Petz recovery map [29, 30] (see also [1]):

$$\begin{aligned} \mathcal {P}_{\sigma ,\mathcal {N}}(\cdot )\equiv \sigma ^{1/2}\mathcal {N}^{\dag }\left( \left[ \mathcal {N}(\sigma )\right] ^{-1/2}(\cdot )\left[ \mathcal {N}(\sigma )\right] ^{-1/2}\right) \sigma ^{1/2} , \end{aligned}$$

(130)

and let $\mathcal {R}_{\sigma ,\mathcal {N}}^{V,W}$ denote the swiveled Petz recovery map

$$\begin{aligned} \mathcal {R}_{\sigma ,\mathcal {N}}^{V,W}(\cdot )\equiv \left( \mathcal {W} _{\sigma }\circ \mathcal {P}_{\sigma ,\mathcal {N}}\circ \mathcal {V}_{\mathcal {N} (\sigma )}\right) (\cdot ), \end{aligned}$$

(131)

where the partial isometric map $\mathcal {V}_{\mathcal {N}(\sigma )}$ is defined by

$$\begin{aligned} \mathcal {V}_{\mathcal {N}(\sigma )}(\cdot )=V_{\mathcal {N}(\sigma )} (\cdot )V_{\mathcal {N}(\sigma )}^{\dag }, \end{aligned}$$

(132)

and similarly for $\mathcal {W}_{\sigma }$, so that $\mathcal {V}_{\mathcal {N} (\sigma )}\left( \mathcal {N}(\sigma )\right) =\mathcal {N}(\sigma )$ and $\mathcal {W}_{\sigma }(\sigma )=\sigma $. Observe then that

$$\begin{aligned} \mathcal {R}_{\sigma ,\mathcal {N}}^{V,W}\left( \mathcal {N}(\sigma )\right) =\sigma . \end{aligned}$$

(133)

Consider that particular values of $\alpha $ for $\Delta _{\alpha }^{\prime } (\rho ,\sigma ,\mathcal {N})$ and $\widetilde{\Delta }_{\alpha }^{\prime } (\rho ,\sigma ,\mathcal {N})$ lead to the following quantities, which can be interpreted as a (pseudo-) distance from the state $\rho $ to the state $\mathcal {N}(\rho )$ after a recovery channel $\mathcal {R}_{\sigma ,\mathcal {N} }^{V,W}$ is applied:

$$\begin{aligned} \Delta _{0}^{\prime }(\rho ,\sigma ,\mathcal {N})&=\min _{V_{\mathcal {N}(\sigma )},W_{\sigma }}D_{0}\left( \rho \left\| \mathcal {R}_{\sigma ,\mathcal {N} }^{V,W}\left( \mathcal {N}(\rho )\right) \right) \right. ,\end{aligned}$$

(134)

$$\begin{aligned} \widetilde{\Delta }_{1/2}^{\prime }(\rho ,\sigma ,\mathcal {N})&=-\log \max _{V_{\mathcal {N}(\sigma )},W_{\sigma }}F\left( \rho ,\mathcal {R} _{\sigma ,\mathcal {N}}^{V,W}\left( \mathcal {N}\left( \rho \right) \right) \right) . \end{aligned}$$

(135)

These observations combined with the monotonicity from Theorems 2 and 3 and the facts that $D_{0}(\rho \Vert \mathcal {R}_{\sigma ,\mathcal {N}}^{V,W}\left( \mathcal {N} (\rho )\right) )\ge 0$ and $-\log \max _{V_{\mathcal {N}(\sigma )},W_{\sigma } }F(\rho ,\mathcal {R}_{\sigma ,\mathcal {N}}^{V,W}\left( \mathcal {N}\left( \rho \right) \right) )\ge 0$ allow us to conclude the following:

Corollary 1

Let $\rho $, $\sigma $, and $\mathcal {N}$ be as given in Definition 1. The swiveled Rényi quantity $\Delta _{\alpha }^{\prime }(\rho ,\sigma ,\mathcal {N})$ is non-negative for $\alpha \in [0,1)\cup (1,2]$ and $\widetilde{\Delta }_{\alpha }^{\prime }(\rho ,\sigma ,\mathcal {N})$ is non-negative for $\alpha \in [1/2,1)\cup (1,\infty ]$.

In order to establish the upper bounds in this section, we need to take $\rho $, $\sigma $, and $\mathcal {N}$ as given in the following definition:

Definition 4

Let $\rho _{SE^{\prime }}$ be a positive definite density operator and let $\sigma _{SE^{\prime }}$ be a positive definite operator, each acting on a finite-dimensional tensor-product Hilbert space $\mathcal {H} _{S}\otimes \mathcal {H}_{E^{\prime }}$. Let $\mathcal {N}$ be a quantum channel given as follows:

$$\begin{aligned} \mathcal {N}\left( \theta _{SE^{\prime }}\right) ={\text {Tr}}_{E}\left\{ U_{SE^{\prime }\rightarrow BE}\theta _{SE^{\prime }}U_{SE^{\prime }\rightarrow BE}^{\dag }\right\} , \end{aligned}$$

(136)

where $U_{SE^{\prime }\rightarrow BE}$ is a unitary operator taking $\mathcal {H}_{S}\otimes \mathcal {H}_{E^{\prime }}$ to an isomorphic finite-dimensional tensor-product Hilbert space $\mathcal {H}_{B} \otimes \mathcal {H}_{E}$, such that $\mathcal {N}( \rho ) $ and $\mathcal {N}( \sigma ) $ are each positive definite and act on $\mathcal {H}_{B}$.

If $\rho $, $\sigma $, and $\mathcal {N}$ are taken as in Definition 4, then the following relations hold

$$\begin{aligned} \Delta _{2}^{\prime }( \rho ,\sigma ,\mathcal {N})&=\max _{V_{\mathcal {N}( \sigma ) },W_{\sigma }}D_{2}\left( \rho \left\| \mathcal {R}_{\sigma ,\mathcal {N}}^{V,W}\left( \mathcal {N}( \rho ) \right) \right) \right. , \end{aligned}$$

(137)

$$\begin{aligned} \widetilde{\Delta }_{\infty }^{\prime }( \rho ,\sigma ,\mathcal {N})&=\max _{V_{\mathcal {N}( \sigma ) },W_{\sigma }}D_{\max }\left( \rho \left\| \mathcal {R}_{\sigma ,\mathcal {N}}^{V,W}\left( \mathcal {N}( \rho ) \right) \right) \right. . \end{aligned}$$

(138)

The main contribution of the recent work [45] was to show that the relative entropy difference $\Delta (\rho ,\sigma ,\mathcal {N})$ in (13) can be bounded from below by (135). In the case that $\rho $, $\sigma $, and $\mathcal {N}$ are taken as in Definition 4, then $\Delta (\rho ,\sigma ,\mathcal {N})$ can be bounded from above by (138). We find here that these results are an immediate corollary of Proposition 1 and Theorem 2, and we also obtain two new bounds on $\Delta (\rho ,\sigma ,\mathcal {N})$ in terms of $\Delta _{0}(\rho ,\sigma ,\mathcal {N})$ and $\Delta _{2}(\rho ,\sigma ,\mathcal {N})$:

Corollary 2

Let $\rho $, $\sigma $, and $\mathcal {N}$ be as given in Definition 1 and such that ${\text {supp}} (\rho )\subseteq {\text {supp}}(\sigma )$. Then the following inequalities hold

$$\begin{aligned} -\log \max _{V_{\mathcal {N}(\sigma )},W_{\sigma }}F\left( \rho ,\mathcal {R} _{\sigma ,\mathcal {N}}^{V,W}\left( \mathcal {N}\left( \rho \right) \right) \right)&\le D(\rho \Vert \sigma )-D\left( \mathcal {N}(\rho )\Vert \mathcal {N}(\sigma )\right) ,\end{aligned}$$

(139)

$$\begin{aligned} \min _{V_{\mathcal {N}(\sigma )},W_{\sigma }}D_{0}\left( \rho \left\| \mathcal {R}_{\sigma ,\mathcal {N}}^{V,W}\left( \mathcal {N}(\rho )\right) \right) \right.&\le D(\rho \Vert \sigma )-D\left( \mathcal {N}(\rho )\Vert \mathcal {N}(\sigma )\right) . \end{aligned}$$

(140)

If $\rho $, $\sigma $, and $\mathcal {N}$ are as given in Definition 4, then the following inequalities hold

$$\begin{aligned} D(\rho \Vert \sigma )-D\left( \mathcal {N}(\rho )\Vert \mathcal {N}(\sigma )\right)&\le \max _{V_{\mathcal {N}(\sigma )},W_{\sigma }}D_{\max }\left( \rho \left\| \mathcal {R}_{\sigma ,\mathcal {N}}^{V,W}\left( \mathcal {N} (\rho )\right) \right) \right. ,\end{aligned}$$

(141)

$$\begin{aligned} D(\rho \Vert \sigma )-D\left( \mathcal {N}(\rho )\Vert \mathcal {N}(\sigma )\right)&\le \max _{V_{\mathcal {N}(\sigma )},W_{\sigma }}D_{2}\left( \rho \left\| \mathcal {R}_{\sigma ,\mathcal {N}}^{V,W}\left( \mathcal {N} (\rho )\right) \right) \right. . \end{aligned}$$

(142)

As discussed in [45] (see also [4, 35]), Corollary 2 can be viewed as providing a physically meaningful refinement of the monotonicity of quantum relative entropy in (12). The bound

$$\begin{aligned} -\log \max _{V_{\mathcal {N}(\sigma )},W_{\sigma }}F\left( \rho ,\mathcal {R} _{\sigma ,\mathcal {N}}^{V,W}\left( \mathcal {N}\left( \rho \right) \right) \right) \le D(\rho \Vert \sigma )-D\left( \mathcal {N}(\rho )\Vert \mathcal {N}(\sigma )\right) \end{aligned}$$

(143)

shows that if the decrease in relative entropy is small after the channel $\mathcal {N}$ acts, then it is possible to perform the recovery map $\mathcal {R}_{\sigma ,\mathcal {N}}^{V,W}$ such that $\sigma $ is recovered perfectly from $\mathcal {N}(\sigma )$, while the recovery of $\rho $ from $\mathcal {N}(\rho )$ has a performance limited by the bound above. This result has far reaching implications in quantum information theory as discussed in [45] (see also [4, 35]).

We mention here that it is also possible to obtain bounds of the form from [45], with a single “ time” variable $t\in \mathbb {R}$. The method of proof is similar to that for Theorem 4 in [45], so we give it in “Appendix 2”. The formal statement is as follows:

Theorem 4

Let $\rho $, $\sigma $, and $\mathcal {N}$ be as given in Definition 1 and such that ${\text {supp}} (\rho )\subseteq {\text {supp}}(\sigma )$. Then the following inequalities hold

$$\begin{aligned} \inf _{t\in \mathbb {R}}D_{0}\left( \rho \left\| \mathcal {R}_{\sigma ,\mathcal {N}}^{t}\left( \mathcal {N(}\rho )\right) \right) \le D(\rho \Vert \sigma )-D\left( \mathcal {N(}\rho )\Vert \mathcal {N(}\sigma )\right) \right. , \end{aligned}$$

(144)

where $\mathcal {R}_{\sigma ,\mathcal {N}}^{t}$ is the following rotated Petz recovery map:

$$\begin{aligned} \mathcal {R}_{\sigma ,\mathcal {N}}^{t}(\cdot )\equiv \left( \mathcal {U} _{\sigma ,t}\circ \mathcal {P}_{\sigma ,\mathcal {N}}\circ \mathcal {U} _{\mathcal {N}(\sigma ),-t}\right) \left( \cdot \right) , \end{aligned}$$

(145)

$\mathcal {P}_{\sigma ,\mathcal {N}}$ is the Petz recovery map defined in (130), and $\mathcal {U}_{\sigma ,t}$ and $\mathcal {U}_{\mathcal {N}(\sigma ),-t}$ are partial isometric maps defined from

$$\begin{aligned} \mathcal {U}_{\omega ,t}(\cdot )\equiv \omega ^{it}\left( \cdot \right) \omega ^{-it}, \end{aligned}$$

(146)

with $\omega $ a positive semi-definite operator. If $\rho $, $\sigma $, and $\mathcal {N}$ are as given in Definition 4, then

$$\begin{aligned} D(\rho \Vert \sigma )-D\left( \mathcal {N(}\rho )\Vert \mathcal {N(}\sigma )\right) \le \sup _{t\in \mathbb {R}}D_{2}\left( \rho \left. \Vert \mathcal {R} _{\sigma ,\mathcal {N}}^{t}(\mathcal {N(}\rho ))\right) \right. . \end{aligned}$$

(147)

Remark 1

Note that it is possible to establish “universal” versions of the above inequalities, by employing Hirschman’s improvement [19] of the Hadamard three-line theorem, as done in [20].

5 Swiveled Rényi conditional mutual information

In this section, we show how swiveled Rényi conditional mutual information are special cases of the quantities defined in the previous section. Furthermore, they satisfy some of the properties that one would expect to hold for a Rényi generalization of the conditional mutual information. However, they generally do not converge to the conditional mutual information in the limit as $\alpha \rightarrow 1$.

Let $\rho _{ABC}$ be a density operator. Following from the observation [21] that

$$\begin{aligned} I( A;B|C) _{\rho }=\Delta ( \rho ,\sigma ,\mathcal {N}) , \end{aligned}$$

(148)

for the choices

$$\begin{aligned} \rho =\rho _{ABC},\quad \sigma =\rho _{AC}\otimes I_{B},\quad \mathcal {N} ={\text {Tr}}_{A}, \end{aligned}$$

(149)

we define the Rényi conditional mutual information to be a special case of $\Delta _{\alpha }^{\prime }( \rho ,\sigma ,\mathcal {N}) $ and $\widetilde{\Delta }_{\alpha }^{\prime }( \rho ,\sigma ,\mathcal {N}) $. That is, by setting

$$\begin{aligned} I_{\alpha }^{\prime }( A;B|C) _{\rho }&=\Delta _{\alpha }^{\prime }( \rho _{ABC},\rho _{AC}\otimes I_{B},{\text {Tr}}_{A}) , \end{aligned}$$

(150)

$$\begin{aligned} \widetilde{I}_{\alpha }^{\prime }( A;B|C) _{\rho }&=\widetilde{\Delta }_{\alpha }^{\prime }( \rho _{ABC},\rho _{AC}\otimes I_{B},{\text {Tr}}_{A}), \end{aligned}$$

(151)

we obtain the swiveled Rényi conditional mutual information stated in the following definition:

Definition 5

The swiveled Rényi conditional mutual information are defined for a density operator $\rho _{ABC}$ and $\alpha \in \left( 0,1\right) \cup \left( 1,\infty \right) $ as follows:

$$\begin{aligned} I_{\alpha }^{\prime }( A;B|C) _{\rho }&\equiv \frac{2}{\alpha -1}\max _{V_{\rho _{AC}},V_{\rho _{C}}}\log \left\| \rho _{BC}^{\left( 1-\alpha \right) /2}V_{\rho _{C}}\rho _{C}^{\left( \alpha -1\right) /2}\rho _{AC}^{\left( 1-\alpha \right) /2}V_{\rho _{AC}}\rho _{ABC}^{\alpha /2}\right\| _{2},\end{aligned}$$

(152)

$$\begin{aligned} \widetilde{I}_{\alpha }^{\prime }( A;B|C) _{\rho }&\equiv \frac{2}{\alpha ^{\prime }}\max _{V_{\rho _{AC}},V_{\rho _{C}}}\log \left\| \rho _{BC}^{-\alpha ^{\prime }/2}V_{\rho _{C}}\rho _{C}^{\alpha ^{\prime }/2}\rho _{AC}^{-\alpha ^{\prime }/2}V_{\rho _{AC}}\rho _{ABC}^{1/2}\right\| _{2\alpha }, \end{aligned}$$

(153)

where $\alpha ^{\prime }=\left( \alpha -1\right) /\alpha $.

We can now easily show that the Rényi conditional mutual information as defined above satisfy several natural properties, with the exception of convergence to the von Neumann conditional mutual information.

The following is a consequence of (150 )–(151) and Proposition 1:

Corollary 3

Let $\rho _{ABC}$ be a positive definite density operator. Then

$$\begin{aligned} \lim _{\alpha \nearrow 1}I_{\alpha }^{\prime }( A;B|C) _{\rho }&=\lim _{\alpha \nearrow 1}\widetilde{I}_{\alpha }^{\prime }( A;B|C) _{\rho }\end{aligned}$$

(154)

$$\begin{aligned}&\le I( A;B|C) _{\rho }\end{aligned}$$

(155)

$$\begin{aligned}&\le \lim _{\alpha \searrow 1}I_{\alpha }^{\prime }( A;B|C) _{\rho } \end{aligned}$$

(156)

$$\begin{aligned}&=\lim _{\alpha \searrow 1}\widetilde{I}_{\alpha }^{\prime }( A;B|C) _{\rho }. \end{aligned}$$

(157)

They are monotone non-decreasing with respect to the parameter $\alpha $, which follows from (150)–(151) and Theorems 2 and 3:

Corollary 4

Let $\rho _{ABC}$ be a density operator. The swiveled Rényi conditional mutual information $I_{\alpha }^{\prime }( A;B|C) _{\rho }$ and $\widetilde{I}_{\alpha }^{\prime }( A;B|C) _{\rho }$ are monotone non-decreasing with respect to the Rényi parameter for particular values. For $0\le \alpha \le \gamma \le 2$, $\alpha \ne 1$, and $\gamma \ne 1$, we have that

$$\begin{aligned} I_{\alpha }^{\prime }( A;B|C) _{\rho }\le I_{\gamma }^{\prime }( A;B|C) _{\rho }, \end{aligned}$$

(158)

and for $1/2\le \alpha \le \gamma \le \infty $, $\alpha \ne 1$, and $\gamma \ne 1$,

$$\begin{aligned} \widetilde{I}_{\alpha }^{\prime }( A;B|C) _{\rho }\le \widetilde{I}_{\gamma }^{\prime }( A;B|C) _{\rho }. \end{aligned}$$

(159)

They are monotone non-increasing with respect to a quantum channel acting on the B system, which follows by invoking [5, Lemmas 13 and 23]:

Corollary 5

Let $\rho _{ABC}$ be a positive definite density operator, and let $\mathcal {N}_{B\rightarrow B^{\prime }}$ be a quantum channel such that

$$\begin{aligned} \sigma _{AB^{\prime }C}\equiv \mathcal {N}_{B\rightarrow B^{\prime }}( \rho _{ABC}) \end{aligned}$$

(160)

is a positive definite density operator. Then for all $\alpha \in [0,1)\cup (1,2]$, the following inequality holds

$$\begin{aligned} I_{\alpha }^{\prime }( A;B|C) _{\rho }\ge I_{\alpha }^{\prime }( A;B^{\prime }|C) _{\sigma }, \end{aligned}$$

(161)

and for all $\alpha \in [1/2,1)\cup (1,\infty ]$, the following inequality holds

$$\begin{aligned} \widetilde{I}_{\alpha }^{\prime }( A;B|C) _{\rho }\ge \widetilde{I}_{\alpha }^{\prime }( A;B^{\prime }|C) _{\sigma }. \end{aligned}$$

(162)

Corollary 4, Proposition 1, and (150)–(151) then imply the following refinements of the strong subaddivity of quantum entropy, two of which were already determined in [45]:

Corollary 6

Let $\rho _{ABC}$ be a density operator. Then the following inequalities hold

$$\begin{aligned} -\log \left[ \max _{W_{\rho _{C}},V_{\rho _{AC}}}F\left( \rho _{ABC} ,\mathcal {R}_{C\rightarrow AC}^{V,W}\left( \rho _{BC}\right) \right) \right]&\le I(A;B|C)_{\rho },\end{aligned}$$

(163)

$$\begin{aligned} \min _{W_{\rho _{C}},V_{\rho _{AC}}}D_{0}\left( \rho _{ABC}\left\| \mathcal {R}_{C\rightarrow AC}^{V,W}\left( \rho _{BC}\right) \right) \right.&\le I(A;B|C)_{\rho }, \end{aligned}$$

(164)

where $\mathcal {R}_{C\rightarrow AC}^{V,W}$ is the following swiveled Petz recovery map:

$$\begin{aligned} \mathcal {R}_{C\rightarrow AC}^{V,W}(\cdot )\equiv \left( \mathcal {V}_{\rho _{AC}}\circ \mathcal {P}_{C\rightarrow AC}\circ \mathcal {W}_{\rho _{C}}\right) (\cdot ), \end{aligned}$$

(165)

the Petz recovery map $\mathcal {P}_{C\rightarrow AC}$ is defined as

$$\begin{aligned} \mathcal {P}_{C\rightarrow AC}(\cdot )\equiv \mathcal {P}_{\rho _{AC} ,{\text {Tr}}_{A}}(\cdot )=\rho _{AC}^{1/2}\rho _{C}^{-1/2}(\cdot )\rho _{C}^{-1/2}\rho _{AC}^{1/2}, \end{aligned}$$

(166)

and the partial isometric maps $\mathcal {V}_{\rho _{AC}}$ and $\mathcal {W} _{\rho _{C}}$ are defined as in (132). If $\rho _{ABC}$ is a positive definite density operator, then the following inequalities hold

$$\begin{aligned} I(A;B|C)_{\rho }&\le \max _{W_{\rho _{C}},V_{\rho _{AC}}}D_{\max }\left( \rho _{ABC}\left\| \mathcal {R}_{C\rightarrow AC}^{V,W}\left( \rho _{BC}\right) \right) \right. ,\end{aligned}$$

(167)

$$\begin{aligned} I(A;B|C)_{\rho }&\le \max _{W_{\rho _{C}},V_{\rho _{AC}}}D_{2}\left( \rho _{ABC}\left\| \mathcal {R}_{C\rightarrow AC}^{V,W}\left( \rho _{BC}\right) \right) \right. . \end{aligned}$$

(168)

Note that remainder terms for strong subadditivity were put forward in [17, 36] before the recent developments in [45].

6 Swiveled Rényi quantum information measures

We now discuss how to extend the approach given here and in [6] in order to construct swiveled Rényi generalizations of any quantum information measure which consists of a linear combination of von Neumann entropies with coefficients chosen from the set $\left\{ -1,0,1\right\} $. We repeat some of the discussions from [6] in order to illustrate the method.

Let $\rho _{A_{1}\ldots A_{l}}$ be a density operator on l systems and set $\mathcal {A}\equiv \left\{ A_{1},\ldots ,A_{l}\right\} $. Suppose that we would like to establish a Rényi generalization of the following linear combination of entropies:

$$\begin{aligned} L\left( \rho _{A_{1}\ldots A_{l}}\right) \equiv \sum _{S\in \mathcal {P}_{\ge 1}\left( \mathcal {A}\right) }a_{S}H(S)_{\rho }, \end{aligned}$$

(169)

where $\mathcal {P}_{\ge 1}\left( \mathcal {A}\right) $ is the power set of $\mathcal {A}$ (excluding the empty set), such that the sum runs over all subsets of the systems $A_{1},\ldots ,A_{l}$. Furthermore, each coefficient $a_{S}\in \left\{ -1,0,1\right\} $ and corresponds to a subset S. In the case that $a_{\mathcal {A}}$ is nonzero, without loss of generality, we can set $a_{\mathcal {A}}=-1$ (otherwise, factor out $-1$ to make this the case). Then, we can rewrite the quantity in (169) in terms of the relative entropy as follows:

$$\begin{aligned} D\left( \rho _{A_{1}\ldots A_{l}}\left\| \exp \left\{ \sum _{S\in \mathcal {P}^{\prime }}a_{S}\log \rho _{S}\right\} \right) \right. , \end{aligned}$$

(170)

where $\mathcal {P}^{\prime }=\mathcal {P}_{\ge 1}\left( \mathcal {A}\right) \backslash \left\{ A_{1},\ldots ,A_{l}\right\} $. On the other hand, if $a_{\mathcal {A}}=0$, i.e., if all the marginal entropies in the sum are on a number of systems that is strictly smaller than the number of systems over which the state $\rho $ is defined (as is the case with $H({ AB})+H({ BC})+H({ AC})$, for example), we can take a purification of the original state and call this purification the state $\rho _{A_{1}\ldots A_{l}}$. This state is now a pure state on a number of systems strictly larger than the number of systems involved in all the marginal entropies. We then add the entropy $H(A_{1}\ldots A_{l})_{\rho }=0$ to the sum of entropies and apply the above recipe (so we resolve the issue with this example by purifying to a system R, setting the sum formula to be $H({ ABCR})+H({ AB})+H({ BC})+H({ AC})$, and proceeding with the above recipe). In the case that the resulting density operator $\rho _{A_{1}\ldots A_{l}}$ is not positive definite, we can mix it with the maximally mixed state $\pi _{A_{1}\ldots A_{l}}$ as follows:

$$\begin{aligned} \left( 1-\varepsilon \right) \rho _{A_{1}\ldots A_{l}}+\varepsilon \pi _{A_{1}\ldots A_{l}}, \end{aligned}$$

(171)

where $\varepsilon \in (0,1)$. The resulting density operator is then $\varepsilon $-distinguishable from the original one by any quantum measurement performed on the systems $A_{1}\ldots A_{l}$.

We then define the following swiveled Rényi entropies, which generalize $L\left( \rho _{A_{1}\ldots A_{l}}\right) $ from (169):

$$\begin{aligned} L_{\alpha }^{\prime }\left( \rho _{A_{1}\ldots A_{l}}\right)&\equiv \frac{2}{\alpha -1}\max _{\left\{ V_{\rho _{S}}\right\} _{S}}\log \left\| \left[ \prod \limits _{S\in \mathcal {P}^{\prime }}\rho _{S}^{-a_{S}\left( \alpha -1\right) /2}V_{\rho _{S}}\right] \rho _{A_{1}\ldots A_{l}}^{\alpha /2}\right\| _{2}, \end{aligned}$$

(172)

$$\begin{aligned} \widetilde{L}_{\alpha }^{\prime }\left( \rho _{A_{1}\ldots A_{l}}\right)&\equiv \frac{2}{\alpha ^{\prime }}\max _{\left\{ V_{\rho _{S}}\right\} _{S}} \log \left\| \left[ \prod \limits _{S\in \mathcal {P}^{\prime }}\rho _{S} ^{-a_{S}\alpha ^{\prime }/2}V_{\rho _{S}}\right] \rho _{A_{1}\ldots A_{l}} ^{1/2}\right\| _{2\alpha }, \end{aligned}$$

(173)

where $\alpha ^{\prime }=\left( \alpha -1\right) /\alpha $. The ordering of the marginal density operators in the product is in general arbitrary, but could be important for some applications (consider, e.g., that the choices in Definition 5 lead to the inequalities in Corollary 6, which have a physical interpretation in terms of recovery).

By the same methods as given throughout this paper, we can establish several properties of these quantities. It follows from the generalized Lie–Trotter product formula [37] and the method given in the proof of Proposition 1 that

$$\begin{aligned} \lim _{\alpha \nearrow 1}L_{\alpha }^{\prime }\left( \rho _{A_{1}\ldots A_{l} }\right)&=\lim _{\alpha \nearrow 1}\widetilde{L}_{\alpha }^{\prime }\left( \rho _{A_{1}\ldots A_{l}}\right) \end{aligned}$$

(174)

$$\begin{aligned}&\le L\left( \rho _{A_{1}\ldots A_{l} }\right) \end{aligned}$$

(175)

$$\begin{aligned}&\le \lim _{\alpha \searrow 1}L_{\alpha }^{\prime }\left( \rho _{A_{1}\ldots A_{l}}\right) \end{aligned}$$

(176)

$$\begin{aligned}&=\lim _{\alpha \searrow 1}\widetilde{L}_{\alpha }^{\prime }\left( \rho _{A_{1}\ldots A_{l}}\right) . \end{aligned}$$

(177)

From the same method as given in the proof of Theorems 2 and 3, for $0\le \alpha \le \gamma \le 2$, $\alpha \ne 1$, and $\gamma \ne 1$, we can conclude that

$$\begin{aligned} L_{\alpha }^{\prime }\left( \rho _{A_{1}\ldots A_{l}}\right) \le L_{\gamma }^{\prime }\left( \rho _{A_{1}\ldots A_{l}}\right) , \end{aligned}$$

(178)

and for $1/2\le \alpha \le \gamma \le \infty $, $\alpha \ne 1$, and $\gamma \ne 1$,

$$\begin{aligned} \widetilde{L}_{\alpha }^{\prime }\left( \rho _{A_{1}\ldots A_{l}}\right) \le \widetilde{L}_{\gamma }^{\prime }\left( \rho _{A_{1}\ldots A_{l}}\right) . \end{aligned}$$

(179)

The inequalities above then lead to the following bounds for $L\left( \rho _{A_{1}\ldots A_{l}}\right) $:

$$\begin{aligned} L_{0}^{\prime }\left( \rho _{A_{1}\ldots A_{l}}\right)&\le L\left( \rho _{A_{1}\ldots A_{l}}\right) \le L_{2}^{\prime }\left( \rho _{A_{1}\ldots A_{l}}\right) ,\end{aligned}$$

(180)

$$\begin{aligned} \widetilde{L}_{1/2}^{\prime }\left( \rho _{A_{1}\ldots A_{l}}\right)&\le L\left( \rho _{A_{1}\ldots A_{l}}\right) \le \widetilde{L}_{\infty }^{\prime }\left( \rho _{A_{1}\ldots A_{l}}\right) , \end{aligned}$$

(181)

which in some cases might have physical interpretations in terms of swiveled Petz recovery channels (see [45, Sections 5.6 and 5.7] for some examples).

7 Monotonicity of trace quantities

Ref. [47] posed an open question regarding whether the following quantity

$$\begin{aligned} \text {Tr}\left\{ \left[ \rho _{AC}^{\left( 1-\alpha \right) /2}\rho _{C}^{\left( \alpha -1\right) /2}\rho _{BC}^{1-\alpha }\rho _{C}^{\left( \alpha -1\right) /2}\rho _{AC}^{\left( 1-\alpha \right) /2}\right] ^{1/\left( 1-\alpha \right) }\right\} \end{aligned}$$

(182)

is monotone in $\alpha $ and never exceeds one. The recent work [12] addressed this open question, first by generalizing it and then proving that

$$\begin{aligned} \text {Tr}\left\{ \left[ \sigma ^{\left( 1-\alpha \right) /2}\mathcal {N} ^{\dag }\left( \mathcal {N}(\sigma )^{\left( \alpha -1\right) /2} \mathcal {N}(\rho )^{1-\alpha }\mathcal {N}(\sigma )^{\left( \alpha -1\right) /2}\right) \sigma ^{\left( 1-\alpha \right) /2}\right] ^{1/\left( 1-\alpha \right) }\right\} \le 1, \end{aligned}$$

(183)

for $\alpha \in (0,1)$ and $\rho $, $\sigma $, and $\mathcal {N}$ as given in Definition 1. The same work established that this bound holds for $\alpha \in (1,2)$ if $\rho $, $\sigma $, and $\mathcal {N}$ are as given in Definition 1. One recovers the quantity in (182) by picking $\rho =\rho _{ABC}$, $\sigma =\rho _{AC}\otimes I_{B}$, and $\mathcal {N} = {\text {Tr}}_{A}$ in (183). It is not known whether the quantity in (183) is monotone with respect to $\alpha $.

Here, we address this latter question by again taking our approach of allowing for a unitary swivel. Consider that we can rewrite the left-hand side of (183) as follows for $\alpha \in (0,1)$:

$$\begin{aligned} \left\| \left[ \mathcal {N}(\rho )^{\left( 1-\alpha \right) /2} \mathcal {N}(\sigma )^{\left( \alpha -1\right) /2}\otimes I_{E}\right] U\sigma ^{\left( 1-\alpha \right) /2}\right\| _{2/\left( 1-\alpha \right) }^{2/\left( 1-\alpha \right) }, \end{aligned}$$

(184)

where U is an isometric extension of the channel $\mathcal {N}$. So we instead consider the following quantity, which has an optimization over a unitary swivel:

$$\begin{aligned} \max _{V_{\mathcal {N}(\sigma )}}\left\| \left[ \mathcal {N}(\rho )^{\left( 1-\alpha \right) /2}V_{\mathcal {N}(\sigma )}\mathcal {N}(\sigma )^{\left( \alpha -1\right) /2}\otimes I_{E}\right] U\sigma ^{\left( 1-\alpha \right) /2}\right\| _{2/\left( 1-\alpha \right) }^{2/\left( 1-\alpha \right) }. \end{aligned}$$

(185)

To simplify the notation, consider that the above quantity for $\alpha \in [0,1)$ is the same as the following one for $p\in [2,\infty )$:

$$\begin{aligned} \max _{V_{\mathcal {N}(\sigma )}}\left\| \left[ \mathcal {N}(\rho )^{1/p}V_{\mathcal {N}(\sigma )}\mathcal {N}(\sigma )^{-1/p}\otimes I_{E}\right] U\sigma ^{1/p}\right\| _{p}^{p}. \end{aligned}$$

(186)

We can now state our contribution to the open question:

Proposition 2

The quantity in (186) is monotone non-increasing on the interval $p\in [2,\infty )$ and has a maximum value of one at $p=2$ if ${\text {supp}}(\rho )\subseteq {\text {supp}}(\sigma )$.

Proof

This ends up being another application of the Hadamard three-line theorem, using techniques similar to what we have used previously. For $q\in [2,\infty )$, $q<p$, and $V_{\mathcal {N}(\sigma )}$ a fixed unitary commuting with $\mathcal {N}(\sigma )$, pick

$$\begin{aligned} G\left( z\right)&=\left[ \mathcal {N}(\rho )^{z/q}V_{\mathcal {N}(\sigma )}\mathcal {N}(\sigma )^{-z/q}\otimes I_{E}\right] U\sigma ^{z/q},\end{aligned}$$

(187)

$$\begin{aligned} p_{0}&=\infty ,\end{aligned}$$

(188)

$$\begin{aligned} p_{1}&=q,\end{aligned}$$

(189)

$$\begin{aligned} \theta&=q/p, \end{aligned}$$

(190)

which implies that $p_{\theta }=p$. Applying Theorem 1 gives

$$\begin{aligned} \left\| G\left( \theta \right) \right\| _{p}\le \left[ \sup _{t\in \mathbb {R}}\left\| G\left( it\right) \right\| _{\infty }\right] ^{1-\theta }\left[ \sup _{t\in \mathbb {R}}\left\| G\left( 1+it\right) \right\| _{q}\right] ^{\theta }. \end{aligned}$$

(191)

So we evaluate these terms to find

$$\begin{aligned} \left\| G\left( \theta \right) \right\| _{p}&=\left\| \left[ \mathcal {N}(\rho )^{\theta /q}V_{\mathcal {N}(\sigma )}\mathcal {N}(\sigma )^{-\theta /q}\otimes I_{E}\right] U\sigma ^{\theta /q}\right\| _{p}\end{aligned}$$

(192)

$$\begin{aligned}&=\left\| \left[ \mathcal {N}(\rho )^{1/p}V_{\mathcal {N}(\sigma )}\mathcal {N}(\sigma )^{-1/p}\otimes I_{E}\right] U\sigma ^{1/p}\right\| _{p},\end{aligned}$$

(193)

$$\begin{aligned} \sup _{t\in \mathbb {R}}\left\| G\left( it\right) \right\| _{\infty }&=\sup _{t\in \mathbb {R}}\left\| \left[ \mathcal {N}(\rho )^{it/q} V_{\mathcal {N}(\sigma )}\mathcal {N}(\sigma )^{-it/q}\otimes I_{E}\right] U\sigma ^{it/q}\right\| _{\infty }\end{aligned}$$

(194)

$$\begin{aligned}&\le 1,\end{aligned}$$

(195)

$$\begin{aligned} \sup _{t\in \mathbb {R}}\left\| G\left( 1+it\right) \right\| _{q}&=\sup _{t\in \mathbb {R}}\left\| \left[ \mathcal {N}(\rho )^{\left( 1+it\right) /q}V_{\mathcal {N}(\sigma )}\mathcal {N}(\sigma )^{-\left( 1+it\right) /q}\otimes I_{E}\right] U\sigma ^{\left( 1+it\right) /q}\right\| _{q}\end{aligned}$$

(196)

$$\begin{aligned}&=\sup _{t\in \mathbb {R}}\left\| \left[ \mathcal {N}(\rho )^{1/q} V_{\mathcal {N}(\sigma )}\mathcal {N}(\sigma )^{-it/q}\mathcal {N}(\sigma )^{-1/q}\otimes I_{E}\right] U\sigma ^{1/q}\right\| _{q}\end{aligned}$$

(197)

$$\begin{aligned}&\le \max _{W_{\mathcal {N}(\sigma )}}\left\| \left[ \mathcal {N} (\rho )^{1/q}W_{\mathcal {N}(\sigma )}\mathcal {N}(\sigma )^{-1/q}\otimes I_{E}\right] U\sigma ^{1/q}\right\| _{q}. \end{aligned}$$

(198)

Putting everything together, we find that for $2\le q<p$, the following inequality holds

$$\begin{aligned}&\max _{V_{\mathcal {N}(\sigma )}}\left\| \left[ \mathcal {N}(\rho )^{1/p}V_{\mathcal {N}(\sigma )}\mathcal {N}(\sigma )^{-1/p}\otimes I_{E}\right] U\sigma ^{1/p}\right\| _{p}^{p} \nonumber \\&\le \max _{W_{\mathcal {N}(\sigma )}}\left\| \left[ \mathcal {N}(\rho )^{1/q}W_{\mathcal {N}(\sigma )}\mathcal {N}(\sigma )^{-1/q}\otimes I_{E}\right] U\sigma ^{1/q}\right\| _{q}^{q}, \end{aligned}$$

(199)

since the inequality from (191) holds for all $V_{\mathcal {N}(\sigma )}$. This establishes the first statement in the proposition.

For the second statement, consider evaluating (186) at $p=2$ for any choice of $V_{\mathcal {N} (\sigma )}$:

$$\begin{aligned}&\left\| \left[ \mathcal {N}(\rho )^{1/2}V_{\mathcal {N}(\sigma )}\mathcal {N}(\sigma )^{-1/2}\otimes I_{E}\right] U\sigma ^{1/2}\right\| _{2}^{2}\nonumber \\&={\text {Tr}}\left\{ \sigma ^{1/2}U^{\dag }\left[ \mathcal {N} (\sigma )^{-1/2}V_{\mathcal {N}(\sigma )}^{\dag }\mathcal {N}(\rho )V_{\mathcal {N} (\sigma )}\mathcal {N}(\sigma )^{-1/2}\otimes I_{E}\right] U\sigma ^{1/2}\right\} \end{aligned}$$

(200)

$$\begin{aligned}&={\text {Tr}}\left\{ \sigma ^{1/2}\mathcal {N}^{\dag }\left[ \mathcal {N}(\sigma )^{-1/2}V_{\mathcal {N}(\sigma )}^{\dag }\mathcal {N} (\rho )V_{\mathcal {N}(\sigma )}\mathcal {N}(\sigma )^{-1/2}\right] \sigma ^{1/2}\right\} \end{aligned}$$

(201)

$$\begin{aligned}&={\text {Tr}}\left\{ \mathcal {N}(\sigma )\mathcal {N}(\sigma )^{-1/2}V_{\mathcal {N}(\sigma )}^{\dag }\mathcal {N}(\rho )V_{\mathcal {N}(\sigma )}\mathcal {N}(\sigma )^{-1/2}\right\} \end{aligned}$$

(202)

$$\begin{aligned}&={\text {Tr}}\left\{ \varPi _{\mathcal {N}(\sigma )}V_{\mathcal {N}(\sigma )}^{\dag }\mathcal {N}(\rho )V_{\mathcal {N}(\sigma )}\right\} \end{aligned}$$

(203)

$$\begin{aligned}&={\text {Tr}}\left\{ \varPi _{\mathcal {N}(\sigma )}\mathcal {N} (\rho )\right\} \end{aligned}$$

(204)

$$\begin{aligned}&=1. \end{aligned}$$

(205)

Corollary 7

Let $\rho _{ABC}$ be a density operator. Then the following quantity is monotone non-increasing for $\alpha \in [0,1)$ and takes a maximum value of one at $\alpha =0$:

$$\begin{aligned} \max _{V_{\rho _{C}}}{\text {Tr}}\left\{ \left( \rho _{AC}^{\left( 1-\alpha \right) /2}V_{\rho _{C}}\rho _{C}^{\left( \alpha -1\right) /2} \rho _{BC}^{1-\alpha }\rho _{C}^{\left( \alpha -1\right) /2}V_{\rho _{C}}^{\dag }\rho _{AC}^{\left( 1-\alpha \right) /2}\right) ^{1/\left( 1-\alpha \right) }\right\} . \end{aligned}$$

(206)

If $\rho _{ABC}$ is a positive definite, then the same quantity is monotone non-decreasing for $\alpha \in (1,2]$ and takes a maximum value of one at $\alpha =2$.

Proof

The first statement follows by applying Proposition 2 for the choices $\rho =\rho _{ABC}$, $\sigma =\rho _{AC}\otimes I_{B}$, and $\mathcal {N}={\text {Tr}}_{A}$. The second statement follows because

$$\begin{aligned}&\left( \rho _{AC}^{\left( 1-\alpha \right) /2}V_{\rho _{C}}\rho _{C}^{\left( \alpha -1\right) /2}\rho _{BC}^{1-\alpha }\rho _{C}^{\left( \alpha -1\right) /2}V_{\rho _{C}}^{\dag }\rho _{AC}^{\left( 1-\alpha \right) /2}\right) ^{1/\left( 1-\alpha \right) } \nonumber \\&\quad =\left( \rho _{AC}^{\left( 1-\beta \right) /2}V_{\rho _{C}}\rho _{C}^{\left( \beta -1\right) /2}\rho _{BC}^{1-\beta }\rho _{C}^{\left( \beta -1\right) /2}V_{\rho _{C}}^{\dag }\rho _{AC}^{\left( 1-\beta \right) /2}\right) ^{1/\left( 1-\beta \right) }, \end{aligned}$$

(207)

for $\beta =2-\alpha \in (0,1)$ and then we apply the first statement.

8 Conclusion

The main contribution of this paper is a general method, the “ swiveled Rényi entropic” approach, for constructing $\alpha $-Rényi generalizations of a quantum information measure that are monotone non-decreasing in the parameter $\alpha $. The swiveled Rényi entropies are discontinuous at $\alpha =1$ and do not converge to von Neumann entropy-based quantities in the limit as $\alpha \rightarrow 1$. We suspect that the swiveled Rényi entropies might be helpful in understanding refinements of quantum information-processing tasks, but this remains unclear due to the lack of convergence at $\alpha =1$. At the very least, the technique recovers the recent refinements [45] of fundamental entropy inequalities such as monotonicity of quantum relative entropy [24, 41] and strong subadditivity [22, 23], in addition to providing new refinements for these entropy inequalities.

The most important open question going forward from here is to determine Rényi entropies which satisfy all of the desiderata that one would have for Rényi generalizations of quantum information measures. We find it curious that with the proposal in [6], one can prove convergence to a von Neumann entropy-based quantity in the limit as $\alpha \rightarrow 1$, but we are still unable to establish monotonicity in $\alpha $. However, with the swiveled Rényi entropies proposed here, the situation is reversed.

One might also consider developing chain rules for the swiveled Rényi entropies, along the lines established in [13], but it is unclear how useful this would be in practice, given that the quantities do not generally converge at $\alpha =1$.

Notes

A “ swivel” is a coupling placed between two objects in a chain in order to allow for them to “ swivel” about a given axis.
A map $G:S\rightarrow L(\mathcal {H})$ is holomorphic (continuous, bounded) if the corresponding functions to matrix entries are holomorphic (continuous, bounded).

References

Barnum, H., Knill, E.: Reversing quantum dynamics with near-optimal quantum and classical fidelity. J. Math. Phys. 43(5), 2097–2106 (2002). arXiv:quant-ph/0004088
Beigi, S.: Sandwiched Rényi divergence satisfies data processing inequality. J. Math. Phys. 54(12), 122202 (2013). arXiv:1306.5920
Bergh, J., Löfström, J.: Interpolation Spaces. Springer, Berlin (1976)
Book MATH Google Scholar
Berta, M., Lemm, M., Wilde, M.M.: Monotonicity of quantum relative entropy and recoverability. Quantum Inf. Comput. 15(15 & 16), 1333–1354 (2015). arXiv:1412.4067
Berta, M., Seshadreesan, K., Wilde, M.M.: Rényi generalizations of the conditional quantum mutual information. J. Math. Phys. 56(2), 022205 (2015). arXiv:1403.6102
Berta, M., Seshadreesan, K.P., Wilde, M.M.: Rényi generalizations of quantum information measures. Phys. Rev. A 91(2), 022333 (2015)
Article ADS MathSciNet MATH Google Scholar
Christandl, M., Winter, A.: “Squashed entanglement”: an additive entanglement measure. J. Math. Phys. 45(3), 829–840 (2004). arXiv:quant-ph/0308088
Coles, P.J., Berta, M., Tomamichel, M., Wehner, S.: Entropic uncertainty relations and their applications (2015). arXiv:1511.04857
Cooney, T., Mosonyi, M., Wilde, M.M.: Strong converse exponents for a quantum channel discrimination problem and quantum-feedback-assisted communication (2014). arXiv:1408.3373
Csiszár, I.: Generalized cutoff rates and Rényi’s information measures. IEEE Trans. Inf. Theory 41(1), 26–34 (1995)
Article MATH Google Scholar
Datta, N.: Min- and max-relative entropies and a new entanglement monotone. IEEE Trans. Inf. Theory 55(6), 2816–2826 (2009). arXiv:0803.2770
Datta, N., Wilde, M.M.: Quantum Markov chains, sufficiency of quantum channels, and Rényi information measures. J. Phys. A: Math. Theor. 48(50), 505301 (2015). arXiv:1501.05636
Dupuis, F.: Chain rules for quantum Rényi entropies. J. Math. Phys. 56(2), 022203 (2015)
Article ADS MathSciNet MATH Google Scholar
Dupuis, F., Fawzi, O., Wehner, S.: Entanglement sampling and applications. IEEE Trans. Inf. Theory 61(2), 1093–1112 (2015). arXiv:1305.1316
Dupuis, F., Kramer, L., Faist, P., Renes, J.M., Renner, R.: Proceedings of the XVIIth International Congress on Mathematical Physics, chapter Generalized Entropies, pp. 134–153. World Scientific (2012). arXiv:1211.3141
van Erven, T., Harremoes, P.: Rényi divergence and Kullback-Leibler divergence. IEEE Trans. Inf. Theory 60(7), 3797–3820 (2014). arXiv:1206.2459
Fawzi, O., Renner, R.: Quantum conditional mutual information and approximate Markov chains. Commun. Math. Phys. 340(2), 575–611 (2015). arXiv:1410.0664
Frank, R.L., Lieb, E.H.: Monotonicity of a relative Rényi entropy. J. Math. Phys. 54(12), 122201 (2013). arXiv:1306.5358
Hirschman, I.I.: A convexity theorem for certain groups of transformations. J. Anal. Math. 2(2), 209–218 (1952)
Article MathSciNet MATH Google Scholar
Junge, M., Renner, R., Sutter, D., Wilde, M.M., Winter, A.: Universal recovery from a decrease of quantum relative entropy (2015). arXiv:1509.07127
Li, K., Winter, A.: Squashed entanglement, $k$-extendibility, quantum Markov chains, and recovery maps (2014). arXiv:1410.4184
Lieb, E.H., Ruskai, M.B.: A fundamental property of quantum-mechanical entropy. Phys. Rev. Lett. 30(10), 434–436 (1973)
Article ADS MathSciNet Google Scholar
Lieb, E.H., Ruskai, M.B.: Proof of the strong subadditivity of quantum-mechanical entropy. J. Math. Phys. 14(12), 1938–1941 (1973)
Article ADS MathSciNet Google Scholar
Lindblad, G.: Completely positive maps and entropy inequalities. Commun. Math. Phys. 40(2), 147–151 (1975)
Article ADS MathSciNet MATH Google Scholar
Linden, N., Mosonyi, M., Winter, A.: The structure of Rényi entropic inequalities. Proc. R. Soc. A: Math., Phys. Eng. Sci. 469(2158), 20120,737 (2013). arXiv:1212.0248
Müller-Lennert, M., Dupuis, F., Szehr, O., Fehr, S., Tomamichel, M.: On quantum Rényi entropies: a new definition and some properties. J. Math. Phys. 54(12), 122203 (2013). arXiv:1306.3142
Nielsen, M.A., Chuang, I.L.: Quantum Computation and Quantum Information, 10th Anniversary Edition. Cambridge University Press, Cambridge (2010)
Book MATH Google Scholar
Petz, D.: Quasi-entropies for finite quantum systems. Rep. Math. Phys. 23(1), 57–65 (1986)
Article ADS MathSciNet MATH Google Scholar
Petz, D.: Sufficient subalgebras and the relative entropy of states of a von Neumann algebra. Commun. Math. Phys. 105(1), 123–131 (1986)
Article ADS MathSciNet MATH Google Scholar
Petz, D.: Sufficiency of channels over von Neumann algebras. Q. J. Math. 39(1), 97–108 (1988)
Article MathSciNet MATH Google Scholar
Reed, M., Simon, B.: Methods of Modern Mathematical Physics II: Fourier Analysis, Self-Adjointness. Academic Press, New York (1975)
MATH Google Scholar
Renner, R.: Security of quantum key distribution. Ph.D. thesis, ETH Zürich (2005). arXiv:quant-ph/0512258
Renner, R., Wolf, S.: Smooth Rényi entropy and applications. In: Proceedings of the 2007 International Symposium on Information Theory, p. 232, (2004). http://www.ti.inf.ethz.ch/sw/publications/smooth.ps
Rényi, A.: On measures of entropy and information. Proceedings of the 4th Berkeley Symposium on Mathematics, Statistics and Probability 1, pp. 547–561 (1961). Held at the Statistical Laboratory, University of California, 1960, J. Neyman (ed.) (University of California Press, Berkeley)
Seshadreesan, K.P., Berta, M., Wilde, M.M.: Rényi squashed entanglement, discord, and relative entropy differences. J. Phys. A: Math. Theor. 48(39), 395303 (2015). arXiv:1410.1443
Sutter, D., Fawzi, O., Renner, R.: Universal recovery map for approximate Markov chains (2015). arXiv:1504.07251
Suzuki, M.: Transfer-matrix method and Monte Carlo simulation in quantum spin systems. Phys. Rev. B 31(5), 2957 (1985)
Article ADS Google Scholar
Tomamichel, M.: A framework for non-asymptotic quantum information theory. Ph.D. thesis, ETH Zurich (2012). arXiv:1203.2142
Tomamichel, M.: Quantum Information Processing with Finite Resources—Mathematical Foundations. SpringerBriefs in Mathematical Physics. Springer, (2015). arXiv:1504.00233
Uhlmann, A.: The “transition probability” in the state space of a *-algebra. Rep. Math. Phys. 9(2), 273–279 (1976)
Article ADS MathSciNet MATH Google Scholar
Uhlmann, A.: Relative entropy and the Wigner-Yanase-Dyson-Lieb concavity in an interpolation theory. Commun. Math. Phys. 54(1), 21–32 (1977). http://projecteuclid.org/euclid.cmp/1103900757
Umegaki, H.: Conditional expectations in an operator algebra IV (entropy and information). Kodai Math. Semin. Rep. 14(2), 59–85 (1962)
Article MathSciNet MATH Google Scholar
Wilde, M.M.: Quantum Inf. Theory. Cambridge University Press, Cambridge (2013). arXiv:1106.1445
Wilde, M.M.: Multipartite quantum correlations and local recoverability. Proc. R. Soc. A 471, 20140,941 (2014). arXiv:1412.0333
Wilde, M.M.: Recoverability in quantum information theory. Proc. R. Soc. A 471(2182), 20150,338 (2015). arXiv:1505.04661
Wilde, M.M., Winter, A., Yang, D.: Strong converse for the classical capacity of entanglement-breaking and Hadamard channels via a sandwiched Rényi relative entropy. Commun. Math. Phys. 331(2), 593–622 (2014). arXiv:1306.1586
Zhang, L.: A stronger monotonicity inequality of quantum relative entropy: a unifying approach via Rényi relative entropy (2014). arXiv:1403.5343v1

Download references

Acknowledgments

We are grateful to Salman Beigi for insightful discussions about the topic of this paper. FD acknowledges the support of the Czech Science Foundation GA CR Project P202/12/1142 and the support of the EU FP7 under Grant Agreement No 323970 (RAQUEL). MMW is grateful to Stephanie Wehner and her group for hospitality during a research visit to TU Delft (May 2015), to Renato Renner and his group for the same during a visit to ETH Zurich (June 2015), and acknowledges support from startup funds from the Department of Physics and Astronomy at LSU, the NSF under Award No. CCF-1350397, and the DARPA Quiness Program through US Army Research Office Award W31P4Q-12-1-0019.

Author information

Authors and Affiliations

Faculty of Informatics, Masaryk University, Brno, Czech Republic
Frédéric Dupuis
Department of Physics and Astronomy, Center for Computation and Technology, Hearne Institute for Theoretical Physics, Louisiana State University, Baton Rouge, LA, 70803, USA
Mark M. Wilde

Authors

Frédéric Dupuis
View author publications
You can also search for this author in PubMed Google Scholar
Mark M. Wilde
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Mark M. Wilde.

Appendices

Appendix 1: Limit as $\alpha \rightarrow 1$

Definition 6

Let $\rho $, $\sigma $, and $\mathcal {N}$ be as given in Definition 1. For $\alpha \in \left( 0,1\right) \cup \left( 1,\infty \right) $, let

$$\begin{aligned} \Delta _{\alpha }(\rho ,\sigma ,\mathcal {N})=\frac{1}{\alpha -1}\log Q_{\alpha }(\rho ,\sigma ,\mathcal {N}), \end{aligned}$$

(208)

where

$$\begin{aligned} Q_{\alpha }(\rho ,\sigma ,\mathcal {N})\equiv \left\| \left( \mathcal {N} (\rho )^{\left( 1-\alpha \right) /2}\mathcal {N}(\sigma )^{\left( \alpha -1\right) /2}\otimes I_{E}\right) U\sigma ^{\left( 1-\alpha \right) /2}\rho ^{\alpha /2}\right\| _{2}^{2}. \end{aligned}$$

(209)

Theorem 5

Let $\rho $, $\sigma $, and $\mathcal {N}$ be as given in Definition 1 and such that ${\text {supp}} (\rho )\subseteq {\text {supp}}(\sigma )$. The following limit holds

$$\begin{aligned} \lim _{\alpha \rightarrow 1}\Delta _{\alpha }(\rho ,\sigma ,\mathcal {N})=D(\rho \Vert \sigma )-D\left( \mathcal {N}(\rho )\Vert \mathcal {N}(\sigma )\right) . \end{aligned}$$

(210)

Proof

Let $\varPi _{\omega }$ denote the projection onto the support of $\omega $. From the condition ${\text {supp}}(\rho )\subseteq {\text {supp}}(\sigma )$, it follows that ${\text {supp}}\left( \mathcal {N}(\rho )\right) \subseteq {\text {supp}}\left( \mathcal {N}(\sigma )\right) $ [32, Appendix B.4]. We can then conclude that

$$\begin{aligned} \varPi _{\sigma }\varPi _{\rho }=\varPi _{\rho },\qquad \varPi _{\mathcal {N}(\rho )} \varPi _{\mathcal {N}(\sigma )}=\varPi _{\mathcal {N}(\rho )}. \end{aligned}$$

(211)

We also know that ${\text {supp}}\left( U\rho U^{\dag }\right) \subseteq {\text {supp}}\left( \mathcal {N}(\rho )\otimes I_{E}\right) $ [32, Appendix B.4], so that

$$\begin{aligned} \left( \varPi _{\mathcal {N}(\rho )}\otimes I_{E}\right) \varPi _{U\rho U^{\dag }} =\varPi _{U\rho U^{\dag }}. \end{aligned}$$

(212)

When $\alpha =1$, we find from the above facts that

$$\begin{aligned} Q_{1}(\rho ,\sigma ,\mathcal {N})&=\left\| \left( \varPi _{\mathcal {N}(\rho )}\varPi _{\mathcal {N}(\sigma )}\otimes I_{E}\right) U\varPi _{\sigma }\rho ^{1/2}\right\| _{2}^{2} \end{aligned}$$

(213)

$$\begin{aligned}&=\left\| \left( \varPi _{\mathcal {N}(\rho )}\otimes I_{E}\right) U\varPi _{\rho }\rho ^{1/2}\right\| _{2}^{2}\end{aligned}$$

(214)

$$\begin{aligned}&=\left\| \left( \varPi _{\mathcal {N}(\rho )}\otimes I_{E}\right) \varPi _{U\rho U^{\dag }}U\rho ^{1/2}\right\| _{2}^{2}\end{aligned}$$

(215)

$$\begin{aligned}&=\left\| \varPi _{U\rho U^{\dag }}U\rho ^{1/2}\right\| _{2}^{2}\end{aligned}$$

(216)

$$\begin{aligned}&=\left\| \rho ^{1/2}\right\| _{2}^{2}\end{aligned}$$

(217)

$$\begin{aligned}&=1. \end{aligned}$$

(218)

So from the definition of the derivative, this means that

$$\begin{aligned} \lim _{\alpha \rightarrow 1}\Delta _{\alpha }(\rho ,\sigma ,\mathcal {N})&=\lim _{\alpha \rightarrow 1}\frac{\log Q_{\alpha }(\rho ,\sigma ,\mathcal {N})-\log Q_{1}(\rho ,\sigma ,\mathcal {N})}{\alpha -1}\end{aligned}$$

(219)

$$\begin{aligned}&=\left. \frac{\hbox {d}}{\hbox {d}\alpha }\left[ \log Q_{\alpha }(\rho ,\sigma ,\mathcal {N})\right] \right| _{\alpha =1}\end{aligned}$$

(220)

$$\begin{aligned}&=\frac{1}{Q_{1}(\rho ,\sigma ,\mathcal {N})}\left. \frac{\hbox {d}}{\hbox {d}\alpha }\left[ Q_{\alpha }(\rho ,\sigma ,\mathcal {N})\right] \right| _{\alpha =1}\end{aligned}$$

(221)

$$\begin{aligned}&=\left. \frac{\hbox {d}}{\hbox {d}\alpha }\left[ Q_{\alpha }(\rho ,\sigma ,\mathcal {N} )\right] \right| _{\alpha =1}. \end{aligned}$$

(222)

Let $\alpha ^{\prime }\equiv \alpha -1$. Consider that

$$\begin{aligned} Q_{\alpha }(\rho ,\sigma ,\mathcal {N})={\text {Tr}}\left\{ \rho ^{\alpha }\sigma ^{-\alpha ^{\prime }/2}\mathcal {N}^{\dag }\left( \mathcal {N} (\sigma )^{\alpha ^{\prime }/2}\mathcal {N}(\rho )^{-\alpha ^{\prime }} \mathcal {N}(\sigma )^{\alpha ^{\prime }/2}\right) \sigma ^{-\alpha ^{\prime } /2}\right\} . \end{aligned}$$

(223)

Now we calculate $\frac{\hbox {d}}{\hbox {d}\alpha }Q_{\alpha }(\rho ,\sigma ,\mathcal {N})$:

$$\begin{aligned}&\frac{\hbox {d}}{\hbox {d}\alpha }{\text {Tr}}\left\{ \rho ^{\alpha }\sigma ^{-\alpha ^{\prime }/2}\mathcal {N}^{\dag }\left( \mathcal {N}(\sigma )^{\alpha ^{\prime } /2}\mathcal {N}(\rho )^{-\alpha ^{\prime }}\mathcal {N}(\sigma )^{\alpha ^{\prime } /2}\right) \sigma ^{-\alpha ^{\prime }/2}\right\} \nonumber \\&\quad ={\text {Tr}}\left\{ \left[ \frac{\hbox {d}}{\hbox {d}\alpha }\rho ^{\alpha }\right] \sigma ^{-\alpha ^{\prime }/2}\mathcal {N}^{\dag }\left( \mathcal {N} (\sigma )^{\alpha ^{\prime }/2}\mathcal {N}(\rho )^{-\alpha ^{\prime }} \mathcal {N}(\sigma )^{\alpha ^{\prime }/2}\right) \sigma ^{-\alpha ^{\prime } /2}\right\} \nonumber \\&\quad \quad +\,{\text {Tr}}\left\{ \rho ^{\alpha }\left[ \frac{\hbox {d}}{\hbox {d}\alpha } \sigma ^{-\alpha ^{\prime }/2}\right] \mathcal {N}^{\dag }\left( \mathcal {N} (\sigma )^{\alpha ^{\prime }/2}\mathcal {N}(\rho )^{-\alpha ^{\prime }} \mathcal {N}(\sigma )^{\alpha ^{\prime }/2}\right) \sigma ^{-\alpha ^{\prime } /2}\right\} \nonumber \\&\quad \quad +\,{\text {Tr}}\left\{ \rho ^{\alpha }\sigma ^{-\alpha ^{\prime }/2} \mathcal {N}^{\dag }\left( \left[ \frac{\hbox {d}}{\hbox {d}\alpha }\mathcal {N}(\sigma )^{\alpha ^{\prime }/2}\right] \mathcal {N}(\rho )^{-\alpha ^{\prime }} \mathcal {N}(\sigma )^{\alpha ^{\prime }/2}\right) \sigma ^{-\alpha ^{\prime } /2}\right\} \nonumber \\&\quad \quad +\,{\text {Tr}}\left\{ \rho ^{\alpha }\sigma ^{-\alpha ^{\prime }/2} \mathcal {N}^{\dag }\left( \mathcal {N}(\sigma )^{\alpha ^{\prime }/2}\left[ \frac{\hbox {d}}{\hbox {d}\alpha }\mathcal {N}(\rho )^{-\alpha ^{\prime }}\right] \mathcal {N} (\sigma )^{\alpha ^{\prime }/2}\right) \sigma ^{-\alpha ^{\prime }/2}\right\} \nonumber \\&\quad \quad +\,{\text {Tr}}\left\{ \rho ^{\alpha }\sigma ^{-\alpha ^{\prime }/2} \mathcal {N}^{\dag }\left( \mathcal {N}(\sigma )^{\alpha ^{\prime }/2} \mathcal {N}(\rho )^{-\alpha ^{\prime }}\left[ \frac{\hbox {d}}{\hbox {d}\alpha }\mathcal {N} (\sigma )^{\alpha ^{\prime }/2}\right] \right) \sigma ^{-\alpha ^{\prime } /2}\right\} \nonumber \\&\quad \quad +\,{\text {Tr}}\left\{ \rho ^{\alpha }\sigma ^{-\alpha ^{\prime }/2} \mathcal {N}^{\dag }\left( \mathcal {N}(\sigma )^{\alpha ^{\prime }/2} \mathcal {N}(\rho )^{-\alpha ^{\prime }}\mathcal {N}(\sigma )^{\alpha ^{\prime } /2}\right) \left[ \frac{\hbox {d}}{\hbox {d}\alpha }\sigma ^{-\alpha ^{\prime }/2}\right] \right\} \end{aligned}$$

(224)

$$\begin{aligned}&\quad =\Bigg [{\text {Tr}}\left\{ \rho ^{\alpha }\left[ \log \rho \right] \sigma ^{-\alpha ^{\prime }/2}\mathcal {N}^{\dag }\left( \mathcal {N} (\sigma )^{\alpha ^{\prime }/2}\mathcal {N}(\rho )^{-\alpha ^{\prime }} \mathcal {N}(\sigma )^{\alpha ^{\prime }/2}\right) \sigma ^{-\alpha ^{\prime } /2}\right\} \nonumber \\&\quad \quad -\,\frac{1}{2}{\text {Tr}}\left\{ \rho \left[ \log \sigma \right] \sigma ^{-\alpha ^{\prime }/2}\mathcal {N}^{\dag }\left( \mathcal {N} (\sigma )^{\alpha ^{\prime }/2}\mathcal {N}(\rho )^{-\alpha ^{\prime }} \mathcal {N}(\sigma )^{\alpha ^{\prime }/2}\right) \sigma ^{-\alpha ^{\prime } /2}\right\} \nonumber \\&\quad \quad +\,\frac{1}{2}{\text {Tr}}\left\{ \rho \sigma ^{-\alpha ^{\prime } /2}\mathcal {N}^{\dag }\left( \left[ \log \mathcal {N}(\sigma )\right] \mathcal {N}(\sigma )^{\alpha ^{\prime }/2}\mathcal {N}(\rho )^{-\alpha ^{\prime } }\mathcal {N}(\sigma )^{\alpha ^{\prime }/2}\right) \sigma ^{-\alpha ^{\prime } /2}\right\} \nonumber \\&\quad \quad -\,{\text {Tr}}\left\{ \rho \sigma ^{-\alpha ^{\prime }/2}\mathcal {N}^{\dag }\left( \mathcal {N}(\sigma )^{\alpha ^{\prime }/2}\left[ \log \mathcal {N} (\rho )\right] \mathcal {N}(\rho )^{-\alpha ^{\prime }}\mathcal {N}(\sigma )^{\alpha ^{\prime }/2}\right) \sigma ^{-\alpha ^{\prime }/2}\right\} \nonumber \\&\quad \quad +\,\frac{1}{2}{\text {Tr}}\left\{ \rho \sigma ^{-\alpha ^{\prime } /2}\mathcal {N}^{\dag }\left( \mathcal {N}(\sigma )^{\alpha ^{\prime } /2}\mathcal {N}(\rho )^{-\alpha ^{\prime }}\mathcal {N}(\sigma )^{\alpha ^{\prime } /2}\left[ \log \mathcal {N}(\sigma )\right] \right) \sigma ^{-\alpha ^{\prime }/2}\right\} \nonumber \\&\quad \quad -\,\frac{1}{2}{\text {Tr}}\left\{ \rho \sigma ^{-\alpha ^{\prime } /2}\mathcal {N}^{\dag }\left( \mathcal {N}(\sigma )^{\alpha ^{\prime } /2}\mathcal {N}(\rho )^{-\alpha ^{\prime }}\mathcal {N}(\sigma )^{\alpha ^{\prime } /2}\right) \sigma ^{-\alpha ^{\prime }/2}\left[ \log \sigma \right] \right\} \Bigg ].\nonumber \\ \end{aligned}$$

(225)

Taking the limit as $\alpha \rightarrow 1$ gives

$$\begin{aligned} \left. \frac{\hbox {d}}{\hbox {d}\alpha }Q_{\alpha }(\rho ,\sigma ,\mathcal {N})\right| _{\alpha =1}= & {} {\text {Tr}}\left\{ \rho \left[ \log \rho \right] \varPi _{\sigma }\mathcal {N}^{\dag }\left( \varPi _{\mathcal {N}(\sigma )}\varPi _{\mathcal {N} (\rho )}\varPi _{\mathcal {N}(\sigma )}\right) \varPi _{\sigma }\right\} \nonumber \\&-\,\frac{1}{2}{\text {Tr}}\left\{ \rho \left[ \log \sigma \right] \varPi _{\sigma }\mathcal {N}^{\dag }\left( \varPi _{\mathcal {N}(\sigma )}\varPi _{\mathcal {N}(\rho )}\varPi _{\mathcal {N}(\sigma )}\right) \varPi _{\sigma }\right\} \nonumber \\&+\,\frac{1}{2}{\text {Tr}}\left\{ \rho \varPi _{\sigma }\mathcal {N}^{\dag }\left( \left[ \log \mathcal {N}(\sigma )\right] \varPi _{\mathcal {N}(\sigma )} \varPi _{\mathcal {N}(\rho )}\varPi _{\mathcal {N}(\sigma )}\right) \varPi _{\sigma }\right\} \nonumber \\&-\,{\text {Tr}}\left\{ \rho \varPi _{\sigma }\mathcal {N}^{\dag }\left( \varPi _{\mathcal {N}(\sigma )}\left[ \log \mathcal {N}(\rho )\right] \varPi _{\mathcal {N}(\rho )}\varPi _{\mathcal {N}(\sigma )}\right) \varPi _{\sigma }\right\} \nonumber \\&+\,\frac{1}{2}{\text {Tr}}\left\{ \rho \varPi _{\sigma }\mathcal {N}^{\dag }\left( \varPi _{\mathcal {N}(\sigma )}\varPi _{\mathcal {N}(\rho )}\varPi _{\mathcal {N} (\sigma )}\left[ \log \mathcal {N}(\sigma )\right] \right) \varPi _{\sigma }\right\} \nonumber \\&-\,\frac{1}{2}{\text {Tr}}\left\{ \rho \varPi _{\sigma }\mathcal {N}^{\dag }\left( \varPi _{\mathcal {N}(\sigma )}\varPi _{\mathcal {N}(\rho )}\varPi _{\mathcal {N} (\sigma )}\right) \left[ \log \sigma \right] \varPi _{\sigma }\right\} .\nonumber \\ \end{aligned}$$

(226)

We now simplify the first four terms and note that the last two are Hermitian conjugates of the second and third:

$$\begin{aligned}&{\text {Tr}}\left\{ \rho \left[ \log \rho \right] \varPi _{\sigma } \mathcal {N}^{\dag }\left( \varPi _{\mathcal {N}(\sigma )}\varPi _{\mathcal {N}(\rho )} \varPi _{\mathcal {N}(\sigma )}\right) \varPi _{\sigma }\right\} ={\text {Tr}} \left\{ \rho \left[ \log \rho \right] \mathcal {N}^{\dag }\left( \varPi _{\mathcal {N}(\rho )}\right) \right\} \nonumber \\&\quad ={\text {Tr}}\left\{ \mathcal {N}\left( \rho \left[ \log \rho \right] \right) \varPi _{\mathcal {N}(\rho )}\right\} ={\text {Tr}}\left\{ U\rho \left[ \log \rho \right] U^{\dag }\left( \varPi _{\mathcal {N}(\rho )}\otimes I_{E}\right) \right\} \nonumber \\&\quad ={\text {Tr}}\left\{ \varPi _{U\rho U^{\dag }}U\rho \left[ \log \rho \right] U^{\dag }\left( \varPi _{\mathcal {N}(\rho )}\otimes I_{E}\right) \right\} ={\text {Tr}}\left\{ \varPi _{U\rho U^{\dag }}U\rho \left[ \log \rho \right] U^{\dag }\right\} \nonumber \\&\quad ={\text {Tr}}\left\{ \rho \left[ \log \rho \right] \right\} , \end{aligned}$$

(227)

$$\begin{aligned}&{\text {Tr}}\left\{ \rho \left[ \log \sigma \right] \varPi _{\sigma }\mathcal {N}^{\dag }\left( \varPi _{\mathcal {N}(\sigma )}\varPi _{\mathcal {N}(\rho )} \varPi _{\mathcal {N}(\sigma )}\right) \varPi _{\sigma }\right\} ={\text {Tr}} \left\{ \rho \left[ \log \sigma \right] \mathcal {N}^{\dag }\left( \varPi _{\mathcal {N}(\rho )}\right) \right\} \nonumber \\&\quad ={\text {Tr}}\left\{ \mathcal {N}\left( \rho \left[ \log \sigma \right] \right) \left( \varPi _{\mathcal {N}(\rho )}\right) \right\} ={\text {Tr}} \left\{ U\rho \left[ \log \sigma \right] U^{\dag }\left( \varPi _{\mathcal {N} (\rho )}\otimes I_{E}\right) \right\} \nonumber \\&\quad ={\text {Tr}}\left\{ \varPi _{U\rho U^{\dag }}U\rho U^{\dag }U\left[ \log \sigma \right] U^{\dag }\left( \varPi _{\mathcal {N}(\rho )}\otimes I_{E}\right) \right\} ={\text {Tr}}\left\{ U\rho U^{\dag }U\left[ \log \sigma \right] U^{\dag }\right\} \nonumber \\&\quad ={\text {Tr}}\left\{ \rho \left[ \log \sigma \right] \right\} , \end{aligned}$$

(228)

$$\begin{aligned}&{Tr}\left\{ \rho \varPi _{\sigma }\mathcal {N}^{\dag }\left( \left[ \log \mathcal {N}(\sigma )\right] \varPi _{\mathcal {N}(\sigma )}\varPi _{\mathcal {N} (\rho )}\varPi _{\mathcal {N}(\sigma )}\right) \varPi _{\sigma }\right\} \nonumber \\&\qquad ={\text {Tr}}\left\{ \rho \mathcal {N}^{\dag }\left( \left[ \log \mathcal {N}(\sigma )\right] \varPi _{\mathcal {N}(\rho )}\right) \right\} \nonumber \\&\quad ={\text {Tr}}\left\{ \mathcal {N}(\rho )\left[ \log \mathcal {N} (\sigma )\right] \varPi _{\mathcal {N}(\rho )}\right\} ={\text {Tr}}\left\{ \mathcal {N}(\rho )\left[ \log \mathcal {N}(\sigma )\right] \right\} , \end{aligned}$$

(229)

$$\begin{aligned}&{Tr}\left\{ \rho \varPi _{\sigma }\mathcal {N}^{\dag }\left( \varPi _{\mathcal {N}(\sigma )}\left[ \log \mathcal {N}(\rho )\right] \varPi _{\mathcal {N}(\rho )}\varPi _{\mathcal {N}(\sigma )}\right) \varPi _{\sigma }\right\} \nonumber \\&\qquad ={\text {Tr}}\left\{ \rho \mathcal {N}^{\dag }\left( \left[ \log \mathcal {N}(\rho )\right] \varPi _{\mathcal {N}(\rho )}\right) \right\} \nonumber \\&\quad ={\text {Tr}}\left\{ \mathcal {N}(\rho )\left( \left[ \log \mathcal {N}(\rho )\right] \varPi _{\mathcal {N}(\rho )}\right) \right\} ={\text {Tr}}\left\{ \mathcal {N}(\rho )\left[ \log \mathcal {N} (\rho )\right] \right\} . \end{aligned}$$

(230)

This then implies that the following equality holds

$$\begin{aligned}&\left. \frac{\hbox {d}}{\hbox {d}\alpha }Q_{\alpha }(\rho ,\sigma ,\mathcal {N})\right| _{\alpha =1}={\text {Tr}}\left\{ \rho \left[ \log \rho \right] \right\} -{\text {Tr}}\left\{ \rho \left[ \log \sigma \right] \right\} \nonumber \\&\quad +{\text {Tr}}\left\{ \mathcal {N}(\rho )\left[ \log \mathcal {N} (\sigma )\right] \right\} -{\text {Tr}}\left\{ \mathcal {N}(\rho )\left[ \log \mathcal {N}(\rho )\right] \right\} . \end{aligned}$$

(231)

Putting together (222) and (231), we can then conclude the statement of the theorem.

Appendix 2: Auxiliary lemmas and proofs

Lemma 1

Let $\mathcal {A}$ and $\mathcal {T}$ be compact metric spaces, and let $f:\mathcal {A}\times \mathcal {T}\rightarrow \mathbb {R}$ be a continuous function. Then, $g,h:\mathcal {A}\rightarrow \mathbb {R}$, defined as $g(\alpha )=\max _{t\in \mathcal {T}}f(\alpha ,t)$ and $h(\alpha )=\min _{t\in \mathcal {T}}f(\alpha ,t)$ are continuous.

Proof

By the Heine–Cantor theorem, f is uniformly continuous. Hence, for every $\varepsilon >0$, there exists a $\delta >0$ such that $\left| f(\alpha ,t)-f(\alpha ^{\prime },t^{\prime })\right| <\varepsilon $ whenever $D_{\mathcal {A}}(\alpha ,\alpha ^{\prime })<\delta $ and $D_{\mathcal {T} }(t,t^{\prime })<\delta $, where $D_{\mathcal {A}}$ and $D_{\mathcal {T}}$ are the distance functions on $\mathcal {A}$ and $\mathcal {T}$ respectively. Now, given $\alpha \in \mathcal {A}$, let t be such that $g(\alpha )=f(\alpha ,t)$. Then, for any $\alpha ^{\prime }\in \mathcal {A}$ with $D_{\mathcal {A}}(\alpha ,\alpha ^{\prime })<\delta $ we have that

$$\begin{aligned} g(\alpha )=f(\alpha ,t)<f(\alpha ^{\prime },t)+\varepsilon \leqslant \max _{t^{\prime }\in \mathcal {T}}f(\alpha ^{\prime },t^{\prime })+\varepsilon =g(\alpha ^{\prime })+\varepsilon . \end{aligned}$$

By symmetry, we then have that $\left| g(\alpha )-g(\alpha ^{\prime })\right| <\varepsilon $, which proves the continuity of g. A similar argument establishes the continuity of h. $\square $

Proof of Theorem 4

Let $\rho $, $\sigma $, and $\mathcal {N}$ be as given in Definition 4. Let

$$\begin{aligned} G\left( z\right) =\left( \mathcal {N}(\rho )^{-z/2}\mathcal {N}(\sigma )^{z/2}\otimes I_{E}\right) U\sigma ^{-z/2}\rho ^{\left( 1+z\right) /2}. \end{aligned}$$

(232)

In the equation

$$\begin{aligned} \frac{1}{p_{\theta }}=\frac{\theta }{p_{0}}+\frac{1-\theta }{p_{1}}, \end{aligned}$$

(233)

choose $p_{0}=2$ and $p_{1}=2$, so that $p_{\theta }=2$. Recalling that

$$\begin{aligned} M_{k}=\sup _{t\in \mathbb {R}}\left\| G\left( k+it\right) \right\| _{p_{k}}, \end{aligned}$$

(234)

for $k=0,1$, we find that

$$\begin{aligned} \left\| G\left( \theta \right) \right\| _{p_{\theta }}\le M_{0}^{1-\theta }M_{1}^{\theta }. \end{aligned}$$

(235)

For our choices, we find that

$$\begin{aligned} M_{0}&=\sup _{t\in \mathbb {R}}\left\| G\left( it\right) \right\| _{2} \end{aligned}$$

(236)

$$\begin{aligned}&=\sup _{t\in \mathbb {R}}\left\| \left( \mathcal {N}(\rho )^{-it/2} \mathcal {N}(\sigma )^{it/2}\otimes I_{E}\right) U\sigma ^{-it/2}\rho ^{\left( 1+it\right) /2}\right\| _{2}\end{aligned}$$

(237)

$$\begin{aligned}&=\left\| \rho ^{1/2}\right\| _{2}=1, \end{aligned}$$

(238)

$$\begin{aligned} M_{1}&=\sup _{t\in \mathbb {R}}\left\| G\left( 1+it\right) \right\| _{2}\end{aligned}$$

(239)

$$\begin{aligned}&=\sup _{t\in \mathbb {R}}\left\| \left( \mathcal {N}(\rho )^{-\left( 1+it\right) /2}\mathcal {N}(\sigma )^{\left( 1+it\right) /2}\otimes I_{E}\right) U\sigma ^{-\left( 1+it\right) /2}\rho ^{\left( 1+\left( 1+it\right) \right) /2}\right\| _{2}\end{aligned}$$

(240)

$$\begin{aligned}&=\sup _{t\in \mathbb {R}}\left\| \left( \mathcal {N}(\rho )^{-1/2} \mathcal {N}(\sigma )^{it/2}\mathcal {N}(\sigma )^{1/2}\otimes I_{E}\right) U\sigma ^{-1/2}\sigma ^{-it/2}\rho \right\| _{2}\end{aligned}$$

(241)

$$\begin{aligned}&=\left[ \exp \sup _{t\in \mathbb {R}}D_{2}\left( \rho \Vert \left( \mathcal {U}_{\sigma ,-t}\circ \mathcal {P}_{\sigma ,\mathcal {N}}\circ \mathcal {U}_{\mathcal {N}(\sigma ),t}\right) \left( \mathcal {N}(\rho )\right) \right) \right] ^{1/2}. \end{aligned}$$

(242)

Applying the three-line theorem gives

$$\begin{aligned}&\left\| \left( \mathcal {N}(\rho )^{-\theta /2}\mathcal {N}(\sigma )^{\theta /2}\otimes I_{E}\right) U\sigma ^{-\theta /2}\rho ^{\left( 1+\theta \right) /2}\right\| _{2}\nonumber \\&\quad \le \left[ \exp \sup _{t\in \mathbb {R}}D_{2}\left( \rho \Vert \left( \mathcal {U}_{\sigma ,-t}\circ \mathcal {P}_{\sigma ,\mathcal {N}}\circ \mathcal {U}_{\mathcal {N}(\sigma ),t}\right) \left( \mathcal {N}(\rho )\right) \right) \right] ^{\theta /2}, \end{aligned}$$

(243)

and after a logarithm gives

$$\begin{aligned}&\frac{2}{\theta }\log \left\| \left( \mathcal {N}(\rho )^{-\theta /2}\mathcal {N}(\sigma )^{\theta /2}\otimes I_{E}\right) U\sigma ^{-\theta /2} \rho ^{\left( 1+\theta \right) /2}\right\| _{2}\nonumber \\&\quad \le \sup _{t\in \mathbb {R} }D_{2}\left( \rho \Vert \left( \mathcal {U}_{\sigma ,-t}\circ \mathcal {P} _{\sigma ,\mathcal {N}}\circ \mathcal {U}_{\mathcal {N}(\sigma ),t}\right) \left( \mathcal {N}(\rho )\right) \right) . \end{aligned}$$

(244)

Take the limit as $\theta \searrow 0$ to get

$$\begin{aligned} D(\rho \Vert \sigma )-D\left( \mathcal {N(}\rho )\Vert \mathcal {N(}\sigma )\right) \le \sup _{t\in \mathbb {R}}D_{2}\left( \rho \Vert \left( \mathcal {U}_{\sigma ,-t}\circ \mathcal {P}_{\sigma ,\mathcal {N}}\circ \mathcal {U}_{\mathcal {N} (\sigma ),t}\right) \left( \mathcal {N}(\rho )\right) \right) . \end{aligned}$$

(245)

Now we prove the other inequality. Let $\rho $, $\sigma $, and $\mathcal {N}$ be as given in Definition 1 and such that ${\text {supp}} (\rho )\subseteq {\text {supp}}(\sigma )$. Take

$$\begin{aligned} G\left( z\right) =\left( \mathcal {N}(\rho )^{z/2}\mathcal {N}(\sigma )^{-z/2}\otimes I_{E}\right) U\sigma ^{z/2}\rho ^{\left( 1-z\right) /2}. \end{aligned}$$

(246)

Then $M_{0}=1$ again and

$$\begin{aligned} M_{1}&=\sup _{t\in \mathbb {R}}\left\| G\left( 1+it\right) \right\| _{2} \end{aligned}$$

(247)

$$\begin{aligned}&=\sup _{t\in \mathbb {R}}\left\| \left( \mathcal {N}(\rho )^{\left( 1+it\right) /2}\mathcal {N}(\sigma )^{-\left( 1+it\right) /2}\otimes I_{E}\right) U\sigma ^{\left( 1+it\right) /2}\rho ^{\left( 1-\left( 1+it\right) \right) /2}\right\| _{2}\end{aligned}$$

(248)

$$\begin{aligned}&=\sup _{t\in \mathbb {R}}\left\| \left( \mathcal {N}(\rho )^{1/2} \mathcal {N}(\sigma )^{-it/2}\mathcal {N}(\sigma )^{-1/2}\otimes I_{E}\right) U\sigma ^{1/2}\sigma ^{it/2}\rho ^{0}\right\| _{2}\end{aligned}$$

(249)

$$\begin{aligned}&=\exp \left\{ -\inf _{t\in \mathbb {R}}D_{0}\left( \rho \Vert \left( \mathcal {U}_{\sigma ,-t}\circ \mathcal {P}_{\sigma ,\mathcal {N}}\circ \mathcal {U}_{\mathcal {N}(\sigma ),t}\right) \left( \mathcal {N}(\rho )\right) \right) \right\} ^{1/2}. \end{aligned}$$

(250)

Applying the three-line theorem gives

$$\begin{aligned}&\left\| \left( \mathcal {N}(\rho )^{\theta /2}\mathcal {N}(\sigma )^{-\theta /2}\otimes I_{E}\right) U\sigma ^{\theta /2}\rho ^{\left( 1-\theta \right) /2}\right\| _{2}\nonumber \\&\quad \le \left[ \exp \left\{ -\inf _{t\in \mathbb {R}}D_{0}\left( \rho \Vert \left( \mathcal {U}_{\sigma ,-t}\circ \mathcal {P}_{\sigma ,\mathcal {N}}\circ \mathcal {U}_{\mathcal {N}(\sigma ),t}\right) \left( \mathcal {N}(\rho )\right) \right) \right\} \right] ^{\theta /2}, \end{aligned}$$

(251)

which after taking a logarithm gives

$$\begin{aligned}&\frac{2}{-\theta }\log \left\| \left( \mathcal {N}(\rho )^{\theta /2}\mathcal {N}(\sigma )^{-\theta /2}\otimes I_{E}\right) U\sigma ^{\theta /2} \rho ^{\left( 1-\theta \right) /2}\right\| _{2}\nonumber \\&\quad \ge \inf _{t\in \mathbb {R} }D_{0}\left( \rho \Vert \left( \mathcal {U}_{\sigma ,-t}\circ \mathcal {P} _{\sigma ,\mathcal {N}}\circ \mathcal {U}_{\mathcal {N}(\sigma ),t}\right) \left( \mathcal {N}(\rho )\right) \right) . \end{aligned}$$

(252)

Take the limit as $\theta \searrow 0$ to get

$$\begin{aligned} D(\rho \Vert \sigma )-D\left( \mathcal {N(}\rho )\Vert \mathcal {N(}\sigma )\right) \ge \inf _{t\in \mathbb {R}}D_{0}\left( \rho \Vert \left( \mathcal {U}_{\sigma ,-t}\circ \mathcal {P}_{\sigma ,\mathcal {N}}\circ \mathcal {U}_{\mathcal {N} (\sigma ),t}\right) \left( \mathcal {N}(\rho )\right) \right) . \end{aligned}$$

(253)

$\square $

Appendix 3: Taylor expansions

Here we show the following limit:

$$\begin{aligned} \lim _{\alpha \rightarrow 1}f\left( \alpha ,V_{\mathcal {N}(\sigma )},V_{\sigma }\right) =f\left( 1,V_{\mathcal {N}(\sigma )},V_{\sigma }\right) , \end{aligned}$$

(254)

where $f\left( \alpha ,V_{\mathcal {N}(\sigma )},V_{\sigma }\right) $ is defined as

$$\begin{aligned}&f\left( \alpha ,V_{\mathcal {N}(\sigma )},V_{\sigma }\right) =\frac{1}{\alpha -1}\nonumber \\&\quad \log \left\| \left( \left[ \mathcal {N}\left( \rho \right) \right] ^{\left( 1-\alpha \right) /2}V_{\mathcal {N}(\sigma )}\left[ \mathcal {N} (\sigma )\right] ^{\left( \alpha -1\right) /2}\otimes I_{E}\right) U\sigma ^{\left( 1-\alpha \right) /2}V_{\sigma }\rho ^{\alpha /2}\right\| _{2}^{2}\qquad \end{aligned}$$

(255)

and $f\left( 1,V_{\mathcal {N}(\sigma )},V_{\sigma }\right) $ in (53). From the fact that

$$\begin{aligned} \left. \log \left\| \left( \left[ \mathcal {N}(\rho )\right] ^{\left( 1-\alpha \right) /2}V_{\mathcal {N}(\sigma )}\left[ \mathcal {N}(\sigma )\right] ^{\left( \alpha -1\right) /2}\otimes I_{E}\right) U\sigma ^{\left( 1-\alpha \right) /2}V_{\sigma }\rho ^{\alpha /2}\right\| _{2}^{2}\right| _{\alpha =1}=0, \end{aligned}$$

(256)

we know (from the definition of derivative) that $\lim _{\alpha \rightarrow 1}f\left( \alpha ,V_{\mathcal {N}(\sigma )},V_{\sigma }\right) $ is equal to

$$\begin{aligned}&\left. \frac{\hbox {d}}{\hbox {d}\alpha }\log \left\| \left( \left[ \mathcal {N}\left( \rho \right) \right] ^{\left( 1-\alpha \right) /2}V_{\mathcal {N}(\sigma )}\left[ \mathcal {N}(\sigma )\right] ^{\left( \alpha -1\right) /2}\otimes I_{E}\right) U\sigma ^{\left( 1-\alpha \right) /2}V_{\sigma }\rho ^{\alpha /2}\right\| _{2}^{2}\right| _{\alpha =1}\nonumber \\&\quad \quad =\left. \frac{\hbox {d}}{\hbox {d}\alpha }\left\| \left( \left[ \mathcal {N}\left( \rho \right) \right] ^{\left( 1-\alpha \right) /2}V_{\mathcal {N}(\sigma )}\left[ \mathcal {N}(\sigma )\right] ^{\left( \alpha -1\right) /2}\otimes I_{E}\right) U\sigma ^{\left( 1-\alpha \right) /2}V_{\sigma }\rho ^{\alpha /2}\right\| _{2}^{2}\right| _{\alpha =1}.\nonumber \\ \end{aligned}$$

(257)

We evaluate the latter derivative by employing Taylor expansions. Substitute $\alpha =1+\gamma $, so that the quantity inside the derivative operation is equal to

$$\begin{aligned} \left\| \left( \left[ \mathcal {N}(\rho )\right] ^{-\gamma /2} V_{\mathcal {N}(\sigma )}\left[ \mathcal {N}(\sigma )\right] ^{\gamma /2}\otimes I_{E}\right) U\sigma ^{-\gamma /2}V_{\sigma }\rho ^{\left( 1+\gamma \right) /2}\right\| _{2}^{2}, \end{aligned}$$

(258)

which we can rewrite as

$$\begin{aligned} \left\| \left( \left[ V_{\mathcal {N}(\sigma )}^{\dag }\mathcal {N} (\rho )V_{\mathcal {N}(\sigma )}\right] ^{-\gamma /2}\left[ \mathcal {N} (\sigma )\right] ^{\gamma /2}\otimes I_{E}\right) U\sigma ^{-\gamma /2}\left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{\left( 1+\gamma \right) /2}\right\| _{2}^{2}, \end{aligned}$$

(259)

due to the unitary invariance of the norm. Now we use that

$$\begin{aligned} \left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{\left( 1+\gamma \right) /2}&=\left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{1/2}+\frac{\gamma }{2}\left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{1/2}\log \left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] +O\left( \gamma ^{2}\right) , \end{aligned}$$

(260)

$$\begin{aligned} \sigma ^{-\gamma /2}&=I-\frac{\gamma }{2}\log \sigma +O\left( \gamma ^{2}\right) ,\end{aligned}$$

(261)

$$\begin{aligned} \left[ \mathcal {N}(\sigma )\right] ^{\gamma /2}&=I+\frac{\gamma }{2} \log \left[ \mathcal {N}(\sigma )\right] +O\left( \gamma ^{2}\right) ,\end{aligned}$$

(262)

$$\begin{aligned} \left[ V_{\mathcal {N}(\sigma )}^{\dag }\mathcal {N}\left( \rho \right) V_{\mathcal {N}(\sigma )}\right] ^{-\gamma /2}&=I-\frac{\gamma }{2} \log \left[ V_{\mathcal {N}(\sigma )}^{\dag }\mathcal {N}(\rho )V_{\mathcal {N} (\sigma )}\right] +O\left( \gamma ^{2}\right) . \end{aligned}$$

(263)

The above implies that

$$\begin{aligned}&\left[ V_{\mathcal {N}(\sigma )}^{\dag }\mathcal {N}\left( \rho \right) V_{\mathcal {N}(\sigma )}\right] ^{-\gamma /2}\left[ \mathcal {N}(\sigma )\right] ^{\gamma /2}U\sigma ^{-\gamma /2}\left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{\left( 1+\gamma \right) /2}\nonumber \\&\quad \quad =\left( I-\frac{\gamma }{2}\log \left[ V_{\mathcal {N}(\sigma )}^{\dag }\mathcal {N}(\rho )V_{\mathcal {N}(\sigma )}\right] \right) \left( I+\frac{\gamma }{2}\log \left[ \mathcal {N}(\sigma )\right] \right) \nonumber \\&\quad \quad \times U\left( I-\frac{\gamma }{2}\log \sigma \right) \left( \left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{1/2}+\frac{\gamma }{2}\left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{1/2}\log \left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] \right) +O\left( \gamma ^{2}\right) .\nonumber \\ \end{aligned}$$

(264)

By working out the right-hand side above and neglecting terms of second order in $\gamma $ and higher, we find that

$$\begin{aligned}&\left[ V_{\mathcal {N}(\sigma )}^{\dag }\mathcal {N}\left( \rho \right) V_{\mathcal {N}(\sigma )}\right] ^{-\gamma /2}\left[ \mathcal {N}(\sigma )\right] ^{\gamma /2}U\sigma ^{-\gamma /2}\left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{\left( 1+\gamma \right) /2} \nonumber \\&\quad =U\left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{1/2}-\frac{\gamma }{2} \log \left[ V_{\mathcal {N}(\sigma )}^{\dag }\mathcal {N}\left( \rho \right) V_{\mathcal {N}(\sigma )}\right] U\left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{1/2}\nonumber \\&\quad \quad +\frac{\gamma }{2}\log \left[ \mathcal {N}(\sigma )\right] U\left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{1/2}\nonumber \\&\quad \quad -\frac{\gamma }{2}U\left[ \log \sigma \right] \left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{1/2}+\frac{\gamma }{2}U\left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{1/2}\log \left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] +O\left( \gamma ^{2}\right) .\nonumber \\ \end{aligned}$$

(265)

The Hermitian conjugate is

$$\begin{aligned}&\left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{1/2}U^{\dag }-\frac{\gamma }{2}\left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{1/2}U^{\dag }\log \left[ V_{\mathcal {N}(\sigma )}^{\dag }\mathcal {N}(\rho )V_{\mathcal {N}(\sigma )}\right] \nonumber \\&\quad +\frac{\gamma }{2}\left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{1/2}U^{\dag }\log \left[ \mathcal {N}(\sigma )\right] \nonumber \\&\quad -\frac{\gamma }{2}\left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{1/2}\left[ \log \sigma \right] U^{\dag }+\frac{\gamma }{2}\left[ \log \left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] \right] \left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{1/2}U^{\dag }+O\left( \gamma ^{2}\right) .\nonumber \\ \end{aligned}$$

(266)

Combining (265) with its Hermitian conjugate and neglecting higher order terms gives

$$\begin{aligned}&\left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] -\gamma \left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{1/2}\mathcal {N}^{\dag }\left( \log \left[ V_{\mathcal {N}(\sigma )}^{\dag }\mathcal {N}(\rho )V_{\mathcal {N}(\sigma )}\right] \right) \left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{1/2}\nonumber \\&\quad +\gamma \left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{1/2}\mathcal {N} ^{\dag }\left( \log \left[ \mathcal {N}(\sigma )\right] \right) \left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{1/2}-\gamma \left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{1/2}\left[ \log \sigma \right] \left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{1/2}\nonumber \\&\quad +\frac{\gamma }{2}\left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] \log \left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] +\frac{\gamma }{2}\left( \log \left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] \right) \left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] +O\left( \gamma ^{2}\right) . \end{aligned}$$

(267)

Taking a trace gives

$$\begin{aligned}&\text {Tr}\left\{ \rho \right\} -\gamma \text {Tr}\left\{ \left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] \mathcal {N}^{\dag }\left( \log \left[ V_{\mathcal {N}(\sigma )}^{\dag }\mathcal {N}(\rho )V_{\mathcal {N}(\sigma )}\right] \right) \right\} \nonumber \\&\quad +\gamma \text {Tr}\left\{ \left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] \mathcal {N}^{\dag }\left( \log \left[ \mathcal {N}(\sigma )\right] \right) \right\} -\gamma \text {Tr}\left\{ \rho \left[ \log \sigma \right] \right\} \nonumber \\&\quad +\gamma \text {Tr}\left\{ \rho \log \rho \right\} +O\left( \gamma ^{2}\right) . \end{aligned}$$

(268)

We can now finally use the above development to conclude that

$$\begin{aligned}&\left. \frac{\hbox {d}}{\hbox {d}\alpha }\left\| \left( \left[ \mathcal {N}\left( \rho \right) \right] ^{\left( 1-\alpha \right) /2}V_{\mathcal {N}(\sigma )}\left[ \mathcal {N}(\sigma )\right] ^{\left( \alpha -1\right) /2}\otimes I_{E}\right) U\sigma ^{\left( 1-\alpha \right) /2}V_{\sigma }\rho ^{\alpha /2}\right\| _{2}^{2}\right| _{\alpha =1}\nonumber \\&\quad =\left. \frac{\hbox {d}}{\hbox {d}\gamma }\left\| \left( \left[ V_{\mathcal {N} (\sigma )}^{\dag }\mathcal {N}(\rho )V_{\mathcal {N}(\sigma )}\right] ^{-\gamma /2}\left[ \mathcal {N}(\sigma )\right] ^{\gamma /2}\otimes I_{E}\right) U\sigma ^{-\gamma /2}\left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] ^{\left( 1+\gamma \right) /2}\right\| _{2}^{2}\right| _{\gamma =0} \end{aligned}$$

(269)

$$\begin{aligned}&\quad ={\text {Tr}}\left\{ \rho \left[ \log \rho -\log \sigma \right] \right\} \nonumber \\&\quad -{\text {Tr}}\left\{ \mathcal {N}\left( \left[ V_{\sigma }\rho V_{\sigma }^{\dag }\right] \right) \left[ \log \left[ V_{\mathcal {N}(\sigma )}^{\dag }\mathcal {N}(\rho )V_{\mathcal {N}(\sigma )}\right] -\log \left[ \mathcal {N}(\sigma )\right] \right] \right\} \end{aligned}$$

(270)

$$\begin{aligned}&\quad =f(1,V_{\mathcal {N}(\sigma )},V_{\sigma }). \end{aligned}$$

(271)

A similar development with Taylor expansions leads to the conclusion that (63) holds. However, here one should employ the method outlined in the proof of [46, Proposition11].

Rights and permissions

Reprints and permissions

About this article

Cite this article

Dupuis, F., Wilde, M.M. Swiveled Rényi entropies. Quantum Inf Process 15, 1309–1345 (2016). https://doi.org/10.1007/s11128-015-1211-x

Download citation

Received: 27 October 2015
Accepted: 01 December 2015
Published: 15 February 2016
Issue Date: March 2016
DOI: https://doi.org/10.1007/s11128-015-1211-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Swiveled Rényi entropies

Abstract

Similar content being viewed by others

Forward and Reverse Entropy Power Inequalities in Convex Geometry

Weighted p-Rényi Entropy Power Inequality: Information Theory to Quantum Shannon Theory

Entropy Measures and Views of Information

Explore related subjects

1 Introduction

2 Summary of results

3 Preliminaries

3.1 Quantum states and channels

Definition 1

Definition 2

3.2 Entropies and norms

3.3 Hadamard three-line theorem

Theorem 1

3.4 Rényi generalizations of the quantum relative entropy difference

4 Swiveled Rényi generalizations of the quantum relative entropy difference

Definition 3

4.1 Reduction to Rényi relative entropy

4.2 Behavior around \(\alpha =1\)

Proposition 1

Proof

4.3 Monotonicity in the Rényi parameter

Theorem 2

Proof

Theorem 3

Proof

4.4 Bounds for the quantum relative entropy difference

Corollary 1

Definition 4

Corollary 2

Theorem 4

Remark 1

5 Swiveled Rényi conditional mutual information

Definition 5

Corollary 3

Corollary 4

Corollary 5

Corollary 6

6 Swiveled Rényi quantum information measures

7 Monotonicity of trace quantities

Proposition 2

Proof

Corollary 7

Proof

8 Conclusion

Notes

References

Acknowledgments

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix 1: Limit as \(\alpha \rightarrow 1\)

Definition 6

Theorem 5

Proof

Appendix 2: Auxiliary lemmas and proofs

Lemma 1

Proof

Proof of Theorem 4

Appendix 3: Taylor expansions

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation