Abstract
We consider both known and not previously studied trace functions with applications in quantum physics. By using perspectives we obtain convexity statements for different notions of residual entropy, including the entropy gain of a quantum channel studied by Holevo and others.
We give new proofs of Carlen-Lieb’s concavity/convexity theorems for certain trace functions, by making use of the theory of operator monotone functions. We then apply these methods in a study of new classes of trace functions.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction and First Results
Consider a quantum system in which an observable A can be written as a sum A=A 1+⋯+A k of a number of components A 1,…,A k . If the components correspond to isolated subsystems then the total quantum entropy of the system \(S(A)=-\operatorname{Tr} A\log A \) is equal to the sum of the entropies of each subsystem. In the general case we may define the residual entropy
as the difference between the total entropy of the system and the sum of the entropies of each subsystem; although it is a negative quantity.
Another type of residual entropy is the entropy gain over a quantum channel studied by Holevo and others [7, 8],
where Φ is a quantum channel represented by a completely positive trace preserving linear map.
Theorem 1.1
Consider n×n matrices A and n×m matrices K. The trace function
is convex in positive definite A for arbitrary K.
Proof
The function f(t)=tlogt defined for t>0 is operator convex. It is well-known but may be derived from [6, Theorem 2.4] since f(0)=0, and logt is operator monotone. The perspective function,
is therefore operator convex as a function of two variables [3, Theorem 2.2]. Consider the Hilbert space \(\mathcal{H}=M_{n\times m} \) equipped with inner product given by \((X,Y)=\operatorname{Tr} Y^{*}X \) for matrices X,Y∈M n×m and let L A and R B denote left and right multiplication with A∈M n and B∈M m respectively. If A and B are positive definite matrices then L A and R B are positive definite commuting operators on \(\mathcal{H}\). Operator convexity of the perspective function g(t,s) is equivalent to convexity of the map
for every K∈M n×m cf. [4, Theorem 1.1]. The statement of the theorem now follows by replacing B with K ∗ AK in the above expression. □
Corollary 1.2
The residual entropy
is a convex function in positive definite n×n matrices A 1,…,A k .
Proof
We apply Theorem 1.1 to block matrices of the form
and since the entry in the first row and the first column of the block matrix
is calculated to
the statement of the corollary follows. Notice that we used the same block matrix technique as in [2]. □
It is actually much easier to obtain the above result by expressing the residual entropy as a sum of relative entropies. We may however obtain other results by carefully choosing the arbitrary matrix K in Theorem 1.1.
Corollary 1.3
Consider the entropy gain
over a quantum channel Φ, where the channel is represented by a completely positive trace preserving linear map Φ. The entropy gain φ(A) is a convex function in A.
Proof
A completely positive trace preserving linear map Φ:M n →M m is of the form
where the so-called Kraus matrices a 1,…,a k ∈M n×m satisfy
We now apply Theorem 1.1 by substituting A by the matrix
The entry in the first row and the first column of the block matrix
is then calculated to −Φ(A)logΦ(A)+Φ(AlogA). Since Φ is trace preserving it follows that the entropic map
is convex. □
Corollary 1.4
The entropy gain
of k positive definite quantities observed through k quantum channels Φ 1,…,Φ k is a convex function in A 1,…,A k .
Proof
The statement is obtained as in the above corollary by considering suitable block matrices, where each block corresponds to a single quantum channel. We leave the details to the reader. □
2 Carlen-Lieb Trace Functions
We give new proofs of some of the statements in [2] without using variational methods.
Theorem 2.1
(Carlen-Lieb)
The trace function
is concave in positive definite matrices A and B.
Proof
The function
is operator monotone, cf. [1, Corollary 4.3]. Indeed, if z=re iθ with 0<θ<π then z p=r p e ipθ. Since we add a positive constant it is plain that the argument of z p+1 is less than pθ but still positive. The argument of f(z) is therefore between zero and pθ≤θ<π. We have shown that the analytic continuation of f to the complex upper half plane has positive imaginary part, thus f is operator monotone.
The perspective function
is therefore operator concave, cf. [3, Theorem 2.2] and so is the function,
that appears by composing with the operator monotone and operator concave function t→t p/r.
The left and right multiplication operators L A and R B are positive definite commuting operators on the Hilbert space \(\mathcal{H}=M_{n} \) equipped with the inner product \((A,B)=\operatorname{Tr} B^{*} A\). It follows that the (super) operator mapping
is concave according to the preceding remark. The trace function
is therefore concave by [4, Theorem 1.1]. The statement now follows by choosing K as the identity matrix. Indeed, under the trace we have
for each n, and we thus obtain
by simple algebraic calculations. □
Notice that the statement in (1) is stronger than what is obtained in the reference [2].
Theorem 2.2
The function
is operator convex for 1≤p≤2.
Proof
We have previously shown that f is operator monotone for 0<p≤1. Let us calculate the representing measure.
We set z=re iθ for r>0 and 0<θ<π and calculate the analytic continuation of f,
into the complex upper half plane. Let argz with 0≤argz<2π denote the angle between the positive x-axis and the complex number z=x+iy. With this non-standard convention argz is an analytic function in C∖[0,∞), and the angle A p (r,θ) between the positive x-axis and (r p e ipθ+1)1/p is given by
and it satisfies
The imaginary part of the analytic continuation of f is therefore given by
and the representing measure of f is obtained as the limit
It follows that
where β is a constant determined by setting t=0 in Eq. (2), and the non-negative function h p is given by
cf. [5] for the details. The key in the proof is the realisation that
and this is so because argz<arg(z+1)<2π when z is in the lower complex plane. It follows that both sides in Eq. (2) are real analytic functions in p in the whole interval (0,2).
The formula in (2) is consequently valid also for 1≤p≤2. However, for 1<p<2 the weight function h p is negative implying that f is operator convex. Notice that h p =0 for p=1. □
The same line of arguments as for 0<p≤1 applies, so we obtain:
Corollary 2.3
The trace function
is convex in positive definite matrices A and B.
2.1 Variational inequalities
Remark 2.4
Let x and y be positive numbers and take 0<p<1. It is easy to prove that
with equality for λ=x p(x p+y p)−1.
Theorem 2.5
Let 0<p<1 and take positive definite n×n matrices A,B. Then
for each n×n matrix X with 0<X<1. If A and B commute then there is equality for X=A p(A p+B p)−1.
Proof
We know that the trace function \(\varphi(X,Y)=\operatorname{Tr}(X^{p}+Y^{p})^{1/p} \) is concave in positive definite X and Y. It is also positively homogeneous since
It follows that the Fréchet differential
for positive definite X,Y,A,B, cf. for example [9, Lemma 5]. We notice that
by the chain rule for Fréchet differentials. By setting f(t)=t 1/p and g(t)=t p we obtain
and similarly
We thus derive that
Let now 0<X<1 and set Y=(1−X p)1/p. Then X p+Y p=1 and thus
We may replace X with X 1/p since any 0<X<1 can be obtained in this way, and we obtain
which is the statement of the theorem. □
3 New Types of Trace Functions
Theorem 3.1
Let 0<p≤1. The function of two variables,
defined for t,s>0, is operator concave.
Proof
We notice that g(t,s) is not a perspective function, so our approach will have to be more indirect. We first prove that for 0≤λ≤1 the function
is operator monotone. Indeed, if z=re iθ with 0<θ<π, then z p=r p e ipθ. Since we add a positive constant it is plain that the argument of λz p+1−λ is less that pθ but still positive. The argument of f λ (z) is therefore between zero and θ<π. We have shown that the analytic continuation of f λ to the complex upper half plane has positive imaginary part, thus f λ is operator monotone.
The perspective function
is operator concave and so is any function that appears as the composition of an operator monotone function of one variable with the perspective. It follows that
is operator concave. However, by an elementary calculation we may write
and the statement of the theorem follows. □
Take 0≤p≤1. Since the function \((t,s)\to\frac {t-s}{t^{p}-s^{p}} \) is operator concave, it follows that the trace function
is concave in positive definite n×n matrices for any n×n matrix K, where L A and R B denote left and right multiplication with A and B.
By choosing K as the unit matrix we obtain:
Theorem 3.2
Let 0<p≤1. The trace function
is concave in positive definite matrices.
4 The Fréchet Differential
Some of the techniques in this section are adapted from [9].
Theorem 4.1
Consider the function f(t)=t p for 0<p≤1. The map
defined in positive definite n×n matrices, is concave for each self-adjoint n×n matrix h.
Proof
Consider x>0 and a basis \((e_{i})_{i=1}^{n} \) in which x is diagonal with eigenvalues given by xe i =λ i e i for i=1,…,n. We may then calculate
Expressed in this basis df(x)h=h∘L f (λ 1,…,λ n ) is the Hadamard (entry-wise) product of h and the Löwner matrix
The inverse Fréchet differential df(x)−1 h is therefore well-defined and given by the Hadamard product
expressed in the same basis and thus
where L x and R x are left and right multiplication with x and
The operators L x and R x are positive definite commuting operators on the Hilbert space \(\mathcal{H}=M_{m} \) equipped with the inner product \((A,B)=\operatorname{Tr} B^{*} A\). The last expression \(\operatorname{Tr} h \mathit{df}(x)^{-1}h=\operatorname{Tr} h g(L_{x}, R_{x})h \) is independent of any particular basis, and since g is operator concave by Theorem 3.1, we obtain [4, Theorem 1.1] that the map \(x\to\operatorname{Tr} h \mathit{df}(x)^{-1}h \) is concave. □
Theorem 4.2
Consider the function f(t)=t p for 0<p≤1. The map of two variables,
is convex.
Proof
Keeping the notation as in the proof of Theorem 1.1 we define two quadratic forms α and β on \(\mathcal{H}\oplus\mathcal{H} \) by setting
where A 1,A 2 are positive definite matrices, and A=λA 1+(1−λ)A 2 for some λ∈[0,1]. The statement of the theorem is equivalent to the majorisation
for arbitrary self-adjoint X,Y∈M n . The quadratic form \(h\to \operatorname{Tr} h \mathit{d f}(x)h \) is positive definite since
where \((e_{i})_{i=1}^{n} \) is a basis in which x is diagonal and λ 1,…λ n are the corresponding eigenvalues counted with multiplicity. We also notice that the corresponding sesqui-linear form is given by
The two quadratic forms α and β are in particular positive definite. Therefore, there exists an operator Γ on \(\mathcal{H}\oplus\mathcal{H} \) which is positive definite in the Hilbert space structure given by β such that
where we retain the notation α and β also for the corresponding sesqui-linear forms. Suppose γ is an eigenvalue of Γ corresponding to an eigenvector X⊕Y. Then
or equivalently
for arbitrary X′,Y′∈M n . From this we may derive the identities
and thus by setting M=df(A)(λX+(1−λ)Y), we obtain
By multiplying from the left with M ∗ and taking the trace we obtain
where the last inequality is implied by the concavity result in Theorem 4.1. This shows that the positive definite operator Γ≥1 from which (3) and the statement of the theorem follow. □
Since the dependence of the function f in \(\operatorname{Tr} h \mathit{df}(x)h \) is linear we immediately obtain:
Corollary 4.3
Let f be a function written on the form
where μ is a positive measure on the unit interval. Then the map of two variables,
is convex.
If we in the corollary above choose μ as the Lebesgue measure, we realise that
is an example of a function such that \((x,h)\to\operatorname{Tr} h \mathit{d f}(x)h \) is convex. Moreover, the perspective g of f given by
is operator concave. Since
this observation directly shows that the function \(x\to\operatorname{Tr} h d{\log} (x)^{-1}h \) is concave, cf. [9, Eq. (3.4)].
5 More Trace Functions
Lemma 5.1
Let K be a contraction. Then
for −1≤q≤1.
Proof
By continuity we may assume K invertible. For 0≤q≤1 we use the inequality
or by inversion
which implies the inequality
For −1≤q≤0 we apply Jensen’s sub-homogeneous operator inequality
or by inversion
This inequality finally implies
and the proof is complete. □
Corollary 5.2
Let K be a contraction. The mapping
is decreasing for −1≤q≤1.
Proof
The Fréchet differential of φ(A) is given by
thus d φ(A)D≤0 for arbitrary D≥0 by the preceding lemma. □
Change history
07 June 2022
A Correction to this paper has been published: https://doi.org/10.1007/s10955-022-02929-z
References
Ando, T.: Concavity of certain maps of positive definite matrices and applications to Hadamard products. Linear Algebra Appl. 26, 203–241 (1979)
Carlen, E.A., Lieb, E.H.: A Minkowsky type trace inequality and strong subadditivity of quantum entropy II: Convexity and concavity. Lett. Math. Phys. 83, 107–126 (2008)
Effros, E.G.: A matrix convexity approach to some celebrated quantum inequalities. Proc. Natl. Acad. Sci. USA 106, 1006–1008 (2009)
Hansen, F.: Extensions of Lieb’s concavity theorem. J. Stat. Phys. 124, 87–101 (2006)
Hansen, F.: The fast track to Löwner’s theorem. Linear Algebra Appl. 438, 4557–4571 (2013)
Hansen, F., Pedersen, G.K.: Jensen’s inequality for operators and Löwner’s theorem. Math. Ann. 258, 229–241 (1982)
Holevo, A.S.: Entropy gain and the Choi-Jamiolkowski correspondence for infinite-dimensional quantum evolutions. Theor. Math. Phys. 166, 123–138 (2011)
Holevo, A.S., Giovannetti, V.: Quantum channels and their entropic characteristics. Rep. Prog. Phys. 75, 046001 (2012)
Lieb, E.: Convex trace functions and the Wigner-Yanase-Dyson conjecture. Adv. Math. 11, 267–288 (1973)
Acknowledgements
We thank Peter Harremoës for pointing out that the convexity of the residual entropy of a compound system may be easily inferred by considering it as a sum of relative entropies.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Hansen, F. Trace Functions with Applications in Quantum Physics. J Stat Phys 154, 807–818 (2014). https://doi.org/10.1007/s10955-013-0890-x
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10955-013-0890-x