Abstract
We provide a self-contained and fast-paced introduction to the theories of operator semigroups, Markov semigroups and quantum dynamical semigroups. The level is appropriate for well-motivated graduate students who have a background in analysis or probability theory, with the focus on the characterisation of infinitesimal generators for various classes of semigroups. The theorems of Hille–Yosida, Hille–Yosida–Ray, Lumer–Phillips and Gorini–Kossakowski–Sudarshan–Lindblad are all proved, with the necessary technical prerequisites explained in full. Exercises are provided throughout.
Access provided by Autonomous University of Puebla. Download chapter PDF
Similar content being viewed by others
1 Introduction
These notes are an extension of a series of lectures given at the Winter School on Dynamical Methods in Open Quantum Systems held at Georg-August-Universität Göttingen during November 2016. These lectures were aimed at graduate students with a background in analysis or probability theory. The aim has been to make the notes self-contained but brief, so that they are widely accessible. Exercises are provided throughout.
We begin with the basics of the theory of operator semigroups on Banach spaces, and develop this up to the Hille–Yosida and Lumer–Phillips theorems; these provide characterisations for the generators of strongly continuous semigroups and strongly continuous contraction semigroups, respectively. As those with a background in probability theory may not be comfortable with all of the necessary material from functional analysis, this is covered rapidly at the start. The reader can find much more on these topics in Davies’s book [9].
After these fundamentals, we recall some key ideas from probability theory. The correspondence between time-homogeneous Markov processes and Markov semigroups is explained, and we explore the concepts of Feller semigroups and Lévy processes. We conclude with the Hille–Yosida–Ray theorem, which characterises generators of Feller semigroups via the positive maximum principle. Applebaum [3, Chapter 3] provides another view of much of this material, as do Liggett [20, Chapter 3] and Rogers and Williams [26, Chapter III].
The final part of these notes addresses the theory of quantum Markov semigroups, and builds to the characterisation of the generators of uniformly continuous conservative semigroups, and the Gorini–Kossakowski–Sudarshan–Lindblad form. En route, we establish Stinespring dilation and Kraus decomposition for linear maps defined on unital C ∗ algebras and von Neumann algebras, respectively, which are important results in the theories of open quantum systems and quantum information. The lecture notes of Alicki and Lendi [2] provide a useful complement, and those of Fagnola [14] study quantum Markov semigroups from the fruitful perspective of quantum probability. There is much scope, and demand, for further developments in this subject.
1.1 Acknowledgements
The author is grateful to the organisers of the winter school, Prof. Dr. Dorothea Bahns (Göttingen), Prof. Dr. Anke Pohl (Jena) and Prof. Dr. Ingo Witt (Göttingen), for the opportunity to give these lectures, and for their hospitality during his time in Göttingen. He is also grateful to Mr. Jason Hancox, for his comments on a previous version of these notes.
1.2 Conventions
The notation “P := Q” means that the quantity P is defined to equal Q.
The sets of natural numbers, non-negative integers, non-negative real numbers, real numbers and complex numbers are denoted \(\mathbb {N} := \{ 1, 2, 3, \dots \}\), \(\mathbb {Z}_+ := \{ 0 \} \cup \mathbb {N}\), \(\mathbb {R}_+ := [ 0, \infty )\), \(\mathbb {R}\) and \(\mathbb {C}\), respectively; the square root of − 1 is denoted i. Note that we follow the Anglophone rather than Francophone convention, in that 0 is both non-negative and non-positive but is neither positive nor negative.
The indicator function of the set A is denoted 1A, with the domain determined by context. If f : A → B and C ⊆ A, then f|C : C → B, the restriction of f to C, takes the same value at any point in C as f does.
Inner products on complex vector spaces are taken to be linear on the right and conjugate linear on the left. Given our final destination, we work with complex vector spaces and complex-valued functions by default.
2 Operator Semigroups
2.1 Functional-Analytic Preliminaries
Throughout their development, there has been a fruitful interplay between abstract functional analysis and the theory of operator semigroups. Here we give a rapid introduction to some of the basic ideas of the former. We cover a little more material that will be used in the sequel, but the reader will find it useful for their further studies in semigroup theory.
Definition 2.1
In these notes, a normed vector space V is a vector space with complex scalar field, equipped with a norm \(\| \cdot \| : V \to \mathbb {R}_+\) which is
-
(i)
subadditive: \(\| u + v \| \leqslant \| u \| + \| v \|\) for all u, v ∈ V ;
-
(ii)
homogeneous: ∥λv∥ = |λ| ∥v∥ for all v ∈ V and \(\lambda \in \mathbb {C}\); and
-
(iii)
faithful: ∥v∥ = 0 if and only if v = 0, for all v ∈ V .
The normed vector space V is complete if, whenever \(( v_n )_{n \in \mathbb {N}} \subseteq V\) is a Cauchy sequence, there exists v ∞∈ V such that v n → v ∞ as n →∞. A complete normed vector space is called a Banach space . Thus Banach spaces are those normed vector spaces in which every Cauchy sequence is convergent.
[Recall that a sequence \(( v_n )_{n \in \mathbb {N}} \subseteq V\) is Cauchy if, for all ε > 0, there exists \(N \in \mathbb {N}\) such that ∥v m − v n∥ < ε for all m, \(n \geqslant N\).]
Exercise 2.2 (Banach’s Criterion)
Let ∥⋅∥ be a norm on the complex vector space V . Prove that V is complete for this norm if and only if every absolutely convergent series in V is convergent.
[Given \(( v_n )_{n \in \mathbb {N}} \subseteq V\), the series \(\sum _{n = 1}^\infty v_n\) is said to be convergent precisely when the sequence of partial sums \(( \sum _{j = 1}^n v_j )_{n \in \mathbb {N}}\) is convergent, and absolutely convergent when \(( \sum _{j = 1}^n \| v_j \| )_{n \in \mathbb {N}}\) is convergent.]
Example 2.3
If \(n \in \mathbb {N}\), then the finite-dimensional vector space \(\mathbb {C}^n\) is a Banach space for any of the ℓ p norms, where p ∈ [1, ∞] and
These norms are all equivalent : for all p, q ∈ [1, ∞] there exists C p,q > 1 such that
Example 2.4
For all p ∈ [1, ∞], let the sequence space
where
and the vector-space operations are defined coordinate-wise: if u, v ∈ ℓ p and \(\lambda \in \mathbb {C}\), then
These are Banach spaces, with ℓ p ⊆ ℓ q if p, q ∈ [1, ∞] are such that \(p \leqslant q\).
If p ∈ [1, ∞), then ℓ p ⊆ c 0 ⊆ ℓ ∞, where
is itself a Banach space for the norm ∥⋅∥∞.
Example 2.5
An inner product on the complex vector space V is a form
which is
-
(i)
linear in the second argument: the map \(V \to \mathbb {C}; v \mapsto \langle u, v \rangle \) is linear for all u ∈ V ;
-
(ii)
Hermitian: \(\overline {\langle u, v \rangle } = \langle v, u \rangle \) for all u, v ∈ V ; and
-
(iii)
positive definite: \(\langle v, v \rangle \geqslant 0\) for all v ∈ V , with equality if and only if v = 0.
Any inner product determines a norm on V , by setting ∥v∥ := 〈v, v〉1∕2 for all v ∈ V . Furthermore, the inner product can be recovered from the norm by polarisation : if \(q : V \times V \to \mathbb {C}\) is a sesquilinear form on V , so is conjugate linear in the first argument and linear in the second, then
A Banach space with norm which comes from an inner product is a Hilbert space . For example, the sequence space ℓ 2 is a sequence space, since setting
defines an inner product on ℓ 2 such that 〈v, v〉 = ∥v∥2 for all v ∈ ℓ 2. In any Hilbert space H, the Cauchy–Schwarz inequality holds:
It may be shown that a Banach space V is a Hilbert space if and only if the norm satisfies the parallelogram law :
Exercise 2.6
Let H be a Hilbert space. Given any set S ⊆ H, prove that its orthogonal complement
is a closed linear subspace of H. Prove further that L ⊆ H is a closed linear subspace of H if and only if L = (L ⊥)⊥.
Example 2.7
Let C(K) denote the complex vector space of complex-valued functions on the compact Hausdorff space K, with vector-space operations defined pointwise: if x ∈ K then
for all f, g ∈ C(K) and \(\lambda \in \mathbb {C}\). The supremum norm
makes C(K) a Banach space. [Completeness is the undergraduate-level fact that uniform convergence preserves continuity.]
Example 2.8
Let \(( \Omega , \mathcal {F}, \mu )\) be a σ-finite measure space, so that \(\mu : \mathcal {F} \to [ 0, \infty ]\) is a measure and there exists a countable cover of Ω with elements in \(\mathcal {F}\) of finite measure.
For all p ∈ [1, ∞], the L p space
is a Banach space when equipped with the L p norm
and where functions are identified if they differ on a null set. [Note that if \(f \in L^p ( \Omega , \mathcal {F}, \mu )\) then ∥f∥p = 0 if and only if f = 0 on a null set.]
The space \(L^2( \Omega , \mathcal {F}, \mu )\) is a Hilbert space, with inner product such that
If p, q, r ∈ [1, ∞] are such that p −1 + q −1 = r −1, where ∞ −1 := 0, then
this is Hölder’s inequality . The subadditivity of the L p norm, known as Minkowski’s inequality , may be deduced from Hölder’s inequality. When r = 1 and p = q = 2, Hölder’s inequality is known as the Cauchy–Bunyakovsky–Schwarz inequality .
Example 2.9
Let \(d \geqslant 1\). The space \(C^\infty _c( \mathbb {R}^d )\) of continuous functions on \(\mathbb {R}^d\) with compact support is a subspace of \(L^p( \mathbb {R}^d )\) for all p ∈ [1, ∞], and is dense for p ∈ [1, ∞), when \(\mathbb {R}^d\) is equipped with Lebesgue measure.
Given a multi-index \(\alpha = ( \alpha _1, \dots , \alpha _d ) \in \mathbb {Z}_+^d\), let |α| := α 1 + ⋯ + α d and
Note that \(D^\alpha f \in C^\infty _c( \mathbb {R}^d )\) for all \(f \in C^\infty _c( \mathbb {R}^d )\) and \(\alpha \in \mathbb {Z}_+^d\).
Let \(f \in L^p( \mathbb {R}^d )\), where p ∈ [1, ∞], and note that \(f g \in L^1( \mathbb {R}^d )\) for all \(g \in C^\infty _c( \mathbb {R}^d )\), by Hölder’s inequality. If there exists \(F \in L^p( \mathbb {R}^d )\) such that
then F is the weak derivative of f, and we write F = D α f. [It is a straightforward exercise to verify that the weak derivative is unique, and that this agrees with the previous definition if \(f \in C^\infty _c( \mathbb {R}^d )\).]
Given p ∈ [1, ∞) and \(k \in \mathbb {Z}_+\), the Sobolev space
is a Banach space when equipped with the norm
and contains \(C^\infty _c( \mathbb {R}^d )\) as a dense subspace.
The Sobolev space \(W^{k, 2}( \mathbb {R}^d )\) is usually abbreviated to \(H^k( \mathbb {R}^d )\) and is a Hilbert space, with inner product such that
Exercise 2.10
Prove that the normed vector space \(W^{k, p}( \mathbb {R}^d )\), as defined in Example 2.9, is complete.
Example 2.11
Let U and V be normed vector spaces. A linear operator T : U → V is bounded if
If T is bounded, then \(\| T u \| \leqslant \| T \| \, \| u \|\) for all u ∈ U, and ∥T∥ is the smallest constant with this property.
The set of all such linear operators is denoted by B(U;V ), or B(U) if U and V are equal.
This set is a normed vector space, with operator norm T↦∥T∥ and algebraic operations defined pointwise, so that
for all S, T ∈ B(U;V ), λ ∈ C and U ∈ U. Furthermore, the space B(U;V ) is a Banach space whenever V is.
Exercise 2.12
Prove the claims in Example 2.11.
Exercise 2.13
Let V be a normed vector space. Prove that the norm on B(V ) is submultiplicative : if S, T ∈ B(V ), then ST : v↦S(Tv) ∈ B(V ), with \(\| S T \| \leqslant \| S \| \, \| T \|\).
Exercise 2.14
Let U and V be normed vector spaces and let T : U → V be a linear operator. Prove that T is bounded if and only if it is continuous when U and V are equipped with their norm topologies.
Example 2.15
Given any normed space V , its topological dual or dual space is the Banach space \(V^* := {B(V;\mathbb {C})}\). An element of V ∗ is called a linear functional or simply a functional .
If p, q ∈ (1, ∞) are conjugate indices , so that such that p −1 + q −1 = 1, then (ℓ p)∗ is naturally isomorphic to ℓ q via the dual pairing
Hölder’s inequality shows that u↦[u, v] is an element of (ℓ p)∗ for any v ∈ ℓ q; proving that every functional arises this way is an exercise. Furthermore, the same pairing gives an isomorphism between (ℓ 1)∗ and ℓ ∞. [The dual of ℓ ∞ is much larger than ℓ 1; it is isomorphic to the space \(M( \beta \mathbb {N} )\) of regular complex Borel measures on the Stone–Čech compactification of the natural numbers.]
Similarly, for conjugate indices p, q ∈ (1, ∞), the dual of \(L^p\mbox{{$( \Omega , \mathcal {F}, \mu )$}}\) is identified with \(L^q\mbox{{$( \Omega , \mathcal {F}, \mu )$}}\), and the dual of \(L^1\mbox{{$( \Omega , \mathcal {F}, \mu )$}}\) with \(L^\infty \mbox{{$( \Omega , \mathcal {F}, \mu )$}}\), via the pairing
In particular, ℓ 2 and \(L^2\mbox{{$( \Omega , \mathcal {F}, \mu )$}}\) are conjugate-linearly isomorphic to their dual spaces. This is a general fact about Hilbert spaces, known as the Riesz–Fréchet theorem : if H is a Hilbert space, then
If K is a compact Hausdorff space, then the dual of C(K) is naturally isomorphic to the space M(K) of regular complex Borel measures on K, with dual pairing
The Hahn–Banach theorem [25, Corollary 2 to Theorem III.6] implies that the dual space separates points: if v ∈ V , then there exists ϕ ∈ V ∗ such that ∥ϕ∥ = 1 and ϕ(v) = ∥v∥.
Example 2.16
Duality makes an appearance at the level of operators. If U and V are normed spaces and T ∈ B(U;V ), then there exists a unique dual operator T′∈ B(V ∗;U ∗) such that
The map T↦T′ from B(U;V ) to B(V ∗;U ∗) is linear and reverses the order of products: if S ∈ B(U;V ) and T ∈ B(V ;W), then (TS)′ = S′T′.
If H and K are Hilbert spaces, and we identify each of these with its dual via the Riesz–Fréchet theorem, then the operator dual to T ∈ B(H;K) is identified with the adjoint operator T ∗∈ B(K;H), since
2.2 Semigroups on Banach Spaces
Definition 2.17
A family of operators \(T = ( T_t )_{t \in \mathbb {R}_+} \subseteq {B(V)}\) is a one-parameter semigroup on V , or a semigroup for short, if
The semigroup T is strongly continuous if
and is uniformly continuous if
Exercise 2.18
Prove that a uniformly continuous semigroup is strongly continuous. [The converse is false: see Exercise 2.29.]
Theorem 2.19
Let T be a strongly continuous semigroup on the Banach space V . There exist constants \(M \geqslant 1\) and \(a \in \mathbb {R}\) such that \(\| T_t \| \leqslant M e^{a t}\) for all \(t \in \mathbb {R}_+\).
Proof
See [9, Theorem 6.2.1]. □
Remark 2.20
The semigroup T of Theorem 2.19 is said to be of type (M, a). A semigroup of type (1, 0) is also called a contraction semigroup .
By replacing T t with e −at T t, one can often reduce to the case of semigroups with uniformly bounded norm. However, it is not always possible to go further and reduce to contraction semigroups; see [9, Example 6.2.3 and Theorem 6.3.8].
Exercise 2.21
Prove that a strongly continuous semigroup is strongly continuous at every point: if \(t \geqslant 0\), then limh→0∥T t+h x − T t x∥ = 0. Prove further that the same is true if “strongly” is replaced by “uniformly”.
Exercise 2.22
Given any A ∈ B(V ), let \(\displaystyle \exp ( A ) := \sum _{n = 0}^\infty \frac {1}{n!} A^n\).
-
(i)
Prove that this series is convergent, so that \(\exp ( A ) \in {B(V)}\). Prove further that \(\| \exp ( A ) \| \leqslant \exp \| A \|\).
-
(ii)
Prove that if B ∈ B(V ) commutes with A, so that that AB = BA, then \(\exp ( A )\) and \(\exp ( B )\) also commute, with \(\exp ( A ) \exp ( B ) = \exp ( A + B )\). [Hint: consider the derivatives of
$$\displaystyle \begin{aligned} t \mapsto \exp( t A ) \exp( -t A ) \quad \text{and} \quad t \mapsto \exp( t A ) \exp( t B ) \exp\bigl( -t ( A + B ) \bigr).] \end{aligned}$$ -
(iii)
Prove that setting \(T_t := \exp ( t A )\) for all \(t \in \mathbb {R}_+\) produces a uniformly continuous one-parameter semigroup T.
The converse of Exercise 2.22(iii) is true, and we state it as a theorem.
Theorem 2.23
If T is a uniformly continuous one-parameter semigroup, then there exists an operator A ∈ B(V ) such that \(T_t = \exp ( t A )\) for all \(t \in \mathbb {R}_+\).
Proof
By continuity at the origin, there exists t 0 > 0 such that
Then
Thus \(X := t_0^{-1} \int _0^{t_0} T_s \,\mathrm {d} s \in {B(V)}\) is invertible, because the Neumann series
is absolutely convergent, so convergent, by Banach’s criterion. Furthermore,
as h → 0+. Hence
Moreover, for any t ∈ [0, t 0],
as n →∞, since
This working shows that \(T_t = \exp ( t A )\) for any t ∈ [0, t 0], so for all \(t \in \mathbb {R}_+\), by the semigroup property: there exists \(n \in \mathbb {Z}_+\) and s ∈ [0, t 0) such that t = nt 0 + s, and
□
Remark 2.24
The integrals in the previous proof are Bochner integrals ; they are an extension of the Lebesgue integral to functions which take values in a Banach space. We will only be concerned with continuous functions, so do not need to concern ourselves with notions of measurability. All the standard theorems carry over from the Lebesgue to the Bochner setting, such as the inequality \(\| \int f( t ) \,\mathrm {d} t \| \leqslant \int \| f( t ) \| \,\mathrm {d} t\), and if T is a bounded operator then \(T \int f( t ) \,\mathrm {d} t = \int T f( t ) \,\mathrm {d} t\).
Definition 2.25
If T is a uniformly continuous semigroup, then the operator A ∈ B(V ) such that \(T_t = \exp ( t A )\) for all \(t \in \mathbb {R}_+\) is the generator of the semigroup.
Exercise 2.26
Prove that the generator of a uniformly continuous one-parameter semigroup T is unique. [Hint: consider the limit of t −1(T t − I) as t → 0+.]
Example 2.27
Given \(t \in \mathbb {R}_+\) and \(f \in V := L^p( \mathbb {R}_+ )\), where p ∈ [1, ∞), let
Then T t ∈ B(V ), with ∥T t∥ = 1, and \(T = ( T_t )_{t\in \mathbb {R}_+}\) is a one-parameter semigroup. If f is continuous and has compact support, then an application of the Dominated Convergence Theorem gives that T t f → f as t → 0+; since such functions are dense in V , it follows that T is strongly continuous.
Exercise 2.28
Prove the assertions in Example 2.27. Prove also that if \(f \in V = L^p( \mathbb {R}_+ )\) is absolutely continuous, with f′∈ V such that
then
where the limit exists in V . [Hint: show that
and then use the strong continuity of T at the origin.]
Exercise 2.29
Prove that the semigroup of Example 2.27 is not uniformly continuous. [Hint: let \(f_n = \lambda _n 1_{[ n^{-1}, 2 n^{-1} ]}\), where the positive constant λ n is chosen to make f n a unit vector in V , and consider ∥T t f n − f n∥ for n > t −1.]
2.3 Beyond Uniform Continuity
As shown above, uniformly continuous one-parameter semigroups are in one-to-one correspondence with bounded linear operators. To move beyond this situation, we need to introduce linear operators which are only partially defined on the ambient Banach space V .
Definition 2.30
An unbounded operator in V is a linear transformation A defined on a linear subspace V 0 ⊆ V , its domain ; we write \( \mathop {\mathrm {dom}} A = V_0\).
An extension of A is an unbounded operator B in V such that \( \mathop {\mathrm {dom}} A \subseteq \mathop {\mathrm {dom}} B\) and the restriction \(B|{ }_{ \mathop {\mathrm {dom}} A} = A\). In this case, we write A ⊆ B.
An unbounded operator A in V is densely defined if \( \mathop {\mathrm {dom}} A\) is dense in V for the norm topology.
Definition 2.31
Given operators A and B, let A + B and AB be defined by setting
and
Note that neither A + B nor AB need be densely defined, even if both A and B are.
Definition 2.32
Let T be a strongly continuous one-parameter semigroup on V . Its generator A is an unbounded operator with domain
and action
It is readily verified that A is an unbounded operator.
Exercise 2.33
Prove that if v ∈ V and \(t \in \mathbb {R}_+\) then
Deduce that \( \mathop {\mathrm {dom}} A\) is dense in V . [Hint: begin by imitating the proof of Theorem 2.23.]
Lemma 2.34
Let the strongly continuous semigroup T have generator A. If \(v \in \mathop {\mathrm {dom}} A\) and \(t \in \mathbb {R}_+\) , then \(T_t v \in \mathop {\mathrm {dom}} A\) and T t Av = AT t v; thus, \(T_t( \mathop {\mathrm {dom}} A ) \subseteq \mathop {\mathrm {dom}} A\) . Furthermore,
Proof
First, note that
by the boundedness of T t, so \(T_t v \in \mathop {\mathrm {dom}} A\) and AT t v = T t Av, as claimed. For the second part, let
Note that F is continuous and F(0) = 0; furthermore, if t > 0, then
as h → 0+, whence F ≡ 0. □
Definition 2.35
An operator A in V is closed if, whenever \(( v_n )_{n \in \mathbb {N}} \subseteq \mathop {\mathrm {dom}} A\) is such that v n → v ∈ V and Av n → u ∈ V , it follows that \(v \in \mathop {\mathrm {dom}} A\) and Av = u. Note that a bounded operator is automatically closed.
The operator A is closable if it has a closed extension, in which case the closure \(\overline {A}\) is the smallest closed extension of A, where the ordering of operators is given in Definition 2.30.
Exercise 2.36
Prove that the graph
of an unbounded operator A in V is a normed vector space for the product norm
Prove further that A is closed if and only if \(\mathcal {G}( A )\) is a Banach space, and that A is closable if and only if the closure of its graph in V ⊕ V is the graph of some operator. Finally, prove that if A is closable then \(\mathcal {G}\bigl ( \overline {A} \bigr )\) is the intersection of the graphs of all closed extensions of A.
Exercise 2.37
Let A be the generator of the strongly continuous one-parameter semigroup T. Use Lemma 2.34 and Theorem 2.19 to show that A is closed.
Proof
Suppose \(( v_n )_{n \in \mathbb {N}} \subseteq \mathop {\mathrm {dom}} A\) is such that v n → v and Av n → u. Let t > 0 and note that
Furthermore,
as n →∞, so
Dividing both sides by t and letting t → 0+ gives that \(v \in \mathop {\mathrm {dom}} A\) and Av = u, as required. □
Definition 2.38
Let H be Hilbert space. If A is a densely defined operator in H, then the adjoint A ∗ is defined by setting
and
When A is bounded, this agrees with the earlier definition. If A is not densely defined, then there may be no unique choice for v, so this definition cannot immediately be extended further.
It is readily verified that the adjoint A ∗ is always closed: if \(( u_n )_{n \in \mathbb {N}} \subseteq \mathop {\mathrm {dom}} A^*\) is such that u n → u ∈H and A ∗ u n → v ∈H then
so \(x \in \mathop {\mathrm {dom}} A^*\) and A ∗ u = v.
Exercise 2.39
Prove that a densely defined operator A is closable if and only if its adjoint A ∗ is densely defined, in which case \(\overline {A} = ( A^* )^*\) and \(\overline {A}^{\,*} = A^*\).
Definition 2.40
A densely defined operator A in a Hilbert space is self-adjoint if and only if A ∗ = A. This is stronger than the condition that
which is merely the condition that A ⊆ A ∗. An operator satisfying this inclusion is called symmetric .
Exercise 2.41
Let A be a densely defined operator in the Hilbert space H. Prove that A is self-adjoint if and only if A is symmetric and such that both A + iI and A −iI are surjective, so that
Proof
Suppose first that A is symmetric and the range conditions hold. Let u, v ∈H be such that
so that \(u \in \mathop {\mathrm {dom}} A^*\) and A ∗ u = v. We wish to prove that \(u \in \mathop {\mathrm {dom}} A\) and Au = v.
Let x, \(y \in \mathop {\mathrm {dom}} A\) be such that (A −iI)x = v −iu and (A + iI)y = u − x. Then
where the penultimate equality holds because A is symmetric and x, \(y \in \mathop {\mathrm {dom}} A\). It follows that ∥u − x∥2 = 0, so \(u = x \in \mathop {\mathrm {dom}} A\) and Au = Ax = v −iu + ix = v.
Now suppose that A is self-adjoint, and note that it suffices to prove that A + iI is surjective, since − A is self-adjoint whenever A is.
Note first that
which implies that \( \mathop {\mathrm {ran}} ( A + \mathrm {i} I )\) is closed: if the sequence \(( v_n )_{n \in \mathbb {N}} \subseteq \mathop {\mathrm {dom}} A\) is such that \(\bigl ( ( A + \mathrm {i} I ) v_n \bigr )_{n \in \mathbb {N}}\) is convergent, then both \(( v_n )_{n \in \mathbb {N}}\) and \(( A v_n )_{n \in \mathbb {N}}\) are Cauchy, so convergent, with v n → v ∈H and Av n → u ∈H. Since A is closed, it follows that \(v \in \mathop {\mathrm {dom}} A\) and Av = u, from which we see that (A + iI)v n → u + iv = (A + iI)v.
It is also follows from (2.2), with A replaced by − A, that A −iI is injective. As
so
□
Definition 2.42
Let A be an unbounded operator in V . Its spectrum is the set
and its resolvent is the map
In other words, \(\lambda \in \mathbb {C}\) is not in the spectrum of A if and only if there exists a bounded operator B ∈ B(V ) such that \(B ( \lambda I - A ) = I_{ \mathop {\mathrm {dom}} A}\) and (λI − A)B = I V; in particular, the operator λI − A is a bijection from \( \mathop {\mathrm {dom}} A\) onto V .
Remark 2.43
If the operator T : V → V is bounded, then its spectrum σ(T) is contained in the closed disc \(\{ \lambda \in \mathbb {C} : | \lambda | \leqslant \| T \| \}\) [22, Lemma 1.2.4].
Exercise 2.44
Let A be an unbounded operator in V and suppose \(\lambda \in \mathbb {C}\) is such that λI − A is a bijection from \( \mathop {\mathrm {dom}} A\) onto V . Prove that (λI − A)−1 is bounded if and only if A is closed. [Thus algebraic invertibility of λI − A is equivalent to its topological invertibility if and only if A is closed.]
The following theorem shows that the resolvent of a semigroup generator may be thought of as the Laplace transform of the semigroup.
Theorem 2.45
Let A be the generator of a one-parameter semigroup T of type (M, a) on V . Then \(\sigma ( A ) \subseteq \{ \lambda \in \mathbb {C} : \mathop {\mathrm {Re}} \lambda \leqslant a \}\) . Furthermore, if \( \mathop {\mathrm {Re}} \lambda > a\) , then
and \(\| ( \lambda I - A )^{-1} \| \leqslant M ( \mathop {\mathrm {Re}} \lambda - a )^{-1}\).
Proof
Fix \(\lambda \in \mathbb {C}\) with \( \mathop {\mathrm {Re}} \lambda > a\) and note first that
is a bounded linear operator, with \(\| R \| \leqslant M ( \mathop {\mathrm {Re}} \lambda - a )^{-1}\).
If v ∈ V and u = Rv, then
and therefore, if t > 0,
Thus \(u \in \mathop {\mathrm {dom}} A\) and (λI − A)u = v. It follows that \( \mathop {\mathrm {ran}} R \subseteq \mathop {\mathrm {dom}} A\) and (λI − A)R = I V.
However, since (T t − I)R = R(T t − I) and R is bounded, the same working shows that
Thus \(R ( \lambda I - A ) = I_{ \mathop {\mathrm {dom}} A}\) and R = (λI − A)−1, as claimed. □
The Laplace-transform formula of Theorem 2.45 allows one to recover a semigroup from its resolvent.
Theorem 2.46
Let A be the generator of a one-parameter semigroup T of type (M, a) on V , and let \(\lambda \in \mathbb {C}\) with \( \mathop {\mathrm {Re}} \lambda > a\) . Then
and
Proof
The first claim follows by induction, with Theorem 2.45 giving the case n = 1.
As noted by Hille and Phillips [16, Theorem 11.6.6], the second follows from the Post–Widder inversion formula for the Laplace transform. For all \(n \in \mathbb {N}\), let
and note that f n is strictly increasing on [0, 1] and strictly decreasing on [1, ∞), and its integral \(\int _0^\infty f_n( t ) \,\mathrm {d} t = 1\); this last fact may be proved by induction. If n is sufficiently large, then a short calculation shows that
The result follows by splitting the integral into three parts. Fix ε ∈ (0, 1) and note first that \(f_n( r ) \leqslant n e^n r^n e^{-n r}\) for all \(r \in \mathbb {R}_+\), with the latter function strictly increasing on [0, 1], so
Similarly, if b := ε∕(1 + ε), then \(f_n( r ) e^{b n r} \leqslant n e^n ( 1 + \varepsilon )^n e^{( b - 1 ) n ( 1 + \varepsilon )} \leqslant n ( 1 + \varepsilon )^n\) for all \(r \geqslant 1 + \varepsilon \), and so
since b(1 + ε) = ε and (1 + ε)e −ε < 1. A standard approximation argument now completes the proof. □
We have now obtained enough necessary conditions on the generator of a strongly continuous semigroup for them to be sufficient as well.
Theorem 2.47 (Feller–Miyadera–Phillips)
A closed, densely defined operator A in V is the generator of a strongly continuous semigroup of type (M, a) if and only if
and
Proof
Let A be the generator of a strongly continuous semigroup T of type (M, a). The spectral condition is a consequence of Theorem 2.45, and the norm inequality follows from Theorem 2.46.
For the converse, let the operator A be closed, densely defined, such that (2.4) holds and having spectrum not containing (a, ∞). Setting A λ := λA(λI − A)−1, note that {A λ : λ ∈ (a, ∞)} is a commuting family of bounded operators such that A λ v → Av as λ →∞, for all \(v \in \mathop {\mathrm {dom}} A\); see Exercise 2.48 for more details.
With \(T^\lambda _t := \exp ( t A_\lambda )\), the inequalities (2.4) imply \(\| T^\lambda _t \| \leqslant M \exp \bigl ( a \lambda t / ( \lambda - a ) \bigr )\) for all λ > a and \(t \in \mathbb {R}_+\), so \(\limsup _{\lambda \to \infty } \| T_t^\lambda \| \leqslant M e^{a t}\). Since
if λ, \(\mu > 2 a_+ = 2 \max \{ a, 0 \}\) and \(v \in \mathop {\mathrm {dom}} A\) then
locally uniformly in t. An approximation argument shows that \(T_t u = \lim _{\lambda \to \infty } T^\lambda _t u\) exists for all \(t \in \mathbb {R}_+\) and u ∈ V , and that \(T = ( T_t )_{t \in \mathbb {R}_+}\) is a strongly continuous one-parameter semigroup of type (M, a).
To see that the generator of T is A, note that the previous working and Lemma 2.34 imply that
dividing by t and letting t → 0 shows that the generator B of T is an extension of A. Note that (a, ∞) is not in the spectrum of B, by Theorem 2.45; it is a simple exercise to show that (λI − A)−1 = (λI − B)−1 for λ > a, and since the ranges of these operators are the domain of A and B, the result follows. □
Exercise 2.48
Let A be an unbounded operator in V , with spectrum not containing (a, ∞) and such that \(\| ( \lambda I - A )^{-1} \| \leqslant M ( \lambda - a )^{-1}\) for all λ > a, where M and a are constants. Prove that
commutes with A μ for all λ, μ > a. Prove also that
by showing this first for the case \(u \in \mathop {\mathrm {dom}} A\). Deduce that A λ v → Av when \(v \in \mathop {\mathrm {dom}} A\).
For contraction semigroups, we have the following refinement of Theorem 2.47.
Theorem 2.49 (Hille–Yosida)
Let A be a closed, densely defined linear operator in the Banach space V . The following are equivalent.
-
(i)
A is the generator of a strongly continuous contraction semigroup.
-
(ii)
\(\sigma ( A ) \subseteq \{ \lambda \in \mathbb {C} : \mathop {\mathrm {Re}} \lambda \leqslant 0 \}\) and
$$\displaystyle \begin{aligned} \| ( \lambda I - A )^{-1} \| \leqslant ( \mathop{\mathrm{Re}} \lambda )^{-1} \qquad \mathit{\text{whenever }} \mathop{\mathrm{Re}} \lambda > 0. \end{aligned}$$ -
(iii)
σ(A) ∩ (0, ∞) is empty and
$$\displaystyle \begin{aligned} \| ( \lambda I - A )^{-1} \| \leqslant \lambda^{-1} \qquad \mathit{\text{whenever }} \lambda > 0. \end{aligned}$$
Proof
Note that (i) implies (ii), by Theorem 2.45, and (ii) implies (iii) trivially. That (iii) implies (i) follows from the extension of Theorem 2.47 noted in its proof. □
In practice, verifying the norm conditions in Theorems 2.47 and 2.49 may prove to be challenging. The next section introduces the concept of operator dissipativity, which is often more tractable.
2.4 The Lumer–Phillips Theorem
Throughout this subsection, V denotes a Banach space and V ∗ its topological dual.
Definition 2.50
For all v ∈ V , let
be the set of normalised tangent functionals to v. The Hahn–Banach theorem [25, Theorem III.6] implies that TF(v) is non-empty for all v ∈ V .
Exercise 2.51
Prove that if H is a Hilbert space then TF(v) = {〈v|} for all v ∈ H, where the Dirac functional 〈v| is such that 〈v|u := 〈v, u〉 for all u ∈ H. [Recall the Riesz–Fréchet theorem from Example 2.15.]
Exercise 2.52
Prove that if f ∈ V = C(K) and x 0 ∈ K is such that |f(x 0)| = ∥f∥ then setting \(\phi ( g ) := \overline {f( x_0 )} g( x_0 )\) for all g ∈ V defines a normalised tangent functional for f. Deduce that TF(f) may contain more than one element.
Definition 2.53
An unbounded operator A in V is dissipative if and only if there exists ϕ ∈TF(v) such that \( \mathop {\mathrm {Re}} \phi ( A v ) \leqslant 0\), for all \(v \in \mathop {\mathrm {dom}} A\). [Note that it suffices to check this condition for unit vectors only.]
Exercise 2.54
Prove that an operator A in the Hilbert space H is dissipative if and only if \(\| ( I + A ) v \| \leqslant \| ( I - A ) v \|\) for all \(v \in \mathop {\mathrm {dom}} A\).
Exercise 2.55
Suppose T is a contraction semigroup with generator A. Prove that A is dissipative.
Proof
If \(v \in \mathop {\mathrm {dom}} A\) and ϕ ∈TF(v), then
so A is dissipative. □
We now seek to find a converse to the result of the preceding exercise.
Lemma 2.56
The unbounded operator A in V is dissipative if and only if
If A is dissipative and λI − A is surjective for some λ > 0, then λ∉σ(A) and \(\| ( \lambda I - A )^{-1} \| \leqslant \lambda ^{-1}\).
Proof
Suppose first that (2.5) holds, let \(v \in \mathop {\mathrm {dom}} A\) be a unit vector and, for all λ > 0, choose \(\phi _\lambda \in \mathit {TF}\bigl ( ( \lambda I - A ) v \bigr )\). Then ϕ λ ≠ 0, so ψ λ = ∥ϕ λ∥−1 ϕ λ is well defined, and
Since \( \mathop {\mathrm {Re}} \psi _\lambda ( v ) \leqslant 1\) and \({-} \mathop {\mathrm {Re}} \psi _\lambda ( A v ) \leqslant \| A v \|\), it follows that
The Banach–Alaoglu theorem [25, Theorem IV.21] implies that the unit ball of V ∗ is weak* compact, so the net (ψ λ)λ>0 has a weak*-convergent subnet with limit in the unit ball. Hence there exists ψ ∈ V ∗ such that
In particular,
so ψ ∈TF(v) and A is dissipative.
Conversely, if λ > 0, \(v \in \mathop {\mathrm {dom}} A\) and ϕ ∈TF(v) is such that \( \mathop {\mathrm {Re}} \phi ( A v ) \leqslant 0\) then
Thus (2.5) holds, and λI − A is injective.
If λI − A is also surjective, then (2.5) gives that \(\| u \| \geqslant \lambda \| ( \lambda I - A )^{-1} u \|\) for all u ∈ V , whence the final claim. □
Exercise 2.57
Let A be dissipative. Prove that λI − A is surjective for some λ > 0 if and only if λI − A is surjective for all λ > 0. [Hint: for a suitable choice of λ and λ 0, consider the series \(R_\lambda := \sum _{n = 0}^\infty ( \lambda - \lambda _0 )^n ( \lambda _0 I - A )^{-( n + 1 )}\).]
Proof
Suppose that λ 0 > 0 is such that λ 0 I − A is surjective. It follows from Lemma 2.56 that \(\| ( \lambda _0 I - A )^{-1} \| \leqslant \lambda _0^{-1}\). The series
is norm convergent for all λ ∈ (0, 2λ 0); if we can show that R λ = (λI − A)−1, then the result follows.
If C ∈ B(V ) is such that ∥C∥ < 1 then I − C is invertible, with \(( I - C )^{-1} = \sum _{n = 0}^\infty C^n\). Hence if C = (λ 0 − λ)(λ 0 I − A)−1, then
so \( \mathop {\mathrm {ran}} R_\lambda \subseteq \mathop {\mathrm {dom}} ( \lambda _0 I - A ) = \mathop {\mathrm {dom}} ( \lambda I - A )\),
and
□
Theorem 2.58 (Lumer–Phillips)
A closed, densely defined operator A generates a strongly continuous contraction semigroup if and only if A is dissipative and λI − A is surjective for some λ > 0.
Proof
One implication follows from Exercise 2.57, Lemma 2.56 and Theorem 2.49. The other implication follows from Theorem 2.49 and Exercise 2.55. □
Example 2.59
Let V = L 2[0, 1], and let Af := g, where
Thus \(f \in \mathop {\mathrm {dom}} A\) if and only if f(0) = 0 and f is absolutely continuous on [0, 1], with square-integrable derivative, and then Af = f′ almost everywhere. For such f, note that
so − A is a dissipative operator, but A is not.
Let g ∈ V and λ > 0; we wish to find \(f \in \mathop {\mathrm {dom}} A\) such that
We proceed by iterating this relation: given h ∈{f, g}, let h 0 := h and, for all \(n \in \mathbb {Z}_+\), let h n+1 ∈ V be such \(h_{n + 1}( t ) = \int _0^t h_n( s ) \,\mathrm {d} s\) for all t ∈ [0, 1]. Then
for all \(n \in \mathbb {N}\). The series \(\sum _{j = 0}^\infty ( -\lambda )^j g_{j + 1}\) is uniformly convergent on [0, 1], so defines a function \(F \in \mathop {\mathrm {dom}} A\), whereas (−λ)n f n → 0 as n →∞. Thus
so λI + A is surjective. By the Lumer–Phillips theorem, the operator − A generates a contraction semigroup.
Exercise 2.60
Fill in the details at the end of Example 2.59. [Hint: with the notation of the example, show that if h ∈{f, g} then \(| h_n( t ) |{ }^2 \leqslant t^n \| h \|{ }_2^2 / n!\) for all \(n \in \mathbb {N}\).]
Remark 2.61
We can explain informally why the operator A defined in Example 2.59 does not generate a semigroup, and why − A does. Recall that each element of a semigroup leaves the domain of the generator invariant, by Lemma 2.34, and A would generate a left-translation semigroup, which does not preserve the boundary condition f(0) = 0. Moreover, − A generates the right-translation semigroup, and this does preserve the boundary condition.
If we let A 0 be the restriction of A to the domain
so adding a further boundary condition, then both A 0 and − A 0 are dissipative, but neither generates a semigroup. We cannot solve the equation (λI ± A 0)f = g for all g when subject to the constraint that \(f \in \mathop {\mathrm {dom}} A_0\). [Take g ∈ L 2[0, 1] such that g(t) = t for all t ∈ [0, 1], construct F as in Example 2.59 and note that F(1) ≠ 0.]
Example 2.62
Recall the weak derivatives D α and Sobolev spaces \(H^k( \mathbb {R}^d )\) defined in Example 2.9, and let \(2 e_j \in \mathbb {Z}_+^d\) be the multi-index with 2 in the jth coordinate and 0 elsewhere. The Laplacian
is a densely defined operator in \(L^2( \mathbb {R}^d )\) with domain \( \mathop {\mathrm {dom}} \Delta := H^2( \mathbb {R}^d )\). It may be shown that
where
consequently, the Laplacian Δ is dissipative. One way to establish (2.6) is to use the Fourier transform. Fourier-theoretic results can also be used to prove that λI − Δ is surjective for all λ > 0, essentially because the map x↦1∕(λ + |x|2) is bounded on \(\mathbb {R}^d\). Thus the Laplacian generates a contraction semigroup.
Exercise 2.63
Let A be a densely defined operator on the Hilbert space H. Prove that if A is symmetric, so that
then iA is dissipative. Deduce that if H is self-adjoint then iH and −iH are the generators of contraction semigroups.
Prove, further, that if \(T = ( T_t )_{t \in \mathbb {R}_+}\) has generator iH, with H self-adjoint, then T t is unitary, so that \(T_t^* T_t = I = T_t T_t^*\), for all \(t \in \mathbb {R}_+\).
Proof
The first part is an immediate consequence of Theorem 2.58, the Lumer–Phillips theorem, together with Exercise 2.41.
For the next part, fix u, \(v \in \mathop {\mathrm {dom}} H\) and \(t \in \mathbb {R}_+\). If h > 0 then
as h → 0+, since T t u, \(T_t v \in \mathop {\mathrm {dom}} H\) and T is strongly continuous. A real-valued function on \(\mathbb {R}_+\) is constant if it is continuous and its right derivative is identically zero, so this working shows that \(T_t^* T_t = I\).
Now let \(S = ( S_t )_{t \in \mathbb {R}_+}\) be the strongly continuous semigroup with generator −iH. The previous working shows that \(S_t^* S_t = I\) for all \(t \in \mathbb {R}_+\), so it suffices to let t > 0 and prove that \(S_t = T_t^*\). To see this, let u, \(v \in \mathop {\mathrm {dom}} H\) and consider the function
Working as above, it is straightforward to show that F′≡ 0 on (0, t), so F(0) = F(t) and the result follows. □
Exercise 2.64
Suppose U is a strongly continuous one-parameter semigroup on the Hilbert space H, with U t unitary, so that \(U_t^* U_t = I = U_t U_t^*\), for all \(t \in \mathbb {R}_+\). Let A be the generator of U.
Prove that \(U^* = ( U_t^* )_{t \in \mathbb {R}_+}\) is also a strongly continuous one-parameter semigroup, with generator − A. Deduce that H := iA is self-adjoint.
Proof
The semigroup property for U ∗ is immediate, and strong continuity holds because
as t → 0+, for any v ∈H.
Next, denote the generator of U ∗ by B, and let \(v \in \mathop {\mathrm {dom}} A\). Then
so − A ⊆ B. Since (U ∗)∗ = U, applying this argument with U replaced by U ∗ gives the reverse inclusion. Thus U ∗ has generator B = −A, as claimed.
Finally, let H = iA and suppose first that u, \(v \in \mathop {\mathrm {dom}} H = \mathop {\mathrm {dom}} A\). Then
so H ⊆ H ∗. For the reverse inclusion, note that
by Lemma 2.34 applied to U and properties of the adjoint. Thus A ∗⊆−A, the generator of U ∗, and therefore H ∗ = −iA ∗⊆iA = H. □
Remark 2.65
Exercises 2.63 and 2.64 lead to Stone’s theorem, which gives a one-to-one correspondence between self-adjoint operators and strongly continuous one-parameter groups of unitary operators. This result has significant consequences for the mathematical foundations of quantum theory; see [25, Section VIII.4].
3 Classical Markov Semigroups
Throughout this section, the triple \(\mbox{{$( \Omega , \mathcal {F}, \mathbb {P} )$}}\) will denote a probability space, so that \(\mathbb {P} : \mathcal {F} \to [ 0, 1 ]\) is a probability measure on the σ-algebra \(\mathcal {F}\) of subsets of Ω, and E will denote a topological space, with \(\mathcal {E}\) its Borel σ-algebra, generated by the open subsets.
An E-valued random variable is a \(\mathcal {F}\)-\(\mathcal {E}\)-measurable mapping X : Ω → E. If X is an E-valued random variable, then σ(X) is the smallest sub-σ-algebra \(\mathcal {F}_0\) of \(\mathcal {F}\) such that X is \(\mathcal {F}_0\)-\(\mathcal {E}\) measurable. More generally, if (X i)i ∈ I is an indexed set of E-valued random variables, then σ(X i : i ∈ I) is the smallest sub-σ-algebra \(\mathcal {F}_0\) of \(\mathcal {F}\) such that X i is \(\mathcal {F}_0\)-\(\mathcal {E}\) measurable for all i ∈ I.
3.1 Markov Processes
Definition 3.1
Given a real-valued random variable X which is integrable, so that
and a sub-σ-algebra \(\mathcal {F}_0\) of \(\mathcal {F}\), the conditional expectation \(\mathbb {E}[ X | \mathcal {F}_0 ]\) is a real-valued random variable Y which is \(\mathcal {F}_0\)-\(\mathcal {E}\) measurable and such that
The choice of Y is determined almost surely : if Y and Z are both versions of the conditional expectation \(\mathbb {E}[ X | \mathcal {F}_0 ]\), then \(\mathbb {P}( Y \neq Z ) = 0\). The existence of Y is guaranteed by the Radon–Nikodým theorem.
The fact that \(\mathbb {E}[ X | \mathcal {F}_0 ]\) is determined almost surely can be recast as saying that \(\mathbb {E}[ \cdot | \mathcal {F}_0 ]\) is a linear operator from \(L^1\mbox{{$( \Omega , \mathcal {F}, \mathbb {P} )$}}\) to \(L^1( \Omega , \mathcal {F}_0, \mathbb {P}|{ }_{\mathcal {F}_0} )\). In fact, the map \(X \mapsto \mathbb {E}[ X | \mathcal {F}_0 ]\) is a contraction from \(L^p\mbox{{$( \Omega , \mathcal {F}, \mathbb {P} )$}}\) onto \(L^p( \Omega , \mathcal {F}_0, \mathbb {P}|{ }_{\mathcal {F}_0} )\), for all p ∈ [1, ∞].
Remark 3.2
Let \(X \in L^2\mbox{{$( \Omega , \mathcal {F}, \mathbb {P} )$}}\). Informally, we can think of \(Y := \mathbb {E}[ X | \mathcal {F}_0 ]\) as the best guess for X given the information in \(\mathcal {F}_0\). In other words, the conditional expectation Y of X with respect to \(\mathcal {F}_0\) is the essentially unique choice of \(\mathcal {F}_0\)-measurable random variable Z which minimises the least-squares distance ∥Z − X∥2.
Definition 3.3
Given a topological space E, let the Banach space
with vector-space operations defined pointwise and supremum norm
Exercise 3.4
Verify that B b(E) is a Banach space. Show further that the norm ∥⋅∥ is submultiplicative, where multiplication of functions is defined pointwise, so that B b(E) is a Banach algebra . Show also that the Banach algebra B b(E) is unital : the multiplicative unit 1E is such that ∥1E∥ = 1. Show finally that the C ∗ identity holds:
where the isometric involution \(f \mapsto \overline {f}\) is such that \(\overline {f}( x ) := \overline {f( x )}\) for all x ∈ E.
Definition 3.5 (Provisional)
A Markov process with state space E is a collection of E-valued random variables \(X = ( X_t )_{t \in \mathbb {R}_+}\) on a common probability space such that, given any f ∈ B b(E),
for all \(s, t \in \mathbb {R}_+\) such that \(s \leqslant t\).
A Markov process is time homogeneous if, given any f ∈ B b(E),
for all \(s, t \in \mathbb {R}_+\) such that \(s \leqslant t\) and x ∈ E.
Definition 3.5 is well motivated by Remark 3.2, but it is somewhat unsatisfactory; for example, what should be the proper meaning of (3.1)? To improve upon it, we introduce the following notion.
Definition 3.6
A transition kernel on \(( E, \mathcal {E} )\) is a map \(p : E \times \mathcal {E} \to [ 0, 1 ]\) such that
-
(i)
the map x↦p(x, A) is Borel measurable for all \(A \in \mathcal {E}\) and
-
(ii)
the map A↦p(x, A) is a probability measure for all x ∈ E.
We interpret p(x, A) as the probability that the transition ends in A, given that it started at x.
Exercise 3.7
If p and q are transition kernels on \(( E, \mathcal {E} )\), then the convolution p∗ q is defined by setting
Prove that p∗ q is a transition kernel. Prove also that convolution is associative: if p, q and r are transition kernels then (p∗ q)∗ r = p∗ (q∗ r).
Definition 3.8
A triangular collection \(\{ p_{s, t} : s, t \in \mathbb {R}_+, \ s \leqslant t \}\) of transition kernels is consistent if p s,t∗ p t,u = p s,u for all s, t, \(u \in \mathbb {R}_+\) with \(s \leqslant t \leqslant u\); that is,
Equation (3.2) is the Chapman–Kolmogorov equation . We interpret p s,t(x, A) as the probability of moving from x at time s to somewhere in A at time t.
Similarly, a one-parameter collection \(\{ p_t : t \in \mathbb {R}_+ \}\) of transition kernels is consistent if p s ⋆ p t = p s+t for all s, \(t \in \mathbb {R}_+\). In this case, the Chapman–Kolmogorov equation becomes
We interpret p t(x, A) as the probability of moving from x into A in t units of time.
Definition 3.9
A family of E-valued random variables \(X = ( X_t )_{t \in \mathbb {R}_+}\) on a common probability space is a Markov process if there exists a consistent triangular collection of transition kernels such that
for all \(A \in \mathcal {E}\) and s, \(t \in \mathbb {R}_+\) such that \(s \leqslant t\).
The family X is a time-homogeneous Markov process if there exists a consistent one-parameter collection of transition kernels such that
for all \(A \in \mathcal {E}\) and s, \(t \in \mathbb {R}_+\) such that \(s \leqslant t\).
The connection between time-homogeneous Markov processes and semigroups is provided by the following definition and theorem.
Definition 3.10
A Markov semigroup is a contraction semigroup T on B b(E) such that, for all \(t \in \mathbb {R}_+\), the bounded linear operator T t is positive : whenever f ∈ B b(E) is such that \(f \geqslant 0\), that is, \(f( x ) \in \mathbb {R}_+\) for all x ∈ E, then \(T_t f \geqslant 0\). [Note that we impose no condition with respect to continuity at the origin.]
If T t preserves the unit, that is, T t1E = 1E for all \(t \in \mathbb {R}_+\), then the Markov semigroup T is conservative .
Remark 3.11
Positive linear maps preserve order: if T is such a map and \(f \leqslant g\), in the sense that \(f( x ) \leqslant g( x )\) for all x ∈ E, then \(T f \leqslant T g\). The image of a real-valued function h under a positive linear map is real valued, since if h takes real values, then h = h + − h −, where \(h^+ : x \mapsto \max \{ h( x ), 0 \}\) and \(h^- := x \mapsto \max \{ -h( x ), 0 \}\). Consequently, positive linear maps also commute with the conjugation, in the sense that \(T \overline {f} = \overline {T f}\).
Exercise 3.12
Suppose the mapping T : B b(E) → B b(E) is linear and positive. Show that \(| T f |{ }^2 \leqslant T | f |{ }^2 \, T 1_E\) for all f ∈ B b(E), and deduce that T is bounded, with norm \(\| T \| \leqslant \| T 1_E \|\).
Proof
If f ∈ B b(E), x ∈ E and \(\lambda \in \mathbb {R}\), then
Inspecting the discriminant of this polynomial in λ gives the first claim, and the second follows because.
□
Theorem 3.13
Let \(p = \{ p_t : t \in \mathbb {R}_+ \}\) be a family of transition kernels. Setting
defines a bounded linear operator on B b(E) which is positive, contractive and unit preserving. Furthermore, the family \(T = ( T_t )_{t \in \mathbb {R}_+}\) is a Markov semigroup if and only if p is consistent.
Proof
If f ∈ B b(E), x ∈ E and s, \(t \in \mathbb {R}_+\), then the Chapman–Kolmogorov equation (3.3) implies that
Verifying the remaining claims is left as an exercise. □
If we have more structure on the semigroup T, then it is possible to provide a converse to Theorem 3.13. This will be sketched in the following section.
3.2 Feller Semigroups
Definition 3.14
Let the topological space E be locally compact. Then
is a Banach space when equipped with pointwise vector-space operations and the supremum norm. [A function \(f : E \to \mathbb {C}\) vanishes at infinity if, for all ε > 0, there exists a compact set K ⊆ E such that |f(x)| < ε for all x ∈ E ∖ K.]
Exercise 3.15
Prove that C 0(E) lies inside B b(E) and is indeed a Banach space. Prove that the multiplicative unit 1E is an element of C 0(E) if and only if E is compact.
Definition 3.16
A Markov semigroup T is Feller if the following conditions hold:
-
(i)
\(T_t\bigl ( C_0( E ) \bigr ) \subseteq C_0( E )\) for all \(t \in \mathbb {R}_+\) and
-
(ii)
limt→0+∥T t f − f∥ = 0 for all f ∈ C 0(E).
Remark 3.17
If a time-homogeneous Markov process X has Feller semigroup T, then
so the generator A describes the change in X over an infinitesimal time interval.
Definition 3.18
An \(\mathbb {R}^d\)-valued stochastic process \(X = ( X_t )_{t \in \mathbb {R}_+}\) is a Lévy process if and only if X
-
(i)
has independent increments, so that X t − X s is independent of the past σ-algebra \(\sigma ( X_r : 0 \leqslant r \leqslant s )\) for all s, \(t \in \mathbb {R}_+\) with \(s \leqslant t\),
-
(ii)
has stationary increments, so that X t − X s has the same distribution as X t−s − X 0, for all s, \(t \in \mathbb {R}_+\) with \(s \leqslant t\) and
-
(iii)
is continuous in probability at the origin, so \(\displaystyle \lim _{t \to {0+}} \mathbb {P}\bigl ( | X_t - X_0 | \geqslant \varepsilon \bigr ) = 0\) for all ε > 0.
Remark 3.19
Lévy processes are well behaved; they have cádlág modifications, and such a modification is a semimartingale, for example.
Exercise 3.20
Prove that if X is a stochastic process with independent and stationary increments, and with cádlág paths, then X is continuous at the origin in probability.
Theorem 3.21
Every Lévy process gives rise to a conservative Feller semigroup.
Proof (Sketch Proof)
For all \(t \in \mathbb {R}_+\), define a transition kernel p t by setting
If \(s \in \mathbb {R}_+\), then
where \(\mathcal {F}_s := \sigma ( X_r : 0 \leqslant r \leqslant s )\); the first equality holds by stationarity and the second by independence. In particular,
so X is a Markov process with transition kernels \(\{ p_t : t \in \mathbb {R}_+ \}\) if these are consistent. For consistency, we use Theorem 3.13; let T be defined as there and note that
From the previous working, it follows that
and replacing x with the \(\mathcal {F}_s\)-measurable random variable X s − X 0 + x gives that
as required. Equation (3.5) also shows that T is conservative.
If \(f \in C_0( \mathbb {R}^d )\), then \(x \mapsto f( X_t - X_0 + x ) \in C_0( \mathbb {R}^d )\) almost surely, and therefore the Dominated Convergence Theorem gives that \(T_t f \in C_0( \mathbb {R}^d )\).
For continuity, let ε > 0 and note that \(f \in C_0( \mathbb {R}^d )\) is uniformly continuous, so there exists δ > 0 such that |f(x) − f(y)| < ε whenever |x − y| < δ. Hence
□
Theorem 3.22
Let T be a conservative Feller semigroup. If the state space E is metrisable, then there exists a time-homogeneous Markov process which gives rise to T.
Proof (Sketch Proof)
For all t ∈ (0, ∞), let
Then p t is readily verified to be a transition kernel.
Let μ be a probability measure on E. If \(t_n \geqslant \dots \geqslant t_1 \geqslant 0\) and A 1, …\(A_n \in \mathcal {E}\), then
By the Chapman–Kolmogorov equation (3.3), these finite-dimensional distributions form a projective family. The Daniell–Kolmogorov extension theorem now yields a probability measure on the product space
such the coordinate projections X t : Ω → E; ω↦ω t form a time-homogeneous Markov process X with associated semigroup T. □
Example 3.23 (Uniform Motion)
If \(E = \mathbb {R}\) and X t = X 0 + t for all \(t \in \mathbb {R}_+\), then
where the transition kernel p t : (x, A)↦δ x+t(A). It follows that X gives rise to a Feller semigroup with generator A such that Af = f′ whenever \(f \in \mathop {\mathrm {dom}} A\).
Example 3.24 (Brownian Motion)
If \(E = \mathbb {R}\) and X is a standard Brownian motion, then Itô’s formula gives that
It follows that the Lévy process X has a Feller semigroup with the generator A such that \(A f = \frac {1}{2} f''\) for all \(f \in C^2( \mathbb {R} ) \cap \mathop {\mathrm {dom}} A\). [Informally,
as t → 0+.]
Example 3.25 (Poisson Process)
If \(E = \mathbb {R}\) and X is a homogeneous Poisson process with unit intensity and unit jumps, then
Hence the Lévy process X has a Feller semigroup with the bounded generator A such that (Af)(x) = f(x + 1) − f(x) for all \(x \in \mathbb {R}\) and \(f \in C_0( \mathbb {R} )\). [To see this, note that
uniformly for all \(x \in \mathbb {R}\).]
The following exercise and theorem show that it is possible to move from the non-conservative to the conservative setting, and from a locally compact state space to a compact one.
Exercise 3.26
Let \(\mathcal {T}\) be a locally compact topology on E and let ∞ denote a point not in E. Prove that \(\widehat {E} := E \cup \{ \infty \}\) is compact when equipped with the topology
and that \(\widehat {\mathcal {T}}\) is Hausdorff if and only if \(\mathcal {T}\) is. [This is the Alexandrov one-point compactification .] Prove further that C 0(E) has co-dimension one in \(C( \widehat {E} )\).
Theorem 3.27
Let T be a Feller semigroup with locally compact state space E. If
then \(\widehat {T} = \bigl ( \widehat {T}_t \bigr )_{t \in \mathbb {R}_+}\) is a conservative Feller semigroup with compact state space \(\widehat {E}\).
Proof
Fix \(t \in \mathbb {R}_+\). The hardest step is to prove that \(\widehat {T}_t\) is positive, that is, if \(\lambda \in \mathbb {R}_+\) and g ∈ B b(E) are such that \(\lambda + g( x ) \geqslant 0\) for all x ∈ E, then \(\lambda + ( T_t g )( x ) \geqslant 0\) for all x ∈ E. Note that g is real valued, and T t maps real-valued functions to real-valued functions, by positivity. Let the function \(g^- := x \mapsto \max \{ -g( x ), 0 \}\) and note that \(\lambda \geqslant g^-( x )\) for all x ∈ E. Hence
and \(( T_t g )( x ) \geqslant ( -T_t g^- )( x ) \geqslant {-\lambda }\), as required.
It is immediate that \(\widehat {T}_t\) preserves the unit, so \(\widehat {T}_t\) is contractive, by Exercise 3.12. The remaining claims are straightforward to verify. □
3.3 The Hille–Yosida–Ray Theorem
As noted above, it can be difficult to show that the hypotheses of the Hille–Yosida theorem, Theorem 2.49, hold. The Lumer–Phillips theorem gives an alternative for contraction semigroups, via the notion of dissipativity. Here, we will show that the additional structure available for Feller semigroups gives another possible approach.
Throughout this subsection, E denotes a locally compact Hausdorff space. Here, a Feller semigroup on C 0(E) means a strongly continuous contraction semigroup on C 0(E) composed of positive operators. This is the restriction to C 0(E) of the Feller semigroups considered above.
Let
denote the real subspace of C 0(E) containing those functions which take only real values.
Definition 3.28
A linear operator A in C 0(E) is real if and only if
-
(i)
\(\overline {f} \in \mathop {\mathrm {dom}} A\) whenever \(f \in \mathop {\mathrm {dom}} A\), so that the domain of A is closed under conjugation, and
-
(ii)
\(\overline {A f} = A \overline {f}\) for all \(f \in \mathop {\mathrm {dom}} A\), so that A commutes with the conjugation.
Exercise 3.29
Show that (i) and (ii) are equivalent to
-
(i)
\(f + \mathrm {i} g \in \mathop {\mathrm {dom}} A\) implies f, \(g \in \mathop {\mathrm {dom}} A\) whenever f, \(g \in C_0( E; \mathbb {R} )\), and
-
(ii)
\(A\bigl ( \mathop {\mathrm {dom}} A \cap C_0( E; \mathbb {R} ) \bigr ) \subseteq C_0( E; \mathbb {R} )\),
respectively.
Exercise 3.30
Prove that T is real whenever T is positive.
Prove further that if \(T = ( T_t )_{t \in \mathbb {R}_+}\) is a Feller semigroup on C 0(E) and T t is real for all \(t \in \mathbb {R}_+\) then the generator A of T is real.
Proof
The first claim is an immediate consequence of Remark 3.11.
For the second, suppose A is the generator of the Feller semigroup T on C 0(E), with each T t real, and let \(f \in \mathop {\mathrm {dom}} A\). Then, since conjugation is isometric, if t > 0, then
and so \(\overline {f} \in \mathop {\mathrm {dom}} A\), with \(A \overline {f} = \overline {A f}\). The result follows. □
Definition 3.31
A linear operator A in C 0(E) satisfies the positive maximum principle if, whenever \(f \in \mathop {\mathrm {dom}} A \cap C_0( E; \mathbb {R} )\) and x 0 ∈ E are such that f(x 0) = ∥f∥, it holds that \(( A f )( x_0 ) \leqslant 0\).
Theorem 3.32 (Hille–Yosida–Ray)
A closed, densely defined operator A in C 0(E) is the generator of a Feller semigroup on C 0(E) if and only if A is real and satisfies the positive maximum principle, and λI − A is surjective for some λ > 0
Proof
Suppose first that A generates a Feller semigroup on C 0(E). By the Lumer–Phillips theorem, Theorem 2.58, and Exercise 3.30, it suffices to prove that A satisfies the positive maximum principle. For this, let \(f \in \mathop {\mathrm {dom}} A \cap C_0( E; \mathbb {R} )\) and x 0 ∈ E be such that f(x 0) = ∥f∥. Setting \(f^+ := x \mapsto \max \{ f( x ), 0\}\), we see that
Thus
Conversely, suppose A is real and satisfies the positive maximum principle. Given any \(f \in \mathop {\mathrm {dom}} A\), there exist x 0 ∈ E and \(\theta \in \mathbb {R}\) such that e iθ f(x 0) = ∥f∥. The real-valued function \(g := \mathop {\mathrm {Re}} e^{\mathrm {i} \theta } f \in \mathop {\mathrm {dom}} A\), since A is real, and \(\| f \| = g( x_0 ) \leqslant \| g \| \leqslant \| f \|\), so \( \mathop {\mathrm {Re}} ( A e^{\mathrm {i} \theta } f )( x_0 ) = ( A g )( x_0 ) \leqslant 0\), by the positive maximum principle. If λ > 0, then
so A is dissipative, by Lemma 2.56, and λI − A is injective. In particular, T is a strongly continuous contraction semigroup, by the Lumer–Phillips theorem.
To prove that each T t is positive, let λ > 0 be such that λI − A is surjective, so invertible, let f ∈ C 0(E) be non-negative, and consider g = (λI − A)−1 f ∈ C 0(E). Either g does not attain its infimum, in which case \(g \geqslant 0\) because g vanishes at infinity, or there exists x 0 ∈ E such that \(g( x_0 ) = \inf \{ g( x ) : x \in E \}\). Then
so \(\lambda g( x_0 ) - f( x_0 ) = ( A g )( x_0 ) \geqslant 0\), by the positive maximum principle applied to − g. Thus if x ∈ E, then
which shows that λ(λI − A)−1 is positive and therefore so is (λI − A)−1. Finally, Theorem 2.46 gives that
so each T t is positive also. □
Exercise 3.33
Prove that if the operator A is real then its resolvent (λI − A)−1 is real for all \(\lambda \in \mathbb {R} \setminus \sigma ( A )\). Deduce with the help of Theorem 2.46 that the Feller semigroup T is real if its generator A is.
Proof
Suppose A is real and \(\lambda \in \mathbb {R} \setminus \sigma ( A )\). If f ∈ C 0(E), then f = (λI − A)g for some g ∈ C 0(E), and
Hence
as required. Since conjugation is isometric, the deduction is immediate. □
Example 3.34
Let the linear operator A be defined by setting
It is a familiar result from elementary calculus that A satisfies the positive maximum principle
Remark 3.35
Courrège has classified the linear operators in \(C_0( \mathbb {R}^d )\) with domains containing \(C^\infty _c( \mathbb {R}^d )\) which satisfy the positive maximum principle. See [3, §3.5.1] and references therein.
4 Quantum Feller Semigroups
To move beyond the classical, we need to replace the commutative domain C 0(E) with the correct non-commutative generalisation. This is what we introduce in the following section.
4.1 C ∗ Algebras
Definition 4.1
A Banach algebra is a complex Banach space and simultaneously a complex associative algebra: it has an associative multiplication compatible with the vector-space operators and the norm, which is submultiplicative. If the Banach algebra is unital , so that it has a multiplicative identity 1, called its unit , then we require the norm ∥1∥ to be 1.
An involution on a Banach algebra is an isometric conjugate-linear map which reverses products and is self-inverse.
A Banach algebra with involution A is a C ∗ algebra if and only if the C ∗ identity holds:
Remark 4.2
The C ∗ identity connects the algebraic and analytic structures in a very rigid way. For example, there exists at most one norm for which an associative algebra is a C ∗ algebra, and ∗-homomorphisms between C ∗ algebras are automatically contractive [30, Proposition I.5.2].
Theorem 4.3 (Gelfand)
Every commutative C ∗ algebra is isometrically isomorphic to C 0(E), where E is a locally compact Hausdorff space. The algebra is unital if and only if E is compact, in which case C 0(E) = C(E).
Theorem 4.4 (Gelfand–Naimark)
Any C ∗ algebra is isometrically ∗-isomorphic to a norm-closed ∗-subalgebra of B(H) for some Hilbert space H , a so-called concrete C ∗ algebra .
Remark 4.5
Let A be a C ∗ algebra. Given any \(n \in \mathbb {N}\), let M n(A) be the complex algebra of n × n matrices with entries in A, equipped with the usual algebraic operations. By the Gelfand–Naimark theorem, we may assume that A ⊆ B(H) for some Hilbert space H, and so M n(A) ⊆ B(H n), where matrices of operators act in the usual manner on column vectors with entries in H. We equip M n(A) with the restriction of the operator norm on B(H n), and then M n(A) becomes a C ∗ algebra.
Remark 4.5 is the root of the theory of operator spaces [10, 24].
Definition 4.6
A unital concrete C ∗ algebra A ⊆ B(H) is a von Neumann algebra if and only if any of the following equivalent conditions hold.
-
(i)
Closure in the strong operator topology: if the net (a i) ⊆A and a ∈ B(H) are such that a i v → av for all v ∈H, then a ∈A.
-
(ii)
Closure in the weak operator topology: if the net (a i) ⊆A and a ∈ B(H) are such that 〈v, a i v〉→〈v, av〉 for all v ∈H, then a ∈A.
-
(iii)
Equality with its bicommutant: letting
$$\displaystyle \begin{aligned} S' := \{ a \in \mathsf{A} : a b = b a \text{ for all } b \in S \} \end{aligned}$$denote the commutant of S ⊆A, then A″ := (A ′)′ = A [von Neumann].
-
(iv)
Existence of a predual: there exists a Banach space A ∗ with (A ∗)∗ = A [Sakai].
Sakai’s characterisation (iv) prompts consideration of the predual of B(H). The predual A ∗ is naturally a subspace of A ∗, and a bounded linear functional ϕ on B(H) is an element of B(H)∗ if and only it is σ-weakly continuous : there exist square-summable sequences \(( u_n )_{n = 1}^\infty \) and \(( v_n )_{n = 1}^\infty \subseteq \mathsf {H}\) such that
This yields a fifth characterisation of von Neumann algebras.
-
(v)
Closure in the σ-weak topology: if the net (a i) ⊆A and a ∈ B(H) are such that ϕ(a i) → ϕ(a) for all ϕ ∈ B(H)∗, then a ∈A.
The predual A ∗ consists of all those bounded linear functionals on A which are continuous in the σ-weak topology; equivalently, they are the restriction to A of elements of B(H)∗ as described in (4.1).
Example 4.7
Recall from Example 2.15 that \(L^\infty \mbox{{$( \Omega , \mathcal {F}, \mu )$}} \cong \bigl ( L^1\mbox{{$( \Omega , \mathcal {F}, \mu )$}} \bigr )^*\), and so every L ∞ space is a commutative von Neumann algebra. Furthermore, every commutative von Neumann algebra is isometrically ∗-isomorphic to \(L^\infty \mbox{{$( \Omega , \mathcal {F}, \mu )$}}\) for some locally compact Hausdorff space Ω and positive Radon measure μ; see [30, Theorem III.1.18].
4.2 Positivity
Definition 4.8
In a C ∗ algebra A we have the notion of positivity : we write \(a \geqslant 0\) if and only if there exists b ∈A such that a = b ∗ b. The set of positive elements in A is denoted by A +, is closed in the norm topology and is a cone : it is closed under addition and multiplication by non-negative scalars. Note that a positive element is self-adjoint.
This notion of positivity agrees with that encountered previously.
Lemma 4.9
Let T ∈ B(H) be such that \(\langle v, T v \rangle \geqslant 0\) for all v ∈H . There exists a unique operator S ∈ B(H) such that \(\langle v, S v \rangle \geqslant 0\) for all v ∈H , and S 2 = T. Furthermore, S is the limit of a sequence of polynomials in T with no constant term.
Proof
This may be established with the assistance of the Maclaurin series for the function z↦(1 − z)1∕2. See [25, Theorem VI.9] for the details. □
Corollary 4.10
If a ∈A + , then there exists a unique element a 1∕2 ∈A + , the square root of a, such that (a 1∕2)2 = a. The square root a 1∕2 lies in the closed linear subspace of A spanned by the set of monomials \(\{ a^n : n \in \mathbb {N} \}\).
Proof
This is a straightforward exercise. □
Exercise 4.11
Prove that f ∈ C 0(E)+ if and only if \(f( x ) \geqslant 0\) for all x ∈ E. Prove also that if the C ∗ algebra A ⊆ B(H), where H is a Hilbert space, then a ∈A + if and only if \(\langle v, a v \rangle \geqslant 0\) for all v ∈H. [The existence of square roots is crucial for both parts.]
Proposition 4.12
Let A by a C ∗ algebra. Then any element a ∈A may be written in the form (a 1 − a 2) + i(a 3 − a 4), where a 1 , …, a 4 ∈A +.
Proof
The self-adjoint elements \( \mathop {\mathrm {Re}} a := ( a + a^* ) / 2\) and \( \mathop {\mathrm {Im}} a := ( a - a^* ) / ( 2 \mathrm {i} )\) are such that \(a = \mathop {\mathrm {Re}} a + \mathrm {i} \mathop {\mathrm {Im}} a\). Thus it suffices to show that any self-adjoint element of A is the difference of two positive elements.
Let a ∈A be self-adjoint and let A 0 be the closed linear subspace of A spanned by the set of monomials \(\{ a^n : n \in \mathbb {N} \}\). As A 0 is a commutative C ∗ algebra, Theorem 4.3 gives an isometric ∗-isomorphism j : A 0 → C 0(E), where E is a locally compact Hausdorff space. Then f := j(a) is real valued, so
are well-defined elements of C 0(E)+ such that f = f + − f −. Hence a = a + − a −, where a + := j −1(f +) and a − := j −1(f −) are positive, as desired. □
Remark 4.13
The proof of Proposition 4.12 shows that if a ∈A is self-adjoint, then there exist a +, a −∈A + such that a = a + − a − and a + a − = 0.
Definition 4.14
The positive cone provides a partial order on the set of self-adjoint elements of A. Given elements a, b ∈A, we write \(a \leqslant b\) if and only if a = a ∗, b = b ∗ and b − a ∈A +.
This order respects the norm.
Proposition 4.15
Let a, b ∈A + be such that \(a \leqslant b\) . Then \(\| a \| \leqslant \| b \|\).
Proof
Suppose without loss of generality that A ⊆ B(H). Then \(a \leqslant b \leqslant \| b \| I\), by transitivity, Exercise 4.11 and the Cauchy–Schwarz inequality. If A 0 denotes the unital commutative C ∗ algebra generated by the set of monomials \(\{ a^n : n \in \mathbb {Z}_+ \}\), then Theorem 4.3 gives an isometric ∗-isomorphism j : A 0 → C(E), where E is a compact Hausdorff space. Hence
so \(0 \leqslant j( a )( x ) \leqslant \| b \|\) for all such x and \(\| a \| = \| j( a ) \|{ }_\infty \leqslant \| b \|\), as claimed. □
Exercise 4.16
Prove that if a ∈A + and \(n \in \mathbb {Z}_+\), then ∥a n∥ = ∥a∥n. [Hint: work as in the proof of Proposition 4.15.]
Definition 4.17
A linear map Φ : A →B between C ∗ algebras is positive if and only if Φ(A +) ⊆B +.
Note that any algebra ∗-homomorphism is positive; this fact has been utilised in the proof of Proposition 4.15.
Corollary 4.18
Let Φ : A →B be a positive linear map between C ∗ algebras. Then
-
(i)
the map Φ commutes with the involution, so that Φ(a ∗) = Φ(a)∗ for all a ∈A , and
-
(ii)
the map Φ is bounded.
Proof
Part (i) is an exercise.
For (ii), it suffices to prove that Φ is bounded on A +; suppose otherwise for contradiction. For all \(n \in \mathbb {N}\), let a n ∈A n be such that ∥a n∥ = 1 and ∥ Φ(a n)∥ > 3n. If \(a := \sum _{n \geqslant 1} 2^{-n} a_n \in \mathsf {A}_+\), then \(a \geqslant 2^{-n} a_n\) for all \(n \in \mathbb {N}\). Hence \(\Phi ( a ) \geqslant 2^{-n} \phi ( a_n )\) and \(\| \phi ( a ) \| \geqslant 2^{-n} \| \Phi ( a_n ) \| > ( 3 / 2 )^n\), by Proposition 4.15, which is a contradiction for sufficiently large n. □
We will now begin to investigate the generators of positive semigroups, following in the footsteps of Evans and Hanche-Olsen [12].
Theorem 4.19
Let \(T = ( T_t )_{t \in \mathbb {R}_+}\) be a uniformly continuous one-parameter semigroup on the C ∗ algebra A . If T t is positive for all \(t \in \mathbb {R}_+\) , then the semigroup generator \(\mathcal {L}\) is bounded and ∗-preserving.
Proof
The boundedness of \(\mathcal {L}\) follows immediately from Theorem 2.23, and if a ∈A, then
by continuity of the involution and the fact that positive maps are ∗-preserving. □
The following result is a variation on [12, Theorem 2]. The proof exploits an idea of Fagnola [14, Proof of Proposition 3.10].
Theorem 4.20
Let \(\mathcal {L}\) be a ∗-preserving bounded linear map on the C ∗ algebra A . The following are equivalent.
-
(i)
If a, b ∈A + are such that ab = 0, then \(a \mathcal {L}( b ) a \geqslant 0\).
-
(ii)
\(( \lambda I - \mathcal {L} )^{-1}\) is positive for all sufficiently large λ > 0.
-
(iii)
\(T_t = \exp ( t \mathcal {L} )\) is positive for all \(t \in \mathbb {R}_+\).
Proof
Suppose (i) holds; we will show that \(( \lambda I - \mathcal {L} )^{-1}\) is positive if \(\lambda > \| \mathcal {L} \|\). It suffices to take a ∈A such that \(( \lambda I - \mathcal {L} )( a )\) is positive, and prove that a ∈A +. Note that a is self- adjoint, so Remark 4.13 gives b and c ∈A + with a = b − c and bc = 0. Thus (ii) holds if c = 0.
The condition bc = 0 implies that b 1∕2 c = 0, so (i) gives that \(c \mathcal {L}( b ) c \geqslant 0\). Hence
and therefore \(0 \leqslant \lambda c^3 \leqslant c \mathcal {L}( c ) c\). It follows that \(\lambda \| c \|{ }^3 = \lambda \| c^3 \| \leqslant \| \mathcal {L} \| \, \| c \|{ }^3\), which holds only when c = 0, as required.
That (ii) and (iii) are equivalent is a consequence of Theorems 2.45 and 2.46. To see that (iii) implies (i), note that if a, b ∈A + are such that ab = 0, then
as t → 0 + . □
In the quantum world, we can go beyond positivity to find a stronger notion, complete positivity, which is of great importance to the theories of open quantum systems and quantum information.
4.3 Complete Positivity
Recall from Remark 4.5 that matrix algebras over C ∗ algebras are also C ∗ algebras.
Definition 4.21
Let \(n \in \mathbb {N}\). A linear map Φ : A →B between C ∗ algebras is n-positive if and only if the ampliation
is positive. If Φ is n-positive for all \(n \in \mathbb {N}\), then Φ is completely positive .
Remark 4.22
Choi [6] produced examples of maps which are n-positive but not n + 1-positive.
Exercise 4.23
Let \(n \in \mathbb {N}\) and let \(T = ( T_t )_{t \in \mathbb {R}_+}\) be a one-parameter semigroup on the C ∗ algebra A. Prove that \(T^{(n)} = ( T_t^{(n)} )_{t \in \mathbb {R}_+}\) is a one-parameter semigroup on M n(A), Prove further that if T is uniformly continuous, with generator \(\mathcal {L}\), then T (n) is also uniformly continuous, with generator \(\mathcal {L}^{(n)}\).
Proposition 4.24 (Paschke [23])
Let \(A = ( a_{i j} )_{i, j = 1}^n \in M_n( \mathsf {A} )\) , where A is a C ∗ algebra. The following are equivalent.
-
(i)
The matrix A ∈ M n(A)+.
-
(ii)
The matrix A may be written as the sum of at most n matrices of the form \(( b_i^* b_j )_{i, j = 1}^n\) , where b 1 , …, b n ∈A.
-
(iii)
The sum \(\sum _{i, j = 1}^n c_i^* a_{i j} c_j \in A_+\) for any c 1 , …, c n ∈A.
Proof
To see that (iii) implies (i), we use the fact that any C ∗ algebra has a faithful representation which is a direct sum of cyclic representations [30, Theorem III.2.4]. Thus we may assume without loss of generality that A ⊆ B(H) and there exists a unit vector u ∈H such that {au : a ∈H} is dense in H.
Given this and Exercise 4.11, let c 1, …, c n ∈A. Then (iii) implies that
where v = (c 1 u, …, c n u)T ∈H n. Vectors of this form are dense in H n as c 1, …, c n vary over A, so the result follows by another application of Exercise 4.11.
The other implications are straightforward to verify. □
Exercise 4.25
Let \(n \in \mathbb {N}\). Use Proposition 4.24 to prove that a linear map Φ : A →B between C ∗ algebras is n-positive if and only if
for all a 1, …, a n ∈A and b 1, …, b n ∈B. Deduce that any ∗-homomorphism between C ∗ algebras is completely positive, as is any map of the form
Theorem 4.26
A positive linear map Φ : A →B between C ∗ algebras is completely positive if A is commutative or B is commutative.
Proof
The first result is due to Stinespring [29] and the second to Arveson [4]. We will prove the latter.
We may suppose that B = C 0(E), where E is a locally compact Hausdorff space, by Theorem 4.3. If a 1, …, a n ∈A, b 1, …, b n ∈ B and x ∈ E, then
where \(c( x ) := \sum _{i = 1}^n b_i( x ) a_i \in A\). Exercises 4.11 and 4.25 give the result. □
Definition 4.27
A map Φ : A →B between unital algebras is unital if Φ(1A) = 1B, where 1A and 1B are the multiplicative units of A and B, respectively.
Theorem 4.28 (Kadison)
A 2-positive unital linear map Φ : A →B between unital C ∗ algebras is such that
Proof
Note first that if a ∈A then
so
Suppose without loss of generality that B ⊆ B(H) for some Hilbert space H, and note that, by Exercise 4.11, if u ∈H and
As u is arbitrary, the claim follows. □
Remark 4.29
The inequality (4.2) is known as the Kadison–Schwarz inequality .
Exercise 4.30
Show that the inequality (4.2) holds if Φ is required only to be positive as long as a is normal , so that a ∗ a = aa ∗. [Hint: use Theorem 4.26.]
4.4 Stinespring’s Dilation Theorem
Exercise 4.25 gives two classes of completely positive maps. The following result makes clear that these are, in a sense, exhaustive.
Theorem 4.31 (Stinespring [29])
Let Φ : A → B(H) be a linear map, where A is a unital C ∗ algebra and H is a Hilbert space. Then Φ is completely positive if and only if there exists a Hilbert space K , a unital ∗-homomorphism π : A → B(K) and a bounded operator T : H →K such that
Proof
One direction is immediate. For the other, let \(\mathsf {K}_0 := \mathsf {A} { \underline {\otimes }} \mathsf {H}\) be the algebraic tensor product of A with H, considered as complex vector spaces. Define a sesquilinear form on K 0 such that
It is an exercise to check that this form is positive semidefinite, using the assumption that Φ is completely positive, and that the kernel
is a vector subspace of K 0. Let K be the completion of \(\mathsf {K}_0 / \mathsf {K}_{00} = \bigl \{ [ x ] : x \in \mathsf {K}_0 \bigr \}\).
If
then π(a) extends by linearity and continuity to an element of B(K), denoted in the same manner. Furthermore, the map a↦π(a) is a unital ∗-homomorphism from A to B(K).
To conclude, let T ∈ B(H;K) be defined by setting Tv = [1 ⊗ v] for all v ∈H. It is a final exercise to verify that Φ(a) = T ∗ π(a)T, as required. □
The following result extends the Kadison–Schwarz inequality, Theorem 4.28.
Corollary 4.32
If Φ : A → B(H) is unital and completely positive then
for all \(n \in \mathbb {N}\) , a 1 , …, a n ∈A and v 1 , …, v n ∈H.
Proof
Let π and T be as in Theorem 4.31. Then ∥T∥2 = ∥T ∗ π(1A)T∥ = ∥ Φ(1A)∥ = 1 and
□
Definition 4.33
A triple (K, π, T) as in Theorem 4.31 is a Stinespring dilation of Φ. Such a dilation is minimal if
Proposition 4.34
Any unital completely positive map Φ : A → B(H) has a minimal Stinespring dilation.
Proof
One may take (K, π, T) as in Theorem 4.31 and restrict to the smallest closed subspace of K containing {π(a)Tv : a ∈A, v ∈H}. □
Exercise 4.35
Prove that the minimal Stinespring dilation is unique in an appropriate sense.
Definition 4.36
Let (a i) ⊆A be a net in the von Neumann algebra A ⊆ B(H). We write if \(a_i \geqslant a_j \geqslant 0\) whenever \(i \geqslant j\) and 〈v, a i v〉→ 0 for all v ∈H. [It follows from Vigier’s theorem [22, Theorem 4.1.1.] that the decreasing net (a i) converges in the strong operator topology to some element a ∈A +.]
A linear map Φ : A → B(K) is normal if implies that 〈v, Φ(a i)v〉→ 0 for all v ∈K.
Proposition 4.37
Let A be a von Neumann algebra. If the linear map Φ : A → B(H) is completely positive and normal, then the unital ∗-homomorphism π of Theorem 4.31 may be chosen to be normal also.
Proof
Let (K, π, T) be a minimal Stinespring dilation for Φ. If v ∈H, a ∈A and the net (a i) ⊆A + is such that , then
since . It now follows by polarisation and minimality that , as required. □
Proposition 4.38
A linear map Φ : A → B(H) is normal if and only if it is σ-weakly continuous.
Proof
It suffices to prove that if (b i) ⊆ B(K) is a norm-bounded net then b i → 0 in the σ-weak topology if and only if 〈v, b i v〉→ 0 for all v ∈K. Furthermore, by polarisation, we need only consider σ-weakly continuous functionals of the form
The result now follows by a standard truncation argument. □
4.5 Semigroup Generators
We will now introduce the class of quantum Feller semigroups, and proceed toward a classification of the semigroup generators for a uniformly continuous subclass. As above, we will first establish some necessary conditions that hold in greater generality.
Definition 4.39
A quantum Feller semigroup \(T = ( T_t )_{t \in \mathbb {R}_+}\) on a C ∗ algebra A is a strongly continuous contraction semigroup such that each T t is completely positive.
If A is unital, with unit 1, and T t1 = 1 for all \(t \in \mathbb {R}_+\) then T is conservative .
Exercise 4.40
Let T be a quantum Feller semigroup on a unital C ∗ algebra. Prove that T is conservative if and only if \(1 \in \mathop {\mathrm {dom}} \mathcal {L}\), with \(\mathcal {L}( 1 ) = 0\). [Hint: Theorem 2.46 may be useful.]
To begin the characterisation of the generators of these semigroups, we introduce a concept due to Evans [11].
Proposition 4.41
Let Φ : A → B(H) be a linear map on the unital concrete C ∗ algebra A ⊆ B(H). The following are equivalent.
-
(i)
If \(n \in \mathbb {N}\) and a ∈ M n(A), then
$$\displaystyle \begin{array}{lll} \Phi^{(n)}( a^* a ) + a^* \Phi^{(n)}( 1 ) a - \Phi^{(n)}( a^* ) a - a^* \Phi^{(n)}( a ) \in M_n\bigl( {B(\mathsf{H})} \bigr)_+. \end{array}$$ -
(ii)
If \(n \in \mathbb {N}\) and a 1 , …, a n ∈A , then
$$\displaystyle \begin{array}{lll} \bigl( \Phi( a_i^* a_j ) + a_i^* \Phi( 1 ) a_j - \Phi( a_i^* ) a_j - a_i^* \Phi( a_j ) \bigr)_{i, j = 1}^n \in M_n\bigl( {B(\mathsf{H})} \bigr)_+. \end{array}$$ -
(iii)
If \(n \in \mathbb {N}\) , a 1 , …, a n ∈A and v 1 , …, v n ∈H are such that \(\sum _{i = 1}^n a_i v_i = 0\) , then
$$\displaystyle \begin{aligned} \sum_{i, j = 1}^n \langle v_i, \Phi( a_i^* a_j ) v_j \rangle \geqslant 0. \end{aligned}$$ -
(iv)
If \(n \in \mathbb {N}\) , a 1 , …, a n ∈A and b 1 , …, b n ∈ B(H) are such that \(\sum _{i = 1}^n a_i b_i = 0\) , then
$$\displaystyle \begin{aligned} \sum_{i, j = 1}^n b_i^* \Phi( a_i^* a_j ) b_j \geqslant 0. \end{aligned}$$
Proof
Given a 1, …, a n ∈A, let a = (a ij) ∈ M n(A) be such that a 1j = a j and a ij = 0 otherwise. Then
for all i, j = 1, …, n, so (i) implies (ii).
Conversely, let a = (a ij) ∈ M n(A). Applying (ii) to a k1, …, a kn and then summing over k gives that
Thus (ii) implies (i).
The implication from (ii) to (iii) is clear, as is that from (iii) to (iv). For the final part, let a 1, …, a n ∈A and b 1, …, b n ∈ B(H), let a 0 = 1 and \(b_0 = -\sum _{i = 1}^n a_i b_i\), and note that \(\sum _{i = 0}^n a_i b_i = 0\). Hence (iv) gives that
Thus (ii) now follows from the first part of Exercise 4.25. □
Definition 4.42
A linear map Φ : A → B(H) on the unital C ∗ algebra A ⊆ B(H) is conditionally completely positive if and only if any of the equivalent conditions in Proposition 4.41 hold.
Exercise 4.43
Prove that the set of conditionally completely positive maps from A to B(H) is a cone, that is, closed under addition and multiplication by non-negative scalars. Prove also that this cone contains all completely positive maps and scalar multiples of the identity map. Finally, prove that the cone is closed under pointwise weak-operator convergence: the net Φi → Φ if and only if 〈v, Φi(a)v〉→〈v, Φ(a)v〉 for all a ∈A and v ∈H.
Exercise 4.44
Let A be as in Definition 4.42. A linear map δ : A → B(H) is a derivation if and only if
Prove that a derivation is conditionally completely positive. Prove also that the map
is conditionally completely positive and normal for all G ∈ B(H).
Theorem 4.45
Let T be a uniformly continuous quantum Feller semigroup on the unital C ∗ algebra A ⊆ B(H). The semigroup generator \(\mathcal {L}\) is bounded, ∗-preserving and conditionally completely positive.
Proof
The first two claims follow immediate from Theorem 4.19. For conditional complete positivity, let a 1, …, a n ∈A and v 1, …, v n ∈H. By Corollary 4.32, if t > 0, then
Letting t → 0+ gives that
and if \(\sum _{i = 1}^n a_i v_i = 0\) then the second and third terms vanish. □
Exercise 4.46
Use Exercise 4.43 to provide an alternative proof that \(\mathcal {L}\) in Theorem 4.45 is conditionally completely positive.
The following result is [11, Theorem 2.9] of Evans, who credits Lindblad [21].
Theorem 4.47 (Lindblad, Evans)
Let \(\mathcal {L}\) be a ∗-preserving bounded linear map on the unital C ∗ algebra A ⊆ B(H). The following are equivalent.
-
(i)
\(\mathcal {L}\) is conditionally completely positive.
-
(ii)
\(( \lambda I - \mathcal {L} )^{-1}\) is completely positive for all sufficiently large λ > 0.
-
(iii)
\(T_t = \exp ( t \mathcal {L} )\) is completely positive for all \(t \in \mathbb {R}_+\).
Proof
The equivalence of (ii) and (iii) is given by Theorems 2.45 and 2.46, together with Exercise 4.23. The solution to Exercise 4.46 gives that (iii) implies (i); to complete the proof, it suffices to show that (i) implies (iii).
Suppose first that \(\mathcal {L}( 1 ) \leqslant 0\). Then \(\mathcal {L}^{(n)}( 1 ) \leqslant 0\) for all \(n \in \mathbb {N}\), so if a ∈ M n(A) then
Thus if b, c ∈ M n(A)+ are such that bc = 0 then b 1∕2 c = 0 and
Theorem 4.20 now gives that \(T^{(n)}_t = \exp ( t \mathcal {L}^{(n)} )\) is positive for all \(t \in \mathbb {R}_+\), so (iii) holds.
Finally, if \(\mathcal {L}( 1 ) > 0\), then the conditionally completely positive map
is such that \(\mathcal {L}'( 1 ) \leqslant 0\), since \(0 \leqslant \mathcal {L}( 1 ) \leqslant \| \mathcal {L}( 1 ) \| I\). It follows that \(T^{\prime }_t = \exp ( t \mathcal {L}' )\) is completely positive for all \(t \in \mathbb {R}_+\), and therefore so is \(T_t = \exp \bigl ( \| \mathcal {L}( 1 ) \| t \bigr ) T^{\prime }_t\). □
Remark 4.48
Since completely positive unital linear maps between unital C ∗ algebras are automatically contractive, by Theorem 4.31 and the fact that ∗-homomorphisms between C ∗ algebras are contractive, the previous result characterises the generators of uniformly continuous conservative quantum Feller semigroups.
4.6 The Gorini–Kossakowski–Sudarshan–Lindblad Theorem
In order to provide a more explicit description of the generators of quantum Feller semigroups, we will establish some results of Lindblad and Christensen, and of Kraus. The Kraus decomposition is a key tool in quantum information theory.
Theorem 4.49 (Lindblad, Christensen)
Let \(\mathcal {L}\) be a ∗-preserving bounded linear map on the von Neumann algebra A . Then \(\mathcal {L}\) is conditionally completely positive and normal if and only if there exists a completely positive, normal map Ψ : A →A and an element g ∈A such that
Proof
The second part of Exercise 4.44 shows that \(\mathcal {L}\) is conditionally completely positive or normal if and only if Ψ has the same property.
Given this, it remains to prove that if \(\mathcal {L}\) is conditionally completely positive, then there exists g ∈A such that \(a \mapsto \mathcal {L}( a ) - g^* a - a g\) is completely positive. We will show this under the assumption that A = B(H); see [14, Proof of Theorem 3.14]. The general case [7] requires considerably more work.
Given u, v ∈H, let the Dirac dyad
Fix a unit vector u ∈H, and let G ∈ B(H) be such that
Given a 1, …a n ∈A and v 1, …, v n ∈H, let v 0 = u and \(a_0 = -\sum _{i = 1}^n |a_i v_i\rangle \langle u|\), so that \(\sum _{i = 0}^n a_i v_i = 0\). The conditional complete positivity of \(\mathcal {L}\) implies that
The result follows. □
Remark 4.50
If A is required only to be a C ∗ algebra, then Christensen and Evans [7] showed that Theorem 4.49 remains true if \(\mathcal {L}\) and Ψ no longer required to be normal, but then g and the range of Ψ must be taken to lie in the σ-weak closure of A.
Theorem 4.51 (Kraus [18])
Suppose A ⊆ B(H) is a von Neumann algebra. A linear map Ψ : A → B(K) is normal and completely positive if and only if there exists a family of operators \(( L_i )_{i \in \mathbb {I}} \subseteq {B(\mathsf {K};\mathsf {H})}\) such that
with convergence in the strong operator topology. The cardinality of the index set \(\mathbb {I}\) may be taken to be no larger than \(\dim \mathsf {K}\).
Proof
If Ψ has this form, then it is completely positive and normal. The first claim is readily verified; for the second, let , fix j 0 and note that \(\langle u, a_j u \rangle \leqslant \langle u, a_{j_0} u \rangle \) for all u ∈H and \(j \geqslant j_0\). Fix ε > 0 and v ∈K, choose a finite set \(\mathbb {I}_0 \subseteq \mathbb {I}\) such that the sum \(\sum _{i \in \mathbb {I}_0} \langle L_i v, a_{j_0} L_i v \rangle > \langle v, \Psi ( a_{j_0} ) v \rangle - \varepsilon \), and note that
This shows that Ψ is normal, as required.
For the converse, Theorem 4.31 shows it suffices to prove that if π : A → B(K) is a normal unital ∗-homomorphism, then π can be written as in the statement of the theorem.
Let \(( e_i )_{i \in \mathbb {I}}\) be an orthonormal basis for H, and consider the net \(( I_{\mathsf {H}} - \sum _{i \in \mathbb {I}_0} |e_i\rangle \langle e_i| )\), where the index \(\mathbb {I}_0\) runs over all finite subsets of \(\mathbb {I}\), ordered by inclusion. Since π is normal and unital, we have that \(I_{\mathsf {K}} = \sum _{i \in \mathbb {I}} \pi \bigl ( |e_i\rangle \langle e_i| \bigr )\) in the weak-operator sense; thus, there exists some \(i_0 \in \mathbb {I}\) such that \(P := \pi \bigl ( |e_{i_0}\rangle \langle e_{i_0}| \bigr )\) is a non-zero orthogonal projection.
Let u ∈K be a unit vector such that Pu = u, let a ∈A, and note that
Hence there exists a partial isometry L 0 : K →H with initial space K 0, the norm closure of {π(a)u : a ∈A}, and final space H 0, the norm closure of {ae 0 : a ∈A}, and such that L 0 π(a)u = ae 0 for all a ∈A. Note that K 0 is invariant under the action of π(a), for all a ∈A, so
Thus \(\pi ( a ) |{ }_{\mathsf {K}_0} = L_0^* a L_0 |{ }_{\mathsf {K}_0}\), and since \(L_0( \mathsf {K}_0^\perp ) = \{ 0 \}\), it follows that \(\pi ( a ) P_0 = L_0^* a L_0\) for all a ∈A, where \(P_0 := L_0^* L_0\) is the orthogonal projection onto the initial space K 0.
Repeating this argument, but on \(\mathsf {K}_0^\perp \), there exists a partial isometry L 1 : K →H with initial projection P 1 such that P 0 P 1 = 0 and \(\pi ( a ) P_1 = L_1^* a L_1\) for all a ∈A. An application of Zorn’s lemma now gives the result. □
Remark 4.52
With Ψ and \(( L_i )_{i \in \mathbb {I}}\) as in Theorem 4.51, we may write
where \(\mathsf {K}_{\mathbb {I}}\) is the Hilbert space with orthonormal basis \(( e_i )_{i \in \mathbb {I}}\) and \(L \in {B(\mathsf {K};\mathsf {H} \otimes \mathsf {K}_{\mathbb {I}})}\) is such that
Exercise 4.53
Use Theorem 4.51 and the second part of Theorem 4.26 to show that every positive normal linear functional on the von Neumann algebra A has the form
[Every bounded linear functional is the linear combination of four positive ones [22, Theorem 3.3.10], and Grothendieck [15] observed that each of these may be taken to be normal if the original is [17, Theorem 7.4.7]. Hence every normal linear functional is of the form used to define the σ-weak topology in Definition 4.6.]
Lemma 4.54
Let T be a uniformly continuous semigroup on a von Neumann algebra with generator \(\mathcal {L}\) . Then \(\mathcal {L}\) is normal if and only if T t is normal for all \(t \in \mathbb {R}_+\).
Proof
This holds because the limit of a norm-convergent sequence of normal maps is normal. To see this, let Φn, Φ : A → B(H) be such that ∥ Φn − Φ∥→ 0, let the net (a i) ⊆A + be such that , and let v ∈H. Fix i 0 and note that \(\| a_i \| \leqslant \| a_{i_0} \|\) whenever \(i \geqslant i_0\), so
The claim follows. □
Theorem 4.55 (Gorini–Kossakowski–Sudarshan, Lindblad)
Let A ⊆ B(H) be a von Neumann algebra. A bounded linear map \(\mathcal {L} \in {B(\mathsf {A})}\) is the generator of a uniformly continuous conservative quantum Feller semigroup composed of normal maps if and only if
where h = h ∗∈A and L ∈ B(H;H ⊗K) for some Hilbert space K.
Proof
If \(\mathcal {L}\) has this form, then it is straightforward to verify that the semigroup it generates is as claimed.
Conversely, suppose \(\mathcal {L}\) is the generator of a semigroup as in the statement of the theorem. Then Theorem 4.47 gives that \(\mathcal {L}\) is conditionally completely positive and \(\mathcal {L}( 1 ) = 0\). Moreover, \(\mathcal {L}\) is normal, by the preceding lemma, and so Theorem 4.49 gives that
where Ψ : A →A is completely positive and normal, and g ∈A. Taking a = 1 in this equation shows that g ∗ + g = − Ψ(1), so \(g = {-\frac {1}{2}} \Psi ( 1 ) + \mathrm {i} h\) for some self-adjoint element h ∈A. The result now follows by Theorem 4.51. □
The story of the previous theorem is very well told in [8]. Going beyond the case of bounded generators is the subject of much interest. See the survey [28] for some recent developments.
4.7 Quantum Markov Processes
We will conclude by giving a very brief indication of how a quantum process may be defined.
Remark 4.56
Let E be a compact Hausdorff space. If X is an E-valued random variable on the probability space \(\mbox{{$( \Omega , \mathcal {F}, \mathbb {P} )$}}\), then
is a unital ∗-homomorphism, where A = C(E) and \(\mathsf {B} = L^\infty \mbox{{$( \Omega , \mathcal {F}, \mathbb {P} )$}}\).
Definition 4.57
A non-commutative random variable is a unital ∗-homomorphism j between unital C ∗ algebras.
A family \(( j_t : \mathsf {A} \to \mathsf {B} )_{t \in \mathbb {R}_+}\) of non-commutative random variables is a dilation of the quantum Feller semigroup T on A if there exists a conditional expectation \(\mathbb {E}\) from B onto A such that \(T_t = \mathbb {E} \mathbin {\circ } j_t\) for all \(t \in \mathbb {R}_+\).
The problem of constructing such dilations has attracted the interest of many authors, including Evans and Lewis [13], Accardi et al. [1], Vincent-Smith [31], Kümmerer [19], Sauvageot [27] and Bhat and Parthasarathy [5].
Essentially, one attempts to mimic the functional-analytic proof of Theorem 3.22. Given the appropriate analogue of an initial measure, which is a state μ on the C ∗ algebra A, the sesquilinear form
must be shown to be positive semidefinite. The key to this is the complete positivity of the semigroup maps. There are many technical issues to be addressed; see [5] for more details.
References
L. Accardi, A. Frigerio, J.T. Lewis, Quantum stochastic processes. Publ. Res. Inst. Math. Sci. 18(1), 97–133 (1982)
R. Alicki, K. Lendi, Quantum Dynamical Semigroups and Applications. Lecture Notes in Physics, vol. 717, 2nd edn. (Springer, Berlin, 2007)
D. Applebaum, Lévy Processes and Stochastic Calculus, 2nd edn. (Cambridge University Press, Cambridge, 2009)
W.B. Arveson, Subalgebras of C ∗-algebras. Acta Math. 123, 141–224 (1969)
B.V.R. Bhat, K.R. Parthasarathy, Kolmogorov’s existence theorem for Markov processes in C ∗ algebras. Proc. Indian Acad. Sci. Math. Sci. 104(1), 253–262 (1994)
M.-D. Choi, Positive linear maps on C ∗-algebras. Can. J. Math. 24(3), 520–529 (1972)
E. Christensen, D.E. Evans, Cohomology of operator algebras and quantum dynamical semigroups. J. Lond. Math. Soc. 20(2), 358–368 (1979)
D. Chruściński, S. Pascazio, A brief history of the GKLS equation. Open Syst. Inf. Dyn. 24(3), 1740001, 20pp. (2017)
E.B. Davies, Linear Operators and Their Spectra (Cambridge University Press, Cambridge, 2007)
E.G. Effros, Z.-J. Ruan, Operator Spaces. London Mathematical Society Monographs, vol. 23 (Oxford University Press, Oxford, 2000)
D.E. Evans, Conditionally completely positive maps on operator algebras. Q. J. Math. 28(3), 271–283 (1977)
D.E. Evans, H. Hanche-Olsen, The generators of positive semigroups. J. Funct. Anal. 32, 207–212 (1979)
D.E. Evans, J.T. Lewis, Dilations of Irreversible Evolutions in Algebraic Quantum Theory. Communications of the Dublin Institute for Advanced Studies Series A, vol. 24 (Dublin Institute for Advanced Studies, Dublin, 1977), v+104 pp.
F. Fagnola, Quantum Markov semigroups and quantum flows. Proyecciones 18(3), 144 pp. (1999)
A. Grothendieck, Un résultat sur le dual d’une C ∗-algèbre. J. Math. Pures Appl. (9) 36(2), 97–108 (1957)
E. Hille, R.S. Phillips, Functional Analysis and Semi-Groups. AMS Colloquium Publications, vol. 31, third printing of the revised 1957 edition (American Mathematical Society, Rhode Island, 1974)
R.V. Kadison, J.R. Ringrose, Fundamentals of the Theory of Operator Algebras II. Advanced Theory. Graduate Studies in Mathematics, vol. 16. (American Mathematical Society, Providence, 1997)
K. Kraus, General state changes in quantum theory. Ann. Phys. 64(2), 311–335 (1971)
B. Kümmerer, Markov dilations on W ∗-algebras. J. Funct. Anal. 63(2), 139–177 (1985)
T.M. Liggett, Continuous Time Markov Processes (American Mathematical Society, Providence, 2010)
G. Lindblad, On the generators of quantum dynamical semigroups. Commun. Math. Phys. 48(2), 119–130 (1976)
G.J. Murphy, C ∗ -Algebras and Operator Theory (Academic, New York, 1990)
W.L. Paschke, Inner product modules over B ∗-algebras. Trans. Am. Math. Soc. 182, 443–468 (1973)
V. Paulsen, Completely Bounded Maps and Operator Algebras. Cambridge Studies in Advanced Mathematics, vol. 78 (Cambridge University Press, Cambridge, 2002)
M. Reed, B. Simon, Methods of Modern Mathematical Physics I. Functional Analysis, revised and enlarged edition (Academic, New York, 1980)
L.C.G. Rogers, D. Williams, Diffusions, Markov Processes and Martingales I. Foundations, 2nd edn. (Cambridge University Press, Cambridge, 2000)
J.-L. Sauvageot, Markov quantum semigroups admit covariant Markov C ∗-dilations. Commun. Math. Phys. 106(1), 91–103 (1986)
I. Siemon, A.S. Holevo, R.F. Werner, Unbounded generators of dynamical semigroups. Open Syst. Inf. Dyn. 24(4), 1740015, 24pp. (2017)
W.F. Stinespring, Positive functions on C ∗-algebras. Proc. Am. Math. Soc. 6(2), 211–216 (1955)
M. Takesaki, Theory of Operator Algebras I. Encyclopaedia of Mathematical Sciences, vol. 124 (Springer, Berlin, 2002)
G.F. Vincent-Smith, Dilation of a dissipative quantum dynamical system to a quantum Markov process. Proc. Lond. Math. Soc. 49(1), 58–72 (1984)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Belton, A.C.R. (2019). Introduction to Classical and Quantum Markov Semigroups. In: Bahns, D., Pohl, A., Witt, I. (eds) Open Quantum Systems . Tutorials, Schools, and Workshops in the Mathematical Sciences . Birkhäuser, Cham. https://doi.org/10.1007/978-3-030-13046-6_1
Download citation
DOI: https://doi.org/10.1007/978-3-030-13046-6_1
Published:
Publisher Name: Birkhäuser, Cham
Print ISBN: 978-3-030-13045-9
Online ISBN: 978-3-030-13046-6
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)