Abstract
In this chapter we present the basic material on metric, Banach, and Hilbert spaces needed in the sequel. By design, every Hilbert space is a Banach space—with the converse being untrue. Banach and Hilbert spaces play a central role in many areas and subareas of mathematical analysis as most of the spaces encountered and utilized in practical problems turn out to be either Hilbert spaces or Banach spaces.
Access provided by CONRICYT-eBooks. Download chapter PDF
1.1 Introduction
In this chapter we present the basic material on metric, Banach, and Hilbert spaces needed in the sequel. By design, every Hilbert space is a Banach space—with the converse being untrue. Banach and Hilbert spaces play a central role in many areas and subareas of mathematical analysis as most of the spaces encountered and utilized in practical problems turn out to be either Hilbert spaces or Banach spaces.
Recall that Banach spaces were named after the Polish mathematician Stefan Banach who introduced them in the mathematical literature around 1920–1922. Basically, a Banach space is a normed vector space that is complete, that is, every Cauchy sequence in it must converge in it. For instance \(\mathbb Q\) the field of rational numbers equipped with the standard absolute value is not complete; its completion is in fact \(\mathbb R\), the field of real numbers. Standard examples of Banach spaces include, but are not limited to, \(\mathbb R^d\), \(\mathbb C^d\), \(\ell ^p(\mathbb N)\), L p (Lebesgue spaces), W k, p (Sobolev spaces), C k, α (Hölder spaces), \(B_{p,q}^s\) (Besov spaces), \(H^p(\mathbb S^1)\) (Hardy spaces), and BMO (functions of bounded mean oscillation)—when they are equipped with their respective standard norms.
In this chapter, we first study some of the basic properties of metric spaces and then use these to deduce those of Banach and Hilbert spaces.
One should stress upon the fact that the introductory material presented in this chapter can be found in any good book in (nonlinear) functional analysis. Consequently, some proofs will be omitted.
1.2 Metric Spaces
This section is essentially devoted to metric spaces and their basic properties. Among other things, the following notions will be introduced and studied in the context of metric spaces: convergence, completeness, continuity, compact metric spaces, and the Banach fixed-point principle.
1.2.1 Basic Definitions and Examples
Definition 1.1
A pair (M, d) consisting of a nonempty set M and a mapping (metric or distance) d : M × M↦[0, ∞) is called a metric space, if the mapping d fulfills the following properties,
-
i)
d(x, y) = 0 if and only if x = y;
-
ii)
d(x, y) = d(y, x); and
-
iii)
d(x, z) ≤ d(x, y) + d(y, z)
for all x, y, z ∈ M.
The inequality iii) appearing in Definition 1.1 is commonly known as the triangle inequality. Further, elements of the set M are called points and the quantity d(x, y) is referred to as the distance between the points x, y ∈ M.
If (M, d) is a metric space, then using the triangle inequality, it can be easily shown that the following property holds,
for all x, y, z ∈ M.
Example 1.2
-
(1)
Let d 0 and d 1 be two metrics upon a nonempty set M. Consider the mapping d defined by, d(x, y) = αd 0(x, y) + βd 1(x, y) for all x, y ∈ M, where α, β ≥ 0 and α + β = 1. It is not hard to see that the pair (M, d) is a metric space.
-
(2)
Let M be an arbitrary nonempty set which we endow with the so-called discrete metric d s defined by
$$\displaystyle \begin{aligned} d_s (x, y) := \begin{cases} 0 & \mbox{if } x = y \\ \\ 1 & \mbox{if } x \neq y\end{cases} \end{aligned}$$It can be easily shown that (M, d s) is a metric space.
-
(3)
Fix p ≥ 1. Then \((\mathbb R^n, d_p)\) is a metric space, where
$$\displaystyle \begin{aligned} d_p(\alpha, \beta) := \Bigg(\sum_{k=1}^n |\alpha_k-\beta_k|{}^p\Bigg)^{1/p} \end{aligned}$$for all \(\alpha = (\alpha_{1}, \alpha_{2}, ..., \alpha_{n}), \ \ \beta = (\beta _{1}, \beta_{2}, ..., \beta_{n}) \in \mathbb{R}^n\).
-
(4)
Consider the unit circle, \(\mathbb S^1 = \{z \in \mathbb C: |z| = 1\}\) and let \(\mathcal A(\mathbb S^1)\) stand for the collection of all functions \(f: \mathbb S^1 \mapsto \mathbb C\) for which
$$\displaystyle \begin{aligned} \int_{0}^{2\pi} |f(e^{i\theta})|{}^2 d\theta < \infty. \end{aligned}$$It can be easily seen that \((\mathcal A(\mathbb S^1), \rho )\) is a metric space, where the metric ρ is defined by,
$$\displaystyle \begin{aligned} \rho(f,g):= \Bigg(\int_{0}^{2\pi} |f(e^{i\theta}) - g(e^{i\theta})|{}^2 d\theta\Bigg)^{\frac{1}{2}} \end{aligned}$$for all \(f, g \in \mathcal A(\mathbb S^1)\).
As usual, if (M, d) is a metric space, then the metric d enables us to define the notions of balls and spheres in M. Indeed, the (open) ball centered at x ∈ M with radius r > 0 is defined by B(x, r) = {y ∈ M : d(x, y) < r}. Similarly, the (closed) ball centered at x ∈ M with radius r ≥ 0 is defined by \(\overline {B}(x,r) = \{y \in M: d(x,y) \leq r\}\). By the sphere S(x, r), centered at x ∈ M with radius r ≥ 0, we mean the set of all points defined by S(x, r) = {y ∈ M : d(x, y) = r}.
Definition 1.3
Let (M, d) be a metric space and let \({\mathcal O} \subset M\) be a subset. The set \({\mathcal O} \) is said to be an open set if for all x ∈ M, there exists r > 0 such that \(B(x, r) \subset {\mathcal O}.\)
Classical examples of open sets in a metric space (M, d) include M itself and the empty set, ∅. Recall that arbitrary unions of open sets of M are also open sets of M. Further, finite intersections of open sets of M are also open sets of M.
Definition 1.4
Let (M, d) be a metric space and let \({\mathcal O} \subset M\) be a subset. A point x ∈ M is said to be an interior point of \({\mathcal O} \) if and only if there exists r > 0 such that \(B(x,r) \subset {\mathcal O}.\)The collection of all interior points of \({\mathcal O}\) is denoted \(\mathrm {Int} ({\mathcal O} )\).
It can be shown that a subset \({\mathcal O} \) of M is open if and only if it contains all of its interior points, that is, \({\mathcal O} = \mathrm {Int} ({\mathcal O})\).
Definition 1.5
Let (M, d) be a metric space and let \({\mathcal O} \subset M\) be a subset. The set \({\mathcal O} \) is said to be a closed set if its complement \({\mathcal O} ^C = M \setminus {\mathcal O}\) is an open set.
Classical examples of closed sets of a metric space (M, d) include singletons {x}, M, and ∅.
Definition 1.6
Let (M, d) be a metric space and let \({\mathcal O} \subset M\) be a subset. A point x ∈ M is said to be an adherent point of \({\mathcal O} \) if and only if for all r > 0, the following holds, \(B(x,r) \cap {\mathcal O} \neq \{\emptyset \}.\) The collection of all adherent points of \({\mathcal O} \) is called the closure of \({\mathcal O} \) and is denoted \(\overline {{\mathcal O}}\).
It can be shown that a subset \({\mathcal O} \) of M is closed if and only if it contains all of its adherent points, that is, \({\mathcal O} = \overline {{\mathcal O} }\). In other words, \({\mathcal O}\) is closed if and only if for any sequence \((x_n)_{n \in \mathbb N} \subset {\mathcal O} \) such that d(x n, x) → 0 for some x ∈ M as n →∞, then one must have \(x \in {\mathcal O}\).
Definition 1.7
Let (M, d) be a metric space and let \({\mathcal O} \subset M\) be a subset. The subset \({\mathcal O}\) is said to be bounded it is included in some ball B(x, r). Otherwise, the set \({\mathcal O} \) is said to be unbounded.
It is easy to see that \({\mathcal O} \subset M\) is bounded if and only if its diameter, \(\displaystyle \text{diam}({\mathcal O} ) := \displaystyle \sup _{x, y \in {\mathcal O} } d(x,y),\) is finite, that is, \(\displaystyle \text{diam}({\mathcal O} ) < \infty \).
If (M, d) is a metric space, then the metric d enables us to define the notion of convergence in M.
Definition 1.8
Let (M, d) be a metric space. A sequence \((x_n)_{n \in \mathbb N} \subset M\) is said to converge to some x ∈ M with respect to the metric d, if d(x n, x) → 0 as n →∞. Equivalently, for every ε > 0, there exists \(N \in \mathbb N\) such that d(x n, x) < ε for all n ≥ N.
If a sequence \((x_n)_{n \in \mathbb N} \subset M\) converges to some x ∈ M with respect to the metric d, then we write limn→∞ x n = x.
Proposition 1.9
Let (M, d) be a metric space. If a sequence \((x_n)_{n \in \mathbb N} \subset M\) converges, then its limit is unique.
Proof
Suppose \((x_n)_{n \in \mathbb N} \subset M\) converges to two limits x, y ∈ M. Then, using the triangle inequality it follows that 0 ≤ d(x, y) ≤ d(x, x n) + d(x n, y). Letting n →∞ in the previous inequality, it follows that d(x, y) = 0 which yields x = y.
Definition 1.10
Let (M, d) be a metric space. A sequence \((x_n)_{n \in \mathbb N} \subset M\) is called a Cauchy sequence, if for every ε > 0, there exists \(N \in \mathbb N\) such that d(x n, x m) < ε for all n, m ≥ N.
Proposition 1.11
Let (M, d) be a metric space. Every convergent sequence is a Cauchy sequence. Further, every Cauchy sequence is bounded.
Proof
-
i)
Let \((x_n)_{n \in \mathbb N}\) be a convergent sequence in the metric space (M, d). This means that there exists x ∈ M such that d(x n, x) → 0 as n →∞. Equivalently, for all ε > 0, there exists \(N \in \mathbb N\) such that \(d(x_n, x) < \frac {\varepsilon }{2}\) for all n ≥ N. Now, using the triangle inequality it follows that \(d(x_n, x_m) \leq d(x_n, x) + d(x, x_m) < \frac {\varepsilon }{2} + \frac {\varepsilon }{2} = \varepsilon \) for all n, m ≥ N. Consequently, \((x_n)_{n \in \mathbb N}\) is a Cauchy sequence.
-
ii)
Suppose that \((x_n)_{n \in \mathbb N}\) is a Cauchy sequence. For ε = 1, there exists \(N \in \mathbb N\) such that d(x n, x m) < 1 for n, m ≥ N. In particular, d(x n, x N) < 1 for all n ≥ N. Setting \(r = 1+ \max \{1, d(x_1, x_N), d(x_2, x_N), ..., d(x_{N-1}, x_N)\}\), one can easily see that \((x_n)_{n \in \mathbb N} \subset B(x_N,r)\) which yields the sequence \((x_n)_{n \in \mathbb N}\) is bounded.
1.2.2 Complete Metric Spaces
Definition 1.12
A metric space (M, d) is said to be complete, if every Cauchy sequence in it converges in it.
Classical examples of complete metric spaces include \(\mathbb R^n\) equipped with its corresponding Euclidean metric defined by
for all \(x = (x_{1}, x_{2}, ..., x_{n}), \quad y = (y_{1}, y_{2} ..., y_{n}) \in \mathbb{R}^{n}\), and \(BC(\mathbb R, M)\) the collection of all bounded continuous functions which go from \(\mathbb R\) into a complete metric space (M, d), when it is equipped with the sup norm metric, \(d_\infty (f,g) = \sup _{t \in \mathbb R} d(f(t), g(t))\) for all \(f, g \in BC(\mathbb R, M)\), etc.
A classical example of a metric space which is not complete is \(\mathbb Q\); the field of rational numbers; when it is equipped with the standard absolute value defined by d 0(x, y) = |x − y| for all \(x,y \in \mathbb Q\). There are obviously various ways of constructing a Cauchy sequence in \(\mathbb Q\) which diverges. Let us exhibit one here. Indeed, consider the recurrent sequence \((x_n)_{n \in \mathbb N}\) given by,
It is not hard to show that not only \((x_n)_{n \in \mathbb N}\) is a sequence of rational numbers but also that \(\displaystyle \lim _{n \to \infty } x_n = \sqrt {3} \in \mathbb R \setminus \mathbb Q\). This shows that \((\mathbb Q,d_0)\) is not complete.
The larger question regarding the completion of a given arbitrary metric space (M, d) is the following: if (M, d) is not complete, are there ways of making it complete? The answer is a “yes.” Indeed, suppose that (M, d) is not complete and let
Define an equivalence relation upon elements of CS(M, d) as follows: two sequences \((x_n)_{n \in \mathbb N}\), \((y_n)_{n \in \mathbb N}\) in CS(M, d) are said to be equivalent which we denote \((x_n)_{n \in \mathbb N} \sim (y_n)_{n \in \mathbb N}\) if and only if, for ε > 0, there exists \(N \in \mathbb N\) such that
Consider
and define the mapping \(\widetilde {d}: \widetilde {CS(M, d)} \times \widetilde {CS(M, d)} \mapsto [0, \infty )\) as follows: if \(x, y \in \widetilde {CS(M, d)}\), that is, x = [(x n)n] and \(y =[(y_n)_{n \in \mathbb N}]\) (equivalence classes of \((x_n)_{n \in \mathbb N}\) and \((y_n)_{n \in \mathbb N}\)), then
It is easy to check that \(\widetilde {d}\) is a metric upon \(\widetilde {CS(M, d)}\). Moreover, the metric space \((\widetilde {CS(M, d)}, \widetilde {d})\), by construction, is complete.
1.2.3 Continuous Functions
In the sequel, the pairs (M, d), (M′, ρ), (M 1, d 1), (M 2, d 2), and (M 3, d 3) stand for metric spaces.
Definition 1.13
A function f : (M, d)↦(M′, ρ) is said to be continuous at x 0 ∈ M, if for all ε > 0 there exists δ > 0 such that for all x ∈ M, d(x 0, x) < δ yields ρ(f(x 0), f(x)) < ε. The function f is said to be continuous on M, if it is continuous at each point of M.
Recall that the continuity of a function f : (M, d)↦(M′, ρ) at x 0 ∈ M is equivalent to its sequential continuity at x 0 ∈ M, that is, for any arbitrary sequence \((x_n)_{n \in \mathbb N} \subset M\) that converges to some x 0, we have that f(x n) converges to f(x 0). In general, it is easier to prove the continuity of a function using the sequential continuity than the general definition of continuity given in Definition 1.13.
We have the following composition result for continuous functions on metric spaces whose proof is left to the reader as an exercise.
Proposition 1.14
Let f : M 1↦M 2 and g : M 2↦M 3 be given functions. Let x 0 ∈ M 1 be such that f is continuous at x 0 and that g is continuous at f(x 0). Then, gof, the composition of f with g, is also continuous at x 0.
Definition 1.15
A function f : (M, d)↦(M′, ρ) is said to be uniformly continuous if, for all ε > 0, there exists δ > 0 such that for all x, y ∈ M, d(x, y) < δ yields ρ(f(x), f(y)) < ε.
Obviously, every uniformly continuous function is continuous. However, the converse, except in the case when M is a compact metric space, is in general not true (see for instance Theorem 1.20).
1.2.4 Compact Metric Spaces
Definition 1.16
An open covering of (M, d) is a collection of open sets whose union is M.
Definition 1.17
A metric space (M, d) is said to be compact if every open covering of it has a finite subcovering.
Definition 1.18
A subset \({\mathcal O}\) of a metric space (M, d) is called relatively compact if its closure \(\overline {{\mathcal O}}\) is a compact subset of M.
Theorem 1.19
A metric space (M, d) is compact if and only if every sequence of elements of M has a subsequence which converges.
Proof
The proof is left to the reader as an exercise.
Theorem 1.20
If (M, d) is a compact metric space and if f : M↦M′ is a continuous function, then f is uniformly continuous.
Proof
Let ε > 0 and x ∈ M be given. Using the fact that f is continuous, we deduce that there exists δ x > 0 such that d(x, y) < δ x yields \(\rho (f(x), f(y)) < \frac {\varepsilon }{2}\). Since M is compact, it follows that there exists a finite number of points x 1, x 1, ..., x d such that the following holds,
Now let \(\delta = \frac {1}{2} \min \{\delta _{x_j}: j =1, 2, ..., d\}\). Clearly, δ > 0. Further, for all (x, y) ∈ M × M satisfying d(x, y) < δ, one can find j 0 ∈{1, 2, ..., d} such that x and y belong to \(B(x_{j_0}, \delta _{j_0})\), which yields \(\rho (f(x), f(x_{j_0})) < \frac {\varepsilon }{2}\) and \(\rho (f(y), f(x_{j_0})) < \frac {\varepsilon }{2}\). Using the triangle inequality it follows that,
which shows that the function f is uniformly continuous.
Let C(M, M′) denote the collection of all continuous functions from M into M′. If \(M' = \mathbb C\), then \(C(M, \mathbb C)\) will be denoted by C(M). If M is compact, then C(M) is a metric space when it is equipped with the metric, d ∞(u, v) =maxx ∈ M|u(x) − v(x)| for all u, v ∈ C(M).
Definition 1.21
A sub-collection Γ ⊂ C(M) is called uniformly bounded if there exists C > 0 such that |u(x)|≤ C for every x ∈ M and u ∈ Γ.
Definition 1.22
A sub-collection Γ ⊂ C(M, M′) is said to be equi-continuous if for all x 0 ∈ M and for all ε > 0 there exists δ = δ(x 0, ε) > 0 such that d(x, x 0) < δ yields ρ(f(x), f(x 0)) < ε for all f ∈ Γ.
Example 1.23
Let Γ ⊂ C(M) be defined as the family of all K-Lipschitz functions on M where K ≥ 0. That is, all functions \(u: M \mapsto \mathbb C\) such that |u(x) − u(y)|≤ Kd(x, y) for all x, y ∈ M. It can be easily shown that the family Γ is equi-continuous.
Theorem 1.24 (Arzelà-Ascoli Theorem)
A sub-collection Γ ⊂ C(M) is relatively compact if and only if,
-
(a)
Γ is equi-continuous; and
-
(b)
Γ is uniformly bounded.
Proof
The proof is left to the reader as an exercise.
Corollary 1.25
A sub-collection Γ ⊂ C(M) is compact if and only if it is closed, uniformly bounded, and equi-continuous.
Proof
The proof is left to the reader as an exercise.
1.2.5 Banach Fixed-Point Principle
Definition 1.26
A mapping T : (M, d)↦(M′, ρ) is said to be Lipschitz if there exists K ≥ 0 (Lipschitz constant) such that
for all x, y ∈ M. In the case when 0 ≤ K < 1, then the map T is said to be a strict contraction.
Example 1.27
Let \({\mathcal O} \subset M\) be a subset. If x ∈ M, ones defines the distance between the point x and the set \({\mathcal O}\) as follows,
Proposition 1.28
The map \(D: M \mapsto \mathbb R_+\) , \(x \mapsto D(x):= d(x, {\mathcal O})\) is Lipschitz with 1 as its Lipschitz constant. That is,
for all x, y ∈ M.
Proof
Using the triangle inequality and a property of the infimum it follows that D(x) ≤ d(x, e) ≤ d(x, y) + d(y, e) for all x, y ∈ M and \(e \in {\mathcal O}\), which yields D(x) − d(x, y) ≤ d(y, e) for all x, y ∈ M and \(e \in {\mathcal O}\). Using the fact that the infimum is the greatest element that is less than or equal to all elements, we deduce that D(x) − d(x, y) ≤ D(y) which in turn yields D(x) − D(y) ≤ d(x, y) for all x, y ∈ M. Since x and y are arbitrary elements of M, replacing x with y in the previous inequality, one gets, D(y) − D(x) ≤ d(y, x) = d(x, y) for all x, y ∈ M. Combing the last two inequalities, we deduce that |D(x) − D(y)|≤ d(x, y) for all x, y ∈ M.
If T : (M, d)↦(M, d) is a mapping, then one defines its fixed-points by F T = {x ∈ M : T(x) = x}.
Remark 1.29
Note that if T is a strict contraction, then F T cannot contain more than one element. Indeed, suppose x = Tx and y = Ty, then d(x, y) = d(T(x), T(y)) ≤ Kd(x, y) < d(x, y), this impossible. Consequently, F T cannot contain more than one element.
Theorem 1.30 (Banach Fixed-Point Theorem [47])
If (M, d) is a complete metric space and if T : M↦M is a strict contraction, then it has a unique fixed-point.
Proof
Let x 0 ∈ M. Define the sequence x n = T n x 0 which yields x n+1 = Tx n and x n = Tx n−1. Consequently, d(x n+1, x n) ≤ K n d(x 0, x 1) where 0 ≤ K < 1 is the Lipschitz constant. Similarly, for all \(n, m \in \mathbb N\) with n > m, we have
This yields the sequence \((x_n)_{n \in \mathbb N} \subset M\) which is a Cauchy sequence and since (M, d) is complete, it follows that there exists x ∈ M such that limn→∞ d(x n, x) = 0. Since T is continuous, it follows that Tx =limn→∞ Tx n =limn→∞ x n+1 = x which yields x ∈ F T. Using Remark 1.29, we deduce that F T = {x}.
1.2.6 Equilibrium Points for the Discrete Logistic Equation
This subsection is an application to the Banach fixed-point theorem and is based upon [72, Chapter 3]. Indeed, we make use of the Banach fixed-point theorem to determine equilibrium points of the (discrete) logistic equation. One of the most celebrated discrete dynamical systems arising in dynamic of population is that of the logistic equation, which is given by the following nonlinear difference equation,
where σ ∈ [0, 1] and x(t) ∈ [0, 1] for all \(t \in \mathbb N\).
In many concrete applications, x(t) stands for the size of the population being studied at the generation t of a reproducing population. Recall that the linear part of the logistic equation, that is, x(t + 1) = 4σx(t), describes, depending upon σ, the exponential growth (if \(\sigma > \frac {1}{4}\)) or decay (if \(\sigma < \frac {1}{4}\)) of a population subject to constant birth or death rate.
Clearly, the discrete logistic equation is of the form, x(t + 1) = Sx(t), where the function S (logistic map) is defined by
Obviously, equilibrium points of the logistic equation correspond to the fixed-points of the logistic map S. It can be easily shown that for \(\sigma \in [0, \frac {1}{4}]\), the point x 0 = 0 is the only fixed-point (equilibrium point of the discrete logistic equation) of S. For \(\sigma \in (\frac {1}{4}, 1]\), fixed-points of the logistic map S are given by the points of the form, x(σ) := 1 − (4σ)−1. Consequently, the logistic map S has infinity many fixed-points (or equilibrium points for the logistic equation) given by
1.3 Banach Spaces
In the rest of this book, unless otherwise stated, \(\mathbb F\) stands for the field of real numbers \(\mathbb R\) or the field of complex numbers \(\mathbb C\).
1.3.1 Basic Definitions
Let \(\mathbb X\) be a vector space over the field \(\mathbb F\). A norm on \(\mathbb X\) is a mapping \(\|\cdot \|: \mathbb X \mapsto \mathbb R_+\) satisfying the following properties,
-
i)
∥x∥ = 0 if and only if x = 0;
-
ii)
∥λx∥ = |λ| ∥x∥; and
-
iii)
∥x + y∥≤∥x∥ + ∥y∥
for all \(x, y\in \mathbb X\) and \(\lambda \in \mathbb F\).
The pair \((\mathbb X, \|\cdot \|)\) is then called a normed vector space. From a given normed vector space \((\mathbb X, \|\cdot \|)\), one can construct a metric space \((\mathbb X, d)\), where the metric d is defined by d(x, y) := ∥x − y∥ for all \(x, y \in \mathbb X\).
In what follows, we introduce two important notions which play a crucial role in many fields: separable and uniformly convex normed vector spaces. For that, we need to introduce a few notions.
Definition 1.31
Let \((\mathbb X, \|\cdot \|)\) be a normed vector space and let \({\mathcal D}\subset \mathbb X\) be a subset. Then, \({\mathcal D}\) is said to be dense, if \(\mathbb X = \overline {{\mathcal D}}\). Equivalently, for each \(x \in \mathbb X\), there exists a sequence \((x_n)_{n \in \mathbb N} \subset {\mathcal D}\) such that d(x n, x) = ∥x n − x∥→ 0 as n →∞.
Definition 1.32
A set \({\mathcal D}\) is called countable if it is finite or has the same cardinality as \(\mathbb N\) (i.e., there exists a bijection between \({\mathcal D}\) and \(\mathbb N\)). A set \({\mathcal D}\) is called uncountable if it is infinite and not countable.
Definition 1.33
A normed vector space \((\mathbb X, \|\cdot \|)\) is said to be separable if it contains a countable dense subset \({\mathcal D}\).
Classical examples of separable normed vector spaces include, but are not limited to, \((\mathbb R, |\cdot |)\), \((\mathbb C, |\cdot |)\), and \((\ell ^p(\mathbb N), \|\cdot \|{ }_p)\) for 1 ≤ p < ∞, where \(\ell ^p(\mathbb N)\) is vector space consisting of all sequences \(x = (x_n)_{n \in \mathbb N}\) with \(x_n \in \mathbb C\) for all \(n \in \mathbb N\) and
It is also well known that \(\ell ^\infty (\mathbb N)\), the vector space of all bounded sequences (the vector space consisting of all sequences \(x = (x_n)_{n \in \mathbb N}\) with \(x_n \in \mathbb C\) for all \(n \in \mathbb N\) such that |x n|≤ M for all \(n \in \mathbb N\) with M ≥ 0 being a constant), is not separable, when it is equipped with its natural sup-norm ∥⋅∥∞ defined, for each \(x = (x_n)_{n \in \mathbb N} \in \ell ^\infty (\mathbb N)\), by
Definition 1.34
A normed vector space \((\mathbb X, \|\cdot \|)\) is said to be uniformly convex if for each ε > 0 there exists δ > 0 such that
While the normed vector spaces \((\mathbb R, |\cdot |)\), \((\mathbb C, |\cdot |)\), and \((\ell ^p(\mathbb N), \|\cdot \|{ }_p)\) for 1 < p < ∞ are uniformly convex, \(\ell ^\infty (\mathbb N)\) is not.
Definition 1.35
A normed vector space \((\mathbb X, \|\cdot \|)\) is said to be a Banach space if the metric space \((\mathbb X, d)\), where d(x, y) := ∥x − y∥ for all \(x, y \in \mathbb X\), is complete.
Classical examples of Banach spaces include finite-dimensional normed vector spaces. Obviously, there are plenty of normed vector spaces which are not Banach spaces (not complete). Indeed, for a < b, consider C[a, b], the collection of all continuous functions \(f: [a, b] \mapsto \mathbb R\). It is clear that C[a, b] is a vector space over \(\mathbb R\) which we equip with the norm given, for p ∈ [1, ∞), by
It is not hard to see that the normed vector space (C[a, b], ∥⋅∥p) is not complete.
Obviously, Banach spaces are more interesting for applications than incomplete normed vector spaces. Consequently, in what follows, our main focus will be on Banach spaces and their basic properties.
The proof of the next theorem presents no difficulty and hence is left to the reader as an exercise.
Theorem 1.36
If \((\mathbb X, \|\cdot \|)\) is a Banach space and if \(\mathbb Y \subset \mathbb X\) is a subspace, then \((\mathbb Y, \|\cdot \|)\) is a Banach space if and only if \(\mathbb Y\) is closed.
1.3.2 The Quotient Space
Definition 1.37
Let \(\mathbb L\) be a subspace of the vector space \(\mathbb X\). The cosets of \(\mathbb L\) are defined by the sets, \([x] = x + \mathbb L = \{x+\ell : \ell \in \mathbb L\}.\)
Define the quotient \(\mathbb X \setminus \mathbb L\) as follows \(\mathbb X \setminus \mathbb L = \{[x]: x \in \mathbb X\}.\) The canonical projection of \(\mathbb X\) onto \(\mathbb X \setminus \mathbb L\) is defined by \(\pi : \mathbb X \mapsto \mathbb X \setminus \mathbb L\), x↦[x]. It is not hard to see that π is surjective and that \(Ker (\pi ) = \{x \in \mathbb X: \pi (x) = 0\} = \mathbb L\). Further, \(\mathbb X \setminus \mathbb L\) is a vector space over \(\mathbb F\) ([x + y] = [x] + [y] and [λx] = λ[x] for all \(x, y \in \mathbb X\) and \(\lambda \in \mathbb F\)) called the quotient space. Furthermore, the mapping \(\|\cdot \|: \mathbb X \setminus \mathbb L \mapsto [0, \infty )\) defined by
for each \([x] \in \mathbb X \setminus \mathbb L\), is a norm on the quotient vector space \(\mathbb X \setminus \mathbb L\). Indeed, for all \(x, y \in \mathbb X\) and \(\lambda \in \mathbb F\), using the fact that \(\mathbb L\) is closed, we have \(d(x, \mathbb L) = 0\) if and only if \(x \in \mathbb L\). Thus \(\|[x]\| = \|x + \mathbb L\| = 0\) if and only if \([x] = x + \mathbb L= 0 + \mathbb L\).
Suppose λ ≠ 0. Then, we have
Now, if λ = 0, it easily follows that
Now let \(x_1, y_1 \in \mathbb L\), then
which yields
That is,
Recall that if \(\mathbb L\) a closed subspace of the normed vector space \((\mathbb X, \|\cdot \|)\), then the canonical projection π is linear and continuous as ∥π(x)∥≤∥x∥ for all \(x \in \mathbb X\).
Theorem 1.38
Let \((\mathbb X, \|\cdot \|)\) be Banach space and let \(\mathbb L\subset \mathbb X\) is a closed subspace. Then the quotient normed vector space \((\mathbb X \setminus \mathbb L, \|\cdot \|)\) is a Banach space.
1.3.3 Lp(Ω, μ) Spaces
Definition 1.39
Let Ω be a set and let \({\mathcal F}\) be a σ-algebra of measurable sets, that is, \({\mathcal F} \subset {\mathcal P} ({\varOmega })\) is a subset and satisfies the following conditions:
-
i)
\(\emptyset \in {\mathcal F}\);
-
ii)
if \(\varGamma \subset {\mathcal F}\), then its complement Γ C belongs to \({\mathcal F}\); and
-
iii)
\(\displaystyle \bigcup _{j=1}^\infty \varGamma _j \in {\mathcal F}\) whenever \(\varGamma _j \in {\mathcal F}\) for all j.
Elements of \({\mathcal F}\) are then called measurable sets.
Definition 1.40
The mapping μ : Ω↦[0, ∞] is called a measure, if it satisfies the following conditions:
-
i)
μ(∅) = 0; and
-
ii)
\(\displaystyle \mu (\bigcup _{j=1}^\infty \varGamma _j) = \sum _{j=1}^\infty \mu (\varGamma _j)\) for any disjoint countable family \((\varGamma _j)_{j \in \mathbb N}\) of elements of \({\mathcal F}\).
The σ-algebra \({\mathcal F}\) is said to be σ-finite if there exists a disjoint countable family \((\varGamma _j)_{j\in \mathbb N}\) of elements of \({\mathcal F}\) such that \(\displaystyle \varOmega = \bigcup _{j=1}^\infty \varGamma _j\) with μ(Ω j) < ∞ for all j.
The space \((\varOmega , {\mathcal F}, \mu )\) is called a measure space if Ω is a set, \({\mathcal F}\) is a σ-algebra on Ω, and μ is a measure on Ω.
Let p ∈ [1, ∞) and let \((\varOmega , {\mathcal F}, \mu )\) be a measure space. The space L p(Ω, μ) (also denoted L p(Ω)) is defined as the collection of all measurable functions \(f: \varOmega \mapsto \mathbb C\) such that
Recall that in L p(Ω, μ), two functions f and g are equal, if they are equal almost everywhere on Ω. Obviously, (L p(Ω, μ), ∥⋅∥p) is a normed vector space.
Similarly, one defines L ∞(Ω, μ) (also denoted L ∞(Ω)) as the set of all measurable functions \(f: \varOmega \mapsto \mathbb C\) such that there exists a constant M ≥ 0 such that |f(x)|≤ M a.e. x ∈ Ω. Now define, for all f ∈ L ∞(Ω, μ),
Obviously, (L ∞(Ω, μ), ∥⋅∥∞) is a normed vector space.
In view of the above, (L p(Ω), ∥⋅∥p) is a normed vector space for all p ∈ [1, ∞].
Example 1.41
Take \(\varOmega = \mathbb R\) and dμ = dx (Lebesgue measure). Consider the function defined by f(x) = e −|x| for all \(x \in \mathbb R\). One can easily see that \(f \in L^p (\mathbb R, dx)\) for all p ∈ [1, ∞].
Proposition 1.42
Let (Ω, μ) be a measure space. If 1 ≤ p ≤ q < ∞ and if 0 < μ(Ω) < ∞, then
for any measurable function f, where r = p −1 − q −1.
Proof
The proof is left to the reader as an exercise.
Corollary 1.43
Let (Ω, μ) be a measure space. If 1 ≤ p ≤ q < ∞ and if 0 < μ(Ω) < ∞, then the injection
is continuous.
Proof
The proof is left to the reader as an exercise.
Theorem 1.44 (Riesz-Fisher)
The space (L p(Ω, μ), ∥⋅∥p) is a Banach space for any 1 ≤ p ≤∞.
Proof
The proof is left to the reader as an exercise.
Theorem 1.45
Let p ∈ [1, ∞]. If \(u \in L^1(\mathbb R)\) and let \(v \in L^p(\mathbb R^n)\) , then for almost every \(x \in \mathbb R^n\) , the function y↦u(x − y)v(y) is integrable on \(\mathbb R^n\) . And the convolution of u and v defined by
is well defined. Further, \(u \ast v \in L^p(\mathbb R^n)\) and
Proof
See the book by Brézis [31].
More generally,
Theorem 1.46 (Young)
Let p, q ∈ [1, ∞] such that r −1 = p −1 + q −1 − 1 ≥ 0. If \(u \in L^p(\mathbb R^n)\) and let \(v \in L^q(\mathbb R^n)\) , then \(u \ast v \in L^r(\mathbb R^n)\) and
Proof
See the book by Brézis [31].
Let \(\varOmega \subset \mathbb R^n\) be a subset and let dμ = dx (Lebesgue measure). Let \(L_{loc}^p(\varOmega )\) for 1 ≤ p < ∞ stand for the collection of all measurable functions \(f: \varOmega \mapsto \mathbb C\) such that
for any Ω′⊂ Ω bounded closed subset.
Clearly, \(L_{loc}^p(\varOmega )\) is a vector space. Further, L p(Ω) is a subspace of \(L_{loc}^p(\varOmega )\). The natural topology of \(L_{loc}^p(\varOmega )\) is given as follows: a sequence \((f_n)_{n \in \mathbb N} \in L_{loc}^p(\varOmega )\) is said to converge to some \(f \in L_{loc}^p(\varOmega )\) if ∥f n − f∥p → 0 as n →∞ in L p(Ω′) for any Ω′⊂ Ω bounded closed subset. Although \(L_{loc}^p(\varOmega )\) equipped with such a type of convergence is a topological vector space, it is not a Banach space.
1.3.4 Sobolev Spaces
If α = (α 1, α 2, ..., α n) with \(\alpha _i \in \mathbb Z_+\) for i = 1, ..., n, one defines the length |α| of α as follows: |α| = α 1 + α 2 + ... + α n. In this event, the differential operator D α is defined by
Definition 1.47
Let 1 ≤ p ≤∞ and let \(k \in \mathbb N\). Suppose \(\varOmega \subset {\mathbb R^n}\) is an open subset. The Sobolev spaces W k, p(Ω) is the collection of all functions \(u: \varOmega \mapsto \mathbb F\) belonging to the set
If p = 2, the Sobolev space W k, 2(Ω) is denoted by H k(Ω). The space W k, p(Ω) is equipped with the norm defined by
and
Definition 1.48
If \(f: \varOmega \mapsto \mathbb F\) is a function, then its support denoted supp(f) is defined by \(Supp(f) \;:= \overline {\{x\in \varOmega : f(x) \neq 0\}}.\)
Example 1.49
If \(E \subset \mathbb R\) is a subset, then the support of the characteristic function χ E of the set E defined by χ E(x) = 1 if x ∈ E and χ E(x) = 0 if x∉E, is \(\overline {E}\) (closure of E).
Definition 1.50
The notation \(C_0^{\infty }(\varOmega )\) stands for the collection of all functions \(u: \varOmega \mapsto \mathbb R\) (or \(\mathbb C\)) of class C ∞ with compact support in Ω.
Definition 1.51
The Sobolev space \(W_0^{k,p}(\varOmega )\) is defined to be the closure of \(C_0^{\infty }(\varOmega )\) in the space W k, p(Ω), that is,
If p = 2, then the Sobolev space \(W_0^{k,2}(\varOmega )\) is denoted by \(H_0^k(\varOmega )\). Further, if \(\varOmega = \mathbb R^n\), then \(W_0^{k,p}(\mathbb R^n) = W^{k,p}(\mathbb R^n)\).
Theorem 1.52 ([47])
The Sobolev space W k, p(Ω) is a Banach space.
Definition 1.53
Let 1 ≤ p < ∞ and let s = k + σ where \(k \in \mathbb N\) and σ ∈ (0, 1). The Sobolev space W s, p(Ω) is defined by
whose norm is given by
If p = 2, then W s, 2(Ω) is denoted H s(Ω). If s ≥ 0, then we define \(W_{0}^{s,p}(\varOmega )\) to be the closure of the space \(C_0^\infty (\varOmega )\) in the Sobolev space W s, p(Ω). In particular, \(W_0^{s,2}(\varOmega )\) is denoted by \(H_{0}^s(\varOmega )\).
1.3.5 Embedding Theorems for Sobolev Spaces
In this subsection, we collect some important and useful embedding theorems on Sobolev spaces including the Sobolev–Gagliardo–Nirenberg’s Theorem and the Poincaré’s Theorem. Although most of the proofs of these theorems are omitted, we will be referring the reader to the appropriate references.
Theorem 1.54 (Sobolev–Gagliardo–Nirenberg)
Let p ∈ [1, n). Then the following embedding holds,
where \(\displaystyle \frac {1}{q} = \frac {1}{p}- \frac {1}{n}\) . Moreover, there exists a constant C(p, n) such that
for all \(u \in W^{1,p}(\mathbb R^n)\) , where the gradient ∇u of u is defined by the vector
Proof
See Brézis [32].
Theorem 1.55
Let \(k \in \mathbb N\) and let p ∈ [1, ∞). Then the following continuous embeddings hold,
Proof
See Brézis [32].
Theorem 1.56
Let \(\varOmega \subset \mathbb R^n\) be an open subset of class C 1 with bounded boundary ∂Ω. If p ∈ [1, ∞], we have the following continuous embeddings,
Proof
See Brézis [32].
We also have
Theorem 1.57 (Rellich–Kondrachov)
Let \(\varOmega \subset \mathbb R^n\) be a bounded subset and of class C 1 . If p ∈ [1, ∞], we have the following compact embeddings,
Proof
See Brézis [32].
We also have the so-called Poincaré’s inequality.
Theorem 1.58 (Poincaré’s Inequality)
Let p ∈ [1, ∞) and let \(\varOmega \subset \mathbb R^n\) be an open bounded subset. Then there exists a constant C = C(Ω, p) > 0 such that
for all \(u \in W_{0}^{1,p} (\varOmega )\).
Proof
See Brézis [32].
Theorem 1.59
Let \(\varOmega \subset \mathbb R^n\) be an open bounded subset with a smooth boundary ∂Ω. Suppose 0 ≤ k ≤ m − 1. Then, we have the following embeddings,
While the second embedding is compact, the first one is compact only if
Proof
See Adams [10].
1.3.6 Bounded Continuous Functions
Let \(J \subset \mathbb R\) be an interval (possibly unbounded) and let \((\mathbb X, \|\cdot \|)\) be a Banach space.
Definition 1.60
Let \(BC(J; \mathbb X)\) denote the space of all bounded continuous functions \(f: J \mapsto \mathbb X\). The space \(BC(J; \mathbb X)\) will be equipped with the sup-norm defined by
for all \(f \in BC(J; \mathbb X)\).
Theorem 1.61
The normed vector space \((BC(J; \mathbb X), \|\cdot \|{ }_\infty )\) is a Banach space.
Proof
The proof is left to the reader as an exercise.
Proposition 1.62
If \((f_n)_{n \in \mathbb N} \subset BC(J; \mathbb X)\) such that f n converges to some f with respect to the sup-norm, then \(f \in BC(J; \mathbb X)\).
Proof
The proof is left to the reader as an exercise.
1.3.7 Hölder Spaces \(C^{k,\alpha }(\overline {\varOmega })\)
Fix once and for all α ∈ (0, 1). Let \(J \subset \mathbb R\) be an interval (possibly unbounded) and let \((\mathbb X, \|\cdot \|)\) be a Banach space.
The space \(C^m(J; \mathbb X)\) (\(m \in \mathbb N\)) stands for the collection of all m-times continuously differentiable functions from J into \(\mathbb X\) and let \(BC^m(J; \mathbb X)\) stand for the space,
equipped with the norm
Definition 1.63
Hölder spaces of continuous functions \(C^{0, \alpha } (J; \mathbb X)\) and \(C^{k, \alpha }(J; \mathbb X)\) for α ∈ (0, 1) and \(k \in \mathbb N\) are respectively defined by
equipped with the norm
equipped with the norm
Proposition 1.64
The Hölder spaces \(C^{0, \alpha } (J; \mathbb X)\) and \(C^{k, \alpha }(J; \mathbb X)\) for α ∈ (0, 1) and \(k \in \mathbb N\) equipped with their corresponding norms are respectively Banach spaces.
Proof
The proof is left to the reader as an exercise.
Definition 1.65
The Lipschitz space \(Lip(J; \mathbb X)\) is defined by
and is equipped with the norm defined by
Proposition 1.66
The Lipschitz space \((Lip(J; \mathbb X), \|\cdot \|{ }_{\tilde {Lip}(J; \mathbb X)} )\) is a Banach space.
Proof
The proof is left to the reader as an exercise.
Definition 1.67
Let \(\varOmega \subset \mathbb R^N\) be an open subset. The Hölder space \(C_b^{0, \alpha }(\overline {\varOmega })\) consists of all bounded continuous functions \(f: \overline {\varOmega } \mapsto \mathbb C\) such that
Theorem 1.68
The space \((C_b^{0, \alpha }(\overline {\varOmega }), \|\cdot \|{ }_{C_b^{0, \alpha }(\overline {\varOmega })})\) is a Banach space, where the norm \( \|\cdot \|{ }_{C_b^{0, \alpha }(\overline {\varOmega })}\) is defined by
for all \(f\in C_b^{0, \alpha }(\overline {\varOmega })\).
Proof
The proof is left to the reader as an exercise.
Definition 1.69
Let \(k \in \mathbb N\). The Hölder space \(C_b^{k,\alpha }(\overline {\varOmega })\) consists of all functions \(f: \overline {\varOmega } \mapsto \mathbb C\) which are k-times continuously differentiable functions with bounded partial derivatives such that \(D^\beta f\in C_b^{0, \alpha }(\overline {\varOmega })\) for any multi-index β with |β| = k.
Theorem 1.70
The space Hölder space \((C_b^{k,\alpha }(\overline {\varOmega }), \|\cdot \|{ }_{C_b^{k,\alpha }(\overline {\varOmega })})\) is a Banach space, where the norm \(\|\cdot \|{ }_{C_b^{k,\alpha }(\overline {\varOmega })}\) is defined by
Proof
The proof is left to the reader as an exercise.
Remark 1.71
Note that the subscript “b” in \(C_b^{k,\alpha }(\overline {\varOmega })\) should be dropped in the case when the domain Ω is bounded. In other words, if Ω is bounded, then \(C_b^{k,\alpha }(\overline {\varOmega })\) will be denoted \(C^{k,\alpha }(\overline {\varOmega })\).
1.3.8 Embedding Theorems for Hölder Spaces
Theorem 1.72
Let \(\varOmega \subset \mathbb R^n\) be an open bounded subset with a smooth boundary ∂Ω. Suppose 0 ≤ k ≤ m − 1. Then, we have the following compact embedding,
Proof
See Adams [10].
Theorem 1.73
Let \(\varOmega \subset \mathbb R^n\) be an open subset with a smooth boundary ∂Ω. Suppose \(m \in \mathbb Z_+\) and α, β are given such that 0 < α < β ≤ 1. Then, we have the following embeddings,
If Ω is bounded, then embeddings,
are compact.
Proof
See Adams and Fournier [11].
1.3.9 The Dual Space
Let \((\mathbb X, \|\cdot \|)\) be a normed vector space over the field \(\mathbb F\). A functional \(\xi : \mathbb X \mapsto \mathbb F\), ξ↦ξ(x) (ξ(x) is also denoted 〈ξ, x〉), is said to be linear if it satisfies the following identity, ξ(λx + μy) = λξ(x) + μξ(y) for all \(\lambda , \mu \in \mathbb F\) and for all \(x, y \in \mathbb X\).
A functional \(\xi : \mathbb X \mapsto \mathbb F\) is called continuous, if there exists a constant K ≥ 0 such that
for all \(x \in \mathbb X\).
The collection of all linear continuous functionals \(\xi : \mathbb X \mapsto \mathbb F\), which we denote by \(\mathbb X^\ast \), is called the (topological) dual of \(\mathbb X\). Clearly, \(\mathbb X^\ast \) is a vector space over \(\mathbb F\) as we can add elements of \(\mathbb X^\ast \) up and multiply them by scalars and still get continuous linear functionals. One can endow the dual \(\mathbb X^\ast \) of \(\mathbb X\) with a norm which we denote by ∥⋅∥∗ and which is defined as follows: the norm ∥ξ∥∗ of \(\xi \in \mathbb X^\ast \), is the smallest constant K satisfying Eq. (1.5). Consequently,
Using the definition of the norm ∥⋅∥∗, it easily follows that |〈ξ, x〉|≤∥ξ∥∗∥x∥ for all \(\xi \in \mathbb X^*\) and \(x \in \mathbb X\). Furthermore,
Theorem 1.74
The normed vector space \((\mathbb X^\ast , \|\cdot \|{ }_\ast )\) is a Banach space.
Proof
The proof is left to the reader as an exercise.
1.3.10 The Schauder Fixed-Point Theorem
Definition 1.75
A nonempty set S is said to be convex if for all x, y ∈ S and λ ∈ [0, 1], then λx + (1 − λ)y ∈ S.
Theorem 1.76 (The Brouwer Fixed-Point Theorem [76])
Let \(S \subset \mathbb F^n\) be a nonempty bounded closed convex subset. If the mapping T : S↦S is continuous, then T has at least one fixed-point, that is, F T ≠ ∅.
Theorem 1.77 (The Schauder Fixed-Point Theorem [76])
Let \(\mathbb X\) be a Banach space and let \(S \subset \mathbb X\) be a nonempty compact convex subset. If the mapping T : S↦S is continuous, then T has at least one fixed-point, that is, F T ≠ ∅.
The following concept which measures the “non-compactness” is due to Kuratowski [78].
Definition 1.78
If \(D\subset \mathbb X\) is a bounded subset, one defines the measure α(D) of non-compactness of D as follows:
Definition 1.79
Let \(D \subset \mathbb X\) be a subset. Suppose that the map \(P: D \mapsto \mathbb X\) is continuous. The map P is called condensing if for any bounded subset D′ of D, α(D′) > 0 yields
We have the following generalization of the Schauder’s fixed-point due to Sadovsky.
Theorem 1.80 (The Sadovsky Fixed-Point Theorem [76])
Let D be a nonempty convex, bounded, and closed subset of a Banach space \(\mathbb X\) and F : D → D be a condensing map. Then F has a fixed point in D.
Proof
The proof makes use of the Schauder’s fixed point theorem (Theorem 1.77). Indeed, fix x ∈ D and let Γ be the set of all closed convex subsets C of D such that x ∈ C and F maps C into itself.
Set
where Conv denotes the convex envelop and \(\overline {Conv}\) its closure.
Using the fact that x ∈ Ω and that F maps Ω into itself yields one must have K ⊆ Ω, which, in turn, yields F(K) ⊆ F(Ω) ⊆ Ω. Now from x ∈ K it follows that K ∈ Γ. Consequently, Ω ⊆ K which yields Ω = K, which, in turn, yields F(K) = F(Ω) ⊆ K and, therefore,
Using the fact that F is condensing it follows that α(K) = 0, which yields Ω is compact. Consequently, F is a continuous function which maps a convex compact set K into itself and so using Schauder theorem it follows that F has a fixed point.
1.3.11 Leray-Schauder Alternative
We will need the following fixed-point theorem in the sequel.
Theorem 1.81 (Leray-Schauder Alternative [61, Theorem 6.5.4])
Let D be a closed convex subset of a Banach space \(\mathbb X\) with 0 ∈ D. Let G : D → D be a completely continuous map. Then, either G has a fixed point in D or the set
is unbounded.
1.4 Hilbert Spaces
Hilbert spaces play an important role in many areas including mathematical analysis, physics, quantum mechanics, Fourier analysis, partial differential equations, etc. These spaces, which generalize in a natural fashion the Euclidean space, are named after the German mathematician David Hilbert who introduced them in the mathematical literature. Obviously, a Hilbert space is, by design, a Banach space. Some of their basic properties will be discussed in this section and throughout the entire book. For the uncovered material on Hilbert spaces, we refer the interested reader to some of the classical books in functional analysis, i.e., Brézis [31, 32], Conway [38], Eidelman et al. [53], Naylor and Sell [93], etc.
1.4.1 Basic Definitions
In this section, \(\mathcal H\) stands for a vector space over the field \(\mathbb F\) where \(\mathbb F = (\mathbb R, |\cdot |)\) or \((\mathbb C, |\cdot |)\).
Definition 1.82
A mapping \(a: \mathcal H \times \mathcal H \mapsto \mathbb F\) is said to be a sesquilinear form, if
-
i)
the mapping x↦a(x, y) is linear for all \(y \in \mathcal H\), that is,
$$\displaystyle \begin{aligned} a(\lambda x + \mu x', y) = \lambda a(x,y) + \mu a(x', y) \end{aligned}$$for all \(x, x', y \in \mathcal H\); and
-
ii)
the mapping y↦a(x, y) is anti-linear for all \(x\in \mathcal H\), that is,
$$\displaystyle \begin{aligned} a(x, \lambda y + \mu y') = \overline{\lambda} a(x,y) + \overline{\mu} a(x, y') \end{aligned}$$for all \(x, y, y' \in \mathcal H\).
Definition 1.83
An inner product or scalar product 〈⋅, ⋅〉 on \(\mathcal H\) is a sesquilinear form which goes from \(\mathcal H \times \mathcal H\) into \(\mathbb F\) and satisfies:
-
i)
〈x, x〉≥ 0 for all \(x \in \mathcal H\);
-
ii)
〈x, x〉 = 0 if and only if x = 0; and
-
iii)
\(\langle y, x\rangle = \overline {\langle x, y\rangle }\) for all \(x, y \in \mathcal H\).
Recall that if 〈⋅, ⋅〉 is an inner product on \(\mathcal H\), then the mapping \(\|\cdot \|: \mathcal H \mapsto \mathbb R_+\) defined by \(\|x\|:= [\langle x,x\rangle ]^{\frac {1}{2}}\) for all \(x \in \mathcal H\) is a norm on \(\mathcal H\); called the norm deduced from the inner product 〈⋅, ⋅〉. Recall also that the norm ∥⋅∥ satisfies various properties including the so-called Cauchy-Schwarz inequality and the parallelogram identity given respectively by,
and
for all \(x, y \in \mathcal H\).
While the proof of the Cauchy-Schwarz inequality requires more efforts, that of the parallelogram identity is easy and is based upon the following identities:
and
for all \(x, y \in \mathcal H\).
Let \(\mathcal H\) be a vector space over \(\mathbb F\) equipped with the inner product given by, 〈⋅, ⋅〉. Two vectors \(x, y \in \mathcal H\) are said to be orthogonal if 〈x, y〉 = 0. If 〈x, y〉 = 0, then the Pythagorean theorem holds in \(\mathcal H\), that is,
More generally, if \((x_n)_{n \in \mathbb N} \subset \mathcal H\) is a sequence such that 〈x n, x m〉 = 0 for all \(n, m \in \mathbb N\) with n ≠ m, then the series \(\sum _{n=1}^\infty x_n\) converges if and only if \(\sum _{n=1}^\infty \|x_n\|{ }^2\) does. In that event, we have the following generalized Pythagorean theorem,
If \(M \subset \mathcal H\) is a subspace, then its orthogonal M ⊥ is defined by
Proposition 1.84
If \(M \subset \mathcal H\) is a subspace, then its orthogonal M ⊥ is a closed subspace of \(\mathcal H\).
Proof
Let \((x_n)_{n \in \mathbb N} \subset M^{\perp }\) be a sequence such that ∥x n − x∥→ 0 as n →∞ for some \(x \in \mathcal H\). Now, using the fact that, 〈x, y〉 = 〈x − x n + x n, y〉 = 〈x − x n, y〉 + 〈x n, y〉 = 〈x − x n, y〉 for all y ∈ M and \(n \in \mathbb N\), it follows, using the Cauchy-Schwarz inequality, that |〈x, y〉|≤∥x − x n∥ . ∥y∥→ 0 as n →∞, which yields 〈x, y〉 = 0, and hence x ∈ M ⊥.
Note that in addition to Proposition 1.84, we also have \(\mathcal H^\perp = \{0\}\) and \(\{0\}^\perp = \mathcal H\). Further, if M ⊂ N where M, N are subspaces of \(\mathcal H\), then N ⊥⊂ M ⊥ and M ⊂ (M ⊥)⊥ with (M ⊥)⊥ = M if M is closed.
Definition 1.85
The space \(\mathcal H\) is said to be a Hilbert space if \((\mathcal H, \|\cdot \|)\) is complete where ∥⋅∥ is the norm deduced from the inner product 〈⋅, ⋅〉.
One of the most important properties of Hilbert spaces is that of the projection theorem. It plays an important role in many areas.
Theorem 1.86
Let \(\mathcal H\) be a Hilbert space and let \(\varSigma \subset \mathcal H\) be a nonempty closed convex subset. For each \(x \in \mathcal H\) , there exists a unique point P Σ(x) belonging to Σ and called the orthogonal projection of x onto Σ that satisfies the identity
An immediate consequence of Theorem 1.86 is that if \(\mathbb L \subset \mathcal H\) is a closed subspace, then \(\mathcal H\) can be written as the direct sum of \(\mathbb L\) and \(\mathbb L^\perp \) as follows: \(\mathcal H = \mathbb L \oplus \mathbb L^\perp .\) This means that each \(x \in \mathcal H\) can be uniquely written as \(x = (x-P_{\mathbb L}(x)) + P_{\mathbb L}(x)\) where \(P_{\mathbb L}(x) \in \mathbb L\) and \(x-P_{\mathbb L}(x) \in \mathbb L^\perp \). The mapping \(P_{\mathbb L}: \mathcal H \mapsto \mathcal H\), \(x \mapsto P_{\mathbb L}(x)\), is called the orthogonal projection of \(\mathcal H\) onto \(\mathbb L\).
1.4.2 Examples of Hilbert Spaces
Classical examples of Hilbert spaces include, but are not limited to, \(\mathbb R^n\), \(\mathbb C^n\), L 2(Ω), and \(\ell ^2(\mathbb N)\), equipped with their natural inner products. Other examples of Hilbert spaces include Sobolev spaces H k(Ω). Take k = 1, \(\varOmega = \mathbb R\), and consider H 1 defined by
Recall that f′ appearing in the definition of \(H^1(\mathbb R)\) is the derivative of f in the sense of distributions. The inner product on \(H^1(\mathbb R)\) and its corresponding norm are given respectively by
and
for all \(f, g \in H^1(\mathbb R)\).
1.5 Exercises
-
1.
Let \((\mathbb X, d)\) be a metric space. Show that
$$\displaystyle \begin{aligned} |d(x, y) - d(y, z)| \leq d(x, z) \end{aligned}$$for all \(x, y, z \in \mathbb X\).
-
2.
Let d 0 and d 1 be two metrics on a nonempty set \(\mathbb X\). Show that mapping d defined by, d = αd 0 + βd 1 (d(x, y) = αd 0(x, y) + βd 1(x, y) for all \(x, y \in \mathbb X\)) where α, β ≥ 0 and α + β = 1, is a metric on \(\mathbb X\).
-
3.
Prove Proposition 1.14.
-
4.
Prove Theorem 1.19.
-
5.
Let a, b > 0 and let f be the function given for all \(x \in \mathbb R^n\) by
$$\displaystyle \begin{aligned} f(x) = \frac{1}{(1 + \|x\|{}^a) (1 + (\ln\|x\|)^b)}, \end{aligned}$$where ∥⋅∥ denotes the Euclidean norm of \(\mathbb R^n\). Find conditions under which \(f \in L^p(\mathbb R^n)\).
-
6.
Show that the Lebesgue space L p(Ω) is vector space for all p ∈ [1, ∞].
-
7.
Prove Theorem 1.44.
-
8.
Let C[a, b] the collection of all continuous functions \(f: [a, b] \mapsto \mathbb R\).
-
a)
Show that C[a, b] is a vector space over \(\mathbb R\).
-
b)
Show that C[a, b] equipped with the norm given, for p ∈ [1, ∞), by
$$\displaystyle \begin{aligned} \|f\|{}_p := \Big(\int_{a}^b |f(t)|{}^p dt \Big)^{\frac{1}{p}} \ \ \mbox{for all} \ \ f \in C[a,b], \end{aligned}$$is not a Banach space.
Hint: Construct a Cauchy sequence in (C[a, b], ∥⋅∥p) which does not converge.
-
a)
-
9.
Consider the function \(f: \mathbb R^n \mapsto \mathbb R\) defined by f(x) = ∥x∥−α where ∥⋅∥ is the Euclidean norm of \(\mathbb R^n\) and \(\alpha \in \mathbb R\). Show that \(f \in L^1_{loc}(\mathbb R^n)\) if and only if α < n.
-
10.
Let 1 < p ≤∞ and let 1 ≤ q < ∞ be such that p −1 + q −1 = 1. Show that the (topological) dual of L p(Ω) is L q(Ω).
-
11.
Let s ≥ 0 and 1 ≤ p < ∞. Suppose q is such that p −1 + q −1 = 1. Show that the (topological) dual of \(W_0^{s,p}(\varOmega )\) is the Sobolev space W −s, q(Ω).
-
12.
Prove Theorem 1.64.
-
13.
Prove Theorem 1.66.
-
14.
Prove Theorem 1.68.
-
15.
Prove Theorem 1.70.
-
16.
Prove Theorem 1.74.
-
17.
Prove Theorem 1.86.
1.6 Comments
Some of the basic materials of Sects. 1.3.3, 1.3.4, 1.3.6, and 1.3.10 are taken from the following sources: Adams [10], Adams and Fournier [11], Brézis [31, 32] and Diagana [47]. The material covered in Sects. 1.3.7 and 1.3.9 are partially taken from Bezandry and Diagana [29] and Lunardi [87]. For additional readings upon L p spaces and Sobolev spaces W k, p we refer the reader to Adams [10], Adams and Fournier [11], and Brézis [32]. For additional readings upon basic functional analysis and real analysis, we refer to Conway [38], Diagana [45], Eidelman et al. [53], Kato [73], Rudin [102], Weidmann [111], and Yosida [112].
The proof of Theorem 1.80 follows Khamsi and Kirk [76, Proof of Theorem 7.12, pages 190–191].
For additional references on metric and normed vector spaces, we refer to the book by Oden and Demkowicz [97]. For additional readings on fixed-point theory, we refer to the book by Khamsi and Kirk [76].
References
R.A. Adams, Sobolev Spaces. Pure and Applied Mathematics, vol. 65 (Academic Press, New York, 1975)
R.A. Adams, J.J.F. Fournier, Sobolev Spaces. Pure and Applied Mathematics (Amsterdam), vol. 140, 2nd edn. (Elsevier/Academic Press, Amsterdam, 2003)
H. Bohr, Almost Periodic Functions (Chelsea Publishing Company, New York, 1947)
H. Brézis, Analyse fonctionnelle: théorie et applications, 2nd edn. (Masson, 1987)
H. Brézis, Functional Analysis, Sobolev Spaces and Partial Differential Equations (Springer, New York, 2011)
J.B. Conway, A Course in Functional Analysis (Springer, New York, 1985)
T. Diagana, Pseudo-Almost Periodic Functions in Banach Spaces (Nova Science Publishers, New York, 2007)
T. Diagana, Almost Automorphic Type and Almost Periodic Type Functions in Abstract Spaces, vol. XIV (Springer, New York, 2013), 303 p.
Y. Eidelman, V. Milman, A. Tsolomitis, Functional Analysis: An Introduction. Graduate Studies in Mathematics, vol. 66 (American Mathematical Society, Providence, 2004)
A. Granas, J. Dugundji, Fixed Point Theory (Springer, New York, 2003)
J.K. Hunter, B. Nachtergaele, Applied Analysis (World Scientific Publishing, River Edge, 2001)
T. Kato, Perturbation Theory for Linear Operators, 2nd edn. (Springer, Berlin, 1976)
M.A. Khamsi, W.A. Kirk, An Introduction to Metric Spaces and Fixed Point Theory. Pure and Applied Mathematics (New York) (Wiley-Interscience, New York, 2001)
C. Kuratowski, Sur les espaces complètes. Fund. Math. 15, 301–309 (1930)
A. Lunardi, Analytic Semigroups and Optimal Regularity in Parabolic Problems. Progress in Nonlinear Differential Equations and their Applications, vol. 16 (Birkhäuser, Basel, 1995)
A.W. Naylor, G.R. Sell, Linear Operator Theory in Engineering and Science. Applied Mathematical Sciences, vol. 40 (Springer, New York, 1982)
J.T. Oden, L.F. Demkowicz, Applied Functional Analysis, 2nd edn. (CRC Press, Boca Raton, 2010)
W. Rudin, Functional Analysis (Springer, New York, 1976)
J. Weidmann, Linear Operators in Hilbert Spaces (Springer, New York, 1980)
K. Yosida, Functional Analysis (Springer, New York, 1965)
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer Nature Switzerland AG
About this chapter
Cite this chapter
Diagana, T. (2018). Banach and Hilbert Spaces. In: Semilinear Evolution Equations and Their Applications. Springer, Cham. https://doi.org/10.1007/978-3-030-00449-1_1
Download citation
DOI: https://doi.org/10.1007/978-3-030-00449-1_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-00448-4
Online ISBN: 978-3-030-00449-1
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)