Basic Concepts

Ben Amar, Afif; O’Regan, Donal

doi:10.1007/978-3-030-92204-7_1

Part of the book series: Developments in Mathematics ((DEVM,volume 71))

836 Accesses

Abstract

This chapter collects well known concepts and results that will play a major role in constructing approximate fixed point theory in the remaining chapters. We note that we will reference the appropriate source papers after Sect. 1.2.8 (before this subsection well known results are presented so that the book is self contained). A brief introduction on fixed point theory is given at the end of this chapter.

Access provided by Autonomous University of Puebla. Download chapter PDF

70 Years of asymptotic fixed point theory

Article 01 March 2015

A Generalization of Nadler’s Fixed Point Theorem

Article 16 May 2017

An answer to an open problem of Jachymski

Article 17 June 2017

This chapter collects well known concepts and results that will play a major role in constructing approximate fixed point theory in the remaining chapters. We note that we will reference the appropriate source papers after Sect. 1.2.8 (before this subsection well known results are presented so that the book is self contained). A brief introduction on fixed point theory is given at the end of this chapter.

1.1 Topological Spaces

1.1.1 The Notion of Topological Spaces

The topology on a set X is usually defined by specifying its open subsets of X.

Definition 1.1

A topology τ on a set X is a family of subsets of X which satisfies the following conditions:

1.
The empty set ∅ and the whole X are both in τ.
2.
τ is closed under finite intersections.
3.
τ is closed under arbitrary unions.

The pair (X, τ) is called a topological space .

The sets Y ∈ τ are called open sets of X and their complements Z = X ∖ Y are closed of X. A subset of X may be neither closed nor open, or both. A set that is both closed and open is called a clopen set.

Examples 1.1

(i)
Let X any set. Then τ = {∅, X} is a topology on X, called the trivial topology on X.
(ii)
At the other extreme of the topological spectrum, if X is any nonempty set, then τ = P(X) the power set of X, is a topology on X, called the discrete topology on X.
(iii)
Let X = {a, b}, and set τ = {∅, {a}, {b}, {a, b}}. Then τ is a topology on X.
(iv)
Let (X, d) be a metric space . Let
$$\displaystyle \begin{aligned} \tau&= \Big\{ Y \subseteq X \colon \text{ for all } x \in Y, \text{ there exists }\delta> 0 \text{ such that } B_{\delta} (x) \\ &\qquad = \{y \in X \colon d(x,y)< \delta \} \subseteq Y \Big\}. \end{aligned} $$

Then τ is a topology, called the metric topology on X induced by d. This is the usual topology one thinks of when dealing with metric spaces, but as we shall see, there can be many more.
(v)
Let X be any nonempty set. Then
$$\displaystyle \begin{aligned} \tau_{cf}= \{\emptyset\} \cup \{Y\subseteq X \colon X\setminus Y \text{ is finite }\} \end{aligned}$$

is a topology on X, called the co-finite topology on X.

Definition 1.2

Let (X, τ) be a topological space and Y ⊆ X. Then Y ∩ τ = {Y ∩ U : U ∈ τ} is called the induced topology on Y .

Definition 1.3

Let (X, τ) be a topological space and Y ⊆ X. We define

(i)
The interior of a subset Y ⊆ X is the largest open set contained in it. It will be denoted by int Y . Equivalently, intY is the union of all open subsets of X contained in Y .
(ii)
A point x ∈ X is a limit point (or accumulation point ) of Y if and only if for every open set U containing x, it is true that U ∩ Y contains some point distinct from x, i.e., Y ∩ (U ∖{x}) ≠ ∅. Note that x need not belong to Y .
(iii)
The point x ∈ Y is an isolated point of Y if there is some open set U such that U ∩ Y = {x}. (In other words, there is some open set containing x but no other points of Y .)
(iv)
The closure of a subset Y , written $\overline {Y}$, is the union of Y and its set of limit points,
$$\displaystyle \begin{aligned} \overline{Y}= Y\cup \{x \in X \colon x \mbox{ is a limit point of } Y\}. \end{aligned}$$

Remark 1.1

It follows from the definition that $x \in \overline {Y}$ if and only if Y ∩ U ≠ ∅ for any open set U containing x. Indeed, suppose that $x \in \overline {Y}$ and that U is some open set containing x. Then either x ∈ Y or x is a limit point of Y (or both), in which case Y ∩ U ≠ ∅. On the other hand, suppose that Y ∩ U ≠ ∅ for any open set U containing x. Then if x is not an element of Y it is certainly a limit point. Thus $x \in \overline {Y}$.

Proposition 1.1

Let (X, τ) be a topological space and Y ⊆ X. The closure of Y is the smallest closed set containing Y , that is,

$$\displaystyle \begin{aligned} \overline{Y}= \bigcap \{ Z \colon Z ~\mathit{\mbox{ is closed and }}\, Y\subseteq Z\}. \end{aligned}$$

Corollary 1.1

A subset Y of a topological space is closed if and only if $Y= \overline {Y}$ . Moreover, for any subset $Y, \overline {Y}= \overline {\overline {Y}}$.

Proof

If Y is closed, then Y is surely the smallest closed set containing Y . Thus $Y= \overline {Y}$. On the other hand, if $Y= \overline {Y}$ then Y is closed because $\overline {Y}$ is. Now let Y be arbitrary. Then $\overline {Y}$ is closed and so equal to its closure, as above. That is, $\overline {Y}= \overline {\overline {Y}}$. ■

Definition 1.4

Let (X, τ) be a topological space.

1.
A subfamily $\mathcal {B}$ of τ is called a base if every open set can be written as a union of sets in $\mathcal {B}$.
2.
A subfamily $\mathcal {X}$ is called a subbase if the finite intersections of its sets form a base, i.e. every open set can be written as a union of finite intersections of sets in $\mathcal {X}$.

Examples 1.2

1.
The collection $\mathcal {B}= \{(a, b) \colon a, b \in \mathbb {R}, a < b\}$ is a base for the usual topology on $\mathbb {R}$.
2.
Let $\mathcal {S}$ be the collection of all semi-infinite intervals of the real line of the forms (−∞, a), and (b, +∞), where $a \in \mathbb {R}$. $\mathcal {S}$ is not a base for any topology on $\mathbb {R}$. To show this, suppose it were. Then, for example, (−∞, 1) and (0, +∞) would be in the topology generated by $\mathcal {S}$, being unions of a single base element, and so their intersection (0, 1) would be by the axiom 2) of topology. But (0, 1) clearly cannot be written as a union of elements in $\mathcal {S}$.
3.
The collection $\mathcal {S}$ is a subbase for the usual topology on $\mathbb {R}$.

Proposition 1.2

Let X be a set and let $\mathcal {B}$ be a collection of subsets of X. $\mathcal {S}$ is a base for a topology τ on X iff the following hold:

1.
$\mathcal {B}$ covers X, i.e., $\forall x \in X, \exists \, B \in \mathcal {B}$ such that x ∈ B.
2.
If x ∈ B₁ ∩ B₂ for some $B_{1}, B_{2} \in \mathcal {B},$ then $B_{3} \in \mathcal {B}$ such that x ∈ B₃ ∈ B₁ ∩ B₂.

Definition 1.5

Let (X, τ) be a topological space and ∈ X. A subset U of X is called a neighborhood of x if it contains an open set containing the point x. The neighborhood system at x is $\mathcal {N}_{x}= \{ U \subseteq X \colon U \text{ is a neighborhood of } x\}$.

Theorem 1.1

Let (X, τ) be a topological space, and x ∈ X. Then:

(a)
If $U \in \mathcal {N}_{x}$ , then x ∈ U.
(b)
If $U, V \in \mathcal {N}_{x}$ , then $U \cap V \in \mathcal {N}_{x}$.
(c)
If $U \in \mathcal {N}_{x}$ , there exists $V \in \mathcal {N}_{x}$ such that $U \in \mathcal {N}_{y}$ for each y ∈ V .
(d)
If $U \in \mathcal {N}_{x}$ and U ⊆ V , then $V \subseteq \mathcal {N}_{x}$.
(e)
G ⊆ X is open if and only if G contains a neighborhood of each of its points.

Remark 1.2

Conversely, if in a set X a nonempty collection $\mathcal {N}_{x}$ of subsets of X is assigned to each x ∈ X so as to satisfy conditions (a) through (d) and if we use (e) to define the notion of an open set, the result is a topology on X in which the neighborhood system at x is precisely $ \mathcal {N}_{x}$.

Definition 1.6

Let (X, τ) be a topological space. A (local) neighborhood base $\mathcal {B}_{x}$ at a point x ∈ X (or a fundamental system of neighborhoods of x) is a collection $\mathcal {B}_{x} \subseteq \mathcal {N}_{x}$ so that $U \in \mathcal {N}_{x}$ implies that there exists $B \in \mathcal {B}_{x}$ so that B ⊆ U. We refer to the elements of $\mathcal {B}_{x}$ as basic neighborhoods of the point x.

Example 1.1

Consider (X, d) be a metric space equipped with the metric topology τ. For each x ∈ X, fix a sequence (r_n(x))_n≥1 of positive real numbers such that lim_n→∞ r_n(x) = 0 and consider $\mathcal {B}_{x}= \{B_{r_{n}(x)}(x) \colon n \geq 1\}$. Then $\mathcal {B}_{x}$ is a neighborhood base at x for each x ∈ X.

Remark 1.3

Let (X, τ) be a topological space, and for each x ∈ X, suppose that $\mathcal {B}_{x}$ is a neighborhood base at x. Then $\mathcal {B}= \bigcup _{x \in X}\mathcal {B}_{x}$ is a base for the topology τ on X.

Definition 1.7

If (X, τ) is a topological space and x ∈ X and $\mathcal {B}$ is a set of open sets, we say that $\mathcal {B}$ is a local base at x if each element of $\mathcal {B}$ includes x and for every open set U that includes x there is some $V\in \mathcal {B}$ such that V ⊆ U.

Remark 1.4

If for each x ∈ X the set $\mathcal {B}_{x}$ is a local base at x, then $\bigcup _{x \in X}\mathcal {B}_{x}$ is a base for the topology of X.

Definition 1.8

Let (X, τ) be a topological space.

1.
(X, τ) is said to be T₁ if for every x, y ∈ X such that x ≠ y, there are neighborhoods U_x of x and U_y of y with y∉U_x and x∉U_y.
2.
(X, τ) is said to be T₂ (or Hausdorff) if for every x, y ∈ X such that x ≠ y, there are neighborhoods U_x of x and U_y of y with U_x ∩ U_y = ∅.

We say that two subsets Y and Z can be separated by τ if there exist U, V ∈ τ with Y ⊆ U, Z ⊆ V and U ∩ V = ∅.
3.
(X, τ) is said to be regular if whenever Y ⊆ X is closed and x∉Y , Y and {x} can be separated.
4.
(X, τ) is said to be normal if whenever Y₁, Y₂ ⊆ X are closed and disjoint, then Y₁ and Y₂ can be separated.
5.
(X, τ) is said to be T₃ if it is T₁ and regular.
6.
(X, τ) is said to be T₄ if it is T₁ and normal.

Definition 1.9

Let (X, τ) be a topological space. An open cover of Y ⊆ X is a collection $\mathcal {G} \subseteq \tau $ such that $Y \subseteq \cup _{G \in \mathcal {G}}$

A subset Y of a topological space (X, τ) is said to be compact if every open cover of X admits a finite subcover .

Proposition 1.3

Suppose (X, τ) is a topological Hausdorff space.

1.
Any compact set Y ⊆ X is closed.
2.
If Y is a compact set, then a subset Z ⊆ Y is compact, if and only if Z is closed (in X).

Proposition 1.4

For a subset Y of a topological space (X, τ), the following statements are equivalent.

1.
Y is compact.
2.
If (Z_α)_{α ∈ I} is any family of closed sets such that Y ∩⋂_{α ∈ I} Z_α = ∅, then Y ∩⋂_{α ∈ J} Z_α = ∅ for some finite subset J ⊆ I.
3.
If (Z_α)_{α ∈ I} is any family of closed sets such that Y ∩⋂_{α ∈ J} Z_α ≠ ∅, for every finite subset J ⊆ I, then Y ∩⋂_{α ∈ I} Z_α ≠ ∅.

Proof

The statements 2. and 3. are contrapositives. We shall show that 1. and 2. are equivalent. The proof rests on the observation that if (U_α)_α is a collection of sets, then Y ⊆⋃_α U_α if and only if Y ∩⋂_α(X ∖ U_α) = ∅. We first show that 1. implies 2. Suppose that Y is compact and let (Z_α)_{α ∈ I} be a family of closed sets such that Y ∩⋂_{α ∈ I} Z_α = ∅. Put U_α = X ∖ Z_α. Then each U_α is open, and by the above observation, Y ⊆⋃_{α ∈ I} U_α. But then there is a finite set J such that Y ⊆⋃_{α ∈ J} U_α, and so Y ∩⋂_{α ∈ J} Z_α = ∅, which proves 2.

Now suppose that 2. holds, and let (U_α)_α be an open cover of Y . Then each X ∖ U_α is closed and Y ∩⋂_{α ∈ I}(X ∖ U_α) = ∅. By 2., there is a finite set J such that Y ∩⋂_{α ∈ J}(X ∖ U_α) = ∅. This is equivalent to the statement that Y ⊆⋃_{α ∈ J} U_α. Hence Y is compact. ■

Remark 1.5

A topological space (X, τ) is compact if and only if any family of closed sets (Z_α)_{α ∈ I} in X having the finite intersection property (i.e., ⋂_{α ∈ J} Z_α ≠ ∅ for each finite subset J in I) is such ⋂_{α ∈ I} Z_α ≠ ∅.

Proposition 1.5

A nonempty subset Y of a topological space (X, τ) is compact if and only Y is compact with respect to the induced topology, that is, if and only if (Y, τ_Y) is compact. If (X, τ) is Hausdorff then so (Y, τ_Y).

Proof

Suppose first that Y is compact in (X, τ), and let (G_α)_{α ∈ I} an open cover of Y in (Y, τ_Y). Then each G_α has the form G_α = Y ∩ U_α for some U_α ∈ τ. It follows that (U_α)_{α ∈ I} is an open cover of Y in (X, τ). By hypothesis, there is a finite subcover, U₁, ⋯ , U_n, say. But then G₁, ⋯ , G_n is an open cover of Y in (Y, τ_Y), that is, (Y, τ_Y) is compact.

Conversely, suppose that (Y, τ_Y) is compact. Let (U_α)_{α ∈ I} be an open cover of Y in (X, τ). Set G_α = Y ∩ U_α. Then (G_α)_{α ∈ I} is an open cover of (Y, τ_Y). By hypothesis, there is a finite subcover, say, G₁, ⋯ , G_m. Clearly, U₁, ⋯ , U_m, is an open cover for Y in (X, τ). That is, Y is compact in (X, τ).

Suppose that (X, τ) is Hausdorff, and let x, y be any two distinct points of Y . Then there is a pair of disjoint open sets U, V in X such that x ∈ U and y ∈ V . Evidently, G₁ = Y ∩ U and G₂ = Y ∩ V are open in (Y, τ_Y), are disjoint and x ∈ G₁ and y ∈ G₂. Hence (Y, τ_Y) is Hausdorff, as required. ■

Theorem 1.2

Let (X, d) be a metric space. Then X, equipped with the metric topology is T₄.

Theorem 1.3

Let (X, τ) be a compact, Hausdorff space. Then (X, τ) is T₄.

Proof

Let Y, Z ⊆ X be two closed sets with Y ∩ Z = ∅. We need to find two open sets U, V ⊆ X, with Y ⊆ U, Z ⊆ V , and U ∩ V = ∅. Assume first that Z is a singleton, Z = {z}.

For every y ∈ Y we find open sets U_y and V_y, such that U_y ∋ y, V_y ∋ z, and U_y ∩ V_y = ∅. Using Proposition 1.3 we know that Y is compact, and since we clearly have Y ⊆⋃_{y ∈ Y} U_y, there exist y₁, ⋯ , y_n ∈ Y such that $\bigcup _{i=1}^{n} U_{y_{i}}\supseteq Y$. Then we are done by taking $U=\bigcup _{i=1}^{n} U_{y_{i}}$ and $V= \bigcap _{i=1}^{n}V_{y_{i}}$.

Having proven the above particular case, we proceed now with the general case. For every z ∈ Z, we use the particular case to find two open sets U_z and V_z with U_z ⊇ Y, V_z ∋ z, and U_z ∩ V_z = ∅. Arguing as above, the set Z is compact, and we have Z ⊆⋃_{z ∈ Z} V_z, so there exists z₁, ⋯ , z_n ∈ Z, such that $\bigcap _{i=1}^{n}V_{z_{i}} \supseteq Z$. Then we are done by taking $U= \bigcap _{i=1}^{n} U_{z_{i}}$ and $V= \bigcup _{i=1}^{n}V_{z_{i}}$. ■

Definition 1.10

A topological space (X, τ) is said to be separable if it admits a countable dense subset.

Proposition 1.6

Let (X, d) be a compact metric space. Then (X, d) is separable.

Proof

For each n ≥ 1, the collection $\mathcal {G}_{n}= \{B_{\frac {1}{n}}(x) \colon x \in X\}$ is an open cover of X. Since X is compact, we can find a finite subcover $\{B_{\frac {1}{n}}(x_{(j,n)}) \colon 1\leq j\leq k_n\}$ of X. It is then clear that if x ∈ X, there exists 1 ≤ j ≤ k_n so that $d(x,x_{(j,n)})< \frac {1}{n}$. As such, the collection

$$\displaystyle \begin{aligned} \mathcal{D}= \{x_{(j,n)} \colon 1 \leq j \leq k_n, 1 \leq n \} \end{aligned}$$

is a countable, dense set in X, proving that (X, d) is separable. ■

1.1.2 Comparison of Topologies

Any set X may carry several different topologies.

Definition 1.11

Let τ, τ′ be two topologies on the same set X. We say that τ is coarser (or weaker ) than τ′, in symbols τ ⊆ τ′, if for every subset of X which is open for τ is also open for τ′, or equivalently, if for every neighborhood of a point in X with respect to τ is also a neighborhood of that same point in the topology τ′. In this case τ′ is said to be finer (or stronger ) than τ′.

Two topologies τ and τ′ on the same set X coincide when they give the same open sets or the same closed sets or the same neighborhoods of each point, equivalently, when τ is both coarser and finer than τ′.

Two basis of neighborhoods of a set are equivalent when they define the same topology.

Remark 1.6

Given two topologies on the same set, it may very well happen that no-one is finer than the other. If it is possible to establish which one is finer, then we say that the two topologies are comparable .

Example 1.2

The cofinite topology τ_c on $\mathbb {R},$ i.e., $\tau _{c}= \{U\subseteq \mathbb {R}\colon U= \emptyset ~ \mbox{or}~ \mathbb {R} \setminus U \mbox{is finite}\}$, and the topology τ_i having $\{(-\infty ,a) \colon a \in \mathbb {R}\}$ as a basis are incomparable. In fact, it is easy to see that $\tau _{i}= \{(-\infty ,a)\colon a \in \mathbb {R}\}\cup \{\emptyset , \mathbb {R}\}$ as these are the union of sets in the given basis. In particular, we have that $\mathbb {R}\setminus \{0\}$ is in τ_c but not τ_i. Moreover, we have that (−∞, 0) is in τ_i but not τ_c. Hence, τ_c and τ_i are incomparable.

Proposition 1.7

If τ₁, τ₂ are Hausdorff topologies on a set X such that τ₂ is finer than τ₁ and such that (X, τ₂) is compact, then τ₁ = τ₂.

Proof

Let Y a τ₂-closed set. Since (X, τ₂) is compact then Y is τ₂-compact. Since τ₁ ⊆ τ₂ it follows that Y is τ₁-compact (any τ₁-open cover of Y is also a τ₂-open cover of Y and has a finite subcover). Since τ₁ is Hausdorff and Y is τ₁-compact then it is also τ₁-closed, which completes the proof (we showed that every τ₂-closed set is a τ₁-closed set). ■

Definition 1.12

Let X be a set and let F be a family of mappings from X into topological spaces:

$$\displaystyle \begin{aligned} \textit{F}= \{f_{\alpha} \colon X \to (Y_{\alpha},\tau_{\alpha})~:~ \alpha \in I\}. \end{aligned}$$

Let τ be the topology generated by the subbase

$$\displaystyle \begin{aligned} \{f_{\alpha}^{-1}(V) ~: V \in \tau_{\alpha}, \alpha \in I\}. \end{aligned}$$

Then τ is the weakest topology on X for which all the f_α are continuous maps (it is the intersection of all topologies having this property). It is called the weak topology induced by F, or the F-topology of X.

Proposition 1.8

Let F be a family of mappings X → (Y_α, τ_α) where X is a set and each (Y_α, τ_α) is a Hausdorff topological space. Suppose F separates points in X i.e., for any x, y ∈ X with x ≠ y, there is some f_α ∈ F such that f_α(x) ≠ f_α(y). Then the F-topology on X is Hausdorff.

Proof

Suppose that x, y ∈ X, with x ≠ y. By hypothesis, there is some α ∈ I such that nf_α(x) ≠ f_α(y). Since (Y_α, τ_α) is Hausdorff, there exist elements U, V ∈ τ_α such that f_α(x) ∈ U, f_α(y) ∈ V and U ∩ V = ∅. But then $f_{\alpha }^{-1}(U)$ and $f_{\alpha }^{-1}(V)$ are open with respect to F-topology and $x \in f_{\alpha }^{-1}(U), y \in f_{\alpha }^{-1}(V)$ and $f_{\alpha }^{-1}(U)\cap f_{\alpha }^{-1}(V)= \emptyset $. ■

Definition 1.13

Let (X, τ) be a topological space. X is called metrizable if it is compatible with some metric d (i.e., τ is generated by the open balls B_r(x) = {y ∈ X, d(x, y) < r}).

Proposition 1.9

Let (X, τ) be a compact topological space. If there is a sequence $\{f_{n}, n \in \mathbb {N}\}$ of continuous real-valued functions that separates points in X then X is metrizable.

Proof

Since (X, τ) is compact and the f_n are continuous then they are bounded. Thus, we can normalize them such that ∥f_n∥_∞ =sup_{x ∈ X}|f_n(x)|≤ 1. Define:

$$\displaystyle \begin{aligned} d(x,y)= \sum_{n=1}^{\infty}\frac{\left\vert f_{n}(x)- f_{n}(y)\right\vert }{2^{n}}. \end{aligned}$$

This series converges. In fact, it converges uniformly on X × X hence the limit is continuous. Because the f_n separate points d(x, y) = 0 iff x = 0. d is also symmetric and satisfies the triangle inequality.

Thus d is a metric and we denote by τ_d the topology induced by this metric. We need to show that τ_d = τ. Consider the metric balls:

$$\displaystyle \begin{aligned} B_{r}(x)= \{y \in X, d(x,y)< r\}. \end{aligned}$$

Since d is τ-continuous on X × X, these balls are τ-open and

$$\displaystyle \begin{aligned} \tau_{d}\subseteq \tau. \end{aligned}$$

By Proposition 1.7, since τ is compact and τ_d is Hausdorff (like any metric space) then τ = τ_d. ■

1.1.3 Nets and Convergence in Topology

Nets generalize the notion of sequences so that certain familiar results relating to continuity and compactness of sequences in metric spaces can be proved in arbitrary topological spaces. We now expand our notion of “sequence” (x_n)_n to something for which the index n need not be a natural number, but can instead take values in a (possibly uncountable) partially ordered set.

Definition 1.14

A directed set (I, ≺) consists of a set I with a partial order ≺ such that for every pair α, β ∈ I, there exists an element γ ∈ I with γ ≻ α and γ ≻ β.

Examples 1.3

1.
The natural numbers $\mathbb {N}$ with the relation ≤ define a directed set $(I, \prec )= (\mathbb {N}, \le )$.
2.
If (X, τ) is a topological space and x ∈ X, one can define a directed set (I, ≺) where I is the set of all neighborhoods of x in X, and U ≺ V for U, V ∈ I means V ⊆ U. This is a directed set because given any pair of neighborhoods U, V ⊆ X of x, the intersection U ∩ V is also a neighborhood of x and thus defines an element of I with U ∩ V ⊆ U and U ∩ V ⊆ V . Note that neither of U and V need be contained in the other, so they might not satisfy either U ≺ V or V ≺ U, hence ≺ is only a partial order, not a total order. Moreover, for most of the topological spaces we are likely to consider, I is uncountably infinite.
3.
Let (X, τ) a topological space and let x ∈ X. Then the set I_x = {U ∈ τ, x ∈ U} is a directed set when equipped with the either the subset relation ⊆, or more usefully the superset relation ⊇.
4.
If (I₁, ≺₁) and (I₂, ≺₂) are directed sets, then (I₁ × I₂, ≺) is a directed set where ≺ is defined by
$$\displaystyle \begin{aligned} (a,b)\prec (x,y)~~\text{if and only if }~~a\prec_{1} x~ \text{and}~b\prec_{2} y. \end{aligned}$$
5.
Let I denote the set of all finite partitions of $\left [ 0, 1\right ]$, partially ordered by inclusion (i.e., refinement). Let f be a continuous function on $\left [0, 1\right ]$, then to P = {0 = t₀ < t₁ < ⋯ < t_n = 1}∈ I, we associate the quantity $L_{P}(f)= \sum _{i=1}^{n}f(t_{i-1})(t_{i}-t_{i-1})$. The map f↦L_P(f) is a net (I is a directed set), and from Calculus, $\lim _{P\in I}L_{P}(f)= \int _{0}^{1}f(x)dx$.

Definition 1.15

Let $\mathcal {P}$ be a property of elements of a directed set (I, ≺). We shall say that:

1.
$\mathcal {P}$ holds eventually if there exists α₀ ∈ I such that $\mathcal {P}$ holds for each α ≻ α₀,
2.
$\mathcal {P}$ holds frequently if for each α ∈ I there exists β ≻ α satisfying $\mathcal {P}$.

Thus “eventually” means “for all successors of some element”, and “frequently” means “for arbitrary large elements”.

Definition 1.16

Given a topological space (X, τ), a net (x_α)_{α ∈ I} is a function I → X : α↦x_α, where (I, ≺) is a directed set.

Definition 1.17

We say that a net (x_α)_{α ∈ I} in X converges to x ∈ X if for every neighborhood U ⊆ X of x, there exists α₀ ∈ I such that x_α ∈ U for every α ≻ α₀.

Example 1.3

A net (x_α)_{α ∈ I} with $(I, \prec )= (\mathbb {N}, \leq )$ is simply a sequence, and convergence of this net to x means the same thing as convergence of the sequence.

Definition 1.18

A net (x_α)_{α ∈ I} has a cluster point (also known as accumulation point) at x ∈ X if for every neighborhood U ⊆ X of x and for every α₀ ∈ I, there exists α ≻ α₀ with x_α ∈ U.

Definition 1.19

A net (y_β)_{β ∈ J} is a subnet of the net (x_α)_{α ∈ I} if y_β = x_ϕ(β) for some order preserving function ϕ: J → I such that for every α₀ ∈ I, there exists an element β₀ ∈ J for which β ≻ β₀ implies ϕ(β) ≻ α₀ (cofinal) .

Example 1.4

If (x_n)_n is a sequence, any subsequence $(x_{k_{n}})_{n}$ becomes a subnet (y_β)_{β ∈ J} of the net $(x_{n})_{n \in \mathbb {N}}$ by setting $J= \mathbb {N}$ and $\phi \colon \mathbb {N} \to \mathbb {N}\colon n\longmapsto k_{n}$. Note that this remains true if we slightly relax our notion of subsequences so that (k_n) need not be a monotone increasing sequence in $\mathbb {N}$ but satisfies k_n →∞ as n →∞. Conversely, any subnet (y_β)_{β ∈ J} of a sequence $(x_{n})_{n \in \mathbb {N}}$ with $(J, \prec )= (\mathbb {N}, \le )$ is also a subsequence in this slightly relaxed sense, and can then be reduced to a subsequence in the usual sense by skipping some terms (so that the function n↦k_n becomes strictly increasing). Note however that a subnet of a sequence need not be a subsequence in general, i.e., it is possible to define a subnet (y_β)_{β ∈ J} of a sequence $(x_{n})_{n \in \mathbb {N}}$ such that J is uncountable, and one can derive concrete examples of such objects.

Remark 1.7

If (x_α)_{α ∈ I} is a net converging to x, then every subnet (x_ϕ(β))_{β ∈ J} also converges to x.

Theorem 1.4

Let Y be a subset of a topological space (X, τ). Then $x\in \overline {Y}$ if and only if there is a net (x_α)_{α ∈ I} with x_α ∈ Y such that x_α→x.

Proof

We know that a point x ∈ X belongs to $\overline {Y}$ if and only if every neighborhood of x meets Y . Suppose then that (x_α)_{α ∈ I} is a net in Y such that x_α→x. By definition of convergence, (x_α)_{α ∈ I} is eventually in every neighborhood of x, so certainly $x \in \overline {Y}$.

Suppose, on the other hand, that $x \in \overline {Y}$. Let $\mathcal {N}_{x}$ be the collection of all neighborhoods of x ordered by reverse inclusion. Then $\mathcal {N}_{x}$ is a directed set. We know that for each $V\in \mathcal {N}_{x}$ the set V ∩ Y is nonempty so let x_V be any element of V ∩ Y . Then x_V→x. ■

Lemma 1.1

Let X be a set, and (x_α)_{α ∈ I} a net in X. Let $\mathcal {B}$ be a family of subsets of X, satisfying

1.
x_α is contained frequently in each element of $\mathcal {B}$ , and
2.
the intersection of any two elements of $\mathcal {B}$ contains an element of $\mathcal {B}$.

Then (x_α)_{α ∈ I} admits a subnet which is eventually contained in each element of $\mathcal {B}$.

Proof

Clearly, the family $\mathcal {B}$ is directed by the inverse inclusion. Consider the set

$$\displaystyle \begin{aligned} J= \{ (\alpha,B) \in I\times \mathcal{B}\colon x_{\alpha} \in B\} \end{aligned}$$

equipped with the coordinate-wise pre-ordering. It is easy to see that J is a directed set. The function ϕ: J → I, defined by ϕ(α, B) = α, is nondecreasing and onto, and hence tends to infinity. Consequently, (x_ϕ(α,B))_(α,B) is a subnet of (x_α)_{α ∈ I}. Moreover, given A ∈ B, fix α₀ ∈ I so that $x_{\alpha _{0}} \in A$, and observe that if (α, B) ≻ (α₀, A) then x_ϕ(β,B) = x_β ∈ B ⊆ A. This completes the proof. ■

In metric spaces, a standard theorem states that sequential continuity is equivalent to continuity. In arbitrary topological spaces this no longer true, but we have the following generalization.

Theorem 1.5

For any two topological spaces X and Y , a map T : X → Y is continuous if and only if for every net (x_α)_{α ∈ I} in X converging to a point x ∈ X, the net (T(x_α))_{α ∈ I} in Y converges to T(x).

Proposition 1.10

A point x of a topological space (X, τ) is a cluster point of a net (x_α)_{α ∈ I} in X if and only if there exists a subnet (x_ϕ(β))_{β ∈ J} that converges to x.

Proof

If (x_ϕ(β))_{β ∈ J} is a subnet of (x_α)_{α ∈ I} converging to x, then for every neighborhood U ⊆ X of x, there exists β₀ ∈ J such that x_ϕ(β) ∈ U for every β ≻ β₀. Then for any α₀ ∈ I, the definition of a subnet implies that we can find β₁ ∈ J with ϕ(β) ≻ α₀ for all β ≻ β₁, and since J is a directed set, there exists β₂ ∈ J with β₂ ≻ β₀ and β₂ ≻ β₁. It follows that for α = ϕ(β₂), α ≻ α₀ and $x_{\alpha }= x_{\phi (\beta _{2})}\in U$, thus x is a cluster point of (x_α)_{α ∈ I}.

Conversely, if x is a cluster point of (x_α)_{α ∈ I}, we can define a convergent subnet as follows. Define a new directed set

$$\displaystyle \begin{aligned} J= I \times \{\text{ neighborhoods of } x ~\text{in}~ X\}, \end{aligned}$$

with the partial order (α, U) ≺ (β, V ) defined to mean both α ≺ β and V ⊆ U. Then for each (β, U) ∈ J, the fact that x is a cluster point implies that we can choose ϕ(β, U) ∈ I to be any α ∈ I such that α ≻ β and x_α ∈ U. This defines a function ϕ: J → I such that for any α₀ ∈ I and any neighborhood U₀ ⊆ X of x, every (β, U) ∈ J with (β, U) ≻ (α₀, U₀) satisfies ϕ(β, U) ≻ β ≻ α₀, hence (x_ϕ(β,U))_{β ∈ J} is a subnet of (x_α)_{α ∈ I}. Moreover, for any neighborhood U ⊆ X of x, we can choose an arbitrary α₀ ∈ I and observe that

$$\displaystyle \begin{aligned} (\beta,V)\succ (\alpha_{0},U) \Longrightarrow x_{\phi(\beta,V)} \in V\subseteq U, \end{aligned}$$

thus (x_ϕ(β,U))_{(β,U) ∈ J} converges to x. ■

Theorem 1.6

A topological space (X, τ) is compact if and only if every net in X has a convergent subnet.

Proof

Suppose X is compact but there exists a net (x_α)_{α ∈ I} in X with no cluster point. The fact that every x ∈ X is not a cluster point of (x_α)_{α ∈ I} then means that we can find for each x ∈ X an open neighborhood U_x ⊆ X of x and an index α_x ∈ I such that $x_{\alpha _{x}} \notin U_{x}$ for all α ≻ α_x. But (U_x)_{x ∈ X} is then an open cover of X and therefore has a finite subcover, meaning there is a finite subset x₁, ⋯ , x_N ∈ X such that $X= \bigcup _{n=1}^{N}U_{x_{n}}$. Since (I, ≺) is a directed set, there also exists an element β ∈ I such that

$$\displaystyle \begin{aligned} \beta \succ \alpha_{x_{n}} ~ \text{ for each }~ n= 1,\cdots,N. \end{aligned}$$

Then $x_{\beta } \notin U_{x_{n}}$ for every n = 1, ⋯ , N, but since the sets $U_{x_{n}}$ cover X, this is a contradiction.

Conversely, suppose that every net in X has a cluster point, but that X has a collection O of open sets that cover X such that no finite subcollection in O covers X. Define a directed set where I is the set of all finite subcollections of O, with the ordering relation defined by inclusion, i.e., for A, B ∈ I, A ≺ B means A ⊆ B. Note that (I, ≺) is a directed set since for any two A, B ∈ I, we have A ∪ B ∈ I with A ∪ B ⊃ A and A ∪ B ⊃ B. By assumption, none of the unions ⋃_{U ∈ A} for A ∈ I cover X, so we can choose a point

$$\displaystyle \begin{aligned} x_{A} \in X \setminus \bigcup_{U \in A}U \end{aligned} $$

(1.1)

for each A ∈ I, thus defining a net (x_A)_{A ∈ I}. Then (x_A)_{A ∈ I} has a cluster point x ∈ X. Since the sets in O cover X, we have x ∈ V for some V ∈ O, and the collection {V } is an element of I, hence there exists A ≻{V } such that x_A ∈ V . But this means A is a finite subcollection of O that includes V , thus contradicting (1.1). ■

Theorem 1.7

Let X be a set and let τ ₁ and τ ₂ be topologies on X. Then the following are equivalent

1.
τ₁ = τ₂.
2.
Every (x_α)_{α ∈ I} in X, converges in τ₁ if and only if it converges in τ₂.

Proposition 1.11

A topological space (X, τ) is Hausdorff if and only if no net has two distinct limits.

Proof

Suppose (X, τ) is Hausdorff and consider a net (x_α)_{α ∈ I}. Suppose for contradiction that x and y are distinct limits of (x_α)_{α ∈ I}. Take disjoint neighborhoods U of x and V of y. By definition of convergence, there is a α_x such that x_α ∈ U for all α ≻ α_x and a α_y such that x_α ∈ V for all α ≻ α_y. In particular we have x_α ∈ U ∩ V for an upper bound α of α_x and α_y in the directed set I, contradicting the disjointness of U and V . Thus (x_α)_{α ∈ I} cannot have two distinct limits.

Conversely, suppose that (X, τ) is not Hausdorff, so there are two distinct points x and y such that any neighborhood of x intersects any neighborhood of y. So there is a net $(x_{(U,V)})_{\mathcal {N}(x)\times \mathcal {N}(y)}$ such that

$$\displaystyle \begin{aligned} x_{(U,V)} \in U\cap V \end{aligned}$$

for neighborhoods U of x and V of y. Take any neighborhood U₀ of x and any $(U,V)\in \mathcal {N}(x)\times \mathcal {N}(y)$ with (U, V ) ≻ (U₀, X). By definition we have U ⊆ U₀ and thus x_{(U,V )} ∈ U ∩ V ⊆ U₀. This proves that x_{(U,V )} → x and we can similarly show that x_{(U,V )} → y. So the net $(x_{(U,V)})_{\mathcal {N}(x)\times \mathcal {N}(y)}$ has two distinct limits, as required. ■

1.2 Topological Vector Spaces

1.2.1 Linear Topologies

Definition 1.20

Let X be a vector space. A linear topology on X is a topology τ such that the maps

$$\displaystyle \begin{aligned} X \times X \ni (x,y)\mapsto x+y \in X \end{aligned} $$

(1.2)

$$\displaystyle \begin{aligned} \mathbb{K} \times X \ni (\alpha,x) \mapsto \alpha x \in X \end{aligned} $$

(1.3)

are continuous. For the map (1.2) we use the product topology τ × τ. For the map (1.3) we use the product topology $\tau _{\mathbb {K}} \times \tau $, where $\tau _{\mathbb {K}}$ is the standard topology on $\mathbb {K}$.

A topological vector space is a pair (X, τ) consisting of a vector space X and a Hausdorff linear topology τ on X.

Remark 1.8

If (X, τ) is a topological vector space then it is clear from Definition 1.20 that $\sum _{k=1}^{N}\lambda _{k}^{(n)}x_{k}^{(n)} \to \sum _{k=1}^{N}\lambda _{k}x_{k}$ as n →∞ with respect to τ if for each k = 1, ⋯ , N as n →∞ we have $\lambda _{k}^{(n)} \to \lambda _{k}$ with respect to the euclidean topology on $\mathbb {K}$ and $x_{k}^{(n)} \to x_{k}$ with respect to τ.

Examples 1.4

1.
Every vector space X over $\mathbb {K}$ endowed with the trivial topology is a topological vector space.
2.
The field $\mathbb {K}$, viewed as a vector space over itself, becomes a topological vector space, when equipped with the standard (euclidean) topology $\tau _{\mathbb {K}}$.
3.
Every normed vector space endowed with the topology given by the metric induced by the norm is a topological vector space.

Proposition 1.12

Every vector space X over $\mathbb {K}$ endowed with the discrete topology is not a topological vector space unless X = {θ}.

Proof

Assume by a contradiction that it is a topological vector space and take θ ≠ x ∈ X. The sequence $\alpha _{n}= \frac {1}{n}$ in $\mathbb {K}$ converges to 0 in the euclidean topology. Therefore, since the scalar multiplication is continuous, α_n x → θ, i.e., for any neighborhood U of θ in X there exists $m \in \mathbb {N}$ such that α_n x ∈ U for all n ≥ m. In particular, we can take U = {θ} since it is itself open in the discrete topology. Hence, α_m x = θ, which implies that x = θ and so a contradiction. ■

Remark 1.9

In terms of net convergence, the continuity requirements for a linear topology on X read:

Whenever (x_α) and (y_α) are nets in X, such that x_α → x and y_α → y, it follows that x_α + y_α → x + y.
Whenever (λ_α) and (x_α) are nets in $\mathbb {K}$ and X, respectively, such that λ_α → λ (in $\mathbb {K}$) and x_α → x (in X), it follows that λ_α x_α → λx.

Example 1.5

Let I be an arbitrary nonempty set. The product space $\mathbb {K}^{I}$ (defined as the space of all functions $I \to \mathbb {K}$) is obviously a vector space (with pointwise addition and scalar multiplication). The product topology turns $\mathbb {K}^{I}$ into a topological vector space.

Remark 1.10

If X is a vector space, then the following maps are continuous with respect to any linear topology on X :

The translations T_y: X → X, y ∈ X, defined by T_y(x) = x + y.
The dilations $D_{\alpha }\colon X \to X, \alpha \in \mathbb {K},$ defined by D_α(x) = αx.

If τ is a linear topology on a vector space X, then τ is translation invariant . That is, a subset U ⊆ X is open if and only if the translation y + U is open for all y ∈ X. Indeed, the continuity of addition implies that for each y ∈ X, the translation x↦y + x is a linear homeomorphism . In particular, every neighborhood of y is of the form y + U, where U is a neighborhood of zero. In other words, the neighborhood system at zero determines the neighborhood system at every point of X by translation. Also note that the dilation x↦αx is a linear homeomorphism for any α ≠ 0. In particular, if U is a neighborhood of zero, then so is αU for all α ≠ 0.

Example 1.6

If a metric d on a vector space X is translation invariant , i.e., d(x + z, y + z) = d(x, y) for all x, y ∈ X (i.e., the metric induced by a norm), then the topology induced by the metric is translation invariant and the addition is always continuous. However, the multiplication by scalars does not need to be necessarily continuous (take d to be the discrete metric, then the topology generated by the metric is the discrete topology which is not compatible with the scalar multiplication).

Proposition 1.13

If Y is a linear subspace of a topological vector space (X, τ), then so its closure $\overline {Y}$ . In particular, any maximal proper subspace is either dense or closed.

Proof

We must show that if $x, y \in \overline {Y}$ and $\lambda \in \mathbb {K}$, then $\lambda x+ y \in \overline {Y}$. There are nets (x_α) and (y_α) in Y , such that x_α→x and y_α→y. By Remark 1.9, we deduce that tx_α→tx and tx_α + y_α→tx + y and we conclude that $tx + y\in \overline {Y}$, as required.

If Y is a maximal proper subspace, the inclusion $Y \subseteq \overline {Y}$ implies either $Y= \overline {Y}$, in which case Y is closed, or $\overline {Y}= X$, in which case Y is dense in X. ■

Notations

Given a vector space X, a subset Y ⊆ X, and a vector x ∈ X, we denote the translation T_x(Y ) simply by Y + x (x + Y ), that is,

$$\displaystyle \begin{aligned} Y+x= x+Y= \{ y+x\colon y \in Y\}. \end{aligned}$$

Likewise, for an $\alpha \in \mathbb {K}$ we denote the dilation D_α(Y ) simply by αY , that is,

$$\displaystyle \begin{aligned} \alpha Y= \{ \alpha y\colon y \in Y\}. \end{aligned}$$

Given another subset Z ⊆ X, we define

$$\displaystyle \begin{aligned} Y+Z= \{y+z\colon y\in Y, z \in Z\}=\bigcup_{y\in Y}(y+Z)= \bigcup_{z\in Z}(Y+z). \end{aligned}$$

Remark 1.11

In general we only have the inclusion 2Y ⊆ Y + Y.

Lemma 1.2

Let τ be a linear topology on the vector space X.

1.
The algebraic sum of an open set and an arbitrary set is open.
2.
Nonzero multiples of open sets are open.
3.
If Y is open, then for any set Z we have $\overline {Z}+Y= Z+Y.$
4.
The algebraic sum of a compact set and a closed set is closed. (However, the algebraic sum of two closed sets need not be closed.)
5.
The algebraic sum of two compact sets is compact.
6.
Scalar multiples of closed sets are closed.
7.
Scalar multiples of compact sets are compact.

Proof

We shall prove only items 3. and 4.

3.
Clearly $Y+Z \subseteq Y+ \overline {Z}$. For the reverse inclusion, let x = z + y where $z \in \overline {Z}$ and y ∈ Y . Then there is an open neighborhood U of θ such that y + U ⊆ Y . Since $z \in \overline {Z}$, there exists some t ∈ Z ∩ (z − U). Then x = z + y = t + z + (y − z) ∈ t + z + U ⊆ Z + Y .
4.
Let Y be compact and Z be closed, and let a net (y_α + z_α)_{α ∈ I} satisfy y_α + z_α→x. Since Y is compact, we can assume (by passing to a subnet) that y_α→y ∈ Y . The continuity of the algebraic operations yields
$$\displaystyle \begin{aligned} z_{\alpha}= (y_{\alpha}+z_{\alpha})- y_{\alpha} \longrightarrow x-y= z. \end{aligned}$$

Since Z is closed, z ∈ Z, so x = y + z ∈ Y + Z, proving that Y + Z is closed.

■

Proposition 1.14

Let τ be a linear topology on the vector space X.

1.
For every neighborhood V of θ, there exists a neighborhood W of θ, such that W + W ⊆ V.
2.
For every neighborhood V of θ, and any compact set $C\subseteq \mathbb {K}$ , there exists a neighborhood W of θ, such that αW ⊆ V, ∀ α ∈ C.

Proof

1. Let T : X × X → X denote the addition map (1.2). Since T is continuous at (θ, θ) ∈ X × X, the preimage T ⁻¹(V ) is a neighborhood of (θ, θ) in the product topology. In particular, there exists neighborhoods W₁, W₂ of θ, such that W₁ × W₂ ⊆ T ⁻¹(V ), so if we take W = W₁ ∩ W₂, then W is still a neighborhood of θ satisfying W × W ⊆ T ⁻¹(V ), which is precisely the desired inclusion W + W ⊆ V.

2. Let $G\colon \mathbb {K} \times X \to X$ denote the multiplication map (1.3). Since G is continuous at $(0,\theta ) \in \mathbb {K} \times X$, the preimage G ⁻¹(V ) is a neighborhood of (0, θ) in the product topology. In particular, there exists a neighborhood I of 0 in $\mathbb {K}$ and a neighborhood W₀ of θ in X such that I × W₀ ⊆ G ⁻¹(V ). Let then ρ > 0 such that I contains the closed disk $\overline {B_{\rho }}(0)= \{ \alpha \in \mathbb {K}\colon \left \vert \alpha \right \vert \le \rho \}$, so that we still have the inclusion $\overline {B_{\rho }}(0) \times W_{0}\subseteq G^{-1}(V)$ i.e.,

$$\displaystyle \begin{aligned} \alpha\in \mathbb{K}, \left\vert \alpha\right\vert \le \rho \Longrightarrow \alpha W_{0} \subseteq V. \end{aligned} $$

(1.4)

Since $C\subseteq \mathbb {K}$ is compact, there is some R > 0, such that

$$\displaystyle \begin{aligned} \left\vert \gamma\right\vert \le R,~~\forall~\gamma \in C. \end{aligned} $$

(1.5)

Let us then define $W= (\frac {\rho }{R})W_{0}.$ First of all, since W is a non-zero dilation of W₀, it is a neighborhood of θ. Secondly, if we start with some γ ∈ C and some w ∈ W, written as $w=(\frac {\rho }{R})w_{0}$ with w₀ ∈ W₀, then

$$\displaystyle \begin{aligned} \gamma w= (\frac{\rho \alpha}{R})w_{0}. \end{aligned}$$

By (1.5) we know that $\left \vert \frac {\rho \alpha }{R}\right \vert \le \rho ,$ so by (1.4) we get γw ∈ V. ■

1.2.2 Absorbing and Balancing Sets

Definition 1.21

A subset Y of a vector space X is convex if, whenever Y contains two points x and y, Y also contains the segment or the straight line joining them, i.e.,

$$\displaystyle \begin{aligned} \forall~ x, y \in Y, \forall~ \alpha, \beta \geq 0~ \mbox{ such that } \alpha+\beta= 1, \alpha x+ \beta y\in Y. \end{aligned}$$

Examples 1.5

1.
The convex subsets of $\mathbb {R}$ are simply the intervals of $\mathbb {R}$. Examples of convex subsets of $\mathbb {R}^{2}$ are solid regular polygons. The Platonic solids are convex subsets of $\mathbb {R}^{3}$. Hyperplanes and half spaces in $\mathbb {R}^{n}$ are convex.
2.
Balls in a normed space are convex.
3.
Consider a topological space X and the set C(X) of all real valued functions defined and continuous on X. C(X) with the pointwise addition and scalar multiplication of functions is a vector space. Fixed g ∈ C(X), the subset Y := {f ∈ C(X): f(x) ≥ g(x), ∀ x ∈ X} is convex.
4.
Consider the vector space $\mathbb {R}[x]$ of all polynomials in one variable with real coefficients. Fixed $n \in \mathbb {N}$ and $c \in \mathbb {R}$, the subset of all polynomials in $\mathbb {R}[x]$ such that the coefficient of the term of degree n is equal to c is convex.

Proposition 1.15

Let X be a vector space. The following properties hold.

(a)
∅ and X are convex.
(b)
Arbitrary intersections of convex sets are convex sets.
(c)
Unions of convex sets are generally not convex.
(d)
The sum of two convex sets is convex.
(e)
A set Y is convex if and only if αY + βY = (α + β)Y for all nonnegative scalars α and β.
(f)
The image and the preimage of a convex set under a linear map is convex.

Definition 1.22

Let Y be any subset of a vector space X. We define the convex hull of X, denoted by conv(Y ), to be the set of all finite convex linear combinations of elements of Y , i.e.,

$$\displaystyle \begin{aligned} \mathrm{conv}(Y)= \left\{ \sum_{i=1}^{n} \alpha_{i}x_{i} \colon x_{i} \in Y, \alpha_{i} \in [0,1], \sum_{i=1}^{n} \alpha_{i}= 1, n \in \mathbb{N} \right\}. \end{aligned}$$

Proposition 1.16

Let Y, Z be arbitrary sets of a vector space X. The following hold.

(a)
conv(Y ) is convex.
(b)
Y ⊆conv(Y ).
(c)
A set is convex if and only if it is equal to its own convex hull.
(d)
If Y ⊆ Z then conv(Y ) ⊆conv(Z).
(e)
conv(conv(Y )) = conv(Y ).
(f)
conv(Y + Z) = conv(Y ) + conv(Z).
(g)
The convex hull of Y is the smallest convex set containing Y , i.e., conv(Y ) is the intersection of all convex sets containing Y .

Definition 1.23

Let X be a vector space.

A subset Y ⊆ X is said to be absorbing (or radial ), if for every x ∈ X, there exists some scalar α > 0, such that αx ∈ Y . Roughly speaking, we may say that a subset is absorbing if it can be made by dilation to swallow every point of the whole space.
A subset Y ⊆ X is said to be balancing (or circled ), if for every $\alpha \in \mathbb {K}$ with $\left \vert \alpha \right \vert \le 1$, one has the inclusion αY ⊆ Y . Note that the line segment joining any point x of a balanced set Y to − x lies in Y .
A subset Y ⊆ X is said to be symmetric , if for every x ∈ Y , one has (−x) ∈ Y , namely (−Y ) = Y .
A subset Y ⊆ X is said to be absolutely convex , if it is convex and balanced.
A subset Y ⊆ X is said to be starshaped about zero if it included the line segment joining each of its points with zero. That is, if for any x ∈ Y and any 0 ≤ α ≤ 1 we have αx ∈ Y .

Remark 1.12

Note that an absorbing set must contain θ, and any set including an absorbing set is itself absorbing. For any absorbing set Y , the set Y ∩ (−Y ) is nonempty, absorbing, and symmetric. Every circled set is symmetric. Every circled set is star-shaped about θ, as is every convex set containing θ.

Remark 1.13

Given τ a linear topology of a vector space X, all neighborhoods of θ are absorbing. Indeed, if we start with some x ∈ X, the sequence $x_{n}= \frac {1}{n}x$ clearly converges to θ, so every neighborhood of θ will contain (many) terms x_n.

Examples 1.6

1.
In a normed space the unit balls centered at the origin are absorbing and balanced.
2.
The unit ball B centered at $(\frac {1}{2},0) \in \mathbb {R}^{2}$ is absorbing but not balanced in the real vector space $\mathbb {R}^{2}$ endowed with the euclidean norm. Indeed, B is a neighborhood of the origin. However, B is not balanced because for example if we take x = (1, 0) ∈ B and α = −1 then αx∉B.
3.
The polynomials $\mathbb {R[X]}$ are a balanced but not absorbing subset of the real space $C([0,1],\mathbb {R})$ of continuous real valued functions on [0, 1]. Indeed, any multiple of a polynomial is still a polynomial but not every continuous function can be written as multiple of a polynomial.
4.
The subset $Y= \{(z_{1},z_{2}) \in \mathbb {C}^{2}\colon \left \vert z_{1}\right \vert \leq \left \vert z_{2}\right \vert \}$ of the complex space $\mathbb {C}^{2}$ with the euclidean topology is balanced but intY is not balanced.

Definition 1.24

Given τ a linear topology of a vector space X, a subset Y ⊆ X is said to be a barrel if it has the following properties:

1.
Y is absorbing,
2.
Y is absolutely convex,
3.
Y is closed.

Proposition 1.17

Let X be a vector space and let τ be a linear topology on X.

A.
If $\mathcal {B}$ is a neighborhood base at θ, then:
1. 1.
  For every $V \in \mathcal {B}$ , there exists $W \in \mathcal {B}$ , such that W + W ⊆ V .
2. 2.
  For every $V \in \mathcal {B}$ and every compact set $C\subseteq \mathbb {K}$ , there exists $W \in \mathcal {B}$ , such that γW ⊆ V, ∀ γ ∈ C.
3. 3.
  For every x ∈ X, the collection ${\mathcal {B}}_{x}= \{ V+x\colon V\in \mathcal {B}\}$ is a neighborhood base at x.
4. 4.
  The topology τ is Hausdorff, if and only if $\bigcap _{V \in \mathcal {B}}V= \{\theta \}$.
B.
There exists a neighborhood base at θ, consisting of open balanced sets.

Proof

A.
Statements 1. and 2. follow immediately from Proposition 1.14. Statement 3. is clear, since translations are homeomorphisms.

4. Denote for simplicity the intersection $\bigcap _{V \in \mathcal {B}}V$ by J, so clearly θ ∈ J. Assume first τ is Hausdorff. In particular, for each x ∈ X ∖{θ}, the set X ∖{x} is an open neighborhood of θ, so there exists some $V^{x} \in \mathcal {B}$ with V ^x ⊆ X ∖{x}. We then clearly have the inclusion
$$\displaystyle \begin{aligned} J\subseteq \bigcap_{x\neq \theta}V^{x}\subseteq \bigcap_{x\neq\theta}(X\setminus \{x\})= \{\theta\}, \end{aligned}$$

so J = {θ}. Conversely, assume J = {θ}, and let us show that τ is Hausdorff. Start with two points x, y ∈ X with x ≠ y, so that x − y ≠ θ, and let us indicate how to construct two disjoint neighborhoods, one for x and one for y. Using translations, we can assume y = θ. Since $\theta \neq x \notin \bigcap _{V \in \mathcal {B}}V$, there exists some $V \in \mathcal {B}$, such that x∉V . Using 1., there is some $W \in \mathcal {B}$, such that W + W ⊆ V , so we still have x ∈ W + W. This clearly forces
$$\displaystyle \begin{aligned} x+((-1)V)\cap V= \emptyset. \end{aligned} $$
(1.6)

Since V is a neighborhood of θ, so is (−1)V (non-zero dilation), then by 3. the left-hand side of (1.5) is a neighborhood of x.
B.
Let us take the $\mathcal {D}$ to be the collection of all open balanced sets that contain θ. All we have to prove is the following statement: for every neighborhood V of θ, there exists $W \in \mathcal {D}$, such that W ⊆ V . Using 2. there exists some open set O ∋ θ, such that
$$\displaystyle \begin{aligned} \gamma O \subseteq V,~~\forall \gamma \in \mathbb{K}, \left\vert \gamma\right\vert \le 1. \end{aligned} $$
(1.7)

In particular, $\bigcup _{\alpha \in \mathbb {K}, 0< \left \vert \alpha \right \vert \le 1}\alpha O$ is an open set contained in V . So $\bigcup _{\alpha \in \mathbb {K}, 0< \left \vert \alpha \right \vert \le 1}\alpha O \in V. $

■

Definition 1.25

Assume τ is a linear topology on a vector space X. A subset Y ⊆ X is said to be τ-bounded , if it satisfies the following condition:

$$\displaystyle \begin{aligned} \mathit{for} ~\mathit{every} ~\mathit{neighborhood}~ V~ \mathit{of}~ \theta, \mathit{there} ~\mathit{exists}~ \rho> 0, \mathit{such} ~\mathit{that} ~ Y\subseteq \rho V. \end{aligned}$$

Example 1.7

Suppose τ is a linear topology on a vector space X. If x ∈ X ∈, then {x} is bounded. Indeed, let V any neighborhood of θ. Then V is absorbing and so x ∈ ρV for all sufficiently large ρ > 0, that is, {x} is bounded.

Proposition 1.18

Let X be a vector space X endowed with a linear topology τ. Then

1.
If Y ⊆ X is τ-bounded, then its closure $\overline {Y}$ is also τ-bounded.
2.
If Y, Z ⊆ X are τ-bounded, then so is Y + Z.
3.
If Y ⊆ X is τ-bounded and $C\subseteq \mathbb {K}$ is bounded, then so ⋃_{α ∈ C} αY .
4.
All compact subsets in X are τ-bounded.

Remark 1.14

It follows by induction, that any finite set in a vector space X endowed with a linear topology τ is bounded. Also, taking Y = {x} (in the above proposition) we see that any translate of a bounded set is bounded.

Proposition 1.19

Any convergent sequence in topological vector space is bounded.

Proof

Suppose that (x_n)_n is a sequence in a topological vector space (X, τ) such that x_n→x. For each $n \in \mathbb {N}$, set y_n = x_n − x, so that y_n→θ. Let V any neighborhood of θ. Let U be any balanced neighborhood of θ such that U ⊆ V . Then U ⊆ ρU for all ρ with |ρ|≥ 1. Since y_n→θ, there is $N \in \mathbb {N}$ such that y_n ∈ U whenever n > N. Hence y_n ∈ U ⊆ tU ⊆ tV whenever n > N and t ≥ 1. Set Y = {y₁, ⋯ , y_n} and Z = {y_n: n > N}. Then Y is a finite set so is bounded and therefore Y ⊆ tV for all sufficiently large t. But then it follows that Y ∪ Z ⊆ tV for sufficiently large t, that is, $\{y_{n} \colon n \in \mathbb {N}\}$ is τ-bounded and so is $\{x_{n} \colon n \in \mathbb {N}\}= x + (Y\cup Z)$. ■

Remark 1.15

A convergent net in a topological vector space need not be bounded . For example, let I be $\mathbb {R}$ equipped with its usual order and let $x_{\alpha } \in \mathbb {R}$ be given by x_α = e ^−α. Then (x_α)_{α ∈ I} is an unbounded but convergent net (with limit 0) in the real normed space $\mathbb {R}$.

Proposition 1.20 (“Zero. Bounded” Rule)

Suppose τ is a linear topology in a vector space X. If the net $(\alpha _{\lambda })_{\lambda \in \Lambda } \subseteq \mathbb {K}$ converges to 0, and the net (x_λ)_{λ ∈ Λ}⊆ X is τ-bounded, then (α_λ x_λ)_{λ ∈ Λ} is convergent to θ.

Proof

Start with some neighborhood V of θ. We wish to construct an index λ_V ∈ Λ such that

$$\displaystyle \begin{aligned} \alpha_{\lambda}x_{\lambda} \in V, ~\forall~ \lambda \succ \lambda_{V}. \end{aligned} $$

(1.8)

Using Proposition 1.17 B., we can assume that V is balanced (otherwise we replace it with a balanced open set V ′⊆ V ). Using the boundedness condition we find ρ > 0, such that

$$\displaystyle \begin{aligned} x_{\lambda} \in \rho V, ~ \forall~\lambda\in \Lambda. \end{aligned} $$

(1.9)

Using the condition α_λ → 0, we then choose λ_V ∈ Λ, so that

$$\displaystyle \begin{aligned} \left\vert \alpha_{\lambda} \right\vert \leq \frac{1}{\rho}, \forall~ \lambda \succ \lambda_{V}. \end{aligned}$$

To check (1.8), start with some λ ≻ λ_V and apply (1.9) to write x_λ = ρv, for some v ∈ V . Now we have

$$\displaystyle \begin{aligned} \alpha_{\lambda}x_{\lambda}= (\alpha_{\lambda}\rho)v \in (\alpha_{\lambda}\rho)V, \end{aligned}$$

with $\left \vert \alpha _{\lambda }\rho \right \vert \leq 1$, so using the fact that V is bounded, it follows that α_λ x_λ ∈ V . ■

Definition 1.26

Let (X, τ) be a topological vector space.

1.
X is locally bounded if θ has a bounded neighborhood.
2.
X is locally compact if θ has a neighborhood whose closure is compact.
3.
X is metrizable if it is compatible with some metric d (i.e., τ is generated by the open balls B_r(x) = {y ∈ X, d(x, y) < r}).
4.
X is normable if it can be endowed with a norm whose induced metric is compatible with τ.
5.
X has the Heine-Borel property if every closed and bounded set is compact.

Proposition 1.21

Let (X, τ) be a topological vector space. For every x ≠ θ the set $Y= \{nx,~n \in \mathbb {N}\}$ is not bounded.

Proof

By separation, there exists an open neighborhood V of θ that does not contain x, hence nx∉nV, i.e., for every n,

$$\displaystyle \begin{aligned} Y\nsubseteq nV. \end{aligned}$$

■

Lemma 1.3

1.
Let d be a translation invariant metric on a vector space X, then for all $n \in \mathbb {N} $ and x ∈ X,
$$\displaystyle \begin{aligned} d(nx,\theta)\leq nd(x,\theta). \end{aligned}$$
2.
If x_n → θ in a metrizable topological vector space (X, τ), then there exist positive scalars α_n →∞ such that α_n x_n → θ.

Proof

The first part is obvious by successive applications of the triangle inequality,

$$\displaystyle \begin{aligned} d(nx,\theta)\leq \sum_{k=1}^{n}d(kx,(k-1)x)\leq nd(x,\theta). \end{aligned}$$

For the second, we note that since d(x_n, θ) → 0, there exists a diverging sequence of positive integers n_k, such that

$$\displaystyle \begin{aligned} d(x_{k},\theta)\leq \frac{1}{n_{k}^{2}}, \end{aligned}$$

from which we get that

$$\displaystyle \begin{aligned} d(n_{k}x_{k},\theta)\leq n_{k} d(x_{k},\theta) \leq \frac{1}{n_{k}} \rightarrow 0. \end{aligned}$$

■

Corollary 1.2

The only bounded subspace of a topological vector space is {θ}.

Proposition 1.22

Let (X, τ) be a topological vector space and let Y ⊆ X. Then, Y is bounded if and only if for every sequence (x_n)_n ⊆ Y and every sequence of scalars α_n → 0, α_n x_n → θ.

Proof

Suppose that Y is bounded, it suffices to apply Proposition 1.20.

Suppose that for every sequence (x_n)_n ⊆ Y and every sequence of scalars α_n → θ, α_n x_n → θ. If Y is not bounded, then there exists an open neighborhood of θ and a sequence β_n →∞, such that no β_n V contains Y . Take then a sequence (x_n)_n ⊆ Y such that x_n∉β_n V . Thus,

$$\displaystyle \begin{aligned} \beta_{n}^{-1}x_{n} \notin V, \end{aligned}$$

which implies that $\beta _{n}^{-1}x_{n}\nrightarrow \theta $, which is a contradiction. ■

Theorem 1.8

Let (X, τ) be a topological vector space. Let Y, Z ⊆ X satisfy:

$$\displaystyle \begin{aligned} Y \mathit{\mbox{ is compact}}, Z \mathit{\mbox{ is closed}}\,\, \mathit{\mbox{and}}\,\, Y\cap Z= \emptyset. \end{aligned}$$

Then there exists an open neighborhood V of θ such that

$$\displaystyle \begin{aligned} (Y+V)\cap (Z+V)= \emptyset. \end{aligned}$$

In other words, there exist disjoint open sets that contain Y and Z.

Proof

Let x ∈ Y . Since X ∖ Z is an open neighborhood of x, it follows that there exists a symmetric open neighborhood V_x of θ such that

$$\displaystyle \begin{aligned} x+V_{x}+V_{x}+V_{x} \subseteq X \setminus Z, \end{aligned}$$

i.e.,

$$\displaystyle \begin{aligned} (x+V_{x}+V_{x}+V_{x}) \cap Z= \emptyset. \end{aligned}$$

Since V_x is symmetric,

$$\displaystyle \begin{aligned} (x+V_{x}+V_{x}) \cap (Z+V_{x})=\emptyset. \end{aligned}$$

For every x ∈ Y corresponds such a V_x. Since Y is compact, there exists a finite collection (x_i, V_i)_1≤i≤n such that

$$\displaystyle \begin{aligned} K\subseteq \bigcup_{i=1}^{n}(x_{i}+V_{i}). \end{aligned}$$

Define

$$\displaystyle \begin{aligned} V= \bigcap_{i=1}^{n}V_{x_{i}}. \end{aligned}$$

Then, for every i,

$$\displaystyle \begin{aligned} (x+V_{x_{i}}+V_{x_{i}}) ~~\mbox{ does not intersect } (Z+V_{x_{i}}), \end{aligned}$$

so

$$\displaystyle \begin{aligned} (x+V_{x_{i}}+V) ~~\mbox{ does not intersect } ~ (Z+V). \end{aligned}$$

Taking the union over i:

$$\displaystyle \begin{aligned} Y+V\subseteq \bigcup_{i=1}^{n}(x_{i}+V_{x_{i}}+V) ~~\mbox{ does not intersect } ~ (Z+V). \end{aligned}$$

■

Remark 1.16

A topological vector space is regular.

Proposition 1.23

Suppose τ is a linear topology in a vector space X.

1.
For Y ⊆ X,
$$\displaystyle \begin{aligned} \overline{Y}= \bigcap_{V, \mathit{\mbox{open neighborhood of }}\,\theta}(Y+V). \end{aligned}$$

That is, the closure of a set is the intersection of all the open neighborhoods of that set.
2.
For $Y, Z\subseteq X, \overline {Y}+\overline {Z}\subseteq \overline {Y+Z}$.
3.
If Y ⊆ X is a linear subspace, then so is $\overline {Y}$.
4.
For every B ⊆ X : If B is balanced so is $\overline {B}$.
5.
For every B ⊆ X : If B is balanced and θ ∈int B then int B is balanced.
6.
If Y ⊆ X is bounded so is $\overline {Y}$.

Proof

1.
Let $x \in \overline {Y}$. By definition, for every open neighborhood V of θ, x + V intersects Y , of x ∈ Y − V . Thus,
$$\displaystyle \begin{aligned} x \in \bigcap_{V, \mbox{open neighborhood of }\,\theta}(Y-V)= \bigcap_{V, \mbox{open neighborhood of }\,\theta}(Y+V). \end{aligned}$$

Conversely, suppose that $x \notin \overline {Y}$. Then, there exists an open neighborhood V of θ such that x + V does not intersect Y, i.e., x∉Y − V , hence
$$\displaystyle \begin{aligned} x \notin \bigcap_{V, \mbox{open neighborhood of }\,\theta}(Y+V). \end{aligned}$$
2.
Let $x\in \overline {Y}$ and $y \in \overline {Z}$. By the continuity of vector addition, for every open neighborhood U of x + y there exists an open neighborhood V of x and an open neighborhood W of y such that
$$\displaystyle \begin{aligned} V+W\subseteq U. \end{aligned}$$

By the definition of $\overline {Y}$ every neighborhood of x intersects Y and by the definition of $\overline {Z}$ every neighborhood of y intersects W : that is, there exist z ∈ V ∩ Y and t ∈ W ∩ Z. Then,
$$\displaystyle \begin{aligned} z\in Y~~\mbox{and}~~t\in Z~~\mbox{implies}~~z+t\in Y+Z, \end{aligned}$$

and
$$\displaystyle \begin{aligned} z\in V~~\mbox{and}~~t\in W~~\mbox{implies}~~z+t\in V+W\subseteq U. \end{aligned}$$

In other words, every neighborhood of $x+y \in \overline {Y}+\overline {Z}$ intersects Y + Z, which implies that $x+y \in \overline {Y+Z}$, and therefore
$$\displaystyle \begin{aligned} \overline{Y}+\overline{Z}\subseteq \overline{Y+Z}. \end{aligned}$$
3.
Let Y be a linear subspace of X, which means that,
$$\displaystyle \begin{aligned} Y+Y\subseteq Y ~~\mbox{and}~~ \forall~\alpha \in \mathbb{K}, \alpha Y\subseteq Y. \end{aligned}$$

By the previous item,
$$\displaystyle \begin{aligned} \overline{Y}+\overline{Y}\subseteq \overline{Y+Y}\subseteq \overline{Y}. \end{aligned}$$

Since scalar multiplication is a homeomorphism it maps the closure of a set into the closure of its image, namely, for every $\alpha \in \mathbb {K},$
$$\displaystyle \begin{aligned} \alpha\overline{Y}\subseteq \overline{Y}. \end{aligned}$$
4.
Since multiplication by a (non-zero) is a homeomorphism,
$$\displaystyle \begin{aligned} \alpha\overline{B}= \overline{\alpha B}. \end{aligned}$$

If B is balanced, then for $\left \vert \alpha \right \vert \leq 1,$
$$\displaystyle \begin{aligned} \alpha\overline{B}=\overline{\alpha B}\subseteq \overline{B}, \end{aligned}$$

hence $\overline {B}$ is balanced.
5.
Again, for every $0< \left \vert \alpha \right \vert \leq 1,$
$$\displaystyle \begin{aligned} \alpha(\mathrm{int} B)= \mathrm{int} (\alpha B)\subseteq \mathrm{int} B. \end{aligned}$$

Since for α = 0, α(intB) = {θ}, we must require that θ ∈intB for the latter to be balanced.
6.
Let V be an open neighborhood of θ. Then there exists an open neighborhood W of θ such that $\overline {W}\subseteq V$. Since Y is bounded, $Y\subseteq \alpha W\subseteq \alpha \overline {W}\subseteq \alpha V$ for sufficiently large α. It follows that for large enough α,
$$\displaystyle \begin{aligned} \overline{Y}\subseteq \alpha\overline{W}\subseteq \alpha V, \end{aligned}$$

which proves that $\overline {Y}$ is bounded.

■

Lemma 1.4

Suppose τ is a linear topology in a vector space X.

1.
If Y is convex so is $\overline {Y}$.
2.
If Y is convex so is int Y .

Proof

1.
The convexity of Y implies that for all α ∈ [0, 1]:
$$\displaystyle \begin{aligned} \alpha Y+ (1-\alpha)Y \subseteq Y. \end{aligned}$$

Let α ∈ [0, 1], then
$$\displaystyle \begin{aligned} \alpha\overline{Y}= \overline{\alpha Y}~~\mbox{and}~~(1-\alpha)\overline{Y}=\overline{(1-\alpha)Y}. \end{aligned}$$

By the second item:
$$\displaystyle \begin{aligned} \alpha\overline{Y}+(1-\alpha)\overline{Y}= \overline{\alpha Y}+\overline{(1-\alpha)Y}\subseteq \overline{\alpha Y+(1-\alpha) Y}\subseteq \overline{Y}, \end{aligned}$$

which proves that $\overline {Y}$ is convex.
2.
Suppose once again that Y is convex. Let x, y ∈intY . This means that there exist open neighborhoods U, V of θ such that
$$\displaystyle \begin{aligned} x+U \subseteq Y~~\mbox{and}~~y+V\subseteq Y. \end{aligned}$$

Since Y is convex:
$$\displaystyle \begin{aligned} \alpha(x+U)+(1-\alpha)(y+V)= (\alpha x+(1-\alpha)y)+\alpha U+(1-\alpha)V\subseteq Y, \end{aligned}$$

which proves that αx + (1 − α)y ∈intY, namely intY is convex.

■

Lemma 1.5

Suppose τ is a linear topology in a vector space X. If Y is a convex subset of X, then:

$$\displaystyle \begin{aligned} 0< \alpha \leq1 ~~ \Longrightarrow \alpha (\mathrm{int}Y)+(1-\alpha)\overline{Y} \subseteq \mathrm{int}Y. \end{aligned} $$

(1.10)

In particular, if intY ≠ ∅, then:

(a)
The interior of Y is dense in $\overline {Y}$ , that is, $\overline {\mathrm {int}Y}= \overline {Y}$.
(b)
The interior of $\overline {Y}$ coincides with the interior of Y , that is, $\mathrm {int}\overline {Y}= \mathrm {int}Y$.

Proof

The case α = 1 in (1.10) is immediate. So let $x\in \mathrm {int}Y, y \in \overline {Y}$, and let 0 < α < 1. Choose an open neighborhood U of θ such that x + U ⊆ Y . Since $y-\frac {\alpha }{1-\alpha }U$ is a neighborhood of y, there is some $z \in Y \cap (y-\frac {\alpha }{1-\alpha }U)$, so that (1 − α)(y − z) belongs to αU. Since Y is convex, the (nonempty) open set V = α(x + U) + (1 − α)z = αx + αU + (1 − α)z lies entirely in Y . Moreover, from

$$\displaystyle \begin{aligned} \alpha x + (1-\alpha)y= \alpha x + (1-\alpha)(y-z)+\alpha x + (1-\alpha)z \in \alpha x + \alpha U + (1-\alpha)z= V\subseteq Y, \end{aligned}$$

we see that αx + (1 − α)y ∈intY . This proves (1.10), and letting α→0 proves (a).

For (b), fix x₀ ∈intY and $x \in \mathrm {int}\overline {Y}$. Pick a neighborhood of θ satisfying $x+ W \subseteq \overline {Y}$. Since W is absorbing, there is some 0 < λ < 1 such that λ(x − x₀) ∈ W, so $x +\lambda (x-x_{0}) \in \overline {Y}$. By (1.10), we have x − λ(x − x₀) = λx₀ + (1 − λ)x ∈intY . But then, using (1.10) once more, we obtain $x= \frac {1}{2}\left [x- \lambda (x-x_{0}) \right ] + \frac {1}{2}\left [x+ \lambda (x-x_{0})\right ] \in \mathrm {int}Y$. Therefore, $\mathrm {int}\overline {Y}\subseteq \mathrm {int}Y \subseteq \mathrm {int}\overline {Y}$ so that $\mathrm {int}\overline {Y}= \mathrm {int}Y$. ■

Definition 1.27

Let τ be a linear topology in a vector space X and Y ⊆ X.

1.
The closed convex hull of a set Y , denoted $\overline {\mathrm {conv}}(Y)$, is the smallest closed convex set including Y . By Lemma 1.4 1. it is the closure of conv(Y ), that is, $\overline {\mathrm {conv}}(Y)= \overline {\mathrm {conv}(Y)}$.
2.
The convex circled hull of Y is the smallest convex and circled set that includes Y . It is the intersection of all convex and circled sets that include Y .
3.
The closed convex circled hull of Y is the smallest closed convex circled set including Y . It is the closure of the convex circled hull of Y .

Definition 1.28

Let X be a vector space and let τ be a linear topology on X. Then (X, τ) is said to be locally convex if there is a base of neighborhoods of the origin in X consisting of convex sets.

Proposition 1.24

A locally convex space (X, τ) always has a base of neighborhoods of the origin consisting of open absorbing absolutely convex subsets.

Proof

Let V be a neighborhood of θ in X. Since (X, τ) is locally convex, there exists W convex neighborhood of θ such that W ⊆ V . Moreover, by Remark 1.13, there exists U balanced neighborhood of θ such that U ⊆ W. The balancedness of U implies that $U= \bigcup _{\alpha \in \mathbb {K}, \left \vert \alpha \right \vert \leq 1}\alpha U$. Thus, using that W is a convex set containing U, we get

$$\displaystyle \begin{aligned} N:= \mathrm{conv}\left(\bigcup_{\alpha \in \mathbb{K}, \left\vert \alpha\right\vert \leq 1}\alpha U\right)= \mathrm{conv}(U) \subseteq W \subseteq V \end{aligned}$$

and so intN ⊆ V . Hence, the conclusion holds because intN is clearly open and convex and it is also balanced since θ ∈intN and N is balanced. ■

1.2.3 Compactness and Completeness

Definition 1.29

Let (X, τ) be a topological vector space.

1.
A net (x_α)_{α ∈ I} in X is said to be a Cauchy net if for each neighborhood V of θ there exists α₀ ∈ I such that x_α − x_β ∈ V whenever α, β ≻ α₀.
2.
A set Y ⊆ X is complete if each Cauchy net in X converges to a point of Y .
3.
A set Y ⊆ X is sequentially complete if each Cauchy sequence in X converges to a point of Y

Example 1.8

Every convergent net is Cauchy.

Proposition 1.25

A Cauchy sequence (and in particular a converging sequence) in a topological vector space (X, τ) is bounded.

Proof

Let (x_n)_n be a Cauchy sequence. Let W, V be two balanced open neighborhoods of θ satisfying

$$\displaystyle \begin{aligned} V+V \subseteq W. \end{aligned}$$

By the definition of a Cauchy sequence, there exists an N such that for all m, n ≥ N,

$$\displaystyle \begin{aligned} x_{n}- x_{m} \in V, \end{aligned}$$

and in particular

$$\displaystyle \begin{aligned} \forall~ n> N~~~ x_{n}\in x_{N}+V. \end{aligned}$$

Set s > 1 such that x_N ∈ sV (we know that such an s exists), then for all n > N,

$$\displaystyle \begin{aligned} x_{n} \in sV+V \subseteq sV+sV \subseteq W. \end{aligned}$$

Since for balanced sets sW ⊆ tW for s < t, and since every open neighborhood of θ contains an open balanced neighborhood, this proves that the sequence is indeed bounded. ■

Proposition 1.26

Let {(X_i, τ_i)}_{i ∈ I} be a family of topological vector spaces, and let X =∏_{i ∈ I} X_i endowed with the product topology τ =∏_{i ∈ I} τ_i . Then (X, τ) is complete if and only if each factor (X_i, τ_i) is complete.

Proposition 1.27

Let (X, τ) be a topological vector space with a countable base of neighborhoods of θ. A set Y ⊆ X is complete if and only if Y is sequentially complete.

Proof

Let $\mathcal {B}= \{V_{n}\colon n \in \mathbb {N}\}$ be a countable base of neighborhoods of θ. We can assume that V₁ ⊇ V₂ ⊇⋯, indeed, otherwise we can substitute $\mathcal {B}$ with the base

$$\displaystyle \begin{aligned} \{V_{1}, V_{1}\cap V_{2}, V_{1}\cap V_{2}\cap V_{3},\cdots\}. \end{aligned}$$

Let Y be complete, and (x_n)_n a Cauchy sequence in Y . There exists a subnet (x_ϕ(α))_{α ∈ I} converging to a point x ∈ Y . Let us construct inductively a sequence (α_k) in I. Choose α₁ so that x_ϕ(α) ∈ x + V₁ for each α ≻ α₁. If we already have α₁, ⋯ , α_k, choose α_k+1 ≻ α_k so that ϕ(α_k+1) ≻ ϕ(α_k) + 1 and x_ϕ(α) ∈ x + V_k+1 for each α ≻ α_k+1. It is easy to verify that $(x_{\phi (\alpha _{k})})_{k \in \mathbb {N}}$ is a subsequence of (x_n)_n that converges to x.

Conversely, Let Y be sequentially complete, and (x_α)_{α ∈ I} a Cauchy net in Y . Let us construct inductively a sequence (α_k)_k in I. Choose α₁ so that $x_{\alpha }-x_{\alpha _{1}} \in V_{1}$ for each α ≻ α₁. If we already have α₁, ⋯ , α_k, choose α_k+1 ≻ α_k so that $x_{\alpha }-x_{\alpha _{k+1}} \in V_{k+1}$ for each α ≻ α_k+1. Then $(x_{\alpha _{n}})$ is a Cauchy sequence since $x_{\alpha _{m}}-x_{\alpha _{n}} \in V_{n}$ whenever m ≥ n. Consequently, $(x_{\alpha _{n}})$ converges to a point x ∈ Y . Now, it is easy to show that (x_α)_{α ∈ I} converges to x, too. ■

Definition 1.30

A set Y in a topological vector space (X, τ) is totally bounded (or precompact ) if for each neighborhood V of θ there is a finite set F ⊆ X such that Y ⊆ F + V .

It is easy to see that in normed spaces (or in topological metric spaces) this definition coincides with the usual metric one: for each ε > 0 there is a finite set F ⊆ X such that dist(x, F) < ε for each x ∈ Y .

Theorem 1.9

Let Y be a set in a topological vector space (X, τ). Then Y is totally bounded if and only if each net in Y admits a Cauchy subnet.

Proof

Let (x_α)_{α ∈ I} be a net in a totally bounded set Y . The family $\mathcal {Z}= \{ Z\subseteq Y\} \colon \mathcal {B}$ be a maximal subfamily of $\mathcal {Z}$ that contains Y and is closed under making finite intersections (existence of such $\mathcal {B}$ follows by Zorn’s lemma). Let us show several properties of $\mathcal {B}$.

(a)
if $\mathcal {F}$ is a finite subfamily of $\mathcal {Z}$ such that $\bigcup \mathcal {F} \in \mathcal {B}$, then $\mathcal {F}\cap \mathcal {B}\neq \emptyset $. Let $\mathcal {F}= \{Z_{1},\cdots ,Z_{n}\}$. We claim that, for some index k, $Z_{k}\cap B \in \mathcal {Z}$ for each $B \in \mathcal {B}$. Indeed, if this not the case, for each i ∈{1, ⋯ , n} there exists $B_{i} \in \mathcal {B}$ such that $Z_{i}\cap B_{i} \notin \mathcal {Z}$, but then $\mathcal {B}\ni (\bigcup _{i=1}^{n}Z_{i})\cap \bigcap _{i}^{n}B_{i}\subseteq \bigcup _{i=1}^{n}(Z_{i}\cap B_{i}) \notin \mathcal {Z}$, a contradiction. Our claim implies that the family of all finite intersections of elements of $\mathcal {B}\bigcup \{Z_{k}\}$ is closed under finite intersections and is contained in $\mathcal {Z}$. By maximality of $\mathcal {B}$, we must have $Z_{k}\in \mathcal {B}$.
(b)
For each set Z ⊆ Y , the family $\mathcal {B}$ contains either Z or Y ∖ Z. If $Z \notin \mathcal {Z}$, then eventually x_α ∈ Y ∖ Z. Since the intersection of Y ∖ Z with any element of $\mathcal {B}$ belongs to $\mathcal {Z}$, the family of finite intersections of $\mathcal {B}\cup \{Y \setminus Z\}$ is contained in $\mathcal {Z}$. Thus $Y \setminus Z \in \mathcal {B}$ by the maximality of $\mathcal {B}$. In the same way we get that $Y \setminus Z \notin \mathcal {Z}$ then $Z \in \mathcal {B}$. Finally, if both Z and Y ∖ Z belong to $\mathcal {Z}$ the one of them belongs to $\mathcal {B}$ by (a) (since $Y \in \mathcal {B}$).
(c)
$\mathcal {B}$ contains arbitrarily small elements, in the sense that for each neighborhood V of θ there exists $B \in \mathcal {B}$ such that B − B ⊆ V . Given a neighborhood V of θ, there exists a neighborhood W of θ with W − W ⊆ V . By total boundedness, there exists a finite set F = {y₁, ⋯ , y_n}⊆ Y such that Y ⊆ F + W. Denoting Y_i = (y_i + W) ∩ Y (i = 1, ⋯ , n), we have $Y= \cup _{i=1}^{n}Y_{i}$. Consider the set $P= \{i \in \{1,\cdots ,n\}\colon Y_{i} \in \mathcal {Z}\}$ and its complement {1, ⋯ , n}∖ P. Since $C= \bigcup _{i \in \{1,\cdots ,n\}\setminus P} Y_{i} \notin \mathcal {Z}$, we must have P ≠ ∅. Let Z =⋃_{i ∈ P} Y_i. Then $Y \setminus Z \notin \mathcal {Z} $ (since Y ∖ Z ⊆ C). By (b), we must have $Z \in \mathcal {B}$. By (a), there exists k ∈ P with $Y_{k}\in \mathcal {B}$. Notice that Y_k − Y_k ⊆ W − W ⊆ V .

To conclude the proof of this implication, notice that the family $\mathcal {B}$ satisfies the assumptions of Lemma 1.1. Hence there exists a subnet of (x_α) that is eventually contained in each element of $\mathcal {B}$. By (c), this subnet is Cauchy.

Conversely, assume that Y is not totally bounded. There exists a neighborhood V of θ such that Y ∖ (F + V ) ≠ ∅ for each finite set F ⊆ V . An easy inductive construction gives a sequence (x_n)_n such that x_n+1∉{x₁, ⋯ , x_n} + V for each n. Since for two indexes m > n we have x_m − x_n∉V , our sequence has no Cauchy subnets. The proof is complete. ■

Theorem 1.10

A set Y in a topological vector space is compact if and only if Y is totally bounded and complete.

Proof

Let Y be compact. Given an open neighborhood V of θ, the open cover {y + V : y ∈ Y } of Y admits a finite sub cover. This proves that Y is totally bounded. Let (x_α)_{α ∈ I} be a Cauchy net in Y . By Theorem 1.6 (x_α)_{α ∈ I} admits a subnet converging to a point of Y . It easily follows that the net (x_α)_{α ∈ I} converges to the same limit.

Conversely, assume Y is totally bounded and complete. Given a net (x_α)_{α ∈ I} in Y , it admits a Cauchy subnet by Theorem 1.9. Since Y is complete, this subnet converges to a point of Y . Again, it follows that (x_α)_{α ∈ I} converges to the same point. By Theorem 1.6, Y is compact. ■

1.2.4 Seminorms and Local Convexity

Definition 1.31

A seminorm on a vector space X is map $p\colon X\to \mathbb {R}$ such that

$$\displaystyle \begin{aligned} p(x+y)\leq p(x)+p(y), \end{aligned}$$

and

$$\displaystyle \begin{aligned} p(\alpha x)= \left\vert \alpha\right\vert p(x). \end{aligned}$$

Definition 1.32

Let $\mathcal {P}:= (p_{i})_{i \in I}$ be a family of seminorms. It is called separating if to each x ≠ θ corresponds a $p_{i}\in \mathcal {P},$ such that p_i(x) ≠ 0. Note that the separation condition is equivalent to

$$\displaystyle \begin{aligned} p_{i}(x)= 0, \forall~i \in I \Rightarrow x= \theta. \end{aligned}$$

Examples 1.7

1.
Suppose $X= \mathbb {R}^{n}$ and let Y be a vector subspace of X. Set for any x ∈ X
$$\displaystyle \begin{aligned} p_{Y}(x):= \inf_{y \in Y}\Vert {x-y} \Vert \end{aligned}$$

where ∥.∥ is the Euclidean norm, i.e., p_Y(x) is the distance from the point x to Y in the usual sense. If $\dim (Y) \geq 1$ then p_Y is a seminorm and not a norm (Y is exactly the kernel of p_Y). When Y = {θ}, p_Y(.) = ∥.∥.
2.
Let X be a vector space on which is defined a nonnegative sesquilinear Hermitian form $\varphi \colon X \times X \to \mathbb {K}$. Then the function
$$\displaystyle \begin{aligned} p_{\varphi}(x):= \varphi(x,x)^{\frac{1}{2}} \end{aligned}$$

is a seminorm. p_φ is a norm if and only if φ is positive definite (i.e., φ(x, x) > 0, ∀ x ≠ θ).
3.
Let $C(\mathbb {R})$ be the vector space of all real valued continuous functions on the real line. For any bounded interval [a, b] with $a, b \in \mathbb {R}$ and a < b, we define for any $f \in C(\mathbb {R})\colon $
$$\displaystyle \begin{aligned} p_{[a,b]}(f):= \sup_{a\leq t \leq b}\left\vert f(t)\right\vert . \end{aligned}$$

p_[a,b] is a seminorm but is never a norm because it might be that f(t) = 0 for all t ∈ [a, b] (and so that p_[a,b](f) = 0) but f≢0. Other seminorms are the following ones:
$$\displaystyle \begin{aligned} q(f):= \left\vert f(0)\right\vert ~~ \mbox{and}~~ q_{p}(f):= \left(\int_{a}^{b}\left\vert f(t)\right\vert ^{p}\right)^{\frac{1}{p}}~~ \mbox{for}~~ 1\leq p< \infty. \end{aligned}$$

Proposition 1.28

Let p be a seminorm on a vector space X.

1.
p is symmetric.
2.
p(θ) = 0.
3.
$\left \vert p(x)-p(y)\right \vert \leq p(x-y). $
4.
p(x) ≥ 0.
5.
$\ker p$ is a linear subspace.

Proof

By the properties of the seminorm:

1.
$p(x-y)= p(-(y-x))= \left \vert -1\right \vert p(y-x)= p(y-x). $
2.
p(θ) = p(0.x) = 0.p(x) = 0.
3.
This follows from the inequalities
$$\displaystyle \begin{aligned} p(x)\leq p(y)+p(x-y)~~ \mathrm{and}~~p(y)\leq p(x)+p(y-x)= p(x)+p(x-y). \end{aligned}$$
4.
By the previous item, for every x:
$$\displaystyle \begin{aligned} 0\leq \left\vert p(x)-p(\theta)\right\vert \leq p(x). \end{aligned}$$
5.
If $x, y \in \ker p \colon $
$$\displaystyle \begin{aligned} p(\alpha x+\beta y)\leq p(\alpha x)+p(\beta y)= \left\vert \alpha\right\vert p(x)+\left\vert \beta\right\vert p(y)= 0. \end{aligned}$$

■

Notation

Let X be a vector space and p a seminorm on X. The sets

$$\displaystyle \begin{aligned} B_{1}^{p}= \{x \in X \colon p(x)< 1\} ~~ \mbox{ and } ~~ \overline{B}_{1}^{p}= \{x \in X \colon p(x) \leq 1\}, \end{aligned}$$

are said to be, respectively, the open and the closed unit semiball of p.

Proposition 1.29

Let τ be a linear topology on the vector space X. Then the following conditions are equivalent:

1.
the open unit semiball $B_{1}^{p}$ of p is an open set.
2.
p is continuous at the origin.
3.
the closed unit semiball $\overline {B}_{1}^{p}$ of p is a barrel neighborhood of the origin.
4.
p is continuous at every point.

Proof

1. ⇒ 2. Suppose that $B_{1}^{p}$ is open in the topology τ on X. Then for any ε > 0 we have that $p^{-1}([0,\varepsilon [)= \{x \in X \colon p(x)\leq \varepsilon \}= \varepsilon B_{1}^{p}$ is an open neighborhood of the origin in X. This is enough to conclude that $p\colon X \to \mathbb {R}^{+}$ is continuous at the origin.

2. ⇒ 3. Suppose that p is continuous at the origin, then $\overline {B}_{1}^{p}= p^{-1}([0,1])$ is a closed neighborhood of the origin. Since $B_{1}^{p}$ is also absorbing and absolutely convex, $\overline {B}_{1}^{p}$ is a barrel.

3. ⇒ 4. Assume that 3. holds and fix θ ≠ x ∈ X. We have for any $\varepsilon > 0 \colon p^{-1}([-\varepsilon +p(x),\varepsilon +p(x)])= \{y \in X \colon \left \vert p(y)-p(x)\right \vert \leq \varepsilon \} \supseteq \{y \in X \colon p(y-x)\leq \varepsilon \} = x+\varepsilon \overline {B}_{1}^{p}$, which is a closed neighborhood of x since τ is a linear topology on X and by the assumption 3. Hence, p is continuous.

4. ⇒ 1. If p is continuous on X then 1. holds because the preimage of an open set under a continuous function is open and $B_{1}^{p}= p^{-1}([0,1[)$. ■

Definition 1.33

Let X be a vector space. For K ⊆ X convex and radial at θ (equivalently, K is absorbing), we define the Minkowski functional of K as

$$\displaystyle \begin{aligned} p_{K}(x)= \inf\{t > 0 \colon \frac{x}{t} \in K\}. \end{aligned}$$

Intuitively, p_K(x) is the factor by which x must be shrunk in order to reach the boundary of K.

Definition 1.34 (Topology Induced from Seminorms)

Let (p_i)_{i ∈ I} a family of seminorms on a vector space X. Then the ith open strip of radius r centered at x ∈ X is

$$\displaystyle \begin{aligned} B_{r}^{i}(x)= \{y\in X \colon p_{i}(x-y)< r\}. \end{aligned}$$

Let Λ be the collection of all open strips in X :

$$\displaystyle \begin{aligned} \Lambda= \{B_{r}^{i}(x)\colon i \in I, r > 0, x \in X\}. \end{aligned}$$

The topology τ( Λ) generated by Λ is called the topology induced by (p_i)_{i ∈ I}.

The fact that p_i is a seminorm ensures that each open strip $B_{r}^{i}(x)$ is convex. Hence all finite intersections of open strips will also be convex.

Theorem 1.11

Let (p_i)_{i ∈ I} be a family of seminorms on a vector space X. Then

$$\displaystyle \begin{aligned} \mathcal{B}= \Big\{ \bigcap_{j=1}^{n}B_{r}^{i_{j}}(x) \colon n \in \mathbb{N} , i_{j}\in I, r> 0, x \in X \Bigg\} \end{aligned}$$

forms a base for the topology induced from these seminorms. In fact, if U is open and x ∈ U, then there exists an r > 0 and i₁, ⋯ , i_n ∈ I such that

$$\displaystyle \begin{aligned} \bigcap_{j=1}^{n}B_{r}^{i_{j}}(x) \subseteq U. \end{aligned}$$

Further, every element of $\mathcal {B}$ is convex.

Proof

Suppose U ⊆ X and x ∈ U. In order to show that $\mathcal {B}$ is a base for the topology, we have to show that there exists some set $B \in \mathcal {B}$ such that x ∈ B ⊆ U. By the characterization of the generated topology, U is a union of finite intersections of elements of Λ. Hence we have

$$\displaystyle \begin{aligned} x \in \bigcap_{j=1}^{n}B_{r_{j}}^{i_{j}}(x_{j}) \end{aligned}$$

for some n > 0, i_j ∈ I, r_j > 0, and x_j ∈ X. Then $x \in B_{r_{j}}^{i_{j}}(x_{j})$, so, by definition $p_{i_{j}}(x-x_{j})< r_{j}$ for each j. Therefore, if we set

$$\displaystyle \begin{aligned} r=\min\{r_{j}-p_{i_{j}}(x-x_{j})\colon j= 1,\cdots,n\}, \end{aligned}$$

then we have $B_{r}^{i_{j}}(x) \subseteq B_{r_{j}}^{i_{j}}(x_{j})$ for each j = 1, ⋯ , n. Hence

$$\displaystyle \begin{aligned} B= \bigcap_{j=1}^{n}B_{r}^{i_{j}}(x) \in \mathcal{B}, \end{aligned}$$

and we have x ∈ B ⊆ U. ■

Proposition 1.30

Let (p_i)_{i ∈ I} be a family of seminorms on a vector space X. Then the induced topology on X is Hausdorff if and only if the family (p_i)_{i ∈ I} is separating.

Remark 1.17

If any one of the seminorms in our family is a norm, then the corresponding topology is automatically Hausdorff (for example, this is the case for $C_{b}^{\infty }(\mathbb {R})$). On the other hand, the topology can be Hausdorff even if no individual seminorms in a norm (consider $L_{loc}^{1}(\mathbb {R})$).

Examples 1.8

1.
Given an open subset Ω of $\mathbb {R}^{m}$ with the euclidean topology, the space C( Ω) of real valued continuous functions on Ω with the so-called topology of uniform convergence on compact sets is a locally convex topological vector space. This topology is defined by the family $\mathcal {P}$ of all the seminorms on C( Ω) given by
$$\displaystyle \begin{aligned} p_{K}(f):= \max_{x\in K}\left\vert f(x)\right\vert ,~\forall ~K \subseteq \Omega ~ \mbox{compact}. \end{aligned}$$

Moreover, the linear topology $\tau _{\mathcal {P}}$ induced from the family $\mathcal {P}$ is Hausdorff, because the family $\mathcal {P}$ is clearly separating. In fact, if p_K(f) = 0, ∀ K compact subsets of Ω then in particular $p_{\{x\}}(f):= \left \vert f(x)\right \vert = 0~\forall ~x \in \Omega ,$ which implies f ≡ 0 on Ω.

More generally, for any X locally compact we have that C(X) with the topology of uniform convergence on compact subsets of X is a locally convex topological vector space.
2.
Let $\mathbb {N}_{0}$ be the set of all non-negative integers. For any $x= (x_{1},\cdots ,x_{m}) \in \mathbb {R}^{m}$ and $\alpha = (\alpha _{1},\cdots ,\alpha _{m}) \in \mathbb {N}_{0}^{m}$ one defines $x^{\alpha }:= x_{1}^{\alpha _{1}}\cdots x_{m}^{\alpha _{m}}$. For any $\beta \in \mathbb {N}_{0}^{m}$, the symbol D ^β denotes the partial derivative of order $\left \vert \beta \right \vert $ where $\left \vert \beta \right \vert := \sum _{i=1}^{m}\beta _{i},$ i.e.,
$$\displaystyle \begin{aligned} D^{\beta}:= \frac{\partial^{\left\vert \beta\right\vert }}{\partial x_{1}^{\beta_{1}}\cdots \partial x_{m}^{\beta_{m}}}= \frac{\partial^{\beta_{1}}}{\partial x_{1}^{\beta_{1}}}\cdots \frac{\partial^{\beta_{m}}}{\partial x_{m}^{\beta_{m}}}. \end{aligned}$$
1. (a)
  Let $\Omega \subseteq \mathbb {R}^{m}$ open in the euclidean topology. For any $k \in \mathbb {N}_{0},$ let C ^k( Ω) be the set of all real valued k −times continuously differentiable functions on Ω, i.e., all the derivatives of f of order ≤ k exist (at every point of Ω) and are continuous functions in Ω. Clearly, when k = 0 we get the set C( Ω) for all real valued continuous functions on Ω and when k = ∞ we get the so-called set of all infinitely differentiable functions or smooth functions on Ω. For any $k \in \mathbb {N}_{0}, C^{k}(\Omega )$ (with pointwise addition and scalar multiplication) is a vector space over $\mathbb {R}$. The topology given by the following family of seminorms on C ^k( Ω):
  $$\displaystyle \begin{aligned} p_{d,K}(f):= \sup_{\substack{ \beta \in \mathbb{N}_{0}^{m}\\ \left\vert \beta\right\vert \leq d }} \sup_{x \in K} \left\vert (D^{\beta} f)(x)\right\vert ,~\forall ~ K \subseteq \Omega ~ \mbox{compact}~ \forall \, d \in \{0,1,\cdots,k\}, \end{aligned}$$
  
  makes C ^k( Ω) into a locally convex topological vector space. (Note that when k = ∞ we have $m\in \mathbb {N}_{0}$. )
2. (b)
  The Schwartz space or space of rapidly decreasing functions on $\mathbb {R}^{m}$ is defined as the set $\mathcal {S}(\mathbb {R}^{m})$ of all real-valued functions which are defined and infinitely differentiable on $\mathbb {R}^{m}$ and which have the additional property (regulating their growth at infinity) that all their derivatives tend to zero at infinity faster than any inverse power of x, i.e.,
  $$\displaystyle \begin{aligned} \mathcal{S}(\mathbb{R}^{m})= \Bigg\{f \in C^{\infty}(\mathbb{R}^{m}) \colon \sup_{x \in \mathbb{R}^{m}} \left\vert x^{\alpha}D^{\beta} f(x)\right\vert < \infty,~ \forall \, \alpha, \beta \in \mathbb{N}_{0}^{m}\Bigg\}. \end{aligned}$$
  
  If f is a smooth function with compact support in $\mathbb {R}^{m}$ then $f \in \mathcal {S}(\mathbb {R}^{m})$, since any derivative of f is continuous and supported on a compact subset of $\mathbb {R}^{m}$, so x ^α(D ^β f(x)) has a maximum in $\mathbb {R}^{m}$ by the extreme value theorem.
  
  The Schwartz space $\mathcal {S}(\mathbb {R}^{m})$ is a vector space over $\mathbb {R}$ and the topology given by the family $\mathcal {P}$ of seminorms on $\mathcal {S}(\mathbb {R}^{m}) \colon $
  $$\displaystyle \begin{aligned} p_{\alpha,\beta}:= \sup_{x \in \mathbb{R}^{m}} \left\vert x^{\alpha}D^{\beta} f(x)\right\vert ,~ \forall \, \alpha, \beta \in \mathbb{N}_{0}^{m} \end{aligned}$$
  
  makes $\mathcal {S}(\mathbb {R}^{m})$ into a locally convex topological vector space. Indeed, the family is clearly separating, because if $p_{\alpha ,\beta }(f)= 0, ~ \forall \, \alpha , \beta \in \mathbb {N}_{0}^{m}$ then in particular $p_{0,0}(f)= \sup _{x \in \mathbb {R}^{m}} \left \vert f(x)\right \vert = 0~ \forall \, x \in \mathbb {R}^{m},$ which implies f ≡ 0 on $\mathbb {R}^{m}$.
  
  Note that $\mathcal {S}(\mathbb {R}^{m})$ is a linear subspace of $C^{\infty }(\mathbb {R}^{m})$, but its topology $\tau _{\mathcal {P}}$ on $\mathcal {S}(\mathbb {R}^{m})$ is finer than the subspace topology induced on it by $C^{\infty }(\mathbb {R}^{m})$.

Theorem 1.12

Let X be a vector space whose topology is induced from a family of seminorms (p_i)_{i ∈ I} . Then given any net (x_α)_{α ∈ J} and any x ∈ X, we have

$$\displaystyle \begin{aligned} x_{\alpha} \to x \Leftrightarrow \forall ~ i \in I, p_{i}(x-x_{\alpha}) \to 0. \end{aligned}$$

Proof

⇒. Suppose that x_α → x, and fix any i ∈ I and ε > 0. Then $B_{\varepsilon }^{i}(x)$ is an open neighborhood of x, so by definition of convergence with respect to a net, there exists an α₀ ∈ J such that

$$\displaystyle \begin{aligned} \alpha \succ \alpha_{0} \Rightarrow x_{\alpha} \in B_{\varepsilon}^{i}(x). \end{aligned}$$

Therefore for all α ≻ α₀ we have p_i(x − x_α) < ε, so p_i(x − x_α) → 0.

⇐. Suppose that p_i(x − x_α) → 0 for every i ∈ I, and let U be any open neighborhood of x. Then by Theorem 1.11, we can find an r > 0 and finitely many i₁, ⋯ , i_n ∈ I such that

$$\displaystyle \begin{aligned} x \in \bigcap_{j=1}^{n}B_{r}^{i_{j}}(x) \subseteq U. \end{aligned}$$

Now, given any j = 1, ⋯ , n we have $p_{i_{j}}(x-x_{\alpha }) \to 0$. Hence, for each j we can find α_j ∈ J such that

$$\displaystyle \begin{aligned} \alpha \succ \alpha_{j}~ \Rightarrow p_{i_{j}}(x-x_{\alpha})< r. \end{aligned}$$

Since J is a directed set, there exists some α₀ ∈ J such that α₀ ≻ α_j for j = 1, ⋯ , n. Thus, for all α ≻ α₀ we have $p_{i_{j}}(x-x_{\alpha })< r$ for each j = 1, ⋯ , n, so

$$\displaystyle \begin{aligned} x_{\alpha} \in \bigcap_{j=1}^{n}B_{r}^{i_{j}}(x) \subseteq U, ~~~ \alpha \succ \alpha_{0}. \end{aligned}$$

Hence x_α → x. ■

Corollary 1.3

Let X be a vector space whose topology is induced from a family of seminorms (p_i)_{i ∈ I} , let Y be any topological space, and fix x ∈ X. Then the following two statements are equivalent.

1.
T : X → Y is continuous at x.
2.
For any net (x_α)_{α ∈ J},
$$\displaystyle \begin{aligned} p_{i}(x-x_{\alpha}) \to 0 ~~~\mathit{\mbox{ for each }} i \in I~~ \Rightarrow T(x_{\alpha}) \to T(x) ~\mathit{\mbox{in}}~ Y. \end{aligned}$$

Proposition 1.31

Let X be a vector space whose topology is induced from a family of seminorms (p_i)_{i ∈ I} . Then,

1.
for all i ∈ I, p_i is continuous.
2.
A set Y ⊆ X is bounded if and only if p_i is bounded on Y for all i ∈ I.

Proof

1.
Let i ∈ I. Because of the reverse triangle inequality, p_i(x − x_α) → 0 implies p_i(x_α) → p_i(x). Hence each seminorm p_i is continuous with respect to the induced topology.
2.
Suppose Y ⊆ X is bounded. Take i ∈ I. Then $B_{1}^{p_{i}}$ is a neighborhood of θ. Hence,
$$\displaystyle \begin{aligned} Y \subseteq \rho B_{1}^{p_{i}} \end{aligned}$$

for some ρ > 0 (by definition of boundedness). Hence, for all x ∈ Y,
$$\displaystyle \begin{aligned} x \in \{\rho y \in X \colon p(y) < 1\}= \{\rho y \in X \colon p(\rho y) < \rho\}= \{ z \in X \colon p(z)< \rho\}, \end{aligned}$$

i.e., p(x) < ρ.

Conversely, if p_i(Y ) is bounded for every i ∈ I. Then there are numbers r_i such that
$$\displaystyle \begin{aligned} \sup_{x \in Y} p_{i}(x)< r_{i}. \end{aligned}$$

Let U be a neighborhood of θ. Again
$$\displaystyle \begin{aligned} \bigcap_{j=1}^{n}B_{r}^{i_{j}}(\theta)\subseteq U. \end{aligned}$$

Choose $m> \frac {M_{i_{j}}}{r_{i_{j}}} (1\leq j \leq n )$. If x ∈ Y then $p_{i_{j}}(\frac {x}{m})< \frac {M_{i_{j}}}{m} <r_{i_{j}} \Rightarrow \frac {x}{m} \in U \Rightarrow x\in mU.$

■

Theorem 1.13

If X is a vector space whose topology τ is induced from a separating family of seminorms (p_i)_{i ∈ I} , then (X, τ) is a locally convex topological vector space.

Proof

We have already seen that there is a base for the topology τ that consists of convex open sets, so we just have to show that vector addition and scalar multiplication are continuous with respect to this topology.

Suppose that ((λ_α, x_α))_{α ∈ J} is any net in $\mathbb {K} \times X$, and that (λ_α, x_α) → (λ, x) with respect to the product topology on $\mathbb {K} \times X$. This is equivalent to assuming that λ_α → λ in $\mathbb {K}$ and x_α → x in X. Fix any i ∈ I and any ε > 0. Suppose that p_i(x) ≠ 0. Since p_i(x − x_α) → 0, there exist α₁, α₂ ∈ J such that

$$\displaystyle \begin{aligned} \alpha \succ \alpha_{1}~~ \Rightarrow ~~\left\vert \lambda -\lambda_{\alpha}\right\vert < \min\Bigg\{\frac{\varepsilon}{2p_{i}(x)}, 1\Bigg\}, \end{aligned}$$

and

$$\displaystyle \begin{aligned} \alpha \succ \alpha_{2}~~ \Rightarrow~~ p_{i}(x-x_{\alpha})< \frac{\varepsilon}{2(\left\vert \lambda\right\vert +1)}. \end{aligned}$$

By definition of directed set, there exists α₀ ≻ α₁, α₂, so both of these inequalities hold for α ≻ α₀. In particular, $(\lambda _{\alpha })_{\alpha \succ \alpha _{0}}$ is a bounded net, with $\left \vert \lambda _{\alpha }\right \vert < \left \vert \lambda \right \vert + 1$ for all α ≻ α₀. Hence, for α ≻ α₀ we have

$$\displaystyle \begin{aligned} p_{i}(\lambda x-\lambda_{\alpha} x_{\alpha}) & \leq p_{i}(\lambda x-\lambda_{\alpha} x)+ p_{i}(\lambda_{\alpha} x-\lambda_{\alpha} x_{\alpha}) \\ & = \left\vert \lambda-\lambda_{\alpha}\right\vert p_{i}(x) + \left\vert \lambda_{\alpha}\right\vert p_{i}(x-x_{\alpha})\\ & <\frac{\varepsilon}{2} + \frac{\varepsilon}{2}= \varepsilon. \end{aligned}$$

If p_i(x) = 0 then we similarly obtain $p_{i}(\lambda x-\lambda _{\alpha } x_{\alpha })< \frac {\epsilon }{2}$ for α ≻ α₀. Thus we have p_i(λx − λ_α x_α) → 0. Since this is true for every i, Theorem 1.12 implies that λ_α x_α → λx. ■

Theorem 1.14

The topology of a locally convex topological vector space X is given by the collection of seminorms obtained as Minkowski functionals p _U associated to a local basis at θ consisting of convex balanced open.

Proof

The proof is straightforward. With or without local convexity, every neighborhood of θ contains a balanced neighborhood of θ. Thus, a locally convex topological vector space has a local basis $\mathcal {B}$ at θ of balanced convex open sets.

Every open $U \in \mathcal {B}$ can be recovered from the corresponding seminorm by

$$\displaystyle \begin{aligned} U= \mathrm{int}U= \{x \in X \colon p_{U}(x)< 1\}. \end{aligned}$$

Oppositely, every seminorm local basis open

$$\displaystyle \begin{aligned} \{x \in X \colon p_{U}(x)< r\} \end{aligned}$$

is simply rU. Thus, the original topology is at least as fine as the seminorm topology. ■

1.2.5 Metrizable Topological Vector Spaces

What does it take for a topological vector space (X, τ) to be metrizable? Suppose there is a metric d compatible with the topology τ. Thus, all open sets are unions of open balls, and in particular, the countable collection of balls $B_{\frac {1}{n}}(\theta )$ forms a local base at the origin.

Theorem 1.15

A Hausdorff topological vector space is metrizable if and only if zero has a countable neighborhood base. In this case, the topology is generated by a translation invariant metric.

Proof

Let (X, τ) be a topological vector space. If τ is metrizable, then τ has clearly a neighborhood base at θ. For the converse, assume that τ has a countable neighborhood base at θ. Choose a countable base {V_n} of circled neighborhoods of θ such that V_n+1 + V_n+1 + V_n+1 ⊆ V_n holds for each n. Now define the function ρ: X → [0, ∞) by

$$\displaystyle \begin{aligned} \rho(x)= \left \lbrace \begin{array}{ccc} 1, & \text{if} & x \notin V_{1}, \\ 2^{-k},& \text{if} & x \in V_{k}\setminus V_{k+1}, \\ 0, & \text{if} & x= \theta. \end{array} \right. \end{aligned}$$

Then it is easy to check that for each x ∈ X we have the following:

1.
ρ(x) ≥ 0 if and only if x = θ.
2.
x ∈ V_k for some k if and only if ρ(x) ≤ 2^−k
3.
ρ(x) = ρ(−x) and ρ(αx) ≤ ρ(x) for all $\left \vert \alpha \right \vert \leq 1$.
4.
lim_α→0 ρ(αx) = 0.

We also note the following property if and only ρ(x_n)→0.

Now by means of the function ρ we define the function Π: X → [0, ∞) via the formula

$$\displaystyle \begin{aligned} \Pi(x)= \inf \Bigg\{\sum_{i=1}^{n}\rho(x_{i}) \colon x_{1},\cdots,x_{n} \in X. \mbox{ and }\;\sum_{i=1}^{n}x_{i}= x \Bigg\}. \end{aligned}$$

The function Π satisfies the following properties.

(a)
Π(x) ≥ 0 for each x ∈ X.
(b)
Π(x + y) ≤ Π(x) + Π(y) for all x, y ∈ X.
(c)
$\frac {1}{2}\rho (x) \leq \Pi (x) \leq \rho (x)$ for each x ∈ X (so Π(x) = 0 if and only if x = θ).

Property (a) follows immediately from the definition of Π. Property (b) is straightforward. The proof of (c) will be based upon the following property:

$$\displaystyle \begin{aligned} \mbox{If}~\sum_{i=1}^{n}\rho(x_{i}) < \frac{1}{2^{m}}, then \sum_{i=1}^{n}x_{i} \in V_{m}. \end{aligned} $$

(1.11)

To verify (1.11), we use induction on n. For n = 1 we have $\rho (x_{1})< \frac {1}{2^{m}}$, and consequently x₁ ∈ V_m+1 ⊆ V_m is trivially true. For the induction step, assume that if {x_i: i ∈ I} is any collection of at most n vectors satisfying $\sum _{i\in I}\rho (x_{i}) < \frac {1}{2^{m}}$ for some $m \in \mathbb {N}$, then ∑_{i ∈ I} x_i ∈ V_m. Suppose that $\sum _{i=1}^{n+1}\rho (x_{i}) < \frac {1}{2^{m}}$ for some $m \in \mathbb {N}$. Clearly, we have $\rho (x_{i})< \frac {1}{2^{m+1}}$, so x_i ∈ V_m+1 for each 1 ≤ n + 1. We now distinguish two cases.

Case $1\colon \sum _{i=1}^{n+1}\rho (x_{i}) < \frac {1}{2^{m+1}}$

Clearly $\sum _{i=1}^{n}\rho (x_{i}) < \frac {1}{2^{m+1}}$, so by the induction hypothesis $\sum _{i=1}^{n}x_{i} \in V_{m+1}.$ Thus

$$\displaystyle \begin{aligned} \sum_{i=1}^{n+1}x_{i} = \sum_{i=1}^{n}x_{i} + x_{n+1} \in V_{m+1} + V_{m+1} \subseteq V_{m}. \end{aligned}$$

Case $2\colon \sum _{i=1}^{n+1}\rho (x_{i}) \geq \frac {1}{2^{m+1}}$

Let 1 ≤ k ≤ n + 1 be the largest k such that $\sum _{i=k}^{n+1}\rho (x_{i}) \geq \frac {1}{2^{m+1}}$. If k = n + 1, then $\rho (x_{n+1})= \frac {1}{2^{m+1}}$, so from $\sum _{i=1}^{n+1}\rho (x_{i})< \frac {1}{2^{m}}$ we have $\sum _{i=1}^{n}\rho (x_{i})< \frac {1}{2^{m+1}}$. But then, as in Case 1, we get $\sum _{i=1}^{n+1}x_{i} \in V_{m}$. Thus, we can assume that k < n + 1. Assume first that k > 1. From the inequalities $\sum _{i=1}^{n+1}\rho (x_{i})< \frac {1}{2^{m}}$ and $\sum _{i=k}^{n+1}\rho (x_{i}) \geq \frac {1}{2^{m+1}}$, we obtain $\sum _{i=1}^{k-1}\rho (x_{i})< \frac {1}{2^{m+1}}$. So our induction hypothesis yields $\sum _{i=1}^{k-1}x_{i} \in V_{m+1}$. Also by the choice of k we have $\sum _{i=k+1}^{n+1}\rho (x_{i})< \frac {1}{2^{m+1}}$, and thus by our induction hypothesis also we have $\sum _{i=k+1}^{n+1}x_{i} \in V_{m+1}$. Therefore, in this case we obtain

$$\displaystyle \begin{aligned} \sum_{i=1}^{n+1}x_{i}=\sum_{i=1}^{k-1}x_{i}+x_{k} +\sum_{i=k+1}^{n+1}x_{i} \in V_{m+1}+V_{m+1} + V_{m+1} \subseteq V_{m}. \end{aligned}$$

If k = 1, then we have $\sum _{i=2}^{n+1}\rho (x_{i})< \frac {1}{2^{m+1}}$, so $\sum _{i=2}^{n+1}x_{i} \in V_{m+1}$. This implies $\sum _{i=1}^{n+1}x_{i}=x_{1} + \sum _{i=2}^{n+1}x_{i} \in V_{m+1}+V_{m+1} \subseteq V_{m}$. This completes the induction and the proof of (1.11).

Next, we verify (c). To this end, let x ∈ X satisfy $\rho (x)= \frac {1}{2^{m}}$ for some m ≥ 0. Also, assume by way of contradiction that the vectors x₁, ⋯ , x_k satisfy $\sum _{i=1}^{k}x_{i}= x$ and $\sum _{i=1}^{k}\rho (x_{i})< \frac {1}{2}\rho (x)= \frac {1}{2^{m+1}}$. But then, from (1.11) we get $x= \sum _{i=1}^{k}x_{i} \in V_{m+1}$, so $\rho (x) \leq \frac {1}{2^{m+1}} < \frac {1}{2^{m}}= \rho (x)$, which is impossible. This contradiction, establishes the validity of (c).

Finally, for each x, y ∈ X define d(x, y) = Π(x − y) and note that d is a translation invariant metric that generates τ. ■

Definition 1.35

Let (X, τ) be a topological vector space.

1.
X is an F-space (completely metrizable topological vector space) if its topology is induced by a complete translationally invariant metric. In other words, a completely metrizable topological vector space is a complete topological vector space having a countable neighborhood base at θ. Every Banach space is an F-space. An F-space is a Banach space if in addition $d(\alpha x,\theta )= \left \vert \alpha \right \vert d(x,\theta )$.
2.
X is a Fréchet space if it is a locally convex F-space.

Definition 1.36

A complete topological vector space (Y, Γ) is called a topological completion or simply a completion of another topological vector space (X, τ) if there is a linear homeomorphism T : X → Y such that T(X) is dense in Y , identifying X with T(X), we can think of X as a subspace of Y .

Theorem 1.16

Every topological vector space has a unique (up to linear homeomorphism) topological completion.

It turns out that the existence of a countable local base is also sufficient for metrizability. (It suffices that τ is induced from a separating countable family of seminorms (p_n)_n). Indeed, there exists a translation-invariant metric compatible with τ. One can show that the following is a compatible metric:

$$\displaystyle \begin{aligned} d(x,y)= \max_{n}\frac{\alpha_{n}p_{n}(x-y)}{1+p_{n}(x-y)}, \end{aligned}$$

where (α_n)_n is any sequence of positive numbers that decays to 0 (it is easy to see that the maximum is indeed attained). Clearly, d(x, x) = 0. Also, since the p_n’s are separating d(x, y) > 0 for x ≠ y. Symmetry, as well as translational invariance are obvious. Finally, the triangle inequality follows from the fact that every p_n is subadditive, and that a ≤ b + c implies that

$$\displaystyle \begin{aligned} \frac{a}{1+a} \leq \frac{b}{1+b}+\frac{c}{1+c}. \end{aligned}$$

It remains to show that this metric is compatible with the topology τ. One can also define the following translation-invariant metric compatible with τ

$$\displaystyle \begin{aligned} d(x,y)= \sum_{n=1}^{+\infty}2^{-n} \frac{p_{n}(x-y)}{1+p_{n}(x-y)}. \end{aligned}$$

Example 1.9

Let $s= \{ (x_{n})_{n \geq 1} \colon x \in \mathbb {K} \mbox{ for all } n \geq 1 \}$, the space of all scalar sequences. The topology of pointwise convergence is described by the seminorms $p_{k}, (k \geq 1), p_{k}((x_{n})_{n \geq 1})= \left \vert x_{k}\right \vert $ and the metric is

$$\displaystyle \begin{aligned} d(x,y)= \sum_{n=1}^{+\infty}2^{-n} \frac{\left\vert x_{n}-y_{n}\right\vert }{1+\left\vert x_{n}-y_{n}\right\vert }, ~~~ x= (x_{n})_{n \geq 1}, ~~ y= (y_{n})_{n \geq 1}. \end{aligned}$$

The ball $\overline {B}_{\frac {1}{4}}(\theta )= \{x \colon d(x,\theta ) \leq \frac {1}{4}\}$ is not convex, since $(1,0,0,\cdots ), (0,1,0,\cdots ) \in \overline {B}_{\frac {1}{4}}(\theta )$, but $\frac {3}{4}(1,0,0,\cdots ) + \frac {1}{4} (0,1,0,\cdots )= (\frac {3}{4},\frac {1}{4},0,0,\cdots ) \notin \overline {B}_{\frac {1}{4}}(\theta )$.

Theorem 1.17

Let (X, τ) be topological vector space that has a countable local base. Then there is a metric d on X such that:

1.
d is compatible with τ (every τ-open set is a union of d-open balls).
2.
The open balls B_r(θ) are balanced.
3.
d is invariant: d(x + z, y + z) = d(x, y).
4.
If, in addition, X is locally convex, then d can be chosen such that all open balls are convex.

Theorem 1.18

A topological vector space (X, τ) is normable if and only if there exists a convex bounded open neighborhood.

Proof

If (X, τ) is normable then B₁ = {x: ∥x∥ < 1} is convex and bounded. Suppose that there exists an open convex and bounded neighborhood V of θ. Set

$$\displaystyle \begin{aligned} U= \bigcap_{\left\vert \alpha\right\vert =1}\alpha V. \end{aligned}$$

Since U is the intersection of convex sets it is convex. It is balanced because for every $\left \vert \beta \right \vert \leq 1,$

$$\displaystyle \begin{aligned} \beta U=\bigcap_{\left\vert \alpha\right\vert = 1}\beta \alpha V= \bigcap_{\left\vert \alpha\right\vert =1}\left\vert \beta\right\vert \alpha V= \left\vert \beta\right\vert U, \end{aligned}$$

and by convexity,

$$\displaystyle \begin{aligned} \left\vert \beta\right\vert U= \left\vert \beta\right\vert U+(1-\left\vert \beta\right\vert )\{\theta\} \subseteq U. \end{aligned}$$

Since U contains θ, intU is balanced, it is also convex. Then there exists a convex and balanced (and certainly bounded) open neighborhood W = intU ⊆ V. Set

$$\displaystyle \begin{aligned} \Vert {x} \Vert = p_{W}(x), \end{aligned}$$

where p_W is the Minkowski functional of W. We will show that this indeed a norm. Clearly, ∥x∥ = 0 if and only if x = θ. Since W is balanced then $p_{W}(\alpha x)= \left \vert \alpha \right \vert p_{W}(x)$. The triangle inequality follows from the properties of p_W. It remains to show this norm is compatible with the topology τ. This follows from the fact that

$$\displaystyle \begin{aligned} B_{r}(\theta)= \{x \colon \Vert {x} \Vert < r\}= \{x \colon p_{W}(x)< r\}= \{x \colon p_{W}(\frac{x}{r})< 1\} \subseteq r W, \end{aligned}$$

which means that B_r(θ) is bounded, hence

$$\displaystyle \begin{aligned} \big\{B_{r}(\theta) \colon r> 0 \big\} \end{aligned}$$

is a local base. ■

Example 1.10

Let Ω be an open set in $\mathbb {R}^{m}$. We consider the space C( Ω) of all continuous functions. Note that the $\sup $-norm does not work here. There exist unbounded continuous functions on open sets.

Every open set Ω in $\mathbb {R}^{m}$ can be written as

$$\displaystyle \begin{aligned} \Omega= \bigcup_{n=1}^{\infty} K_{n}, \end{aligned}$$

where $K_{n} \Subset K_{n+1}$, where the K_n are compact, and $\Subset $ stands for compactly embedded, i.e., K_n is a compact set in the interior of K_n+1. We topologize C( Ω) with the separating family of seminorms,

$$\displaystyle \begin{aligned} p_{n}(f)= \max\{\left\vert f(x)\right\vert \colon x \in K_{n}\}= \Vert {f} \Vert _{K_{n}}. \end{aligned}$$

(These are clearly seminorms, and they are separating because for every f ≠ 0 there exists an n such that $f_{| K_{n}} \neq 0$).

Since the p_n’s are monodically increasing,

$$\displaystyle \begin{aligned} \bigcap_{d=1}^{D}\bigcap_{k=1}^{n}B^{k}_{\frac{1}{d}}(\theta)= \bigcap_{d=1}^{D}\bigcap_{k=1}^{n}\{f \colon p_{k}(f)< \frac{1}{d}\}= B^{n}_{\frac{1}{D}}(\theta), \end{aligned}$$

which means that the $B^{n}_{\frac {1}{D}}(\theta )$ form a convex local base for C( Ω). In fact, $B^{n}_{\frac {1}{D}}(\theta )$ contains a neighborhood obtained by taking n, D to be the greatest of the two, from which follows that

$$\displaystyle \begin{aligned} B^{n}_{\frac{1}{n}}(\theta)= \{f \colon p_{n}(f)< \frac{1}{n}\} \end{aligned}$$

is a convex local base for C( Ω), and the p_n’s are continuous in this topology. We can thus endow this topological space with a compatible metric, for example,

$$\displaystyle \begin{aligned} d(f,g)= \max_{n}\frac{2^{-n}p_{n}(f-g)}{1+p_{n}(f-g)}. \end{aligned}$$

We will now show that this space is complete. Recall that if a topological vector space has a compatible metric with respect to which is complete, then it is called an F-space. If, moreover, the space is locally convex, then it is called a Fréchet space. Thus, C( Ω) is a Fréchet space. Let (f_n)_n be a Cauchy sequence. This means that for every ε > 0 there exists an N, such that for every d, n > N,

$$\displaystyle \begin{aligned} \max_{k}\frac{2^{-k}p_{k}(f_{n}-f_{d})}{1+p_{k}(f_{n}-f_{d})}< \varepsilon, \end{aligned}$$

and so,

$$\displaystyle \begin{aligned} (\forall~k \geq 1)~~~\frac{2^{-k}p_{k}(f_{n}-f_{d})}{1+p_{k}(f_{n}-f_{d})}< \varepsilon, \end{aligned}$$

which means that (f_n)_n is a Cauchy sequence in each K_k (endowed with the $\sup $-norm), and hence converges uniformly to a function f. Given ε and let M such that 2^−M < ε, then

$$\displaystyle \begin{aligned} \max _{k > M}\frac{2^{-M}p_{k}(f_{n}-f)}{1+p_{k}(f_{n}-f)}< \varepsilon, \end{aligned}$$

and there exists an N, such that for every n > N,

$$\displaystyle \begin{aligned} \max_{k \leq M}\frac{2^{-M}p_{k}(f_{n}-f)}{1+p_{k}(f_{n}-f)}< \varepsilon, \end{aligned}$$

which implies that f_n→f, hence the space is indeed complete.

The question remains whether C( Ω) with this topology is normable. For this, the origin must have a convex bounded neighborhood. Recall that a set Y is bounded if and only if {p_n(f): f ∈ Y } is bounded for every n, i.e., if

$$\displaystyle \begin{aligned} \{\sup \{ \left\vert f(x)\right\vert \colon x \in K_{n}\} \colon f \in Y\} \end{aligned}$$

is a bounded set for every n, or if

$$\displaystyle \begin{aligned} \forall ~ n \geq 1~~~ \sup\{\left\vert f(x)\right\vert \colon x \in K_{n}, f \in Y\} < \infty. \end{aligned}$$

Because the $B^{n}_{\frac {1}{n}}(\theta )$ form a base, every neighborhood of θ contains a set

$$\displaystyle \begin{aligned} B^{k}_{\frac{1}{k}}(\theta), \end{aligned}$$

hence,

$$\displaystyle \begin{aligned} \sup\{\left\vert f(x)\right\vert \colon x \in K_{n}, f \in Y\} \geq \sup\{\Vert {f} \Vert _{K_{n}}\colon \Vert {f} \Vert _{K_{k}}< \frac{1}{k}\}. \end{aligned}$$

The right hand side can be made as large as we please for n > k, i.e., no set is bounded, and hence the space is not normable.

1.2.6 Finite Dimensional Topological Vector Spaces

Lemma 1.6

Let (X, τ) be a topological vector space. Any linear map $T\colon \mathbb {K}^{n} \to X$ is continuous.

Proof

Denote by (e_i)_1≤i≤n the standard basis in $\mathbb {K}^{n}$ and set

$$\displaystyle \begin{aligned} u_{j}=T(e_{j})~~~j=1,\cdots,n. \end{aligned}$$

By linearity, for any $x= (x_{1},\cdots ,x_{n})= \sum _{j=1}^{n} x_{j}e_{j}$

$$\displaystyle \begin{aligned} T(x)= \sum_{j=1}^{n} x_{j}u_{j}. \end{aligned}$$

The map x↦x_j (which is linear map $\mathbb {K}^{n} \to \mathbb {K}$) is continuous and so are addition and scalar multiplication in X. ■

Proposition 1.32

Let (X, τ) be a topological vector space. Then:

1.
Every finite dimensional subspace Y of X is a closed subset of X.
2.
If Y is an n-dimensional subspace of X and (u_i)_1≤i≤n is a basis for Y , then the map $T\colon \mathbb {K}^{n} \to Y$ defined by $T(x_{1},\cdots ,x_{n})= \sum _{j=1}^{n} x_{j}u_{j}$ is a topological isomorphism of $\mathbb {K}^{n}$ equipped with its Euclidean topology, onto X. That is, specifically, a net $(x^{\alpha })_{\alpha }= \left (\sum _{j=1}^{n} x_{j}^{\alpha }u_{j}\right )_{\alpha }$ converges to an element $x= \sum _{j=1}^{n} x_{j}u_{j} \in Y$ if and only if each net $(x_{j}^{\alpha })_{\alpha }$ converges to x_j, 1 ≤ j ≤ n.

Proof

1.
We prove part 1 by induction on the dimension of the subspace Y . First, if Y has dimension 1, let y ≠ θ ∈ Y be a basis for Y . If (λ_α y)_α is a net in Y that converges to an element x ∈ X, then the net (λ_α)_α must be eventually bounded in $\mathbb {K}$, in the sense that there must exist an index α₀ and a constant M such that $\left \vert \lambda _{\alpha }\right \vert \leq M$ for all α ≻ α₀. Indeed, if the net (λ_α)_α were not eventually bounded, let $(\lambda _{\alpha _{\beta }})_{\beta }$ be a subnet for which $\lim _{\beta }\left \vert \lambda _{\alpha _{\beta }}\right \vert = \infty $. Then
$$\displaystyle \begin{aligned} y&= \lim_{\beta}\frac{1}{\lambda_{\alpha_{\beta}}}\lambda_{\alpha_{\beta}}y \\ &= \lim_{\beta}\frac{1}{\lambda_{\alpha_{\beta}}}\lim_{\beta}\lambda_{\alpha_{\beta}}y \\ &= 0 \times x \\ &= \theta. \end{aligned}$$

which is a contradiction. So, the net (λ_α)_α is bounded. Let $(\lambda _{\alpha _{\beta }})_{\beta }$ be a convergent subnet of (λ_α)_α with limit λ. Then
$$\displaystyle \begin{aligned} x= \lim_{\alpha}\lambda_{\alpha}y= \lim_{\beta}\lambda_{\alpha_{\beta}}= \lambda y. \end{aligned}$$

whence x ∈ Y , and Y is closed.

Assume now that any n-1-dimensional subspace is closed, and let Y have dimension n > 1. Let {y₁, ⋯ , y_n} be a basis for Y , and write Y ′ for the linear span of y₁, ⋯ , y_n−1. Then elements y of Y can be written uniquely in the form y = y′ + λy_n, for y′∈ Y ′ and $\lambda \in \mathbb {K}$. Suppose that x is an element of the closure of Y , i.e., $x= \lim _{\alpha }(y^{\prime }_{\alpha }+ \lambda _{\alpha }y_{n})$. As before, we have that the net (λ_α)_α must be bounded. Indeed, if the net (λ_α)_α were not bounded, then let $(\lambda _{\alpha _{\beta }})_{\beta }$ be a subnet for which $\lim _{\beta }\left \vert \lambda _{\alpha _{\beta }}\right \vert =\infty $. Then
$$\displaystyle \begin{aligned} \theta= \lim_{\beta}\frac{1}{\lambda_{\alpha_{\beta}}} x= \lim_{\beta}\frac{y^{\prime}_{\alpha_{\beta}}}{\lambda_{\alpha_{\beta}}} + y_{n}, \end{aligned}$$

or
$$\displaystyle \begin{aligned} y_{n}= -\lim_{\beta}\frac{y^{\prime}_{\alpha_{\beta}}}{\lambda_{\alpha_{\beta}}}, \end{aligned}$$

implying that y_n belongs to the closure of the closed subspace Y ′, this is impossible, showing that the net (λ_α)_α is bounded. Hence, letting $(\lambda _{\alpha _{\beta }})_{\beta }$ be a convergent subnet of (λ_α)_α, say $\lambda = \lim _{\beta }\lambda _{\alpha _{\beta }}$, we have
$$\displaystyle \begin{aligned} x= \lim_{\beta}(y^{\prime}_{\alpha_{\beta}} + \lambda_{\alpha_{\beta}}y_{n}), \end{aligned}$$

showing that
$$\displaystyle \begin{aligned} x-\lambda y_{n}= \lim_{\beta}y^{\prime}_{\alpha_{\beta}}, \end{aligned}$$

whence, since Y ′ is closed, there exists a y′∈ Y ′ such that x − λy_n = y′. Therefore, x = y′ + λy_n ∈ Y , and Y is closed, proving part 1.
2.
We prove part 2 for real vector spaces. The map $T\colon \mathbb {R}^{n} \to Y$ of part 2 is obviously linear, one to one and onto. Also, it is continuous by previous lemma. Let us show that T ⁻¹ is continuous. Thus, let $(x^{\alpha })_{\alpha }=\left (\sum _{j=1}^{n} x_{j}^{\alpha }u_{j}\right )_{\alpha }$ converge to θ in Y . Suppose, by way of contradiction, that there exists an j for which the net $(x_{j}^{\alpha })_{\alpha }$ does not converge to 0. Then let $(x_{j}^{\alpha ^{\beta }})_{\beta }$ be a subnet for which $\lim _{\beta }x_{j}^{\alpha ^{\beta }}= x_{j}$, where x_j either is ±∞ or is a nonzero real number. Write $x^{\alpha }= x_{j}^{\alpha }u_{j} + x^{\prime \alpha }$. Then
$$\displaystyle \begin{aligned} \frac{1}{x_{j}^{\alpha^{\beta}}}x^{\alpha^{\beta}}=u_{j}+ \frac{1}{x_{j}^{\alpha^{\beta}}}x^{\prime \alpha^{\beta}}, \end{aligned}$$

whence
$$\displaystyle \begin{aligned} u_{j}= - \lim_{\beta}\frac{1}{x_{j}^{\alpha^{\beta}}}x^{\prime\alpha^{\beta}}, \end{aligned}$$

implying that u_j belongs to the (closed) subspace spanned by the vectors
$$\displaystyle \begin{aligned} u_{1},\cdots, u_{j+1},\cdots, u_{n}. \end{aligned}$$

and this is a contradiction, since the u_j’s form a basis of Y . Therefore, each of the nets $(x_{j}^{\alpha })_{\alpha }$ converges to 0, and T ⁻¹ is continuous.

■

Corollary 1.4

There exists a unique topology on $\mathbb {K}^{n}$ (viewed as a topological vector space), and all n-dimensional topological vector spaces are topologically isomorphic .

There are no infinite dimensional locally compact topological vector spaces. This is essentially due to F. Riesz.

Theorem 1.19

A topological space is locally compact if and only if is finite dimensional.

Proof

Let (X, τ) be a topological vector space. If X is finite dimensional, then τ coincides with the Euclidean topology and since the closed balls are compact sets, it follows that (X, τ) is locally compact.

For the converse assume that (X, τ) is locally compact and let V be a compact neighborhood of θ. From $V\subseteq \bigcup _{x \in V}(x+\frac {1}{2}V)$, we see that there exists a finite subset {x₁, ⋯ , x_k} of V such that

$$\displaystyle \begin{aligned} V\subseteq \bigcup_{i=1}^{k}(x_{i}+\frac{1}{2}V)= \{x_{1},\cdots,x_{k}\}+ \frac{1}{2}V. \end{aligned} $$

(1.12)

Let Y be a linear span of x₁, ⋯ , x_k. From (1.12), we get $V\subseteq Y+\frac {1}{2}V$. This implies $\frac {1}{2}V\subseteq \frac {1}{2}(Y+ \frac {1}{2}V)= Y+ \frac {1}{2^{2}}V$, so $V\subseteq Y+ (Y+ \frac {1}{2^{2}}V)= Y+ \frac {1}{2^{2}}V$. By induction we see that

$$\displaystyle \begin{aligned} V\subseteq Y+\frac{1}{2^{n}}V \end{aligned} $$

(1.13)

for each n. Next, fix x ∈ V . From (1.13), it follows that for each n there exist y_n ∈ Y and v_n ∈ V such that $x= y_{n} + \frac {1}{2^{n}}v_{n}$. Since V is compact, there exists a subnet $(v_{n_{\alpha }})$ of the sequence (v_n) such that $v_{n_{\alpha }} \stackrel \tau \longrightarrow v \in X$ (and clearly $\frac {1}{2^{n_{\alpha }}} \longrightarrow 0$ in $\mathbb {R})$. So

$$\displaystyle \begin{aligned} y_{n_{\alpha}} = x- \frac{1}{2^{n_{\alpha}}}v_{n_{\alpha}} \stackrel \tau \longrightarrow x-0v= x. \end{aligned}$$

Since (Proposition 1.32 1.) Y is a closed subspace, x ∈ Y . That is, V ⊆ Y . Since V is also an absorbing set, it follows that X = Y , so that X is finite dimensional. ■

Theorem 1.20

1.
Let Y₁, ⋯ , Y_n be compact convex sets in a vector space (X endowed with a linear topology τ). Then
$$\displaystyle \begin{aligned} \mathrm{conv}(Y_{1} \cup \cdots \cup Y_{n}) \end{aligned}$$

is compact.
2.
Let (X, τ) be a locally convex topological vector space. If Y ⊆ X is totally bounded then conv(Y ) is totally bounded as well.
3.
If (X, τ) is a Fréchet space and K ⊆ X is compact then $\overline {\mathrm {conv}}(K)$ is compact.
4.
If $K \subseteq \mathbb {R}^{n}$ is compact then conv(K) is compact.

Proof

1.
Let $S \subseteq \mathbb {R}^{n}$ be the simplex
$$\displaystyle \begin{aligned} S= \{(s_{1},\cdots,s_{n})~ :~ s_{i} \geq 0, ~ \sum_{i=1}^{n} s_{i}= 1\}. \end{aligned}$$

Set Y = Y₁ ×⋯ × Y_n and define the function φ: S × Y → X :
$$\displaystyle \begin{aligned} \varphi(s,y)= \sum_{i=1}^{n} s_{i}y_{i}. \end{aligned}$$

Consider the set K = φ(S × Y ). It is the continuous image of a compact set and it is therefore compact. Moreover,
$$\displaystyle \begin{aligned} K \supseteq \mathrm{conv}(Y_{1} \cup \cdots \cup Y_{n}). \end{aligned}$$

It is easy to show that K is convex, and since it includes all the Y_i’s it must in fact be equal to conv(Y₁ ∪⋯ ∪ Y_n).
2.
Let U be an open neighborhood of θ. Because X is locally convex there exists a convex open neighborhood V of θ such that
$$\displaystyle \begin{aligned} V+ V \subseteq U. \end{aligned}$$

Since Y is totally bounded there exists a finite set F such that
$$\displaystyle \begin{aligned} Y \subseteq F + V \subseteq \mathrm{conv}(F) + V. \end{aligned}$$

Since the right hand side is convex
$$\displaystyle \begin{aligned} \mathrm{conv}(Y) \subseteq \mathrm{conv}(F) + V. \end{aligned}$$

By the first item conv(F) is compact, therefore there exists a finite set F′ such that
$$\displaystyle \begin{aligned} \mathrm{conv}(F)= F' + V, \end{aligned}$$

i.e.,
$$\displaystyle \begin{aligned} \mathrm{conv}(Y) \subseteq F' + V + V \subseteq F' + U, \end{aligned}$$

which proves that conv(Y ) is totally bounded.
3.
In every metric space the closure of a totally bounded set is totally bounded, and if the space is complete it is compact. Since K is compact, then it is totally bounded. By the previous item conv(K) is totally bounded and hence its closure is compact.
4.
$S \subseteq \mathbb {R}^{n}$ be the convex simplex. One can show that conv(K) is the image of the continuous map S × K :
$$\displaystyle \begin{aligned} (s,x_{1},\cdots,x_{n}) \mapsto \sum_{i=1}^{n} s_{i}x_{i}, \end{aligned}$$

whose domain is compact.

■

Corollary 1.5

Let X be a vector space endowed with a linear topology τ. The convex hull of a finite set (polytope) is compact.

Example 1.11 (Noncompact Convex Hull)

Consider l₂, the space of all square summable sequences. For each n let . Observe that $\Vert {u_{n}} \Vert _{2}= \frac {1}{n}$, so . Consequently,

$$\displaystyle \begin{aligned} Y= \{u_{1},u_{2},u_{3},\cdots\}\cup\{\theta\} \end{aligned}$$

is norm compact subset of l₂. Since θ ∈ Y , it is easy to see that

$$\displaystyle \begin{aligned} \mathrm{conv}(Y)= \Bigg\{ \sum_{i=1}^{k} \alpha_{i}u_{i} \colon \alpha_{i} \geq 0 \mbox{ for each } i \,\mbox{and} \, \sum_{i=1}^{k} \alpha_{i} \leq 1\, \Bigg\}. \end{aligned}$$

In particular, each vector of conv(Y ) has only finitely many nonzero components. We claim that conv(Y ) is not norm compact. To see this, set

$$\displaystyle \begin{aligned} x_{n}= (\frac{1}{2},\frac{1}{2}.\frac{1}{2^{2}},\frac{1}{3}.\frac{1}{2^{3}},\cdots,\frac{1}{n}.\frac{1}{2^{n}},0,0,\cdots) = \sum_{i=1}^{n}\frac{1}{2^{i}} u_{i}, \end{aligned}$$

so x_n ∈conv(Y ). Now in l₂. But x∉conv(Y ), so conv(Y ) is not even closed, let alone compact.

Remark 1.18

In the above example, the convex hull of a compact set failed to be closed. The question remains whether the closure of the convex hull is compact. In general, the answer is no. To see this, let X the space of sequences that are eventually zero, equipped with the l₂-norm. Let Y as above, and note that $\overline {\mathrm {conv}}(Y)$ (where the closure is taken in X, not l₂) is not compact either. To see this, observe that the sequence (x_n)_n defined above has no convergent subsequence (in X).

Proposition 1.33

Let Y and Z are two nonempty convex subsets of a topological vector space (X, τ) such that Y is compact and Z is closed and bounded, then conv(Y ∪ Z) is closed.

Proof

Let x_i = (1 − α_i)y_i + α_i z_i→x, where 0 ≤ α_i ≤ 1, y_i ∈ Y and z_i ∈ Z for each i. By passing to a subnet, we can assume that y_i→y ∈ Y and α_i→α ∈ [0, 1]. If α > 0, then $z_{i} \longrightarrow \frac {x-(1-\alpha )y}{\alpha }= z \in Z$, and consequently x = (1 − α)y + αz ∈conv(Y ∪ Z).

Now consider the case α = 0. The boundedness of Z and Proposition 1.20 imply α_i z_i→θ, so x_i = (1 − α_i)y_i + α_i z_i→y. Since the space is Hausdorff, x = y ∈conv(Y ∪ Z). ■

1.2.7 The Weak Topology of Topological Vector Spaces and the Weak^∗ Topology of Their Duals

If X is a topological vector space then the weak topology on it is coarser than the origin topology: any set that is open in the original topology is open in the weak topology. From this, it follows that it is easier for a sequence to converge in the weak topology than in the original topology.

We will consider topological vector spaces (X, τ) over the field $\mathbb {K}, \mathbb {K}= \mathbb {R}$ or $\mathbb {K}= \mathbb {C}$. For definiteness we assume $\mathbb {K}= \mathbb {C}$.

Remark 1.19

Given a vector space X and a linear functional $\phi \colon X \to \mathbb {K}$, the map $p_{\phi }= \left \vert \phi \right \vert \colon X\ni x \mapsto \left \vert \phi (x)\right \vert \in [0,\infty [$ defines a seminorm on X.

Definition 1.37

Let (X, τ) be a topological vector space. The topological dual space X′ is the set of all continuous linear maps $(X, \tau ) \to \mathbb {K}$.

Next, we will discuss the geometric form of the Hahn-Banach theorems. The first geometric version is

Lemma 1.7

Let (X, τ) be a real topological vector space, and let V ⊆ X be a convex open set which contains θ. If x₀ ∈ X ∖ V , there exists ψ ∈ X′, such that ψ(x₀) = 1 and ψ(x) < 1, for all x ∈ V .

It turns out that Lemma 1.7 is a particular case of a more general result:

Theorem 1.21 (Hahn-Banach Separation Theorem-Real Case)

Let (X, τ) be a real topological vector space, let Z, W ⊆ X be nonempty convex sets with Z open, and Z ∩ W = ∅. Then there exists ψ ∈ X′, and a real number α, such that

$$\displaystyle \begin{aligned} \psi(z)< \alpha\leq \psi(w), \mathit{\mbox{ for all }} z\in Z, w\in W. \end{aligned}$$

Proof

Fix some points z₀ ∈ Z, w₀ ∈ W, and define the set

$$\displaystyle \begin{aligned} V= Z- W+ w_{0}- z_{0}= \{z- w+w_{0}- z_{0}\colon z\in Z, w\in W\}. \end{aligned}$$

It is straightforward that V is convex and contains θ. The equality

$$\displaystyle \begin{aligned} V= \bigcup_{w\in W}(Z- w+w_{0}- z_{0}) \end{aligned}$$

shows that V is also open. Define the vector x₀ = w₀ − z₀. Since Z ∩ W = ∅, it is clear that x₀∉V . Use Lemma 1.7 to produce ψ ∈ X′ such that

(i)
ψ(x₀) = 1,
(ii)
ψ(x) < 1, for all x ∈ V .

By the definition of x₀ and V , we have ψ(w₀) = ψ(z₀) + 1, and

$$\displaystyle \begin{aligned} \psi(z)< \psi(w)+ \psi(z_{0})- \psi(w_{0})+ 1, \mbox{ for all } z\in Z, w\in W, \end{aligned}$$

which gives

$$\displaystyle \begin{aligned} \psi(z)< \psi(w), \mbox{ for all } z\in Z, w\in W. \end{aligned} $$

(1.14)

Put

$$\displaystyle \begin{aligned} \alpha= \inf_{w\in W}\psi(w). \end{aligned}$$

The inequality (1.14) gives

$$\displaystyle \begin{aligned} \psi(z)\leq \alpha\leq \psi(w), \mbox{ for all } z\in Z, w\in W. \end{aligned} $$

(1.15)

The proof will be complete once we prove the following:

$$\displaystyle \begin{aligned} \psi(z)< \alpha \mbox{ for all } z\in Z. \end{aligned}$$

Suppose the contrary, i.e., there exists some z₁ ∈ Z with ψ(z₁) = α. Using the continuity of the map

$$\displaystyle \begin{aligned} \mathbb{R}\ni \beta \mapsto z_{1}+ \beta x_{0}\in X, \end{aligned}$$

there exists some ε > 0 such that

$$\displaystyle \begin{aligned} z_{1}+ \beta x_{0}\in Z, \mbox{ for all } \beta\in [-\varepsilon,\varepsilon]. \end{aligned}$$

In particular, by (1.15) one has

$$\displaystyle \begin{aligned} \psi(z_{1}+ \varepsilon x_{0})\leq \alpha, \end{aligned}$$

which means that

$$\displaystyle \begin{aligned} \alpha+ \varepsilon\leq \alpha, \end{aligned}$$

which is clearly impossible. ■

Theorem 1.22 (Hahn-Banach Separation Theorem-Complex Case)

Let (X, τ) be a complex topological vector space, let Z, W ⊆ X be nonempty convex sets with Z open, and Z ∩ W = ∅. Then there exists ψ ∈ X′, and a real number α, such that

$$\displaystyle \begin{aligned} \mathrm{Re}\,\psi(z)< \alpha\leq \mathrm{Re}\,\psi(w), \mathit{\mbox{ for all }} z\in Z, w\in W. \end{aligned}$$

Proof

Regard X as a real topological vector space, and apply the real version to produce an $\mathbb {R}$-linear continuous functional $\psi _{1}\colon X\to \mathbb {R}$, and a real number α, such that

$$\displaystyle \begin{aligned} \psi_{1}(z)< \alpha\leq \psi_{1}(w), \,x\in X \end{aligned}$$

Then the functional $\psi \colon X\to \mathbb {C}$ defined by

$$\displaystyle \begin{aligned} \psi(x)= \psi_{1}(x)- i\psi_{1}(ix), \,x\in X \end{aligned}$$

will clearly satisfy the desired properties. ■

Remark 1.20

Geometrically we can say that the hyperplane {Re ψ(x) = α} separates the sets Z, W in broad sense.

There is another version of the Hahn-Banach separation theorem, which holds for locally convex topological vector spaces.

Theorem 1.23

Let (X, τ) be a locally convex topological vector space. Suppose C, D ⊆ X are convex sets, with C compact, D closed, and C ∩ D = ∅. Then there exists ψ ∈ X′ and two numbers $\alpha , \beta \in \mathbb {R}$ , such that

$$\displaystyle \begin{aligned} \mathrm{Re}\,\psi(x)\leq \alpha< \beta\leq \mathrm{Re}\,\psi(y), \mathit{\mbox{ for all }} x\in C, y\in D. \end{aligned}$$

Proof

Let W = D − C. By Lemma 1.2, 4. W is closed. Since C ∩ D = ∅, we have θ∉W. Since W is closed, its complement X ∖ W will then be a neighborhood of θ. Since X is locally convex, there exists a convex open set Z, with θ ∈ Z ⊆ X ∖ W. In particular we have Z ∩ W = ∅. Applying the suitable version of the Hahn-Banach separation theorem (real or complex case), we find a linear continuous map $\psi \colon X\to \mathbb {K}$ and a real number γ, such that

$$\displaystyle \begin{aligned} \mathrm{Re}\,\psi(z)< \gamma\leq \mathrm{Re}\,\psi(w), \mbox{ for all } z\in Z, w\in W. \end{aligned}$$

Notice that θ ∈ Z, we get γ > 0. Then the inequality

$$\displaystyle \begin{aligned} \gamma\leq \mathrm{Re}\,\psi(w), \mbox{ for all } w\in W, \end{aligned}$$

gives

$$\displaystyle \begin{aligned} \mathrm{Re}\,\psi(y)- \mathrm{Re}\,\psi(x)\geq \gamma> 0, \mbox{ for all } x\in C, y\in D. \end{aligned}$$

Then if we define

$$\displaystyle \begin{aligned} \beta= \inf_{y\in D}\mathrm{Re}\,\psi(y) \,\mathrm{and}\, \alpha= \sup_{x\in C}\mathrm{Re}\,\psi(x), \end{aligned}$$

we get β ≥ α + γ, and we are done. ■

Remark 1.21

Geometrically we can say that the hyperplane {Re ψ(x) = β} separates the compact sets C and the closed set D in the strict sense.

One important feature of topological duals in the locally convex Hausdorff case is described by the following result.

Proposition 1.34

If (X, τ) is a locally convex topological vector space, then X′ separates the points of X, in the following sense: for any x, y ∈ X, such that x ≠ y, there exists ϕ ∈ X′, such that ϕ(x) ≠ ϕ(y).

Proof

Since X is locally convex and Hausdorff, there exists some open convex set V ∋ y such that x∉V . The existence of ϕ then follows from the Hahn-Banach separation theorem. ■

Definition 1.38

Let (X, τ) be a topological vector space. The weak topology on X, which we denote by σ(X, X′) , is the initial topology for X′. That is, σ(X, X′) is the coarsest topology on X such that each element of X′ is continuous $(X, \sigma (X,X')) \to ~\mathbb {C}$.

Equivalently, the weak topology on X is the seminorm topology given by the seminorms $\left \vert \phi \right \vert , \phi \in X'$.

Remark 1.22

The topologies τ and σ(X, X′) are comparable, and τ is at least as fine as σ(X, X′). That is, σ(X, X′) ⊆ τ. A vague rule is that the smaller X′ is compared to the set of all linear maps $(X, \sigma (X,X')) \to \mathbb {C}$, the smaller σ(X, X′) will be compared to τ.
If X′ separates X then (X, σ(X, X′)) is a locally convex topological vector space. It is Hausdorff because σ(X, X′) is induced by the separating family of seminorms $p_{\phi }= \left \vert \phi \right \vert , \phi \in X'$. In particular if (X, τ) is a locally convex topological vector space then (X, σ(X, X′)) is a locally convex topological vector space.

Definition 1.39

Let (X, τ) be a topological vector space and (x_α)_{α ∈ I} a net in X. We say that

1.
The net (x_α)_{α ∈ I} converges strongly to x and we write
$$\displaystyle \begin{aligned} x_{\alpha} \to x~~ \textit{if} ~(x_{\alpha})_{\alpha \in I} ~\mathrm{converges}\ \mathrm{to} ~ x ~\mathit{in} \,\mathit{the} \,\mathit{original} \,\mathit{topology} ~\tau. \end{aligned}$$
2.
The net (x_α)_{α ∈ I} converges weakly to x and we write

This condition is equivalent to the condition that p_ϕ(x_α − x) → 0, ∀ ϕ ∈ X′, which in turn is equivalent to
$$\displaystyle \begin{aligned} \phi(x_{\alpha})\to \phi(x),~~\forall\, \phi \in X'. \end{aligned}$$

A simple consequence of the fact that σ(X, X′) ⊆ τ is that

i.e., every strongly convergent net is weakly convergent.

Similarly, we will speak about the strong neighborhood, strongly closed, strongly bounded ⋯, and weak neighborhood, weakly closed, weakly bounded ⋯

Definition 1.40

We say that Y ⊆ X is weakly bounded if Y is a bounded subset of (X, σ(X, X′)): for every neighborhood N of θ in (X, σ(X, X′)) there is some c ≥ 0 such that Y ⊆{cx: x ∈ N} = cN (equivalently, ϕ(Y ) is bounded in $\mathbb {C})$.

Remark 1.23

If (X, τ) is an infinite dimensional locally convex topological vector space, the weak topology σ(X, X′) has a peculiar property: every weak neighborhood of θ contains a closed infinite dimensional linear subspace. Indeed, if we start with some neighborhood V , then there exist ϕ₁, ⋯ , ϕ_n ∈ X′ and ε₁, ⋯ , ε_n > 0, such that $\varepsilon _{1}B_{P_{\phi _{1}}}(\theta )\cap \cdots \cap \varepsilon _{n}B_{P_{\phi _{n}}}(\theta )$, where for i = 1, ⋯ , n, $B_{P_{\phi _{i}}}(\theta )= \{x \in X, \left \vert \phi _{i}(x)\right \vert < 1\}$. So V will clearly contain the closed subspace $(\ker \,\phi _{1})\cap \cdots \cap (\ker \,\phi _{n})$. It follows that

$$\displaystyle \begin{aligned} \dim X \leq n +\dim (\ker\,\phi_{1})\cap\cdots \cap(\ker\,\phi_{n}), \end{aligned}$$

i.e., $\dim (\ker \,\phi _{1})\cap \cdots \cap (\ker \,\phi _{n})= \infty $. Hence σ(X, X′) is not locally bounded.

Proposition 1.35

In any finite-dimensional normed space, the weak topology coincides with the topology generated by any norm.

Proof

Let X be a finite-dimensional vector space, let (e₁, ⋯ , e_d) be a basis in X, and let ϕ₁, ⋯ , ϕ_d be its dual basis, defined by ϕ_i(e_j) = δ_i,j. Then, $\Vert {x} \Vert _{\infty }= \max _{1\leq i \leq d} \left \vert \phi _{i}(x)\right \vert $ is a norm on X, and since X is finite-dimensional, all linear functionals on X are also continuous.

We know that on finite dimensional vector space two norms are equivalent, so it is enough to compare the weak topology to the topology τ induced by ∥∥_∞. It is clear that τ ⊇ σ(X, X′). On the other hand,

$$\displaystyle \begin{aligned} \left\vert x\right\vert {}_{\phi_{1}, \cdots, \phi_{d}}= \sup_{1\leq i \leq d}= \Vert {x} \Vert _{\infty}, x \in X, \end{aligned}$$

and hence the open ∥.∥_∞-balls around any point and with any radius are open in the weak topology. Hence, τ ⊆ σ(X, X′). ■

Theorem 1.24

Let X be an infinite-dimensional normed space and S_X = {x ∈ X : ∥x∥ = 1} be the unit sphere of X. The closure of the unit sphere in the weak topology is the whole closed unit ball, i.e.,

$$\displaystyle \begin{aligned} \overline{S_{X}}^{\sigma(X,X')}= \{x \in X \colon \Vert {x} \Vert \le1 \}. \end{aligned}$$

Similarly, one can show that B₁(θ) = {x ∈ X : ∥x∥ < 1} has empty interior for σ(X, X′). In particular it is not open. Despite these facts, there are sets whose weak closure is equivalent to its strong closure.

Remark 1.24

If (X, τ) is a locally convex topological vector space, then for any Y ⊆ X, then $\overline {\mathrm {conv}(Y)}^{\tau }= \overline {\mathrm {conv}(Y)}^{\sigma (X,X')}.$

Theorem 1.25

If Y ⊆ X is convex and (X, τ) is a locally convex topological vector space, then

1.
Y is σ(X, X′)-closed (weakly closed) if and only if Y is τ-closed (strongly closed) .
2.
Y is σ(X, X′)-dense if and only if Y is τ-dense.

Proof

1.
Since σ(X, X′) ⊆ τ, then if Y is σ(X, X′)-closed it is τ-closed. Conversely, if Y is τ-closed and convex, let x₀ ∈ X ∖ Y . Then by the Hahn-Banach separation theorem (for complex vector spaces) there is some ϕ ∈ X′ such that
$$\displaystyle \begin{aligned} \sup_{x \in Y}\mathrm{Re} (\phi(x))\le \gamma_{1}< \gamma_{2}\leq \mathrm{Re} (\phi(x_{0})) . \end{aligned}$$

Hence the neighborhood of x₀
$$\displaystyle \begin{aligned} x_{0}+V= x_{0}+\bigg\{x\colon \left\vert \gamma(x)\right\vert \leq \mathrm{Re} (\phi(x_{0}))-\gamma_{2}\bigg\} \end{aligned}$$

has empty intersection with Y .
2.
Obvious.

■

In particular, in a topological vector space, the closure of convex sets is convex.

If a sequence converges weakly, it need not converge in the original topology, and Mazur’s theorem shows that if a sequence in a metrizable locally convex space converges weakly then there is a sequence in the convex hull of the original sequence that converges to the same limit as the weak limit of the original sequence.

Theorem 1.26

(Mazur) Let X be a metrizable locally convex space. If , then there is a sequence (y_m)_m ⊆ X such that each y_m is a convex combination of finitely many x_n and such that y_m → x.

Proof

The convex hull of a subset Y of X is the set of all convex combinations of finitely many elements of Y . The convex hull of a set is convex and contains the set. Let Z be the convex hull of the sequence (x_n)_n and let W the weak closure of Z. Since and x_n ∈ Z, Theorem 1.25 tells us that $W= \overline {Z}$, so $x \in \overline {Z}$. But X is metrizable, so x being in the closure of Z implies that there is a sequence (y_m)_m ⊆ Z such that y_m → x. This sequence (y_m)_m satisfies the claim. ■

Let (X, τ) be a topological vector space. The dual space X′ does not come with an a priori topology.

Let x ∈ X, and define $f_{x}\colon X' \to \mathbb {C}$ by f_x(ϕ) = ϕ(x). Now f_x is linear. If ϕ₁, ϕ₂ ∈ X′ are distinct, then ϕ₁ − ϕ₂ ≠ 0 so there is some x ∈ X such that (ϕ₁ − ϕ₂)(x) ≠ 0, which tells us that f_x(ϕ₁) ≠ f_x(ϕ₂). Therefore the set {f_x: x ∈ X} is a separating family of seminorms on X′, hence generating a topology which makes X′ a locally convex topological vector space. We denote this topology by σ(X′, X) or w ^∗ and it is called the weak^∗ topology on X′. The open sets in the weak^∗ topology are generated by the subbase

$$\displaystyle \begin{aligned} B_{r}^{x}= \{\phi \in X' \colon \left\vert \phi(x)\right\vert < r\}. \end{aligned}$$

Lemma 1.8

(a)
The weak topology σ(X′, X) is the weakest topology on X′ such that each map f_x is continuous.
(b)
A sequence (ϕ_n)_n converges to ϕ in σ(X′, X) if and only if for all x ∈ X
$$\displaystyle \begin{aligned} \lim_{n \longrightarrow \infty}\phi_{n}(x)= \phi(x). \end{aligned}$$
(c)
A set Y ⊆ X′ is bounded w.r.t. σ(X′, X) if and only if for all x ∈ X
$$\displaystyle \begin{aligned} \{ \phi(x), \phi \in Y\} \end{aligned}$$

is bounded in $\mathbb {C}$.

Example 1.12

Recall that $c^{\prime }_{0}= l_{1}$ and $l_{1}^{\prime }= l_{\infty }$. Weak convergence of a sequence (x_n)_k ⊆ l₁ to zero (with l₁ viewed as a topological vector space) means that

$$\displaystyle \begin{aligned} \forall \, y= (y_{k})_{k} \subseteq l_{\infty}~~~\lim_{k \to \infty}\sum_{k=1}^{+\infty}(x_{n})_{k}y_{k}= 0. \end{aligned}$$

Weak^∗ convergence of a sequence (x_n)_k ⊆ l₁ to zero (with l₁ viewed as the dual of the topological vector space c₀) means that

$$\displaystyle \begin{aligned} \forall \, y= h(y_{k})_{k} \subseteq c_{0}~~~\lim_{k \to \infty}\sum_{k=1}^{+\infty}(x_{n})_{k}y_{k}= 0. \end{aligned}$$

Clearly, weak convergence implies weak^∗ convergence (but not the opposite).

A priori, one can look at the second dual Y of the locally convex vector space (X, σ(X′, X)), i.e.,

$$\displaystyle \begin{aligned} Y=\{\lambda\colon X' \to \mathbb{C}, \text{w.r.t}, \sigma(X',X)\}. \end{aligned}$$

By construction, it follows that X ⊆ Y,

i.e., X can be embedded into Y . It turns out that X = Y , i.e., the dual of (X, σ(X′, X)) can be identified with X.

Theorem 1.27

If $\lambda \colon X' \to \mathbb {C}$ is linear and continuous w.r.t, σ(X′, X), then there exists x ∈ X such that

$$\displaystyle \begin{aligned} \lambda(\phi)= \phi(x)~~\forall \phi \in X'. \end{aligned}$$

Proof

By definition of continuity w.r.t, σ(X′, X), for all 𝜖 > 0 there are δ > 0 and x₁, ⋯ , x_n such that

$$\displaystyle \begin{aligned} \lambda \{ \phi \colon \left\vert \phi(x_{i})\right\vert \le \delta, i=1,\cdots,n \} \subseteq (-\epsilon,\epsilon). \end{aligned}$$

In particular, if ϕ is such that ϕ(x_i) = 0 for all i, then λ(ϕ) = 0. This show that

$$\displaystyle \begin{aligned} N_{\phi}\supseteq\bigcap_{i=1}^{n}N_{x_{i}}. \end{aligned}$$

Consider the linear mapping $T\colon X' \to \mathbb {C}^{n+1}$ defined by

$$\displaystyle \begin{aligned} T(\phi)= (\lambda(\phi),\cdots,\phi(x_{1}),\cdots,\phi(x_{n})). \end{aligned}$$

By the assumption, T(X′) is a subspace of $\mathbb {C}^{n+1}$ and the point (1, 0, ⋯ , 0) is not in T(X′). Then there are $\alpha = (\alpha _{1},\cdots ,\alpha _{n+1}) \in \mathbb {C}^{n+1}$ such

$$\displaystyle \begin{aligned} \alpha T(X')= \{\alpha_{1}\lambda(\phi)+\sum_{i=2} ^{n+1} \alpha_{i} \phi(x_{i-1}), \phi \in X'\}= 0< \Re \alpha_{1}. \end{aligned}$$

It follows that α₁ ≠ 0 and

$$\displaystyle \begin{aligned} \lambda \phi= \sum_{i=1} ^{n}\frac{\alpha_{i+1}}{\alpha_{1}} \phi(x_{i}). \end{aligned}$$

■

If X is in particular a normed space, then we know that $(X',\Vert {.} \Vert _{X'})$ is a Banach space. Hence, if τ is the vector topology of X′ generated by the norm $,\Vert {.} \Vert _{X'}, \sigma (X',X)\subseteq \tau $.

Definition 1.41

We say that

The sequence (ϕ_n)_n converges strongly to ϕ and we write
$$\displaystyle \begin{aligned} \phi_{n}\longrightarrow \phi~~\text{if} ~\Vert {\phi_{n}-\phi} \Vert _{X'} \longrightarrow 0. \end{aligned}$$
The sequence (ϕ_n)_n converges weakly to ϕ and we write if (ϕ_n)_n converges to ϕ in the topology σ(X′, X).

The Banach-Alaoglu theorem shows that certain subsets of X′ are weak^∗ compact, i.e., they are compact subsets of σ(X′, X).

Definition 1.42

Let X be a topological vector space and V be a neighborhood of θ. Define the polar of V as

$$\displaystyle \begin{aligned} K= \Big\{ \phi \in X' \colon \left\vert \phi(x)\right\vert \leq 1~~\forall~x \in V\Big\}. \end{aligned}$$

Theorem 1.28 (Banach-Alaoglu)

Let X be a topological vector space and V be a neighborhood of θ. Then the polar K of V is compact in the weak ^∗ topology σ(X′, X).

Proof

Since each V local neighborhood absorbing, then there is a γ(x) ∈C such that

$$\displaystyle \begin{aligned} x \in \gamma(x)V. \end{aligned}$$

Hence it follows that

$$\displaystyle \begin{aligned} \left\vert \phi(x)\right\vert \le \gamma(x) ~~x \in X, ~ \phi \in K. \end{aligned}$$

Consider the topological space

$$\displaystyle \begin{aligned} P= \prod_{x \in X}\{ \alpha \in \mathbf{C}\colon \left\vert \alpha\right\vert \leq \gamma(x)\}, \end{aligned}$$

with the product topology σ. By Tychonoff’s theorem (P, σ) is compact.

By the construction, the elements of P are functions $f\colon X \to \mathbb {C}$ (not necessarily linear) such that

$$\displaystyle \begin{aligned} \left\vert f(x)\right\vert \leq \gamma(x). \end{aligned}$$

In particular, the set K is the subset of P made of the linear functions.

We first show that K is the subset of P w.r.t the topology σ. This follows from the fact that if f₀ is in the σ closure of $\overline {K}$, then the scalars α, β and point x, y ∈ X one has that

$$\displaystyle \begin{aligned} &\Bigg\{\left\vert f(\alpha x+\beta y)- f_{0}(\alpha x+\beta y)\right\vert < \varepsilon, \left\vert f(x)-f_{0}(x)\right\vert < \varepsilon, \left\vert f(y)-f_{0}(y)\right\vert <\varepsilon \Bigg\} \\ & \qquad \bigcap K \neq \emptyset. \end{aligned} $$

Take thus ϕ in the intersection, so that

$$\displaystyle \begin{aligned} \left\vert f_{0}(\alpha x+\beta y)- \alpha f_{0}( x) -\beta f_{0}(y)\right\vert &= \Bigg\vert (f_{0}(\alpha x+\beta y)- f(\alpha x+\beta y)) \\ & \quad +\alpha(f(x)-f_{0}(x))+(f(y)-f_{0}(y))\Bigg\vert\\ &< (1+\left\vert \alpha)\right\vert +\left\vert \beta\right\vert )\varepsilon. \end{aligned} $$

Since ε is arbitrary, f₀ is linear. Moreover, since $\left \vert f_{0}(x)\right \vert \le \gamma (x)$, then for x ∈ V

$$\displaystyle \begin{aligned} \left\vert f_{0}(x)\right\vert \le 1. \end{aligned}$$

It follows that we have two topologies on K :

the weak^∗ topology σ(X′, X) inherited by X′,
the product topology σ inherited by P. Since K is closed in (P, σ), then (K, σ) is compact.

To conclude, we need only to show that the two topologies coincide. This follows because the bases of the two topologies are generated by the sets

$$\displaystyle \begin{aligned} V_{\sigma(X',X)}= \Big\{ \left\vert \phi(x_{i})-\phi_{0}(x_{i})\right\vert < \varepsilon, i= 1\cdots,n \Big\}, \end{aligned}$$

$$\displaystyle \begin{aligned} V_{\sigma}= \Big\{ \left\vert f(x_{i})-f_{0}(x_{i})\right\vert < \varepsilon, i= 1\cdots,n \Big\}. \end{aligned}$$

There is thus a one to one correspondence among local bases, hence the two topologies coincide. ■

Theorem 1.29

Let (X, τ) be a separable topological vector space. Let K ⊆ X′ be weakly ^∗ compact. Then K is metrizable in the weak ^∗ topology.

Proof

Let $\{x_{n}, n \in \mathbb {N}\}$ be a dense subset of X and $f_{x_{n}}(\phi )= \phi (x_{n})$ for ϕ ∈ X′. By the definition of the weak^∗ topology on X′, the functionals $f_{x_{n}}$ are weak^∗ continuous. Also, for every n,

$$\displaystyle \begin{aligned} f_{x_{n}} (\phi_{1})= f_{x_{n}} (\phi_{2}), \end{aligned}$$

i.e.,

$$\displaystyle \begin{aligned} \phi_{1}(x_{n})= \phi_{2}(x_{n}), \end{aligned}$$

then ϕ₁ = ϕ₂ (continuous functionals that coincide on a dense set).

Thus, $\{f_{x_{n}}, n \in \mathbb {N}\}$ is a countably family of continuous functionals that separates points in X′. It follows by Proposition 1.9 that K is metrizable. ■

Remark 1.25

1.
The claim is not that X′ endowed with the weak^∗ topology is metrizable. For example, this is not true in infinite-dimensional Banach spaces.
2.
The topological space (X′, σ(X′, X)) is never metrizable, unless X has a countable vector base.

Theorem 1.30

Let X be a separable topological vector space. If V is a neighborhood of θ and if the sequence (ϕ_n)_n ⊆ X′ satisfies

$$\displaystyle \begin{aligned} \left\vert \phi_{n}(x)\right\vert \leq 1, ~~~~~n\ge 1, x \in V, \end{aligned}$$

then there is a subsequence (ϕ_α(n))_n and some ϕ ∈ X′ such that for all x ∈ X,

$$\displaystyle \begin{aligned} \lim_{n \to \infty}\phi_{\alpha(n)}(x)= \phi(x). \end{aligned}$$

Proof

The Banach-Alaoglu theorem implies that the polar

$$\displaystyle \begin{aligned} K= \Big\{ \phi \in X'\colon \left\vert \phi(x)\right\vert \le 1~~\forall~x \in V\Big\}, \end{aligned}$$

is weak^∗ compact. K with the subspace topology inherited from σ(X′, X) is compact, hence by Theorem 1.29 it is metrizable. Since the sequence (ϕ_n)_n is contained in K, it has a subsequence (ϕ_α(n))_n that converges weakly to some ϕ ∈ K. For each x ∈ X, the functional $f_{x}\colon (X',\sigma (X',X))\to \mathbb {C}$ defined by f_x(ϕ) = ϕ(x) is continuous, hence for all x ∈ X we have f_x(ϕ_α(n))) → f_x(ϕ), which is the claim. ■

Theorem 1.31

If (X, τ) is locally convex and Y ⊆ X, then Y is bounded in (X, τ) if and only if Y is bounded in (X, σ(X, X′)).

Dual of Banach Spaces and Reflexive Spaces

A particular case is when X is normed: in this case X′ is a Banach space with norm $\Vert {\phi } \Vert _{X'}= \sup _{\Vert {x} \Vert = 1}\left \vert \phi (x)\right \vert .$ One can introduce the second dual of X, i.e., denoted by X″. Clearly, there is a canonical immersion J of X into X″, by

$$\displaystyle \begin{aligned} J\colon X\to X'', ~~~J(x)(\phi)=\phi(x), \Vert {J(x)} \Vert _{X''}= \Vert {x} \Vert _{X}. \end{aligned}$$

Since J : X → X″ is continuous, it follows that J(X) is a closed subspace of X″. In particular, either J(X) = X″ or it is not dense.

Lemma 1.9 (Helly)

Let X be a Banach space, ϕ ∈ X′, i = 1⋯ , n, n linear functionals in X′ and $\alpha _{i} \in \mathbb {C}, i= 1\cdots , n, n$ scalars. Then the following properties are equivalent:

1.
for all ε > 0 there is x_ε, ∥x_ε∥ < 1 such that
$$\displaystyle \begin{aligned} \left\vert \phi(x_{\varepsilon})-\alpha_{i}\right\vert \le \epsilon~~~i= 1\cdots, n, \end{aligned}$$
2.
for all $\beta _{1},\cdots ,\beta _{n}\in \mathbb {C}$
$$\displaystyle \begin{aligned} \left\vert \sum_{i}^{n}\beta_{i}\alpha_{i}\right\vert \le \Vert {\sum_{i}^{n}\beta_{i}\phi_{i} } \Vert _{X'}. \end{aligned}$$

Proof

The first implication follows by

$$\displaystyle \begin{aligned} \left\vert \sum_{i}^{n}\beta_{i}\alpha_{i}\right\vert =\left\vert \sum_{i}^{n}\beta_{i}(\alpha_{i}-\phi_{i}(x_{\varepsilon}))\right\vert +\left\vert \sum_{i}^{n}\beta_{i}\phi_{i}(x_{\varepsilon})\right\vert \end{aligned}$$

$$\displaystyle \begin{aligned} \le \varepsilon \sum_{i}^{n}\left\vert \beta_{i}\right\vert +\Vert {\sum_{i}^{n}\beta_{i}\phi_{i}} \Vert _{X'}, \end{aligned}$$

since ∥x_ε∥≤ 1. Conversely if 1. does not hold, then this means that the closure of the set

$$\displaystyle \begin{aligned} (\phi_{1},\cdots,\phi_{n})\Big\{x\colon \Vert {x} \Vert \le 1\Big\}\subseteq\mathbb{C}^{n} \end{aligned}$$

does not contains (α₁, ⋯ , α_n). Thus there is $(\beta _{1},\cdots ,\beta _{n}) \in \mathbb {C}^{n}$ such that

$$\displaystyle \begin{aligned} \max \mathrm{Re}\,\Bigg\{ \sum_{i}^{n}\beta_{i}\phi_{i}(x), \Vert {x} \Vert \le1 \Bigg\}< \mathrm{Re}\,\Bigg\{ \sum_{i}^{n}\beta_{i}\alpha_{i} \Bigg\}\le \left\vert \sum_{i}^{n}\beta_{i}\alpha_{i}\right\vert . \end{aligned}$$

Since {x: ∥x∥≤ 1} is balanced, it follows that 2. is false. ■

Proposition 1.36 (Goldstine)

If X is a Banach space, then J(B_X) is dense in $B_{X''}$ for the weak ^∗ topology.

Proof

If ξ ∈ X″, take a neighborhood of the form

$$\displaystyle \begin{aligned} V= \Bigg\{\eta \in X'\colon \left\vert \eta(\phi_{i})-\xi(\phi_{i})\right\vert < \varepsilon, \phi_{i} \in X', i= 1\cdots n \Bigg\}. \end{aligned}$$

We need only to find x ∈ X such that

$$\displaystyle \begin{aligned} \left\vert \phi_{i}(x)-\xi(\phi_{i})\right\vert < \varepsilon. \end{aligned}$$

Since $\Vert {\xi } \Vert _{X''}\le 1$, then

$$\displaystyle \begin{aligned} \left\vert \sum_{i}^{n}\beta_{i}\xi(\phi_{i})\right\vert \le \Vert {\sum_{i}^{n}\beta_{i}\phi_{i} } \Vert _{X'}, \end{aligned}$$

so that for Lemma 1.9 it follows that there is an x_ε ∈ X which belongs to V . ■

Definition 1.43

A Banach space is reflexive if J(X) = X″.

It is important to observe that in the previous definition the canonical immersion J is used: even for particular non-reflexive spaces, one can find a continuous linear surjection from X to X″.

Theorem 1.32 (Kakutani)

The Banach space X is reflexive if and only if B_X is compact for the weak topology σ(X, X′).

Proof

If X is reflexive, then J : X → X″ is continuous, injective and surjective. Hence J ⁻¹ is linear and continuous w.r.t. the strong topologies of X and X″. Actually both J and J ⁻¹ are isometries.

It is clear that

$$\displaystyle \begin{aligned} J \bigg\{x\colon \left\vert \phi(x)\right\vert < \varepsilon \bigg\}= \bigg\{\eta\colon \left\vert \eta\phi\right\vert < \varepsilon \bigg\}, \end{aligned}$$

so that the topology J ⁻¹(σ(X″, X′)) coincides with the topology σ(X, X′). Since $B_{X''}$ is weak^∗ compact, so B_X.

Conversely, if B_X is compact, then J(B_X) is closed, and by Proposition 1.36 it coincide with the whole $B_{X''}$. ■

Theorem 1.33

If X is a Banach space and X′ is separable, then X is separable.

Proof

Let (ϕ_n)_n be a dense countable set in X′. Let x_n ∈ X, ∥x_n∥_X ≤ 1, be a point where

$$\displaystyle \begin{aligned} \left\vert \phi_{n}(x_{n})\right\vert \ge \frac{1}{2}\Vert {\phi_{n}} \Vert _{X'}, \end{aligned}$$

and consider the countable set

$$\displaystyle \begin{aligned} Q= \bigg\{ \sum_{\textit{finite}}\alpha_{i} x_{i} \colon \alpha_{i} ~\textit{belongs to a countable dense subset of} ~\mathbb{C} \bigg\}. \end{aligned}$$

Clearly Q is countable and dense in the vector space L generated by {x_n}_n, so that it remains to prove that L is dense in X.

If L is not dense, then there is a non null continuous functional ϕ such that

$$\displaystyle \begin{aligned} \phi\neq 0_{X'}~~\phi(x_{n})= 0~\forall ~n. \end{aligned}$$

Since (ϕ_n)_n is dense, there is n_ϕ such that $\Vert {\phi -\phi _{n_{\phi }}} \Vert _{X'}< \varepsilon ,$ so that

$$\displaystyle \begin{aligned} \Vert {\phi_{n_{\phi}}} \Vert _{X'} \le \left\vert \phi_{n_{\phi}}(x_{n_{\phi}})\right\vert \le \left\vert (\phi-\phi_{n_{\phi}})(x_{n_{\phi}})\right\vert +\left\vert \phi(x_{n_{\phi}})\right\vert \le \varepsilon. \end{aligned}$$

Thus $\Vert {\phi _{n_{\phi }}} \Vert _{X'} \le 2\varepsilon ,$ which implies that $\phi = 0_{X'}. $ ■

Proposition 1.37

If Y ⊆ X is a closed subspace of a reflexive space, then Y is reflexive.

Proof

The proof follows by proving that the topology σ(Y, Y ′) coincide with the topology Y ∩ σ(X, X′) and B_Y is closed for σ(X, X′) (closed for strong topology and convex). ■

Corollary 1.6

Let X be a normed space. Then, X is separable and reflexive if and only if X′ is separable and reflexive.

Proof

Clearly if X is reflexive, the unit ball $B_{X'}$ is compact for the topology σ(X′, X″) because of the Banach-Alaoglu theorem and the fact σ(X′, X″) = σ(X′, X). Moreover if X is reflexive and separable, then X″ is separable, hence by Theorem 1.33 is separable.

Conversely, if X′ is reflexive, then X″ is reflexive, so that M(X) is reflexive by Proposition 1.37, hence X is reflexive. Moreover, we know from Theorem 1.33 that X is separable, if X′ is separable. ■

Definition 1.44

We say that X Banach space is uniformly convex if for all ε > 0 there exists δ > 0 such that

$$\displaystyle \begin{aligned} \Vert {x} \Vert _{X}, \Vert {y} \Vert _{X}\le 1, \Vert {\frac{x+y}{2}} \Vert \geq 1-\delta \Longrightarrow \Vert {x-y} \Vert _{X}< \varepsilon. \end{aligned}$$

Theorem 1.34 (Milman)

If X is a uniformly convex Banach space, then X is reflexive.

Proof

Let $\xi \in X'', \Vert {\xi } \Vert _{X''}= 1. $ We want to prove that for all ε > 0 there is x ∈ X, ∥x∥_X ≤ 1 such that

$$\displaystyle \begin{aligned} \Vert {\xi - J(x)} \Vert _{X''}< \varepsilon. \end{aligned}$$

Since J(X) is strongly closed (J is an isometry), then J is surjective.

Let ϕ ∈ X′ be such that

$$\displaystyle \begin{aligned} \Vert {\phi} \Vert _{X'}= 1,~~~\xi\phi> 1- \delta, \end{aligned}$$

where δ is the constant chosen by the uniform convexity estimate corresponding to ε, and consider the neighborhood of ξ of the form

$$\displaystyle \begin{aligned} V= \Big\{ \eta \in X'' \colon \left\vert (\xi-\eta)(\phi)< \frac{\delta}{2}\right\vert \Big\}. \end{aligned}$$

By Proposition 1.36, it follows that there is some x ∈ B_X such that J(x) ∈ V.

Assume that $\xi \notin J(x)+\varepsilon B_{X''}$. Then we obtain a new neighborhood of ξ for the weak^∗ topology which does not contains x. With the same procedure, we can find a new $\overline {x}$ in this new neighborhood. Thus we have

$$\displaystyle \begin{aligned} \left\vert \phi(x)-\xi(\phi)\right\vert \le \frac{\delta}{2}, ~~\left\vert \phi(\overline{x})-\xi(\phi)\right\vert \le \frac{\delta}{2}. \end{aligned}$$

Adding we obtain

$$\displaystyle \begin{aligned} 2\left\vert \xi(\phi)\right\vert \le \left\vert \phi(x+\overline{x})\right\vert + \delta \le \Vert {x+\overline{x}} \Vert +\delta. \end{aligned}$$

Then $\Vert {\frac {x+\overline {x}}{2}} \Vert \ge (1-\delta ),$ so that $\Vert {x+\overline {x}} \Vert < \varepsilon $, which is a contradiction. ■

1.2.8 l₁-Sequences

Definition 1.45

Let (x_n)_n be a bounded sequence in a Banach space X, and ε > 0. We say that (x_n)_n admits ε-l₁-blocks if for every infinite $M\subseteq \mathbb {N}$ there are $a_{1},\cdots ,\cdots a_{r} \in \mathbb {K}$ with $\sum |a_{\rho }|= 1$ and i₁ < ⋯ < i_r in M such that $\|\sum a_{\rho }x_{i_{\rho }}\| \leq \varepsilon $.

Clearly there will be no subsequence of (x_n)_n equivalent to the l₁-basis iff (x_n)_n admits ε-l₁-blocks for arbitrary small ε > 0.

Theorem 1.35

Let X be a real (for simplicity) Banach space and (x_n)_n a bounded sequence. Suppose that, for some ε > 0, (x_n)_n admits small ε-l₁ -blocks. Then there is a subsequence $(x_{n_{k}})_{k}$ of (x_n)_n such that $(x_{n_{k}})_{k}$ is “close to being a weak Cauchy sequence” in the following sense:

$$\displaystyle \begin{aligned} \limsup_{k}\phi(x_{n_{k}}) - \liminf_{k}\phi(x_{n_{k}})\leq 2\varepsilon \end{aligned}$$

for every ϕ ∈ X′ with $\|\phi \|{ }_{X'}= 1$.

Proof

Suppose the theorem were not true. We claim that without loss of generality we may assume that there is a δ > 0 such that

$$\displaystyle \begin{aligned} \varphi((x_{n_{k}})_{k}):=\sup_{\|\phi\|{}_{X'}= 1}\left( \limsup_{k}\phi(x_{n_{k}})- \liminf_{k}\phi(x_{n_{k}})> 2\varepsilon+ \delta \right) \end{aligned} $$

(1.16)

for all subsequences $(x_{n_{k}})_{k}$. In fact, if every subsequence contained another subsequence with a φ-value arbitrarily close to 2ε, the diagonal process would even provide one where $\varphi ((x_{n_{k}})_{k})\leq 2\varepsilon $ in contrast to our assumption.

Fix a τ > 0 which will be specified later. After passing to a subsequence we may assume that (x_n)_n satisfies the following conditions:

(i)
If C and D are finite disjoint subsets of $\mathbb {N}$ there are a $\lambda _{0}\in \mathbb {R}$ and an ϕ ∈ X′ with $\|\phi \|{ }_{X'}= 1$ such that ϕ(x_n) < λ₀ for n ∈ C and ϕ(x_n) > λ₀ + 2ε + δ for n ∈ D.
(ii)
There are i₁ < ⋯ < i_r in $\mathbb {N}$, $a_{1},\cdots a_{r} \in \mathbb {R}$ with
$$\displaystyle \begin{aligned} \sum |a_{\rho}|= 1,\, |\sum a_{\rho}|< \tau, \, \|\sum a_{\rho}x_{i_{\rho}}\| \leq \varepsilon \end{aligned}$$

For (i), define, for $r \in \mathbb {N}$, T_r to be the collection of all (i₁, ⋯ , i_r) (with i₁ < ⋯ < i_r) such that there are a $\lambda _{0}\in \mathbb {R}$ and a normalized ϕ such that $\phi (x_{i_{\rho }})< \lambda _{0}$ if ρ is even and > λ₀ + 2ε + δ otherwise. (1.16) implies that there is an M₀ for which all (i₁, ⋯ , i_r) are in T_r for i₁ < ⋯ < i_r in M₀. Let us assume that $M_{0}= \mathbb {N}$. Let C and D be finite disjoint subsets of $2\mathbb {N}= \{2,4,\cdots \}$. We may select i₁ < ⋯ < i_r in $\mathbb {N}$ such that C ⊆{i_ρ | ρ even } and D ⊆{i_ρ | ρ odd }. Because of (i₁, ⋯ , i_r) ∈ T_r we have settled (i) provided C and D are in $2\mathbb {N}$, and all what’s left to do is to consider (x_2n)_n instead of (x_n)_n.

For (ii), By assumption we find i₁ < ⋯ < i_r, $a_{1},\cdots a_{r} \in \mathbb {R}$ such that $\sum |a_{\rho }|= 1$ and $\|\sum a_{\rho }x_{i_{\rho }}\| \leq \varepsilon $ with arbitrarily large i₁. Therefore we obtain $i^{1}_{1}< \cdots < i^{1}_{r_{1}}< i^{2}_{1}< \cdots < i^{2}_{r_{2}}< i^{3}_{1}< \cdots < i^{3}_{r_{3}}< \cdots $ and associated $a_{\rho }^{i}$. The numbers $\eta _{j}:= \sum _{\rho =1}^{r_{j}}a_{\rho }^{j}$ all lie in [−1, 1] so that we find j < k with |η_j − η_k|≤ 2τ. Let i₁ < ⋯ < i_r be the family $i^{j}_{1}< \cdots < i^{j}_{r_{j}}< i^{k}_{1}< \cdots < i^{k}_{r_{k}}$, and define the a₁, ⋯a_r by $\frac {1}{2}a^{j}_{1},\cdots \frac {1}{2}a^{j}_{r_{j}}, -\frac {1}{2}a^{k}_{1},\cdots -\frac {1}{2}a^{k}_{r_{k}}$.

We are now ready to derive a contradiction. On the one hand, by (ii), we find i₁ < ⋯ < i_r, $a_{1},\cdots ,\cdots a_{r} \in \mathbb {R}$ such that $\sum |a_{\rho }|= 1, |\sum a_{\rho }|\leq \tau $ with $\|\sum a_{\rho }x_{i_{\rho }}\| \leq \varepsilon $. On the other hand we may apply (i) with C := {i_ρ | a_ρ < 0} and D := {i_ρ | a_ρ > 0}. We put α := −∑_{ρ ∈ C} a_ρ, β :=∑_{ρ ∈ D} a_ρ, and we note that |α − β|≤ τ, α + β = 1 so that $|\beta -\frac {1}{2}|\leq \tau $, hence

$$\displaystyle \begin{aligned} \varepsilon\geq \|\sum a_{\rho}x_{i_{\rho}}\|\leq \sum a_{\rho}\phi(x_{i_{\rho}})\geq -\lambda_{0}\alpha+(\lambda_{0}+2\varepsilon+\delta)\beta\geq -|\lambda_{0}|\tau+\varepsilon+\frac{\delta}{2}-\tau\delta. \end{aligned}$$

This expression can be made larger than ε if τ has been chosen sufficiently small (note that the numbers |λ₀| are bounded by sup_n∥x_n∥), a contradiction which proves the theorem. ■

Remark 1.26

Since the unit vector basis (x_n)_n of real l₁ the assumption of the theorem holds with ε = 1 and since for every subsequence $(x_{n_{k}})_{k}$ one may find $\|\phi \|{ }_{X'}= 1$ with

$$\displaystyle \begin{aligned} \limsup_{k}\phi(x_{n_{k}}) - \liminf_{k}\phi(x_{n_{k}})= 2 \end{aligned}$$

there can be no better constant than that given in our theorem.

Theorem 1.36 (Rosenthal’s Theorem)

Let X be a Banach space and (x_n)_n a bounded sequence in X. If there exists no subsequence which is a weak Cauchy sequence then one can find a subsequence $(x_{n_{k}})_{k}$ which is equivalent with the unit vector basis of l₁ (i.e., $(\lambda _{k})_{k} \mapsto \sum \lambda _{k}x_{n_{k}}$ , from l₁ to X, is an isomorphism).

In particular one has: If X does not contain an isomorphic copy of l₁ , then every bounded sequence admits a subsequence which is a weak Cauchy sequence.

Proof

Rosenthal’s theorem is the assertion that (x_n)_n has a weak Cauchy subsequence provided it admits ε-l₁-blocks for all ε. So, it is simple to derive the theorem from Theorem 1.35. If (x_n)_n and thus every subsequence has ε-l₁-blocks for all ε, apply Theorem 1.35 successively with ε running through a sequence tending to zero. The diagonal sequence which is obtained from this construction will be a Cauchy sequence. ■

Remark 1.27

1.
Since weakly convergent sequences are weakly Cauchy it follows immediately that Rosenthal’s theorem holds in reflexive spaces.
2.
Rosenthal’s theorem holds, whenever X is such that X′ is separable. Let (x_n)_n be bounded and ϕ be a fixed functional. If we apply the Bolzano-Weierstrass theorem to the scalar sequence (ϕ(x_n))_n we get a subsequence $(x_{n_{k}})_{k}$ such that $(\phi (x_{n_{k}}))_{k}$ converges. Applying the same idea to $(x_{n_{k}})_{k}$ with a second functional, say ψ, we get a subsequence of this subsequence such that the application of ψ produces something which is convergent. ϕ, applied to this new subsequence, also gives rise to convergence. Thus we have a subsequence of (x_n)_n where ϕ and ψ converge, and similarly one can achieve this for any prescribed finite number of functionals. Even countably many functionals are manageable, by the diagonal process. Since we are dealing with bounded sequences (y_n)_n (typically subsequences of the original sequence) the collection of ϕ where (ϕ(y_n))_n converges is a norm closed subspace of X′.

There is a generalization of Rosenthal’s theorem to Fréchet spaces which, it seems, has been firstly by Díaz [44]. Thus the starting point for proving promised generalizations is to understand what it means for a sequence in a locally convex space be equivalent to the unit basis of l₁.

We denote by $l_{1}^{0}$ the subspace of l₁ formed by elements with only finitely many nonzero coordinates.

Barroso, Kalenda and Lin introduced the following notion of l₁-sequences in topological vector spaces [14].

Definition 1.46

Let (X, τ) be a topological vector space and (x_n)_n a sequence in X. We say that (x_n)_n is an l₁-sequence if the mapping $T_{0}\colon l_{1}^{0}\to X$ defined by

$$\displaystyle \begin{aligned} T_{0}((a_{i})_{i\geq 1})= \sum_{i=1}^{\infty}a_{i}x_{i} \end{aligned} $$

(1.17)

is an isomorphism of $l_{1}^{0}$ onto $T_{0}(l_{1}^{0})$.

The following characterization of l₁-sequences is given in [14].

Proposition 1.38

Let (X, τ) be a locally convex space and (x_n)_n a bounded sequence in X. The following are equivalent:

(i)
There is a continuous seminorm p on X such that
$$\displaystyle \begin{aligned} p\left(\sum_{i=1}^{n}a_{i}x_{i}\right)\geq \sum_{i=1}^{n}|a_{i}|, ~~~n\in\mathbb{N}, a_{1},\cdots,a_{n} \in \mathbb{R}. \end{aligned}$$
(ii)
(x_n)_n is an l₁ -sequence .

If X is sequentially complete, then these conditions are equivalent to the following:
(iii)
The mapping T : l₁ → X defined by $T((a_{i})_{i\geq 1})= \sum _{i=1}^{\infty }a_{i}x_{i}$ is a well defined isomorphism of l₁ onto its image in X

Proof

Let $T_{0}\colon l_{1}^{0}\to X$ be defined by (1.17). As (x_n)_n is bounded and X is locally convex, it is easy to check that T₀ is continuous.

Further, if (i) holds, then T₀ is clearly one-to-one and $T_{0}^{-1}$ is continuous. This proves (i) ⇒ (ii).

Conversely, suppose that (ii) holds. Set

$$\displaystyle \begin{aligned} U= T_{0}(\{x\in l_{1}^{0}\colon \|x\|{}_{l^{1}}< 1\}). \end{aligned}$$

As T₀ is an isomorphism, U is an absolutely convex open subset of $T_{0}(l_{1}^{0})$. We can find V , an absolutely convex neighborhood of θ in X such that $V\cap T_{0}(l_{1}^{0})\subset U$. Let p the Minkowski functional of V . Then p is a continuous seminorm witnessing that (i) holds. This proves (ii) ⇒ (i).

Now suppose that X is sequentially complete. As T₀ is continuous and linear, it is uniformly continuous and hence it maps Cauchy sequences to Cauchy sequences. In particular the mapping T₀ can be uniquely extended to a continuous linear mapping T : l₁ → X. This is obviously the mapping described in (iii). As $l_{1}^{0}$ is dense in l₁, we get (ii) ⇔ (iii). ■

The following theorem is a variant of Rosenthal’s theorem [14]. Its proof is a slight refinement of the proof of Lemma 3 in [44].

Theorem 1.37

Let (X, τ) be a metrizable locally convex space. Then each bounded sequence in X contains either a weakly Cauchy subsequence or a subsequence which is an l₁ -sequence .

Proof

Let (∥.∥_n) be a sequence of seminorms generating the topology of X. Without loss of generality we may assume that ∥x∥_n ≤∥x∥_n+1 for all n and x ∈ X. Let U_n = {x: ∥x∥_n < 1} and let $B_{n}= U_{n}^{0}$ be the polar of U_n. Assume that (x_m)_m is a bounded sequence in X such that no its subsequence is an l₁-sequence. For n = 0, 1, 2, ⋯ we construct a sequence $(x_{m}^{n})_{m}$ inductively as follows. Set $x_{m}^{0}= x_{m}$ for all $m\in \mathbb {N}$. Assume that for a given $n \in \mathbb {N}$ the sequence $(x_{m}^{n-1})_{m}$ has been defined. By Rosenthal’s theorem one of the following possibilities takes place (elements of X are viewed as functions on B_n):

(i)
$(x_{m}^{n-1})_{m}$ has a subsequence which is equivalent to the l₁-basis on B_n.
(ii)
$(x_{m}^{n-1})_{m}$ has a subsequence which point wise converges on B_n.

Let us show that the case (i) cannot occur. Indeed, suppose that (i) holds. Let (y_m)_m be the respective subsequence. The equivalence to the l₁ basis on B_n means that there is some C > 0 such that

$$\displaystyle \begin{aligned} \Vert {\sum_{i=1}^{m}a_{i}y_{i}} \Vert _{n}\geq C\sum_{i=1}^{m}\left\vert a_{i}\right\vert \end{aligned}$$

for each $m \in \mathbb {N}$ and each choice $a_{1},\cdots , a_{m} \in \mathbb {R}$. By Proposition 1.38 (y_m)_m is an l₁-sequence in X, which is a contradiction.

Thus the possibility (ii) takes place. Denote by $(x_{m}^{n})_{m}$ the respective subsequence. This completes the inductive construction.

Take the diagonal sequence $(x_{m}^{m})$. It is a subsequence of (x_m)_m which pointwise converges on B_n for each $n\in \mathbb {N}$. Moreover, if ϕ ∈ X′ is arbitrary, then there is n and c > 0 such that cϕ ∈ B_n. In particular, the linear span of the union of all $B^{\prime }_{n}s$ is the whole dual X′. It follows that the sequence $(x_{m}^{m})$ is weakly Cauchy. The proof is complete. ■

Remark 1.28

Let X = l₁ endowed with its weak topology. Let (e_n)_n denote the canonical basic sequence. Then, the sequence (e_n)_n contains neither a weakly Cauchy subsequence nor a subsequence which is an l₁-sequence . Indeed, suppose that (x_n)_n is an l₁-sequence in X. Denote by Y its linear span. By the definition of an l₁-sequence we get that Y is isomorphic to $(l_{1}^{0},\Vert {.} \Vert _{1})$, hence it is metrizable. On the other hand, by the definition of X we get that Y is equipped with its weak topology which is not metrizable as Y has infinite dimension.

Further, the sequence (e_n)_n contains no weakly Cauchy subsequence in (l₁, ∥.∥₁) and in (l₁, σ(l₁, (l₁)′)) coincide, we get that (e_n)_n contains no weakly Cauchy subsequence in X. Thus the proof is completed.

The following is given in [14] and is about the coincidence of norm and weak topologies.

Proposition 1.39

Let Γ be an arbitrary set. Then the norm and weak topologies coincide on the positive cone of l₁( Γ).

Proof

Denote by C the positive cone of l₁( Γ). Since the weak topology is weaker than the norm one, it is enough to prove that the identity of C endowed with the weak topology onto (C, ∥.∥) is continuous. Let x ∈ C and ε > 0 be arbitrary. Fix a nonempty finite set F ⊆ Γ such that

$$\displaystyle \begin{aligned} \sum_{\gamma \in F}x(\gamma)> \Vert {x} \Vert -\frac{\varepsilon}{4}. \end{aligned}$$

Set

$$\displaystyle \begin{aligned} ~~~~U= \Big\{y \in C\colon \left\vert y(\gamma)-x(\gamma)\right\vert < \frac{\epsilon}{4\left\vert F\right\vert } ~\mbox{for}~ \gamma \in F \Big\}, \end{aligned}$$

$$\displaystyle \begin{aligned} V= \Big\{y \in C\colon \sum_{\gamma \in \Gamma \setminus F}y(\gamma)-\sum_{\gamma \in \Gamma \setminus F}x(\gamma) <\frac{\epsilon}{4} \Big\}. \end{aligned}$$

Then both U and V are weak neighborhoods of x in C (recall that the dual of l₁( Γ) is represented by l_∞( Γ)), hence so U ∩ V . Moreover, if y ∈ U ∩ V , then

$$\displaystyle \begin{aligned} \Vert {y-x} \Vert &= \sum_{\gamma \in F}\left\vert y(\gamma)-x(\gamma)\right\vert +\sum_{\gamma \in \Gamma \setminus F}\left\vert y(\gamma)-x(\gamma)\right\vert < \frac{\varepsilon}{4} +\sum_{\gamma \in \Gamma \setminus F}(y(\gamma)+x(\gamma))\\ &= \frac{\varepsilon}{4}+\sum_{\gamma \in \Gamma \setminus F}(y(\gamma)-x(\gamma))+2\sum_{\gamma \in \Gamma \setminus F}x(\gamma)< \frac{\varepsilon}{4}+\frac{\varepsilon}{4}+2 .\frac{\varepsilon}{4}. \end{aligned}$$

This shows that the identity is weak-to-norm continuous at x. The proof is complete. ■

1.2.9 The Fréchet-Urysohn Property

Definition 1.47

Let Y be a subset of a topological (Hausdorff) space X.

(1)
Y is countably compact , if every sequence in Y has a cluster-point in Y .
(2)
Y is sequentially compact , if every sequence in Y has a convergent subsequence with limit in Y .
(3)
Y is relatively countably compact , if every sequence in Y has a cluster-point in X.
(4)
Y is relatively sequentially compact , if every sequence in Y has a convergent subsequence with limit in X.

It is easy to see that

(1)
Every (relatively) compact set is (relatively) countably compact.
(2)
Every (relatively) sequentially compact set is (relatively) countably compact.

Definition 1.48

A topological space (X, τ) is called Fréchet-Urysohn if the closures of subsets of X are described using sequences, i.e., if whenever Y ⊆ X and x ∈ X such that $x \in \overline {Y}$, there is a sequence (x_n)_n in Y with x_n → x.

Example 1.13

Metrizable spaces and one point compactifications of discrete spaces are Fréchet-Urysohn.

Definition 1.49

A completely regular Hausdorff topological space X is called a g-space , if its relatively countably compact subsets are relatively compact.

Definition 1.50

A Hausdorff topological space X is said to be angelic if for every relatively countably compact set Y ⊆ X, the following hold:

(i)
Y is relatively compact,
(ii)
for each $x\in \overline {Y},$ there exists a sequence (x_n)_n ⊆ Y such that x_n→x.

If K is a compact topological space then K is a Fréchet-Urysohn space if and only if it is angelic. It can be said that a Hausdorff topological space X is angelic if and only if X is a g-space for which any compact subspace is a Fréchet-Urysohn space.

The following are some characterizations of Fréchet-Urysohn spaces.

Theorem 1.38

For a topological vector space (X, τ) the following assertions are equivalent:

1.
X is Fréchet-Urysohn.
2.
For every subset Y of X such that $\theta \in \overline {Y}$ there exists a bounded subset Z of Y such that $\theta \in \overline {Z}$.
3.
For any sequence (Y_n)_n of subsets of X, each with $\theta \in \overline {Y_{n}}$ , there exists a sequence $Z_{n}\subseteq Y_{n}, n \in \mathbb {N}$ , such that ⋃_n Z_n is bounded and $\theta \in \overline {\bigcup _{n\leq k}Z_{k}}$ for each $n \in \mathbb {N}$.

Proof

Clearly 1. implies 2. Now assume 2. It is obvious that 3. holds if θ ∈ Y_n for infinitely many n. Therefore, we assume that $\theta \in \overline {Y_{n}}\setminus Y_{n}$, for each $n \in \mathbb {N}$. Consequently, there exists a null sequence (x_n)_n in $X\setminus \overline {\{\theta \}}$. For each $n \in \mathbb {N}$ there exists a closed neighbourhood U_n of zero such that θ∉U_n + x_n. Let each W_n = U_n ∩ Y_n. Clearly θ is in each $ \overline {W_{n}}\setminus W_{n}$ and not in the set

$$\displaystyle \begin{aligned} Y= \bigcup_{n}(W_{n}+x_{n}). \end{aligned}$$

However, $\theta \in \overline {Y}\colon $ For U, an open neighborhood of θ, there exist $k \in \mathbb {N}$ with x_k ∈ U and, V , a neighbourhood of θ with V + x_k ⊆ U. As there is y ∈ V ∩ W_k we also have y + x_k ∈ U ∩ Y . Thus $\theta \in \overline {Y} \setminus Y $. By hypothesis, there is Z ⊆ Y with Z bounded and $\theta \in \overline {Z}$. There exists subsets Z_n ⊆ W_n = U_n ∩ Y_n such that

$$\displaystyle \begin{aligned} Z= \bigcup_{n}(Z_{n}+x_{n}). \end{aligned}$$

By construction, θ does not belong to the closed sets

$$\displaystyle \begin{aligned} \bigcup_{k< n}(U_{k}+x_{k}). \end{aligned}$$

Therefore θ is not in any $\overline {\bigcup _{k< n}(Z_{k}+x_{k})}$. This and $\theta \in \overline {Z}$ imply that

$$\displaystyle \begin{aligned} \theta \in \overline{\bigcup_{n\leq k}(Z_{k}+x_{k})}, \end{aligned}$$

for each $n \in \mathbb {N}$. Let V ′ and V be any balanced neighborhoods of θ with V − V ⊆ V ′. Fix $n\in \mathbb {N}$. There exists m ≥ n, in $\mathbb {N}$, such that x_k ∈ V for all k ≥ m. From

$$\displaystyle \begin{aligned} \theta \in \overline{\bigcup_{m \geq k}(Z_{k}+x_{k})}, \end{aligned}$$

it follows that there exist k ≥ m and y ∈ B_k with y + x_k ∈ V . From y ∈ V − x_k ⊆ V − V ⊆ V ′, we see, for each $n \in \mathbb {N}$, the set V ′ meets ⋃_n≤k Z_k. As any neighborhood of θ contains V ′ and V as above, θ is in the closure of each ⋃_n≤k Z_k. Note also that ⋃_n Z_n is bounded. Indeed, as

$$\displaystyle \begin{aligned} Z= \bigcup_{n}(Z_{n}+x_{n}) \end{aligned}$$

and $W= \{x_{m} \colon m \in \mathbb {N}\}$ are bounded and since

$$\displaystyle \begin{aligned} \bigcup_{n}Z_{n}\subseteq \bigcup_{n}(Z_{n}+x_{n})- \{x_{m} \colon m \in \mathbb{N}\}= Z-W, \end{aligned}$$

then ⋃_n Z_n is also bounded too. We have proved that 2. implies 3.

3. implies 1.: Assume that $\theta \in \overline {Y}$, and set Y_n = nY, for each $n\in \mathbb {N}$. Since θ is in each $\overline {Y_{n}}$, there exist Z_n ⊆ Y_n, as in 3.. So each ⋃_n≤k Z_k is nonempty, and, consequently, there exists a strictly increasing sequence (n_k)_k in $\overline {N}$ with $Z_{n_{k}}$ nonempty. For each k, let $z_{k} \in Z_{n_{k}}$. There exists a sequence (y_k)_k in Y such that z_k = n_k y_k for each $k \in \mathbb {N}$. Since (n_k)_k is strictly increasing and (z_k)_k = (n_k y_k)_k is bounded, the sequence (y_k)_k in Y converges to zero in X. The proof is complete. ■

There are many nonmetrizable Fréchet-Urysohn spaces. To provide some examples, we have the following deep result of J. Bourgain, D. H. Fremlin and M. Talagrand [24]:

Theorem 1.39

Let X be a Polish space (i.e., a separable completely metrizable space). Denote by B₁(X) the space of all real-valued functions on X which are of the first Baire class and equip this space with the topology of pointwise convergence. Suppose that Y ⊆ B₁(X) is relatively countably compact in B₁(X) (i.e., each sequence in Y has a cluster point in B₁(X). Then the closure $\overline {Y}$ of Y in B₁(X) is compact and Fréchet-Urysohn.

A slightly weaker version is given in [101].

Corollary 1.7

Let X be a Polish space and Y be a set of real-valued continuous functions on X. Suppose that each sequence in Y has a pointwise convergent subsequence. Then the closure of Y in $\mathbb {R}^{p}$ is a Fréchet-Urysohn compact space contained in B₁(X).

Proof

Y is obviously contained in B₁(X). Moreover, let (f_n)_n be any sequence in Y . By the assumption there is a subsequence $(f_{n_{k}})_{k}$ pointwise converging to some function f. As the functions $f_{n_{k}}$ are continuous, the limit function f is of the first Baire class . Hence, it is a cluster point of (f_n)_n in B₁(X). So, Y is relatively countably compact in B₁(X). The assertion now follows from Theorem 1.39. ■

We continue by the following example [14].

Proposition 1.40

Let (X, τ) be a metrizable locally convex space and Y be a bounded subset of X. If Y is τ-separable and contains no l₁ -sequence , then the set

$$\displaystyle \begin{aligned} \overline{Y-Y}^{\sigma(X,X')}= \overline{\{x-y\colon x,y\in Y\}}^{\sigma(X,X')} \end{aligned}$$

is Fréchet-Urysohn when equipped with the weak topology.

Proof

As the closed linear span of Y is separable, we can without loss of generality suppose that X is separable. Let (∥.∥_n), U_n and $B_{n} \, (n\in \mathbb {N})$ be as in the proof of Theorem 1.37. Notice that B_n is a metrizable weak^∗compact subset of X′. Moreover, the linear span of the union of all $B_{n}^{\prime }s$ is the whole dual X′ (see the end of the proof of Theorem 1.37). Let now P be the topological sum of the spaces $(B_{n},\sigma (X',X)), n\in \mathbb {N}$. Then P is a Polish space. Denote by G: P → X′ the canonical mapping of P onto the union of all $B_{n}^{\prime }s$. Then G is continuous from P to (X′, σ(X′, X)). Define a mapping $H\colon P\to \mathbb {R}^{P}$ by the formula H(x)(p) = G(p)(x). Then H is a homeomorphism of (X, σ(X, X′)) onto H(X) equipped with the pointwise convergence topology. Moreover, the functions from H(X) are continuous on P.

Let Z = H(Y − Y ). We claim that each sequence from Z has a pointwise convergent subsequence. To show that it is enough to observe that each sequence in Y − Y has weakly Cauchy subsequence. Indeed, let (z_n)_n be a sequence in Y − Y . Then z_n = x_n − y_n for some x_n, y_n ∈ Y . As Y contains no l₁-sequence , by Theorem 1.37, we get a weakly subsequence $(x_{n_{k}})_{k}$ of (x_n)_n. Applying Theorem 1.37 once more we get a weakly Cauchy subsequence $(y_{n_{k}})_{k}$ of (y_n)_n. Then $(z_{n_{k}})_{k}$ is a weakly Cauchy subsequence of (z_n)_n. Thus Z is relatively countably compact in B₁(P), which is the space of all Baire-one functions on P equipped with the topology of pointwise convergence. By Theorem 1.39, the closure of Z in $ \mathbb {R}^{P}$ is a Fréchet-Urysohn compact subset of B₁(P). In particular, the weak closure of Y − Y is Fréchet-Urysohn when equipped with the weak topology. The proof is complete. ■

Note that the result of the above proposition generalizes the following in the context of Banach spaces [101].

Proposition 1.41

Let X be a Banach space and Y be a bounded subset of X. If X is norm-separable and contains no l ₁ -sequence , then the set

$$\displaystyle \begin{aligned} \overline{Y-Y}^{\sigma(X',X)}= \overline{\{J(x-y)\colon x,y\in Y\}}^{\sigma(X',X)} \end{aligned}$$

is Fréchet-Urysohn when equipped with the weak ^∗ topology, where J denotes the canonical embedding of X into X″. In particular,

$$\displaystyle \begin{aligned} \overline{Y-Y}^{\sigma(X,X')}= \overline{\{x-y\colon x,y\in Y\}}^{\sigma(X,X')} \end{aligned}$$

is Fréchet-Urysohn when equipped with the weak topology.

We have the following characterization of the Fréchet-Urysohn property in locally convex spaces [14].

Proposition 1.42

Let (X, τ) be a Hausdorff locally convex space such that there is a metrizable locally convex topology on X compatible with the duality. The following assertion are equivalent.

(i)
Any bounded subset of X is Fréchet-Urysohn in the weak topology.
(ii)
Any bounded sequence in X has a weakly Cauchy subsequence.

If, moreover, τ itself is metrizable, then these assertions are equivalent to the following one:
(iii)
X contains no l₁ -sequence .

Proof

Let ρ be a metrizable locally convex topology compatible with the duality . By Theorem 1.37 (X, ρ) contains no l₁-sequence if and only if (X, ρ) satisfies the condition (ii). Further, the validity of (ii) for (X, ρ) is equivalent to its validity for (X, τ). It follows that (ii) holds if and only if (X, ρ) contains no l₁-sequence . In particular, if ρ = τ, we get (i) ⇔ (ii).

(ii) ⇒ (i) Suppose that (ii) holds. Let Y be a bounded subset of (X, τ) and let x ∈ X ∈ belong to the weak closure of Y . We need to find a sequence in Y converging to x. We first prove it under the additional assumption that Y is separable. Then Y is bounded and separable in (X, ρ) as well. As (X, ρ) contains no l₁-sequence , by Proposition 1.40 we get that the weak closure of Y -Y is Fréchet-Urysohn in the weak topology. Hence, in particular, there is a sequence in Y weakly converging to x.

To prove the general case it is enough to show that there is a countable set Z ⊆ Y such that x belongs to the weak closure of Z. In other words, it is enough to show that the weak topology on X has countable tightness. To prove that observe that (X, σ(X, X′)) is canonically homeomorphic to a subspace of C_p(X′, σ(X′, X)), which is the space of all continuous functions on the space (X′, σ(X′, X)) equipped with the topology of pointwise convergence. Further notice that (X′, σ(X′, X)) is σ-compact, this follows by the metrizability of ρ as $X'= \bigcup _{m, n \in \mathbb {N}}mB_{n}$ using the notation from the proof of Theorem 1.37. Finally, as any finite power of a σ-compact and hence Lindelöf, we can conclude by the Arkhangel’skii-Pytkeev theorem [7].

(i) ⇒ (ii) Suppose that (ii) does not hold. Then there is a sequence (x_n)_n in X which is an l₁-sequence in (X, ρ). Let $T_{0}\colon l_{1}^{0}\to X$ be defined as in (1.17). Let S denote the unit sphere in $l_{1}^{0}$. Then θ is in the weak closure of S (as $l_{1}^{0}$ is an infinite dimensional normed space) but it is not the weak limit of any sequence from S (by Schur’s theorem [75]). Thus, θ is in the weak closure of T₀(S) without being the weak limit of any sequence from T₀(S). Thus T₀(S) ∪{θ} is a bounded set which is not Fréchet-Urysohn in the weak topology. ■

The following characterization of Banach spaces not containing l₁ is given in [101].

Theorem 1.40

Let X be a Banach space. Then the following assertions are equivalent.

1.
X contains no isomorphic copy of l₁.
2.
Each bounded separable subset of X is Fréchet-Urysohn in the weak topology.
3.
For each separable subset Y ⊆ X there are relatively weakly closed subsets $Y_{n}, n\in \mathbb {N}$ such that $Y= \bigcup _{n \in \mathbb {N}}Y_{n}$ and each Y_n is Fréchet-Urysohn in the weak topology.

Proof

The implication 1. ⇒ 2. follows from Proposition 1.40.

The implication 2. ⇒ 1. follows from the fact that the unit ball of l₁ is not Fréchet-Urysohn (as θ is in the weak closure of the sphere and the sphere is weakly sequentially closed by the Schur theorem [75]).

The implication 2. ⇒ 3. is trivial if we use the fact that a closed ball is weakly closed.

Let us prove 3. ⇒ 2. To show 2. it is enough to prove that the unit ball of any closed separable subspace of X is Fréchet-Urysohn in the weak topology. Let Z be such a subspace. Let $Y_{n}, n\in \mathbb {N}$ be the cover of Z provided by 3. As each Y_n is weakly closed, it is also norm-closed. By the Baire category theorem some Y_n has a nonempty interior in Y , so it contains a ball. We get that some ball in Y is Fréchet-Urysohn, so the unit ball has this property as well. ■

Remark 1.29

Note that the assertion 3. is a topological property of the space (X, σ(X, X′)) (as norm separability coincides with weak separability).

As a consequence of Proposition 1.42 we get the following improvement of Theorem 1.40.

Corollary 1.8

Let X be a Banach space. The following assertions are equivalent.

1.
X contains no isomorphic copy of l₁.
2.
The closed unit ball of X is Fréchet-Urysohn in the weak topology.
3.
There is a sequence (Y_n)_n≥1 of weakly closed sets which are Fréchet-Urysohn in the weak topology such that $X= \bigcup _{n=1}^{\infty }Y_{n}$.

Proof

The equivalence 1. ⇔ 2. follows from Proposition 1.42. The implication 2. ⇒ 3. is trivial. The implication 3. ⇒ 1. follows from Theorem 1.40 (or, alternatively, 3. ⇒ 2.) follows from the Baire category theorem as in Theorem 1.40. ■

Definition 1.51

A Banach space (X, ∥.∥) is Asplund if and only if Y ′ is separable for each separable subspace Y ⊆ X.

Remark 1.30

A Banach space X is an Asplund space if each convex continuous function $T\colon X\to \mathbb {R}$ is Fréchet differentiable on a dense G_δ set in X. Also it is known that a Banach space X is Asplund if and only if X′ has the RNP [25].

It is worthwhile to remark that there are separable Banach spaces having no copy of l₁ for which X′ is nonseparable [93, 127]. On the other hand, the well-known James’s space J is an example of a nonreflexive Banach space without an unconditional basis which does not contain any copy of l₁ and yet has separable dual.

Remark 1.31

Let us remark that the implication (ii) ⇒ (i) of Proposition 1.42 does not hold for general locally convex spaces. Indeed, there are Banach spaces X such that the closed unit ball of X′ is weak^∗ sequentially compact, but it is not Fréchet-Urysohn in the weak^∗ topology. In particular, the dual closed unit ball is weak^∗ sequentially compact whenever X is Asplund [55], in particular if X = C(K) with K scattered [55]. On the other hand, K is canonically homeomorphic to a subset of the closed unit ball of C(K)′ equipped with the weak^∗ topology, so it is enough to observe that there are scattered compact spaces which are not Fréchet-Urysohn. As a concrete example we can take K = [0, w₁], the ordinal interval equipped with the order topology (w₁ is the first uncountable ordinal).

It is worth to compare Theorem 1.40 with a similar characterization of Asplund spaces [101].

Theorem 1.41

Let X be a Banach space. Then the following assertions are equivalent.

1.
X is Asplund .
2.
Each bounded separable subset of X is metrizable in the weak topology.
3.
For each separable subset Y ⊆ X there are relatively weakly closed subsets $Y_{n}, n\in \mathbb {N}$ , of Y such that $Y= \bigcup _{n \in \mathbb {N}}Y_{n}$ and each Y_n is metrizable in the weak topology.

Proof

The equivalence of 1. and 2. follows from the well-known fact that the unit ball of Y is metrizable in the weak topology if and only if Y ′ is separable. The equivalence of 2. and 3. can be proved similarly as corresponding equivalence in the previous theorem. ■

Remark 1.32

There is no analogue of Theorem 1.40 for convex sets. Indeed, let X = l₁ and let C be the closed convex hull of the standard basis. Then C contains an l₁-sequence but is Fréchet-Urysohn in the weak topology. In fact, it is even metrizable as it is easy to see that on the positive cone of l₁ the weak and norm topologies coincide.

1.3 Ultrametric Spaces

The origin of ultrametric spaces lies in valuation theory and dates back to Krasner and Monna who developed this theory for ultrametric distances with real values (non-Archimedean analysis). A systematic study of (general) ultrametric spaces was provided [16, 81, 84, 113, 120, 152, 154, 155, 157, 160, 169] and others. This study is concerned with ultrametric whose distance functions take their values in an arbitrary partially ordered set (with a smallest element 0) not just in the real numbers.

Definition 1.52

Let ( Γ, ≤) be an ordered set with smallest element 0. Let X be a nonempty set. A mapping d: X × X→ Γ is called an ultrametric distance and (X, d, Γ) an ultrametric space if d has the following properties for all x, y, z ∈ X and γ ∈ Γ:

(d1):: d(x, y) = 0 if and only if x = y,
(d2):: d(x, y) = d(y, x),
(d3):: if d(x, y) ≤ γ and d(y, z) ≤ γ, then d(x, z) ≤ γ.

If there is no ambiguity, we simply write X instead of (X, d, Γ).

If Γ is totally ordered , (d3) becomes

(d3′) $d(x,z)\leq \max \{d(x,y),d(y,z)\}$ for all x, y, z ∈ X.

Remark 1.33

The ultrametric space (X, d, Γ) is trivial, if there exists γ ∈ Γ such that for all x, y ∈ X, x ≠ y, d(x, y) = γ.

Definition 1.53

Let (Y, d_∣Y, Γ_Y) and (X, d, Γ) be ultrametric spaces such that Y ⊂ X and Γ_Y ⊂ Γ. Assume that Γ_Y has the induced order of Γ and the same 0 as Γ and that furthermore, d_∣Y(Y × Y ) ⊂ Γ_Y and d_∣Y(y, y′) = d(y, y′) for all y, y′∈ Y . Then (Y, d_∣Y, Γ_Y) is said to be a subspace of (X, d, Γ) and X is called an extension of Y . Often we simply write d instead of d_∣Y.

Definition 1.54

Let (X, d, Γ) be an ultrametric space. The space X is said to be solid if for every γ ∈ Γ and x ∈ X there exists y ∈ X such that d(x, y) = γ. If X is solid, then d(X × X) = Γ.

Definition 1.55

Let (X, d, Γ) be an ultrametric space. Let γ ∈ Γ^• = Γ ∖{0} and a ∈ X. The set B_γ(a) = {x ∈ X ∣ d(a, x) ≤ γ} is called a ball. The element a is said to be a center of B_γ(a) and the element γ to be a radius of B_γ(a). If x, y ∈ X, x ≠ y, then B(x, y) = B_d(x,y)(x) is called a principal ball .

Remark 1.34

Let (X, d, Γ) be an ultrametric space. If X is solid, every ball is principal. If Γ is totally ordered, also the converse conclusion holds.

Definition 1.56

Let (X, d, Γ) be an ultrametric space. A nonempty Y of X is said to be convex in X when for all y₁, y₂ ∈ Y with y₁ ≠ y₂ the principal ball B(y₁, y₁) ⊆ Y .

Remark 1.35

Every principal ball is convex in X and furthermore, if ⋂_{i ∈ I} B(x_i, y_i) ≠ ∅ then ⋂_{i ∈ I} B(x_i, y_i) is convex in X.

In the following lemma, we list some properties of balls which can easily be verified [161].

Lemma 1.10

Let (X, d, Γ) be an ultrametric space and let γ, δ ∈ Γ ^•.

1.
Let x, y ∈ X.
1. (a)
  If γ ≤ δ and B_γ(x) ∩ B_γ(y) ≠ ∅, then B_γ(x) ⊆ B_δ(y),
2. (b)
  if B_δ(y) ⊂ B_γ(x), then $\gamma \nleq \delta $.
2.
Concerning principal balls, if x, y, z, u ∈ X, x ≠ z and y ≠ u, then
1. (a)
  B(x, z) ⊆ B_δ(y) if and only if d(x, z) ≤ δ and x ∈ B_δ(y),
2. (b)
  if B(x, z) ⊂ B_δ(y), then d(x, z) < δ,
3. (c)
  if B(x, z) = B(y, u), then d(x, z) = d(y, u).
3.
Let X be solid and x, y ∈ X.
1. (a)
  B_γ(x) ⊆ B_δ(y) if and only if γ ≤ δ and x ∈ B_δ(y),
2. (b)
  if B_γ(x) ⊂ B_δ(y), then γ < δ.
3. (c)
  if B_γ(x) = B_δ(y), then γ = δ
4.
If Γ is totally ordered and B_γ(x) ⊂ B_δ(y), then δ < γ.

Definition 1.57

Let (X, d, Γ) be an ultrametric space. A set of balls which is totally ordered by inclusion is said to be a chain .

Lemma 1.11

Let (X, d, Γ) be an ultrametric space. Let $\mathcal {C}$ be a chain of balls of X which does not have a smallest ball. Then there exists a limit ordinal λ and a strictly decreasing family of balls (B_i)_i<λ such that each $B_{i} \in \mathcal {C}$ and for every ball $C \in \mathcal {C}$ there exists B_i such that B_i ⊇ C and hence $\bigcap \mathcal {C}= \bigcap _{i< \lambda }B_i$.

Definition 1.58

Let (X, d, Γ) be an ultrametric space. X is called spherically complete (resp., principally complete ) if every chain of balls of X (resp., principal balls of X) has a nonempty intersection.

Remark 1.36

Every spherically complete ultrametric space (X, d, Γ) is principally complete. The converse is true when Γ is totally ordered or the space is solid.

Definition 1.59

An ultrametric space (X, d, Γ) is said to be complete if every chain of balls $\{B_{\gamma _{i}}\,\mid i \in I\},$ with $\inf \{\gamma _{i}\,\mid i \in I\}= 0$, has a nonempty intersection.

Remark 1.37

A spherically complete ultrametric space (X, d, Γ) is complete. If Γ is totally ordered and if Γ^• does not have a smallest element, the ultrametric distance induces on X a uniformity, hence also a topology. In this case, the concept of completeness coincides with that given by the uniformity.

Several examples of different types of ultrametric spaces are discussed in [160]. Some where Γ is totally ordered and others where Γ is not totally ordered.

Examples 1.9

1.
Let Δ be a totally ordered Abelian additive group, let ∞ be a symbol such that ∞∉ Δ, and δ + ∞ = ∞ + δ = ∞, ∞ + ∞ = ∞, δ < ∞ for all δ ∈ Δ. We denote by 0 the neutral element of Δ, that is 0 + δ = δ for every δ ∈ Δ. Let K be a commutative field, let v: K→ Δ ∪{∞} be a valuation of K, so we have
1. (v1)
  v(x) = ∞ if and only if x = 0,
2. (v2)
  v(xy) = v(x) + v(y),
3. (v3)
  $v(x+y)\geq \min \{v(x),v(y)\}$.
Let Γ^• be a totally ordered Abelian multiplicative group with neutral element 1, let 0 be a symbol such that 0∉ Γ^•, 0γ = γ0 = 0, 0.0 = 0, 0 < γ for every γ ∈ Γ^•. Let θ: Δ ∪{∞}→ Γ = Γ^• ∪{0} be an order reversing bijection such that θ(∞) = 0, θ(δ + δ′) = θ(δ).θ(δ′), so θ(0) = 1.

Let d: K × K→ Γ be defined by d(x, y) = θ(v(x − y)), then (K, d, Γ) is an ultrametric space which is said to be associated to the valued field (K, v, Δ ∪{∞}).
2.
Let Γ be a totally ordered set with smallest element 0, let Γ^• = Γ∖{0}. Let R be a nonempty set with a distinguished element 0. For each f : Γ^•→R, let supp(f) = {γ ∈ Γ^• ∣ f(γ) ≠ 0} be the support of f. Let R[[ Γ]] be the set of all f : Γ^•→R with support which is empty or anti-well ordered. Let d: R[[ Γ]] × R[[ Γ]]→ Γ be defined by d(f, f) = 0 and if f ≠ g, d(f, g) is the largest element of the set {γ ∈ Γ^• ∣ f(γ) ≠ g(γ)}. Then (R[[ Γ]], d, Γ) is an ultrametric space which is solid and spherically complete.
3.
Let I be a set with at least two elements, let (X_i)_{i ∈ I} be a family of sets X_i, each one having at least two elements. Let X =∏_{i ∈ I} X_i. Let $\mathcal {P}(I)$ be the set of all subsets of I, ordered by inclusion. And let $d\colon X\times X\longrightarrow \mathcal {P}(I)$ be defined by d(f, g) = {i ∈ I ∣ f_i ≠ g_i}, where f = (f_i)_{i ∈ I} and g = (g_i)_{i ∈ I}. Then $(X,d,\mathcal {P}(I))$ is a solid and spherically complete ultrametric space. If each X_i = {0, 1}, we obtain the ultrametric space $(\mathcal {P}(I),d,\mathcal {P}(I))$ with d(A, B) = (A ∪ B)∖(A ∩ B) for all A, B ⊆ I.
4.
Let X be a topological space, let Y be a discrete topological space, let $\mathcal {C}(X,Y)$ denote the set of continuous functions from X to Y and let $\mathcal {C}l(X)$ the set of clopen (i.e., closed and open) subsets of X. The mapping $d\colon \mathcal {C}(X,Y)\times \mathcal {C}(X,Y)\longrightarrow \mathcal {C}l(X)$ is defined by d(f, g) = {x ∈ X ∣ f(x) ≠ g(x)}. Then $(\mathcal {C}(X,Y),d,\mathcal {C}l(X))$ is a solid ultrametric space, and it is spherically complete if $\mathcal {C}l(X)$ is a complete sub-Boolean-algebra of $\mathcal {P}(X)$.

Definition 1.60

Let (X, d, Γ) be an ultrametric space and assume that Γ is totally ordered. Let (Y, d_∣Y, Γ_Y) be a subspace of (X, d, Γ) and assume that d(Y × Y ) = d(X × X) = Γ. If for every x ∈ X and for every y ∈ Y , with x ≠ y, there exists y′∈ Y such that d(y′, x) < d(y, x), the extension Y ≺ X is called immediate and we write Y im ≺ X. The extension Y ≺ X is said to be dense (denoted by Y de ≺ X), if for every x ∈ X and for every 0 < γ ∈ Γ there exists y in Y such that d(y, x) < γ. Thus if Y de ≺ X then also Y im ≺ X.

Remark 1.38

If Γ^• does not have a smallest element, Definition 1.60 coincides with that given by the topology of X. We remark that both notions, “immediate” and “dense” can be defined more generally for ultrametric spaces, where Γ is only ordered [155].

The following is given in [161].

Theorem 1.42

1.
Every ultrametric space (X, d, Γ), with Γ totally ordered, has an immediate extension which is spherically complete. (We call such an extension a spherical completion of X.)
2.
Every ultrametric space (X, d, Γ), with Γ totally ordered, has an extension (X′, d, Γ) such that X′ is dense in X′. (We call such an extension a completion of X.)
3.
Let (Y, d_∣Y, Γ_Y) be a subspace of Let (X, d, Γ). Assume that Γ is totally ordered and that $\Gamma ^{\bullet }_{\mid Y}$ is coinitial in Γ ^• and that furthermore d(Y × Y ) = Γ_Y, d(X × X) = Γ. If X is complete, then there exists one and only one completion $\widehat {Y}$ of Y which is a subspace of X.

Proof

The proofs of 1. and 2. are given in [155, 176].

3. Let $\mathcal {S}$ be the set of all ultrametric subspaces $\mathcal {S}$ such that Y is dense in $\mathcal {S}$. Since Y is dense in itself, . The set $\mathcal {S}$ is ordered by inclusion. Let {S_i ∣ i ∈ I} be a totally ordered subset of $\mathcal {S}$. Then S =⋃_{i ∈ I} S_i is a subspace of X and Y is dense in S. Thus $S \in \mathcal {S}$ is an upper bound for all S_i, i ∈ I. By Zorn’s lemma, there exists a maximal element in $\mathcal {S}$ which we denote again by S. We show that S is complete. Since $\Gamma ^{\bullet }_{\mid Y}$ is coinitial in Γ^• and $\Gamma ^{\bullet }_{\mid Y}=\Gamma ^{\bullet }_{\mid S}= d(S\times S)\setminus \{0\}$ has in $\Gamma ^{\bullet }_{\mid S}$ the infimum 0 if and only if the infimum of Δ in Γ^• is 0, thus we may just write $\inf \Delta = 0$. We assume that S is not complete. Then there exists a chain $\{B_{\gamma _{i}}^{S}(a_i)\,\mid i \in I\}$ of balls in S with

$$\displaystyle \begin{aligned} \inf\{\gamma_{i}\,\mid i \in I\}= 0 \,\mbox{ and }\,\bigcap B_{\gamma_{i}}^{S}(a_i)= \emptyset. \end{aligned}$$

Since X is complete and for each $i \in I, B_{\gamma _{i}}^{S}(a_i)= S\cap B_{\gamma _{i}}^{S}(a_i)$, where $B_{\gamma _{i}}(a_i)$ denotes the ball with center a_i and radius γ_i in X, there exists z ∈ X such that $\{z\}= \bigcap B_{\gamma _{i}}(a_i)$. Let S′ = S ∪{z}. Then S′ is a subspace of X which properly contains S, so also Y . To prove that Y is dense in S′, it suffices to show that if 0 < γ ∈ Γ, there exists y ∈ Y such that d(y, z) < γ. Since $\inf \{\gamma _{i}\,\mid i \in I\}= 0$ there exists γ_i with 0 < γ_i < γ. Since Y is dense in S and a_i ∈ S, it follows that there exists y ∈ Y such that d(y, a_i) < γ_i. Since, moreover, $z \in B_{\gamma _{i}}(a_i)$, then $d(z,y)\leq \max \{d(z,a_i),d(y,a_i)\}\leq \gamma _{i}< \gamma $. Thus Y is dense in S′. So $S' \in \mathcal {S}$, which contradicts the maximality of S in $\mathcal {S}$. We have proved that S is complete, hence a completion of Y in X. It remains to show that Y has at most one completion in X. Assume that $\widehat {Y_1}, \widehat {Y_2}$ are completions of Y in X. Let $\widehat {y_1} \in \widehat {Y_1}$. For each γ ∈ Γ^• there exists y_γ ∈ Y such that $d(\widehat {y_1},y_{\gamma })< \gamma $. If Γ^• has a smallest element, say γ ^∗ then

$$\displaystyle \begin{aligned} \widehat{y_1}= y_{\gamma^{*}} \in Y \subset \widehat{Y_2}. \end{aligned}$$

If Γ^• does not have a smallest element, then $\inf \{\gamma \,\mid \gamma \in \Gamma ^{\bullet }\}= 0$, thus there exists $\widehat {y_2} \in \widehat {Y_2}$ with

$$\displaystyle \begin{aligned} \{\widehat{y_2}\}= \bigcap_{\gamma \in \Gamma^{\bullet}}B_{\gamma}(y_{\gamma}) \end{aligned}$$

because $\widehat {Y_2}$ is complete. Hence $\widehat {y_1}= \widehat {y_2} \in \widehat {Y_2}$. This shows that $\widehat {Y_1}\subseteq \widehat {Y_2}$. By the same argumentation, we conclude that $\widehat {Y_2}\subseteq \widehat {Y_1}$, thus $\widehat {Y_1}= \widehat {Y_2}$. ■

Definition 1.61

Let (X, d, Γ) be an ultrametric space and assume that Γ is totally ordered. Let λ be a limit ordinal and let ξ = (x_i)_i<λ be a family of elements of X. We say that ξ is a Cauchy family if for every γ ∈ Γ^• there exists i₀ = i₀(γ, ξ) < λ such that if i₀ ≤ i < κ < λ, then d(x_i, x_k) < γ. The family ξ = (x_i)_i<λ is said to be pseudo-convergent if there exists i₀ = i₀(ξ) < λ such that if i₀ ≤ i < κ < μ < λ, then d(x_κ, x_μ) < d(x_i, x_κ).

Remark 1.39

We note that if ξ = (x_i)_i<λ is pseudo-convergent , the elements x_i, for i₀(ξ) ≤ i < λ are all distincts and if i₀(ξ) ≤ i < κ < μ < λ, then d(x_i, x_κ) = d(x_κ, x_μ), this element is denoted by ξ_i. Hence if i₀ ≤ i < κ < λ, then ξ_i > ξ_κ.

Definition 1.62

Let (X, d, Γ) be an ultrametric space and assume that Γ is totally ordered. Let λ be a limit ordinal and let ξ = (x_i)_i<λ be a Cauchy family of elements of X. The element y ∈ X is a limit of the family ξ if for every γ ∈ Γ^• there exists i₁ = i₁(γ) < λ such that if i₁ ≤ i < λ, then d(y, x_i) < γ. The ultrametric space X is complete if and only if every Cauchy family has a limit in X.

Remark 1.40

A Cauchy family ξ = (x_i)_i<λ has at most one limit. Indeed, if y, z are limits, then d(y, z) < γ for all γ ∈ Γ^•, so y = z.

Definition 1.63

Let (X, d, Γ) be an ultrametric space and assume that Γ is totally ordered. Let λ be a limit ordinal and let ξ = (x_i)_i<λ be a pseudo-convergent family of elements of X. The element y ∈ X is a pseudo-limit of the family ξ = (x_i)_i<λ if there exists i₁ = i₁(ξ, y), i₀(ξ) ≤ i₁ < λ, such that if i₁ ≤ i < λ then d(y, x_i) ≤ ξ_i. If y is a pseudo-limit of ξ, then z ∈ X is a pseudo-limit of ξ if and only if d(y, z) < ξ_i for all i such that i₁ ≤ i < λ.

The following is a characterization of spherical completeness [151].

Proposition 1.43

Let (X, d, Γ) be an ultrametric space and assume that Γ is totally ordered. Then X is spherically complete if and only if every pseudo-convergent family of X has a pseudo-limit in X.

1.4 Admissible Functions

Throughout this subsection, we denote by (X, τ) a topological vector space, and by Y a nonempty subset of X.

Below the definition of functions providing the possibility of working with extended real seminorms in topological vector spaces.

Definition 1.64

An admissible function for Y on X is an extended real-valued function ρ: X→[0, ∞] such that

1.
The mapping (x, y)↦ρ(x, y) is continuous on Y × Y ,
2.
ρ(x + y) ≤ ρ(x) + ρ(y) for all x, y ∈ X,
3.
$\rho (\lambda x)= \left \vert \lambda \right \vert \rho (x)$, for all $\lambda \in \mathbb {R}$ and x ∈ X,
4.
If x, y ∈ Y and ρ(x − y) = 0, then x = y.

Remark 1.41

Notice that if ρ is an admissible function for Y on X, then it defines a metric on Y whose induced topology is coarser than τ.

Remark 1.42

It is instructive to compare the notion of continuity in the sense of 1. with the usual one. It is easy to see that if ρ is continuous on X, then (x, y)↦ρ(x, y) is continuous on Y × Y . Furthermore, if 1. − 3. hold then ρ is continuous on Y .

It is not true, in general, that if ρ is continuous on Y , then it satisfies 1. For example, if $X= \mathbb {R}$ and Y = [0, ∞), then the mapping $\rho \colon \mathbb {R} \longrightarrow [0,\infty ]$ defined by

$$\displaystyle \begin{aligned} \rho(x)= \left \lbrace \begin{array}{ccc} \frac{1}{x}, & \text{if} & x> 0, \\ \infty,& \text{if} & x= 0, \\ 0, & \text{if} & x< 0, \end{array} \right. \end{aligned}$$

is continuous on Y . However, the mapping T : Y × Y →[0, ∞] given by T(x, y) = ρ(x − y) is not continuous at the point (1, 1). Indeed, it suffices to see that $(1-\frac {1}{k}, 1)$ converges to (1, 1) in Y × Y , while that $T(1-\frac {1}{k}, 1)= 0$ and T(1, 1) = ∞.

Barroso [12] proved that the class of admissible functions is sufficiently good to imply that the Schauder-projection operator is continuous.

Proposition 1.44

Let ρ be an admissible function for Y on X. Then for any ε > 0 and p ∈ Y, the function g: Y →[0, ∞) given by

$$\displaystyle \begin{aligned} g(x)= \max\{\varepsilon-\rho(x-p),0\} \end{aligned}$$

is continuous on Y .

Proof

Firstly, let us recall that the effective domain of ρ is the set

$$\displaystyle \begin{aligned} \mathtt{D}(\rho)= \{x \in X \colon \rho(x)< \infty\}. \end{aligned}$$

Let x₀ be a point in Y and δ > 0 be arbitrary. By assumption, there exists a neighborhood U × V of (x₀, p) in Y × Y such that

$$\displaystyle \begin{aligned} \rho(x_{0}-p)-\delta \leq \rho(x-z)\leq \rho(x_{0}-p)+\delta, \end{aligned}$$

for all (x, z) ∈ U × V . If x₀ − p∉D(ρ) then ρ(x₀ − p) = ∞ and, hence, ρ(x − p) = ∞ for all x ∈ U. In consequence, g(x) = g(x₀) = 0 for all x ∈ U. In case x₀ − p ∈D(ρ), we can conclude that x − p ∈D(ρ) for all x ∈ U. In this case, it is easy to see that g(x₀) + δ ≥ g(x), for all x ∈ U. On the other hand, if g(x₀) = 0, then clearly g(x) ≥ g(x₀) − δ holds for every x ∈ U. Assuming now that g(x₀) = ε − ρ(x₀ − p), we have g(x₀) − δ ≤ ε − ρ(x − p) ≤ g(x), for all x ∈ U. In any case, we have proven that g is continuous at x₀, and hence continuous in Y . The proof is complete. ■

The following is an example of an admissible function [12].

Proposition 1.45

Let Y be a compact convex subset of a topological vector space (X, τ) and $\mathcal {F}= \{\rho _{n}\colon n \in \mathbb {N}\}$ a countable family of seminorms on X which separate points of Y − Y and such that the topology Γ generated by $\mathcal {F}$ is coarser than τ in Y . Then the function ρ: X → [0, ∞] defined as

$$\displaystyle \begin{aligned} \rho(x)= \sum_{n=1}^{\infty}\rho_{n}(x), \,\, x \in X \end{aligned}$$

is admissible.

Proof

Since Y is compact and Γ is coarser than τ, each ρ_n restricted to Y is τ-continuous. Thus we have $\max \{\rho _{n}(x)\colon x \in Y < \infty \}$ for all $n \in \mathbb {N}$. By replacing the seminorms ρ_n by suitable positive multiples, if necessary, we may assume that

$$\displaystyle \begin{aligned} \max\{\rho_{n}(x)\colon x \in Y_{n} \} \leq 2^{-n-1}, \end{aligned} $$

(1.18)

for all $n \in \mathbb {N}$. Notice that ρ(x − y) < ∞ for all x, y ∈ Y . Moreover, one readily checks 2. − 4.. Using now (1.18), we see that the sequence of functions $\rho ^{n}(x-y)= \sum _{i=1}^{n}\rho _{i}(x-y)$ is Cauchy w.r.t. the topology of uniform convergence on Y × Y . Thus ρ ⁿ(x − y) converges uniformly on Y × Y to ρ(x − y). Furthermore, to verify that 1 − holds, we have only to ensure this for each ρ_n. Let (x_α, y_α) be a net in Y × Y converging to (x, y). Since τ is finer than Γ on Y , both ρ_n(x_α − x) and ρ_n(y_α − y) converge to 0. We may then apply the triangle inequality to conclude $\left \vert \rho _{n}(x_{\alpha },y_{\alpha })-\rho _{n}(x-y)\right \vert \to 0$. ■

1.5 Some Fixed Point Theorems

Banach’s Contraction Mapping Principle is remarkable in its simplicity, yet it is perhaps the most widely applied fixed point theorem in all of analysis. The principle first appeared in explicit form in Banach’s thesis [11].

Theorem 1.43 (Banach’s Contraction Mapping Principle)

Let (X, d) be a complete metric space and let T : X→X. If there exists an 0 < k < 1 such that d(T(x), T(y)) ≤ kd(x, y) for all x, y ∈ X, then T has a unique fixed point.

Proof

First we consider the case when:

$$\displaystyle \begin{aligned} \mathrm{diam}(X):= \sup\{d(x,y)\colon x,y\in X\}< \infty. \end{aligned}$$

For each $n\in \mathbb {N}$, let Y_n = T ⁿ(X). Then

$$\displaystyle \begin{aligned} Y_{n+1}= T^{n+1}(X)= T^{n}(T(X))\subseteq T^{n}(X)= Y_{n} \end{aligned}$$

for all $n\in \mathbb {N}$. Therefore, $\{Y_{n} \colon n\in \mathbb {N}\}$ is a decreasing sequence of nonempty subsets of X. Next, notice that

$$\displaystyle \begin{aligned} 0 \leq \mathrm{diam}(Y_{n+ 1})\leq k\mathrm{diam}(Y_{n})~~~ \mbox{ for all } n\in \mathbb{N} \end{aligned}$$

and so, by induction,

$$\displaystyle \begin{aligned} 0\leq \mathrm{diam}(Y_{n+ 1})\leq k^{n}\mathrm{diam}(Y_{n})~~~ \mbox{ for all } n\in \mathbb{N}. \end{aligned}$$

Therefore, $\lim _{n\to \infty }\mathrm {diam}(\overline {Y_{n}})= \lim _{n\to \infty }\mathrm {diam}(Y_{n})= 0$. It then follows from Cantor’s intersection property that

$$\displaystyle \begin{aligned} \bigcap_{n\in \mathbb{N}}\overline{Y_{n}}= \{x\}~~~ \mbox{ for some } x\in X. \end{aligned}$$

Moreover, since $x\in \overline {Y_{n}},$

$$\displaystyle \begin{aligned} T(x)\in T(\overline{Y_{n}})\subseteq \overline{T(Y_{n})}= \overline{Y_{n+1}}\subseteq \overline{Y_{n}}, \end{aligned}$$

$T(x)\in \bigcap _{n\in \mathbb {N}}\overline {Y_{n}}= \{x\}$. That is, T(x) = x.

In the case when diam(X) = ∞ some extra work is required. In this case we choose any x₀ ∈ X and let

$$\displaystyle \begin{aligned} Z:= \overline{\{T^{n}(x_{0}) \colon n\in \mathbb{N}\}}. \end{aligned}$$

Then T(Z) ⊆ Z and

$$\displaystyle \begin{aligned} \mathrm{diam}(Z)\leq \frac{d(T(x_{0}),x_{0})}{1-k}< \infty. \end{aligned}$$

Hence from the previous argument there exists a point x ∈ Z ⊆ X such that T(x) = x. ■

The Caristi fixed point theorem [35] is known as one of the very interesting and useful generalizations of the Banach’s Contraction Mapping Principle for self-mappings on a complete metric space. Neither continuity nor a Lipschitz condition is required.

Theorem 1.44 (Caristi’s Fixed Point Theorem)

Let (X, d) be a complete metric space and let T : X→X be a mapping such that

$$\displaystyle \begin{aligned} d(x,T(x))\leq \varphi(x)- \varphi(T(x)) \end{aligned}$$

for all x ∈ X, where φ: X→[0, +∞) is a lower semicontinuous mapping. Then T has at least a fixed point.

The setting of generalized ultrametric spaces offers a highly flexible framework in which to study the fixed point theory is necessary for logic programming semantics [59, 85, 104, 113, 151, 153, 156, 157] and [177].

Definition 1.65

Let (X, d, Γ) be an ultrametric space. A mapping φ: X→X is said to be strictly contracting if for all x, x′∈ X, with x ≠ x′, d(φ(x), φ(x′)) < d(x, x′). An element z ∈ X with φ(z) = z is called a fixed point of φ.

For strictly contracting maps on ultrametric spaces we have the following fixed point theorem [151, 153, 160].

Theorem 1.45

Assume that (X, d, Γ) is a spherically complete ultrametric space and that φ: X→X is strictly contracting. Then φ has exactly one fixed point z ∈ X.

Proof

Assume, π_x = d(x, φ(x)) ≠ 0 for every x ∈ X. Let $B_{x}= B_{\pi _{x}}$. The set $\mathcal {B}= \{B_{x}\,\mid x \in X\}$ is ordered by inclusion. Let $\mathfrak {C}$ be a maximal chain in $\mathcal {B}$. Since X is spherically complete, there exists an element $z \in \bigcap \{B_{x}\,\mid B_{x} \in \mathfrak {C}\}$. Then B_z ⊆ B_x for every $B_{x}\in \mathfrak {C}$. Indeed, this is obvious, if z = x. If z ≠ x then d(φ(z), φ(x)) ≤ d(z, x) ≤ π_x = d(x, φ(x)), π_z = d(φ(z), z) ≤ π_x. Hence B_z ⊆ B_x. Since $\mathfrak {C}$ is a maximal chain in $\mathcal {B}$, then B_z is the smallest element of $\mathfrak {C}$. But π_φ(z) = d(φ(z), φ(φ(z))) < d(z, φ(z)) = π_z and therefore $B_{\varphi (z)} \subsetneq B_{z}$, contradicting the maximality of $\mathfrak {C}$. Hence there exists an element x ∈ X with φ(x) = x. If also φ(y) = y for x ≠ y, then d(x, y) = d(φ(x), φ(y)) < d(x, y), which is absurd. Thus there exists exactly one fixed point for φ. ■

Remark 1.43

Analysing the proof of Theorem 1.45, we see that to prove the existence of a fixed point for the mapping φ: X→X, it suffices to assume the following property. For any x, y ∈ X, d(φ(x), φ(y)) ≤ d(x, y) and for x ≠ φ(x), d(φ(x), φ(φ(x))) < d(x, φ(x)).

In the special case when Γ is totally ordered, we obtain the following characterization of principal completeness [153].

Theorem 1.46

Let (X, d, Γ) be an ultrametric space and assume that Γ is totally ordered. The following conditions are equivalent:

1.
X is principally complete
2.
Every strictly contracting mapping φ: X→X has a fixed point.

Proof

1.⇒2.: this was proved in Theorem 1.45.

2.⇒1.: We assume that X is not principally complete, so there exists a chain $\mathcal {C}$ of principal balls such that $\bigcap \mathcal {C}= \emptyset $. Hence $\mathcal {C}$ dos not have a smallest ball and therefore the coinitial type λ of $\mathcal {C}$ is a limit ordinal. Then there exists a strictly decreasing family (B_i)_i<λ of balls $B_{i} \in \mathcal {C}$ such that $\bigcap _{i< \lambda }B_{i}= \bigcap \mathcal {C}=\emptyset $. We write $B_{i}= B_{\gamma _{i}}(a_i)$ and we define φ: X→X. If x ∈ X there exists the smallest κ = κ(x) < λ such that x∉B_κ, we define φ(x) = a_κ. We show that φ is strictly contracting. Let x, y ∈ X, x ≠ y. If κ(x) = κ(y) then 0 = d(φ(x), φ(y)) < d(x, y). If κ(x) ≠ κ(y), say κ(x) < κ(y), from B_κ(x) ⊃ B_κ(y) and x∉B_κ(x), y ∈ B_κ(x) we get d(x, y) > γ_κ(x) ≥ d(φ(x), φ(y)). So φ is strictly contracting. From the definition of φ, it is obvious that φ does not have a fixed point. ■

Brouwer’s fixed point theorem, in mathematics, a theorem of algebraic topology that was stated and proved by Brouwer [27, 28]. Inspired by the earlier work of the French mathematician Poincaré, Brouwer investigated the behavior of continuous functions mapping the closed ball of unit radius in n-dimensional Euclidean space into itself.

Theorem 1.47 (Brouwer’s Fixed Point Theorem)

Let X be an n-dimensional Euclidean space. Then, any continuous map of {x ∈ X : ∥x∥≤ 1} into itself has a fixed point.

As a consequence, we get

Theorem 1.48

Any continuous map T of a compact convex K set in n-dimensional Euclidean space X into itself has a fixed point.

Proof

Assume first that K ⊆ B_X = {x ∈ X : ∥x∥≤ 1}. Define G: B_X → K by taking G(x) to be the unique point y ∈ K such that ∥x − y∥≤∥x − z∥ for all z ∈ K. Such a vector y exists and unique. Note that G(x) = x = y if x ∈ K. Consider T ∘ G: B_X → K as a map from B_X into itself. The map H : B_X → B_X defined by H(x) = T(G(x)) is continuous because G is continuous. Let x_n → x. We have ∥x_n − G(x_n)∥≤∥x_n − z∥ for all z ∈ K. Hence, if y is any limit point of {G(x_n)} then ∥x − y∥≤∥x − z∥ for all z ∈ K. This proves that G(x) is the only limit of {G(x_n)} which lies in the compact set K. Hence G(x_n) → G(x). By Theorem 1.47 there exists x ∈ B_X such that T(G(x)) = x. Since the range of T is contained in K we get x ∈ K. But then G(x) = x so T(x) = x. This proves the theorem when K ⊆ B_X. For the general case choose R such that K ⊆{x ∈ X : ∥x∥≤ R}. Let K₁ = {R ⁻¹ x: x ∈ K}. Then K₁ is a compact convex set and the function T₁: K₁ → K₁ defined by T₁(x) = R ⁻¹ T(Rx) is continuous. By the first case there exists x₁ ∈ K₁ such that R ⁻¹ T(Rx₁) = x₁. If x = Rx₁ then T(x) = x. ■

Remark 1.44 (Kakutani’s Example)

Theorem 1.47 does not hold in an infinite dimensional Hilbert space:

if $T(x)= (\sqrt {1-\|x\|{ }^{2}},x_{1},x_{2},\cdots )$ then T maps {x ∈ l₂: ∥x∥≤ 1} into itself and is continuous. It has no fixed point.

Definition 1.66

A map T : Y → X where X is a normed space and Y ⊆ X is called compact if T(Z) is relatively compact whenever Z ⊆ Y is bounded.

Brouwer’s Theorem was extended to infinite dimensional spaces by Schauder in the following way [174].

Theorem 1.49 (Schauder’s Fixed Point Theorem)

Let Y be a closed bounded convex set in a normed space (X, ∥.∥) and T a continuous map of Y into itself. If T is compact then it has a fixed point.

Proof

Let Z ⊆ X be compact. Let ε > 0 and B_𝜖(x₁), B_𝜖(x₂), ⋯ , B_𝜖(x_N) cover Z where {x₁, x₂, ⋯ , x_N}⊆ Z. Let $m_{i}(x)= \max (\varepsilon - \|x-x_{i}\|,0)$ and $\varphi (x)= \frac {\sum _{i=1}^{N}m_{i}(x)x_{i}}{\sum _{j=1}^{N}m_{j}(x)}$ for x ∈ Z. It is obvious that each m_i is continuous and $\sum _{j=1}^{N}m_{j}(x)> 0$ for all x ∈ Z. Hence φ is continuous. If x ∈ Z then m_i(x) ≠ 0 implies ∥x − x_i∥ < ε and hence $\left \| \sum _{i=1}^{N}m_{i}(x)(x_{i}-x)\right \| < \varepsilon \sum _{i=1}^{N}m_{i}(x)$ which proves that ∥φ(x) − x∥ < ε (m_i(x) ≠ 0 for at least one i). Further φ(Z) ⊆conv(Z).

Let $W= \overline {T(Y)}$. Then W is a compact subset of Y . For each n let φ_n: W →conv(W) ⊆ Y be a continuous map such that $\|\varphi _{n}(x)-x\| < \frac {1}{n}$ for all x ∈ W for all n. This is possible by the reasoning above. Let T_n = φ_n ∘ T so that T_n is a continuous map : W → Y . So there is a finite set $\{x_{1}^{n},x_{2}^{n},\cdots ,x_{N_{n}}^{n}\}\subseteq W$ such that $\varphi _{n}(W)\subseteq W_{n}:= \mathrm {span}(\{x_{1}^{n},x_{2}^{n},\cdots ,x_{N_{n}}^{n}\})$. Let Y_n = Y ∩ W_n. Then Y_n is a compact convex set in the finite dimensional space W_n. We claim that T_n maps Y_n into itself. First note that T(Y_n) ⊆ T(Y ) ⊆ W so T_n = φ_n ∘ T is defined on Y_n. Also φ_n takes values in $\mathrm {conv}(\{x_{1}^{n},x_{2}^{n},\cdots ,x_{N_{n}}^{n}\})\subseteq W_{n}$ as well as in Y so it takes values in Y_n. By Theorem 1.48 there exists y_n ∈ Y_n such that T_n(y_n) = y_n. Since y_n ∈ Y and T(y_n) ∈ W we have $\|\varphi _{n}(T(y_{n}))- T(y_{n})\|< \frac {1}{n}$ for all n. In other words $\|y_{n}- T(y_{n})\|< \frac {1}{n}$ for all n. Since (T(y_n))_n ⊆ W and W is compact there is a subsequence $(T(y_{n_{j}}))_{n_{j}}$ converging to some y. Now

$$\displaystyle \begin{aligned} \|y_{n_{j}}- y\|\leq \|T(y_{n_{j}})- y\|+ \|y_{n_{j}}- T(y_{n_{j}})\|< \|T(y_{n_{j}})- y\|+ \frac{1}{n_{j}} \rightarrow 0. \end{aligned}$$

This implies T(y) = y. ■

Lemma 1.12

Let $Y_{0}= \bigg \{x= (x_{n})_{n\geq 1}\in l_{2}\colon |x_{n}|\leq \frac {1}{n}~ \mathit{\mbox{for all}}\, \,n\geq 1\bigg \}$ . Then any continuous map T : Y₀ → Y₀ has a fixed point.

Proof

We first prove that the parallelepiped Y₀ is compact in l₂. We have $Y_{0}= \bigcap _{n\geq 1} Z_{n}, Z_{n}= \bigg \{x= (x_{m})_{m\geq 1}\in l_{2}\colon |x_{n}|\leq \frac {1}{n}\bigg \}$. Since the canonical projection $p_{n}\colon l_{2}\to \mathbb {K}$ is continuous, it follows that $Z_{n}= p_{n}^{-1}(\overline {B_{\frac {1}{n}}(0)})$ is closed for all n ≥ 1, and therefore Y₀ is a closed set. Since the series $\sum _{n= 1}^{\infty }\frac {1}{n^{2}}$ converges, it follows that for any ε > 0 there exists n_ε ≥ 1 such that $\sum _{k= n_{\varepsilon }}^{\infty }\frac {1}{k^{2}}\leq \varepsilon $. Since $|p_{n}(x)|\leq \frac {1}{n}$ for all x ∈ Y₀ and n ≥ 1, it follows that $\sum _{k= n_{\varepsilon }}^{\infty }|p_{k}(x)|{ }^{2}\leq \varepsilon $ for all x ∈ Y₀, i.e., Y₀ is relatively compact in l₂. Hence Y₀ is compact.

Let Y_n = {(x₁, x₂, ⋯ , x_n, 0, 0, ⋯ ): x ∈ Y } and define T_n: Y_n → Y_n by T_n(x) = (y₁, y₂, ⋯ , y_n, 0, 0, ⋯ ) where y = T(x₁, x₂, ⋯ , x_n, 0, 0, ⋯ ). Y_n can be identified with compact convex set in $\mathbb {K}^{n}$ and T_n is continuous, hence it has a fixed point x ⁽ⁿ⁾. Since (x_n)_n≥1 ⊆ Y₀ and Y₀ is compact in (l₂, ∥.∥₂) there is a subsequence $(x_{n_{j}})_{j}$ converging to some x ∈ Y . Let $y^{n}= T(x_{1}^{(n)},x_{2}^{(n)},\cdots ,x_{n}^{(n)},0,0,\cdots )$ so that $x^{(n)}= T_{n}(x^{(n)})= (y_{1}^{(n)},y_{2}^{(n)},\cdots ,y_{n}^{(n)},0,0,\cdots )$. It is clear that $\lim _{n\to \infty }(x_{1}^{(n)},x_{2}^{(n)},\cdots ,x_{n}^{(n)},0,0,\cdots )= x$ so lim_n→∞ y ⁽ⁿ⁾ = T(x). Hence $x= \lim _{j\to \infty }x^{(n_{j})}= \lim _{j\to \infty }(y_{1}^{(n_{j})},y_{2}^{(n_{j})},\cdots ,y_{n}^{(n_{j})},0,0,\cdots )= \lim _{j\to \infty }y^{(n_{j})}= T(x)$. ■

Lemma 1.13

If Z is a closed convex of Y ₀ then every continuous map of Z into itself has a fixed point.

Proof

For each x ∈ Y₀ there is a unique point P(x) ∈ Z closet to x and the map P : Y₀ → Z is continuous. If T : Z → Z is continuous then G: Y₀ → Y₀ defined by G = T ∘ P is continuous. Hence by Lemma 1.12 there exists x ∈ Y₀ such that T(P(x)) = x. Since the range of T is contained in Z we see that x = T(P(x)) ∈ Z. But then P(x) = x so x = T(x). ■

Proposition 1.46

Let Y be a compact convex set in a locally convex topological vector space (X, τ). If Y has at least two points and T : Y → Y is continuous then there is a proper subset Y₁ of Y such that T(Y₁) ⊆ Y₁ and Y₁ is also compact and convex.

Proof

We reduce the proof to the case when the topology τ of X is replaced by the weak topology. We introduce an ordering for subsets of X′ as follows: Z ≤ W if for any ψ ∈ Z and ε > 0 there exists a finite subset φ₁, φ₂, ⋯ , φ_k of W and δ > 0 such that x, y ∈ Y and |φ_i(x) − φ_i(y)| < δ, 1 ≤ i ≤ k imply |ψ(T(x)) − ψ(T(y))| < ε. We observe that if Z ≤ W and φ(x) = φ(y) for all φ ∈ Z then ψ(T(x)) = ψ(T(y)). We claim that for any ψ ∈ Z there exists a countable family W = {φ₁, φ₂, ⋯ } such that {ψ}≤ W. For this let ε > 0. First note that T is weak-weak continuous and Y is compact convex in weak topology. By uniform continuity of ψ ∘ T on Y with its weak topology, |ψ(T(x)) − ψ(T(y))| < ε if x − y belongs to a suitable weak neighbourhood of θ. Hence there exists φ₁, φ₂, ⋯ , φ_k and δ > 0 such that |φ_i(x) − φ_i(y)| < δ, 1 ≤ i ≤ k implies |ψ(T(x)) − ψ(T(y))| < ε. Now vary ε over $\{\frac {1}{n}, n\geq 1\}$ to get a countable set W ⊆ X′. For any ε > 0 choose n such that $\frac {1}{n}< \varepsilon $. There exists φ₁, φ₂, ⋯ , φ_k and δ > 0 such that |φ_i(x) − φ_i(y)| < δ, 1 ≤ i ≤ k implies $|\psi (T(x))- \psi (T(y))|< \frac {1}{n} <\varepsilon $. It follows that if |φ(x) − φ(y)| < δ for all φ ∈ W then |ψ(T(x)) − ψ(T(y))| < ε. Hence {ψ}≤ W. If we now repeat the argument for each element of W to get another countable set W₁, then repeat the argument for each element of W₁ and so on we end up with countable family W₀ such that with ψ it self, we get a countable subset P of X′ which contains ψ with P ≤ P.

If Y₁ is weakly compact, convex and contained in Y then it is a weakly closed convex set, hence strongly closed. Hence it is a closed convex subset of Y in the strong (i.e., original) topology, hence strongly compact also. Thus, we may and do assume that the topology τ of X is the weak topology. Now suppose x, y ∈ Y, x ≠ y. Choose ψ such that ψ(x) ≠ ψ(y). Let P = {ψ₁ = ψ, ψ₂, ⋯ } be a countable subset of X′ containing ψ such that P ≤ P. Now ψ_n(Y ) is compact for each n ≥ 1. Because if Q = {α₁ ψ, α₂ ψ₂, ⋯ } with each α_n > 0 then Q ≤ Q, we may suppose $|\psi _{n}(z)|\leq \frac {1}{n}$ for all n ≥ 1, for all z ∈ Y . Define G: Y → l₂ by G(z) = (ψ_n(z))_n≥1. G is continuous and its range S is contained in $Y_{0}= \bigg \{x= (x_{n})_{n\geq 1}\in l_{2}\colon |x_{n}|\leq \frac {1}{n}~ \mbox{for all}\, \,n\geq 1\bigg \}$. S has at least two points because ψ(x) ≠ ψ(y). Let T₀: S → S be the map G ∘ T ∘ G ⁻¹. In other words, if s ∈ S we pick z ∈ Y such that s = G(z) and define T₀(s) = G(T(z)). To see that this is well defined note that s = G(z₁) = G(z₂) implies ψ_n(z₁) = ψ_n(z₂) for all n which implies ψ_n(T(z₁)) = ψ_n(T(z₂)) for all n (because P ≤ P) so G(T(z₁)) = G(T(z₂)) so T₀ is well defined. The fact that P ≤ P also implies that if ψ_n(z_m)→ψ_n(z) as m→∞ for each n then ψ_n(T(z_m))→ψ_n(T(z)) for each n. This means T₀ is continuous. Lemma 1.13 shows that T₀ has a fixed point s₁ ∈ S. Let Y₁ = G ⁻¹({s₁}). Let z ∈ Y₁ so G(z) = s₁. Then s₁ = T₀(s₁) = G(T(z)). Hence T(z) ∈ Y₁. Thus T(Y₁) ⊆ Y₁. Clearly Y₁ is convex. It is a closed subset of S and hence it is compact. ■

Tychonoff extended Brouwer’s result to a compact convex subset of a locally convex topological vector space [190].

Theorem 1.50 (Schauder-Tychonoff’s Fixed Point Theorem)

Any continuous map T from a compact convex subset Y of a locally convex topological vector space (X, τ) into Y has a fixed point.

Proof

By Proposition 1.46 there is a minimal nonempty compact convex set Y₀ such that T(Y₀) ⊆ Y₀ and Y₀ must be a singleton. ■

The following result [98], called Markov-Kakutani fixed point theorem, is powerful in that it determines a single fixed point for a whole family of mappings, while theorems such as the Schauder-Tychonoff fixed point theorem determine conditions on the space such that the restriction on the mapping is minimal, namely that we only require the mapping T to be continuous.

Theorem 1.51 (Markov-Kakutani’s Fixed Point Theorem)

Let Y be a compact convex subset of a locally convex topological vector space (X, τ). Let T_α: Y → Y (α ∈ I) be a family of continuous mappings that are affine (which means they satisfy the condition $T_{\alpha }(\sum _{i=1}^{n}\lambda _{i}x_{i})= \sum _{i=1}^{n}\lambda _{i}T_{\alpha }(x_{i})$ whenever $n\in \mathbb {N}, \lambda _{i}\geq 0$ for all i and $\sum _{i=1}^{n}\lambda _{i}= 1$ ). If T_α ∘ T_β = T_β ∘ T_α for all α, β ∈ I then there exists x ∈ Y such that T_α(x) = x for all α ∈ I.

Proof

For each α ∈ I, let Z_α = {x ∈ Y : T_α(x) = x}. From the Schauder-Tychonoff fixed point theorem we know that Z_α ≠ ∅. Since T_α is a continuous affine map, it follows that Z_α is compact and convex. So to restate the conclusion of the theorem we must show that ⋂_{α ∈ I} Z_α ≠ ∅. Since Y is compact, we have, by Proposition 1.4 that we need only show that ⋂_{α ∈ J} Z_α ≠ ∅ for each nonempty finite subset J of I. To this end, let J = {α₁, α₂, ⋯ , α_n} be a nonempty finite subset of I. We shall proceed by induction.

Let x be any element of $Z_{\alpha _{1}}$ then

$$\displaystyle \begin{aligned} T_{\alpha_{1}}(T_{\alpha_{2}}(x))= T_{\alpha_{2}}(T_{\alpha_{1}}(x))= T_{\alpha_{2}}(x). \end{aligned}$$

That is, $T_{\alpha _{2}}(x)$ is a fixed point of $T_{\alpha _{1}}$ and so $T_{\alpha _{2}}(x)\in Z_{\alpha _{1}}$. Thus, $T_{\alpha _{2}}(Z_{\alpha _{1}})\subseteq Z_{\alpha _{1}}$. Hence, from the Schauder-Tychonoff fixed point theorem, $T_{\alpha _{2}}$ has a fixed point in $Z_{\alpha _{1}}$. Therefore, $Z_{\alpha _{1}}\cap Z_{\alpha _{2}}\neq \emptyset $. Now, suppose that

$$\displaystyle \begin{aligned} Z_{\alpha_{1}}\cap Z_{\alpha_{2}}\neq \emptyset\cap\cdots \cap Z_{\alpha_{j}}~~ \mbox{where},~1\leq j\leq n. \end{aligned}$$

Let $Z= Z_{\alpha _{1}}\cap Z_{\alpha _{2}}\neq \emptyset \cap \cdots \cap Z_{\alpha _{j}}$. Then Z is nonempty, compact and convex. Let x be any element of Z and let 1 ≤ i ≤ j then

$$\displaystyle \begin{aligned} T_{\alpha_{i}}(T_{\alpha_{j+1}}(x))= T_{\alpha_{j+1}}(T_{\alpha_{i}}(x))= T_{\alpha_{j+1}}(x). \end{aligned}$$

That is, $T_{\alpha _{j+1}}(x)$ is a fixed point of $T_{\alpha _{i}}$ and so $T_{\alpha _{j+1}}(x)\in Z_{\alpha _{i}}$. Since 1 ≤ i ≤ j was arbitrary,

$$\displaystyle \begin{aligned} T_{\alpha_{j+1}}(x)\in Z_{\alpha_{1}}\cap Z_{\alpha_{2}}\neq \emptyset\cap\cdots \cap Z_{\alpha_{j}}=Z. \end{aligned}$$

Hence, from the Schauder-Tychonoff fixed point theorem, $T_{\alpha _{j+1}}$ has a fixed point in Z. Therefore,

$$\displaystyle \begin{aligned} Z_{\alpha_{1}}\cap Z_{\alpha_{2}}\neq \emptyset\cap\cdots \cap Z_{\alpha_{j}}\cap Z_{\alpha_{j+1}}\neq \emptyset. \end{aligned}$$

By induction, we see that ⋂_{α ∈ J} Z_α ≠ ∅. This completes the proof.■

We shall need some facts about the Kuratowski measure of noncompactness μ introduced by Kuratowski [122]. This measure of noncompactness is used by Darbo [40], Furi and Vignoli [61], Nussbaum [136], Petryshyn [150], and others.

The concept of Kuratowski’s measure of noncompactness is defined below.

Definition 1.67

Let (X, d) a metric space. If Y is a bounded subset of X (i.e., $\mathrm {diam} (Y)= \sup \{d(x,y)\colon x,y\in Y\}< \infty $), the Kuratowski measure of noncompactness of Y is defined by

$$\displaystyle \begin{aligned} \mu(Y)= \inf\Big\{ \delta> 0\colon Y= \bigcup_{i=1}^{n}Y_{i} \mbox{ for some } Y_{i} \mbox{ with } \mathrm{diam} (Y_{i})\leq \delta, 1\leq i\leq n< \infty \Big\}. \end{aligned}$$

We give the following properties of μ. For the proofs see [136].

Proposition 1.47

Let (X, d) be a metric space. If Y is a bounded subset of X, then $\mu (Y)= \mu (\overline {Y})$.

Proposition 1.48

Let (X, d) be a complete metric space. Then

1.
for every bounded subset Y of X, μ(Y ) = 0 if and only if $\overline {Y}$ is compact.
2.
If (Y_n)_n≥1 is a decreasing sequence of closed, bounded nonempty subsets of X and if lim_n→∞ μ(Y_n) = 0, then Y =⋂_n≥1 Y_n is compact and nonempty.

If (X, ∥.∥) is a normed space, the norm ∥.∥ gives a metric on X and one can take the Kuratowski measure of noncompactness μ on X with respect to this metric.

Proposition 1.49

Let (X, ∥.∥) be a normed space, Y, Z two bounded subsets of X, x₀ ∈ X and $\lambda \in \mathbb {K}$ . Then

1.
μ(λY ) = |λ|μ(Y ).
2.
μ(conv(Y )) = μ(Y ).
3.
μ(Y + Z) ≤ μ(Y ) + μ(Z).
4.
μ(Y ∪{x₀}) = μ(Y ).

Closely associated with the measure of noncompactness is the concept of k-set contraction.

Definition 1.68

If Y₁ is a subset of a metric space (X₁, d₁), and (X₂, d₂) is a second metric space and T : Y₁ → X₂ is a continuous map, we shall say that T is a k-set-contraction if μ₂(T(Z)) ≤ μ₁(Z), for all bounded sets Z ⊆ Y₁, where μ_i denotes the Kuratowski measure of noncompactness on (X_i, d_i).

Theorem 1.52 (Darbo’s Fixed Point Theorem)

Let Y be a closed bounded convex set in a Banach space (X, ∥.∥) and let T : Y → Y be a k-set-contraction with k < 1. Then T has a fixed point in Y [ 40 ].

There is a more useful generalization of Darbo’s fixed point theorem.

Theorem 1.53

Let Y be a closed bounded convex set in a Banach space (X, ∥.∥) and T : Y → Y a continuous map. Define $Y_{1}= \overline {\mathrm {conv}}(T(Y))$ and $Y_{n}= \overline {\mathrm {conv}}(T(Y_{n-1}))$ for n > 1 and assume that if lim_n→∞ μ(Y_n) = 0 where μ denotes the Kuratowski measure of noncompactness on X. Then T has a fixed point in Y .

If T in Theorem 1.53 is a k-set contraction with k < 1, then if lim_n→∞ μ(Y_n) = 0, but the conditions of Theorem 1.53 may be satisfied in cases of interest for which T is not a k-set contraction with k < 1.

The following result is an extension of Darbo’s fixed point theorem [61, 136, 172].

Theorem 1.54 (Sadovskii’s Fixed Point Theorem)

Let Y be a closed bounded convex set in a Banach space (X, ∥.∥) and let T : Y → Y be a continuous μ-condensing map (i.e., μ(T(Z)) < μ(Z), for all bounded sets Z ⊆ Y for which μ(Z) > 0). Then T has a fixed point in Y .

1.6 Nonexpansive Mappings

Definition 1.69

A mapping T is nonexpansive if ∥T(x) − T(y)∥≤∥x − y∥ for all x, y in its domain.

Definition 1.70

Let X be a Banach space and Y be a nonempty bounded closed convex subset of X. We say that Y has the fixed point property for nonexpansive mapping if for every nonexpansive mapping T : Y →Y , Y contains a fixed point x ^∗ (i.e., T(x ^∗) = x ^∗), X has the fixed point property (FPP for short) if any nonempty bounded closed convex subset of X has the fixed point property for nonexpansive mapping, X has the weak fixed point property (WFPP for short) if any weakly compact convex subset of X has the fixed point property for nonexpansive mapping.

Remark 1.45

For a reflexive Banach space, FPP and WFPP are obviously the same.

Definition 1.71

Let Y be a nonempty set. A nonempty subset Y₀ of Y is called invariant under T or T-invariant for a mapping T : Y →Y if T(Y₀) ⊂ Y₀ . Let $\mathcal {Y}$ be a class of subsets of Y . We say that an element $Y_0 \in \mathcal {Y}$ is $\mathcal {Y}$-minimal for T if there exists no proper T-invariant subset of Y₀ in the class $\mathcal {Y}$.

We are interested mainly in the case that Y is a subset of a Banach space X and $\mathcal {Y}$ is the class of weakly compact subsets of X or the class of closed convex subsets of X.

Remark 1.46

If Y is a closed convex subset of a Banach space X and T : Y →Y , then a decreasing sequence of nonempty, closed, convex, T-invariant sets may be obtained by setting

$$\displaystyle \begin{aligned} Y_{0}= Y\,\,\, \text{and}\,\,\, Y_{n+1}= \overline{\mathrm{conv}}(T(Y_n))\,\,\, \forall\, n\geq 1. \end{aligned}$$

We set

$$\displaystyle \begin{aligned} \widehat{Y}= \bigcap_{n=1}^{\infty}Y_{n}. \end{aligned}$$

The set $\widehat {Y}$ is closed, convex and T-invariant . But it may be empty. Of course this situation cannot occur if Y is weakly compact.

Proposition 1.50

If X is a Banach space, Y ⊆ X is a nonempty, weakly compact, convex set and T : Y →Y , then there exists a nonempty, closed, convex set $\widehat {Y} \subseteq Y$ which is minimal invariant for T.

Proof

Let Γ be the family of all nonempty, closed, convex subsets of Y which are T-invariant. We order Γ by reverse inclusion, namely if Y₁, Y₂ ∈ Γ, then

$$\displaystyle \begin{aligned} Y_{1}\leq Y_{2} \Longleftrightarrow Y_{2}\subset Y_{1}. \end{aligned}$$

By the finite intersection property for the weak topology, every chain in Γ has an upper bound (namely the intersection of the elements in the chain). So by the Zorn lemma , Γ has a maximal element $\widehat {Y} \in \Gamma $. Evidently $\widehat {Y}$ is T-invariant. ■

Remark 1.47

Note that if $\widehat {Y} \subseteq Y$ is a nonempty, closed, convex and minimal T-invariant set, then

$$\displaystyle \begin{aligned} \widehat{Y}= \overline{\mathrm{conv}}(T(\widehat{Y})). \end{aligned}$$

If $\widehat {Y} \in \Gamma $ in Proposition 1.50 is a singleton, i.e., $\widehat {Y}= \{y\}$, then

$$\displaystyle \begin{aligned} T(y)= y, \end{aligned}$$

i.e., it is a fixed point of T.

The famous question whether a Banach space has the fixed point property had remained open for a long time. It has been answered in the negative by Sadovski [172] and Alspach [4] who constructed the following examples, respectively.

Examples 1.10

1.
Let X = c₀ and Y = {x ∈ c₀, ∥x∥_∞≤ 1}. Define T : Y →Y by
$$\displaystyle \begin{aligned} T(x)= (1,x_{1},x_{2},x_{3}, \ldots), \,\text{for all }\, x= (x_{1},x_{2},x_{3}, \ldots) \in Y. \end{aligned}$$
2.
Let X = L ¹(0, 1) and
$$\displaystyle \begin{aligned} Y= \left\{ x \in X, \, 0\leq x(t)\leq 1 \,\, \text{and} \int_{0}^{1}x(t)dt= \frac{1}{2}\right\}. \end{aligned}$$

Define T : Y →Y by
$$\displaystyle \begin{aligned} T(x)(t)= \left\{\begin{array}{ll}\min\{1,2x(2t)\}, \ \mathrm{if} \ 0\leq t\leq \frac{1}{2}, \\[0.3cm] \max\{0, 2x(2t-1)-1\}, \ \mathrm{if}\ \frac{1}{2}< t\leq 1. \end{array}\right. \end{aligned}$$

Then Y is bounded, closed, and convex, and T is an isometry ∥T(x) − T(y)∥₁ = ∥x − y∥₁, for all x, y ∈ Y and is fixed point free.

Namely, c₀ and L ¹(0, 1) do not have the fixed point property. The above two examples suggest that to obtain positive results for the existence of fixed points for nonexpansive mappings, it is necessary to impose some restrictions either on T or on the Banach space X.

The following well-known result is due to Kirk [107].

Theorem 1.55

Let X be a reflexive Banach space and Y a closed bounded convex subset of X. Let Y have normal structure. If T : Y → Y is nonexpansive, then T has a fixed point.

Remark 1.48

Theorem 1.55 remains true if X is any Banach space and Y is a convex weakly compact subset having normal structure.

An immediate consequence of Theorem 1.55 is the following well-known result, which was proved independently by Browder [29], Göhde [69] and Kirk [107].

Theorem 1.56

Let X be a uniformly convex Banach space and Y a nonempty closed bounded convex subset of X. If T : Y → Y is nonexpansive, then T has a fixed point.

Remark 1.49

For nonexpansive maps, no characterization of FPP or WFPP seems to be known [21].

References

D.E. Alspach, A fixed point free nonexpansive mapping. Proc. Amer. Math. Soc. 82, 423–424 (1981)
Article MathSciNet MATH Google Scholar
A.V. Archangelskii, Topological Spaces of Functions. Moscow State Univ., 1989 (in Russian) (Kluwer Acad. Publ., Dordrecht, 1992). (in English)
Google Scholar
S. Banach, Sur les opérations dans les ensembles abstraits et leurs applications. Fund. Math. 3, 133–181 (1922)
Article MathSciNet MATH Google Scholar
C.S. Barroso, The approximate fixed point property in Hausdorff topological vector spaces and applications. Discrete Contin. Dyn. Syst. 25, 469–479 (2009)
Article MathSciNet Google Scholar
C.S. Barroso, O.F.K. Kalenda, P.-K. Lin, On the approximate fixed point property in Hausdorff topological vector spaces and applications. Math. Z. 271(3–4), 1271–1285 (2012)
Article MathSciNet MATH Google Scholar
J.M. Bayod, J. Martinez-Maurica, Ultrametrically injective spaces. Proc. Amer. Math. Soc. 101(3), 571–576 (1987)
Article MathSciNet MATH Google Scholar
T.D. Benavides, A renorming of some nonseparable Banach spaces with the fixed point property. J. Math. Anal. Appl. 350, 525–530 (2009)
Article MathSciNet MATH Google Scholar
J. Bourgain, D.H. Fremlin, M. Talagrand, Pointwise compact sets of Baire measurable functions. Amer. J. Math. 100, 845–886 (1978)
Article MathSciNet MATH Google Scholar
R.D. Bourgin, Geometric Aspects of Convex Sets with the Radon-Nikodým Property. Lecture Notes in Math., vol. 993 (Springer-Verlag, 1983)
Google Scholar
L.E.J. Brouwer, Über Abbildungen von Mannigfaligkeiten. Math. Ann. 71, 97–115 (1910)
Google Scholar
L.E.J. Brouwer, An intuitionist correction of the fixed point theorem on the sphere. Proc. R. Soc. Lond. A Math. Phys. Sci. 213, 1–2 (1952)
MathSciNet MATH Google Scholar
F.E. Browder, Fixed point theorems for nonlinear semicontractive mappings in Banach spaces. Arch. Rat. Mech. Anal. 21, 259–269 (1966)
Article MathSciNet MATH Google Scholar
J. Caristi, Fixed point theorems for mappings satisfying the inwardness condition. Trans. Amer. Math. Soc. 30 (part 3), 1527–1534 (1997)
Google Scholar
G. Darbo, Punti uniti in trasformazioni a codimino non compatto. Rend. Sem. Mat. Univ. Padova. 24, 84–92 (1955)
MathSciNet MATH Google Scholar
G. Díaz, Monte subspaces in the countable projective limits of L_p(μ)-spaces. Canad. Math. Mat. Bull. 32, 169–176 (1989)
Article Google Scholar
M. Fabian, Gâteaux Differentiability of Convex Functions and Topology: Weak Asplund Spaces (Wiley-Intersscience, New York, 1997)
MATH Google Scholar
M. Fitting, Metric methods, three examples and a theorem. J. Logic Program. 21, 113–127 (1994)
Article MathSciNet MATH Google Scholar
M. Furi, A. Vignoli, Fixed points for densifying mappings. Rendi. Accademie. Naz. Lincei. 47, 465–467 (1969)
MathSciNet MATH Google Scholar
D. Göhde, Zum prinzip der kontraktiven abbildung. Math. Nachr. 30, 251–258 (1965)
Article MathSciNet MATH Google Scholar
P. Habala, P. Hájek, V. Zizler, Introduction to Banach Spaces I, II (Matfyzpress, Prague, 1996)
MATH Google Scholar
U. Heckmanns, Aspects of Ultrametric Spaces, volume 109 of Queen’s Papers in Pure and Applied Mathematics (Queen’s University, Kingston, 1998)
MATH Google Scholar
P. Hitzler, A.K. Seda, Generalized ultrametrics, domains and an application to computational logic. Irish Math. Soc. Bull. 41, 31–43 (1998)
Article MathSciNet MATH Google Scholar
P. Hitzler, A.K. Seda, The fixed point theorems of Priesse-Crampe and Ribenboim in logic programming. Fields Inst. Commun. 32, 219–235 (2002)
MATH Google Scholar
R.C. James, A separable somewhat reflexive Banach space with non separable dual. Bull. Amer. Math. Soc. (N.S.) 80, 738–743 (1974)
Google Scholar
S. Kakutani, Two Fixed-point theorems concerning bicompact convex sets. Proc. Imp. Acad. 14(7), 242–245 (1938)
MathSciNet MATH Google Scholar
O.F.K. Kalenda, Spaces not containing l₁ have weak approximate fixed point property. J. Math. Anal. Appl. 373, 134–137 (2011)
Article MathSciNet MATH Google Scholar
M.A. Khamsi, V. Kreinovich, D. Misane, A new method of proving the existence of answer sets for disjunctive logic programs, in Proceeding of the Workshop on Logic Programming with Incomplete Information, Vancouver, B. C., October (1993), pp. 227–244
Google Scholar
W.A. Kirk, A fixed point theorem for mappings which do not increase distance. Am. Math. Mon. 72, 1004–1006 (1965)
Article MathSciNet MATH Google Scholar
W.A. Kirk, N. Shahzad, Fixed Point Theory in Distance Spaces (Springer International Publishing, Cham, 2014)
Book MATH Google Scholar
M. Krötzsch, Generalized ultrametric spaces in quantitative domain theory. Theoret. Comput. Sci. 368(1–2), 30–49 (2006)
Article MathSciNet MATH Google Scholar
C. Kuratowski, Sur les espaces complets. Fund. Math. 15, 301–309 (1930)
Article MATH Google Scholar
J. Lindenstrauss, C. Stegall, Examples of separable spaces which do not contain l₁ and whose duals are non-separable. Stud. Math. 54, 81–105 (1975)
Article MATH Google Scholar
R.D. Nussbaum, The fixed point index and fixed point theorems for k-set contractions. Doctoral dissertation, University of Chicago, 1969
Book MATH Google Scholar
W.V. Petryshyn, Structures of the fixed point sets of the k-set contractions. Arch. Rat. Mech. Anal. 40, 312–328 (1971)
Article MathSciNet MATH Google Scholar
S. Priess-Crampe, Der Banachsche Fixpunktsatz für ultrametrishe Räume. Results Math. 18, 178–186 (1990)
Article MathSciNet MATH Google Scholar
S. Priess-Crampe, Remarks on some theorems of functional analysis, in Ultrametric Functional Analysis (Clermont-Ferrand, 2004), Contemp. Math., vol. 384 (American Mathematical Society, Providence, RI, 2005), pp. 235–246
Google Scholar
S. Priess-Crampe, P. Ribenboim, Fixed point, combs and generalized power series. Abh. Math. Semin. Univ. Hambg. 63, 227–244 (1993)
Article MathSciNet MATH Google Scholar
S. Priess-Crampe, P. Ribenboim, Generalized ultrametric spaces. I. Abh. Math. Semin. Univ. Hambg. 66, 55–73 (1996)
Article MathSciNet MATH Google Scholar
S. Priess-Crampe, P. Ribenboim, Generalized ultrametric spaces. II. Abh. Math. Semin. Univ. Hambg. 67, 19–31 (1997)
Article MathSciNet MATH Google Scholar
S. Priess-Crampe, P. Ribenboim, The common fixed point theorem for ultrametric spaces. Geom. Dedicate 72, 105–110 (1998)
Article MATH Google Scholar
S. Priess-Crampe, P. Ribenboim, Fixed point and attractor theorems for ultrametric spaces. Forum Math. 12, 53–64 (2000)
MathSciNet MATH Google Scholar
S. Priess-Crampe, P. Ribenboim, Ultrametric Dynamics. Illinois J. Math. 55, 287–303 (2011)
Google Scholar
S. Priess-Crampe, P. Ribenboim, The approximation to a fixed point. J. Fixed Point Theory Appl. 14, 41–53 (2013)
Article MathSciNet MATH Google Scholar
P. Ribenboim, The new theory of ultrametric spaces. Periodica Math. Hung. 32, 103–111 (1996)
Article MathSciNet MATH Google Scholar
B.N. Sadovskiĭ, On a fixed point principle. Funktsional. Anal. i Prilozhen 1(2), 74–76 (1967). (in Russian)
Google Scholar
J. Schauder, Der fixpunktsatz funktional räumen. Stud. Math. 2, 361–370 (1930)
Article Google Scholar
E. Schören, On immediate extensions of ultra metric spaces. Results Math. 29, 287–303 (1996)
Google Scholar
A.K. Seda, Quasi-metrics and the semantics of logic programs. Fundam. Inform. 29, 97–117 (1997)
Article MathSciNet MATH Google Scholar
A. Tychonoff, Ein fixpunktsatz. Math. Ann. 111, 767–676 (1935)
Article MathSciNet MATH Google Scholar

Download references

Author information

Authors and Affiliations

Faculty of Sciences, Mathematics University of Sfax, Sfax, Tunisia
Afif Ben Amar
School of Mathematical and Statistical Sciences, National University of Ireland, Galway, Ireland
Donal O’Regan

Authors

Afif Ben Amar
View author publications
You can also search for this author in PubMed Google Scholar
Donal O’Regan
View author publications
You can also search for this author in PubMed Google Scholar

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Ben Amar, A., O’Regan, D. (2022). Basic Concepts. In: Topology and Approximate Fixed Points. Developments in Mathematics, vol 71. Springer, Cham. https://doi.org/10.1007/978-3-030-92204-7_1

Download citation

DOI: https://doi.org/10.1007/978-3-030-92204-7_1
Published: 08 December 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-92203-0
Online ISBN: 978-3-030-92204-7
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)

Publish with us

Policies and ethics

Basic Concepts

Abstract

Similar content being viewed by others

70 Years of asymptotic fixed point theory

A Generalization of Nadler’s Fixed Point Theorem

An answer to an open problem of Jachymski

1.1 Topological Spaces

1.1.1 The Notion of Topological Spaces

Definition 1.1

Examples 1.1

Definition 1.2

Definition 1.3

Remark 1.1

Proposition 1.1

Corollary 1.1

Proof

Definition 1.4

Examples 1.2

Proposition 1.2

Definition 1.5

Theorem 1.1

Remark 1.2

Definition 1.6

Example 1.1

Remark 1.3

Definition 1.7

Remark 1.4

Definition 1.8

Definition 1.9

Proposition 1.3

Proposition 1.4

Proof

Remark 1.5

Proposition 1.5

Proof

Theorem 1.2

Theorem 1.3

Proof

Definition 1.10

Proposition 1.6

Proof

1.1.2 Comparison of Topologies

Definition 1.11

Remark 1.6

Example 1.2

Proposition 1.7

Proof

Definition 1.12

Proposition 1.8

Proof

Definition 1.13

Proposition 1.9

Proof

1.1.3 Nets and Convergence in Topology

Definition 1.14

Examples 1.3

Definition 1.15

Definition 1.16

Definition 1.17

Example 1.3

Definition 1.18

Definition 1.19

Example 1.4

Remark 1.7

Theorem 1.4

Proof

Lemma 1.1

Proof

Theorem 1.5

Proposition 1.10

Proof

Theorem 1.6

Proof

Theorem 1.7

Proposition 1.11

Proof

1.2 Topological Vector Spaces

1.2.1 Linear Topologies

Definition 1.20

Remark 1.8