Old and New: Convergence and compactness II: Weak compactness

This post is a continuation of the previous post on convergence and compactness. Here we study the important notions of weak and weak* convergence in details, and to provide ourselves a fundamental framework for studying compactness methods in PDEs. Major reference is Terry's notes, and also several discussions found in Mathstackexchange.

1.1. Compactness in the norm topology

We recall first the Heine-Borel theorem for metric spaces.

Theorem 1 (Heine-Borel) Let $ {(X,d)}$ be a metric space. Let $ {K\subset X}$. The following are equivalent:

$ {K}$ is sequential compact;

$ {K}$ is compact, i.e. every open cover has a finite subcover;

$ {K}$ is complete, and totally bounded, i.e. for each $ {\epsilon>0}$ there is a finite number of metric balls of radius $ {\epsilon}$ covers $ {K}$.

Several simplifications can be made if the metric space $ {X}$ is a finite dimensional normed vector space over $ {\mathbb{R}}$. First,

Proposition 2 $ {X}$ as a finite dimensional normed vector space will be automatically complete.

In fact, more is true: any two norms on a finite dimensional vector space are equivalent, and we can identify this space with $ {\mathbb{R}^{n}}$ equipped with the $ {\ell^{1}}$-norm. Given this, we can show

Proposition 3 $ {K\subset X}$ is bounded if and only if totally bounded.

It suffices to show for the closed cube $ {Q=[-1,1]^{n}}$ is totally bounded. If not, let $ {\{U_{\alpha}\}_{\alpha\in A}}$ be a collection of infinite open balls of the same radius that cover $ {Q}$ but without a finite subcover. Because $ {n}$ is finite, we can subdivide $ {Q}$ into $ {2^{n}}$ closed cubes of equal sides. By pigeon hole principle, at least one of the cubes requires infinite cover. Call this cube $ {Q_{1}}$. Continuing, we have a nested sequence

$ \displaystyle Q_{1}\supset Q_{2}\supset\cdots $

which by completeness satisfies $ {\bigcap_{n=1}^{\infty}Q_{n}=\{x\}}$ for some $ {x\in Q}$. But then the sequence will eventually lies in one covering ball. This is enough to deduce a contradiction. Finally, since complete subsets are also closed, we conclude the usual Heine-Borel theorem that $ {K\subset X}$ if and only if $ {K}$ is closed and bounded.
In the case of infinite dimensional Banach space (since compact sets are neccesarily complete, a Banach space in fact gives us more compact sets), the norm topology is so strong that it forces the compact sets to be "almost finite dimensional''.

Proposition 4 Let $ {V}$ be a Banach space and $ {K\subset V}$. Then $ {K}$ is compact if and only if $ {K}$ is closed and bounded, and for each $ {\epsilon>0}$, $ {K}$ lies in the $ {\epsilon}$-neighborhood of some finite dimensional subspace $ {W\subset V}$.

This follows by a direct application of Theorem 1.

Example 1 By Proposition 4, we see that in $ {\ell^{p}(\mathbb{N})}$, $ {1\leq p<\infty}$, a subset $ {K}$ is compact if and only if $ {K}$ is closed and bounded, and also uniformaly integrable at the spatial infinity, in the sense that for each $ {\epsilon>0}$, there is $ {n>0}$ such that

$ \displaystyle \left(\sum_{m>n}|f(m)|^{p}\right)^{1/p}\leq\epsilon $
for all $ {f\in K}$. Hence we see that the "moving bump'' $ {(e_{n})_{n=1}^{\infty}}$ is not compact, despite being closed and bounded in $ {\ell^{p}(\mathbb{N}).}$

We conclude this section with a construction of F. Riesz, which leads to the remarkable converse to the statement that closed unit ball is compact if the space is finite dimensional. Thus in particular, compact sets in an infinite dimensional space must have empty interior.
Consider an infinite dimensional normed vector space $ {X}$, and $ {W\subset X}$ a finite dimensional subspace. Take $ {x\in X}$. Then there is $ {y\in W}$ such that it is a distance minimizer:

$ \displaystyle \|x-y\|\leq\|x-w\| $

for all $ {w\in W}$. This follows from the completeness of $ {W}$, note however $ {y}$ need not be unique unless the space is uniformly convex. Now take $ {p\notin W}$. Then there is $ {w_{p}\in W}$ such that

$ \displaystyle \|p-w_{p}\|=\inf_{w\in W}\|p-w\|=R>0. $

Put

$ \displaystyle x=\frac{p-w_{p}}{R} $

so that $ {\|x\|=1}$. And we find that

$ \displaystyle \begin{array}{rcl} \inf_{w\in W}\|x-w\| & = & \inf_{w\in W}\|\frac{p-w_{p}}{R}-w\|\\ & = & \inf_{w\in W}\|\frac{p-w_{p}-R\cdot w}{R}\|=1. \end{array} $

Hence the point $ {x}$ is such that $ {\|x\|=1}$ and $ {\|x-w\|\geq1}$ for all $ {w\in W}$. This says that for any finite dimensional subspace, there exists a point on the unit sphere that is away form that subspace. This entails the following property of the unit sphere $ {S}$ in $ {X}$: pick any $ {x_{0}\in S}$, and let $ {x_{n}}$ be a point such that $ {\|x-w\|\geq1}$ for all $ {w\in\text{span}\{x_{0},\dots x_{n-1}\}}$. Then the sequence $ {(x_{n})_{n=0}^{\infty}}$ cannot have any convergent subsequence. The noncompactness of $ {S}$ thus follows.

1.2. The weak and weak* topologies

Recall that a topological vector space $ {V}$ is a vector space with a topology such that vector space operations are continuous with respect to that topology. We shall refer to this given topology as the strong topology on $ {V}$. This topology gives rise to the notion of dual space $ {V^{*}}$ of $ {V}$, defined to be the space of all continuous linear functionals on $ {V}$.
If the space is finite dimensional, its dual space can be identified with the space itself by a choice of basis $ {\{e_{1},\dots,e_{n}\}}$ and the map

$ \displaystyle \Phi:e_{i}\mapsto\Phi(e_{i})=\delta_{i}, $

where $ {\delta_{i}}$ is the Kronecker delta. The choice of basis had identified $ {V}$ with $ {\mathbb{R}^{n}}$, on which the product topology can be shown to be equivalent to any norm topology. Yet it is also the weakest topology that makes any $ {\lambda\in V^{*}}$ continuous, since a linear combination of $ {\delta_{i}}$'s is essentially the same with the corresponding linear combination of coordinate projection $ {\pi_{i}}$'s. The situation is quite different for infinite dimensional spaces. We make the following definition.

Definition 5 The weak topology on a topological vector space $ {V}$ is generated by all the semi-norms (i.e. a norm except that $ {\|v\|=0}$ does not neccessarily imply $ {v=0}$) of the following form

$ \displaystyle \|\cdot\|_{\lambda}:\|v\|_{\lambda}=|\lambda(v)|, $
for $ {\lambda\in V^{*}}$. The basic open sets generated by $ {\|\cdot\|_{\lambda}}$ are of the form

$ \displaystyle \{v\in V:|\lambda(v)|<c\}, $
for $ {c>0}$. A sequence $ {(v_{n})_{n=1}^{\infty}}$ converging to $ {v\in V}$ in the weak topology satisfies

$ \displaystyle \lambda(v_{n})\rightarrow\lambda(v) $
for all $ {\lambda\in V^{*}}$. We call the sequence $ {(v_{n})_{n=1}^{\infty}}$ converges weakly to $ {v}$, and denote it by

$ \displaystyle v_{n}\rightharpoonup v. $

By this definition, all $ {\lambda\in V^{*}}$ is continuous in the weak topology, since $ {\lambda}$ is continuous in the topology generated by its own semi-norm $ {\|\cdot\|_{\lambda}}$. Also, the weak topology is easily seen to be the weakest of this kind (i.e. contains fewest open sets). Thus it is weaker than the strong topology.
In a similar spirit,

Definition 6 The weak* topology on the dual space $ {V^{*}}$ of a topological vector space $ {V}$ is generated by all the semi-norms

$ \displaystyle \|\cdot\|_{v}:\|\lambda\|_{v}=|\lambda(v)|, $
for $ {v\in V.}$ Similarly, a sequence $ {(\lambda_{n})_{n=1}^{\infty}}$ converging to $ {\lambda\in V^{*}}$ in the weak* topology satisfies

$ \displaystyle \lambda_{n}(v)\rightarrow\lambda(v) $
for all $ {v\in V}$. We call the sequence $ {(\lambda_{n})_{n=1}^{\infty}}$ converges weak-starly to $ {\lambda}$, and denote it by

$ \displaystyle \lambda_{n}\rightharpoonup^{*}\lambda. $

Note that weak* topology on $ {V^{*}}$ depends on which predual $ {V}$ we use, so one cannot talk about the weak* topology unless the predual is specified or there is no confusion in the context.
By making use of the bidual $ {\left(V^{*}\right)^{*}}$, we can also talk about the weak topology on $ {V^{*}}$, and so on. The relation between the definitions of these topologies can be schematically shown as follows:

Because of the embedding $ {V\hookrightarrow\left(V^{*}\right)^{*}}$(which uses Hahn-Banach theorem), the weak* topology on $ {V^{*}}$ is weaker than the weak topology on $ {V^{*}}$.
From now on we shall be mainly interested in case $ {V}$ is a normed vector space.

Remark 1 In probability literature people usually refer to the weak-star convergence of measures as weak convergence of measures or vague convergence of measures, perhaps because the dual spaces of measure spaces are seldom considered.

The first question regards to the notion of weak and weak-star convergence is that if their limits, if exist, is unique. This is non-trivial since the topology generated by each individual semi-norm is non-Hausdorff (since each one has a non-trivial kernel). That altogether they generate a Hausdorff topology is due to the Hahn-Banach theorem.

Proposition 7 Let $ {V}$ be a normed vector space. Then the weak and weak* topologies on $ {V}$ and $ {V^{*}}$are Hausdorff.

Proof: By Hahn Banach, if $ {\lambda(v)=0}$ for all $ {\lambda\in V^{*}}$, then $ {v=0}$. $ \Box$

Example 2 Let $ {V=c_{0}(\mathbb{N})}$ (equipped with sup-norm), $ {V^{*}=\ell^{1}(\mathbb{N})}$, $ {\left(V^{*}\right)^{*}=\ell^{\infty}(\mathbb{N})}$, and let $ {e_{1},e_{2},\dots}$ be the standard basis of either of the three spaces that will be specified in the context.

$ {(e_{n})_{n=1}^{\infty}}$ converges weakly to $ {0}$ in $ {V}$, but not in the sup-norm;

$ {(e_{n})_{n=1}^{\infty}}$ converges weak-starly to $ {0}$ in $ {V^{*}}$, but not weakly in $ {V^{*}}$;

$ {\left(\sum_{m=n}^{\infty}e_{m}\right)_{n=1}^{\infty}}$ converges weak-starly to $ {0}$ in $ {\left(V^{*}\right)^{*}}$, but not weakly. The latter can be explained as follows. Recall that the dual space of $ {\ell^{\infty}(\mathbb{N})}$ is strictly larger than $ {\ell^{1}(\mathbb{N})}$. One can use Hahn-Banach theorem to extend the limit functional (which is continuous) defined on the space of bounded convergent sequence $ {c(\mathbb{N})\subset\ell^{\infty}(\mathbb{N})}$ to a generalised limit functional $ {\lim^{*}\in\left(\ell^{\infty}(\mathbb{N})\right)}$ . Then since $ {\sum_{m=n}^{\infty}e_{m}\in c(\mathbb{N})\subset\ell^{\infty}(\mathbb{N})}$ for each $ {n}$, $ {\lim^{*}(\sum_{n=m}^{\infty}e_{n})=1\neq0}$ for each $ {n}$, thus the seqeunce $ {\left(\lim^{*}(\sum_{n=m}^{\infty}e_{n})\right)_{n=1}^{\infty}}$does not converge to zero.

But it is perhaps quite surprising that in $ {V^{*}=\ell^{1}(\mathbb{N})}$, a sequence converges weakly will also converge strongly. This is known as the Schur property of $ {\ell^{1}(\mathbb{N})}$, where in this case depends quite strongly on the discrete nature of $ {\mathbb{N}}$. Nevertheless, the unit ball $ {B=\{x\in\ell^{1}(\mathbb{N}):\|x\|_{1}<1\}}$ is open in the strong topology, but not in the weak topology, so these two topologies are not the same, abide having the same converging sequences. The reason is that a non-empty weakly open set needs to be unbounded, since any weakly open set is the union of finite intersetions of the sets of the form

$ \displaystyle U=\{x:|\lambda(x)|<c\} $
for $ {c>0}$. Hence in particular, the kernel of $ {\lambda}$ is contained in $ {U}$. So once a open set is non-empty, it contains at least a one dimensional subspace spanned by a vector in the kernel of some $ {\lambda}$. And since $ {\ker\lambda}$ has at most codimension one, finite intersections of this kind of sets contain at least a one dimensional subsepace. In fact, this also implies the weak topology of a normed vector space is not normable.

Although the weak topology is not normable, and so a priori there is no notion of boundedness of sets, it turns out sets that are weakly bounded (i.e. $ {E\subset V}$ such that $ {\lambda(E)}$ is bounded for each $ {\lambda\in V^{*}}$\}) are also strongly bounded (i.e. bounded in norm), due to the uniform boundedness principle.

Proposition 8 Let $ {V}$ be a normed vector space, and $ {E\subset V}$. Then $ {E}$ is strongly bounded if and only if weakly bounded.

Proof: We show the "if'' part. By the isometric embedding $ {V\hookrightarrow(V^{*})^{*}}$, each $ {x\in E}$ can be thought of as an element in $ {(V^{*})^{*}}$. Then $ {\sup_{x\in E}|\lambda(v)|<+\infty}$ for each $ {\lambda}$. By the uniform boundedness principle, $ {\sup_{x\in E}\|x\|_{V}<+\infty}$. $ \Box$

Making use of the basic inequality $ {|\lambda(v)|\leq\|\lambda\|_{V^{*}}\|v\|_{V}}$, we obtain

Corollary 9 Weakly and weak-starly convergent sequences are bounded. In fact, if $ {x_{n}\rightharpoonup x}$ in $ {V}$, then

$ \displaystyle \|x\|_{V}\leq\liminf_{n\rightarrow\infty}\|x_{n}\|_{V}. $
Similarly, if $ {\lambda_{n}\rightharpoonup^{*}\lambda}$ in $ {V^{*}}$, then

$ \displaystyle \|\lambda\|_{V^{*}}\leq\liminf_{n\rightarrow\infty}\|\lambda_{n}\|_{V^{*}}. $
Moreover, strict inequalities can hold. See Example 2.

We shall develop the converse to the above corollary except relaxed to subsequences in reflexive spaces, known as the Banach-Eberlein-Smulian theorem. Thus weak and weak* topologies enjoy much better compactness properties, in contrast to the strong (i.e. norm) topologies. For later purpose we need the following observation, which follows directly from the above corollary.

Lemma 10 Let $ {V}$ be a Banach space. Then the closed unit ball in $ {V}$ is closed in the weak topology; also, the closed unit ball in $ {V^{*}}$ is closed in the weak* topology.

Note that the statement for unit sphere is certainly false (more or less a paraphrase of that strict inequality can hold in Corollary 9).

1.3. Weak compactness: from the viewpoint of product topological spaces, again

Example 3 Let $ {V=c_{0}(\mathbb{N})}$ (equipped with sup-norm), $ {V^{*}=\ell^{1}(\mathbb{N})}$, $ {\left(V^{*}\right)^{*}=\ell^{\infty}(\mathbb{N})}$ .

Suppose $ {x_{n}\rightharpoonup x}$ in $ {V}$. We know the sequence is bounded in sup-norm. By definition of weak convergence, we have
$ \displaystyle \sum_{i=1}^{\infty}a^{(i)}x_{n}^{(i)}\rightarrow\sum_{i=1}^{\infty}a^{(i)}x^{(i)}<+\infty $
for all $ {a=(a^{(i)})_{i=1}^{\infty}\in\ell^{1}(\mathbb{N})}$. Then take
$ \displaystyle a=\sum_{i=1}^{\infty}\frac{1}{2^{i}}e_{i}. $
We see that
$ \displaystyle \sum_{i=1}^{\infty}\frac{1}{2^{i}}\left|x_{n}^{(i)}-x^{(i)}\right|\rightarrow0. $
This implies $ {x_{n}\rightarrow x}$ pointwisely in $ {V}$. By a similar argument we can see that the converse is also true: a bounded sequence that converges pointwisely will also converge weakly in $ {V}$.

Similarly, a sequence $ {\lambda_{n}\rightharpoonup^{*}\lambda}$ in $ {V^{*}}$ if and only if the sequence $ {(\lambda_{n})_{n=1}^{\infty}}$ is bounded in $ {\ell^{1}(\mathbb{N})}$ and converge pointwise to $ {\lambda}$. For the "only if'' part, we take $ {x=e_{i}}$ the standard basis in $ {c_{0}(\mathbb{N})}$ and verify the statement. For the "if'' part, note by definition that each $ {x\in c_{0}(\mathbb{N})}$ satisfies for any $ {\epsilon>0}$, there is $ {m>0}$ such that
$ \displaystyle \sup_{n>m}|x^{(n)}|<\epsilon. $
The pointwise convergence of $ {\lambda_{n}\in\ell^{1}(\mathbb{N})}$ thus implies for all $ {x\in V}$
$ \displaystyle \begin{array}{rcl} (\lambda_{n}-\lambda)(x) & = & \sum_{i=1}^{\infty}(\lambda_{n}^{(i)}-\lambda^{(i)})x^{(i)}\\ & \leq & \epsilon\left(\sup_{0<i<m}|x^{(i)}|+\|\lambda_{n}\|+\|\lambda\|\right) \end{array} $
for $ {n,m}$ large enough. This establishes the claimed equivalence.

We can regard the situation (1) as a weak-starly convergent sequence in $ {\left(V^{*}\right)^{*}}$ via the isometric embedding $ {V\hookrightarrow\left(V^{*}\right)^{*}}$.

As is perhaps anticipated, the space $ {\ell^{1}(\mathbb{N})}$ (or $ {\ell^{\infty}(\mathbb{N})}$) at the level of sets can be embedded into the product space $ {\mathbb{R}^{\mathbb{N}}=\prod_{i=1}^{\infty}\mathbb{R}^{(i)}}$. Furhtermore, if we consider the closed unit ball $ {B^{*}}$ (or more generally any closed bounded subset) in$ {\ell^{1}(\mathbb{N})}$ (or $ {\ell^{\infty}(\mathbb{N})}$) to be embedded into the product space $ {[-1,1]^{\mathbb{N}}}$ (note that $ {\|a\|_{1}\leq1}$ implies $ {|a^{(i)}|\leq1}$ for all $ {i\in\mathbb{N}}$; similar statement holds for $ {\|\cdot\|_{\infty}}$), the argument given in the above example in effect identified the weak{*} topology with the product topology on $ {[-1,1]^{\mathbb{N}}}$ restricted to $ {B^{*}}$. Moreover, $ {B^{*}}$ is closed in $ {[-1,1]^{\mathbb{N}}}$ by Lemma 10. Now the Tychonoff theorem implies that $ {[-1,1]^{\mathbb{N}}}$ is compact with product topology, so $ {B^{*}}$ is compact in the weak* topolgy. The sequential compactness $ {B}$ follows similarly.
The above argument can be generalized, leading to the Banach-Alaoglu theorem.

Theorem 11 (Banach-Alaoglu) Let $ {V}$ be a normed vector space. Then the closed unit ball of $ {V^{*}}$ is compact in the weak* topology.

Some additional care is needed for its sequential counterpart.

Theorem 12 (Sequential Banach-Alaoglu) Let $ {V}$ be a separable normed vector space. Then the closed unit ball of $ {V^{*}}$ is sequentially compact in the weak* topology.

Proof: Let $ {B}$, $ {B^{*}}$ be the closed unit ball in $ {V}$ and $ {V^{*}}$. Any element $ {\lambda\in B^{*}}$ maps $ {B}$ to $ {[-1,1]}$, as shown by the inequality

$ \displaystyle |\lambda(x)|\leq\|\lambda\|\|x\|\leq\|x\|\leq1 \ \ \ \ \ (1)$

for $ {x\in B}$. Since $ {V}$ is separable, there is a countable dense (in the sense of norm) subset $ {Q\subset B}$. Restrict $ {B^{*}}$ to $ {Q}$, we can identify $ {B^{*}\downharpoonright_{Q}}$ with a closed subset of $ {[-1,1]^{Q}}$, which by the sequential Tychonoff theorem is sequentially compact in the product topology. Therefore, any sequence in $ {B^{*}}$ contains a subsequence converging pointwisely on $ {Q}$. But by the estimate (1) (which in effect says $ {B^{*}}$ is uniformly equicontinuous on $ {B}$), we conclude that the subsequence converges pointwisely on $ {B}$. The sequential compactness of $ {B^{*}}$ thus follows. $ \Box$

Remark 2 One can also prove the theorem by observing that the weak* topology on the closed unit ball $ {B^{*}}$is metrisable: let $ {\{x_{i}\}_{i=1}^{\infty}}$ be an enumeration of the countable dense subset in $ {B}$, define

$ \displaystyle d(\lambda_{1},\lambda_{2})=\sum_{n=1}^{\infty}\frac{1}{2^{n}}\left|\lambda_{1}(x_{n})-\lambda_{2}(x_{n})\right| $
which is a metric induces the weak* topology on $ {B^{*}}$. And then invoke the Heine-Borel theorem 1 and the Banach-Alaoglu theorem 11.

Remark 3 It is essential to have the separability assumption. A counterexample: the closed unit ball in $ {\left(\ell^{\infty}(\mathbb{N})\right)^{*}}$ is not sequentially compact.

If the space $ {V}$ is reflexive, i.e. $ {V\equiv\left(V^{*}\right)^{*}}$, then by identifying the weak* topology on $ {\left(V^{*}\right)^{*}}$ and the weak topology on $ {V}$, and combining with Proposition (8), we obtain:

Corollary 13 (Banach-Eberlein-Smulian) If $ {V}$ is a reflexive Banach space, then any bounded sequence in contains a weakly convergent subseqeunce.

Remark 4 In fact, the converse to the above corollary is also true. See here for more discussion.

Remark 5 One should compare the Banach-Eberlein-Smulian theorem to the Heine-Borel theorem for metric spaces. The value of the former theorem lies in the fact that weak topology on an infinite dimensional normed vector space is not metrisable, so that the latter theorem doesn't apply.
It is especially curious in the case of a separable Hilbert space, in which case the weak and weak* topologies are identified. While the weak topology is not metrisable, when restricted to the closed unit ball, it becomes metrisable, as the Remark 2 shows.

Old and New

Pages

Convergence and compactness II: Weak compactness

No comments:

Post a Comment