Definition 1.1. A partially ordered set (M, ) is a set M equipped with a binary relation, called a partial ordering, satisfying

Similar documents
A NICE PROOF OF FARKAS LEMMA

Functional Analysis. AMAT 617 University of Calgary Winter semester Lecturer: Gilad Gour Notes taken by Mark Girard.

2. Dual space is essential for the concept of gradient which, in turn, leads to the variational analysis of Lagrange multipliers.

2) Let X be a compact space. Prove that the space C(X) of continuous real-valued functions is a complete metric space.

ME(EE) 550 Foundations of Engineering Systems Analysis Chapter 05: Linear Transformations and Functionals

Chapter 1. Preliminaries. The purpose of this chapter is to provide some basic background information. Linear Space. Hilbert Space.

Functional Analysis F3/F4/NVP (2005) Homework assignment 2

4.4. Orthogonality. Note. This section is awesome! It is very geometric and shows that much of the geometry of R n holds in Hilbert spaces.

Dual Spaces. René van Hassel

Chapter 1. Introduction

Chapter 1 Preliminaries

USING FUNCTIONAL ANALYSIS AND SOBOLEV SPACES TO SOLVE POISSON S EQUATION

David Hilbert was old and partly deaf in the nineteen thirties. Yet being a diligent

Some Results on b-orthogonality in 2-Normed Linear Spaces

Aliprantis, Border: Infinite-dimensional Analysis A Hitchhiker s Guide

NOTES ON VECTOR-VALUED INTEGRATION MATH 581, SPRING 2017

Math 5052 Measure Theory and Functional Analysis II Homework Assignment 7

Normed Vector Spaces and Double Duals

Linear Algebra Massoud Malek

FUNCTIONAL ANALYSIS LECTURE NOTES: COMPACT SETS AND FINITE-DIMENSIONAL SPACES. 1. Compact Sets

Optimality Conditions for Nonsmooth Convex Optimization

Functional Analysis HW #1

ALGEBRAIC GEOMETRY COURSE NOTES, LECTURE 2: HILBERT S NULLSTELLENSATZ.

MTH 503: Functional Analysis

5 Compact linear operators

Functional Analysis. Franck Sueur Metric spaces Definitions Completeness Compactness Separability...

Functional Analysis I

Hahn-Banach extension theorem for interval-valued functions and existence of quasilinear functionals

Exercise Solutions to Functional Analysis

Math 5520 Homework 2 Solutions

Lecture Notes in Functional Analysis

FUNCTIONAL ANALYSIS HAHN-BANACH THEOREM. F (m 2 ) + α m 2 + x 0

Locally convex spaces, the hyperplane separation theorem, and the Krein-Milman theorem

CHAPTER II HILBERT SPACES

Functional Analysis (H) Midterm USTC-2015F haha Your name: Solutions

08a. Operators on Hilbert spaces. 1. Boundedness, continuity, operator norms

Real Analysis Notes. Thomas Goller

Hilbert spaces. 1. Cauchy-Schwarz-Bunyakowsky inequality

Spectral Theory, with an Introduction to Operator Means. William L. Green

THEOREMS, ETC., FOR MATH 516

Convexity in R n. The following lemma will be needed in a while. Lemma 1 Let x E, u R n. If τ I(x, u), τ 0, define. f(x + τu) f(x). τ.

Tree sets. Reinhard Diestel

CHAPTER 7. Connectedness

Report 1 The Axiom of Choice

TOPOLOGICAL VECTOR SPACES

Linear Algebra Lecture Notes-I

There are two things that are particularly nice about the first basis

Introduction to Real Analysis Alternative Chapter 1

CHAPTER VIII HILBERT SPACES

2 Metric Spaces Definitions Exotic Examples... 3

Theorem (4.11). Let M be a closed subspace of a Hilbert space H. For any x, y E, the parallelogram law applied to x/2 and y/2 gives.

B. Appendix B. Topological vector spaces

MAT 570 REAL ANALYSIS LECTURE NOTES. Contents. 1. Sets Functions Countability Axiom of choice Equivalence relations 9

Separation in General Normed Vector Spaces 1

Topology. Xiaolong Han. Department of Mathematics, California State University, Northridge, CA 91330, USA address:

Linear Algebra (part 1) : Vector Spaces (by Evan Dummit, 2017, v. 1.07) 1.1 The Formal Denition of a Vector Space

SOLUTIONS TO SOME PROBLEMS

Elementary linear algebra

Functional Analysis

A SYNOPSIS OF HILBERT SPACE THEORY

2. Prime and Maximal Ideals

August 23, 2017 Let us measure everything that is measurable, and make measurable everything that is not yet so. Galileo Galilei. 1.

Elements of Convex Optimization Theory

CONTENTS. 4 Hausdorff Measure Introduction The Cantor Set Rectifiable Curves Cantor Set-Like Objects...

Chapter 1. Optimality Conditions: Unconstrained Optimization. 1.1 Differentiable Problems

Further Mathematical Methods (Linear Algebra) 2002

Vector Spaces and Linear Transformations

MATH 51H Section 4. October 16, Recall what it means for a function between metric spaces to be continuous:

Optimization and Optimal Control in Banach Spaces

The Hahn-Banach Theorem for Real Vector Spaces

An introduction to some aspects of functional analysis

Introduction to Convex Analysis Microeconomics II - Tutoring Class

CHAPTER 5. Birkhoff Orthogonality and a Revisit to the Characterization of Best Approximations. 5.1 Introduction

Finite-dimensional spaces. C n is the space of n-tuples x = (x 1,..., x n ) of complex numbers. It is a Hilbert space with the inner product

Course 311: Michaelmas Term 2005 Part III: Topics in Commutative Algebra

Zweier I-Convergent Sequence Spaces

FUNCTIONAL ANALYSIS: NOTES AND PROBLEMS

Hahn-Banach theorems. 1. Continuous Linear Functionals

Functional Analysis. Martin Brokate. 1 Normed Spaces 2. 2 Hilbert Spaces The Principle of Uniform Boundedness 32

Functional Analysis (2006) Homework assignment 2

Chapter 2 Metric Spaces

REAL AND COMPLEX ANALYSIS

MAA6617 COURSE NOTES SPRING 2014

Analysis II Lecture notes

Homework #2 Solutions Due: September 5, for all n N n 3 = n2 (n + 1) 2 4

ON THE CHEBYSHEV POLYNOMIALS. Contents. 2. A Result on Linear Functionals on P n 4 Acknowledgments 7 References 7

MATH 650. THE RADON-NIKODYM THEOREM

NAME: Mathematics 205A, Fall 2008, Final Examination. Answer Key

MATH 326: RINGS AND MODULES STEFAN GILLE

The extreme points of symmetric norms on R^2

A metric space is a set S with a given distance (or metric) function d(x, y) which satisfies the conditions

Convex Analysis and Optimization Chapter 4 Solutions

5 Banach Algebras. 5.1 Invertibility and the Spectrum. Robert Oeckl FA NOTES 5 19/05/2010 1

6.4 BASIS AND DIMENSION (Review) DEF 1 Vectors v 1, v 2,, v k in a vector space V are said to form a basis for V if. (a) v 1,, v k span V and

A NOTE ON LINEAR FUNCTIONAL NORMS

Math 564 Homework 1. Solutions.

l(y j ) = 0 for all y j (1)

2. The Concept of Convergence: Ultrafilters and Nets

Notes on Ordered Sets

Places of Number Fields and Function Fields MATH 681, Spring 2018

Transcription:

1 Zorn s Lemma Definition 1.1. A partially ordered set (M, ) is a set M equipped with a binary relation, called a partial ordering, satisfying (a) x x for every x M. (b) If x y and y x, then x = y. (c) If x y and y z, then x = z. (Reflexivity) (Antisymmetry) (Transitivity) The word partially is chosen because M may contain elements x, y such that neither x y nor y { x holds. Such pair } of elements is said to be incomparable. One example is M = {1}, {2}, {1, 2}, with set inclusion as a partial ordering. It is clear that {1} and {2} are not comparable. Analogously, x, y M are comparable if x y or y x or both. Definition 1.2. (a) A chain C is a partially ordered set where every pair of elements is comparable. (b) An upper bound (if it exists) of a subset W of a partially ordered set M is an element u M such that x u for all x W. (1) (c) A maximal element (if it exists) of a partially ordered set M is an element m M such that if m x for some x M, then x = m. (2) In other words, there is no x M such that m x but x m. If a maximal element exists for a totally ordered set/chain, then it must be unique; this follows immediately since every pair of elements is comparable in a chain. Now, it seems intuitive that a maximal element is also an upper bound. Unfortunately, this is not always the case, as illustrated in the following example. { } Example 1.3. Consider the set W =, {1}, {2}, {3}, {1, 2} with set inclusion as a partial ordering. The maximal elements are {1, 2} and {3}. On the other hand, if we view W as a subset of the power set of {1, 2, 3}, then the upper bound of W is the element {1, 2, 3}. Here is what went wrong: (1) requires an upper bound u to be comparable with all elements in W, while (2) does not. Another simple explanation is that an upper bound might not be element of W.

We are now ready to formulate Zorn s lemma and provide two applications of it. Theorem 1.4 (Zorn s lemma). Let (M, ) be a non-empty partially ordered set. If every chain C M has an upper bound, then M has at least one maximal element. Note the upper bound need not be an element of C, but it must be an element of M. Theorem 1.5 (Hamel basis). Every non-empty vector space X has a Hamel basis. Proof. Let M be the set of all linearly independent subsets of X ordered by set inclusion. M is non-empty since X {0} = there exists a nonzero x X = x M. Every chain C M has an upper bound, given by the union of all elements of C; by Zorn s lemma, M contains a maximal element B. We claim that B is a Hamel basis for X. Suppose not, then there exists an z X \ B. The set Y = B {z} form a linearly independent subset of X, containing B as a proper subset. This contradicts the maximality of B. Theorem 1.6 (Total orthonormal set). Every non-empty Hilbert space H has a total orthonormal set. Proof. Let M be the set of all orthonormal subsets of H ordered by set inclusion. M is non-empty since H {0} = there exists a nonzero x X = x x M. Every chain C M has an upper bound, given by the union of all elements of C; by Zorn s lemma, M contains a maximal element F. We claim that F is total in H. Suppose not, then there exists a nonzero z H such that z F. The set Y = F {z/ z } form an orthonormal subset of H, containing F as a proper subset. This contradicts the maximality of F. Note that completeness is required to appeal to the sufficient condition for totality. Remark 1.7. Observe that the proofs of Theorem 1.5, 1.6 shares the same steps. We first construct a partially ordered set (M, ) that possess certain properties depending on the context, such that Zorn s lemma is applicable. The maximal element, say, B, will be a candidate of something we want to show; this is shown by constructing an element in M that contradicts the maximality of B. The last step is usually the most difficult part. We will see in the next section that the proof of the Hahn-Banach theorem follows exactly this same approach. Page 2

2 Hahn-Banach Theorem. The Hahn-Banach theorem is one of the most fundamental result in linear functional analysis. A simple but powerful consequence of the theorem is there are sufficiently many bounded linear functionals in a given normed space X. Essentially, one construct functionals with certain properties on a lower-dimensional subspace of X, then extend it to the entire space X. By now, you should have seen several applications of the Hahn- Banach theorem. Definition 2.1. Given a vector space X, a sublinear functional is a real-valued functional p: X R satisfying (a) p(x + y) p(x) + p(y) for all x, y X. (b) p(αx) = αp(x) for all α R 0 and x X. (Subadditive). (Positive-homogeneous). Theorem 2.2 (Hahn-Banach Theorem: Extension of linear functionals). Let X be a real vector space and p a sublinear functional on X. Let f be a linear functional defined on a subspace Y X, satisfying f(y) p(y) for all y Y. Then f has a linear extension f from Y to X satisfying (a) f is a linear functional on X, (b) f Y = f, i.e. the restriction of f to Y agrees with f, (c) f(x) p(x) for all x X. Proof. The proof is different from the textbook, in the sense that in step (A) we define the partially ordered set M as an ordered pair consists of a subspace of X and a linear extension, whereas in step (C) we show how to choose δ by a backward argument, which is more intuitive instead of starting on some random equations and claim the choice of δ will get the job done. Following Remark 1.7, we split the proof into three parts: (A) Let M be the partially ordered set of pairs (Z, f Z ), where i. Z is a subspace of X containing Y, ii. f Z : Z R is a linear functional extending f, satisfying f Z (z) p(z) for all z Z. with the partial ordering (Z 1, f Z1 ) (Z 2, f Z2 ) if Z 1 Z 2 and (f Z2 ) Z1 = f Z1. It is { clear that } M is non-empty since (Y, f) M. Choose an arbitrary chain C = (Z α, f Zα ) in M, Λ some indexing set. We will show that C has an upper α Λ bound in M. Let W = α Λ Z α and construct a functional f W : W R defined as follows: If w W, then w Z α for some α Λ and we set f W (w) = f Zα (w) for that particular α. Page 3

This definition is well-defined. Indeed, suppose w Z α and w Z β. If Z α Z β, say, then f Zβ Zα = f Zα, since we are in a chain. W clearly contains Y, and we show that W is a subspace of X and f W is a linear functional on W. Choose any w 1, w 2 W, then w 1 Z α1, w 2 Z α2 for some α 1, α 2 Λ. If Z α1 Z α2, say, then for any scalars β, γ R we have w 1, w 2 Z α2 = βw 1 + γw 2 Z α2 W. Also, with f W (u) = f Zα1 (u) and f W (v) = f Zα2 (v), f W (βu + γv) = f Zα2 (βu + γv) = βf Zα2 (u) + γf Zα2 (v) = βf Zα1 (u) + γf Zα2 (v) = βf W (u) + γf W (v). linearity of f Zα2 since we are in a chain The case Z α2 Z α1 follows from a symmetric argument. Choose any w W, then w Z α for some α Λ and f W (w) = f Zα (w) p(w) since (w, Z α ) M. Hence, (W, f W ) is an element of M and an upper bound of C since (Z α, f Zα ) (W, f W ) for all α Λ. Since C was an arbitrary chain in M, by Zorn s lemma, M has a maximal element (Z, f Z ) M, and f Z is (by definition) a linear extension of f satisfying f Z (z) p(z) for all z Z. (B) The proof is complete if we can show that Z = X. Suppose not, then there exists an θ X \ Z; note θ 0 since Z is a subspace of X. Consider the subspace Z θ = span{z, {θ}}. Any x Z θ has a unique representation x = z + αθ, z Z, α R. Indeed, if x = z 1 + α 1 θ = z 2 + α 2 θ, z 1, z 2 Z, α 1, α 2 R, then z 1 z 2 = (α 2 α 1 )θ Z since Z is a subspace of X. Since θ / Z, we must have α 2 α 1 = 0 and z 1 z 2 = 0. Next, we construct a functional f Zθ : Z θ R defined by f Zθ (x) = f Zθ (z + αθ) = f Z (z) + αδ, (3) where δ is any real number. It can be shown that f Zθ is linear and f Zθ is a proper linear extension of f Z ; indeed, we have, for α = 0, f Zθ (x) = f Zθ (z) = f Z (x). Consequently, if we can show that f Zθ (x) p(x) for all x Z θ, (4) then (Z θ, f Zθ ) M satisfying (Z, f Z ) (Z θ, f Zθ ), thus contradicting the maximality of (Z, f Z ). Page 4

(C) From (3), observe that (4) is trivial if α = 0, so suppose α 0. We do have a single degree of freedom, which is the parameter δ in (3), thus the problem reduces to showing the existence of a suitable δ R such that (4) holds. Consider any x = z + αθ Z θ, z Z, α R. Assuming α > 0, (4) is equivalent to f Z (z) + αδ p(z + αθ) = αp(z/α + θ) f Z (z/α) + δ p(z/α + θ) δ p(z/α + θ) f Z (z/α). Since the above must holds for all z Z, α R, we need to choose δ such that δ inf p(z 1 + θ) f Z (z 1 ) = m 1. (5) z 1 Z Assuming α < 0, (4) is equivalent to f Z (z) + αδ p(z + αθ) = αp( z/α θ) f Z (z/α) δ p( z/α θ) δ p( z/α θ) f Z (z/α). Since the above must holds for all z Z, α R, we need to choose δ such that δ sup z 2 Z p( z 2 θ) f Z (z 2 ) = m 0. (6) We are left with showing condition (5), (6) are compatible, i.e. p( z 2 θ) f Z (z 2 ) p(z 1 + θ) f Z (z 1 ) for all z 1, z 2 Z. The inequality above is trivial if z 1 = z 2, so suppose not. We have that p(z 1 + θ) f Z (z 1 ) + p( z 2 θ) + f Z (z 2 ) = p(z 1 + θ) + p( z 2 θ) + f Z (z 2 z 1 ) f Z (z 2 z 1 ) + p(z 1 + θ z 2 θ) = f Z (z 2 z 1 ) + p(z 1 z 2 ) = f Z (z 1 z 2 ) + p(z 1 z 2 ) 0. where linearity of f Z and subadditivity of p are used. Hence, the required condition on δ is m 0 δ m 1. Page 5

3 Complex Hahn-Banach Theorem. We begin by proving a lemma that gives us the relation between real and imaginary parts of a complex linear functional. Lemma 3.1. Let X be a complex vector space and f : X C a linear functional. There exists a real linear functional f 1 : X R such that f(x) = f 1 (x) if(ix) for all x X. Proof. Write f(x) = f 1 (x) + if 2 (x), where f 1, f 2 are real-valued functionals. f 1, f 2 are real-linear since for any x, y X and scalars α, β R we have f 1 (αx + βy) + if 2 (αx + βy) = f(αx + βy) Moreover, we also have = αf(x) + βf(y) = αf 1 (x) + βf 1 (y) f 1 (ix) + if 2 (ix) = f(ix) = if(x) = if 1 (x) f 2 (x) = f 1 (x) = f 2 (ix) and f 2 (x) = f 1 (ix). = f(x) = f 1 (x) if 1 (ix). + i αf 2 (x) + βf 2 (y). Theorem 3.2 (Complex Hahn-Banach Theorem). Let X be a real or complex vector space and p a real-valued functional on X satisfying (a) p(x + y) p(x) + p(y) for all x, y X, (b) p(αx) = α p(x) for all α C. Let f be a linear functional defined on a subspace Y X, satisfying f(y) p(y) for all y Y. Then f has a linear extension f from Y to X satisfying (a) f is a linear functional on X, (b) f Y = f, i.e. the restriction of f to Y agrees with f, (c) f(x) p(x) for all x X Proof. Observe that the result follows from Theorem 2.2 if X is real, so suppose X is complex; this means Y is complex. Lemma 3.1 states that f(x) = f 1 (x) if(ix), with f 1 real-linear functional on Y. By the real Hahn-Banach Theorem 2.2, there exists a linear extension f 1 from Y to X. Setting f(x) = f 1 (x) i f 1 (ix), one can show that f is indeed a complex linear extension of f from Y to X. We are left to show that f satisfies (c). Page 6

Observe that the inequality is trivial if f(x) = 0, since p(x) 0. Indeed, 0 = p(0) = p(x x) p(x) + p( x) = p(x) + 1 p(x) = 2p(x). Suppose f(x) 0. Writing f(x) = f(x) e iθ, we have f(x) = e iθ f(x) = f(e iθ x) = f 1 (e iθ x) p(e iθ x) = e iθ p(x) = p(x). Remark 3.3. One could make a crude estimate on f(x) without appealing to polar form, but it fails to deliver what we want since f(x) 2 = f 1 (x) 2 + f 1 (ix) 2 p(x) 2 + p(ix) 2 = 2p(x) 2. Page 7