Introduction to Empirical Processes and Semiparametric Inference Lecture 22: Preliminaries for Semiparametric Inference

Similar documents
Analysis Preliminary Exam Workshop: Hilbert Spaces

INF-SUP CONDITION FOR OPERATOR EQUATIONS

Introduction to Empirical Processes and Semiparametric Inference Lecture 08: Stochastic Convergence

Notes on Integrable Functions and the Riesz Representation Theorem Math 8445, Winter 06, Professor J. Segert. f(x) = f + (x) + f (x).

4 Hilbert spaces. The proof of the Hilbert basis theorem is not mathematics, it is theology. Camille Jordan

I teach myself... Hilbert spaces

A Brief Introduction to Functional Analysis

Introduction to Empirical Processes and Semiparametric Inference Lecture 13: Entropy Calculations

CHAPTER II HILBERT SPACES

Statistics 612: L p spaces, metrics on spaces of probabilites, and connections to estimation

1 The Projection Theorem

Hilbert spaces. 1. Cauchy-Schwarz-Bunyakowsky inequality

Introduction to Empirical Processes and Semiparametric Inference Lecture 12: Glivenko-Cantelli and Donsker Results

Spectral theory for compact operators on Banach spaces

4 Linear operators and linear functionals

PROBLEMS. (b) (Polarization Identity) Show that in any inner product space

Spectral Theory, with an Introduction to Operator Means. William L. Green

MA677 Assignment #3 Morgan Schreffler Due 09/19/12 Exercise 1 Using Hölder s inequality, prove Minkowski s inequality for f, g L p (R d ), p 1:

ADJOINTS, ABSOLUTE VALUES AND POLAR DECOMPOSITIONS

FUNCTIONAL ANALYSIS LECTURE NOTES: ADJOINTS IN HILBERT SPACES

LECTURE 7. k=1 (, v k)u k. Moreover r

2. Dual space is essential for the concept of gradient which, in turn, leads to the variational analysis of Lagrange multipliers.

COMMON COMPLEMENTS OF TWO SUBSPACES OF A HILBERT SPACE

SPECTRAL THEOREM FOR COMPACT SELF-ADJOINT OPERATORS

HILBERT SPACES AND THE RADON-NIKODYM THEOREM. where the bar in the first equation denotes complex conjugation. In either case, for any x V define

ABSTRACT CONDITIONAL EXPECTATION IN L 2

The weak topology of locally convex spaces and the weak-* topology of their duals

LECTURE OCTOBER, 2016

Chapter 1. Preliminaries. The purpose of this chapter is to provide some basic background information. Linear Space. Hilbert Space.

FUNCTIONAL ANALYSIS LECTURE NOTES: COMPACT SETS AND FINITE-DIMENSIONAL SPACES. 1. Compact Sets

Functional Analysis HW #5

Hilbert space methods for quantum mechanics. S. Richard

Convex Analysis and Economic Theory Winter 2018

Functional Analysis. Franck Sueur Metric spaces Definitions Completeness Compactness Separability...

Hilbert Spaces. Hilbert space is a vector space with some extra structure. We start with formal (axiomatic) definition of a vector space.

Prof. M. Saha Professor of Mathematics The University of Burdwan West Bengal, India

Projection Theorem 1

Functional Analysis F3/F4/NVP (2005) Homework assignment 3

Chapter 1. Introduction

October 25, 2013 INNER PRODUCT SPACES

CHAPTER VIII HILBERT SPACES

Math 209B Homework 2

Introduction to Empirical Processes and Semiparametric Inference Lecture 09: Stochastic Convergence, Continued

Functional Analysis, Math 7321 Lecture Notes from April 04, 2017 taken by Chandi Bhandari

Vector Spaces. Vector space, ν, over the field of complex numbers, C, is a set of elements a, b,..., satisfying the following axioms.

Introduction to Empirical Processes and Semiparametric Inference Lecture 25: Semiparametric Models

Problem Set 6: Solutions Math 201A: Fall a n x n,

Operators with Compatible Ranges

Real Analysis, 2nd Edition, G.B.Folland Elements of Functional Analysis

SPECTRAL PROPERTIES OF THE LAPLACIAN ON BOUNDED DOMAINS

1 Compact and Precompact Subsets of H

In English, this means that if we travel on a straight line between any two points in C, then we never leave C.

Fourier and Wavelet Signal Processing

Kernel Method: Data Analysis with Positive Definite Kernels

Trace Class Operators and Lidskii s Theorem

Elements of Positive Definite Kernel and Reproducing Kernel Hilbert Space

Part III. 10 Topological Space Basics. Topological Spaces

1 Math 241A-B Homework Problem List for F2015 and W2016

David Hilbert was old and partly deaf in the nineteen thirties. Yet being a diligent

Normed Vector Spaces and Double Duals

The Gram matrix in inner product modules over C -algebras

Math 350 Fall 2011 Notes about inner product spaces. In this notes we state and prove some important properties of inner product spaces.

285K Homework #1. Sangchul Lee. April 28, 2017

1 Functional Analysis

Course 212: Academic Year Section 1: Metric Spaces

1. Subspaces A subset M of Hilbert space H is a subspace of it is closed under the operation of forming linear combinations;i.e.,

A SYNOPSIS OF HILBERT SPACE THEORY

Functional Analysis F3/F4/NVP (2005) Homework assignment 2

A Spectral Characterization of Closed Range Operators 1

Rudiments of Hilbert Space Theory

Chapter 7: Bounded Operators in Hilbert Spaces

Errata Applied Analysis

Measure, Integration & Real Analysis

Homework Assignment #5 Due Wednesday, March 3rd.

von Neumann algebras, II 1 factors, and their subfactors V.S. Sunder (IMSc, Chennai)

1 Orthonormal sets in Hilbert space

Introduction to Real Analysis Alternative Chapter 1

FUNCTIONAL ANALYSIS LECTURE NOTES: WEAK AND WEAK* CONVERGENCE

Boundary Behavior of Excess Demand Functions without the Strong Monotonicity Assumption

Theorem (4.11). Let M be a closed subspace of a Hilbert space H. For any x, y E, the parallelogram law applied to x/2 and y/2 gives.

An introduction to some aspects of functional analysis

CONTENTS. 4 Hausdorff Measure Introduction The Cantor Set Rectifiable Curves Cantor Set-Like Objects...

Exercise Solutions to Functional Analysis

Typical Problem: Compute.

Your first day at work MATH 806 (Fall 2015)

Functional Analysis. AMAT 617 University of Calgary Winter semester Lecturer: Gilad Gour Notes taken by Mark Girard.

V. SUBSPACES AND ORTHOGONAL PROJECTION

Orthogonal Projection and Least Squares Prof. Philip Pennance 1 -Version: December 12, 2016

Math Linear Algebra II. 1. Inner Products and Norms

Elements of Convex Optimization Theory

Optimization Theory. Linear Operators and Adjoints

Orthonormal Bases Fall Consider an inner product space V with inner product f, g and norm

Hilbert Spaces. Contents

Finite-dimensional spaces. C n is the space of n-tuples x = (x 1,..., x n ) of complex numbers. It is a Hilbert space with the inner product

1 Inner Product Space

Lecture notes: Applied linear algebra Part 1. Version 2

DS-GA 1002 Lecture notes 0 Fall Linear Algebra. These notes provide a review of basic concepts in linear algebra.

In Chapter 14 there have been introduced the important concepts such as. 3) Compactness, convergence of a sequence of elements and Cauchy sequences,

Functional Analysis I

Fredholm Theory. April 25, 2018

Transcription:

Introduction to Empirical Processes and Semiparametric Inference Lecture 22: Preliminaries for Semiparametric Inference Michael R. Kosorok, Ph.D. Professor and Chair of Biostatistics Professor of Statistics and Operations Research University of North Carolina-Chapel Hill 1

Projections Geometrically, the projection of an object T onto a space S is the element Ŝ S that is closest to T, provided such an element exists. In the semiparametric inference context, the object is usually a random variable and the spaces of interest for projection are usually sets of square-integrable random variables. 2

The following theorem gives a simple method for identifying the projection in this setting: Theorem 17.1 Let S be a linear space of real random variables with finite second moments. Then Ŝ is the projection of T onto S if and only if (i) Ŝ S and (ii) E(T Ŝ)S = 0 for all S S. 3

If S 1 and S 2 are both projections, then S 1 = S 2 almost surely. If S contains the constants, then ET = EŜ and cov(t Ŝ, S) = 0 for all S S. 4

Proof. First assume (i) and (ii) hold. Then, for any S S, we have E(T S) 2 = E(T Ŝ)2 + 2E(T Ŝ)(Ŝ S) + E(Ŝ S)2. (1) But Conditions (i) and (ii) force the middle term to be zero, and thus E(T S) 2 E(T Ŝ)2 with strict inequality whenever E(Ŝ S)2 > 0. Thus Ŝ is the almost surely unique projection of T onto S. 5

Conversely, assume that and any S S, Ŝ is a projection and note that for any α R E(T Ŝ αs)2 E(T Ŝ)2 = 2αE(T Ŝ)S + α2 ES 2. Since Ŝ is a projection, the left side is strictly nonnegative for every α. But the parabola α α 2 ES 2 2αE(T Ŝ)S is nonnegative for all α and S only if E(T Ŝ)S = 0 for all S. 6

Thus (ii) holds, and (i) is part of the definition of a projection and so holds automatically. The uniqueness follows from application of (1) to both S 1 and S 2, forcing E(S 1 S 2 ) 2 = 0. If the constants are in S, then Condition (ii) implies that E(T Ŝ)c = 0 for c = 1, and the theorem follows. 7

Note that the theorem does not imply that a projection always exists. In fact, if the set S is open in the L 2 (P ) norm, then the infimum of E(T S) 2 over S S is not achieved. A sufficient condition for existence, then, is that S be closed in the L 2 (P ) norm, but often existence can be established directly: we will discuss this more later. 8

A very useful example of a projection is a conditional expectation. For X and Y real random variables on a probability space, g 0 (y) E(X Y = y) is the conditional expectation of X given Y = y. If we let G be the space of all measurable functions g of Y such that Eg 2 (Y ) <, then it is easy to verify that E(X g 0 (Y ))g(y ) = 0, for every g G. 9

Thus, provided Eg0 2 (Y ) <, E(X Y = y) is the projection of X onto the linear space G. By Theorem 17.1, the conditional expectation is almost surely unique. We will utilize conditional expectations frequently for calculating projections needed in semiparametric inference settings. 10

Hilbert Spaces A Hilbert space is essentially an abstract generalization of a finite-dimensional Euclidean space. This abstraction is a special case of a Banach space and, like a Banach space, is often infinite-dimensional. To be precise, a Hilbert space is a Banach space with an inner product. 11

An inner product on a Banach space D with norm is a function, : D D R such that, for all α, β R and x, y, z D, the following hold: (i) x, x = x 2, (ii) x, y = y, x, and (iii) αx + βy, z = α x, z + β y, z. A semi-inner product arises when is a semi-norm. 12

It is also possible to generate a norm beginning with an inner product. Let D be a linear space with semi-inner product, (not necessarily with a norm), i.e.,, satisfies Conditions (ii) (iii) above and also satisfies x, x 0. Then x x, x 1/2, for all x D, defines a semi-norm. 13

This is verified in the following theorem: Theorem 17.2 Let, be a semi-inner product on D, with x x, x 1/2 for all x D. Then, for all α R and all x, y D, (a) x, y x y, (b) x + y x + y, and (c) αx = α x. Moreover, if, is also an inner product, then x = 0 if and only if x = 0. 14

Part (a) in Theorem 17.2 is also known as the Cauchy-Schwartz inequality. Two elements x, y in a Hilbert space H with inner product, are orthogonal if x, y = 0, denoted x y. 15

For any set C H and any x H, x is orthogonal to C if x y for every y C, denoted x C. The two subsets C 1, C 2 H are orthogonal if x y for every x C 1 and y C 2, denoted C 1 C 2. For any C 1 H, the orthocomplement of C 1, denoted C 1 {x H : x C 1 }., is the set 16

Let the subspace H H be linear and closed. By Theorem 17.1, we have for any x H that there exists an element y H that satisfies x y x z for any z H, and such that x y, z = 0 for all z H. Let Π denote an operator that performs this projection, i.e., let Πx y, where y is the projection of x onto H. 17

The projection operator Π : H H has several important properties: Theorem 17.3 Let H be a closed, linear subspace of H, and let Π : H H be the projection operator onto H. Then (i) Π is continuous and linear, (ii) Πx x for all x H, (iii) Π 2 Π Π = Π, and (iv) N(Π) = H and R(Π) = H. 18

Proof. Let x, y H and α, β R. If z H, then [αx + βy] [απx + βπy], z = α x Πx, z + β y Πy, z = 0. Now by the uniqueness of projections (Theorem 17.1), we now have that Π(αx + βy) = απx + βπy. This yields linearity of Π. 19

If we can establish (ii), (i) will follow. Since x Πx, Πx = 0, for any x H, we have x 2 = x Πx 2 + Πx 2 Πx 2, and (ii) (and hence also (i)) follows. For any y H, Πy = y. Thus for any x H, Π(Πx) = Πx, and (iii) follows. 20

Now let x N(Π). Then x = x Πx H. Conversely, for any x H, Πx = 0 by definition, and thus x N(Π). Hence N(Π) = H. 21

Now it is trivial to verify that R(Π) H by the definitions. Moreover, for any x H, Πx = x, and thus H R(Π). This implies (iv). 22

One other point we note is that for any projection Π onto a closed linear subspace H H, I Π, where I is the identity, is also a projection onto the closed linear subspace H. A key example of a Hilbert space H is H = L 2 (P ) with inner product f, g = fgdp. 23

A closed, linear subspace of interest to us is L 0 2 (P ) L 2(P ) which consists of all mean zero functions in L 2 (P ). The projection operator Π : L 2 (P ) L 0 2 (P ) is Πx = x P x. 24

To see this, note that Πx L 0 2 (P ) and x Πx, y = P x, y = P x P y = 0 for all y L 0 2 (P ). Thus by Theorem 17.1, Πx is the unique projection onto L 0 2 (P ). It is also not hard to verify that I Π is the projection onto the constants (see Exercise 17.4.2). 25

We close this section with a brief discussion of linear functionals on Hilbert spaces. Recall from Chapter 6 the definition of a linear operator and the fact that the norm for a linear operator T : D E is T sup T x. x D: x 1 In the special case where E = R, a linear operator is called a linear functional. 26

A linear functional T, like a linear operator, is bounded when T <. By Proposition 6.15, boundedness is equivalent to continuity in this setting. We now present a very important result for bounded linear functionals in Hilbert spaces. 27

Theorem 17.5 (Riesz representation theorem) If L : H R is a bounded linear functional on a Hilbert space, then there exists a unique element h 0 H such that L(h) = h, h 0 for all h H, and, moreover, L = h 0. 28

More on Banach Spaces As with Hilbert spaces, a linear functional on a Banach space is just a linear operator with real range. The dual space B of a Banach space B is the set of all continuous, linear functionals on B. 29

By applying Proposition 6.15, it is clear that every b B satisfies b b b b for every b B, where b sup b b <. b B: b 1 30

For the special case of a Hilbert space H, H can identified with H by the Reisz representation theorem. This implies that there exists an isometry (a one-to-one map that preserves norms) between H and H. 31

To see this, choose h H and let h H be the unique element that satisfies h, h = h h for all h H. Then h = by the Cauchy-Schwartz inequality. sup h, h h h H: h 1 The desired conclusion follows since h was arbitrary. 32

We now return to the generality of Banach spaces. For each continuous, linear operator between Banach spaces A : B 1 B 2, there exists an adjoint map (or just adjoint) A : B 2 B 1 defined by for all b 1 B 1 and b 2 B 2. (A b 2)b 1 = b 2Ab 1 It is straightforward to verify that the resulting A is linear. 33

The following proposition tells us that A is also continuous (by being bounded): Proposition 17.6 Let A : B 1 B 2 be a bounded linear operator between Banach spaces. Then A = A. 34

Proof. Since also, for any b 2 B 2, A b 2 = sup A b 2b 1 b 1 B 1 : b 1 1 = sup b 1 B 1 : b 1 1 we have A A. b 2 A, { b 2 ( ) Ab1 Ab 1 } Ab 1 Thus A is a continuous, linear operator. 35

Now let A : B 1 B 2 be the adjoint of A with respect to the double duals (duals of the duals) of B 1 and B 2. Note that for j = 1, 2, B j B j, since for any b j B j, the map b j : B j R defined by b j b j b j, is a bounded linear functional. 36

By the definitions involved, we now have for any b 1 B 1 and b 2 B 2 that (A b 1 )b 2 = (A b 2)b 1 = b 2Ab 1, and thus A A and the restriction of A to B 1, denoted hereafter A 1, equals A. Hence A = A 1 A, and the desired result follows. 37

We can readily see that the adjoint of an operator A : H 1 H 2 between two Hilbert spaces, with respective inner products, 1 and, 2, is a map A : H 2 H 1 satisfying Ah 1, h 2 2 = h 1, A h 2 1 for every h 1 H 1 and h 2 H 2. Here we are using the isometry for Hilbert spaces described above. 38

Now consider the adjoint of a restriction of a continuous linear operator A : H 1 H 2, A 0 : H 0,1 H 1 H 2, where H 0,1 is a closed, linear subspace of H 1. If Π : H 1 H 0,1 is the projection onto the subspace, it is not hard to verify that the adjoint of A 0 is A 0 Π A (see Exercise 17.4.3). 39

Recall from Chapter 6 the notation B(D, E) denoting the collection of all linear operators between normed spaces D and E. From Lemma 6.16, we know that for a given T B(B 1, B 2 ), for Banach spaces B 1 and B 2, R(T ) is not closed unless T is continuously invertible. 40

We now give an illustrative counter-example. Let B 1 = B 2 = L 2 (0, 1), and define T : L 2 (0, 1) L 2 (0, 1) by T x(u) ux(u). Then T 1, and thus T B(L 2 (0, 1), L 2 (0, 1)). 41

However, it is clear that R(T ) = { y L 2 (0, 1) : 1 0 } u 2 y 2 (u)du <. Although R(T ) is dense in L 2 (0, 1) (see Exercise 17.4.3), the functions y 1 (u) 1 and y 2 (u) u are clearly not in R(T ). Thus R(T ) is not closed. 42

This lack of closure of R(T ) arises from the simple fact that the inverse operator T 1 y(u) = u 1 y(u) is not bounded over y L 2 (0, 1) (consider y = 1, for example). On the other hand, it is easy to verify that for any normed spaces D and E and any T B(D, E), N(T ) is always closed as a direct consequence of the continuity of T. 43

Observe also that for any T B(B 1, B 2 ), N(T ) = {b 2 B 2 : (T b 2)b 1 = 0 for all b 1 B 1 } (2) = {b 2 B 2 : b 2(T b 1 ) = 0 for all b 1 B 1 } = R(T ), where R(T ) is an abuse of notation denoteing the linear functionals in B 2 that yield zero on R(T ). For Hilbert spaces, the notation is valid because of the isometry between a Hilbert space H and its dual H. 44

The identity (2) has an interesting extension: Theorem 17.7 For two Banach spaces B 1 and B 2 and for any T B(B 1, B 2 ), R(T ) = B 2 if and only if N(T ) = {0} and R(T ) is closed. If we specialize (2) to Hilbert spaces H 1 and H 2, we obtain trivially for any A B(H 1, H 2 ) that R(A) = N(A ). 45

The following result for Hilbert spaces is also useful: Theorem 17.9 For two Hilbert spaces H 1 and H 2 and any A B(H 1, H 2 ), R(A) is closed if and only if R(A ) is closed if and only if R(A A) is closed. Moreover, if R(A) is closed, then R(A ) = R(A A) and A(A A) 1 A : H 2 H 2 is the projection onto R(A). 46