Calculus in Gauss Space

Similar documents
Harmonic Analysis. 1. Hermite Polynomials in Dimension One. Recall that if L 2 ([0 2π] ), then we can write as

Finite-dimensional spaces. C n is the space of n-tuples x = (x 1,..., x n ) of complex numbers. It is a Hilbert space with the inner product

Real Analysis Notes. Thomas Goller

The Dirichlet s P rinciple. In this lecture we discuss an alternative formulation of the Dirichlet problem for the Laplace equation:

be the set of complex valued 2π-periodic functions f on R such that

The Canonical Gaussian Measure on R

Normed Vector Spaces and Double Duals

An introduction to some aspects of functional analysis

Notions such as convergent sequence and Cauchy sequence make sense for any metric space. Convergent Sequences are Cauchy

Outline of Fourier Series: Math 201B

4 Hilbert spaces. The proof of the Hilbert basis theorem is not mathematics, it is theology. Camille Jordan

L p Functions. Given a measure space (X, µ) and a real number p [1, ), recall that the L p -norm of a measurable function f : X R is defined by

ON THE REGULARITY OF SAMPLE PATHS OF SUB-ELLIPTIC DIFFUSIONS ON MANIFOLDS

On m-accretive Schrödinger operators in L p -spaces on manifolds of bounded geometry

SHARP BOUNDARY TRACE INEQUALITIES. 1. Introduction

1.3.1 Definition and Basic Properties of Convolution

Part V. 17 Introduction: What are measures and why measurable sets. Lebesgue Integration Theory

I teach myself... Hilbert spaces

l(y j ) = 0 for all y j (1)

Some SDEs with distributional drift Part I : General calculus. Flandoli, Franco; Russo, Francesco; Wolf, Jochen

Statistics 612: L p spaces, metrics on spaces of probabilites, and connections to estimation

Math 328 Course Notes

3 Integration and Expectation

Lecture 4 Lebesgue spaces and inequalities

Probability and Measure

Integration by Parts and Its Applications

5 Measure theory II. (or. lim. Prove the proposition. 5. For fixed F A and φ M define the restriction of φ on F by writing.

Measurable functions are approximately nice, even if look terrible.

Eigenvalues and Eigenfunctions of the Laplacian

g(x) = P (y) Proof. This is true for n = 0. Assume by the inductive hypothesis that g (n) (0) = 0 for some n. Compute g (n) (h) g (n) (0)

Trace Class Operators and Lidskii s Theorem

Malliavin Calculus: Analysis on Gaussian spaces

Overview of normed linear spaces

1.5 Approximate Identities

1 Continuity Classes C m (Ω)

Chapter IV Integration Theory

Lipschitz p-convex and q-concave maps

3. (a) What is a simple function? What is an integrable function? How is f dµ defined? Define it first

L p Spaces and Convexity

F (x) = P [X x[. DF1 F is nondecreasing. DF2 F is right-continuous

Multivariate Differentiation 1

Some Background Material

Sobolev spaces. May 18

Basic Properties of Metric and Normed Spaces

Analysis Finite and Infinite Sets The Real Numbers The Cantor Set

It follows from the above inequalities that for c C 1

Sobolev Spaces. Chapter Hölder spaces

A linear algebra proof of the fundamental theorem of algebra

Part III. 10 Topological Space Basics. Topological Spaces

LECTURE 3 Functional spaces on manifolds

6. Duals of L p spaces

Appendix A Functional Analysis

Gaussian Processes. 1. Basic Notions

Exercises to Applied Functional Analysis

A variational weak weighted derivative: Sobolev spaces and degenerate elliptic equations

08a. Operators on Hilbert spaces. 1. Boundedness, continuity, operator norms

4 Expectation & the Lebesgue Theorems

An introduction to Mathematical Theory of Control

Lecture Notes 1: Vector spaces

Boundary measures, generalized Gauss Green formulas, and mean value property in metric measure spaces

BOUNDARY VALUE PROBLEMS ON A HALF SIERPINSKI GASKET

MAT 570 REAL ANALYSIS LECTURE NOTES. Contents. 1. Sets Functions Countability Axiom of choice Equivalence relations 9

1 Inner Product Space

i. v = 0 if and only if v 0. iii. v + w v + w. (This is the Triangle Inequality.)

Math 350 Fall 2011 Notes about inner product spaces. In this notes we state and prove some important properties of inner product spaces.

THE L 2 -HODGE THEORY AND REPRESENTATION ON R n

Introduction to Infinite Dimensional Stochastic Analysis

The Schwartz Space: Tools for Quantum Mechanics and Infinite Dimensional Analysis

HILBERT SPACES AND THE RADON-NIKODYM THEOREM. where the bar in the first equation denotes complex conjugation. In either case, for any x V define

Contents. 2 Sequences and Series Approximation by Rational Numbers Sequences Basics on Sequences...

THE RADON NIKODYM PROPERTY AND LIPSCHITZ EMBEDDINGS

Vector Spaces. Vector space, ν, over the field of complex numbers, C, is a set of elements a, b,..., satisfying the following axioms.

Kernel Method: Data Analysis with Positive Definite Kernels

at time t, in dimension d. The index i varies in a countable set I. We call configuration the family, denoted generically by Φ: U (x i (t) x j (t))

LECTURE 1: SOURCES OF ERRORS MATHEMATICAL TOOLS A PRIORI ERROR ESTIMATES. Sergey Korotov,

A linear algebra proof of the fundamental theorem of algebra

The Lebesgue Integral

Scalar multiplication and addition of sequences 9

4 Uniform convergence

9 Brownian Motion: Construction

Math212a1413 The Lebesgue integral.

THEOREMS, ETC., FOR MATH 515

It follows from the above inequalities that for c C 1

Hilbert Spaces. Hilbert space is a vector space with some extra structure. We start with formal (axiomatic) definition of a vector space.

Optimization and Optimal Control in Banach Spaces

W if p = 0; ; W ) if p 1. p times

A VERY BRIEF REVIEW OF MEASURE THEORY

CONVERGENCE THEORY. G. ALLAIRE CMAP, Ecole Polytechnique. 1. Maximum principle. 2. Oscillating test function. 3. Two-scale convergence

1 Background and Review Material

Math 210B. Artin Rees and completions

We denote the space of distributions on Ω by D ( Ω) 2.

1.4 The Jacobian of a map

Here we used the multiindex notation:

ON MATRIX VALUED SQUARE INTEGRABLE POSITIVE DEFINITE FUNCTIONS

Fredholm Theory. April 25, 2018

C.7. Numerical series. Pag. 147 Proof of the converging criteria for series. Theorem 5.29 (Comparison test) Let a k and b k be positive-term series

Integration on Measure Spaces

2) Let X be a compact space. Prove that the space C(X) of continuous real-valued functions is a complete metric space.

PARTIAL REGULARITY OF BRENIER SOLUTIONS OF THE MONGE-AMPÈRE EQUATION

Heat Kernel and Analysis on Manifolds Excerpt with Exercises. Alexander Grigor yan

Transcription:

Calculus in Gauss Space 1. The Gradient Operator The -dimensional Lebesgue space is the measurable space (E (E )) where E =[0 1) or E = R endowed with the Lebesgue measure, and the calculus of functions on Lebesgue space is just real and harmonic analysis. The -dimensional Gauss space is the same measure space (R (R )) as in the previous paragraph, but now we endow that space with the Gauss measure P in place of the Lebesgue measure. Since the Gauss space (R (R ) P ) is a probability space, we can and frequently will think of any measurable function : R R as a random variable. Therefore, P{ A} = P { A} = P { R : () A} E() =E () = dp = dp Cov( )= L 2 (P) = dp etc. Note, also, that = (Z) for all random variables, where Z is the standard normal random vector Z() := for all R, as before. In particular, E() =E () =E[(Z)] Var() =Var[(Z)] Cov( ) =Cov[(Z) (Z)] and so on, notation being typically obvious from context. 19

20 2. Calculus in Gauss Space Let := / for all 1 6 6 and let := ( 1 ) denote the gradient operator, acting on continuously-differentiable functions : R R. From now on we will use the following. Definition 1.1. Let C 0 (P ) denote the collection of all infinitely-differentiable functions : R R such that and all of its mixed derivatives of order 6 grow more slowly than [γ ()] ε for every ε>0. Equivalently, C 0 (P ) if and only if lim e ε2 () = lim e ε2 ( 1 )() =0 for all 1 6 1 6 and 1 6 6. We also define C0 (P ) := C0 (P ) We will frequently use the following without mention. Lemma 1.2. If C 0 (P ), then E ( ) < and E ( 1 ) < for all 1 6 <, 1 6 1 6, and 1 6 6. I omit the proof since it is elementary. For every C0 1(P ), define 2 12 := () 2 P (d)+ ( )() 2 P (d) = E 2 + E 2 Notice that 12 is a bona fide Hilbertian norm on C0 2(P ) with Hilbertian inner product 12 := dp + ( ) ( ) dp = E[]+E[ ] We will soon see that C 2 0 (P ) is not a Hilbert space with the preceding norm and inner product because it is not complete. This observation prompts the following definition. Definition 1.3. The Gaussian Sobolev space D 12 (P ) is the completion of C 1 0 (P ) in the norm 12. In order to understand what the elements of D 12 (P ) look like consider a function D 12 (P ). By definition we can find a sequence

1. The Gradient Operator 21 ex:smoothing:1 1 2 C 1 0 (P ) such that 12 0 as. Since L 2 (P ) is complete, we can deduce also that D := lim exists in L 2 (P ) for every 1 6 6. It follows, by virtue of construction, that D = for all C 1 0 (P ). Therefore, D is an extension of the gradient operator from C 1 0 (P ) to D 12 (P ). From now on, I will almost always write D in favor of when C 1 0 (P ). This is because D can make sense even when is not in C 1 0 (P ), as we will see in the next few examples. In general, we can think of elements of D 12 (P ) as functions in L 2 (P ) that have one weak derivative in L 2 (P ). We may refer to the linear operator D as the Malliavin derivative, and the random variable D as the [generalized] gradient of. We will formalize this notation further at the end of this section. For now, let us note instead that the standard Sobolev space W 12 (R ) is obtained in exactly the same way as D 12 (P ) was, but the Lebesgue measure is used in place of P everywhere. Since γ () =dp ()/d <1, 1 it follows that the Hilbert space D 12 (P ) is richer than the Hilbert space W 12 (R ), whence the Malliavin derivative is an extension of Sobolev s [generalized] gradient. 2 It is a natural time to produce examples to show that the space D 12 (P ) is strictly larger than the space C 1 0 (P ) endowed with the norm 12. Example 1.4 ( ). Consider the case and let () := (1 ) + for all R Then we claim that D 12 (P 1 ) \ C 1 0 (P 1) and in fact we have the P 1 -a.s. identity, 3 (D)() = sign()1 [ 11] () whose intuitive meaning ought to be clear. In order to prove these assertions let ψ 1 be a symmetric probability density function on R such that ψ 1 C (R), ψ 1 a positive constant on [ 1 1], and ψ 1 0 off of [ 2 2]. For every real number >0, define ψ () := ψ 1 () and () := ( ψ )(). Then sup N () () 0 as 1 In other words, E( 2 ) < R () 2 d for all L 2 (R ) that are strictly positive on a set of positive Lebesgue measure. 2 The extension is strict. For instance, () := exp() [ R] defines a function in D 12 (P 1 )\W 12 (R). 3 It might help to recall that D is defined as an element of the Hilbert space L 2 (P 1 ) in this case. Therefore, it does not make sense to try to compute (D)() for all R. This issue arises when one constructs any random variable on any probability space, of course. Also, note that P 1 -a.s. equality is the same thing as Lebesgue-a.e. equality.

22 2. Calculus in Gauss Space ex:smoothing:2 ex:lipschitz:d12 N because is uniformly continuous. In particular, N L 2 (P ) 0 as N. To complete the proof it remains to verify that N ()+sign()1 [ 11]() 2 P (d) =0 (2.1) goal:n lim N By the dominated convergence theorem and integration by parts, 1 N () = ()ψn ( ) d = ψ N ( ) d + 0 := A N ()+B N () 0 1 ψ N ( ) d I will prove that A N 1 [0 ) as N in L 2 (P 1 ); a small adaptation of this argument will also prove that B N 1 ( 0] in L 2 (P 1 ), from which (2.1) ensues. We can apply a change of variables, together with the symmetry of ψ 1, in order to see that A N () = N(1 ) N ψ 1 () d Therefore, A N () sign()1 [0 ) () as N for all = 0. Since A N ()+sign()1 [0 ) () is bounded uniformly by 2, the dominated convergence theorem implies that A N () sign()1 [ 11] () as N in L 2 (P 1 ). This concludes our example. Example 1.5 ( > 2). Let us consider the case that > 2. In order to produce a function F D 12 (P ) \ C 1 0 (P ) we use the construction of the previous example and set F() := ( ) and Ψ N () := ψ N ( ) for all R and N > 1 Then the calculations of Example 1.4 also imply that F N := F Ψ N F as N in the norm 12 of D 12 (P ), F N C0 1(P ), and F C0 1(P ). Thus, it follows that F D 12 (P ) \ C0 1(P ). Furthermore, (D F)() = sign( )1 [ 11] ( ) ( ) for every 1 6 6 and P -almost every R. 166 = Example 1.6. The previous two examples are particular cases of a more general family of examples. Recall that a function : R R is Lipschitz continuous if there exists a finite constant K such that () () 6 K for all R. The smallest such constant K is called the Lipschitz constant of and is denoted by Lip(). Let : R R be a Lipschitz function. According to Rademacher s theorem XXX, is almost everywhere [equivalently, P -a.s.] differentiable and ( )() 6 Lip() a.s. Also note that () 6 (0) + Lip() for all R.

1. The Gradient Operator 23 In particular, E( ) < for all > 1. A density argument, similar to the one that appeared in the preceding examples, shows that D 12 (P ) and D 6 Lip() a.s. We will appeal to this fact several times in this course. The generalized gradient D follows more or less the same general set of rules as does the more usual gradient operator. And it frequently behaves as one expect it should even when it is understood as the Gaussian extension of ; see Examples (1.4) and (1.5) to wit. The following ought to reinforce this point of view. lem:chainrule Lemma 1.7 (Chain Rule). For all ψ D 12 (P 1 ) and D 12 (P ), D(ψ ) =[(Dψ) ] D() a.s. Proof. If and ψ are continuously differentiable then the chain rule of calculus ensures that [ (ψ )]() =ψ (())( )() for all R and 1 6 6. That is, D( ) = (ψ ) =(ψ )( ) =(Dψ)()D() where Dψ refers to the one-dimensional Malliavin derivative of ψ and D() := D refers to the -dimensional Malliavin derivative of. In general we appeal to a density argument. Here is a final example that is worthy of mention. ex:dm Example 1.8. Let M := max 166 Z and note that M() = max 166 = 1 Q() () for P -almost all R where Q() denotes the cone of all points R such that > max =. We can approximate the indicator function of Q() by a smooth function to see that M D 12 (P ) and D M = 1 Q() a.s. for all 1 6 6. Let J() := arg max() Clearly, J is defined uniquely for P -almost every R. computation of D M is equivalent to And our (DM)() = J() for P -almost all R where 1 denote the standard basis of R.

24 2. Calculus in Gauss Space Let us end this section by introducing a little more notation. The preceding discussion constructs, for every function D 12 (P ), the Malliavin derivative D as an R -valued function with coordinates in L 2 (P ). We will use the following natural notations exchangeably: (D)() := [(D)()] =(D )() for every D 12 (P ), R, and 1 6 6 In this way we may also think of D as a scalar-valued element of the real Hilbert space L 2 (P χ ), where Definition 1.9. χ always denotes the counting measure on {1 }. We see also that the inner product on D 12 (P ) is 12 = L 2 (P ) + D D L 2 (P χ ) = E()+E (D D) for all D 12 (P ) Definition 1.10. The random variable D L 2 (P χ ) is called the Malliavin derivative of the random variable D 12 (P ). 2. Higher-Order Derivatives One can define higher-order weak derivatives just as easily as we obtained the directional weak derivatives. Choose and fix C 2 (R ) and two integers 1 6 6. The mixed derivative of in direction () is the function ( 2 )(), where 2 := = The Hessian operator 2 is defined as 2 11 2 1 2 :=..... 2 1 2

2. Higher-Order Derivatives 25 With this in mind, we can define a Hilbertian inner product 22 via 22 := dp + ( ) ( ) P (d)+ tr ( 2 )( 2 ) dp = ()() P (d)+ ( )()( )() P (d) + ( 2 )()( 2 )() P (d) = 12 + ( 2 ) ( 2 ) dp = E()+E [ ]+E 2 2 [ C 2 0 (P )] where K M denotes the matrix or Hilbert Schmidt inner product, K M := K M = tr(k M) for all matrices K and M. We also obtain the corresponding Hilbertian norm 22 where: 2 22 = 2 L 2 (P ) + 2 L 2 (P ) + 2 2 = 2 12 + 2 = E 2 + E 2 L 2 (P χ 2 ) 2 + E 2 2 L 2 (P ) [ C 2 0 (P )]; χ 2 := χ χ denotes the counting measure on {1 } 2 ; and K := K K = K 2 = tr(k K) denotes the Hilbert Schmidt norm of any matrix K. Definition 2.1. The Gaussian Sobolev space D 22 (P ) is the completion of C 2 0 (P ) in the norm 22. For every D 22 (P ) we can find functions 1 2 C 2 0 (P ) such that 22 0 as Then D and D 2 := lim 2 exist in L 2 (P ) for every 1 6 6. Equivalently, D = lim exists in L 2 (P χ ) and D 2 = lim 2 exists in L 2 (P χ 2 ).

26 2. Calculus in Gauss Space Choose and fix an integer > 2. If =( 1 ) is a vector of integers in {1 }, then write ( )() := ( 1 )() [ C (R ) R ] Let denote the formal -tensor whose -th coordinate is. We define a Hilbertian inner product 2 [inductively] via 2 = 12 + ( ) ( ) dp for all C0 (P ), where denotes the Hilbert Schmidtt inner product for -tensors: K M := K M {1 } for all -tensors K and M. 2 := 1/2 22. The corresponding norm is defined via Definition 2.2. The Gaussian Sobolev space D 2 (P ) is the completion of C 0 (P ) in the norm 2. We also define D 2 (P ) := >1 D 2 (P ). If D 2 (P ) then we can find a sequence of functions 1 2 C 0 (P ) such that 2 0 as. It then follows that D := lim exists in L 2 (P χ ) for every 1 6 6, where χ := χ χ [ 1 times] denotes the counting measure on {1 }. The operator D is called the th Malliavin derivative. It is easy to see that the Gaussian Sobolev spaces are nested; that is, D 2 (P ) D 12 (P ) for all 2 6 6 Also, whenever C 0 (P ), the th Malliavin derivative of is just the classically-defined derivative, which is a -dimensional tensor. Because every polynomial in variables is in C 0 (P ), 4 it follows immediately that D 2 (R ) contains all -variable polynomials; and that all Malliavin derivatives acts as one might expect them to. More generally, we have the following. 4 A function : R R is a polynomial in variables if it can be written as () = 1 ( 1 ) ( ), for all =( 1 ) R, where each is a polynomial on R. The degree of the polynomial is the maximum of the degrees of 1. Thus, for example () = 1 3 2 2 5 is a polynomial of degree 3 in 5 variables.

3. The Adjoint Operator 27 def:d:k,p Definition 2.3. We define Gaussian Sobolev spaces D (P ) by completing the space C0 (P ) in the norm D (P ) := L (P ) + 1/ D L (P χ) Each D (P ) is a Banach space in the preceding norm. 3. The Adjoint Operator Recall the canonical Gaussian probability density function γ := dp /d from (1.1). Since (D γ )() = γ (), we can apply the chain rule to see that for every C0 1(P ), (D )()() P (d) = ()D [()γ ()] d R R = ()(D )() P (d)+ ()() P (d) R R for 1 6 6. Let := ( 1 ) and sum the preceding over all 1 6 6 to find the following adjoint relation, E D () = D L 2 (P ) = A L 2 (P ) = E A () (2.2) IbP where A is the formal adjoint of D; that is, (A)() := (D)() + () (2.3) A:g Eq. (2.3) is defined pointwise whenever C 1 0 (P ). But it also makes sense as an identity in L 2 (P χ ) if, for example, D 12 (P ) and () is in L 2 (P χ ). Let us pause to emphasize that (2.2) can be stated equivalently as as -vectors. 5 E[D()] = E[A()] (2.4) D:delta If D 12 (P ), then we can always find 1 2 C0 1(P ) such that 12 0 as. Note that D dp D dp 6 L 2 (P χ )D D L 2 (P χ ) (2.5) DfDf 6 L 2 (P χ ) 21 0 5 If W =(W1 W ) is a random -vector then E(W) is the -vector whose th coordinate is E(W ).

28 2. Calculus in Gauss Space as. Also, A dp A dp 6 A L 2 (P ) L 2 (P ) 6 A L 2 (P ) 12 0 (2.6) fdgfdg whenever C 1 0 (P ). We can therefore combine (2.4), (2.5), and (2.6) in order to see that (2.4) in fact holds for all D 12 (P ) and C 1 0 (P ). Finally define Dom[A] := D 12 (P ): A L 2 (P χ ) (2.7) Dom:A Since C 1 0 (P ) is dense in L 2 (P ), we may infer from (2.4) and another density argument the following. pr:adjoint Proposition 3.1. The adjoint relation (2.4) is valid for all D 12 (P ) and Dom[A]. Definition 3.2. The linear operator A is the adjoint operator, and Dom[A] is called the domain of the definition or just domain of A. The linear space Dom[A] has a number of nicely-behaved subspaces. The following records an example of such a subspace. pr:subspace Proposition 3.3. For every 2 <6, D 12 (P ) L (P ) Dom[A] Proof. We apply Hölder s inequality to see that E Z 2 [(Z)] 2 = 2 [()] 2 P (d) 6 L (P ) where = E Z 2/( 1) ( 1)/(2) = 2/( 1) P (d) ( 1)/(2) < Therefore, Z(Z) L 2 (P χ ), and we may apply (2.3) to find that A L 2 (P χ ) 6 D L 2 (P χ ) + L (P ) 6 12 + L 2 (P ) < This proves that Dom[A] Very often, people prefer to use the divergence operator δ associated to A in place of A itself. That is, if G =( 1 ) with every in the domain of A, then (δg)() := (A G)() := (A )()

3. The Adjoint Operator 29 If every is in C 1 0 (P ), then (2.3) shows that (δg)() = (D )()+ () = (div )()+ () where div denotes the usual divergence operator in Lebesgue space. We will be working directly with the adjoint in these notes, and will keep the discussion limited to A, rather than δ. Still, it is worth mentioning the scalar identity, E [G (D)] = E [δ(g)] for all functions G : R R for which δg can be defined in L 2 (P ) and all D 12 (P ). The preceding formula is aptly known as the integration by parts formula of Malliavin calculus, and is equivalent to the statement that A and D are L 2 (P )-adjoints of one another, though one has to pay attention to the domains of the definition of A, D, and δ carefully in order to make precise this equivalence.