Chapter 1: Probability Theory Lecture 1: Measure space, measurable function, and integration

Similar documents
Chapter 1: Probability Theory Lecture 1: Measure space and measurable function

ST5215: Advanced Statistical Theory

RS Chapter 1 Random Variables 6/5/2017. Chapter 1. Probability Theory: Introduction

Lecture 4: Conditional expectation and independence

Spring 2014 Advanced Probability Overview. Lecture Notes Set 1: Course Overview, σ-fields, and Measures

36-752: Lecture 1. We will use measures to say how large sets are. First, we have to decide which sets we will measure.

1 Measurable Functions

Measure and integration

Notes on Measure, Probability and Stochastic Processes. João Lopes Dias

Probability and Random Processes

Lecture 6 Feb 5, The Lebesgue integral continued

1.1. MEASURES AND INTEGRALS

Annalee Gomm Math 714: Assignment #2

Analysis of Probabilistic Systems

Lecture 3: Random variables, distributions, and transformations

Chapter 2: Fundamentals of Statistics Lecture 15: Models and statistics

3 Measurable Functions

Measures. 1 Introduction. These preliminary lecture notes are partly based on textbooks by Athreya and Lahiri, Capinski and Kopp, and Folland.

Final. due May 8, 2012

STAT 7032 Probability Spring Wlodek Bryc

Section Signed Measures: The Hahn and Jordan Decompositions

18.175: Lecture 2 Extension theorems, random variables, distributions

I. ANALYSIS; PROBABILITY

Signed Measures and Complex Measures

Chapter 4. Measure Theory. 1. Measure Spaces

HILBERT SPACES AND THE RADON-NIKODYM THEOREM. where the bar in the first equation denotes complex conjugation. In either case, for any x V define

An Introduction to Stochastic Calculus

MA359 Measure Theory

Probability: Handout

Chapter 6. Hypothesis Tests Lecture 20: UMP tests and Neyman-Pearson lemma

Random experiments may consist of stages that are performed. Example: Roll a die two times. Consider the events E 1 = 1 or 2 on first roll

ST5215: Advanced Statistical Theory

Measure-theoretic probability

18.175: Lecture 3 Integration

II - REAL ANALYSIS. This property gives us a way to extend the notion of content to finite unions of rectangles: we define

Measures. Chapter Some prerequisites. 1.2 Introduction

REAL ANALYSIS I Spring 2016 Product Measures

The Caratheodory Construction of Measures

FUNDAMENTALS OF REAL ANALYSIS by. IV.1. Differentiation of Monotonic Functions

Integration on Measure Spaces

MATHS 730 FC Lecture Notes March 5, Introduction

Measure Theoretic Probability. P.J.C. Spreij

Real Analysis Chapter 3 Solutions Jonathan Conder. ν(f n ) = lim

The Lebesgue Integral

Chapter 1. Measure Spaces. 1.1 Algebras and σ algebras of sets Notation and preliminaries

2 Measure Theory. 2.1 Measures

Lebesgue measure on R is just one of many important measures in mathematics. In these notes we introduce the general framework for measures.

Problem Set. Problem Set #1. Math 5322, Fall March 4, 2002 ANSWERS

Lecture 11: Random Variables

Economics 574 Appendix to 13 Ways

Product measures, Tonelli s and Fubini s theorems For use in MAT4410, autumn 2017 Nadia S. Larsen. 17 November 2017.

Measure Theoretic Probability. P.J.C. Spreij

(U) =, if 0 U, 1 U, (U) = X, if 0 U, and 1 U. (U) = E, if 0 U, but 1 U. (U) = X \ E if 0 U, but 1 U. n=1 A n, then A M.

Lectures 22-23: Conditional Expectations

Section 2: Classes of Sets

Dynkin (λ-) and π-systems; monotone classes of sets, and of functions with some examples of application (mainly of a probabilistic flavor)

MATH MEASURE THEORY AND FOURIER ANALYSIS. Contents

Construction of a general measure structure

Lecture Notes 1 Probability and Random Variables. Conditional Probability and Independence. Functions of a Random Variable

Measure and Integration: Concepts, Examples and Exercises. INDER K. RANA Indian Institute of Technology Bombay India

Lebesgue-Radon-Nikodym Theorem

6.2 Fubini s Theorem. (µ ν)(c) = f C (x) dµ(x). (6.2) Proof. Note that (X Y, A B, µ ν) must be σ-finite as well, so that.

Lecture Notes 1 Probability and Random Variables. Conditional Probability and Independence. Functions of a Random Variable

Lecture 2: Random Variables and Expectation

Lecture 3: Probability Measures - 2

Lecture 12: Multiple Random Variables and Independence

n [ F (b j ) F (a j ) ], n j=1(a j, b j ] E (4.1)

Information Theory and Statistics Lecture 3: Stationary ergodic processes

Notes on Measure Theory. Let A 2 M. A function µ : A [0, ] is finitely additive if, A j ) =

Why study probability? Set theory. ECE 6010 Lecture 1 Introduction; Review of Random Variables

Building Infinite Processes from Finite-Dimensional Distributions

1 Inner Product Space

2 Construction & Extension of Measures

Part II: Markov Processes. Prakash Panangaden McGill University

Northwestern University Department of Electrical Engineering and Computer Science

MTH 404: Measure and Integration

Stat 643 Review of Probability Results (Cressie)

Abstract Measure Theory

ABSTRACT EXPECTATION

Probability and Measure. November 27, 2017

Chapter 5 Random vectors, Joint distributions. Lectures 18-23

Math 4121 Spring 2012 Weaver. Measure Theory. 1. σ-algebras

(2) E M = E C = X\E M

CHAPTER 6. Differentiation

2 Probability, random elements, random sets

CHAPTER 1 SETS AND EVENTS

MATH & MATH FUNCTIONS OF A REAL VARIABLE EXERCISES FALL 2015 & SPRING Scientia Imperii Decus et Tutamen 1

Homework 11. Solutions

Math 564 Homework 1. Solutions.

Measures and Measure Spaces

5 Measure theory II. (or. lim. Prove the proposition. 5. For fixed F A and φ M define the restriction of φ on F by writing.

Lebesgue Integration: A non-rigorous introduction. What is wrong with Riemann integration?

MATH 418: Lectures on Conditional Expectation

Recall that if X is a compact metric space, C(X), the space of continuous (real-valued) functions on X, is a Banach space with the norm

AN INTRODUCTION TO GEOMETRIC MEASURE THEORY AND AN APPLICATION TO MINIMAL SURFACES ( DRAFT DOCUMENT) Academic Year 2016/17 Francesco Serra Cassano

1.4 Outer measures 10 CHAPTER 1. MEASURE

Lecture 5: Expectation

Probability and Random Processes

Independent random variables

Real Analysis Problems

Transcription:

Chapter 1: Probability Theory Lecture 1: Measure space, measurable function, and integration Random experiment: uncertainty in outcomes Ω: sample space: a set containing all possible outcomes Definition 1.1 A collection of subsets of Ω, F, is a σ-field (or σ-algebra) if (i) The empty set /0 F ; (ii) If A F, then the complement A c F ; (iii) If A i F, i = 1,2,..., then their union A i F. (Ω,F ) is a measurable space if F is a σ-field on Ω Two trivial examples: F = {/0,Ω} and F = all subsets of Ω (power set) A nontrivial example: F = {/0,A,A c,ω}, where A Ω C = a collection of subsets of interest (may not be a σ-field) σ(c ): the smallest σ-field containing C (the σ-field generated by C ) σ(c ) = C if C itself is a σ-field σ({a}) = σ({a,a c }) = σ({a,ω}) = σ({a, /0}) = {/0,A,A c,ω} UW-Madison (Statistics) Stat 709 Lecture 1 2018 1 / 16

Borel σ-field R k : the k-dimensional Euclidean space (R 1 = R is the real line) O = all open sets, C = all closed sets B k = σ(o) = σ(c ): the Borel σ-field on R k C B k, B C = {C B : B B k } is the Borel σ-field on C Definition 1.2. Let (Ω,F ) be a measurable space. A set function ν defined on F is a measure if (i) 0 ν(a) for any A F ; (ii) ν(/0) = 0; (iii) If A i F, i = 1,2,..., and A i s are disjoint, i.e., A i A j = /0 for any i j, then ( ) ν A i = ν(a i ). i=1 (Ω,F,ν) is a measure space if ν is a measure on F in (Ω,F ). If ν(ω) = 1, then ν is a probability measure. We usually use P instead of ν; i.e., (Ω,F,P) is a probability space. i=1 UW-Madison (Statistics) Stat 709 Lecture 1 2018 2 / 16

Conventions For any x R, + x =, x = if x > 0, x = if x < 0 0 = 0, + =, a = for any a > 0; or / is not defined Important examples of measures Let x Ω be a fixed point and { c x A δ x (A) = 0 x A. This is called a point mass at x Let F = all subsets of Ω and ν(a) = the number of elements in A F (ν(a) = if A contains infinitely many elements). Then ν is a measure on F and is called the counting measure. There is a unique measure m on (R,B) that satisfies m([a,b]) = b a for every finite interval [a,b], < a b <. This is called the Lebesgue measure. If we restrict m to the measurable space ([0,1],B [0,1] ), then m is a probability measure (uniform distribution). UW-Madison (Statistics) Stat 709 Lecture 1 2018 3 / 16

Proposition 1.1 (Properties of measures) Let (Ω,F,ν) be a measure space. 1 (Monotonicity). If A B, then ν(a) ν(b). 2 (Subadditivity). For any sequence A 1,A 2,..., ( ) ν A i ν(a i ). i=1 3 (Continuity). If A 1 A 2 A 3 (or A 1 A 2 A 3 and ν(a 1 ) < ), then ( ) ν lim A n = lim ν (A n ), n n where ( lim A n = A i or = A i ). n i=1 Let P be a probability measure on (R,B). The cumulative distribution function (c.d.f.) of P is defined to be F(x) = P ((,x]), i=1 i=1 x R UW-Madison (Statistics) Stat 709 Lecture 1 2018 4 / 16

Proposition 1.2 (Properties of c.d.f. s) (i) Let F be a c.d.f. on R. (a) F( ) = lim x F(x) = 0; (b) F( ) = lim x F(x) = 1; (c) F is nondecreasing, i.e., F(x) F(y) if x y; (d) F is right continuous, i.e., lim y x,y>x F (y) = F(x). (ii) Suppose a real-valued function F on R satisfies (a)-(d) in part (i). Then F is the c.d.f. of a unique probability measure on (R,B). Product space I = {1,...,k}, k is finite or Γ i, i I, are some sets i I Γ i = Γ 1 Γ k = {(a 1,...,a k ) : a i Γ i,i I } R R = R 2, R R R = R 3 Let (Ω i,f i ), i I, be measurable spaces i I F i is not necessarily a σ-field σ ( i I F i ) is called the product σ-field on the product space i I Ω i ( i I Ω i,σ ( i I F i )) is denoted by i I (Ω i,f i ) Example: i=1,...,k (R,B) = (R k,b k ) UW-Madison (Statistics) Stat 709 Lecture 1 2018 5 / 16

Product measure Consider a rectangle [a 1,b 1 ] [a 2,b 2 ] R 2. The usual area of [a 1,b 1 ] [a 2,b 2 ] is (b 1 a 1 )(b 2 a 2 ) = m([a 1,b 1 ])m([a 2,b 2 ]) Is m([a 1,b 1 ])m([a 2,b 2 ]) the same as the value of a measure defined on the product σ-field? σ-finite A measure ν on (Ω,F ) is said to be σ-finite iff there exists a sequence {A 1,A 2,...} such that A i = Ω and ν(a i ) < for all i Any finite measure (such as a probability measure) is clearly σ-finite The Lebesgue measure on R is σ-finite, since R = A n with A n = ( n,n), n = 1,2,... The counting measure in is σ-finite if and only if Ω is countable The measure ν(a) = unless A = /0 is not σ-finite UW-Madison (Statistics) Stat 709 Lecture 1 2018 6 / 16

Proposition 1.3 (Product measure theorem) Let (Ω i,f i,ν i ), i = 1,...,k, be measure spaces with σ-finite measures. There exists a unique σ-finite measure on σ-field σ(f 1 F k ), called the product measure and denoted by ν 1 ν k, such that for all A i F i, i = 1,...,k. ν 1 ν k (A 1 A k ) = ν 1 (A 1 ) ν k (A k ) Jiont and marginal c.d.f. s The joint c.d.f. of a probability measure on (R k,b k ) is defined by F(x 1,...,x k ) = P((,x 1 ] (,x k ]), x i R and the ith marginal c.d.f. is defined by F i (x) = lim F(x 1,...,x i 1,x,x i+1,...,x k ) x j,j=1,...,i 1,i+1,...,k There is a 1-1 correspondence between probability and c.d.f. on R k. The product measure corresponds to F(x 1,...,x k ) = F 1 (x 1 ) F k (x k ), (x 1,...,x k ) R k, where F i is the c.d.f. of the ith probability measure. UW-Madison (Statistics) Stat 709 Lecture 1 2018 7 / 16

Measurable function f : a function from Ω to Λ (often Λ = R k ) Inverse image of B Λ under f : f 1 (B) = {f B} = {ω Ω : f (ω) B}. The inverse function f 1 need not exist for f 1 (B) to be defined. f 1 (B c ) = (f 1 (B)) c for any B Λ f 1 ( B i ) = f 1 (B i ) for any B i Λ, i = 1,2,... Let C be a collection of subsets of Λ. Define f 1 (C ) = {f 1 (C) : C C } Definition 1.3 Let (Ω,F ) and (Λ,G ) be measurable spaces. Let f be a function from Ω to Λ. f is called a measurable function from (Ω,F ) to (Λ,G ) iff f 1 (G ) F. UW-Madison (Statistics) Stat 709 Lecture 1 2018 8 / 16

Remarks f is measurable from (Ω,F ) to (Λ,G ) iff for any B G, f 1 (B) = {ω : f (ω) B} F ; we don t care about whether {f (ω) : ω A} is in G or not, A F. If F = all subsets of Ω, then any function f is measurable. If f is measurable from (Ω,F ) to (Λ,G ), then f 1 (G ) is a sub-σ-field of F and is called the σ-field generated by f and denoted by σ(f ). σ(f ) may be much simpler than F A measurable f from (Ω,F ) to (R,B) is called a Borel function. f is Borel if and only if f 1 (a, ) F for all a R. There are a lots of Borel functions (Proposition 1.4) In probability and statistics, a Borel function is also called a random variable. (A random variable = a variable that is random?) A random vector (X 1,...,X n ) is a function measurable from (Ω,F ) to (R n,b n ). (X 1,...,X n ) is a random vector iff each X i is a random variable. UW-Madison (Statistics) Stat 709 Lecture 1 2018 9 / 16

Indicator and simple functions The indicator function for A Ω is: { 1 ω A I A (ω) = 0 ω A. For any B R, /0 0 B,1 B A (B) = A 0 B,1 B A c 0 B,1 B Ω 0 B,1 B. I 1 Then, σ(i A ) = {/0,A,A c,ω} and I A is Borel iff A F Note that σ(i A ) is much simpler than F. Let A 1,...,A k be measurable sets on Ω and a 1,...,a k be real numbers. A simple function is ϕ(ω) = k i=1 a i I Ai (ω) A simple function is nonnegative iff a i 0 for all i. Any nonnegative Borel function can be the limit of a sequence of nonnegative simple functions. UW-Madison (Statistics) Stat 709 Lecture 1 2018 10 / 16

Let A 1,...,A k be a partition of Ω, i.e., A i s are disjoint and A 1 A k = Ω. Then the simple function ϕ with distinct a i s exactly characterizes this partition and σ(ϕ) = σ({a 1,...,A k }). Distribution (law) Let (Ω,F,ν) be a measure space and f be a measurable function from (Ω,F ) to (Λ,G ). The induced measure by f, denoted by ν f 1, is a measure on G defined as ( ) ν f 1 (B) = ν(f B) = ν f 1 (B), B G If ν = P is a probability measure and X is a random variable or a random vector, then P X 1 is called the law or the distribution of X and is denoted by P X. The c.d.f. of P X is also called the c.d.f. or joint c.d.f. of X and is denoted by F X. UW-Madison (Statistics) Stat 709 Lecture 1 2018 11 / 16

Integration Integration is a type of average". Definition 1.4 (a) The integral of a nonnegative simple function ϕ w.r.t. ν is defined as k ϕdν = a i ν(a i ). (b) Let f be a nonnegative Borel function and let S f be the collection of all nonnegative simple functions satisfying ϕ(ω) f (ω) for any ω Ω. The integral of f w.r.t. ν is defined as { } fdν = sup ϕdν : ϕ S f. i=1 (Hence, for any Borel function f 0, there exists a sequence of simple functions ϕ 1,ϕ 2,... such that 0 ϕ i f for all i and lim n ϕn dν = fdν.) UW-Madison (Statistics) Stat 709 Lecture 1 2018 12 / 16

(c) Let f be a Borel function, be the positive part of f, and f + (ω) = max{f (ω),0} f (ω) = max{ f (ω),0} be the negative part of f. (Note that f + and f are nonnegative Borel functions, f (ω) = f + (ω) f (ω), and f (ω) = f + (ω) + f (ω).) We say that fdν exists if and only if at least one of f + dν and f dν is finite, in which case fdν = f + dν f dν. (d) When both f + dν and f dν are finite, we say that f is integrable. Let A be a measurable set and I A be its indicator function. The integral of f over A is defined as fdν = I A fdν. A Note: A Borel function f is integrable if and only if f is integrable. UW-Madison (Statistics) Stat 709 Lecture 1 2018 13 / 16

Notation for integrals fdν = Ω fdν = f (ω)dν = f (ω)dν(ω) = f (ω)ν(dω). In probability and statistics, XdP = EX = E(X) and is called the expectation or expected value of X. If F is the c.d.f. of P on (R k,b k ), f (x)dp = f (x)df (x) = fdf. Extended set For convenience, we define the integral of a measurable f from (Ω,F,ν) to ( R, B), where R = R {, }, B = σ(b {{ },{ }}). Let A + = {f = } and A = {f = }. If ν(a + ) = 0, we define f + dν to be I A c + f + dν; otherwise f + dν =. f dν is similarly defined. If at least one of f + dν and f dν is finite, then fdν = f+ dν f dν is well defined. Example 1.5 For a countable Ω, F = all subsets of Ω, ν = the counting measure, and a Borel f, fdν = f (ω). ω Ω UW-Madison (Statistics) Stat 709 Lecture 1 2018 14 / 16

Example 1.6. If Ω = R and ν is the Lebesgue measure, then the Lebesgue integral of f over an interval [a,b] is written as b f (x)dx = f (x)dx, [a,b] which agrees with the Riemann integral in calculus when the latter is well defined. However, there are functions for which the Lebesgue integrals are defined but not the Riemann integrals. Proposition 1.5 Let (Ω,F,ν) be a measure space and f and g be Borel functions. (i) If fdν exists and a R, then (af )dν exists and is equal to a fdν. (ii) If both fdν and gdν exist and fdν + gdν is well defined, then (f + g)dν exists and is equal to fdν + gdν. It is often a good idea to break the proof into several steps: simple functions, nonnegative functions, and then general functions. a UW-Madison (Statistics) Stat 709 Lecture 1 2018 15 / 16

Proof of Proposition 1.5(i) (the proof of (ii) is an exercise) If a = 0, then (af )dν = 0dν = 0 = a fdν. Suppose that a > 0. If f is simple and 0, then af is also simple and 0 and a fdν = (af )dν follows from the definition of integration. For a > 0 and a general f 0, (af )dν = sup{ ϕdν : ϕ S af = sup { a φdν : φ S f } = asup } { } = sup aφdν : φ = ϕ/a S f { } φdν : φ S f = a fdν For a > 0 and general f, since fdν exists, ) a fdν = a( f + dν f dν = a f + dν a f dν = af + dν af dν = (af ) + dν (af ) dν = (af )dν For a < 0, af = a ( f ) UW-Madison (Statistics) Stat 709 Lecture 1 2018 16 / 16