STAT 7032 Probability Spring Wlodek Bryc

Similar documents
STAT 7032 Probability. Wlodek Bryc

Theorem 2.1 (Caratheodory). A (countably additive) probability measure on a field has an extension. n=1

The main results about probability measures are the following two facts:

Product measure and Fubini s theorem

MATHS 730 FC Lecture Notes March 5, Introduction

1.1. MEASURES AND INTEGRALS

Problem set 1, Real Analysis I, Spring, 2015.

Section 2: Classes of Sets

Lebesgue Integration on R n

3 (Due ). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

18.175: Lecture 2 Extension theorems, random variables, distributions

2 (Bonus). Let A X consist of points (x, y) such that either x or y is a rational number. Is A measurable? What is its Lebesgue measure?

Spring 2014 Advanced Probability Overview. Lecture Notes Set 1: Course Overview, σ-fields, and Measures

Some Background Material

Chapter 4. Measure Theory. 1. Measure Spaces

18.175: Lecture 3 Integration

RS Chapter 1 Random Variables 6/5/2017. Chapter 1. Probability Theory: Introduction

Notes on Measure, Probability and Stochastic Processes. João Lopes Dias

Measure and integration

Measure and Integration: Concepts, Examples and Exercises. INDER K. RANA Indian Institute of Technology Bombay India

Part V. 17 Introduction: What are measures and why measurable sets. Lebesgue Integration Theory

Estimates for probabilities of independent events and infinite series

3 Integration and Expectation

MATH31011/MATH41011/MATH61011: FOURIER ANALYSIS AND LEBESGUE INTEGRATION. Chapter 2: Countability and Cantor Sets

II - REAL ANALYSIS. This property gives us a way to extend the notion of content to finite unions of rectangles: we define

THEOREMS, ETC., FOR MATH 515

Chapter 1. Measure Spaces. 1.1 Algebras and σ algebras of sets Notation and preliminaries

Part II Probability and Measure

Notation. General. Notation Description See. Sets, Functions, and Spaces. a b & a b The minimum and the maximum of a and b

Introduction and Preliminaries

MTH 404: Measure and Integration

Measures. Chapter Some prerequisites. 1.2 Introduction

Theorems. Theorem 1.11: Greatest-Lower-Bound Property. Theorem 1.20: The Archimedean property of. Theorem 1.21: -th Root of Real Numbers

(1) Consider the space S consisting of all continuous real-valued functions on the closed interval [0, 1]. For f, g S, define

1 Measurable Functions

MAT 570 REAL ANALYSIS LECTURE NOTES. Contents. 1. Sets Functions Countability Axiom of choice Equivalence relations 9

2 Measure Theory. 2.1 Measures

1/12/05: sec 3.1 and my article: How good is the Lebesgue measure?, Math. Intelligencer 11(2) (1989),

Probability: Handout

MATH5011 Real Analysis I. Exercise 1 Suggested Solution

Statistical Inference

Lecture 3: Probability Measures - 2

Random Process Lecture 1. Fundamentals of Probability

MATH41011/MATH61011: FOURIER SERIES AND LEBESGUE INTEGRATION. Extra Reading Material for Level 4 and Level 6

Construction of a general measure structure

Summary of Real Analysis by Royden

van Rooij, Schikhof: A Second Course on Real Functions

Integration on Measure Spaces

HILBERT SPACES AND THE RADON-NIKODYM THEOREM. where the bar in the first equation denotes complex conjugation. In either case, for any x V define

Part IA Probability. Definitions. Based on lectures by R. Weber Notes taken by Dexter Chua. Lent 2015

Real Analysis Notes. Thomas Goller

Economics 574 Appendix to 13 Ways

Measures and Measure Spaces

Measures. 1 Introduction. These preliminary lecture notes are partly based on textbooks by Athreya and Lahiri, Capinski and Kopp, and Folland.

Chapter 1: Probability Theory Lecture 1: Measure space, measurable function, and integration

Three hours THE UNIVERSITY OF MANCHESTER. 24th January

Probability. Lecture Notes. Adolfo J. Rumbos

The Lebesgue Integral

Elementary Probability. Exam Number 38119

G1CMIN Measure and Integration

Essential Background for Real Analysis I (MATH 5210)

Lebesgue Measure on R n

Contents. 2 Sequences and Series Approximation by Rational Numbers Sequences Basics on Sequences...

Notes 1 : Measure-theoretic foundations I

STA 711: Probability & Measure Theory Robert L. Wolpert

REVIEW OF ESSENTIAL MATH 346 TOPICS

Problem Set 2: Solutions Math 201A: Fall 2016

Logical Connectives and Quantifiers

Probability and Measure

Lebesgue measure on R is just one of many important measures in mathematics. In these notes we introduce the general framework for measures.

Introduction. Chapter 1. Contents. EECS 600 Function Space Methods in System Theory Lecture Notes J. Fessler 1.1

1.4 Outer measures 10 CHAPTER 1. MEASURE

INTRODUCTION TO MEASURE THEORY AND LEBESGUE INTEGRATION

Chapter 1: Probability Theory Lecture 1: Measure space and measurable function

1. Probability Measure and Integration Theory in a Nutshell

I. ANALYSIS; PROBABILITY

FUNDAMENTALS OF REAL ANALYSIS by. II.1. Prelude. Recall that the Riemann integral of a real-valued function f on an interval [a, b] is defined as

Lebesgue Integration: A non-rigorous introduction. What is wrong with Riemann integration?

Real Analysis Math 131AH Rudin, Chapter #1. Dominique Abdi

Chapter 2 Metric Spaces

Introduction to Real Analysis Alternative Chapter 1

Stat 451: Solutions to Assignment #1

Chapter 1. Sets and probability. 1.3 Probability space

Introduction to Real Analysis

5 Set Operations, Functions, and Counting

L p Spaces and Convexity

Copyright c 2007 Jason Underdown Some rights reserved. statement. sentential connectives. negation. conjunction. disjunction

THEOREMS, ETC., FOR MATH 516

Product measures, Tonelli s and Fubini s theorems For use in MAT4410, autumn 2017 Nadia S. Larsen. 17 November 2017.

arxiv: v1 [math.fa] 14 Jul 2018

l(y j ) = 0 for all y j (1)

Compendium and Solutions to exercises TMA4225 Foundation of analysis

A LITTLE REAL ANALYSIS AND TOPOLOGY

Probability Theory I: Syllabus and Exercise

n [ F (b j ) F (a j ) ], n j=1(a j, b j ] E (4.1)

DRAFT MAA6616 COURSE NOTES FALL 2015

INTRODUCTION TO REAL ANALYSIS II MATH 4332 BLECHER NOTES

(Ω, F, P) Probability Theory Kalle Kyt ol a 1

Lebesgue Measure on R n

Foundations of Analysis. Joseph L. Taylor. University of Utah

Transcription:

STAT 7032 Probability Spring 2018 Wlodek Bryc Created: Friday, Jan 2, 2014 Revised for Spring 2018 Printed: January 9, 2018 File: Grad-Prob-2018.TEX Department of Mathematical Sciences, University of Cincinnati, Cincinnati, OH 45221 E-mail address: brycwz@ucmail.uc.edu

Contents Chapter 0. Math prerequisites 7 1. Convergence 7 2. Set theory 8 3. Compact set 10 4. Riemann integral 10 Additional Exercises 10 Chapter 1. Events and Probabilities 11 1. Elementary and semi-elementary probability theory 11 2. Sigma-fields 14 Required Exercises 16 Additional Exercises 17 Chapter 2. Probability measures 19 1. Existence 19 2. Uniqueness 21 3. Probability measures on R 23 4. Probability measures on R k 25 Required Exercises 27 Additional Exercises 27 Chapter 3. Independence 29 1. Independent events and sigma-fields 29 2. Zero-one law 30 3. Borel-Cantelli Lemmas 31 Required Exercises 31 Additional Exercises 32 Chapter 4. Random variables 35 1. Measurable mappings 35 2. Random variables with prescribed distributions 36 3

4 Contents 3. Convergence of random variables 38 Required Exercises 40 Additional Exercises 40 Chapter 5. Expected values 43 1. Simple random variables 43 2. Inequalities 45 3. The law of large numbers 47 Required Exercises 47 Additional Exercises 47 Chapter 6. Integration 49 1. Approximation by simple random variables 49 2. Expected values 50 3. Inequalities 54 4. Independent random variables 55 Required Exercises 55 Additional Exercises 55 Chapter 7. Product measure and Fubini s theorem 57 1. Product spaces 57 2. Product measure 57 3. Fubini s Theorem 58 Required Exercises 60 Additional Exercises 60 Chapter 8. Sums of independent random variables 61 1. The strong law of large numbers 61 2. Kolmogorov s zero-one law 63 3. Kolmogorov s Maximal inequality and its applications 63 4. Etemadi s inequality and its application 66 Required Exercises 67 Chapter 9. Weak convergence 69 1. Convergence in distribution 69 2. Fundamental results 71 Required Exercises 75 Chapter 10. Characteristic functions 77 1. Complex numbers, Taylor polynomials, etc 77 2. Characteristic functions 79 3. Uniqueness 81 4. The continuity theorem 83 Required Exercises 84 Chapter 11. The Central Limit Theorem 87 1. Lindeberg and Lyapunow theorems 88 2. Lyapunov s theorem 90

Contents 5 3. Strategies for proving CLT without Lindeberg condition 91 Required Exercises 91 Chapter 12. Limit Theorems in R k 93 1. The basic theorems 93 2. Multivariate characteristic function 94 3. Multivariate normal distribution 95 4. The CLT 96 Required Exercises 99 Appendix A. Addenda 101 1. Modeling an infinite number of tosses of a coin 101 Appendix. Bibliography 103 Appendix. Index 105

Chapter 0 Math prerequisites 1. Convergence 1.1. Convergence of numbers. Recall that for a sequence of numbers, lim n a n = L means that... n=1 a n = L means that... Theorem 0.1. If a sequence of real numbers {a n } is bounded and increasing, then lim n a n = sup n N a n. For unbounded increasing sequences we write lim n a n =. Recall that for a sequence of numbers a n, lim sup a n = lim n sup n k n a k and lim inf n a n = lim n inf k n a k. Remark 0.1. It is clear that lim inf n a n lim sup n a n. lim n a n exists as an extended number in [, ]. The equality holds iff the limit Similarly, for a sequence of functions f n : Ω R, we define functions f, f : Ω R = R {, } by f = lim inf n f n and f = lim sup n f n pointwise. We say that the sequence of functions {f n } converges pointwise, if f n (ω) converges for all ω Ω. We say that the sequence of functions {f n } converges uniformly over Ω to f, if sup ω Ω f n (ω) f(ω) 0. 2. Set theory (1) For a set Ω, by 2 Ω we denote the so called power set, i.e., the set of all subsets of Ω. We use upper case letters like A, B, C,... for the subsets - some (but not all) will be interpreted as events. (2) The empty set is - in handwriting this needs to be carefully distinguished from the Greek letters ϕ or Φ. 7

8 0. Math prerequisites (3) We use A B, for the union, A B for the intersection, A c or A for the complement. We do not use A + B and AB in this course!!! (4) We use A B for what some other books denote by A B. Sometimes it will be convenient to write this as B A. Collections of sets will be dented by scripted letters, like A or F. We will need to consider large collections of sets, as well as collections like A = {A 1, A 2,... }. (5) For a family A = {A t : t T } of subsets of Ω indexed by a set T, the union of all sets in A is the set of ω with the property that there exists a set A t A such that ω A t. In symbols, A t = {ω Ω : ω A t for some t T } = {ω Ω : t T ω A t } t T More concisely, A = {ω Ω : ω A for some A A} = {ω Ω : A A ω A} A A Similarly, we define the intersection A t = {ω Ω : t T ω A t } t T In particular, for a countable collection of sets, A n = A n = {ω : ω A n for some n N} n N A n = n N n=1 A n = {ω : ω A n for all n N} n=1 (6) The notation for intervals is (a, b) = {x R : a < x < b}, [a, b) = {x R : a x < b} and similarly (a, b] and [a, b]. Theorem 0.2 (DeMorgan s law). (0.1) (0.2) ( t T A t)c = t T Since (A c ) c = A, formula (0.1) is equivalent to ( t T A t)c = t T 2.1. Indicator functions and limits of sets. This has application to the so called indicator functions: { 1 if ω A (0.3) I A (ω) = 0 otherwise A c t A c t Since I An (ω) = { 0 1,

4. Riemann integral 9 it is clear that lim sup I An (ω) = n { 0 This means that lim sup n I An (ω) = I A (ω) for some set A Ω. For the same reasons, lim inf n I An = I A for some set A Ω. 1. Proposition 0.3. A = A k and A = n N k n n N k n A k Proof. This is Exercise 0.1. The second set has probabilistic interpretation: A = {A n occur infinitely often } = {A n i. o. } It is clear that A A. We say that lim n A n exists if A = A. Exercises 0.3 and 0.4 give examples of such limits. 2.2. Cardinality. Sets A, B have the same cardinality if there exists a one-to-one and onto function f : A B. We shall say that a set A is countable if either A is finite, or it has the same cardinality as the set N of natural numbers. It is known that the set of all rational numbers Q is countable while the interval [0, 1] R is not countable. 3. Compact set Recall that if K is compact if every sequence x n K has a convergent subsequence (with respect to some metric d). Equivalently, from every open cover of K one can select a finite sub-cover. If K is compact and sets F n K are closed with non-empty intersections n k=1 F k for all n, then the infinite intersection k=1 F k is also non-empty. Theorem 0.4. Closed bounded subsets of R k are compact. 4. Riemann integral Function f : [a, b] R is Riemann-integrable, with integral S = b a f(x)dx, if for every ε > 0 there exists δ > 0 such that S f(x j ) I j < ε i for every partition of [a, b] into sub-intervals I j of length I j < δ and every choice of x j I j. Every Riemann-integrable function is Lebesgue-integrable over [a, b]. It is known that continuous functions are Riemann-integrable. In calculus, the improper integral t 0 f(x)dx is defined as the limit lim t 0 f(x)dx. This is not the same as the Lebesgue integral over [0, ).

10 0. Math prerequisites Additional Exercises Exercise 0.1. Prove Proposition 0.3. Exercise 0.2. Suppose B, C are subsets of Ω and { B if n is even A n = C if n is odd Identify the sets A = n N k n A k and A = n N k n A k. Exercise 0.3. Suppose A 1 A 2 A n.... Show that lim n A n exists (and describe the limit). Exercise 0.4. Suppose A 1 A 2 A n.... Show that lim n A n exists (and describe the limit).

Chapter 1 Events and Probabilities Abstract. Fields. Probability measures and σ-fields. Sigma-field generated by a collection of sets. Lebesgue measure on the unit interval and unit square. Borel sigma-field. 1. Elementary and semi-elementary probability theory The standard model of probability theory is the triplet (Ω, F, P ), where Ω is a set, F 2 Ω, and P is a function F [0, 1]. The set Ω is called sometimes a sample space or a probability space. Sets A F are called events, and the number P (A) is called probability of event A. We say that event A occurred, if ω A, and we interpret P (A) as the likelihood that event A occurred. 1.1. Field of events. It is natural to expect that the events form a field. Definion 1.1. A class F of subsets of Ω is a field if: (i) Ω F (ii) if A F then A c F (iii) if A, B F then A B F By induction, if A 1, A 2,..., A n F then A 1 A n = 1 j n A j F. By DeMorgan s law (Theorem 0.2), a field is also closed under intersections, 1 j n A j F. In particular, we can replace axiom (iii) by (iii ) if A, B F then A B F Example 1.1. The class B 0 of finite unions of disjoint left-open right-closed subintervals of (0, 1], is a field. Proof. (0, 1] B 0. If A = K j=1 (a j, b j ] with a 1 < b 1 a 2 < b 2 a K < b K then A c = (0, a 1 ] (b 1, a 2 ] (b K 1, a K ] (b K, 1], where some of the intervals might be empty. If A = j I j and B = k J k then A B = j,k I j J k and intersections I j J k are disjoint, possibly empty, intervals of the form (a, b]. Similarly, for Ω := (0, 1] (0, 1], the set of finite unions of rectangles (a, b] (c, d] is a field. 11

12 1. Events and Probabilities 1.2. Finitely additive probabilities. We want to assign the number P (A), as a measure of the likelihood that the event A occurred. Definion 1.2. Let F be a field. A function P : F R is a finitely-additive probability measure if it satisfies the following conditions (i) 0 P (A) 1 for A F (ii) P ( ) = 0, P (Ω) = 1. (iii) If A, B F are disjoint, then P (A B) = P (A) + P (B). A function P : F R is a probability measure on F if it is finitely-additive and satisfies the following continuity condition (iv) If A 1 A 2... are sets in F and k A k =, then lim n P (A n ) = 0 Remark 1.1 (Probability measures are countably additive). Axioms (iii) and (iv) are often combined together into countable additivity, (iii+) If A 1, A 2,..., F are pairwise disjoint and k A k F, then P ( k A k) = k=1 P (A k). Other equivalent versions of continuity or countable additivity are: (1) If A 1 A 2... are sets in F and k A k F, then P ( k A k) = lim n P (A n ) (2) If A 1 A 2... are sets in F and k A k F, then P ( k A k) = lim n P (A n ) Proposition 1.1 (Elementary properties). Suppose P is a finitely additive probability measure on the field F of subsets of Ω. For A, B F we have (1) B A implies P (A) P (B) (2) P (A c ) = 1 P (A) (3) P (A B) = P (A) + P (B) P (A B) Proof. (1) follows from A = B (A \ B) by finite additivity. (2) follows from Ω = A A c (3) is a special case of Exercise 1.1. Example 1.2. Let Ω = N and F consist of all subsets A N such that the limit lim n #(A {1,..., n})/n exists. Then P : F [0, 1] defined by P (A) = lim n #(A {1,..., n})/n is a finitely additive probability measure. However, Exercise 1.7 says that P is not continuous. Constructions of finitely additive continuous measures are somewhat more involved. 1.2.1. Example: Lebesgue measure on the unit interval. In this example we consider Ω = (0, 1] and the field B 0 from Example 1.1. For A = n k=1 I k B 0 with disjoint I k, define λ(a) = n k=1 I k. Theorem 1.2. λ is a well defined (continuous) probability measure on the field B 0. Proof. Since the representation A = n k=1 I k B 0 is not unique, we need to make sure that λ is well defined. Write A = k I k = j J j as the finite sums of disjoint intervals. Then I k = I k A = j I k I j so by finite additivity k I k = k This shows that j J j = k I k, so λ(a) is indeed well defined. j I k I j and similarly j J j = j k I k I j.

1. Elementary and semi-elementary probability theory 13 To prove continuity, we proceed by contrapositive. Suppose that A n A n+1 are such that λ(a n ) > δ > 0. Choose B n K n A n such that λ(a n ) λ(b n ) < δ/2 n and K n is compact. Then P (A n ) P (B 1 B n ) = P ( n k=1 A n \ B k ) P ( n k=1 A k \ B k ) k=1 δ/2k = δ/2. So P (B 1 B n ) δ/2 > 0, and K 1 K n. Thus, n=1 A n n=1 K n. Construction of Lebesgue measure has the following generalization. Suppose B 0 2 R is a field consisting of finite unions of intervals (, b], (a, b], (b, ). Theorem 1.3 (Lebesgue). If P is a (continuous) probability measure on B 0 then the function F (x) := P ((, x]) has the following properties: (i) F is non-decreasing (ii) lim x F (x) = 0 (iii) lim x F (x) = 1 (iv) F is right-continuous, i.e., lim y x F (y) = F (x) Conversely, if F is a function with properties (i)-(iv) then there exists a (unique) continuous probability measure P on B 0 such that F (x) := P ((, x] for all x R. Sketch of the proof. The proof of (iv) may require some care: For rational r x, we have (0, x] = r>x (0, r]. Any real y lies between two rational numbers. The proof of converse has two parts. For A B 0 given by A = n j=1 (a j, b j ], the definition P (A) = n j=1 (F (b j) F (a j ) does not depend on the representation. (Here, we set F ( ) = 0 and F ( ) = 1.) Uniqueness is an obvious consequence of finite additivity. Then we need to verify continuity. This proof can proceed similarly to the proof of Theorem 1.2. (See hints for Exercise 1.12.) A generalization of the above construction is based on the concept of a semi-algebra. Definion 1.3. A collection S of subsets of Ω is called semi-algebra, or a semi-ring, if (i) S (ii) S is closed under intersections, i.e. if A, B S then A B S (iii) If A, B S then B \ A is a finite union of sets in S. The main (motivating) example of a semi-algebra is the family of rectangles in R 2, and more generally, in R d. The following is a version of [Durrett, Theorem 1.1.4] adapted to probability measures. Theorem 1.4. Let S be a semi-algebra. Suppose that P : S [0, 1] is additive, countably subadditive, i.e., if A = n=1 A n is in S for pairwise disjoint sets A n S then P (A) n=1 P (A n). If P ( ) = 0, then P has a unique extension onto the field F generated by S, and this extension is continuous. Proof. (Omitted in 2018)

14 1. Events and Probabilities Question 1.1. Why not to require uncountable continuity of probability measures? Compare Exercise 1.3 and Exercise 1.3. 2. Sigma-fields Given an infinite sequence {A n } of events, it is convenient to allow also more complicated events such as A = n N k n A k that events A k occur infinitely often. This motivates the following. Definion 1.4. A class F of subsets of Ω is a σ-field if it is field and if it is also closed under the formation of countable unions: (iii+) If A 1, A 2 F then n N A n F. Note that (iii+) implies (iii) because we can take A 1 = A and A n = B for other n. By an application of DeMorgan s law (Theorem 0.2), (iii+) can be replaced by If A 1, A 2, F then n N A n F. Clearly, the power set 2 Ω is the largest possible σ-field. We will often consider smallest σ-fields that contain some collections of sets of our interest. Proposition 1.5. Suppose A is a collection of subsets of Ω. There exist a unique σ-field F with the following properties: (1) A A implies A F. That is, A F. (2) If G is a σ-field such that A F then F G. We write F = σ(a), and call F the sigma-field generated by A. Proof. Uniqueness is a consequence of (2) To show that F exists, consider a set M of all sigma-fields G with the property that A G. Since 2 Ω M, this is a nonempty family of sets. Define F = G. G M Then F is a sigma-field with the required properties, because the intersection of sigma-fields is a sigma-field (can you verify this?). Definion 1.5. The Borel sigma-field is the sigma field generated by all open sets. Proposition 1.6. For Ω = R, the Borel sigma-field B 1 is generated by the intervals {(a, b] : a < b}. For Ω = R d, the Borel sigma-field B d is generated by the rectangles d k=1 (a k, b k ]. Note that Borel-field B 0 2 R generated by all intervals (a, b] consist of finite unions of intervals (, b], (a, b], (b, ).

2. Sigma-fields 15 2.1. Probability measures. Definion 1.6. If F is a σ-field of subsets of Ω and P is a probability measure on F, then the triple (Ω, F, P ) is called a probability space. The sets A F are called events. Example 1.3. Let F = 2 Ω and fix ω 0 Ω. Then P (A) = I A (ω 0 ) is a probability measure, sometimes called the point mass, denoted by δ ω0. Since a convex combination of probability measures is a probability measure, another example of a probability measure is P = 1 2 δ ω 0 + 1 2 δ ω 1. Example 1.4 (Discrete probability space). Let F be the σ-field of all subsets of a countable Ω = {ω 1, ω 2,... } Suppose p 1, p 2,... is a sequence of nonnegative numbers such that k=1 p k = 1. Define P (A) = Then P is a probability measure. k:ω k A Proof. This is not entirely trivial. If A = j A j with disjoint sets then P (A) = ω i A j p i does not depend on the order of summation, and equals to the iterated series j=1 i: ω i A j p i. Example 1.5 (Discrete probability measure). Let F be the σ-field of all subsets of an infinite set Ω. Suppose ω 1, ω 2, Ω are fixed, and p 1, p 2,... is a sequence of nonnegative numbers such that k=1 p k = 1. Define P (A) = Then P is a probability measure. k: ω k A The numbers p k are sometimes called the probability mass function, or the probability density function (as this is the density with respect to the counting measure on points {x 1, x 2,... }). More generally, if P 1, P 2,..., is a sequence of probability measures on the common field or σ-field F and p 1, p 2,... is a sequence of nonnegative numbers such that k=1 p k = 1, then Q(A) = k=1 p kp k (A) is also a probability measure on F. The probabilistic interpretation is that we have a sequence of experiments described by probability measures P k and we want to model a new experiment with additional randomization where the k-th experiment is performed with probability p k. 2.1.1. Examples of discrete distributions on R. Example 1.6 (Binomial distribution). For fixed integer n and 0 < p < 1, take x k = k and The binomial formula shows that n k=0 p k = 1. p k p k p k = ( n k )pk (1 p) n k, k = 0,... n. Example 1.7 (Poisson distribution). For λ > 0 take x k = k and λ λk p k = e k!, k = 0, 1.... Example 1.8 (Polya s distribution). For r > 0 and 0 < p < 1 take x k = k and p k = Γ(r + k) k!γ(r) (1 p)r p k, k = 0, 1,...

16 1. Events and Probabilities Required Exercises Exercise 1.1. Prove the inclusion-exclusion formula 1 n (1.1) P ( A j ) j=1 n = P (A j ) P (A j1 A j2 )+ P (A j1 A j2 A j3 )+ +( 1) n P (A 1 A 2 A n ) j=1 1 j 1 <j 2 n 1 j 1 <j 2 <j 3 n Does the proof use countable additivity? Exercise 1.2. For a probability space (Ω, F, P ), if B 1, B 2,... is a sequence of events such that n k=1 P (B k) > n 1, show that P ( n k=1 B k) > 0. Exercise 1.3. For a probability space (Ω, F, P ), suppose {B n : n N} are events with P (B n ) = 1. Show that. ( ) P B n = 1 n=1 Exercise 1.4. Suppose that Ω = N and for n N let F n be the σ-field generated by the collection of one-point sets A n = {{1}, {2},..., {n}}. Show that F n F n+1 and that F := n F n is a field but not a σ-field. Exercise 1.5. Show that measure P in Example 1.2 is additive but not continuous. (For the second statement, find A n F such that A n A n+1 and n=1 A n =, but P (A n ) = 1.) Exercise 1.6. Without using Proposition 1.6, show that open intervals (a, b) and closed intervals [a, b] are in the sigma-field generated by the intervals (a, b] in R. (Compare Example 1.1.) 1 This can also be written as ( n ) n Pr A k = ( 1) M 1 Pr(A j1 A j2 A jm ) k=1 M=1 1 j 1 <j 2 < <j M n

Additional Exercises 17 Additional Exercises Exercise 1.7. Without using Proposition 1.6, show that the open triangle T = {(x, y) : x > 0, y > 0, x + y < 1]} is in the sigma-field generated by the rectangles (a, b] (c, d) in R 2. Exercise 1.8. Suppose that F n are fields satisfying F n F n+1. Show that n F n is a field. Exercise 1.9. Suppose P is a finitely additive measure on a field F. Show that if A 1,..., A n,... are disjoint then the series n=1 P (A n) converges. Exercise 1.10. Prove that continuous finitely-additive probability measure on a field is countably additive. That is, show that property (iii+) of Remark 1.1 follows from the axioms (i)-(iv) of Definition 1.2. Exercise 1.11. if P 1, P 2,..., is a sequence of continuous probability measures on the field F and p 1, p 2,... is a sequence of nonnegative numbers such that k=1 p k = 1, show that Q(A) = k=1 p kp k (A) is also continuous. Exercise 1.12. Suppose Ω is a metric space and F is a field of subsets of Ω. Suppose that P is a finitely additive probability measure on F. Lets say that P is a tight probability measure if for every A F with P (A) > 0 and ε > 0 there exist B F and a compact set K such that B K A and P (A) < P (B) + ε. (1) In the setting of Theorem 1.2, show that the Lebesgue measure on B 0 is tight. (2) Show that a tight finitely additive probability measure is countably-additive. Hint: Proceed by contrapositive! 2 Exercise 1.13 (Compare Exercise 1.4). Let Ω be an infinite set. Consider the following classes of subsets of Ω: F n = {A Ω : A has at most n elements or A c has at most n elements} Then we have the following facts: F n F n+1 F 0 is a σ-field For n 1, class F n is not a field n F n is a field but it is not a σ-field. n F n is not a σ-field. 2 Plan of proof: Suppose A 1 A 2... are sets in F such that there exists δ > 0 with P (A n) > δ for all n. Using tightness, we can find compact sets K 1, K 2,... and sets B j F such that B j K j and B 1 B 2 B n has positive probability. In fact, we can find such B j with P (B 1 B 2 B n) of at least δ(1 n j=1 1/2j ) > 0 Since every finite intersection K 1 K 2 K n contains B 1 B 2 B n, we see that n Kn is nonempty. So n An cannot be empty.

18 1. Events and Probabilities Exercise 1.14 (Compare Exercise 1.3). Suppose {B t : t T } are events with P (B t ) = 1. Give an example where t T B t = so P ( t T B t) = 0. Hint: Lebesgue measure on Borel (0, 1] Exercise 1.15. Let Ω be a nonempty set and C be the class of one-element sets. Show that if A σ(c) then either A is countable or A c is countable. Exercise 1.16. Suppose A and B are σ-fields of subsets of Ω. Let F = A B be the smallest σ-field containing both A and B. Show that F is generated by sets of the form A B where A A and B B. Exercise 1.17. The field F(A) generated by a class A of subsets of Ω is defined as the intersection of all fields in Ω containing all of the sets in A. Show that F(A) is indeed a field, that A F(A) and that F(A) is minimal in the sense that if G is a field and A G then F(A) G. Show that if A is nonempty then F(A) is the class of sets of the form m j=1 B j where sets B j are disjoint and are of the form B = n i=1 A i where either A i A or A c i A. Exercise 1.18. For Ω = (0, 1] and any A Ω define { } P = inf B k : B k B 0, B k A where B is the sum of lengths of intervals forming B. (1) Show that 0 P (A) 1 (2) Show that P (A B) P (A) + P (B) (3) Show that P B0 = λ, the Lebesgue measure from Theorem 1.2. (4) Show that P ({x}) = 0. k k=1

Bibliography [Billingsley] P. Billingsley, Probability and Measure IIIrd edition [Durrett] R. Durrett, Probability: Theory and Examples, Edition 4.1 (online) [Gut] A. Gut, Probability: a graduate course [Resnik] S. Resnik, A Probability Path, Birkhause 1998 [Proschan-Shaw] S M. Proschan and P. Shaw, Essential of Probability Theory for Statistitcians, CRC Press 2016 [Varadhan] S.R.S. Varadhan, Probability Theory, (online pdf from 2000) 103

Index L p-norm, 46 λ-system, 22 π-system, 21 σ-field, 14 σ-field generated by a random element X, 35 distribution of a random variable, 36 Binomial distribution, 15, 52 Borel σ-field, 35 Borel sigma-field, 14 Cantelli s inequality, 48 cardinality, 9 Cauchy distribution, 83 Cauchy-Schwarz inequality, 45 centered, 47 Central Limit Theorem, 87 characteristic function, 79 characteristic function continuity theorem, 83 Characteristic functions uniqueness, 82 Characteristic functions inversion formula, 82 Chebyshev s inequality, 45 complex numbers, 78 conjugate exponents, 46 continuity condition, 12 convergence in distribution, 69 converges in distribution, 93 converges in probability, 38 converges pointwise, 7 converges uniformly, 7 converges with probability 1, 38 convex function, 45 countable additivity, 12 covariance matrix, 96 cumulative distribution function, 23, 36 cylindrical sets, 26, 27 DeMorgan s law, 8 density function, 24 diadic interval, 101 discrete random variables, 37 events, 11, 15 expected value, 43 Exponential distribution, 53 exponential distribution, 24 Fatou s lemma, 51 field, 11 finite dimensional distributions, 26 finitely-additive probability measure, 12 Fubini s Theorem, 58 Geometric distribution, 52 Hölder s inequality, 46, 54 inclusion-exclusion, 16 independent σ-fields, 29 independent events, 29 indicator functions, 8 induced measure, 36 infinite number of tosses of a coin, 101 integrable, 50 intersection, 8 Jensen s inequality, 45 joint cumulative distribution function, 25 joint distribution of random variables, 36 Kolmogorov s maximal inequality, 63 Kolmogorov s one series theorem, 64 Kolmogorov s three series theorem, 64 Kolmogorov s two series theorem, 64 Kolmogorov s zero-one law, 63 Kronecker s Lemma, 65 Lévy distance, 75 Lebesgue s dominated convergence theorem, 51 Lebesgue s dominated convergence theorem used, 52, 62, 71, 84 Levy s theorem, 66 Lindeberg condition, 89 Lyapunov s condition, 90 Lyapunov s inequality, 45 marginal cumulative distribution functions, 25

106 Index Markov s inequality, 45 maximal inequality, Etemadi s, 66 maximal inequality,kolmogorov s, 63 measurable function, 35 measurable rectangle, 57 Minkowki s inequality, 46 Minkowski s inequality, 54 moment generating function, 56 Monotone Convergence Theorem, 51 multivariate normal distribution, 95 negative binomial distribution, 15 normal distribution, 24 Poisson distribution, 15, 53 Polya s distribution, 15 Portmanteau Theorem, 71 power set, 7 probability, 11 probability measure, 12 probability space, 11, 15 product measure, 58 quantile function, 37, 71 random element, 35 random variable, 35 random vector, 35 sample space, 11 Scheffe s theorem, 69 section, 57 semi-algebra, 13 semi-ring, 13 sigma-field generated by A, 14 simple random variable, 43 Skorohod s theorem, 71 Slutsky s Theorem, 70 Standard normal density, 53 stochastic process with continuous trajectories, 36 stochastically bounded, 40 symmetric distribution, 67 tail σ-field, 30 Tail integration formula, 59 tight, 40 tight probability measure, 17 Tonelli s theorem, 58 uncorrelated, 47 uniform continuous, 24 Uniform density, 53 uniform discrete, 24 uniform singular, 24 uniformly integrable, 52, 73 union, 8 variance, 44 zero-one law, 30, 63