B Supplemental Notes 2 Hypergeometric, Binomial, Poisson and Multinomial Random Variables and Borel Sets

Similar documents
Axioms of Measure Theory

If a subset E of R contains no open interval, is it of zero measure? For instance, is the set of irrationals in [0, 1] is of measure zero?

It is always the case that unions, intersections, complements, and set differences are preserved by the inverse image of a function.

The Borel hierarchy classifies subsets of the reals by their topological complexity. Another approach is to classify them by size.

Introduction to Probability. Ariel Yadin. Lecture 7

Product measures, Tonelli s and Fubini s theorems For use in MAT3400/4400, autumn 2014 Nadia S. Larsen. Version of 13 October 2014.

Lecture Notes for Analysis Class

The Boolean Ring of Intervals

Integrable Functions. { f n } is called a determining sequence for f. If f is integrable with respect to, then f d does exist as a finite real number

Advanced Stochastic Processes.

Solution. 1 Solutions of Homework 1. Sangchul Lee. October 27, Problem 1.1

Chapter 3. Strong convergence. 3.1 Definition of almost sure convergence

Last Lecture. Wald Test

Definition 4.2. (a) A sequence {x n } in a Banach space X is a basis for X if. unique scalars a n (x) such that x = n. a n (x) x n. (4.

Singular Continuous Measures by Michael Pejic 5/14/10

MATH301 Real Analysis (2008 Fall) Tutorial Note #7. k=1 f k (x) converges pointwise to S(x) on E if and

Convergence of random variables. (telegram style notes) P.J.C. Spreij

Probability and Statistics

(A sequence also can be thought of as the list of function values attained for a function f :ℵ X, where f (n) = x n for n 1.) x 1 x N +k x N +4 x 3

Introductory Ergodic Theory and the Birkhoff Ergodic Theorem

FUNDAMENTALS OF REAL ANALYSIS by

Distribution of Random Samples & Limit theorems

Sequences and Series of Functions

Lecture 19: Convergence

Math 299 Supplement: Real Analysis Nov 2013

7.1 Convergence of sequences of random variables

} is said to be a Cauchy sequence provided the following condition is true.

Measure and Measurable Functions

STAT Homework 1 - Solutions

Probability and statistics: basic terms

62. Power series Definition 16. (Power series) Given a sequence {c n }, the series. c n x n = c 0 + c 1 x + c 2 x 2 + c 3 x 3 +

Here are some examples of algebras: F α = A(G). Also, if A, B A(G) then A, B F α. F α = A(G). In other words, A(G)

HOMEWORK I: PREREQUISITES FROM MATH 727

An Introduction to Randomized Algorithms

2.1. The Algebraic and Order Properties of R Definition. A binary operation on a set F is a function B : F F! F.

Lecture 3 : Random variables and their distributions

Chapter 1. Probability

Lecture 3 The Lebesgue Integral

EE 4TM4: Digital Communications II Probability Theory

Chapter 0. Review of set theory. 0.1 Sets

Lecture 7: Properties of Random Samples

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators

Topic 9: Sampling Distributions of Estimators

Math 61CM - Solutions to homework 3

Lecture 12: November 13, 2018

lim za n n = z lim a n n.

MATH 413 FINAL EXAM. f(x) f(y) M x y. x + 1 n

ON POINTWISE BINOMIAL APPROXIMATION

Mathematics 170B Selected HW Solutions.

Homework 1 Solutions. The exercises are from Foundations of Mathematical Analysis by Richard Johnsonbaugh and W.E. Pfaffenberger.

Arkansas Tech University MATH 2924: Calculus II Dr. Marcel B. Finan

Simulation. Two Rule For Inverting A Distribution Function

18.440, March 9, Stirling s formula

Math 155 (Lecture 3)

We are mainly going to be concerned with power series in x, such as. (x)} converges - that is, lims N n

Math 113 Exam 3 Practice

Chapter 6 Infinite Series

Fall 2013 MTH431/531 Real analysis Section Notes

University of Colorado Denver Dept. Math. & Stat. Sciences Applied Analysis Preliminary Exam 13 January 2012, 10:00 am 2:00 pm. Good luck!

2 Banach spaces and Hilbert spaces

REAL ANALYSIS II: PROBLEM SET 1 - SOLUTIONS

6. Uniform distribution mod 1

6. Sufficient, Complete, and Ancillary Statistics

Limit Theorems. Convergence in Probability. Let X be the number of heads observed in n tosses. Then, E[X] = np and Var[X] = np(1-p).


Let us give one more example of MLE. Example 3. The uniform distribution U[0, θ] on the interval [0, θ] has p.d.f.

Infinite Sequences and Series

CS284A: Representations and Algorithms in Molecular Biology

Assignment 5: Solutions

Discrete probability distributions

The Random Walk For Dummies

Lecture 6: Coupon Collector s problem

Discrete Mathematics and Probability Theory Summer 2014 James Cook Note 15

Summary. Recap ... Last Lecture. Summary. Theorem

IE 230 Seat # Name < KEY > Please read these directions. Closed book and notes. 60 minutes.

7.1 Convergence of sequences of random variables

Relations Among Algebras

3 Gauss map and continued fractions

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.265/15.070J Fall 2013 Lecture 21 11/27/2013

Math 341 Lecture #31 6.5: Power Series

Approximations and more PMFs and PDFs

1 The Haar functions and the Brownian motion

MATH4822E FOURIER ANALYSIS AND ITS APPLICATIONS

Midterm Exam #2. Please staple this cover and honor pledge atop your solutions.

from definition we note that for sequences which are zero for n < 0, X[z] involves only negative powers of z.

HOMEWORK #4 - MA 504

MAT1026 Calculus II Basic Convergence Tests for Series

Real and Complex Analysis, 3rd Edition, W.Rudin

Introduction to Probability. Ariel Yadin. Lecture 2

Introduction to Optimization Techniques

Council for Innovative Research

MA131 - Analysis 1. Workbook 10 Series IV

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 19 11/17/2008 LAWS OF LARGE NUMBERS II THE STRONG LAW OF LARGE NUMBERS

Mathematical Methods for Physics and Engineering

Some discrete distribution

Ma 530 Infinite Series I

Element sampling: Part 2

4. Partial Sums and the Central Limit Theorem

Week 5-6: The Binomial Coefficients

Transcription:

B671-672 Supplemetal otes 2 Hypergeometric, Biomial, Poisso ad Multiomial Radom Variables ad Borel Sets 1 Biomial Approximatio to the Hypergeometric Recall that the Hypergeometric istributio is fx = x x x = x x 0 elsewhere which we may write as x = max 0, +,...,mi, fx = x 1 i x 1 i 1 i Sice there are terms i the umerator ad the deomiator we may divide both by to obtai fx = Sice x 1 x 1 1 x 1 i x 1 1 i i i for i = 0,1,...,x 1 i for i = 0,1,..., x 1 i 1 for i = 0,1,..., 1 Fall2004 1 supp2.tex

we have that x 1 x 1 ad it follows that x x 1 x x x 1 x x 1 1 i 1 x 1 x i i 1 x fx x x x 1 If we assume that lim = p ad that ad x are fixed the ad lim x It follows that x 1 lim lim x x x x 1 x 1 x = x x = x = p x p x x p x p x x p x p x x x so that the hypergeometric distributio ca be approximated by the biomial with p = Fall2004 2 supp2.tex

2 Poisso Approximatio to the Biomial The probability of x successes i Beroulli trials with trials ad probability of success p o each trial is give by the biomial distributio i.e. P X = x = p x x p x for x = 0,1,2,..., Suppose ow that such that lim p = lim λ = λ > 0 The biomial distributio ca the be writte as Hece P X = x = x 1 x λ x x! = = x 1 p x x p x 1 x + 1 x! i λ x λ x! λ P x λx x! x λ x λ λ x λ x Fall2004 3 supp2.tex

ad ow for fixed x L x = U x = λx x! x 1 λ x λ x x! λ λ λx x! e λ x λx x! e λ so that lim L x = lim U x = λx x! e λ Hece lim P x = λx x! e λ i.e. the biomial distributio approaches the Poisso as p λ ad hece ca be used to approximate the biomial uder this coditio. Fall2004 4 supp2.tex

3 Multivariate Hypergeometric ad Multiomial istributios Cosider a populatio of idividuals each classified ito oe of k mutually exclusive categories C 1,C 2,...,C k. Suppose that there are i idividuals i category C i for i = 1,2,...,k. ote that k i=1 i =. If we draw a radom sample of size without replacemet from this populatio the probability of x i i the sample from category C i, i = 1,2,...,k is Px 1,x 2,...,x k =! 1 x1 2 x2 k xk x 1!x 2! x k! which is called the multivariate hypergeometric distributio with parameters 1, 2,..., k. It is ot widely used sice the multiomial distributio provides a excellet approximatio. Rewrite the distributio as Px 1,x 2,...,x k =! ki=1 x i! ki=1 x i 1 1 l=0 j=0 i j l ividig the umerator ad deomiator by as i the developmet of the biomial approximatio to the hypergeometric yields Sice i x i 1 Px 1,x 2,...,x k =! ki=1 x i! i j 1 i ki=1 x i 1 j=0 1 l=0 i j l for j = 0,1,...,x i 1 i = 1,2,...,k l 1 for l = 0,1,..., 1 Fall2004 5 supp2.tex

we have that k i=1 ad i x i 1 xi k i=1 1 x i 1 j=0 1 i j k i=1 l 1 xi i Thus the multivariate hypergeometric distributio is bouded below by ad is bouded above by! x 1!x 2! x k! k i=1! x 1!x 2! x k! i x x i 1 i 1 xi ki=1 i Thus if is fixed ad lim i = p i for each i the lim Px 1,x 2,...,x k =! x 1!x 2! x k! It follows that the multivariate hypergeometric distributio ca be approximated by the multiomial distributio with p i = i for i = 1,2,...,k. k i=1 p x i i Fall2004 6 supp2.tex

4 Borel Sets ad Measurable Fuctios 4.1 ecessity for Borel Sets Let Ω = [0,1 ad let PE be the legth of E i.e. the uiform probability measure. It is ot possible for P to be defied for all subsets of Ω i such a way that P satisfies 0 PE 1, PΩ = 1, P is coutably additive, ad PE is equal to the legth of E. To show this a o-measurable set is costructed by the followig procedure: 1 efie a equivalece relatio o Ω = [0, 1 such that Ω = [0,1 = t T E t where the E t are disjoit. The equivalece relatio is defied by x y mod 1 x + r = y where r is a ratioal umber where mod 1 x 1 + x 2 = x 1 + x 2 if x 1 + x 2 1 x 1 + x 2 1 if x 1 + x 2 > 1 for x 1,x 2 [0,1 2 Use the Axiom of Choice to select a set F cotaiig exactly oe poit from each of the E t. 3 Order the ratioal umbers as ad defie the sets F i by r 0,r 1,r 2,... where r 0 = 0 F i = mod 1 F + r i Fall2004 7 supp2.tex

You ca show that the F i are mutually exclusive ad that F i = Ω = [0,1 4 If F is measurable so are each of the F i ad It the follows that PF = PF i 1 = PΩ = P[0,1 = P F i = = PF i PF Thus F is ot measureable i.e. o value PF ca be assiged to F. Referece: Couterexamples i Probability ad Statistics, 1986 Romao ad Siegel, Wadsworth ad Brooks/Cole Fall2004 8 supp2.tex

4.2 Measurable Fuctios ad Radom Variables I the study of probablity ad its applicatios to statistics we eed to have a collectio of radom variables measurable fuctios large eough to esure that probabilities are well defied. Recall that most of classical aalysis calculus, etc. deals with cotiuous fuctios ad limits of sequeces of cotiuous fuctios. Sice limits of sequeces of cotiuous fuctios are ot ecessarily cotiuous ad also sice a sequece of cotiuous fuctios may ot ted to a fiite limit it is coveiet to exted slightly the defiitio of the fuctios uder cosideratio by allowig fuctios to take the values ± ad to exted the otio of a Borel σ-field to be the smallest σ-field geerated by the sets { }, {+ } ad B where B is the class of all Borel sets. With this extesio the collectio of measurable fuctios from Ω, W to R, B is closed uder the followig usual operatios: arithmetic operatios sums, products ifima ad suprema poitwise limits of sequeces of fuctios Fall2004 9 supp2.tex

There are ow two defiitios of measurable fuctios: Costructive: A measurable fuctio is the limit of a coverget sequece of simple fuctios where a simple fuctio is a fuctio of the form X = x j I Ej i=1 where the E j are measurable sets i.e. E j W. escriptive: A measurable fuctio is a fuctio such that the iverse image of ay Borel set i B is a measurable set i W. Theorem 4.1 The costructive ad descriptive defiitios are equivalet. Moreover the set of all measurable fuctios is closed uder the usual operatios of aalysis Proof: See Loeve page 107-108. Fall2004 10 supp2.tex

Theorem 4.2 A cotiuous fuctio of a measurable fuctio is measurable. The class of fuctios cotaiig all cotiuous fuctios ad closed uder limits are called Baire fuctios. It follows that Baire fuctios of measurable fuctios are also measurable. Proof: See Loeve page 108-109. Referece: Loeve, M. 1960 Probability Theory Secod Editio Va ostrad Theorem 4.3 A fuctio X = X 1,X 2,...,X from Ω to R is measurable if ad oly if each of the coordiates X 1,X 2,...,X is measurable. Proof: See Loeve page 108-109. Referece: Loeve, M. 1960 Probability Theory Secod Editio Va ostrad It follows that the class of measurable fuctios or radom variables is rich eough to iclude all radom variables of potetial iterest. Much like the fact that the class of Borel sets is rich eough to cotai all evets of iterest ad still be compatible with the eed for probabilities to be coutably additive. Fall2004 11 supp2.tex

example 4.1 efie the sigum fuctio by sgmx = +1 if x > 0 0 if x + 0 1 if x < 0 ad ote that the sigum fuctio is ot cotiuous at x = 0. efie the fuctio f x by f x = mi1,x x 0 max 1,x x < 0 ote that f 0 = 0 ad that f x is cotiuous for every x. However lim f x = sgmx which is ot cotiuous at x = 0 i.e. the limit of a sequece of cotiuous fuctios is ot ecessarily cotiuous. Referece: Couterexamples i Aalysis,1964 Gelbaum ad Olsted, Holde- ay Ic. page 77 Fall2004 12 supp2.tex

example 4.2 efie the fuctio f x by f x = mi, 1 x if 0 < x 1 0 if x = 0 The f x is bouded o the iterval [0,1]. However the fuctio fx = lim f x = 1 x if 0 < x 1 0 if x = 0 is ubouded. Thus the limit of a sequece of bouded fuctios is ot ecessarily bouded. Referece: Couterexamples i Aalysis,1964 Gelbaum ad Olsted, Holde- ay Ic. page 77 Fall2004 13 supp2.tex