PROBABILITY. Contents Preface 1 1. Introduction 2 2. Combinatorial analysis 5 3. Stirling s formula 8. Preface

Similar documents
PROBABILITY VITTORIA SILVESTRI

Probability and Statistics. Vittoria Silvestri

PROBABILITY VITTORIA SILVESTRI

Lecture Notes 1 Basic Probability. Elements of Probability. Conditional probability. Sequential Calculation of Probability

LECTURE 1. 1 Introduction. 1.1 Sample spaces and events

MATH MW Elementary Probability Course Notes Part I: Models and Counting

UCSD CSE 21, Spring 2014 [Section B00] Mathematics for Algorithm and System Analysis

Statistical Inference

Discrete Mathematics & Mathematical Reasoning Chapter 6: Counting

2. AXIOMATIC PROBABILITY

CMPSCI 240: Reasoning about Uncertainty

Lecture 8: Conditional probability I: definition, independence, the tree method, sampling, chain rule for independent events

Lecture 3. Probability and elements of combinatorics

the time it takes until a radioactive substance undergoes a decay

EE 178 Lecture Notes 0 Course Introduction. About EE178. About Probability. Course Goals. Course Topics. Lecture Notes EE 178

Lecture 4: Counting, Pigeonhole Principle, Permutations, Combinations Lecturer: Lale Özkahya

Discrete Probability

ELEG 3143 Probability & Stochastic Process Ch. 1 Probability

Solution Set for Homework #1

6 CARDINALITY OF SETS

Lecture 1: An introduction to probability theory

= 2 5 Note how we need to be somewhat careful with how we define the total number of outcomes in b) and d). We will return to this later.

Notes. Combinatorics. Combinatorics II. Notes. Notes. Slides by Christopher M. Bourke Instructor: Berthe Y. Choueiry. Spring 2006

Chapter 2 Class Notes

Properties of Probability

Probability, Random Processes and Inference

Econ 325: Introduction to Empirical Economics

1 Basic Combinatorics

Mathematical Probability

1. Discrete Distributions

Notes on statistical tests

Undergraduate Probability I. Class Notes for Engineering Students

Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 14

Conditional Probability

P (A B) P ((B C) A) P (B A) = P (B A) + P (C A) P (A) = P (B A) + P (C A) = Q(A) + Q(B).

Lecture 1. Chapter 1. (Part I) Material Covered in This Lecture: Chapter 1, Chapter 2 ( ). 1. What is Statistics?

Lecture 1. ABC of Probability

Chapter 2 PROBABILITY SAMPLE SPACE

Axiomatic Foundations of Probability. Definition: Probability Function

Introduction and basic definitions

Independence. P(A) = P(B) = 3 6 = 1 2, and P(C) = 4 6 = 2 3.

Lectures on Elementary Probability. William G. Faris

Module 1. Probability

STAT 516 Answers Homework 2 January 23, 2008 Solutions by Mark Daniel Ward PROBLEMS. = {(a 1, a 2,...) : a i < 6 for all i}

Probabilistic models

3/15/2010 ENGR 200. Counting

STAT Chapter 3: Probability

HW2 Solutions, for MATH441, STAT461, STAT561, due September 9th

Chapter 3 : Conditional Probability and Independence

The Inclusion Exclusion Principle

Given a experiment with outcomes in sample space: Ω Probability measure applied to subsets of Ω: P[A] 0 P[A B] = P[A] + P[B] P[AB] = P(AB)

Lecture 2 31 Jan Logistics: see piazza site for bootcamps, ps0, bashprob

Part IA Probability. Definitions. Based on lectures by R. Weber Notes taken by Dexter Chua. Lent 2015

Probability 1 (MATH 11300) lecture slides

Probability Theory. 1 Review Questions in Combinatorics. Exercises

MATH 556: PROBABILITY PRIMER

1 Combinatorial Analysis

Problem # Number of points 1 /20 2 /20 3 /20 4 /20 5 /20 6 /20 7 /20 8 /20 Total /150

I - Probability. What is Probability? the chance of an event occuring. 1classical probability. 2empirical probability. 3subjective probability

HW MATH425/525 Lecture Notes 1

1. How many labeled trees are there on n vertices such that all odd numbered vertices are leaves?

Probability Theory and Random Variables

MAT2377. Ali Karimnezhad. Version September 9, Ali Karimnezhad

Discrete Probability. Mark Huiskes, LIACS Probability and Statistics, Mark Huiskes, LIACS, Lecture 2

tossing a coin selecting a card from a deck measuring the commuting time on a particular morning

Probability theory basics

5. Conditional Distributions

Economics 204 Fall 2011 Problem Set 1 Suggested Solutions

CIS 2033 Lecture 5, Fall

MATH 220 (all sections) Homework #12 not to be turned in posted Friday, November 24, 2017

MAT 271E Probability and Statistics

Number Theory and Counting Method. Divisors -Least common divisor -Greatest common multiple

Probability, For the Enthusiastic Beginner (Exercises, Version 1, September 2016) David Morin,

Lecture Notes 1 Probability and Random Variables. Conditional Probability and Independence. Functions of a Random Variable

Probabilistic models

Statistical Theory 1

Chapter. Probability

2. Counting and Probability

Sets. A set is a collection of objects without repeats. The size or cardinality of a set S is denoted S and is the number of elements in the set.

Elementary Discrete Probability

Probability. Part 1 - Basic Counting Principles. 1. References. (1) R. Durrett, The Essentials of Probability, Duxbury.

Binomial Coefficient Identities/Complements

Mathematical Structures Combinations and Permutations

STAT 430/510 Probability

Monty Hall Puzzle. Draw a tree diagram of possible choices (a possibility tree ) One for each strategy switch or no-switch

Probability Theory Review

Conditional Probability, Independence and Bayes Theorem Class 3, Jeremy Orloff and Jonathan Bloom

If S = {O 1, O 2,, O n }, where O i is the i th elementary outcome, and p i is the probability of the i th elementary outcome, then

Discrete Finite Probability Probability 1

Lecture Lecture 5

Notes 1 : Measure-theoretic foundations I

Why study probability? Set theory. ECE 6010 Lecture 1 Introduction; Review of Random Variables

Course: ESO-209 Home Work: 1 Instructor: Debasis Kundu

Definition: Let S and T be sets. A binary relation on SxT is any subset of SxT. A binary relation on S is any subset of SxS.

MASSACHUSETTS INSTITUTE OF TECHNOLOGY 6.436J/15.085J Fall 2008 Lecture 3 9/10/2008 CONDITIONING AND INDEPENDENCE

Senior Math Circles March 3, 2010 Counting Techniques and Probability II

Math 5010 Introduction to Probability. Davar Khoshnevisan University of Utah Firas Rassoul-Agha University of Utah

Sample Spaces, Random Variables

Lecture 8: Probability

Transcription:

PROBABILITY VITTORIA SILVESTRI Contents Preface. Introduction. Combinatorial analysis 5 3. Stirling s formula 8 Preface These lecture notes are for the course Probability IA, given in Lent 09 at the University of Cambridge. The contents are closely based on the following lecture notes, all available online: James Norris: http://www.statslab.cam.ac.uk/ james/lectures/p.pdf Douglas Kennedy: http://trin-hosts.trin.cam.ac.uk/fellows/dpk0/ia/iaprob.html Richard Weber: http://www.statslab.cam.ac.uk/ rrw/prob/prob-weber.pdf Appropriate books for this course are: W. Feller An Introduction to Probability Theory and its Applications, Vol. I. Wiley 968 G. Grimmett and D. Welsh Probability: An Introduction. Oxford University Press nd Edition 04 S. Ross A First Course in Probability. Prentice Hall 009 D.R. Stirzaker Elementary Probability. Cambridge University Press 994/003 Please notify vs358@cam.ac.uk for comments and corrections.

VITTORIA SILVESTRI. Introduction This course concerns the study of experiments with random outcomes, such as rolling a die, tossing a coin or drawing a card from a standard deck. Say that the set of possible outcomes is Ω = {ω, ω, ω 3,...}. We call Ω sample space, while its elements are called outcomes. A subset A Ω is called an event. Example. (Rolling a die. Toss a normal six-faced die: the sample space is Ω = {,, 3, 4, 5, 6}. Examples of events are: {5} (the outcome is 5 {, 3, 5} (the outcome is odd {3, 6} (the outcome is divisible by 3 Example. (Drawing a card. Draw a card from a standard deck: Ω is the set of all possible cards, so that Ω = 5. Examples of events are: A = {the card is a Jack}, A = 4 A = {the card is Diamonds}, A = 3 A 3 = {the card is not the Queen of Spades}, A 3 = 5. Example.3 (Picking a natural number. Pick any natural number: the sample space is Ω = N. Examples of events are: {the number is at most 5} = {0,,, 3, 4, 5} {the number is even} = {, 4, 6, 8...} {the number is not 7} = N \ {7}. Example.4 (Picking a real number. Pick any number in the closed interval [0, ]: the sample space is Ω = [0, ]. Examples of events are: {x : x < /3} = [0, /3 {x : x 0.7} = [0, ] \ {0.7} = [0, 0.7 (0.7, ] {x : x = n for some n N} = {, /, /4, /8...}. Note that the sample space Ω is finite in the first two examples, infinite but countable in the third example, and uncountable in the last example. Remark.5. For the first part of the course we will restrict to countable sample spaces, thus excluding Example.4 above.

PROBABILITY 3 We can now give a general definition. Definition.6 (Probability space. Let Ω be any set, and F be a set of subsets of Ω. We say that F is a σ-algebra if - Ω F, - if A F, then A c F, - for every sequence (A n n in F, it holds n= A n F. Assume that F is a σ-algebra. A function P : F [0, ] is called a probability measure if - P(Ω =, - for any sequence of disjoint events (A n n it holds ( P A n = P(A n. n= The triple (Ω, F, P is called a probability space. Remark.7. In the case of countable state space we take F to be the set of all subsets of Ω, unless otherwise stated. We think of F as the collection of observable events. If A F, then P(A is the probability of the event A. In some probability models, such as the one in Example.4, the probability of each individual outcome is 0. This is one reason why we need to specify probabilities of events rather than outcomes... Equally likely outcomes. The simplest case is that of a finite (non-empty sample space n= Ω = {ω, ω... ω Ω }, Ω < and equally likely outcomes: P(A = A Ω Note that taking A = {w i } we find k= P({ω i } = Ω A F. i Ω, thus all outcomes are equally likely. To check that P is a probability measure, note that P(Ω = Ω / Ω =, and for disjoint events (A k n k= it holds ( n P A k = A A... A n n A k n = = P(A k, Ω Ω as wanted. Moreover, we have the following properties: k= k=

4 VITTORIA SILVESTRI P( = 0, if A B then P(A P(B, P(A B = P(A + P(B P(A B, P(A c = P(A. Example.8. When rolling a fair die there are 6 possible outcomes, all equally likely. Then Ω = {,, 3, 4, 5, 6}, P({i} = /6 for i =... 6. So P(even outcome = P({, 4, 6} = /, while P(outcome 5 = P({,, 3, 4, 5} = 5/6. Example.9 (Largest digit. Consider a string of random digits 0,... 9 of length n. For 0 k 9, what is the probability that the largest digit is at most k? And exactly k? We model the set of all possible strings by Ω = {0,... 9} n, so that Ω = 0 n, and all elements of Ω are equally likely. Let A k denote the event that none of the digit exceeds k. Then A k = (k + n, so P(A k = A k (k + n = Ω 0 n. To answer the second question, let B k denote the event that the largest digit equals k, and note that B k = A k \ A k. Since A k A k, it follows that P(B k = P(A k P(A k = (k + n k n 0 n. Example.0 (The birthday problem. Suppose there are n people in the room. What is the probability that at least two of them share the same birthday? To answer the question, we assume that no-one was born on the 9th of February, the other dates all being equally likely. Then Ω = {,... 365} n and Ω = 365 n. Let A n denote the event that at least two people share the same birthday. If n > 365 then P(A n =, so we restrict to n 365. Then P(A n = P(A c 365 364 (365 n + n = 365 n. You can check that P(A n / as soon as n 3.

PROBABILITY 5. Combinatorial analysis We have seen that it is often necessary to be able to count the number of subsets of Ω with a given property. We now take a systematic look at some counting methods... Multiplication rule. Take N finite sets Ω, Ω... Ω N (some of which might coincide, with cardinalities Ω k = n k. We imagine to pick one element from each set: how many possible ways do we have to do so? Clearly, we have n choices for the first element. Now, for each choice of the first element, we have n choices for the second, so that Ω Ω = n n. Once the first two element, we have n 3 choices for the third, and so on, giving We refer to this as the multiplication rule. Ω Ω... Ω N = n n n N. Example. (The number of subsets. Suppose a set Ω = {ω, ω... ω n } has n elements. How many subsets does Ω have? We proceed as follows. To each subset A of Ω we can associate a sequence of 0 s and s of length n so that the i th number is if ω i is in A, and 0 otherwise. Thus if, say, Ω = {ω, ω, ω 3, ω 4 } then A = {ω }, 0, 0, 0 A = {ω, ω 3, ω 4 }, 0,, A 3 = 0, 0, 0, 0. This defines a bijection between the subsets of Ω and the strings of 0 s and s of length n. Thus we have to count the number of such strings. Since for each element we have choices (either 0 or, there are n strings. This shows that a set of n elements has n subsets. Note that this also counts the number of functions from a set of n elements to {0, }... Permutations. How many possible orderings of n elements are there? Label the elements {,... n}. A permutation is a bijection from {,... n} to itself, i.e. an ordering of the elements. We may obtain all permutations by subsequently choosing the image of element, then the image of element and so on. We have n choices for the image of, then n choices for the image of, n choices for the image of 3 until we have only one choice for the image of n. Thus the total number of choices is, by the multiplication rule, n! = n(n (n. Thus there are n! different orderings, or permutations, of n elements. Equivalently, there are n! different bijections from any two sets of n elements. Example.. There are 5! possible orderings of a standard deck of cards.

6 VITTORIA SILVESTRI.3. Subsets. How many ways are there to choose k elements from a set of n elements?.3.. With ordering. We have n choices for the first element, n choices for the second element and so on, ending with n k + choices for the k th element. Thus there are (. n(n (n k + = n! (n k! ways to choose k ordered elements from n. An alternative way to obtain the above formula is the following: to pick k ordered elements from n, first pick a permutation of the n elements (n! choices, then forget all elements but the first k. Since for each choice of the first k elements there are (n k! permutations starting with those k elements, we again obtain (...3.. Without ordering. To choose k unordered elements from n, we could first choose k ordered elements, and then forget about the order. Recall that there are n!/(n k! possible ways to choose k ordered elements from n. Moreover, any given k elements can be ordered in k! possible ways. Thus there are ( n n! = k k!(n k! possible ways to choose k unordered elements from n. More generally, suppose we have integers n, n... n k with n + n + + n k = n. Then there are ( n n! = n... n k n!... n k! possible ways to partition n elements in k subsets of cardinalities n,... n k..4. Subsets with repetitions. How many ways are there to choose k elements from a set of n elements, allowing repetitions?.4.. With ordering. We have n choices for the first element, n choices for the second element and so on. Thus there are n k = n n n possible ways to choose k ordered elements from n, allowing repetitions..4.. Without ordering. Suppose we want to choose k elements from n, allowing repetitions but discarding the order. How many ways do we have to do so? Note that naïvely dividing n k by k! doesn t give the right answer, since there may be repetitions. Instead, we count as follows. Label the n elements {,... n}, and for each element draw a each time it is picked. 3... n ** *... ***

PROBABILITY 7 Note that there are k s and n vertical lines. Now delete the numbers: (.... The above diagram uniquely identifies an unordered set of (possibly repeated k elements. Thus we simply have to count how many such diagrams there are. The only restriction is that there must be n vertical lines and k s. Since there are n + k locations, we can fix such a diagram by assigning the positions of the s, which can be done in ( n + k k ways. This therefore counts the number of unordered subsets of k elements from n, without ordering. Example.3 (Increasing and non decreasing functions. An increasing function from {,... k} to {,... n} is uniquely determined by its range, which is a subset of {,... n} of size k. Vice versa, each such subset determines a unique increasing function. This bijection tells us that there are ( n k increasing functions from {,... k} to {,... n}, since this is the number of subsets of size k of {,,... n}. How about non decreasing functions? There is a bijection from the set of non decreasing functions f : {,... k} {,... n} to the set of increasing functions g : {,... k} {,... n + k }, given by g(i = f(i + i for i k. Hence the number of such decreasing functions is ( n+k k. Example.4 (Ordered partitions. An ordered partition of k of size n is a sequence (k, k... k n of non-negative integers such that k + + k n = k. How many ordered partitions of k of size n are there? We give a graphic representation of each such partition as follows: draw k s followed by a vertical line, then k s followed by another vertical line and so on, closing with k n s, e.g. (, 0, 3, Note the analogy with (.. Now, since this determines a bijection, it again suffices to count the number of diagrams made of k s and n vertical line, which is ( n+k k.

8 VITTORIA SILVESTRI 3. Stirling s formula We have seen how factorials are ubiquitous in combinatorial analysis. It is therefore important to be able to provide asymptotics for n! as n becomes large, which is often the case of interest. Recall that we write a n b n to mean that a n /b n as n. Theorem 3. (Stirling s formula. As n we have n! πn n+/ e n. Note that this implies that log(n! log( πn n+/ e n as n. We now prove this weaker statement. Set l n = log(n! = log + log + + log n. Write x for the integer part of x. Then Integrate over the interval [, n] to get log x log x log x +. from which Integrating by parts we find from which we deduce that n n l n n log xdx l n log xdx l n n+ log xdx. log xdx = n log n n +, n log n n + l n (n + log(n + n. Since n log n n + n log n, (n + log(n + n n log n, dividing through by n log n and taking the limit as n we find that l n n log n. Since log( πn n+/ e n n log n, this concludes the proof.

Proof of Stirling s formula non-examinable. Note the identity b a f(xdx = PROBABILITY 9 f(a + f(b (b a b a (x a(b xf (xdx, which may be checked by integrating by parts twice the right hand side. Take f(x = log x to find k+ k log xdx = = log k + log(k + log k + log(k + for all k. Sum over k n to get from which n + + k+ k log xdx = log((n! + log(n! + 0 (x k(k + x x dx x( x (x + k dx n k= 0 x( x (x + k dx, n log n n + = log(n! n log n + x( x k= 0 (x + k dx }{{} a k Rearranging for log(n! we find Note that since = log(n! n log n + a k. log(n! = a k k k= ( n + n log n n + a k. we have k= a k <. We therefore define 0 A = exp k= x( xdx = k, ( a k k= and exponentiate both sides to find ( n! = An n+/ e n exp a k. Since as n, this gives a k 0 k=n k=n n! An n+/ e n.

0 VITTORIA SILVESTRI It only remains to show that A = π. Using the above asymptotic, we find ( n n n A n, so it suffices to show that To see this, set n ( n n I n = π/ 0 πn. cos n θdθ. Then I 0 = π/, I = and integrating by parts we obtain for all n. Thus I n = n n I n I n = n n... 3 ( n π 4 I 0 = n n, ( ( n n n I n+ = n n +... 4 5 3 I = But (I n n is decreasing in n, and I n I n = n n as wanted. ( n ( n n, so also I n +. n I n+ (n + π nπ,, and we conclude that Example 3.. Suppose we have 4n balls, of which n are red and n are black. We put them randomly into two urns, so that each urn contains n balls. What is the probability that each urn contains exactly n red balls and n black balls? Call this probability p n. Note that there are ( ( 4n n ways to distribute the balls into the two urns, n ( n n n of which will result in each urn containing exactly n balls of each color. Thus ( ( n n p n = ( n n (n! ( = 4n n! n 4 (4n! ( πe n (n n+/ πe n n n+/ 4 πe 4n (4n 4n+/ = πn. Statslab, Centre for Mathematical Sciences, University of Cambridge, Wilberforce Road, Cambridge CB3 0WB, UK