Example. If 4 tickets are drawn with replacement from ,

Similar documents
P (second card 7 first card King) = P (second card 7) = %.

The Central Limit Theorem

Conditional Probability

Senior Math Circles November 19, 2008 Probability II

Probability Experiments, Trials, Outcomes, Sample Spaces Example 1 Example 2

MAT Mathematics in Today's World

Problem # Number of points 1 /20 2 /20 3 /20 4 /20 5 /20 6 /20 7 /20 8 /20 Total /150

LECTURE 1. 1 Introduction. 1.1 Sample spaces and events

Probability Year 9. Terminology

Intermediate Math Circles November 8, 2017 Probability II

2.6 Tools for Counting sample points

Lecture 8: Conditional probability I: definition, independence, the tree method, sampling, chain rule for independent events

Probability Year 10. Terminology

Q1 Own your learning with flash cards.

1 The Basic Counting Principles

Chapter 7: Section 7-1 Probability Theory and Counting Principles

Combinatorial Analysis

STA Module 4 Probability Concepts. Rev.F08 1

Example. What is the sample space for flipping a fair coin? Rolling a 6-sided die? Find the event E where E = {x x has exactly one head}

Probability and Independence Terri Bittner, Ph.D.

A Event has occurred

Discussion 01. b) What is the probability that the letter selected is a vowel?

ACMS Statistics for Life Sciences. Chapter 13: Sampling Distributions

Discrete and continuous

Probability and Statistics Notes

Introductory Probability

Week 2: Probability: Counting, Sets, and Bayes

Year 10 Mathematics Probability Practice Test 1

CS 361: Probability & Statistics

Chapter 6: Probability The Study of Randomness

Math 180B Problem Set 3

Chapter 8: An Introduction to Probability and Statistics

1 Probability Theory. 1.1 Introduction

Mutually Exclusive Events

Notes 1 Autumn Sample space, events. S is the number of elements in the set S.)

Mathematical Foundations of Computer Science Lecture Outline October 18, 2018

Political Science Math Camp: Problem Set 2

University of California, Berkeley, Statistics 134: Concepts of Probability. Michael Lugo, Spring Exam 1

Problems from Probability and Statistical Inference (9th ed.) by Hogg, Tanis and Zimmerman.

Notes Week 2 Chapter 3 Probability WEEK 2 page 1

Topic -2. Probability. Larson & Farber, Elementary Statistics: Picturing the World, 3e 1

CSE 103 Homework 8: Solutions November 30, var(x) = np(1 p) = P r( X ) 0.95 P r( X ) 0.

Properties of Probability

Econ 325: Introduction to Empirical Economics

Probabilistic models

Axioms of Probability

Examples of frequentist probability include games of chance, sample surveys, and randomized experiments. We will focus on frequentist probability sinc

REPEATED TRIALS. p(e 1 ) p(e 2 )... p(e k )

Introduction to Probability. Ariel Yadin. Lecture 1. We begin with an example [this is known as Bertrand s paradox]. *** Nov.

Part 3: Parametric Models

Elementary Discrete Probability

Chapter 2 Class Notes

Name: Firas Rassoul-Agha

Lecture 8 Sampling Theory

Review of Basic Probability

Dynamic Programming Lecture #4

Lesson B1 - Probability Distributions.notebook

I. Introduction to probability, 1


Essential Statistics Chapter 4

Lecture 2 Binomial and Poisson Probability Distributions

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 2 MATH00040 SEMESTER / Probability

*Karle Laska s Sections: There is no class tomorrow and Friday! Have a good weekend! Scores will be posted in Compass early Friday morning

2011 Pearson Education, Inc

Independence. P(A) = P(B) = 3 6 = 1 2, and P(C) = 4 6 = 2 3.

STAT 430/510 Probability

ELEG 3143 Probability & Stochastic Process Ch. 1 Probability

Discrete Probability

Chapter 1: Revie of Calculus and Probability

The Central Limit Theorem

Probability, For the Enthusiastic Beginner (Exercises, Version 1, September 2016) David Morin,

MA 250 Probability and Statistics. Nazar Khan PUCIT Lecture 15

3.2 Probability Rules

STAT200 Elementary Statistics for applications

Chapter 5 : Probability. Exercise Sheet. SHilal. 1 P a g e

If S = {O 1, O 2,, O n }, where O i is the i th elementary outcome, and p i is the probability of the i th elementary outcome, then

STT 315 Problem Set #3

MA : Introductory Probability

Chapter 2: Probability Part 1

Some Review Problems for Exam 3: Solutions

1 Preliminaries Sample Space and Events Interpretation of Probability... 13

I - Probability. What is Probability? the chance of an event occuring. 1classical probability. 2empirical probability. 3subjective probability

Chapter 7: Hypothesis Testing

Probability Theory. Introduction to Probability Theory. Principles of Counting Examples. Principles of Counting. Probability spaces.

Statistics for Engineers Lecture 2

Probability and random variables

Counting principles, including permutations and combinations.

10.1. Randomness and Probability. Investigation: Flip a Coin EXAMPLE A CONDENSED LESSON

CISC 1100/1400 Structures of Comp. Sci./Discrete Structures Chapter 7 Probability. Outline. Terminology and background. Arthur G.

Probabilistic models

Introduction to Probability 2017/18 Supplementary Problems

Probability and random variables. Sept 2018

1. When applied to an affected person, the test comes up positive in 90% of cases, and negative in 10% (these are called false negatives ).

STAT 111 Recitation 1

MATH STUDENT BOOK. 12th Grade Unit 9

Axioms of Probability. Set Theory. M. Bremer. Math Spring 2018

Probability and Probability Distributions. Dr. Mohammed Alahmed

UNIT NUMBER PROBABILITY 6 (Statistics for the binomial distribution) A.J.Hobson

Probability. Chapter 1 Probability. A Simple Example. Sample Space and Probability. Sample Space and Event. Sample Space (Two Dice) Probability

CHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things.

Transcription:

Example. If 4 tickets are drawn with replacement from 1 2 2 4 6, what are the chances that we observe exactly two 2 s? Exactly two 2 s in a sequence of four draws can occur in many ways. For example, ( 2 - - - 2 ), ( 2-2 - - ), ( 2 - - 2 - ), and so on. (where means not 2 ). Two key observations: (i) All these different sequences are mutually exclusive of each other... If we observe the sequence ( 2 - - 2 - ), then we do not observe the sequence ( 2 - - - 2 ). (ii) The probability of observing each of these individual sequences is the same for all of them, because multiplication is commutative, i.e., 2 5 3 5 3 5 2 5 = 2 5 2 5 3 5 3 5 = 2 5 3 5 2 5 3 5 = = 5.76% 1

This means that P (exactly two 2 s in four draws) = Now we have to figure out what N is... Observations. N = number of sequences with two 2 s {}}{ 5.76% + 5.76% + 5.76% + + 5.76% = N 5.76% (i) We don t care which tickets go in the not 2 spots. (ii) Since we are listing all of the possible 2-2 methodical. sequences, we can be (iii) When listing different 2-2 sequences, all we have to decide is where in each sequence to put the 2 s = the not 2 s will go in the other spots. The number of different sequences with two 2 s is equal to the number of ways to choose two positions in a sequence of four. 2

There are 4 positions in which we can place the first 2, and for each choice of first position, there are 3 ways to choose the second position... So it seems that there are 4 3 = 12 ways to place two 2 s in a sequence of four draws... But we are overcounting, because each pair of positions has been counted twice! For example, the choices first 2 in the third position and second 2 in the first position and first 2 in the first position and second 2 in the third position result in the same pair of positions first and third. Conclusion: The number of sequences with exactly two 2 s is N = 4 3 2 = 6 So P (exactly two 2 s in four draws) = 6 5.76% = 34.56%. 3

More general question: If n tickets are drawn at random with replacement from the box 1 2 2 4 6, what are the chances that exactly k of them will be 2 s? The reasoning that we used when n = 4 and k = 2 can be used to answer this question too. (*) The results of different draws are independent. (*) The probability of a 2 on any one draw is 2/5. ( ) (*) The probability of a not 2 on any one draw is 3/5. 4

Observation 1. The probability of any particular sequence of n draws which results in k 2 s and (n k) s is equal to k 2 s and (n k) s {}}{ 2 2 2 k (2/5) s and (n k) (3/5) s {}}{ 3 5 3 5 2 5 3 5 2 5 3 5 2 5 3 5 = ( ) 2 k 5 regardless of the order in which the tickets appear! ( ) n k 3 5 5

Observation 2. Different sequences of k 2 s and (n k) s (i.e., sequences that differ in at least one position (actually, at least two)) are mutually exclusive. This means that we can use the addition rule to conclude that P (exactly k 2 s in n draws) # of different sequences with exactly k 2 s and (n k) s {}}{ ( ) ( ) n k ( ) ( ) n k ( ) ( ) n k 2 3 2 3 2 3 = k + k + + k 5 5 5 5 5 5 ( ) ( ) n k 2 3 = N k. 5 5 N = number of different sequences with exactly k 2 s and (n k). 6

Next question: What is N? I.e., how many sequences of draws are there with k 2 s and (n k) s? (*) We only need to count the number of ways of choosing k positions for the 2 s among the n available positions. n positions and k 2 s { }} { 2 2 2 There are n (n 1) (n 2) (n k + 1) different ways that we can place the 2 s if the order matters: first 2, second 2, etc. But we don t care about the order in which the positions were chosen, so the number above is too big we are counting each of the possible sequences too many times. Every unordered set of k positions of the 2 s appears k! = k (k 1) 2 1 different times in the collection of ordered sets we counted above. 7

Observation 3. The number of sequences of n draws that result in k 2 s and (n k) s is ( ) n (n 1) (n 2) (n k + 1) n! n N = = k! (n k)! k! =. k ( Comment: n ) k is one of the standard ways of denoting this number. Another standard notation for this is n C k. Conclusion. If n tickets are drawn at random with replacement from the box 1 2 2 4 6, the probability of observing exactly k 2 s is P (exactly k 2 s in n draws) = ( ) n k ( ) 2 k 5 ( ) n k 3. 5 8

Comments: ( n k) is pronounced n choose k, and is also called a binomial coefficient. It is the number of different (unordered) subsets of size k that can be chosen from a set of n objects. ( n 0) = 1 by definition. ( ) ( n k = n n k). The binomial coefficients grow large quickly. For example, ( ) ( ) ( ) ( ) 10 10 20 20 = 120, = 252, = 1140, = 15504 3 5 3 5 and ( ) 100 30 = 29372339821610944823963760 The numbers ( n k) are called binomial coefficients because they appear in the binomial formula (a + b) n = ( n 0 ) a n + ( n 1 ) a n 1 b + + 9 ( ) n a n k b k + + k ( ) n b n. n

The general case. Suppose a box contains N tickets: 1 s and 0 s. And suppose that the probability of (randomly) drawing a 1 from the box is P ( 1 ) = p. The number of 1 s in the box is p N. The probability of drawing a 0 is 1 p. If n tickets are drawn at random with replacement from such a box, then the probability of observing exactly k 1 s (and (n k) 0 s) is ( ) n P (exactly k 1 s in n draws) = p k (1 p) n k. k Observation. For this question, the number N of tickets in the box is not important. What matters is the proportion p of 1 s in the box. 10

Coin tosses. If we have a box with one 1 and one 0, then the number of 1 s in n random draws with replacement from this box can be used to model the number of heads in n tosses of a fair coin. (*) The probability of observing k heads in n tosses of a fair coin is P (k heads in n tosses) = ( ) n k ( ) k 1 2 ( ) n k 1 = 2 ( ) n k ( ) n 1. 2 (*) Given a particular n, there are n + 1 possible values for k (i.e., 0, 1, 2,..., n) and the probabilities for each of these values can be displayed in a probability histogram. The values of k are arranged on the horizontal axis and we use the density scale on the vertical axis: the area of the bar above each value k gives the probability of observing exactly k heads in n tosses. 11

30% 25% 24.61% 20% 15% 10% Probability 20.51% 20.51% 11.72% 11.72% 5% 4.39% 4.39% 0.98% 0.98% 0.098% 0.098% 0 1 2 3 4 5 6 7 8 9 10 Number of H in ten tosses Probability histogram for the number of heads in 10 tosses of a fair coin. 12

We can read this histogram the same way that we do a histogram for data... (*) What is the probability of observing more than 7 heads in 10 tosses? More than 7 heads in 10 tosses means 8 heads, 9 heads or 10 heads, and these are all mutually exclusive events. So... P (more than 7 heads in 10 tosses) = P (8 heads) + P (9 heads) + P (10 heads) = area under histogram from 7.5 to 10.5 4.39% + 0.98% + 0.098% 5.47% (*) What is the probability of observing between 4 and 6 heads in 10 tosses? P (between 4 and 6 heads in 10 tosses) = P (4 heads) + P (5 heads) + P (6 heads) = area under histogram from 3.5 to 6.5 20.51% + 24.61% + 20.51% = 65.63% 13

A hint of things to come... 30% 25% 24.61% 20% 15% 10% Probability 20.51% 20.51% 11.72% 11.72% 5% 4.39% 4.39% 0.98% 0.98% 0.098% 0.098% 0 1 2 3 4 5 6 7 8 9 10... But first, more examples. Number of H in ten tosses 14

Example. 8 tickets are drawn at random with replacement from a box containing 8 tickets 6 1 s and 2 0 s. What is the probability that we will observe between 5 and 7 1 s? We can calculate the probability using (the addition rule and) the formula for binomial probabilities: ( ) ( ) ( ) ( ) P between 5 and 7 1 s = P 5 1 s + P 6 1 s + P 7 1 s ( ) ( ) ( ) 3 ( ) ( ) ( ) 2 8 3 1 8 3 1 = 5 + 6 5 4 4 6 4 4 ( ) ( ) ( ) 1 8 3 1 + 7 7 4 4 = 56 35 36 + 28 48 4 8 + 8 37 4 8 = 51516 0.786 = 78.6% 65536 15

(*) We can use a probability histogram in this case too: P ( ) between 5 and 7 1 s = area of histogram between 4.5 and 7.5 0.21 + 0.31 + 0.27 = 0.79 16

Question: What changes when the number of draws increases? (*) 80 tickets are drawn at random with replacement from the same box. What is the probability that we will observe between 55 and 65 1 s? We can give a precise answer using the binomial formula: P ( ) between 55 and 65 1 s in 80 draws ( ) ( ) ( ) ( ) 25 80 3 1 80 = 55 + 55 4 4 56 + ( 80 65 ) ( ) 3 65 4 ( ) 3 56 4 ( ) 24 1 + 4 ( ) 15 1 =...? 4 These days, evaluating expressions like this directly is easy with computers. For example, we can use on-line calculators like the one found here: http://stattrek.com/online-calculator/binomial.aspx Answer: 84.6%. 17

What about using a probability histogram for the number of 1 s observed in 80 random draws (with replacement) from the same box..? (*) Kind of hard to read. (*) Also - we have to calculate all the probabilities to draw the histogram! 18

Zooming in to the part of the histogram where 40 number of 1 s 80: (*) This part of the histogram accounts for more than 99.9999% of the total. (*) I.e., P (fewer than 40 1 s in 80 draws) < 0.0001%. 19

Another hint at things to come... 20