UNIT NUMBER PROBABILITY 6 (Statistics for the binomial distribution) A.J.Hobson

Similar documents
Probability Rules. MATH 130, Elements of Statistics I. J. Robert Buchanan. Fall Department of Mathematics

Intermediate Math Circles November 8, 2017 Probability II

The Central Limit Theorem

DEFINITION: IF AN OUTCOME OF A RANDOM EXPERIMENT IS CONVERTED TO A SINGLE (RANDOM) NUMBER (E.G. THE TOTAL

Chapter 18 Sampling Distribution Models

Random Variable. Discrete Random Variable. Continuous Random Variable. Discrete Random Variable. Discrete Probability Distribution

Chapter 2: Discrete Distributions. 2.1 Random Variables of the Discrete Type

MAT Mathematics in Today's World

VTU Edusat Programme 16


Lecture 2 Binomial and Poisson Probability Distributions

Math 180B Problem Set 3

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 2 MATH00040 SEMESTER / Probability

1. Sample spaces, events and conditional probabilities. A sample space is a finite or countable set S together with a function. P (x) = 1.

4 Lecture 4 Notes: Introduction to Probability. Probability Rules. Independence and Conditional Probability. Bayes Theorem. Risk and Odds Ratio

success and failure independent from one trial to the next?

Probability Year 9. Terminology

Week 2. Section Texas A& M University. Department of Mathematics Texas A& M University, College Station 22 January-24 January 2019

Mathematics. ( : Focus on free Education) (Chapter 16) (Probability) (Class XI) Exercise 16.2

Chapter 8: An Introduction to Probability and Statistics

PROBABILITY DISTRIBUTIONS: DISCRETE AND CONTINUOUS

Binomial Distribution. Collin Phillips

Probability Year 10. Terminology

7.1 What is it and why should we care?

Homework 4 Solution, due July 23

Lecture Lecture 5

Probability, For the Enthusiastic Beginner (Exercises, Version 1, September 2016) David Morin,

9/6/2016. Section 5.1 Probability. Equally Likely Model. The Division Rule: P(A)=#(A)/#(S) Some Popular Randomizers.

Discrete Probability. Chemistry & Physics. Medicine

MATH MW Elementary Probability Course Notes Part I: Models and Counting

Problems from Probability and Statistical Inference (9th ed.) by Hogg, Tanis and Zimmerman.

Exam III #1 Solutions

Example. If 4 tickets are drawn with replacement from ,

Chapter 4: An Introduction to Probability and Statistics

10.1. Randomness and Probability. Investigation: Flip a Coin EXAMPLE A CONDENSED LESSON

Expectation MATH Expectation. Benjamin V.C. Collins, James A. Swenson MATH 2730

Name: Firas Rassoul-Agha

Math 493 Final Exam December 01

MULTINOMIAL PROBABILITY DISTRIBUTION

(c) P(BC c ) = One point was lost for multiplying. If, however, you made this same mistake in (b) and (c) you lost the point only once.

2.3 Estimating PDFs and PDF Parameters

27 Binary Arithmetic: An Application to Programming

18440: Probability and Random variables Quiz 1 Friday, October 17th, 2014

Statistics 100A Homework 1 Solutions

Chapter 6 Continuous Probability Distributions

2.6 Tools for Counting sample points

STA 247 Solutions to Assignment #1

UNIT 4 MATHEMATICAL METHODS SAMPLE REFERENCE MATERIALS

Statistics for Managers Using Microsoft Excel/SPSS Chapter 4 Basic Probability And Discrete Probability Distributions

Presentation on Theo e ry r y o f P r P o r bab a il i i l t i y

5 CORRELATION. Expectation of the Binomial Distribution I The Binomial distribution can be defined as: P(X = r) = p r q n r where p + q = 1 and 0 r n

Sections 5.1 and 5.2

Probability Density Functions and the Normal Distribution. Quantitative Understanding in Biology, 1.2

RVs and their probability distributions

SOL Warm-Up Graphing Calculator Active

Lecture 2. Binomial and Poisson Probability Distributions

IEOR 3106: Introduction to Operations Research: Stochastic Models. Professor Whitt. SOLUTIONS to Homework Assignment 1

Distribusi Binomial, Poisson, dan Hipergeometrik

Example. χ 2 = Continued on the next page. All cells


Chapter 2: Probability Part 1

8 Probability. An event occurs if, when the experiment is performed, the outcome is in that event.

Counting principles, including permutations and combinations.

Probability Distributions - Lecture 5

Entropy. Probability and Computing. Presentation 22. Probability and Computing Presentation 22 Entropy 1/39

Probability and Probability Distributions. Dr. Mohammed Alahmed

Example 1. The sample space of an experiment where we flip a pair of coins is denoted by:

CS 361: Probability & Statistics

1 INFO Sep 05

Probability Dr. Manjula Gunarathna 1

1 Random Experiments from Random Experiments

Lecture 13 (Part 2): Deviation from mean: Markov s inequality, variance and its properties, Chebyshev s inequality

Recap. The study of randomness and uncertainty Chances, odds, likelihood, expected, probably, on average,... PROBABILITY INFERENTIAL STATISTICS

Binomial and Poisson Probability Distributions

Combinations. April 12, 2006

Probabilistic Systems Analysis Spring 2018 Lecture 6. Random Variables: Probability Mass Function and Expectation

Statistics for Economists. Lectures 3 & 4

Math 1313 Experiments, Events and Sample Spaces

Random variables (discrete)

Preliminary Statistics Lecture 2: Probability Theory (Outline) prelimsoas.webs.com

Example 1. Assume that X follows the normal distribution N(2, 2 2 ). Estimate the probabilities: (a) P (X 3); (b) P (X 1); (c) P (1 X 3).

Expected Value 7/7/2006

Scientific Measurement

Probability Theory and Applications

Discrete probability and the laws of chance

Senior Math Circles November 19, 2008 Probability II

Tutorial 3: Random Processes

5.3 Conditional Probability and Independence

Twin primes are pairs of natural numbers (p, p + 2) such that both p and p + 2 are primes.

Lecture 4: Probability and Discrete Random Variables

1 Review of Probability

Mutually Exclusive Events

Examples of frequentist probability include games of chance, sample surveys, and randomized experiments. We will focus on frequentist probability sinc

Chapter 18. Sampling Distribution Models /51

Chapter 7: Section 7-1 Probability Theory and Counting Principles

Probability (10A) Young Won Lim 6/12/17

Topics in Probability Theory and Stochastic Processes Steven R. Dunbar. Binomial Distribution

X = X X n, + X 2

CME 106: Review Probability theory

Lecture 3. Discrete Random Variables

Transcription:

JUST THE MATHS UNIT NUMBER 19.6 PROBABILITY 6 (Statistics for the binomial distribution) by A.J.Hobson 19.6.1 Construction of histograms 19.6.2 Mean and standard deviation of a binomial distribution 19.6.3 Exercises 19.6.4 Answers to exercises

UNIT 19.6 - PROBABILITY 6 STATISTICS FOR THE BINOMIAL DISTRIBUTION 19.6.1 CONSTRUCTION OF HISTOGRAMS Elementary discussion on the presentation of data, in the form of frequency tables, histograms etc., usually involves experiments which are actually carried out. But we illustrate now how the binomial distribution may be used to estimate the results of a certain kind of experiment before it is performed. EXAMPLE For four coins, tossed 32 times, construct a histogram showing the expected number of occurrences of 0,1,2,3,4... heads. Solution Firstly, in a single toss of the four coins, the probability of head (or tail) for each coin is 1 2. The terms in the expansion of ( 1 + ) 4 1 2 2 give the probabilities of exactly 0,1,2,3 and 4 heads, respectively. The expansion is ( 1 2 + 1 ) 4 ( ) 1 4 ( ) 1 3 ( ( ) 1 1 2 ( ) ( ) 1 1 2 ( ) 1 4 + 4 + 6 + 4 +. 2 2 2 2) 2 2 2 2 That is, ( 1 2 + 1 ) 4 ( ) 1 4 (1 + 4 + 6 + 4 + 1), 2 2 showing that the probabilities of 0,1,2,3 and 4 heads in a single toss of four coins are 1, 1, 6, 1, and 1, respectively. 16 4 16 4 16 Therefore, in 32 tosses of four coins, we may expect 0 heads, twice; 1 head, 8 times; 2 heads, 12 times; 3 heads, 8 times and 4 heads, twice. The following histogram uses class-intervals for which each member is, in fact, situated at the mid-point: 1

12 8 Freq. 2 0 1 2 3 4 No. of heads in 32 tosses of 4 coins Notes: (i) The only reason that the above histogram is symmetrical in shape is that the probability of success and failure are equal to each other, so that the terms of the binomial expansion are, themselves, symmetrical. (ii) Since the widths of the class-intervals in the above histogram are 1, the areas of the rectangles are equal to their heights. Thus, for example, the total area of the first three rectangles represents the expected number of times of obtaining at most 2 heads in 32 tosses of 4 coins. 19.6.2 MEAN AND STANDARD DEVIATION OF A BINOMIAL DISTRIBUTION THEOREM If p is the probability of success of an event in a single trial and q is the probability of its failure, then the binomial distribution, giving the expected frequencies of 0,1,2,3,...n successes in n trials, has a mean of np and a standard deviation of npq, irrespective of the number of times the experiment is to be carried out. 2

Proof: (a) The Mean From the binomial expansion formula, (q + p) n = q n + nq n 1 p + n(n 1) q n 2 p 2 + n(n 1)(n 2) q n 3 p 3 +... + nqp n 1 + p n. 3! Hence, if the n trials are made N times, the average number of successes is equal to the following expression, multiplied by N, then divided by N: 0 q n + 1 nq n 1 p + 2 n(n 1) q n 2 p 2 + 3 That is, the mean is np ( q n 1 + (n 1)q n 2 p + n(n 1)(n 2) q n 3 p 3 +..(n 1) nqp n 1 + np n. 3! ) (n 1)(n 2) q n 3 p 2 +... + (n 1)qp n 2 + p n 1 2 = np(q + p) n 1 = np since q + p = 1. (b) The Standard Deviation For the standard deviation, we observe that, if f r is the frequency of r successes when the n trials are conducted N times, then We use this, first, to establish a result for n! f r = N (n r)!r! qn r p r. n r 2 f r. 3

For example, 0 2 f 0 = 0.Nq n = 0 = 0.f 0. 1 2 f 1 = 1.Nnq n 1 p = 1.f 1. 2 2 f 2 = 2Nn(n 1)q n 2 p 2 = Nn(n 1)q n 2 p 2 + Nn(n 1)p 2 q n 2 3 2 f 3 = 3N n(n 1)(n 2) q n 3 p 3 = N = 2f 2 + Nn(n 1)p 2 q n 2 ; n(n 1)(n 2) q n 3 p 3 + Nn(n 1)p 2 (n 2)q n 3 p 4 2 f 4 = 4N n(n 1)(n 2)(n 3) q n 4 p 4 = N 3! and, in general, when r 2, r 2 f r = N = 3f 3 + Nn(n 1)p 2 (n 2)q n 3 p; n(n 1)(n 2)(n 3) q n 4 p 4 +Nn(n 1)p 3! 2 (n 2)(n 3) = 4f 4 + Nn(n 1)p q n 4 p 2 ; 2 (n 2)(n 3) q n 4 p 2 n(n 1)(n 2)...(n r + 1) +Nn(n 1)p 2 q n r p r = rf r +Nn(n 1)p 2 (n 2)! (r 1)! (n r)!(r 2)! qn r p r 2. This result, together with those for 0 2.f 0 and 1 2 f 1, shows that n r 2 f r = n rf r + Nn(n 1)p 2 n r=2 (n 2)! (n r)!(r 2)! qn r p r 2. 4

That is, n r 2 f r = Nnp + Nn(n 1)p 2 (q + p) n 2 = Nnp + Nn(n 1)p 2, since q + p = 1. It was also established, in Unit 18.3, that the standard deviation of a set, x 1, x 2, x 3,... x m, of m observations, with a mean value of x, is given by the formula σ 2 = 1 m m i=1 x 2 i x 2, which, in the present case, may be written σ 2 = 1 N n r 2 f r 1 ( n N 2 rf r ) 2. Hence, σ 2 = 1 N ( Nnp + Nn(n 1)p 2 ) 1 N 2 (Nnp)2, which gives σ 2 = np + n 2 p 2 np 2 n 2 p 2 = np(1 p) = npq; and so, σ = npq. 5

ILLUSTRATION For direct calculation of the mean and the standard deviation for the data in the previous coin-tossing problem, we may use the following table, in which x i denotes numbers of heads and f i denotes the corresponding expected frequencies: x i f i f i x i f i x 2 i 0 2 0 0 1 8 8 8 2 12 24 48 3 8 24 72 4 2 8 32 Totals 32 64 160 The mean is given by which agrees with np = 4 1 2. The standard deviation is given by x = 64 32 = 2 (obviously), [160 ] σ = 32 22 = 1, which agrees with npq = 4 1 2 1 2. Note: If the experiment were carried out N times instead of 32 times, all values in the last three columns of the above table would be multiplied by a factor of N, which would then cancel 32 out in the remaining calculations. EXAMPLE Three dice are rolled 216 times. Construct a binomial distribution and show the frequencies of occurrence for 0,1,2 and 3 sixes. Evaluate the mean and the standard deviation of the distribution. 6

Solution First of all, the probability of success in obtaining a six with a single throw of a die is 1 6, and the corresponding probability of failure is 5 6. For a single throw of three dice, we require the expansion ( 1 6 + 5 ) 3 ( ) 1 3 ( ) 1 2 ( ( ) ( ) 5 1 5 2 ( ) 5 3 + 3 + 3 +, 6 6 6 6) 6 6 6 showing that the probabilities of 0,1,2 and 3 sixes are 125 216, 75 216, 15 216 and 1 216, respectively. Hence, in 216 throws of the three dice we may expect 0 sixes, 125 times; 1 six, 75 times; 2 sixes, 15 times and 3 sixes, once. The corresponding histogram is as follows: 125 Freq. 75 15 0 1 2 3 No. of sixes in 216 throws of 3 dice From the previous Theorem, the mean value is 3 1 6 = 1 2 and the standard deviation is 3 1 6 5 6 = 15 6. 7

19.6.3 EXERCISES 1. Four dice are rolled 81 times. If less than 5 on a die is considered to be a success, and everything else a failure, (a) draw the corresponding histogram for the expected frequencies of success; (b) determine the expected number of times of obtaining at least three successes among the four dice; (c) shade the area of the histogram which is a measure of the result in (c); (d) calculate the mean and the standard deviation of the frequency distribution in (a). 2. In a seed-viability test, 450 seeds were placed on a filter-paper in 90 rows of 5. The number of seeds that germinated in each row were counted, and the results were as follows: No. of seeds germinating per row 0 1 2 3 4 5 Observed Frequency of rows 0 1 11 30 38 10 If the germinating seeds were distributed, at random, among the rows, we would expect a binomial distribution with an index of n = 5. Determine (a) the average number of seeds germinating per row; (b) the probability of a single seed germinating; (c) the expected frequencies of rows for each number of seeds germinating; Draw the histogram for the expected frequencies and the histogram for the observed frequencies. 8

19.6.4 ANSWERS TO EXERCISES 1. (a) The histogram is as follows: 32 Freq. 24 16 8 1 0 1 2 3 4 No. of less than 5 s in 81 throws of 4 dice (b) Expected frequency of 3 or 4 successes = 32 + 16 = 48; (c) Shade the last two rectangles on the right of the histogram; (d) Mean = 4 2 3 = 8 3 and Standard Deviation = 4 2 3 1 3 = 2 2 3. 2. (a) Average number of seeds germinating per row is 325 = 3.50; 90 (b) Probability of a single seed germinating is 3.5 = 7 ; 5 10 (c) Expected frequencies for 0,1,2,3,4,5 seeds are 0.2187, 2.5515, 11.9070, 27.7830, 32.4135, 15.1263 respectively. (d) The histograms are as follows: 40 Estimated no. of rows 30 20 10 0 1 2 3 4 No. of seeds germinating 9

40 Observed no. of rows 30 20 10 0 1 2 3 4 No. of seeds germinating 10