CHAPTER 18 SAMPLING DISTRIBUTION MODELS STAT 203

Similar documents
Chapter 18. Sampling Distribution Models. Copyright 2010, 2007, 2004 Pearson Education, Inc.

ACMS Statistics for Life Sciences. Chapter 13: Sampling Distributions

CHAPTER 7. Parameters are numerical descriptive measures for populations.

Carolyn Anderson & YoungShil Paek (Slide contributors: Shuai Wang, Yi Zheng, Michael Culbertson, & Haiyan Li)

Chapter 18 Sampling Distribution Models

Chapter 22. Comparing Two Proportions. Bin Zou STAT 141 University of Alberta Winter / 15

Chapter 15 Sampling Distribution Models

Sampling Distribution Models. Chapter 17

Probability and Discrete Distributions

STA Why Sampling? Module 6 The Sampling Distributions. Module Objectives

Statistic: a that can be from a sample without making use of any unknown. In practice we will use to establish unknown parameters.

STAT 203 Chapter 18 Sampling Distribution Models

Discrete Distributions

Sampling Distribution: Week 6

REPEATED TRIALS. p(e 1 ) p(e 2 )... p(e k )

P (A) = P (B) = P (C) = P (D) =

The Components of a Statistical Hypothesis Testing Problem

CS 361: Probability & Statistics

EXAM. Exam #1. Math 3342 Summer II, July 21, 2000 ANSWERS

The area under a probability density curve between any two values a and b has two interpretations:

Chapter 18. Sampling Distribution Models /51

Lecture 8 Sampling Theory

Sections 5.1 and 5.2

Chapter 1 Review of Equations and Inequalities

Sampling Distribution Models. Central Limit Theorem

Each trial has only two possible outcomes success and failure. The possible outcomes are exactly the same for each trial.

Lecture 11 - Tests of Proportions

CS 361: Probability & Statistics

The Central Limit Theorem

Confidence Intervals. Confidence interval for sample mean. Confidence interval for sample mean. Confidence interval for sample mean

IV. The Normal Distribution

The normal distribution Mixed exercise 3

Binomial random variable

CISC 1100/1400 Structures of Comp. Sci./Discrete Structures Chapter 7 Probability. Outline. Terminology and background. Arthur G.

Lecture #16 Thursday, October 13, 2016 Textbook: Sections 9.3, 9.4, 10.1, 10.2

Lecture 20 Random Samples 0/ 13

Overview. Confidence Intervals Sampling and Opinion Polls Error Correcting Codes Number of Pet Unicorns in Ireland

Stephen Scott.

S2 QUESTIONS TAKEN FROM JANUARY 2006, JANUARY 2007, JANUARY 2008, JANUARY 2009

PubH 5450 Biostatistics I Prof. Carlin. Lecture 13

Probability and Statistics for Engineers

EDEXCEL S2 PAPERS MARK SCHEMES AVAILABLE AT:

Discrete Mathematics and Probability Theory Fall 2013 Vazirani Note 12. Random Variables: Distribution and Expectation

Notes slides from before lecture. CSE 21, Winter 2017, Section A00. Lecture 16 Notes. Class URL:

Stat 139 Homework 2 Solutions, Spring 2015

Chapter 6. Estimates and Sample Sizes

Discrete Mathematics and Probability Theory Fall 2014 Anant Sahai Note 15. Random Variables: Distributions, Independence, and Expectations

Inferential Statistics. Chapter 5

Chapter 6: SAMPLING DISTRIBUTIONS

CSE 103 Homework 8: Solutions November 30, var(x) = np(1 p) = P r( X ) 0.95 P r( X ) 0.

PRACTICE PROBLEMS FOR EXAM 2

X = X X n, + X 2

ECE 313: Hour Exam I

COVENANT UNIVERSITY NIGERIA TUTORIAL KIT OMEGA SEMESTER PROGRAMME: ECONOMICS

Inference for Proportions, Variance and Standard Deviation

Example. If 4 tickets are drawn with replacement from ,

10.2: The Chi Square Test for Goodness of Fit

ELEG 3143 Probability & Stochastic Process Ch. 2 Discrete Random Variables

Stat 101: Lecture 12. Summer 2006

Independence Solutions STAT-UB.0103 Statistics for Business Control and Regression Models

AP Stats MOCK Chapter 7 Test MC

Conditional Probability, Independence and Bayes Theorem Class 3, Jeremy Orloff and Jonathan Bloom

7 Random samples and sampling distributions

Senior Math Circles November 19, 2008 Probability II

Chapter 18. Sampling Distribution Models. Bin Zou STAT 141 University of Alberta Winter / 10

Sociology 6Z03 Topic 10: Probability (Part I)

University of California, Berkeley, Statistics 134: Concepts of Probability. Michael Lugo, Spring Exam 1

Bayesian Models in Machine Learning

CENTRAL LIMIT THEOREM (CLT)

Midterm Exam 1 (Solutions)

Chapter 2 Class Notes

Unit 22: Sampling Distributions

The Central Limit Theorem

Chapter 8: Confidence Intervals

BINOMIAL DISTRIBUTION

Expected Value - Revisited

Bernoulli Trials, Binomial and Cumulative Distributions

P (E) = P (A 1 )P (A 2 )... P (A n ).

Discrete Mathematics and Probability Theory Fall 2012 Vazirani Note 14. Random Variables: Distribution and Expectation

Central Limit Theorem and the Law of Large Numbers Class 6, Jeremy Orloff and Jonathan Bloom

Theoretical Foundations

Chapter 3. Estimation of p. 3.1 Point and Interval Estimates of p

An-Najah National University Faculty of Engineering Industrial Engineering Department. Course : Quantitative Methods (65211)

Probability Distributions

Discrete Mathematics for CS Spring 2007 Luca Trevisan Lecture 20

Statistics, Probability Distributions & Error Propagation. James R. Graham

CS 361: Probability & Statistics

Probability Distributions

STA Module 10 Comparing Two Proportions

MAT Mathematics in Today's World

Math/Stat 352 Lecture 10. Section 4.11 The Central Limit Theorem

Chapter 23. Inference About Means

HW on Ch Let X be a discrete random variable with V (X) = 8.6, then V (3X+5.6) is. V (3X + 5.6) = 3 2 V (X) = 9(8.6) = 77.4.

STAT Chapter 13: Categorical Data. Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure).

(A) Incorrect! A parameter is a number that describes the population. (C) Incorrect! In a Random Sample, not just a sample.

Exam Empirical Methods VU University Amsterdam, Faculty of Exact Sciences h, February 12, 2015

Discrete Mathematics and Probability Theory Spring 2016 Rao and Walrand Note 16. Random Variables: Distribution and Expectation

Ling 289 Contingency Table Statistics

Probability COMP 245 STATISTICS. Dr N A Heard. 1 Sample Spaces and Events Sample Spaces Events Combinations of Events...

Example 1: Dear Abby. Stat Camp for the MBA Program

Transcription:

1 CHAPTER 18 SAMPLING DISTRIBUTION MODELS STAT 203

Outline 2 Sampling Distribution for Proportions Sample Proportions The mean The standard deviation The Distribution Model Assumptions and Conditions Sampling Distribution for a Mean The mean, standard deviation and Model The Central Limit Theorem Assumptions and Conditions The effect of Sample Size

But First, an experiment 3 If you can please bring a coin to class. The experiment: Each student will flip a coin 20 times and count the number of heads. Let us describe the distribution of the sample of proportions of heads. What do we know

Two Proportions 4 In each of our little experiments there are two proportions: The true proportion, p, where 0 p 1: This can be the probability of success (as in flipping a coin). It can also be the proportion of a population which has a desired characteristic (e.g. proportion of BC residents who voted in the last federal election). The sample proportion: For a randomly selected sample, each case is deemed a success or a failure and the sample proportion is given as ˆp = # of Successes Sample Size n

What s Random and What s Fixed 5 The population proportion p is fixed. The sample proportion P^ is used to estimate the population proportion (the parameter). Before moving on to estimation, we consider the situation where we know the population proportion. The idea is to view our sample as a realization of a random sample. Hence, our Sample Statistic is a realization of a random experiment. It s value is one of many possible values it could have taken. The range of possible values the sample statistics can take on, along with the probability of each value is called the Sampling Distribution.

6 Sampling Distribution for Sample Proportions 1 In the coin example, we saw we could determine the list of possible values of the sample statistic and the probabilities for each. Outside the scope of this course is the name of that distribution called: the Binomial Distribution. Using this distribution is quite cumbersome and as it turns out, often not necessary. It turns out that the Normal Model is quite good at approximating the sampling distribution of sample proportion, provided the sample size is large enough.

7 Sampling Distribution for Sample Proportions 2 Knowing that the sampling distribution is Normal is only half the battle we need the distribution parameters and. Recall that to iden.fy which member of the Normal distribu.on we are working with, we need to iden.fy the mean and standard devia.on. It turns out these are and So the sampling distribu.on of is

About Spread 8 In this chapter we pretend we know the true population proportion p and work out probabilities for. In situations where we don t know p, we consequently don t know the standard deviation. In such cases we need to estimate the standard deviation as well. - Standard deviation of. The true spread. - Standard error. An estimate of the standard deviation of the sampling distribution. SE(ˆp) = ˆp(1 ˆp) n

Conditions for using a Normal 9 Approximation The sample is randomly drawn from the population The sampled values must be independent. Individuals are drawn without replacement from the population, so independence can never be achieved. But this assumption is well validated as long as the sample size is no greater than 10% of the population size. The sample size needs to be large. It is sufficient to verify that both np 10 and n(1-p) 10.

Example: Our Experiment 10 In our coin example, what s the probability of flipping a coin 20 times and being off by more than 0.25 from p? What is the probability of being within 0.05 of the true proportion? If we flipped a coin 19 times, what would the probability of getting the sample proportion agreeing with the true proportion?

Defects 11 A company known for its cheap toys is being investigated for security purposes. Of the toys they produce, 21% are defective! A sample 154 is taken and tested. a) Describe the sampling distribution model for the sample proportion by naming the model and telling its mean and standard deviation. Justify your answer. b) What is the probability that in this sample over 20% of the toys will be found to be defective?

Aliens 12 Aliens have come to abduct 100 humans (randomly). The humans will be able to choose between participating in their circus or being put in their galactic zoo. Due to ethics, they can t force a human against there choice. They need to have at least 70 people in the circus. It turns out that 66% of humans prefer the circus. What is the approximate probability that their sample will not meet their demands?

Quantitative Data 13 Proportions are used to summarize categorical variables and the Normal Model is useful for these. Means summarize quantitative variables and the Normal Model can be used for these as well. Once again, we need to determine the parameters of this Normal Model. Here we are looking for the parameters of the Sampling Distribution not the population of interest

Sampling Distribution of the Mean- 1 14 The sampling distribution of means is the distribution of all the possible random samples of size n that could be selected from a population. Suppose a random sample of n subjects is to be drawn from a population, and the observation on a subject (y) in the population follows a distribution with mean µ and standard deviation σ. The mean of the sampling distribution of means is represented by, and is equal to

Sampling Distribution of the Mean- 2 15 The standard deviation of the sampling distribution of means is represented and given by Equivalently, let y 1, y 2,, y n be a random sample from a population with mean µ and standard deviation σ. The set of sample means in repeated random samples of size n from this population has mean µ and standard deviation equal to. When σ is unknown, is estimated by substituting σ by the sample SD s. This is the standard error of :

16 Conditions for using a Normal Approximation The sample is randomly drawn from the population The sampled values must be independent. The sample size is no greater than 10% of the population size. The sample size needs to be large. The rule of thumb is n>30.

The Central Limit Theorem 17 Sit back and think of how powerful what we ve just done is. Regardless of the distribution of the data could even be categorical the distribution of the mean (and proportion) follows an approximately Normal Distribution if certain (simple) conditions are met. That s impressive. The result that allows us to claim this is the Central Limit Theorem (CLT). For large sample size, the sample mean follows an approximately Normal Distribution.

Sample Size 18 There are two advantages to having a large sample size. As sample size increases, the sampling distribution becomes more and more Normal. As the sample size increases the SD and SE both decrease. In other words, our results become more and more precise.

Commercial Aircrafts 20 The ages of U.S. commercial aircraft have a mean of 13.0 years and a standard deviation of 7.9 years (based on data from Aviation Data Services). The Federal Aviation Administration randomly selects 36 commercial aircrafts for special stress tests. (a) Describe the sampling distribution of the mean age of a sample of 36 aircrafts. (b) Find the probability that the mean age of this sample group is greater than 15.0 years. (c) Is the probability calculated in part (b) an exact or an approximate probability? Justify your answer.

Cola 21 A bottling company uses a filling machine to fill plastic bottles with cola. A bottle should contain 300 ml. In fact, the contents vary, with mean 302 ml and standard deviation 3 ml. (a) What is the probability that an individual bottle contains less than 300 ml? (b) What is the probability that the mean contents of bottles in six six-packs is less than 300 ml? (c) What is the probability that one or more bottles are under-filled?

When is the CLT not required? 22 If the distribution of the population you are trying to study is Normal, then the distribution of the sample mean is also Normal. In such cases, the requirement of Sample Size is dropped, but the requirement of independence and randomness are still required. We know that the Birth weights after a regular gestation period follow a Normal Distribution. Using Vital Statistics for the United States, the population mean is found to 6.5 lbs with SD 0.72 lbs. A sample of 15 babies in Holland leads to a sample mean of 6.95 lbs. What is the probability of observing a sample mean as high or higher if the distribution of birth weights in Holland is the same as in the US?