Chapter 18. Sampling Distribution Models. Bin Zou STAT 141 University of Alberta Winter / 10

Similar documents
Probability and Samples. Sampling. Point Estimates

Chapter 15 Sampling Distribution Models

Chapter 22. Comparing Two Proportions. Bin Zou STAT 141 University of Alberta Winter / 15

Chapter 7. Scatterplots, Association, and Correlation

The Central Limit Theorem

Chapter 5. Understanding and Comparing. Distributions

STATISTICS 141 Final Review

TEST 1 M3070 Fall 2003

Sampling Distribution: Week 6

QUIZ 4 (CHAPTER 7) - SOLUTIONS MATH 119 SPRING 2013 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS = 100%

Statistic: a that can be from a sample without making use of any unknown. In practice we will use to establish unknown parameters.

Stat 139 Homework 2 Solutions, Spring 2015

7.1 Sampling Error The Need for Sampling Distributions

STA Why Sampling? Module 6 The Sampling Distributions. Module Objectives

Chapter 4. Displaying and Summarizing. Quantitative Data

Midterm 1 and 2 results

MALLOY PSYCH 3000 MEAN & VARIANCE PAGE 1 STATISTICS MEASURES OF CENTRAL TENDENCY. In an experiment, these are applied to the dependent variable (DV)

Post-exam 2 practice questions 18.05, Spring 2014

Population 1 Population 2

Two Sample Problems. Two sample problems

Stat 231 Exam 2 Fall 2013

Chapter 18. Sampling Distribution Models. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Sampling Distribution Models. Chapter 17

CHAPTER 7. Parameters are numerical descriptive measures for populations.

ACMS Statistics for Life Sciences. Chapter 13: Sampling Distributions

University of California, Berkeley, Statistics 131A: Statistical Inference for the Social and Life Sciences. Michael Lugo, Spring 2012

# of 6s # of times Test the null hypthesis that the dice are fair at α =.01 significance

4.2 Probability Models

Econ 250 Fall 2006 Due November 7 in Class Assignment 2: Probability. Discrete and Continuous Random Variables

Harvard University. Rigorous Research in Engineering Education

STAT Chapter 9: Two-Sample Problems. Paired Differences (Section 9.3)

EXAM 3 Math 1342 Elementary Statistics 6-7

Chapter 22. Comparing Two Proportions 1 /29

ch 6 review with answers typed out and explained.notebook January 30, 2018

Lab #12: Exam 3 Review Key

STAT100 Elementary Statistics and Probability

Answers Part A. P(x = 67) = 0, because x is a continuous random variable. 2. Find the following probabilities:

Chapter 22. Comparing Two Proportions 1 /30

Interval estimation. October 3, Basic ideas CLT and CI CI for a population mean CI for a population proportion CI for a Normal mean

Chapter 6: SAMPLING DISTRIBUTIONS

SAMPLING DISTRIBUTIONS

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA)

Sampling distributions and the Central Limit. Theorem. 17 October 2016

Cumulative Frequency & Frequency Density

Determining the Spread of a Distribution

UNIVERSITY OF TORONTO. Faculty of Arts and Science APRIL - MAY 2005 EXAMINATIONS STA 248 H1S. Duration - 3 hours. Aids Allowed: Calculator

value mean standard deviation

Determining the Spread of a Distribution

Topic 3: Sampling Distributions, Confidence Intervals & Hypothesis Testing. Road Map Sampling Distributions, Confidence Intervals & Hypothesis Testing

Population Variance. Concepts from previous lectures. HUMBEHV 3HB3 one-sample t-tests. Week 8

, 0 x < 2. a. Find the probability that the text is checked out for more than half an hour but less than an hour. = (1/2)2

Math 10 - Compilation of Sample Exam Questions + Answers

Practice problems from chapters 2 and 3

6.2A Linear Transformations

ECN221 Exam 1 VERSION B Fall 2017 (Modules 1-4), ASU-COX VERSION B

This is a multiple choice and short answer practice exam. It does not count towards your grade. You may use the tables in your book.

Business Statistics: A Decision-Making Approach 6 th Edition. Chapter Goals

Chapter 6. Estimates and Sample Sizes

Sampling Distribution Models. Central Limit Theorem

Sections 6.1 and 6.2: The Normal Distribution and its Applications

Chapter 18. Sampling Distribution Models /51

STAT Chapter 13: Categorical Data. Recall we have studied binomial data, in which each trial falls into one of 2 categories (success/failure).

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

Stats Review Chapter 6. Mary Stangler Center for Academic Success Revised 8/16

6 THE NORMAL DISTRIBUTION

Prince Sultan University STAT 101 Final Examination Spring Semester 2008, Term 082 Monday, June 29, 2009 Dr. Quazi Abdus Samad

Tribhuvan University Institute of Science and Technology 2065

Final Exam STAT On a Pareto chart, the frequency should be represented on the A) X-axis B) regression C) Y-axis D) none of the above

A multi-state model for the prognosis of non-mild acute pancreatitis

Chapter 3 Data Description

The Normal Distribution Review

Introduction to Estimation. Martina Litschmannová K210

Chapter 1 - Lecture 3 Measures of Location

Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing

( ) P A B : Probability of A given B. Probability that A happens

are the objects described by a set of data. They may be people, animals or things.

In this chapter, you will study the normal distribution, the standard normal, and applications associated with them.

Chapter 6 Continuous Probability Distributions

Comparison of Two Population Means

Unit 2. Describing Data: Numerical

Problem Set #5. Due: 1pm on Friday, Nov 16th

STAT 200 Chapter 1 Looking at Data - Distributions

Lecture 17. Ingo Ruczinski. October 26, Department of Biostatistics Johns Hopkins Bloomberg School of Public Health Johns Hopkins University

Chapter 18: Sampling Distribution Models

BIOS 6222: Biostatistics II. Outline. Course Presentation. Course Presentation. Review of Basic Concepts. Why Nonparametrics.

Recap: Ø Distribution Shape Ø Mean, Median, Mode Ø Standard Deviations

Section 9 1B: Using Confidence Intervals to Estimate the Difference ( p 1 p 2 ) in 2 Population Proportions p 1 and p 2 using Two Independent Samples

Null Hypothesis Significance Testing p-values, significance level, power, t-tests

WISE International Masters

The area under a probability density curve between any two values a and b has two interpretations:

Sections 3.4 and 3.5

1 Statistical inference for a population mean

Sampling Distributions

You have 3 hours to complete the exam. Some questions are harder than others, so don t spend too long on any one question.

Raquel Prado. Name: Department of Applied Mathematics and Statistics AMS-131. Spring 2010

Hypothesis testing: Steps

Exam 2 (KEY) July 20, 2009

Sampling Distributions: Central Limit Theorem

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p).

Transcription:

Chapter 18 Sampling Distribution Models Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 1 / 10

Population VS Sample Example 18.1 Suppose a total of 10,000 patients in a hospital and 7,000 of them like to play basketball. A sample of 200 patients is selected from this hospital, and 128 of them like to play basketball. Find the proportion of patients who like to play basketball in the population and in the sample. Population parameter is a numerical measure, e.g., mean, median, variance, etc, of the given population. Sample statistic is a summary measure calculated from a sample data set. Remark: sample statistic is a random variable, since its value varies from sample to sample. The distribution of a sample statistic is called sampling distribution. Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 2 / 10

Sample Proportion Population parameter: proportion p. e.g., proportion of students who pass the final in a class. Sample statistic: proportion ˆp. e.g., instead of considering all students, we rather select a sample to investigate. Then, ˆp apparently depends on the chosen sample. Assume we select a sample of n students, and k of them passed the exam. Hence, ˆp = k n. (different k and/or n give different ˆp.) Question? Assume true population parameter p is known, what is the distribution of ˆp? (which is called sampling distribution) Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 3 / 10

Central Limit Theorem for Sample Proportion CLT Given p (population proportion) and n (sample size), the sampling distribution of ˆp is ( ) pq N p,, where q = 1 p, n that is, ˆp follows a normal distribution with mean p and standard deviation pq n. Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 4 / 10

Assumptions and Conditions for CLT 1 Independence Assumption: The sampled values must be independent of each other. 2 Sample Size Assumption: The sample size, n must be large enough. 3 Randomization Condition: The sample should be a simple random sample of the population. 4 10% Condition: The sample size, n, must be no larger than 10% of the population. 5 Success/failure Condition: n p 10 and n q 10. Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 5 / 10

Example 18.2 A study shows that the proportion of people in age 20 to 34 with IQ over 120 is about 0.35. We randomly choose a sample of 50 people aged between 20 and 34. What is the probability there are more than 30 people with IQ over 120? Check assumptions: independence & large sample and conditions: randomization; 10%; and success/failure. Identify: p = 0.35 and n = 50. (q = 1 p = 0.65) Sampling distribution: ˆp N(0.35, 0.35 0.65/50 = 0.06745). more than 30 people with IQ over 120 = ˆp 30 50 = 0.6. ( P(ˆp 0.6) = P Z = ˆp ) 0.35 0.6 0.35 0.06745 0.06745 = 3.71 = 0.0001. Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 6 / 10

Sample Mean Sampling distribution for Mean When a simple random sample is drawn from a population with mean µ and standard deviation σ, its sample mean ȳ has a sampling distribution with the same mean µ but new standard deviation is given by σ n. If the population is normally distributed, sampling distribution is exactly normally distributed. If not, then sampling distribution is only approximately normally distributed. And the larger the sample size, the closer the approximation. Remark: when we estimate the standard deviation of a sampling distribution, we call it standard error. Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 7 / 10

Assumptions and Conditions for CLT 1 Independence Assumption: The sampled values must be independent of each other. 2 Sample Size Assumption: The sample size, n must be large enough. 3 Randomization Condition: The sample should be a simple random sample of the population. 4 10% Condition: The sample size, n, must be no larger than 10% of the population. 5 Large Enough Sample Condition: n 30. Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 8 / 10

Example 18.3 The scores of students on the ACT college entrance exam has a normal distribution with µ = 18.6 and σ = 5.9. (a) What is the probability that one randomly chosen student scores 21 or higher? (b) Now take a simple random sample of 50 students who took the test. What are the mean and standard deviation of ȳ and describe the shape of its sampling distribution? (c) What is the probability that the sample mean is 21 or higher? Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 9 / 10

Example 18.4 The duration of a disease from the onset of symptoms until death ranges from 3 to 20 years. The mean is 8 years and the standard deviation is 4 years. Looking at the average duration for 30 randomly selected patients, calculate the mean and standard deviation of ȳ and describe the shape of its sampling distribution. What is the probability that the average duration of those 30 patients is less than 7 years? Bin Zou (bzou@ualberta.ca) STAT 141 University of Alberta Winter 2015 10 / 10