Chapter 8 Sampling Distributions Defn Defn

Similar documents
Probability and Samples. Sampling. Point Estimates

(a) (i) Use StatCrunch to simulate 1000 random samples of size n = 10 from this population.

Unit 2. Describing Data: Numerical

Essential Question: What are the standard intervals for a normal distribution? How are these intervals used to solve problems?

CHAPTER 7 THE SAMPLING DISTRIBUTION OF THE MEAN. 7.1 Sampling Error; The need for Sampling Distributions

4.12 Sampling Distributions 183

4.2 The Normal Distribution. that is, a graph of the measurement looks like the familiar symmetrical, bell-shaped

Chapter 3. Measuring data

Lecture 8: Chapter 4, Section 4 Quantitative Variables (Normal)

The Chi-Square Distributions

Describing distributions with numbers

The Chi-Square Distributions

Math 361. Day 3 Traffic Fatalities Inv. A Random Babies Inv. B

y = a + bx 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation Review: Interpreting Computer Regression Output

MAT Mathematics in Today's World

a table or a graph or an equation.

Lesson 19: Understanding Variability When Estimating a Population Proportion

Chapter 5: Exploring Data: Distributions Lesson Plan

Lecture 10/Chapter 8 Bell-Shaped Curves & Other Shapes. From a Histogram to a Frequency Curve Standard Score Using Normal Table Empirical Rule

Lecture 22/Chapter 19 Part 4. Statistical Inference Ch. 19 Diversity of Sample Proportions

Elementary Statistics

Unit 22: Sampling Distributions

Lecture 8 Continuous Random Variables

Data Analysis and Statistical Methods Statistics 651

4/1/2012. Test 2 Covers Topics 12, 13, 16, 17, 18, 14, 19 and 20. Skipping Topics 11 and 15. Topic 12. Normal Distribution

Lecture 3. The Population Variance. The population variance, denoted σ 2, is the sum. of the squared deviations about the population

Density Curves & Normal Distributions

Review. A Bernoulli Trial is a very simple experiment:

Section 3.4 Normal Distribution MDM4U Jensen

4/19/2009. Probability Distributions. Inference. Example 1. Example 2. Parameter versus statistic. Normal Probability Distribution N

Descriptive Univariate Statistics and Bivariate Correlation

3/30/2009. Probability Distributions. Binomial distribution. TI-83 Binomial Probability

Inverse Normal Distribution and Sampling Distributions

and the Sample Mean Random Sample

Chapter 6. The Standard Deviation as a Ruler and the Normal Model 1 /67

1 MA421 Introduction. Ashis Gangopadhyay. Department of Mathematics and Statistics. Boston University. c Ashis Gangopadhyay

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Chapter 3: Examining Relationships

CHAPTER 1 Univariate data

Probability and Inference. POLI 205 Doing Research in Politics. Populations and Samples. Probability. Fall 2015

Section Linear Correlation and Regression. Copyright 2013, 2010, 2007, Pearson, Education, Inc.

Supporting Australian Mathematics Project. A guide for teachers Years 11 and 12. Probability and statistics: Module 25. Inference for means

1 Probability Distributions

Normal Random Variables

Normal Random Variables

SAMPLING DISTRIBUTIONS

ACMS Statistics for Life Sciences. Chapter 13: Sampling Distributions

Chapter 6: SAMPLING DISTRIBUTIONS

Statistics and Data Analysis in Geology

Probability Distributions

Sampling, Frequency Distributions, and Graphs (12.1)

The Normal Distribution. Chapter 6

Name Date Chiek Math 12

Ø Set of mutually exclusive categories. Ø Classify or categorize subject. Ø No meaningful order to categorization.

Unit 4 Probability. Dr Mahmoud Alhussami

Perhaps the most important measure of location is the mean (average). Sample mean: where n = sample size. Arrange the values from smallest to largest:

STA 291 Lecture 16. Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately) normal

(i) The mean and mode both equal the median; that is, the average value and the most likely value are both in the middle of the distribution.

STAT 200 Chapter 1 Looking at Data - Distributions

Quantitative Bivariate Data

MALLOY PSYCH 3000 MEAN & VARIANCE PAGE 1 STATISTICS MEASURES OF CENTRAL TENDENCY. In an experiment, these are applied to the dependent variable (DV)

Expected Value - Revisited

Probability Distribution for a normal random variable x:

AP Statistics Semester I Examination Section I Questions 1-30 Spend approximately 60 minutes on this part of the exam.

Essential Statistics Chapter 6

Chapter 2: Summarizing and Graphing Data

Theoretical Foundations

Probability. Hosung Sohn

CHAPTER 5: EXPLORING DATA DISTRIBUTIONS. Individuals are the objects described by a set of data. These individuals may be people, animals or things.

Quiz 2 covered materials in Chapter 5 Chapter 6 Chapter 7. Normal Probability Distribution. Continuous Probability. distribution 11/9/2010.

Lecture 27. DATA 8 Spring Sample Averages. Slides created by John DeNero and Ani Adhikari

Chapter 5: Exploring Data: Distributions Lesson Plan

Chapter 5. Understanding and Comparing. Distributions

Distribution of sample means

Chapter 18 Sampling Distribution Models

Lecture 10: The Normal Distribution. So far all the random variables have been discrete.

FREQUENCY DISTRIBUTIONS AND PERCENTILES

Lecture 1: Descriptive Statistics

STA Module 8 The Sampling Distribution of the Sample Mean. Rev.F08 1

Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc.

Statistics lecture 3. Bell-Shaped Curves and Other Shapes

Single Sample Means. SOCY601 Alan Neustadtl

Chapter 15 Sampling Distribution Models

Inferential Statistics

Essential Question: How are the mean and the standard deviation determined from a discrete probability distribution?

Chapter 1. Looking at Data

Sections 6.1 and 6.2: The Normal Distribution and its Applications

MATH 1150 Chapter 2 Notation and Terminology

23.3. Sampling Distributions. Engage Sampling Distributions. Learning Objective. Math Processes and Practices. Language Objective

Statistics and parameters

CHAPTER 5 Probabilistic Features of the Distributions of Certain Sample Statistics

Probability Distributions

Introduction to Basic Statistics Version 2

(i) The mean and mode both equal the median; that is, the average value and the most likely value are both in the middle of the distribution.

The empirical ( ) rule

Concepts in Statistics

Chapter 3. Data Description

Finding Quartiles. . Q1 is the median of the lower half of the data. Q3 is the median of the upper half of the data

Lecture 6: Chapter 4, Section 2 Quantitative Variables (Displays, Begin Summaries)

Describing distributions with numbers

Transcription:

1 Chapter 8 Sampling Distributions Defn: Sampling error is the error resulting from using a sample to infer a population characteristic. Example: We want to estimate the mean amount of Pepsi-Cola in 12-oz. cans coming off an assembly line by choosing a random sample of 16 cans, and using the sample mean as an estimate of the mean for the population of cans. Suppose that we choose 100 random samples of size 16 and compute the sample mean for each of these samples. These 100 values of will differ from each other somewhat due to sampling error, but the values should all be close to 12-oz. Defn: For a random variable, and a given sample size n, the distribution of the variable, i.e., of all possible values of, is called the sampling distribution of the mean. This probability distribution is a set of pairs of numbers. In each pair, the first number is a possible value of the sample mean, and the second number is the probability of obtaining that value of the mean occur when we select a random sample from the population. Properties of the Sampling Distribution of the Mean: 1) For samples of size n, the expectation (mean) of, equals the expectation (mean) of. In other words,. 2) The possible values of cluster closer around the population mean for larger samples than for smaller samples. In other words, the larger the sample size, the smaller the sampling error. In particular, the standard deviation of the sampling distribution of the means,, will be smaller than the population standard

2 deviation, sample size.. In particular, we have n, where n is the Example (Continued): To visualize the concept of the sampling distribution of the mean, let us return to the above example. If we were to measure the amount of Pepsi-Cola in each 12-oz. can, we would obtain a list of values: Here, is the amount of Pepsi-Cola in the first can selected; is the amount of Pepsi-Cola in the second can selected; etc. If we were to construct a histogram for all of these values, that histogram would represent the population distribution. Assuming that the distribution of fill amounts is normal, the histogram would be a bell-shaped curve centered at and with spread given by some (relatively) small amount, such as Now, instead, let us consider the average fill amount per can for random samples of 16 cans. We select a random sample of 16 cans, measure the amount in each can, and calculate the sample mean. We do this repeatedly, and obtain a list of values: is the mean fill amount for the first sample of 16 cans; is the mean fill amount for the second sample of 16 cans; is the mean fill amount for the third sample of 16 cans; etc. If we were to construct a histogram for all of these values, that histogram would represent the sampling distribution of the mean for samples of size 16. It would be a bell-shaped curve centered at and with spread given by

3 Defn: The standard deviation of the sampling distribution of the mean is called the standard error of the mean. Property 2 says that for a given population, and a given random variable defined for the members of that population, the standard error of the mean is smaller for larger sample sizes. To visualize what this means, let s look at an example. Example: Consider the adult population of the United States. We have an IQ test that has been developed to assess the intelligence of individuals in this population. The distribution of IQ scores in the population is normal with mean µ = 100 and standard deviation σ = 15. This means that if we were to measure the IQ of each adult American, and do a histogram of the data, we would get a bell-shaped curve centered at µ = 100 and with standard deviation σ = 15. Now suppose that we select a simple random sample of size n = 4 from the population, administer the IQ test to each person in the sample, and find the sample mean. If we were to do this repeatedly, so that we obtain sample mean values for all possible samples of size n = 4, and do a histogram of those numbers, the histogram would be a bell-shaped curve centered at and with standard deviation Suppose that, instead, we consider all possible samples of size n =100 from the population, and for each sample, we obtain the sample mean IQ score. If we do a histogram of all of these numbers, the histogram will be a bell-shaped curve centered at and with standard deviation

4 These three curves are shown together in the graph below: The following theoretical result from probability theory is fundamental for our work in statistical inference. The Central Limit Theorem: For large (n 30) sample sizes, the random variable has an approximate normal distribution, with mean and standard deviation n. In Z other words, the random variable has an approximate standard normal distribution. This theorem holds regardless of the type of population distribution. The population distribution could be normal; it could be uniform (equally likely outcomes); it could be strongly positively skewed; it could be strongly negatively skewed. Regardless of the shape of the population distribution, the sampling distribution of the mean will be approximately normal. n

5 Example: p. 390, Exercise 20 Example: p. 390, Exercise 22 Example: p. 391, Exercise 27 The Sampling Distribution of the Sample Proportion Assume that we have a (large) population, which is divided into two subpopulations. In one subpopulation, each member possesses a certain characteristic; in the other subpopulation, each member does not possess this characteristic. Assume that the proportion of members of the entire population who possess the characteristic is p. We select a simple random sample of size n from the population. We are interested in the proportion of the members of the sample who possess the characteristic of interest. This proportion is called the sample proportion, denoted by The Central Limit Theorem tells us that, if the sample size is large, then a) The shape of the sampling distribution of is approximately normal, having mean p and standard deviation ( ), provided and ( )

6 b) Under the same conditions, the distribution of is approximately standard normal. ( ) Example: p. 399, Exercise 15. Example: p. 399, Exercise 21 Example: p. 399, Exercise 23