Chapter 18. Sampling Distribution Models /51

Size: px
Start display at page:

Download "Chapter 18. Sampling Distribution Models /51"

Transcription

1 Chapter 18 Sampling Distribution Models 1 /51

2 Homework p432 2, 4, 6, 8, 10, 16, 17, 20, 30, 36, 41 2 /51

3 3 /51

4 Objective Students calculate values of central 4 /51

5 The Central Limit Theorem for Sample Proportions In the real world it is very common to sample the population to estimate a proportion of the population demonstrating some characteristic. Rather than showing real repeated samples, imagine what would happen if we were to actually draw many samples. We know 9/10 dentists recommend Crest. Were only 10 dentists surveyed? Trump s approval rating is currently 38%. Did they ask every person in the U.S? How do they know how many people watched the Super Bowl? (Estimated million) 5 /51

6 The Central Limit Theorem for Sample Proportions If we only take one sample, what makes anyone think the sample is a good representation of the population. Can we trust the results actually reflect the proportion in the population? Suppose we could take many samples of the same size (n). What would we expect to see if we looked at a distribution of the sample proportions for these samples. In other words, we have a distribution of data values that are proportions taken from our many samples. We could then create a histogram of the proportions from many samples. Do you think this histogram might look like the histogram of all possible sample proportions from samples of size n? 6 /51

7 The Central Limit Theorem for Sample Proportions What would the histogram of all the sample proportions look like? That histogram of all possible sample proportions is called the sampling distribution of the sample proportions. What is the expected value of the proportion? What would this value be for our sampling distribution of sample proportions? One very nice attribute of sampling proportions is the fact that the standard deviation of the distribution is an artifact of the proportion. 7 /51

8 Modeling the Distribution of Sample Proportions We would expect the histogram of the sample proportions to be centered at the actual true proportion, p, in the population. As far as the shape of the histogram goes, we could simulate a bunch of random samples that we did not actually draw. You could do this with a random table or random number generator by assigning values appropriate to the proportion. It is a really fortunate turn that the histogram turns out to be unimodal, symmetric, and centered at p. More specifically, it is extremely fortuitous that a Normal model is juuuuust right for the histogram of sample proportions. 8 /51

9 Modeling the Distribution of Sample Proportions Modeling how sample proportions (and soon sample means) vary from sample to sample is the foundation of the methodologies we use in parametric statistics. A sampling distribution model for how a sample proportion varies from sample to sample provides the ability to quantify variation and determine the probability we would observe a sample proportion within a particular interval. To define a Normal model, we need the mean and standard deviation. For the distribution of sample proportions µ, the mean of the Normal Distribution, is equal to p, the true proportion in the population. 9 /51

10 Modeling the Distribution of Sample Proportions When working with proportions, knowing the mean (proportion) automatically gives us the standard deviation as well. In our case the standard deviation we will use is SD (X ) = pq n So, the distribution of the sample proportions is modeled with a probability model that is: N p, pq n 10/51

11 Modeling the Distribution of Sample Proportions This is the graph of the distribution of sample proportions N p, pq n 3 pq n 2 pq n 1 pq n p 1 pq n 2 pq n 3 pq n 11/51

12 The Central Limit Theorem for Sample Proportions Because we have a Normal model, we know that 95% of Normally distributed values are within two standard deviations of the mean. So we should not be surprised if 95% of various sampling polls gave results that were near the mean but varied above and below that by no more than two standard deviations. This is what we mean by sampling error. Sampling error is not really an error (mistake), it is just variability we would expect expect to see from one sample to another. A better phrase would be sampling variability. The Normal model gets better as a model for the distribution of sample proportions as the sample size gets bigger. 12/51

13 Assumptions and Conditions Like most models, the distribution of sample proportions is useful (valid) only when specific assumptions are true. There are two assumptions in the case of the model for the distribution of sample proportions: 1. Independence Assumption: The sampled values must be independent of each other. 2. Sample Size Assumption: The sample size, n, must be large enough. 13/51

14 Assumptions and Conditions We check that the assumptions are reasonable by checking conditions that provide information about the assumptions. The corresponding conditions to check before using the Normal to model the distribution of sample proportions are: Randomization Condition (independence) 10% Condition (independence) Success/Failure Condition (adequate sample size) 14/51

15 Assumptions and Conditions 1. Randomization Condition: The sample should be a simple random sample of the population (SRS) % Condition: the sample size, n, must be no larger than 10% of the population (N 10n). 3. Success/Failure Condition: The sample size has to be big enough so that both np (number of successes) and nq (number of failures) are at least 10 (np 10, nq 10). 15/51

16 A Sampling Distribution Model for a Proportion A proportion is now more than just a computation from a set of data. A proportion is now one example of a random variable quantity (statistic) that has a probability distribution. This distribution is called the sampling distribution model for proportions. We make decisions based on what we know about sampling distribution models, but we never actually create them. It would be kind of tough to do, as the model requires an infinite number of data values. We rarely take repeated samples from the same population and make a histogram. We can simulate them, but we know what the model would look like, so we never need to actually create it. 16/51

17 Sampling Distribution Let us be clear about what we are now discussing. We have a new distribution of values (statistics) taken from samples. This new distribution (model) is to be distinguished from a distribution of individual observations taken from a population. This new distribution currently consists of sample proportions denoted: p^ p-hat p 17/51

18 A Sampling Distribution Model for a Proportion Provided that the sampled values are independent and the sample size is sufficiently large, the sampling distribution of model with: P! is modeled by a Normal Mean: µ P! = P Standard deviation: SD P! ( ) = pq n N p, pq n 18/51

19 Example Suppose we find data that tells us 80% of cars on the interstate exceed the speed limit. What proportion of the next 50 cars that speed by will be exceeding the speed limit. The problem (parameter of interest): We want to find the distribution of the proportion of the next 50 cars that may be speeding on the highway. We are told 80% of all cars on the highway are speeding. Randomize Condition: The 50 cars can be considered a random sample and thus, representative sample of all cars on the highway. 10% Condition: 50 is certainly less than 10% of all cars on the highway. Success/Failure Condition: np = 50(0.80) = 40, nq = 50(0.20) = 10, both /51

20 Example Suppose we find data that tells us 80% of cars on the interstate exceed the speed limit. What proportion of the next 50 cars that speed by will be exceeding the speed limit. The sampling distribution model for the proportion of speeders in 50 cars: Mean: µ P! =.8 Standard deviation: SD P! ( ) = pq n =.8 i N p, pq n The model for P! is N(.8,.057) 20/51

21 Example The model for P! is N(.8,.057) Proportion of speeders in 50 cars Accordingly, we expect 68% of the samples of 50 cars to have proportions of speeders between and % of the samples to have 99.7% 95% 68% proportions between and % of the samples to have proportions between and /51

22 What About Quantitative Data? Proportions summarize categorical variables. It is binomial data, you do exhibit the characteristic or you do not exhibit the characteristic. The Normal sampling distribution model is extremely useful in evaluating sample proportion data. But can we do something similar with quantitative data (i.e. means)? Yeperdoo. Even better, not only can we use all of the same concepts, but almost the same model. 22/51

23 Simulating the Sampling Distribution of a Mean Like other statistics computed from a random sample, a sample mean also has a sampling distribution. A simulation will give us a sense as to what the sampling distribution of the sample mean might look like Let s start with a simulation of many tosses of a die. Using a die or your calculator roll a die 20 times and record the results on a dot plot on the board. 23/51

24 Simulating the Sampling Distribution of a Mean A histogram of the results might look like this: 150 freq How did we do? /51

25 Simulation the Sampling Distribution of the Mean Now we will take samples of size 5 rolls. Roll the die 5 times and calculate the mean of the 5 rolls. Repeat 10 times, record on the board. Complete questions 1-3 of the worksheet. 25/51

26 Means Averaging More Dice Throws Looking at the mean of two dice after a simulation of 10,000 tosses: The mean of three dice after a simulation of 10,000 tosses looks like: 26/51

27 Means Averaging Even More Dice Throws The mean of 5 dice after a simulation of 10,000 tosses looks like: The mean of 20 dice after 10,000 tosses looks like: What does this tell us? 27/51

28 Means What the Simulations Show As the sample size (number of dice) gets larger, each sample mean is more likely to be closer to the population mean. So, we see the shape continuing to compress around 3.5 And, you hopefully noticed that the sampling distribution of a mean becomes Normal with all possible samples of size n. 28/51

29 The Central Limit Theorem The sampling distribution of any mean becomes more nearly Normal as the sample size grows. It must be true that the observations are independent and random. This is true, no matter the shape of the population distribution as long as the sample is large enough. More on this soon. This Fundamental Theorem of Statistics is called the Central Limit Theorem. 29/51

30 The Central Limit Theorem The Central Limit Theorem is not intuitive and a tad bit weird: Not only does the histogram of the sample means get closer and closer to the Normal model as the sample size grows, but this is true regardless of the shape of the population distribution. The Central Limit Theorem works better (and faster) the more unimodal and symmetric the population. Additionally, the Central Limit Theorem works better for larger samples. To summarize, the more unimodal and symmetric the population, the more unimodal and symmetric the sampling distribution. The larger the sample sizes, the more unimodal and symmetric the sampling distribution. 30/51

31 The Central Limit Theorem The mean of a random sample is a random variable whose sampling distribution (the distribution of means) can be approximated by a Normal model. The larger the sample, the better the approximation will be. It is also intuitive that the standard deviation of the distribution of sample means will be smaller than the population standard deviation. A collection of means should be close to the population mean, and thus closer to each other. SD(X ) = σ n So, the distribution of the sample means (sampling distribution) is a probability model: N µ, σ n 31/51

32 Sampling Distribution We have introduced a new distribution that is to be differentiated from the distribution of population values. The new distribution, the sampling distribution (distribution of a sample statistic), is just that; a distribution of statistics (means). The new distribution, has a mean equal to the population parameter (μ) and a standard deviation (now called standard error) that is smaller than that of the original population distribution. 32/51

33 Assumptions and Conditions The CLT requires essentially the same assumptions we saw for modeling proportions: Random Assumption: The samples must be randomly selected. Independence Assumption: The individual values must be independent. Sample Size Assumption: The sample size must be sufficiently large. 33/51

34 Assumptions and Conditions Once again we cannot check these directly, but we can be reasonably confident about whether the Independence Assumption is appropriate. We can check some related conditions to verify the assumption: Randomization Condition: The data values must be sampled randomly. 10% Condition: When the sample is drawn without replacement, the sample size, n, should be no more than 10% of the population. Large Enough Sample Condition: The CLT doesn t tell us how large a sample we need. For now, you need to think about your sample size in the context of what you know about the population. 34/51

35 But Which Normal? The Central Limit Theorem ensures that the sampling distribution of a statistic (mean) is approximately Normal. But which Normal model? We have learned that for proportions, the sampling distribution is centered at the population proportion (p). For means, it s centered at the population mean (μ). All that remains is to define the standard deviations. 35/51

36 But Which Normal The Normal model for the sampling distribution of the proportion has a standard deviation equal to SD(P! ) = pq n = pq n The Normal model for the sampling distribution of the means of samples size n has a standard deviation equal to SD(X ) = σ n where σ is the population standard deviation. 36/51

37 About Variation The standard deviation of the sampling distribution decreases only with the square root of the sample size. Thus, the variability decreases as the sample size increases but only by a factor of a square root. To decrease standard deviation significantly we have to increase sample size a whole bunch. But we cannot forget that we are limited in sample size to 10% of the population. 37/51

38 Example SAT math scores have mean 541 and standard deviation 103. What about the mean of random samples of 20 students? (Note that the small sample is okay because we believe a Normal model applies to the population.) We are interested in the distribution of possible means from samples of SAT scores from 20 students. SAT scores have a mean of 541 and a standard deviation of 103, and since the SAT is standardized, it s reasonable to assume that the model for all SAT scores is Normal. What we are NOT discussing is a distribution of individual SAT scores. 38/51

39 Example SAT scores have mean 541 and standard deviation 103. What about the mean of random samples of 20 students? Random Sampling Condition: The 20 students were sampled randomly. Independence Assumption: It is reasonable to believe that the SAT scores of the 20 randomly sampled students will be mutually independent, as long as the students were not all from the same school. 10% Condition: 20 students represent less than 10% of all students. Under these conditions, the sampling distribution of y has a Normal model. Large enough: 20 students provide a small sample, but we believe the distribution of individual SAT scores is unimodal and symmetric so the sampling distribution of y has a Normal model, 39/51

40 Example Individual SAT scores have mean 541 and standard deviation 103. What about the mean of random samples of 20 students? SD X ( ) = σ n = N(541, ) 40/51

41 Example Mean SAT scores of 20 students SAT scores have mean 541 and standard deviation 103. N(541, ) 99.7% 95% According to the Normal model, we can expect 68% % of samples of 20 to have mean SAT scores between and % of samples to have means between and % of samples to have means between and Remember, we are not comparing the SAT scores of individual students but, rather, comparing SAT scores of samples of 20 students. 41/51

42 Example SAT scores have mean 541 and standard deviation 103. Mean SAT scores of 20 students 1.67% What is the probability that a sample of 20 students would have an aggregate mean SAT score of 590 or more? N(541, ) We meet the conditions for calculating the probability (listed previously). P(x 590) = normcdf(590, 10^99, 541, ) =.0167 P(z ) = normcdf(2.1275, 10^99, 0, 1) =.0167 Z = = The probability of a random sample of 20 students having a mean SAT score of 590 or better is approximately 1.67% 42/51

43 Individual SAT scores have mean 541 and standard deviation 103. What is the probability that an individual will have a score of 590 or better? Mean SAT scores 99.7% 95% 68% P(x 590) = normcdf(590, 10^99, 541, 103) =.3171 P(z.4757) = normcdf(.4757, 10^99, 0, 1) =.3171 Z = =.4757 The probability of an individual student having a mean 31.71% SAT score of 550 or better is approximately 31.7% Do you see the difference? /51

44 Another Example Speeds of cars on a highway have mean 52 mph and standard deviation of 6 mph, and are likely to be skewed to the right (a few very fast drivers). Describe what we might see in random samples of 50 cars. We are investigating the distribution of sample mean speeds from many samples of 50 cars on the highway. 44/51

45 Example 2 Random Sampling Condition: The 50 speeds were sampled randomly. Independence Assumption: It is reasonable to think that the speeds of the 50 randomly sampled cars will be mutually independent. 10% Condition: 50 cars represent less than 10% of all cars on the highway. Large enough: 50 cars is a large enough sample to ensure the sampling distribution is unimodal and symmetric even though the original distribution is skewed. 45/51

46 Example 2 The distribution of our sample (and of the population of cars being driven) is almost certainly positively skewed, but the Central Limit Theorem takes care of that because the sample size of 50 cars, is sufficiently large and overcomes the skewness in the population. Under these conditions, the sampling distribution of y has a Normal model, with mean 52 mph and standard deviation SD X ( ) = σ n = N(52, ) 46/51

47 Example 2 Mean speed of 50 cars N(52, ) 99.7% 95% According to the Normal model, we can expect 68% % of samples of 50 cars to have mean speeds between 51.2 and 52.9 mph 95% of samples to have means between 50.3 and 53.7 mph 99.7% of samples to have means between 49.5 and 54.6 mph. 47/51

48 Example What is the probability the next 50 cars will have an average speed of 50 mph or less? Mean speed of 50 cars N(52, ) n 0.92% The conditions for a normal model N(52, ) have been met. P(x 50) = normalcdf(-10^99, 50, 52, ) =.0092 P(z ) = normalcdf(-10^99, , 0, 1) =.0092 Z = = The probability of a random sample of 50 cars having a mean speed of 50 or lower is approximately 0.92% 48/51

49 Complete Response Once again, to ensure you completely answer a question, the following elements must be included in your response. n 1. Identify the parameter of study. n 2. You must determine that the sample was obtained randomly. n 3. Determine that the measures are independent of each other. n 4. Ensure the sample is not too large, n < 10% of population. n 5. Determine that the sample is sufficiently large such that the central limit theorem ensures the sampling distribution is sufficiently unimodal and symmetric, np 10 nq 10 for proportion, n > 30 for means. n 6. Calculate the SE showing the appropriate values. SD X n 7. A complete probability statement P(who) = how = what. ( ) = σ n SD p! ( ) = pq n n 8. If a normal model is involved; define the model, calculate any z scores (even if you do not use them), and an appropriately labeled curve. n 9. A concluding sentence in context. 49/51

50 The Real World and the Model World Be careful! Now we have two distributions to deal with. The first is the real world distribution of a sample from individual data, which we might display with a histogram. The second is the theoretical sampling distribution of the statistic, which we model with a Normal model based on the Central Limit Theorem. Don t confuse the two! 50/51

51 The Process Going Into the Sampling Distribution Model 51/51

STA Why Sampling? Module 6 The Sampling Distributions. Module Objectives

STA Why Sampling? Module 6 The Sampling Distributions. Module Objectives STA 2023 Module 6 The Sampling Distributions Module Objectives In this module, we will learn the following: 1. Define sampling error and explain the need for sampling distributions. 2. Recognize that sampling

More information

Chapter 18. Sampling Distribution Models. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 18. Sampling Distribution Models. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models Copyright 2010, 2007, 2004 Pearson Education, Inc. Normal Model When we talk about one data value and the Normal model we used the notation: N(μ, σ) Copyright 2010,

More information

Sampling Distribution Models. Chapter 17

Sampling Distribution Models. Chapter 17 Sampling Distribution Models Chapter 17 Objectives: 1. Sampling Distribution Model 2. Sampling Variability (sampling error) 3. Sampling Distribution Model for a Proportion 4. Central Limit Theorem 5. Sampling

More information

Sampling Distribution Models. Central Limit Theorem

Sampling Distribution Models. Central Limit Theorem Sampling Distribution Models Central Limit Theorem Thought Questions 1. 40% of large population disagree with new law. In parts a and b, think about role of sample size. a. If randomly sample 10 people,

More information

Chapter 15 Sampling Distribution Models

Chapter 15 Sampling Distribution Models Chapter 15 Sampling Distribution Models 1 15.1 Sampling Distribution of a Proportion 2 Sampling About Evolution According to a Gallup poll, 43% believe in evolution. Assume this is true of all Americans.

More information

STA Module 8 The Sampling Distribution of the Sample Mean. Rev.F08 1

STA Module 8 The Sampling Distribution of the Sample Mean. Rev.F08 1 STA 2023 Module 8 The Sampling Distribution of the Sample Mean Rev.F08 1 Module Objectives 1. Define sampling error and explain the need for sampling distributions. 2. Find the mean and standard deviation

More information

Chapter 18 Sampling Distribution Models

Chapter 18 Sampling Distribution Models Chapter 18 Sampling Distribution Models The histogram above is a simulation of what we'd get if we could see all the proportions from all possible samples. The distribution has a special name. It's called

More information

ACMS Statistics for Life Sciences. Chapter 13: Sampling Distributions

ACMS Statistics for Life Sciences. Chapter 13: Sampling Distributions ACMS 20340 Statistics for Life Sciences Chapter 13: Sampling Distributions Sampling We use information from a sample to infer something about a population. When using random samples and randomized experiments,

More information

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p).

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p). Sampling distributions and estimation. 1) A brief review of distributions: We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation,

More information

Chapter 23. Inference About Means

Chapter 23. Inference About Means Chapter 23 Inference About Means 1 /57 Homework p554 2, 4, 9, 10, 13, 15, 17, 33, 34 2 /57 Objective Students test null and alternate hypotheses about a population mean. 3 /57 Here We Go Again Now that

More information

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p).

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p). Sampling distributions and estimation. 1) A brief review of distributions: We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation,

More information

Solving with Absolute Value

Solving with Absolute Value Solving with Absolute Value Who knew two little lines could cause so much trouble? Ask someone to solve the equation 3x 2 = 7 and they ll say No problem! Add just two little lines, and ask them to solve

More information

Chapter 22. Comparing Two Proportions 1 /29

Chapter 22. Comparing Two Proportions 1 /29 Chapter 22 Comparing Two Proportions 1 /29 Homework p519 2, 4, 12, 13, 15, 17, 18, 19, 24 2 /29 Objective Students test null and alternate hypothesis about two population proportions. 3 /29 Comparing Two

More information

From the Data at Hand to the World at Large

From the Data at Hand to the World at Large From the Data at Hand to the World at Large PART V Chapter 18 Sampling Distribution Models Chapter 19 Confidence Intervals for Proportions Chapter 20 Testing Hypotheses About Proportions Chapter 21 More

More information

COMP6053 lecture: Sampling and the central limit theorem. Jason Noble,

COMP6053 lecture: Sampling and the central limit theorem. Jason Noble, COMP6053 lecture: Sampling and the central limit theorem Jason Noble, jn2@ecs.soton.ac.uk Populations: long-run distributions Two kinds of distributions: populations and samples. A population is the set

More information

Solutions to In-Class Problems Week 14, Mon.

Solutions to In-Class Problems Week 14, Mon. Massachusetts Institute of Technology 6.042J/18.062J, Spring 10: Mathematics for Computer Science May 10 Prof. Albert R. Meyer revised May 10, 2010, 677 minutes Solutions to In-Class Problems Week 14,

More information

4/19/2009. Probability Distributions. Inference. Example 1. Example 2. Parameter versus statistic. Normal Probability Distribution N

4/19/2009. Probability Distributions. Inference. Example 1. Example 2. Parameter versus statistic. Normal Probability Distribution N Probability Distributions Normal Probability Distribution N Chapter 6 Inference It was reported that the 2008 Super Bowl was watched by 97.5 million people. But how does anyone know that? They certainly

More information

Review of the Normal Distribution

Review of the Normal Distribution Sampling and s Normal Distribution Aims of Sampling Basic Principles of Probability Types of Random Samples s of the Mean Standard Error of the Mean The Central Limit Theorem Review of the Normal Distribution

More information

COMP6053 lecture: Sampling and the central limit theorem. Markus Brede,

COMP6053 lecture: Sampling and the central limit theorem. Markus Brede, COMP6053 lecture: Sampling and the central limit theorem Markus Brede, mb8@ecs.soton.ac.uk Populations: long-run distributions Two kinds of distributions: populations and samples. A population is the set

More information

Using Dice to Introduce Sampling Distributions Written by: Mary Richardson Grand Valley State University

Using Dice to Introduce Sampling Distributions Written by: Mary Richardson Grand Valley State University Using Dice to Introduce Sampling Distributions Written by: Mary Richardson Grand Valley State University richamar@gvsu.edu Overview of Lesson In this activity students explore the properties of the distribution

More information

MA 1125 Lecture 15 - The Standard Normal Distribution. Friday, October 6, Objectives: Introduce the standard normal distribution and table.

MA 1125 Lecture 15 - The Standard Normal Distribution. Friday, October 6, Objectives: Introduce the standard normal distribution and table. MA 1125 Lecture 15 - The Standard Normal Distribution Friday, October 6, 2017. Objectives: Introduce the standard normal distribution and table. 1. The Standard Normal Distribution We ve been looking at

More information

Chapter 3. Estimation of p. 3.1 Point and Interval Estimates of p

Chapter 3. Estimation of p. 3.1 Point and Interval Estimates of p Chapter 3 Estimation of p 3.1 Point and Interval Estimates of p Suppose that we have Bernoulli Trials (BT). So far, in every example I have told you the (numerical) value of p. In science, usually the

More information

STA Module 4 Probability Concepts. Rev.F08 1

STA Module 4 Probability Concepts. Rev.F08 1 STA 2023 Module 4 Probability Concepts Rev.F08 1 Learning Objectives Upon completing this module, you should be able to: 1. Compute probabilities for experiments having equally likely outcomes. 2. Interpret

More information

Chapter 6. The Standard Deviation as a Ruler and the Normal Model 1 /67

Chapter 6. The Standard Deviation as a Ruler and the Normal Model 1 /67 Chapter 6 The Standard Deviation as a Ruler and the Normal Model 1 /67 Homework Read Chpt 6 Complete Reading Notes Do P129 1, 3, 5, 7, 15, 17, 23, 27, 29, 31, 37, 39, 43 2 /67 Objective Students calculate

More information

MAT Mathematics in Today's World

MAT Mathematics in Today's World MAT 1000 Mathematics in Today's World Last Time We discussed the four rules that govern probabilities: 1. Probabilities are numbers between 0 and 1 2. The probability an event does not occur is 1 minus

More information

3/30/2009. Probability Distributions. Binomial distribution. TI-83 Binomial Probability

3/30/2009. Probability Distributions. Binomial distribution. TI-83 Binomial Probability Random variable The outcome of each procedure is determined by chance. Probability Distributions Normal Probability Distribution N Chapter 6 Discrete Random variables takes on a countable number of values

More information

Unit 4 Probability. Dr Mahmoud Alhussami

Unit 4 Probability. Dr Mahmoud Alhussami Unit 4 Probability Dr Mahmoud Alhussami Probability Probability theory developed from the study of games of chance like dice and cards. A process like flipping a coin, rolling a die or drawing a card from

More information

Chapter 24. Comparing Means

Chapter 24. Comparing Means Chapter 4 Comparing Means!1 /34 Homework p579, 5, 7, 8, 10, 11, 17, 31, 3! /34 !3 /34 Objective Students test null and alternate hypothesis about two!4 /34 Plot the Data The intuitive display for comparing

More information

Chapter 22. Comparing Two Proportions 1 /30

Chapter 22. Comparing Two Proportions 1 /30 Chapter 22 Comparing Two Proportions 1 /30 Homework p519 2, 4, 12, 13, 15, 17, 18, 19, 24 2 /30 3 /30 Objective Students test null and alternate hypothesis about two population proportions. 4 /30 Comparing

More information

Steve Smith Tuition: Maths Notes

Steve Smith Tuition: Maths Notes Maths Notes : Discrete Random Variables Version. Steve Smith Tuition: Maths Notes e iπ + = 0 a + b = c z n+ = z n + c V E + F = Discrete Random Variables Contents Intro The Distribution of Probabilities

More information

What is a random variable

What is a random variable OKAN UNIVERSITY FACULTY OF ENGINEERING AND ARCHITECTURE MATH 256 Probability and Random Processes 04 Random Variables Fall 20 Yrd. Doç. Dr. Didem Kivanc Tureli didemk@ieee.org didem.kivanc@okan.edu.tr

More information

What is proof? Lesson 1

What is proof? Lesson 1 What is proof? Lesson The topic for this Math Explorer Club is mathematical proof. In this post we will go over what was covered in the first session. The word proof is a normal English word that you might

More information

P (A) = P (B) = P (C) = P (D) =

P (A) = P (B) = P (C) = P (D) = STAT 145 CHAPTER 12 - PROBABILITY - STUDENT VERSION The probability of a random event, is the proportion of times the event will occur in a large number of repititions. For example, when flipping a coin,

More information

Chapter 6. Estimates and Sample Sizes

Chapter 6. Estimates and Sample Sizes Chapter 6 Estimates and Sample Sizes Lesson 6-1/6-, Part 1 Estimating a Population Proportion This chapter begins the beginning of inferential statistics. There are two major applications of inferential

More information

MA 1125 Lecture 33 - The Sign Test. Monday, December 4, Objectives: Introduce an example of a non-parametric test.

MA 1125 Lecture 33 - The Sign Test. Monday, December 4, Objectives: Introduce an example of a non-parametric test. MA 1125 Lecture 33 - The Sign Test Monday, December 4, 2017 Objectives: Introduce an example of a non-parametric test. For the last topic of the semester we ll look at an example of a non-parametric test.

More information

Senior Math Circles November 19, 2008 Probability II

Senior Math Circles November 19, 2008 Probability II University of Waterloo Faculty of Mathematics Centre for Education in Mathematics and Computing Senior Math Circles November 9, 2008 Probability II Probability Counting There are many situations where

More information

Chapter 1 Review of Equations and Inequalities

Chapter 1 Review of Equations and Inequalities Chapter 1 Review of Equations and Inequalities Part I Review of Basic Equations Recall that an equation is an expression with an equal sign in the middle. Also recall that, if a question asks you to solve

More information

Part 13: The Central Limit Theorem

Part 13: The Central Limit Theorem Part 13: The Central Limit Theorem As discussed in Part 12, the normal distribution is a very good model for a wide variety of real world data. And in this section we will give even further evidence of

More information

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation?

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation? Did You Mean Association Or Correlation? AP Statistics Chapter 8 Be careful not to use the word correlation when you really mean association. Often times people will incorrectly use the word correlation

More information

13. Sampling distributions

13. Sampling distributions 13. Sampling distributions The Practice of Statistics in the Life Sciences Third Edition 2014 W. H. Freeman and Company Objectives (PSLS Chapter 13) Sampling distributions Parameter versus statistic Sampling

More information

THE SAMPLING DISTRIBUTION OF THE MEAN

THE SAMPLING DISTRIBUTION OF THE MEAN THE SAMPLING DISTRIBUTION OF THE MEAN COGS 14B JANUARY 26, 2017 TODAY Sampling Distributions Sampling Distribution of the Mean Central Limit Theorem INFERENTIAL STATISTICS Inferential statistics: allows

More information

Sequences and Series

Sequences and Series Sequences and Series What do you think of when you read the title of our next unit? In case your answers are leading us off track, let's review the following IB problems. 1 November 2013 HL 2 3 November

More information

Study and research skills 2009 Duncan Golicher. and Adrian Newton. Last draft 11/24/2008

Study and research skills 2009 Duncan Golicher. and Adrian Newton. Last draft 11/24/2008 Study and research skills 2009. and Adrian Newton. Last draft 11/24/2008 Inference about the mean: What you will learn Why we need to draw inferences from samples The difference between a population and

More information

PHYS 275 Experiment 2 Of Dice and Distributions

PHYS 275 Experiment 2 Of Dice and Distributions PHYS 275 Experiment 2 Of Dice and Distributions Experiment Summary Today we will study the distribution of dice rolling results Two types of measurement, not to be confused: frequency with which we obtain

More information

Chapter 26: Comparing Counts (Chi Square)

Chapter 26: Comparing Counts (Chi Square) Chapter 6: Comparing Counts (Chi Square) We ve seen that you can turn a qualitative variable into a quantitative one (by counting the number of successes and failures), but that s a compromise it forces

More information

Chapter 18 Summary Sampling Distribution Models

Chapter 18 Summary Sampling Distribution Models Uit 5 Itroductio to Iferece Chapter 18 Summary Samplig Distributio Models What have we leared? Sample proportios ad meas will vary from sample to sample that s samplig error (samplig variability). Samplig

More information

Basic Probability. Introduction

Basic Probability. Introduction Basic Probability Introduction The world is an uncertain place. Making predictions about something as seemingly mundane as tomorrow s weather, for example, is actually quite a difficult task. Even with

More information

Chapter 4: An Introduction to Probability and Statistics

Chapter 4: An Introduction to Probability and Statistics Chapter 4: An Introduction to Probability and Statistics 4. Probability The simplest kinds of probabilities to understand are reflected in everyday ideas like these: (i) if you toss a coin, the probability

More information

Chapter 15. Probability Rules! Copyright 2012, 2008, 2005 Pearson Education, Inc.

Chapter 15. Probability Rules! Copyright 2012, 2008, 2005 Pearson Education, Inc. Chapter 15 Probability Rules! Copyright 2012, 2008, 2005 Pearson Education, Inc. The General Addition Rule When two events A and B are disjoint, we can use the addition rule for disjoint events from Chapter

More information

Conceptual Explanations: Simultaneous Equations Distance, rate, and time

Conceptual Explanations: Simultaneous Equations Distance, rate, and time Conceptual Explanations: Simultaneous Equations Distance, rate, and time If you travel 30 miles per hour for 4 hours, how far do you go? A little common sense will tell you that the answer is 120 miles.

More information

Chapter 18: Sampling Distribution Models

Chapter 18: Sampling Distribution Models Chapter 18: Sampling Distribution Models Suppose I randomly select 100 seniors in Scott County and record each one s GPA. 1.95 1.98 1.86 2.04 2.75 2.72 2.06 3.36 2.09 2.06 2.33 2.56 2.17 1.67 2.75 3.95

More information

Chapter 8. Linear Regression /71

Chapter 8. Linear Regression /71 Chapter 8 Linear Regression 1 /71 Homework p192 1, 2, 3, 5, 7, 13, 15, 21, 27, 28, 29, 32, 35, 37 2 /71 3 /71 Objectives Determine Least Squares Regression Line (LSRL) describing the association of two

More information

Statistics and Quantitative Analysis U4320. Segment 5: Sampling and inference Prof. Sharyn O Halloran

Statistics and Quantitative Analysis U4320. Segment 5: Sampling and inference Prof. Sharyn O Halloran Statistics and Quantitative Analysis U4320 Segment 5: Sampling and inference Prof. Sharyn O Halloran Sampling A. Basics 1. Ways to Describe Data Histograms Frequency Tables, etc. 2. Ways to Characterize

More information

MATH 56A SPRING 2008 STOCHASTIC PROCESSES

MATH 56A SPRING 2008 STOCHASTIC PROCESSES MATH 56A SPRING 008 STOCHASTIC PROCESSES KIYOSHI IGUSA Contents 4. Optimal Stopping Time 95 4.1. Definitions 95 4.. The basic problem 95 4.3. Solutions to basic problem 97 4.4. Cost functions 101 4.5.

More information

Chapter 5 Simplifying Formulas and Solving Equations

Chapter 5 Simplifying Formulas and Solving Equations Chapter 5 Simplifying Formulas and Solving Equations Look at the geometry formula for Perimeter of a rectangle P = L W L W. Can this formula be written in a simpler way? If it is true, that we can simplify

More information

CENTRAL LIMIT THEOREM (CLT)

CENTRAL LIMIT THEOREM (CLT) CENTRAL LIMIT THEOREM (CLT) A sampling distribution is the probability distribution of the sample statistic that is formed when samples of size n are repeatedly taken from a population. If the sample statistic

More information

value of the sum standard units

value of the sum standard units Stat 1001 Winter 1998 Geyer Homework 7 Problem 18.1 20 and 25. Problem 18.2 (a) Average of the box. (1+3+5+7)=4=4. SD of the box. The deviations from the average are,3,,1, 1, 3. The squared deviations

More information

5.3 Conditional Probability and Independence

5.3 Conditional Probability and Independence 28 CHAPTER 5. PROBABILITY 5. Conditional Probability and Independence 5.. Conditional Probability Two cubical dice each have a triangle painted on one side, a circle painted on two sides and a square painted

More information

Probability and Samples. Sampling. Point Estimates

Probability and Samples. Sampling. Point Estimates Probability and Samples Sampling We want the results from our sample to be true for the population and not just the sample But our sample may or may not be representative of the population Sampling error

More information

Probability and Independence Terri Bittner, Ph.D.

Probability and Independence Terri Bittner, Ph.D. Probability and Independence Terri Bittner, Ph.D. The concept of independence is often confusing for students. This brief paper will cover the basics, and will explain the difference between independent

More information

Problems from Probability and Statistical Inference (9th ed.) by Hogg, Tanis and Zimmerman.

Problems from Probability and Statistical Inference (9th ed.) by Hogg, Tanis and Zimmerman. Math 224 Fall 2017 Homework 1 Drew Armstrong Problems from Probability and Statistical Inference (9th ed.) by Hogg, Tanis and Zimmerman. Section 1.1, Exercises 4,5,6,7,9,12. Solutions to Book Problems.

More information

appstats27.notebook April 06, 2017

appstats27.notebook April 06, 2017 Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics Math 140 Introductory Statistics Professor Silvia Fernández Chapter 2 Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. Visualizing Distributions Recall the definition: The

More information

Math 140 Introductory Statistics

Math 140 Introductory Statistics Visualizing Distributions Math 140 Introductory Statistics Professor Silvia Fernández Chapter Based on the book Statistics in Action by A. Watkins, R. Scheaffer, and G. Cobb. Recall the definition: The

More information

( )( b + c) = ab + ac, but it can also be ( )( a) = ba + ca. Let s use the distributive property on a couple of

( )( b + c) = ab + ac, but it can also be ( )( a) = ba + ca. Let s use the distributive property on a couple of Factoring Review for Algebra II The saddest thing about not doing well in Algebra II is that almost any math teacher can tell you going into it what s going to trip you up. One of the first things they

More information

A Primer on Statistical Inference using Maximum Likelihood

A Primer on Statistical Inference using Maximum Likelihood A Primer on Statistical Inference using Maximum Likelihood November 3, 2017 1 Inference via Maximum Likelihood Statistical inference is the process of using observed data to estimate features of the population.

More information

Probability and Inference. POLI 205 Doing Research in Politics. Populations and Samples. Probability. Fall 2015

Probability and Inference. POLI 205 Doing Research in Politics. Populations and Samples. Probability. Fall 2015 Fall 2015 Population versus Sample Population: data for every possible relevant case Sample: a subset of cases that is drawn from an underlying population Inference Parameters and Statistics A parameter

More information

Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc.

Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc. Chapter 23 Inferences About Means Sampling Distributions of Means Now that we know how to create confidence intervals and test hypotheses about proportions, we do the same for means. Just as we did before,

More information

The Central Limit Theorem

The Central Limit Theorem The Central Limit Theorem Patrick Breheny March 1 Patrick Breheny STA 580: Biostatistics I 1/23 Kerrich s experiment A South African mathematician named John Kerrich was visiting Copenhagen in 1940 when

More information

Chapter 8: An Introduction to Probability and Statistics

Chapter 8: An Introduction to Probability and Statistics Course S3, 200 07 Chapter 8: An Introduction to Probability and Statistics This material is covered in the book: Erwin Kreyszig, Advanced Engineering Mathematics (9th edition) Chapter 24 (not including

More information

Chapter 27 Summary Inferences for Regression

Chapter 27 Summary Inferences for Regression Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test

More information

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b). Confidence Intervals 1) What are confidence intervals? Simply, an interval for which we have a certain confidence. For example, we are 90% certain that an interval contains the true value of something

More information

Statistic: a that can be from a sample without making use of any unknown. In practice we will use to establish unknown parameters.

Statistic: a that can be from a sample without making use of any unknown. In practice we will use to establish unknown parameters. Chapter 9: Sampling Distributions 9.1: Sampling Distributions IDEA: How often would a given method of sampling give a correct answer if it was repeated many times? That is, if you took repeated samples

More information

University of California, Berkeley, Statistics 134: Concepts of Probability. Michael Lugo, Spring Exam 1

University of California, Berkeley, Statistics 134: Concepts of Probability. Michael Lugo, Spring Exam 1 University of California, Berkeley, Statistics 134: Concepts of Probability Michael Lugo, Spring 2011 Exam 1 February 16, 2011, 11:10 am - 12:00 noon Name: Solutions Student ID: This exam consists of seven

More information

STA Module 10 Comparing Two Proportions

STA Module 10 Comparing Two Proportions STA 2023 Module 10 Comparing Two Proportions Learning Objectives Upon completing this module, you should be able to: 1. Perform large-sample inferences (hypothesis test and confidence intervals) to compare

More information

the probability of getting either heads or tails must be 1 (excluding the remote possibility of getting it to land on its edge).

the probability of getting either heads or tails must be 1 (excluding the remote possibility of getting it to land on its edge). Probability One of the most useful and intriguing aspects of quantum mechanics is the Heisenberg Uncertainty Principle. Before I get to it however, we need some initial comments on probability. Let s first

More information

Discrete Mathematics and Probability Theory Fall 2014 Anant Sahai Note 15. Random Variables: Distributions, Independence, and Expectations

Discrete Mathematics and Probability Theory Fall 2014 Anant Sahai Note 15. Random Variables: Distributions, Independence, and Expectations EECS 70 Discrete Mathematics and Probability Theory Fall 204 Anant Sahai Note 5 Random Variables: Distributions, Independence, and Expectations In the last note, we saw how useful it is to have a way of

More information

Probability Year 9. Terminology

Probability Year 9. Terminology Probability Year 9 Terminology Probability measures the chance something happens. Formally, we say it measures how likely is the outcome of an event. We write P(result) as a shorthand. An event is some

More information

Stat 20 Midterm 1 Review

Stat 20 Midterm 1 Review Stat 20 Midterm Review February 7, 2007 This handout is intended to be a comprehensive study guide for the first Stat 20 midterm exam. I have tried to cover all the course material in a way that targets

More information

7.1: What is a Sampling Distribution?!?!

7.1: What is a Sampling Distribution?!?! 7.1: What is a Sampling Distribution?!?! Section 7.1 What Is a Sampling Distribution? After this section, you should be able to DISTINGUISH between a parameter and a statistic DEFINE sampling distribution

More information

PageRank: The Math-y Version (Or, What To Do When You Can t Tear Up Little Pieces of Paper)

PageRank: The Math-y Version (Or, What To Do When You Can t Tear Up Little Pieces of Paper) PageRank: The Math-y Version (Or, What To Do When You Can t Tear Up Little Pieces of Paper) In class, we saw this graph, with each node representing people who are following each other on Twitter: Our

More information

Do students sleep the recommended 8 hours a night on average?

Do students sleep the recommended 8 hours a night on average? BIEB100. Professor Rifkin. Notes on Section 2.2, lecture of 27 January 2014. Do students sleep the recommended 8 hours a night on average? We first set up our null and alternative hypotheses: H0: μ= 8

More information

Statistics 1. Edexcel Notes S1. Mathematical Model. A mathematical model is a simplification of a real world problem.

Statistics 1. Edexcel Notes S1. Mathematical Model. A mathematical model is a simplification of a real world problem. Statistics 1 Mathematical Model A mathematical model is a simplification of a real world problem. 1. A real world problem is observed. 2. A mathematical model is thought up. 3. The model is used to make

More information

DISTRIBUTIONS USED IN STATISTICAL WORK

DISTRIBUTIONS USED IN STATISTICAL WORK DISTRIBUTIONS USED IN STATISTICAL WORK In one of the classic introductory statistics books used in Education and Psychology (Glass and Stanley, 1970, Prentice-Hall) there was an excellent chapter on different

More information

Lecture 8 Sampling Theory

Lecture 8 Sampling Theory Lecture 8 Sampling Theory Thais Paiva STA 111 - Summer 2013 Term II July 11, 2013 1 / 25 Thais Paiva STA 111 - Summer 2013 Term II Lecture 8, 07/11/2013 Lecture Plan 1 Sampling Distributions 2 Law of Large

More information

Chapter 14. From Randomness to Probability. Copyright 2012, 2008, 2005 Pearson Education, Inc.

Chapter 14. From Randomness to Probability. Copyright 2012, 2008, 2005 Pearson Education, Inc. Chapter 14 From Randomness to Probability Copyright 2012, 2008, 2005 Pearson Education, Inc. Dealing with Random Phenomena A random phenomenon is a situation in which we know what outcomes could happen,

More information

We introduce one more operation on sets, perhaps the most important

We introduce one more operation on sets, perhaps the most important 11. The power set Please accept my resignation. I don t want to belong to any club that will accept me as a member. Groucho Marx We introduce one more operation on sets, perhaps the most important one:

More information

Interpret Standard Deviation. Outlier Rule. Describe the Distribution OR Compare the Distributions. Linear Transformations SOCS. Interpret a z score

Interpret Standard Deviation. Outlier Rule. Describe the Distribution OR Compare the Distributions. Linear Transformations SOCS. Interpret a z score Interpret Standard Deviation Outlier Rule Linear Transformations Describe the Distribution OR Compare the Distributions SOCS Using Normalcdf and Invnorm (Calculator Tips) Interpret a z score What is an

More information

Why should you care?? Intellectual curiosity. Gambling. Mathematically the same as the ESP decision problem we discussed in Week 4.

Why should you care?? Intellectual curiosity. Gambling. Mathematically the same as the ESP decision problem we discussed in Week 4. I. Probability basics (Sections 4.1 and 4.2) Flip a fair (probability of HEADS is 1/2) coin ten times. What is the probability of getting exactly 5 HEADS? What is the probability of getting exactly 10

More information

Probability Experiments, Trials, Outcomes, Sample Spaces Example 1 Example 2

Probability Experiments, Trials, Outcomes, Sample Spaces Example 1 Example 2 Probability Probability is the study of uncertain events or outcomes. Games of chance that involve rolling dice or dealing cards are one obvious area of application. However, probability models underlie

More information

Sampling Distributions. Introduction to Inference

Sampling Distributions. Introduction to Inference Sampling Distributions Introduction to Inference Parameter A parameter is a number that describes the population. A parameter always exists but in practice we rarely know it s value because we cannot examine

More information

Probability deals with modeling of random phenomena (phenomena or experiments whose outcomes may vary)

Probability deals with modeling of random phenomena (phenomena or experiments whose outcomes may vary) Chapter 14 From Randomness to Probability How to measure a likelihood of an event? How likely is it to answer correctly one out of two true-false questions on a quiz? Is it more, less, or equally likely

More information

the time it takes until a radioactive substance undergoes a decay

the time it takes until a radioactive substance undergoes a decay 1 Probabilities 1.1 Experiments with randomness Wewillusethetermexperimentinaverygeneralwaytorefertosomeprocess that produces a random outcome. Examples: (Ask class for some first) Here are some discrete

More information

Mr. Stein s Words of Wisdom

Mr. Stein s Words of Wisdom Mr. Stein s Words of Wisdom I am writing this review essay for two tests the AP Stat exam and the Applied Stat BFT. The topics are more or less the same, so reviewing for the two tests should be a similar

More information

*Karle Laska s Sections: There is no class tomorrow and Friday! Have a good weekend! Scores will be posted in Compass early Friday morning

*Karle Laska s Sections: There is no class tomorrow and Friday! Have a good weekend! Scores will be posted in Compass early Friday morning STATISTICS 100 EXAM 3 Spring 2016 PRINT NAME (Last name) (First name) *NETID CIRCLE SECTION: Laska MWF L1 Laska Tues/Thurs L2 Robin Tu Write answers in appropriate blanks. When no blanks are provided CIRCLE

More information

Probability Year 10. Terminology

Probability Year 10. Terminology Probability Year 10 Terminology Probability measures the chance something happens. Formally, we say it measures how likely is the outcome of an event. We write P(result) as a shorthand. An event is some

More information

Chapter 6 Group Activity - SOLUTIONS

Chapter 6 Group Activity - SOLUTIONS Chapter 6 Group Activity - SOLUTIONS Group Activity Summarizing a Distribution 1. The following data are the number of credit hours taken by Math 105 students during a summer term. You will be analyzing

More information

UNIT NUMBER PROBABILITY 6 (Statistics for the binomial distribution) A.J.Hobson

UNIT NUMBER PROBABILITY 6 (Statistics for the binomial distribution) A.J.Hobson JUST THE MATHS UNIT NUMBER 19.6 PROBABILITY 6 (Statistics for the binomial distribution) by A.J.Hobson 19.6.1 Construction of histograms 19.6.2 Mean and standard deviation of a binomial distribution 19.6.3

More information

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 2 MATH00040 SEMESTER / Probability

ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 2 MATH00040 SEMESTER / Probability ACCESS TO SCIENCE, ENGINEERING AND AGRICULTURE: MATHEMATICS 2 MATH00040 SEMESTER 2 2017/2018 DR. ANTHONY BROWN 5.1. Introduction to Probability. 5. Probability You are probably familiar with the elementary

More information

Statistics 511 Additional Materials

Statistics 511 Additional Materials Sampling Distributions and Central Limit Theorem In previous topics we have discussed taking a single observation from a distribution. More accurately, we looked at the probability of a single variable

More information