Sampling Distribution Models. Central Limit Theorem

Size: px
Start display at page:

Download "Sampling Distribution Models. Central Limit Theorem"

Transcription

1 Sampling Distribution Models Central Limit Theorem

2 Thought Questions 1. 40% of large population disagree with new law. In parts a and b, think about role of sample size. a. If randomly sample 10 people, will exactly four (40%) disagree with law? Surprised if only two in sample disagreed? How about if none disagreed? b. If randomly sample 1000 people, will exactly 400 (40%) disagree with law? Surprised if only 200 in sample disagreed? How about if none disagreed?

3 The Diversity of Samples from the Same Population Working Backward from Samples to Populations Start with question about population. Collect a sample from the population, measure variable. Answer question of interest for sample. With statistics, determine how close such an answer, based on a sample, would tend to be from the actual answer for the population. Understanding Dissimilarity among Samples We need to understand what kind of differences we should expect to see in various samples from the same population

4 What to Expect of Sample Proportions A slice of the population 40% of population carry a certain gene Do Not Carry Gene =, Do Carry Gene = X

5 What to Expect of Sample Proportions Possible Samples Sample 1: Proportion with gene = 12/25 = 0.48 = 48% Sample 2: Proportion with gene = 9/25 = 0.36 = 36% Sample 3: Proportion with gene = 10/25 = 0.40 = 40% Sample 4: Proportion with gene = 7/25 = 0.28 = 28%

6 The Central Limit Theorem for Sample Proportions Rather than showing real repeated samples, imagine what would happen if we were to actually draw many samples. Now imagine what would happen if we looked at the sample proportions for these samples. The histogram we d get if we could see all the proportions from all possible samples is called the sampling distribution of the proportions. What would the histogram of all the sample proportions look like?

7 The Central Limit Theorem for Sample Proportions We would expect the histogram of the sample proportions to center at the true proportion, p, in the population. As far as the shape of the histogram goes, we can simulate a bunch of random samples that we didn t really draw. It turns out that the histogram is unimodal, symmetric, and centered at p. More specifically, it s an amazing and fortunate fact that a Normal model is just the right one for the histogram of sample proportions.

8 Imagine, Imagine, Imagine - Predicting Election Results Imagine a research organization (R.E.) wants to know if Candidate A would win the election if held next week. They completed a well-conducted poll of 100 randomly selected voters. Imagine several other research organizations (499 of them) also completed a well-conducted poll of 100 randomly selected voters to try and answer the same question. They all conduct the polls on the same day. Imagine that the answer they are looking for is 0.53, that is Candidate A would get 53% of the vote if the election were held on the day of the polls.

9 Predicting Election Results R.E. Results 1 52/100 = /100 = /100 = /100 = /100 =.59...and so on

10 Relative frequency 0.35 Results from 500 Polls of 100 voters Proportion in sample that would vote for Candidate A

11 Relative frequency 0.4 Results from 500 Polls of 400 voters Proportion in sample that would vote for Candidate A

12 Relative frequency Relative frequency Results from 500 Polls of 100 voters Proportion in sample that would vote for Candidate A Results from 500 Polls of 400 voters Proportion in sample that would vote for Candidate A

13 Relative frequency 0.3 Results from 500 Polls of 1000 voters Proportion in sample that would vote for Candidate A

14 Relative frequency Relative frequency 0.4 Results from 500 Polls of 400 voters 0.3 Results from 500 Polls of 1000 voters Proportion in sample that would vote for Candidate A Proportion in sample that would vote for Candidate A

15 The Central Limit Theorem for Sample Proportions Modeling how sample proportions vary from sample to sample is one of the most powerful ideas we ll see in this course. A sampling distribution model for how a sample proportion varies from sample to sample allows us to quantify that variation and how likely it is that we d observe a sample proportion in any particular interval. To use a Normal model, we need to specify its mean and standard deviation. We ll put µ, the mean of the Normal, at p.

16 The Central Limit Theorem for Sample Proportions When working with proportions, knowing the mean automatically gives us the standard deviation as well the standard deviation we will use is pq n So, the distribution of the sample proportions is modeled with a probability model that is N p, pq n

17 Sampling Distribution Model for Proportions Mean and Standard Deviation Let Y - Binom(n,p) where n is the number of trials and p is the probability of success.

18 The Central Limit Theorem for Sample Proportions A picture of what we just discussed is as follows:

19 The Central Limit Theorem for Sample Proportions Because we have a Normal model, for example, we know that 95% of Normally distributed values are within two standard deviations of the mean. So we should not be surprised if 95% of various polls gave results that were near the mean but varied above and below that by no more than two standard deviations. This is what we call sampling variability. The sample proportions varies from sample to sample. All possible sample proportions arrange themselves neatly under a normal curve.

20 What to Expect of Sample Proportions If numerous samples of the same size are taken from a population are taken, the frequency curve made from proportions from various samples will be approximately bell-shaped. In other words, the sampling distribution of possible sample proportions is Normal and centered at the true population proportion, with standard deviation (true proportion)(1 true proportion) sample size

21 What to Expect of Sample Proportions In reality, we don t know the true proportion The standard error(se) for the sampling distribution of possible sample proportions is (sample proportion)(1 sample proportion) sample size

22 Assumptions and Conditions 1. Randomization Condition: The sample should be a random sample of the population % Condition: the sample size, n, must be no larger than 10% of the population. 3. Success/Failure Condition:The sample size has to be big enough so that both np (number of successes) and nq (number of failures) are at least 10.

23 A Sampling Distribution Model for a Proportion A proportion is no longer just a computation from a set of data. It is now a random variable quantity that has a probability distribution. This distribution is called the sampling distribution model for proportions. Even though we depend on sampling distribution models, we never actually get to see them. We never actually take repeated samples from the same population and make a histogram. We only imagine or simulate them.

24 A Sampling Distribution Model for a Proportion Still, sampling distribution models are important because they act as a bridge from the real world of data to the imaginary world of the statistic and enable us to say something about the population when all we have is data from the real world.

25 The Sampling Distribution Model for a Proportion Provided that the sampled values are independent and the sample size is large enough, the sampling distribution of ˆp is modeled by a Normal model with Mean: Expected Value( ˆp )=p Standard deviation: SD( ˆp) pq n

26 What to Expect of Sample Proportions Example : Suppose 40% of all voters in U.S. favor candidate X. Pollsters take a sample of 2400 people. What sample proportion would be expected to favor candidate X? The sample proportion could be anything from a bell-shaped curve with mean 0.40 and standard deviation: (0.40)(1 0.40) = For our sample of 2400 people: 68% of sample proportions will be between 39% and 41% 95% of sample proportions will be between 38% and 42% 99.7% of sample proportions will be between 37% and 43%

27 Example: Do Americans Really Vote When They Say They Do? Reported in Time magazine (Nov 28, 1994): Telephone poll of 800 adults (2 days after election) 56% reported they had voted. Committee for Study of American Electorate stated only 39% of American adults had voted. Could it be the results of poll simply reflected a sample that, by chance, voted with greater frequency than general population?

28 Example: Do Americans Really Vote When They Say They Do? Suppose only 39% of American adults voted. We can expect sample proportions to be represented by a bell-shaped curve with mean 0.39 and standard deviation: (0.39)(1 0.39) = or 1.7% % of sample proportions will be between 37.3% and 40.7% 95% of sample proportions will be between 35.6% and 42.4% 99.7% of sample proportions will be between 33.9% and 44.1%

29 Question

30 Thought Questions 2. Mean weight of all women at large university is 135 pounds with a standard deviation of 10 pounds. a. Recalling Empirical Rule for bell-shaped curves, in what range would you expect 95% of women s weights to fall? b. If randomly sampled 10 women at university, how close do you think their average weight would be to 135 pounds? c. If sampled 1000 women, would you expect average weight to be closer to 135 pounds than for the sample of only 10 women?

31 What to Expect of Sample Means Example Want to estimate average weight loss for all who attend national weight-loss clinic for 10 weeks. Unknown to us, population mean weight loss is 8 pounds and standard deviation is 5 pounds. If weight losses are approximately bell-shaped, 95% of individual weight losses will fall between 2 (a gain of 2 pounds) and 18 pounds lost. Possible Samples (random samples of 25 people from this population) Sample 1: 1,1,2,3,4,4,4,5,6,7,7,7,8,8,9,9,11,11,13,13,14,14,15,16,16 Sample 2: 2, 2,0,0,3,4,4,4,5,5,6,6,8,8,9,9,9,9,9,10,11,12,13,13,16 Sample 3: 4, 4,2,3,4,5,7,8,8,9,9,9,9,9,10,10,11,11,11,12,12,13,14,16,18 Sample 4: 3, 3, 2,0,1,2,2,4,4,5,7,7,9,9,10,10,10,11,11,12,12,14,14,14,19

32 What to Expect of Sample Means Results: Sample 1: Mean = 8.32 pounds Sample 2: Mean = 6.76 pounds Sample 3: Mean = 8.48 pounds Sample 4: Mean = 7.16 pounds Each sample gave a different sample mean, but close to 8.

33 What to Expect of Sample Means Say the true population mean height is 68 inches and the population standard deviation is 3 inches

34 What to Expect of Sample Means Variation in sample means Say, the actual mean height of a population is 68 inches. First sample mean is 67.5 inches Second sample mean is 66 inches Third sample mean is 69 inches Sample means are (hopefully) close to the population mean But they are not identical Why? Due to sample variability Each sample is only a subset of the population

35 What to Expect of Sample Means : Population of measurements is bell-shaped, and a random sample of any size is measured.

36

37

38

39 What to Expect of Sample Means: Population of measurements of interest is not bell-shaped, but a large random sample is measured. Sample of size 30 is considered large, but if there are extreme outliers, better to have a larger sample. Population Mean is 21.76

40

41

42

43 What to Expect of Sample Means: Population of measurements of interest is not bell-shaped, but a large random sample is measured.

44 What to Expect of Sample Means If numerous samples or repetitions of the same size are taken, the frequency curve of means from various samples will be approximately bell-shaped. The mean for the sampling distribution of the sample mean is equal to the true population mean The standard deviation(sd) for the sampling distribution of the possible sample means is : population standard deviation sample size

45 What to Expect of Sample Means In reality, we won t know the population standard deviation The standard error(se) for the sampling distribution of the possible sample means is sample standard deviation sample size

46 The Central Limit Theorem: The Fundamental Theorem of Statistics The sampling distribution of any mean becomes closer to near Normal as the sample size grows. We don t even care about the shape of the population distribution as long as the sample size is large enough! The Fundamental Theorem of Statistics is called the Central Limit Theorem (CLT).

47 The Central Limit Theorem: The Fundamental Theorem of Statistics The CLT is surprising: Not only does the histogram of the sample means get closer and closer to the Normal distribution as the sample size grows, but this is true regardless of the shape of the population distribution. The CLT works better (and faster) the closer the population distribution is to a Normal itself. It also works better for larger samples.

48 The Central Limit Theorem: The Fundamental Theorem of Statistics Slide 18-48

49 The Fundamental Theorem of Statistics The Central Limit Theorem (CLT) The mean of a random sample is a random variable whose sampling distribution can be approximated by a Normal model. The larger the sample, the better the approximation will be.

50 Assumptions and Conditions The CLT requires essentially the same assumptions we saw for modeling proportions: Independence Assumption: The sampled values must be independent of each other. Sample Size Assumption: The sample size must be sufficiently large.

51 Assumptions and Conditions (cont.) We can t check these directly, but we can think about whether the Independence Assumption is plausible. We can also check some related conditions: Randomization Condition: The data values must be sampled randomly. 10% Condition: When the sample is drawn without replacement, the sample size, n, should be no more than 10% of the population. Large Enough Sample Condition: The CLT doesn t tell us how large a sample we need. For now, you need to think about your sample size in the context of what you know about the population.

52 Weight-Loss Example Weight-loss example, population mean and standard deviation were 8 pounds and 5 pounds, respectively, and we were taking random samples of size 25. Potential sample means represented by a bell-shaped curve with mean of 8 pounds and standard deviation: 5 = 1 pound 25 For our samples of 25 people: 68% of sample means will be between 7 and 9 pounds 95% of sample means will be between 6 and 10 pounds 99.7% of sample means will be between 5 and 11 pounds

53 Weight-Loss Example Increasing the Size of the Sample suppose a sample of 100 people instead of 25 was taken. Potential sample means still represented by a bell-shaped curve with mean of 8 pounds but standard deviation: 5 = 0.5 pounds 100 For our sample of 100 people: 68% of sample means will be between 7.5 and 8.5 pounds 95% of sample means will be between 7 and 9 pounds 99.7% of sample means will be between 6.5 and 9.5 pounds

54 Question Suppose that test scores on a particular exam have a mean of 77 and standard deviation of 5, and that they have a bellshaped curve. Suppose you randomly select a 1000 samples of size 100 from this population and calculate the sample mean test scores. Between what two values(sample means) would you expect 95% of these sample mean test scores to fall? A. 72 AND 82 B. 67 AND 87 C. 76 AND 78

55 Question

56 But Which Normal? The CLT says that the sampling distribution of any mean or proportion is approximately Normal. But which Normal model? For proportions, the sampling distribution is centered at the population proportion. For means, it s centered at the population mean. But what about the standard deviations?

57 But Which Normal? The Normal model for the sampling distribution of the mean has a standard deviation equal to SDy n where σ is the population standard deviation.

58 But Which Normal? The Normal model for the sampling distribution of the proportion has a standard deviation equal to SD ˆp pq n pq n

59 About Variation The standard deviation of the sampling distribution declines only with the square root of the sample size (the denominator contains the square root of n). Therefore, the variability decreases as the sample size increases. While we d always like a larger sample, the square root limits how much we can make a sample tell about the population. (This is an example of the Law of Diminishing Returns.)

60 The Real World and the Model World Be careful! Now we have two distributions to deal with. The first is the real world distribution of the sample, which we might display with a histogram. The second is the math world sampling distribution of the statistic, which we model with a Normal model based on the Central Limit Theorem. Don t confuse the two!

61 Sampling Distribution Models There are two basic truths about sampling distributions: 1. Sampling distributions arise because samples vary. Each random sample will have different cases and, so, a different value of the statistic. 2. Although we can always simulate a sampling distribution, the Central Limit Theorem saves us the trouble for means and proportions.

62 What Can Go Wrong? Don t confuse the sampling distribution with the distribution of the sample. When you take a sample, you look at the distribution of the values, usually with a histogram, and you may calculate summary statistics. The sampling distribution is an imaginary collection of the values that a statistic might have taken for all random samples the one you got and the ones you didn t get. Watch out for small samples from skewed populations. The more skewed the distribution, the larger the sample size we need for the CLT to work.

63 Summary Sample proportions and means will vary from sample to sample that s sampling error (sampling variability). Sampling variability may be unavoidable, but it is also predictable! In statistics, the concept of population parameter is theoretical Only God? knows the Truth? We try our best to find out what it is.

64 A Scientific Look at the Dangers of High Heels, NY Times, Jan., 2012 Not long ago, Neil J. Cronin, a postdoctoral researcher, and two of his colleagues at the Musculoskeletal Research Program at Griffith University in Queensland, Australia, were having coffee on the university s campus when they noticed a young woman tottering past in high heels. She looked quite uncomfortable and unstable, Dr. Cronin says.

65 A Scientific Look at the Dangers of High Heels, NY Times, Jan., 2012 Some observers, particularly women, might have winced in sympathy or, alternatively, wondered where she d bought stilettos. But the three researchers, men who study the biomechanics of walking, were struck instead by the scientific implications of her passage. We began to consider what might be happening at the muscle and tendon level in women who wear heels, Dr. Cronin says.

66 Study: Long-term use of high heeled shoes alters the neuromechanics of human walking

67 Study: Long-term use of high heeled shoes alters the neuromechanics of human walking

STA Why Sampling? Module 6 The Sampling Distributions. Module Objectives

STA Why Sampling? Module 6 The Sampling Distributions. Module Objectives STA 2023 Module 6 The Sampling Distributions Module Objectives In this module, we will learn the following: 1. Define sampling error and explain the need for sampling distributions. 2. Recognize that sampling

More information

Sampling Distribution Models. Chapter 17

Sampling Distribution Models. Chapter 17 Sampling Distribution Models Chapter 17 Objectives: 1. Sampling Distribution Model 2. Sampling Variability (sampling error) 3. Sampling Distribution Model for a Proportion 4. Central Limit Theorem 5. Sampling

More information

Chapter 18. Sampling Distribution Models. Copyright 2010, 2007, 2004 Pearson Education, Inc.

Chapter 18. Sampling Distribution Models. Copyright 2010, 2007, 2004 Pearson Education, Inc. Chapter 18 Sampling Distribution Models Copyright 2010, 2007, 2004 Pearson Education, Inc. Normal Model When we talk about one data value and the Normal model we used the notation: N(μ, σ) Copyright 2010,

More information

Chapter 18. Sampling Distribution Models /51

Chapter 18. Sampling Distribution Models /51 Chapter 18 Sampling Distribution Models 1 /51 Homework p432 2, 4, 6, 8, 10, 16, 17, 20, 30, 36, 41 2 /51 3 /51 Objective Students calculate values of central 4 /51 The Central Limit Theorem for Sample

More information

Chapter 15 Sampling Distribution Models

Chapter 15 Sampling Distribution Models Chapter 15 Sampling Distribution Models 1 15.1 Sampling Distribution of a Proportion 2 Sampling About Evolution According to a Gallup poll, 43% believe in evolution. Assume this is true of all Americans.

More information

STA Module 8 The Sampling Distribution of the Sample Mean. Rev.F08 1

STA Module 8 The Sampling Distribution of the Sample Mean. Rev.F08 1 STA 2023 Module 8 The Sampling Distribution of the Sample Mean Rev.F08 1 Module Objectives 1. Define sampling error and explain the need for sampling distributions. 2. Find the mean and standard deviation

More information

ACMS Statistics for Life Sciences. Chapter 13: Sampling Distributions

ACMS Statistics for Life Sciences. Chapter 13: Sampling Distributions ACMS 20340 Statistics for Life Sciences Chapter 13: Sampling Distributions Sampling We use information from a sample to infer something about a population. When using random samples and randomized experiments,

More information

Chapter 7. Scatterplots, Association, and Correlation. Copyright 2010 Pearson Education, Inc.

Chapter 7. Scatterplots, Association, and Correlation. Copyright 2010 Pearson Education, Inc. Chapter 7 Scatterplots, Association, and Correlation Copyright 2010 Pearson Education, Inc. Looking at Scatterplots Scatterplots may be the most common and most effective display for data. In a scatterplot,

More information

Carolyn Anderson & YoungShil Paek (Slide contributors: Shuai Wang, Yi Zheng, Michael Culbertson, & Haiyan Li)

Carolyn Anderson & YoungShil Paek (Slide contributors: Shuai Wang, Yi Zheng, Michael Culbertson, & Haiyan Li) Carolyn Anderson & YoungShil Paek (Slide contributors: Shuai Wang, Yi Zheng, Michael Culbertson, & Haiyan Li) Department of Educational Psychology University of Illinois at Urbana-Champaign 1 Inferential

More information

Chapter 7 Summary Scatterplots, Association, and Correlation

Chapter 7 Summary Scatterplots, Association, and Correlation Chapter 7 Summary Scatterplots, Association, and Correlation What have we learned? We examine scatterplots for direction, form, strength, and unusual features. Although not every relationship is linear,

More information

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p).

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p). Sampling distributions and estimation. 1) A brief review of distributions: We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation,

More information

Are data normally normally distributed?

Are data normally normally distributed? Standard Normal Image source Are data normally normally distributed? Sample mean: 66.78 Sample standard deviation: 3.37 (66.78-1 x 3.37, 66.78 + 1 x 3.37) (66.78-2 x 3.37, 66.78 + 2 x 3.37) (66.78-3 x

More information

Chapter 18: Sampling Distributions

Chapter 18: Sampling Distributions Chapter 18: Sampling Distributions All random variables have probability distributions, and as statistics are random variables, they too have distributions. The random phenomenon that produces the statistics

More information

Lecture 27. DATA 8 Spring Sample Averages. Slides created by John DeNero and Ani Adhikari

Lecture 27. DATA 8 Spring Sample Averages. Slides created by John DeNero and Ani Adhikari DATA 8 Spring 2018 Lecture 27 Sample Averages Slides created by John DeNero (denero@berkeley.edu) and Ani Adhikari (adhikari@berkeley.edu) Announcements Questions for This Week How can we quantify natural

More information

Chapter 7 Sampling Distributions

Chapter 7 Sampling Distributions Statistical inference looks at how often would this method give a correct answer if it was used many many times. Statistical inference works best when we produce data by random sampling or randomized comparative

More information

P (E) = P (A 1 )P (A 2 )... P (A n ).

P (E) = P (A 1 )P (A 2 )... P (A n ). Lecture 9: Conditional probability II: breaking complex events into smaller events, methods to solve probability problems, Bayes rule, law of total probability, Bayes theorem Discrete Structures II (Summer

More information

Section 5.4. Ken Ueda

Section 5.4. Ken Ueda Section 5.4 Ken Ueda Students seem to think that being graded on a curve is a positive thing. I took lasers 101 at Cornell and got a 92 on the exam. The average was a 93. I ended up with a C on the test.

More information

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc.

Chapter 8. Linear Regression. Copyright 2010 Pearson Education, Inc. Chapter 8 Linear Regression Copyright 2010 Pearson Education, Inc. Fat Versus Protein: An Example The following is a scatterplot of total fat versus protein for 30 items on the Burger King menu: Copyright

More information

Chapter 8: Sampling Distributions. A survey conducted by the U.S. Census Bureau on a continual basis. Sample

Chapter 8: Sampling Distributions. A survey conducted by the U.S. Census Bureau on a continual basis. Sample Chapter 8: Sampling Distributions Section 8.1 Distribution of the Sample Mean Frequently, samples are taken from a large population. Example: American Community Survey (ACS) A survey conducted by the U.S.

More information

Lecture 8 Sampling Theory

Lecture 8 Sampling Theory Lecture 8 Sampling Theory Thais Paiva STA 111 - Summer 2013 Term II July 11, 2013 1 / 25 Thais Paiva STA 111 - Summer 2013 Term II Lecture 8, 07/11/2013 Lecture Plan 1 Sampling Distributions 2 Law of Large

More information

Last few slides from last time

Last few slides from last time Last few slides from last time Example 3: What is the probability that p will fall in a certain range, given p? Flip a coin 50 times. If the coin is fair (p=0.5), what is the probability of getting an

More information

STA Module 5 Regression and Correlation. Learning Objectives. Learning Objectives (Cont.) Upon completing this module, you should be able to:

STA Module 5 Regression and Correlation. Learning Objectives. Learning Objectives (Cont.) Upon completing this module, you should be able to: STA 2023 Module 5 Regression and Correlation Learning Objectives Upon completing this module, you should be able to: 1. Define and apply the concepts related to linear equations with one independent variable.

More information

From the Data at Hand to the World at Large

From the Data at Hand to the World at Large From the Data at Hand to the World at Large PART V Chapter 18 Sampling Distribution Models Chapter 19 Confidence Intervals for Proportions Chapter 20 Testing Hypotheses About Proportions Chapter 21 More

More information

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation?

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation? Did You Mean Association Or Correlation? AP Statistics Chapter 8 Be careful not to use the word correlation when you really mean association. Often times people will incorrectly use the word correlation

More information

Section 7.1 How Likely are the Possible Values of a Statistic? The Sampling Distribution of the Proportion

Section 7.1 How Likely are the Possible Values of a Statistic? The Sampling Distribution of the Proportion Section 7.1 How Likely are the Possible Values of a Statistic? The Sampling Distribution of the Proportion CNN / USA Today / Gallup Poll September 22-24, 2008 www.poll.gallup.com 12% of Americans describe

More information

Chapter 8. Linear Regression. The Linear Model. Fat Versus Protein: An Example. The Linear Model (cont.) Residuals

Chapter 8. Linear Regression. The Linear Model. Fat Versus Protein: An Example. The Linear Model (cont.) Residuals Chapter 8 Linear Regression Copyright 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Slide 8-1 Copyright 2007 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Fat Versus

More information

Exam #2 Results (as percentages)

Exam #2 Results (as percentages) Oct. 30 Assignment: Read Chapter 19 Try exercises 1, 2, and 4 on p. 424 Exam #2 Results (as percentages) Mean: 71.4 Median: 73.3 Soda attitudes 2015 In a Gallup poll conducted Jul. 8 12, 2015, 1009 adult

More information

appstats8.notebook October 11, 2016

appstats8.notebook October 11, 2016 Chapter 8 Linear Regression Objective: Students will construct and analyze a linear model for a given set of data. Fat Versus Protein: An Example pg 168 The following is a scatterplot of total fat versus

More information

σ. We further know that if the sample is from a normal distribution then the sampling STAT 2507 Assignment # 3 (Chapters 7 & 8)

σ. We further know that if the sample is from a normal distribution then the sampling STAT 2507 Assignment # 3 (Chapters 7 & 8) STAT 2507 Assignment # 3 (Chapters 7 & 8) DUE: Sections E, F Section G Section H Monday, March 16, in class Tuesday, March 17, in class Wednesday, March 18, in class Last Name Student # First Name Your

More information

AP Statistics. Chapter 6 Scatterplots, Association, and Correlation

AP Statistics. Chapter 6 Scatterplots, Association, and Correlation AP Statistics Chapter 6 Scatterplots, Association, and Correlation Objectives: Scatterplots Association Outliers Response Variable Explanatory Variable Correlation Correlation Coefficient Lurking Variables

More information

Week 11 Sample Means, CLT, Correlation

Week 11 Sample Means, CLT, Correlation Week 11 Sample Means, CLT, Correlation Slides by Suraj Rampure Fall 2017 Administrative Notes Complete the mid semester survey on Piazza by Nov. 8! If 85% of the class fills it out, everyone will get a

More information

The Central Limit Theorem

The Central Limit Theorem The Central Limit Theorem Patrick Breheny March 1 Patrick Breheny STA 580: Biostatistics I 1/23 Kerrich s experiment A South African mathematician named John Kerrich was visiting Copenhagen in 1940 when

More information

Using Dice to Introduce Sampling Distributions Written by: Mary Richardson Grand Valley State University

Using Dice to Introduce Sampling Distributions Written by: Mary Richardson Grand Valley State University Using Dice to Introduce Sampling Distributions Written by: Mary Richardson Grand Valley State University richamar@gvsu.edu Overview of Lesson In this activity students explore the properties of the distribution

More information

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b). Confidence Intervals 1) What are confidence intervals? Simply, an interval for which we have a certain confidence. For example, we are 90% certain that an interval contains the true value of something

More information

Chapter 18 Sampling Distribution Models

Chapter 18 Sampling Distribution Models Chapter 18 Sampling Distribution Models The histogram above is a simulation of what we'd get if we could see all the proportions from all possible samples. The distribution has a special name. It's called

More information

Stat 20 Midterm 1 Review

Stat 20 Midterm 1 Review Stat 20 Midterm Review February 7, 2007 This handout is intended to be a comprehensive study guide for the first Stat 20 midterm exam. I have tried to cover all the course material in a way that targets

More information

4/19/2009. Probability Distributions. Inference. Example 1. Example 2. Parameter versus statistic. Normal Probability Distribution N

4/19/2009. Probability Distributions. Inference. Example 1. Example 2. Parameter versus statistic. Normal Probability Distribution N Probability Distributions Normal Probability Distribution N Chapter 6 Inference It was reported that the 2008 Super Bowl was watched by 97.5 million people. But how does anyone know that? They certainly

More information

EQ: What is a normal distribution?

EQ: What is a normal distribution? Unit 5 - Statistics What is the purpose EQ: What tools do we have to assess data? this unit? What vocab will I need? Vocabulary: normal distribution, standard, nonstandard, interquartile range, population

More information

MATH 1150 Chapter 2 Notation and Terminology

MATH 1150 Chapter 2 Notation and Terminology MATH 1150 Chapter 2 Notation and Terminology Categorical Data The following is a dataset for 30 randomly selected adults in the U.S., showing the values of two categorical variables: whether or not the

More information

Chapter 6 The Standard Deviation as a Ruler and the Normal Model

Chapter 6 The Standard Deviation as a Ruler and the Normal Model Chapter 6 The Standard Deviation as a Ruler and the Normal Model Overview Key Concepts Understand how adding (subtracting) a constant or multiplying (dividing) by a constant changes the center and/or spread

More information

Chapter 6. September 17, Please pick up a calculator and take out paper and something to write with. Association and Correlation.

Chapter 6. September 17, Please pick up a calculator and take out paper and something to write with. Association and Correlation. Please pick up a calculator and take out paper and something to write with. Sep 17 8:08 AM Chapter 6 Scatterplots, Association and Correlation Copyright 2015, 2010, 2007 Pearson Education, Inc. Chapter

More information

Chapter 8: Confidence Intervals

Chapter 8: Confidence Intervals Chapter 8: Confidence Intervals Introduction Suppose you are trying to determine the mean rent of a two-bedroom apartment in your town. You might look in the classified section of the newspaper, write

More information

Lab 5 for Math 17: Sampling Distributions and Applications

Lab 5 for Math 17: Sampling Distributions and Applications Lab 5 for Math 17: Sampling Distributions and Applications Recall: The distribution formed by considering the value of a statistic for every possible sample of a given size n from the population is called

More information

Data Presentation. Naureen Ghani. May 4, 2018

Data Presentation. Naureen Ghani. May 4, 2018 Data Presentation Naureen Ghani May 4, 2018 Data is only as good as how it is presented. How do you take hundreds or thousands of data points and create something a human can understand? This is a problem

More information

COMP6053 lecture: Sampling and the central limit theorem. Jason Noble,

COMP6053 lecture: Sampling and the central limit theorem. Jason Noble, COMP6053 lecture: Sampling and the central limit theorem Jason Noble, jn2@ecs.soton.ac.uk Populations: long-run distributions Two kinds of distributions: populations and samples. A population is the set

More information

COMP6053 lecture: Sampling and the central limit theorem. Markus Brede,

COMP6053 lecture: Sampling and the central limit theorem. Markus Brede, COMP6053 lecture: Sampling and the central limit theorem Markus Brede, mb8@ecs.soton.ac.uk Populations: long-run distributions Two kinds of distributions: populations and samples. A population is the set

More information

( ) P A B : Probability of A given B. Probability that A happens

( ) P A B : Probability of A given B. Probability that A happens A B A or B One or the other or both occurs At least one of A or B occurs Probability Review A B A and B Both A and B occur ( ) P A B : Probability of A given B. Probability that A happens given that B

More information

Math 10 - Compilation of Sample Exam Questions + Answers

Math 10 - Compilation of Sample Exam Questions + Answers Math 10 - Compilation of Sample Exam Questions + Sample Exam Question 1 We have a population of size N. Let p be the independent probability of a person in the population developing a disease. Answer the

More information

Solving with Absolute Value

Solving with Absolute Value Solving with Absolute Value Who knew two little lines could cause so much trouble? Ask someone to solve the equation 3x 2 = 7 and they ll say No problem! Add just two little lines, and ask them to solve

More information

Review of the Normal Distribution

Review of the Normal Distribution Sampling and s Normal Distribution Aims of Sampling Basic Principles of Probability Types of Random Samples s of the Mean Standard Error of the Mean The Central Limit Theorem Review of the Normal Distribution

More information

Chapter 6. Estimates and Sample Sizes

Chapter 6. Estimates and Sample Sizes Chapter 6 Estimates and Sample Sizes Lesson 6-1/6-, Part 1 Estimating a Population Proportion This chapter begins the beginning of inferential statistics. There are two major applications of inferential

More information

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p).

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p). Sampling distributions and estimation. 1) A brief review of distributions: We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation,

More information

DISTRIBUTIONS USED IN STATISTICAL WORK

DISTRIBUTIONS USED IN STATISTICAL WORK DISTRIBUTIONS USED IN STATISTICAL WORK In one of the classic introductory statistics books used in Education and Psychology (Glass and Stanley, 1970, Prentice-Hall) there was an excellent chapter on different

More information

Overview. Confidence Intervals Sampling and Opinion Polls Error Correcting Codes Number of Pet Unicorns in Ireland

Overview. Confidence Intervals Sampling and Opinion Polls Error Correcting Codes Number of Pet Unicorns in Ireland Overview Confidence Intervals Sampling and Opinion Polls Error Correcting Codes Number of Pet Unicorns in Ireland Confidence Intervals When a random variable lies in an interval a X b with a specified

More information

Simple Illustrations of Statistical Significance

Simple Illustrations of Statistical Significance ELECTRONOTES WEBNOTE 17 2/13/2014 ENWN-17 Simple Illustrations of Statistical Significance Recently, with regard to a local traffic study, I had occasion to wonder about how well the general public understood

More information

Read the text and then answer the questions.

Read the text and then answer the questions. 1 Read the text and then answer The young girl walked on the beach. What did she see in the water? Was it a dolphin, a shark, or a whale? She knew something was out there. It had an interesting fin. She

More information

HOLLOMAN S AP STATISTICS BVD CHAPTER 08, PAGE 1 OF 11. Figure 1 - Variation in the Response Variable

HOLLOMAN S AP STATISTICS BVD CHAPTER 08, PAGE 1 OF 11. Figure 1 - Variation in the Response Variable Chapter 08: Linear Regression There are lots of ways to model the relationships between variables. It is important that you not think that what we do is the way. There are many paths to the summit We are

More information

1. Create a scatterplot of this data. 2. Find the correlation coefficient.

1. Create a scatterplot of this data. 2. Find the correlation coefficient. How Fast Foods Compare Company Entree Total Calories Fat (grams) McDonald s Big Mac 540 29 Filet o Fish 380 18 Burger King Whopper 670 40 Big Fish Sandwich 640 32 Wendy s Single Burger 470 21 1. Create

More information

MA 1125 Lecture 15 - The Standard Normal Distribution. Friday, October 6, Objectives: Introduce the standard normal distribution and table.

MA 1125 Lecture 15 - The Standard Normal Distribution. Friday, October 6, Objectives: Introduce the standard normal distribution and table. MA 1125 Lecture 15 - The Standard Normal Distribution Friday, October 6, 2017. Objectives: Introduce the standard normal distribution and table. 1. The Standard Normal Distribution We ve been looking at

More information

STA Module 4 Probability Concepts. Rev.F08 1

STA Module 4 Probability Concepts. Rev.F08 1 STA 2023 Module 4 Probability Concepts Rev.F08 1 Learning Objectives Upon completing this module, you should be able to: 1. Compute probabilities for experiments having equally likely outcomes. 2. Interpret

More information

Senior Math Circles November 19, 2008 Probability II

Senior Math Circles November 19, 2008 Probability II University of Waterloo Faculty of Mathematics Centre for Education in Mathematics and Computing Senior Math Circles November 9, 2008 Probability II Probability Counting There are many situations where

More information

Probability Distributions

Probability Distributions CONDENSED LESSON 13.1 Probability Distributions In this lesson, you Sketch the graph of the probability distribution for a continuous random variable Find probabilities by finding or approximating areas

More information

Assignment 3 Logic and Reasoning KEY

Assignment 3 Logic and Reasoning KEY Assignment 3 Logic and Reasoning KEY Print this sheet and fill in your answers. Please staple the sheets together. Turn in at the beginning of class on Friday, September 8. Recall this about logic: Suppose

More information

The Central Limit Theorem

The Central Limit Theorem The Central Limit Theorem Patrick Breheny September 27 Patrick Breheny University of Iowa Biostatistical Methods I (BIOS 5710) 1 / 31 Kerrich s experiment Introduction 10,000 coin flips Expectation and

More information

appstats27.notebook April 06, 2017

appstats27.notebook April 06, 2017 Chapter 27 Objective Students will conduct inference on regression and analyze data to write a conclusion. Inferences for Regression An Example: Body Fat and Waist Size pg 634 Our chapter example revolves

More information

Functions. If x 2 D, then g(x) 2 T is the object that g assigns to x. Writing the symbols. g : D! T

Functions. If x 2 D, then g(x) 2 T is the object that g assigns to x. Writing the symbols. g : D! T Functions This is the second of three chapters that are meant primarily as a review of Math 1050. We will move quickly through this material. A function is a way of describing a relationship between two

More information

Mr. Stein s Words of Wisdom

Mr. Stein s Words of Wisdom Mr. Stein s Words of Wisdom I am writing this review essay for two tests the AP Stat exam and the Applied Stat BFT. The topics are more or less the same, so reviewing for the two tests should be a similar

More information

Chapter 3. Estimation of p. 3.1 Point and Interval Estimates of p

Chapter 3. Estimation of p. 3.1 Point and Interval Estimates of p Chapter 3 Estimation of p 3.1 Point and Interval Estimates of p Suppose that we have Bernoulli Trials (BT). So far, in every example I have told you the (numerical) value of p. In science, usually the

More information

STA Module 10 Comparing Two Proportions

STA Module 10 Comparing Two Proportions STA 2023 Module 10 Comparing Two Proportions Learning Objectives Upon completing this module, you should be able to: 1. Perform large-sample inferences (hypothesis test and confidence intervals) to compare

More information

Simple Regression Model. January 24, 2011

Simple Regression Model. January 24, 2011 Simple Regression Model January 24, 2011 Outline Descriptive Analysis Causal Estimation Forecasting Regression Model We are actually going to derive the linear regression model in 3 very different ways

More information

Chapter 27 Summary Inferences for Regression

Chapter 27 Summary Inferences for Regression Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test

More information

Statistic: a that can be from a sample without making use of any unknown. In practice we will use to establish unknown parameters.

Statistic: a that can be from a sample without making use of any unknown. In practice we will use to establish unknown parameters. Chapter 9: Sampling Distributions 9.1: Sampling Distributions IDEA: How often would a given method of sampling give a correct answer if it was repeated many times? That is, if you took repeated samples

More information

CHAPTER 1: Preliminary Description of Errors Experiment Methodology and Errors To introduce the concept of error analysis, let s take a real world

CHAPTER 1: Preliminary Description of Errors Experiment Methodology and Errors To introduce the concept of error analysis, let s take a real world CHAPTER 1: Preliminary Description of Errors Experiment Methodology and Errors To introduce the concept of error analysis, let s take a real world experiment. Suppose you wanted to forecast the results

More information

Introducing Proof 1. hsn.uk.net. Contents

Introducing Proof 1. hsn.uk.net. Contents Contents 1 1 Introduction 1 What is proof? 1 Statements, Definitions and Euler Diagrams 1 Statements 1 Definitions Our first proof Euler diagrams 4 3 Logical Connectives 5 Negation 6 Conjunction 7 Disjunction

More information

Sampling Distributions. Introduction to Inference

Sampling Distributions. Introduction to Inference Sampling Distributions Introduction to Inference Parameter A parameter is a number that describes the population. A parameter always exists but in practice we rarely know it s value because we cannot examine

More information

Men. Women. Men. Men. Women. Women

Men. Women. Men. Men. Women. Women Math 203 Topics for second exam Statistics: the science of data Chapter 5: Producing data Statistics is all about drawing conclusions about the opinions/behavior/structure of large populations based on

More information

Hyperreal Numbers: An Elementary Inquiry-Based Introduction. Handouts for a course from Canada/USA Mathcamp Don Laackman

Hyperreal Numbers: An Elementary Inquiry-Based Introduction. Handouts for a course from Canada/USA Mathcamp Don Laackman Hyperreal Numbers: An Elementary Inquiry-Based Introduction Handouts for a course from Canada/USA Mathcamp 2017 Don Laackman MATHCAMP, WEEK 3: HYPERREAL NUMBERS DAY 1: BIG AND LITTLE DON & TIM! Problem

More information

Lesson 6-1: Relations and Functions

Lesson 6-1: Relations and Functions I ll bet you think numbers are pretty boring, don t you? I ll bet you think numbers have no life. For instance, numbers don t have relationships do they? And if you had no relationships, life would be

More information

Test 3 SOLUTIONS. x P(x) xp(x)

Test 3 SOLUTIONS. x P(x) xp(x) 16 1. A couple of weeks ago in class, each of you took three quizzes where you randomly guessed the answers to each question. There were eight questions on each quiz, and four possible answers to each

More information

Lecture 5. 1 Review (Pairwise Independence and Derandomization)

Lecture 5. 1 Review (Pairwise Independence and Derandomization) 6.842 Randomness and Computation September 20, 2017 Lecture 5 Lecturer: Ronitt Rubinfeld Scribe: Tom Kolokotrones 1 Review (Pairwise Independence and Derandomization) As we discussed last time, we can

More information

Elementary Statistics

Elementary Statistics Elementary Statistics Q: What is data? Q: What does the data look like? Q: What conclusions can we draw from the data? Q: Where is the middle of the data? Q: Why is the spread of the data important? Q:

More information

Preptests 55 Answers and Explanations (By Ivy Global) Section 4 Logic Games

Preptests 55 Answers and Explanations (By Ivy Global) Section 4 Logic Games Section 4 Logic Games Questions 1 6 There aren t too many deductions we can make in this game, and it s best to just note how the rules interact and save your time for answering the questions. 1. Type

More information

Chapter 23. Inference About Means

Chapter 23. Inference About Means Chapter 23 Inference About Means 1 /57 Homework p554 2, 4, 9, 10, 13, 15, 17, 33, 34 2 /57 Objective Students test null and alternate hypotheses about a population mean. 3 /57 Here We Go Again Now that

More information

Chapter 5 Least Squares Regression

Chapter 5 Least Squares Regression Chapter 5 Least Squares Regression A Royal Bengal tiger wandered out of a reserve forest. We tranquilized him and want to take him back to the forest. We need an idea of his weight, but have no scale!

More information

Polling and sampling. Clement de Chaisemartin and Douglas G. Steigerwald UCSB

Polling and sampling. Clement de Chaisemartin and Douglas G. Steigerwald UCSB Polling and sampling. Clement de Chaisemartin and Douglas G. Steigerwald UCSB 1 What pollsters do Pollsters want to answer difficult questions: As of November 1 st 2016, what % of the Pennsylvania electorate

More information

Commentary. Regression toward the mean: a fresh look at an old story

Commentary. Regression toward the mean: a fresh look at an old story Regression toward the mean: a fresh look at an old story Back in time, when I took a statistics course from Professor G., I encountered regression toward the mean for the first time. a I did not understand

More information

Mean/Average Median Mode Range

Mean/Average Median Mode Range Normal Curves Today s Goals Normal curves! Before this we need a basic review of statistical terms. I mean basic as in underlying, not easy. We will learn how to retrieve statistical data from normal curves.

More information

Discrete Mathematics and Probability Theory Summer 2014 James Cook Midterm 1

Discrete Mathematics and Probability Theory Summer 2014 James Cook Midterm 1 CS 70 Discrete Mathematics and Probability Theory Summer 2014 James Cook Midterm 1 Thursday July 17, 2014, 12:40pm-2:00pm. Instructions: Do not turn over this page until the proctor tells you to. Don t

More information

CHAPTER 18 SAMPLING DISTRIBUTION MODELS STAT 203

CHAPTER 18 SAMPLING DISTRIBUTION MODELS STAT 203 1 CHAPTER 18 SAMPLING DISTRIBUTION MODELS STAT 203 Outline 2 Sampling Distribution for Proportions Sample Proportions The mean The standard deviation The Distribution Model Assumptions and Conditions Sampling

More information

QUADRATICS 3.2 Breaking Symmetry: Factoring

QUADRATICS 3.2 Breaking Symmetry: Factoring QUADRATICS 3. Breaking Symmetry: Factoring James Tanton SETTING THE SCENE Recall that we started our story of symmetry with a rectangle of area 36. Then you would say that the picture is wrong and that

More information

One sided tests. An example of a two sided alternative is what we ve been using for our two sample tests:

One sided tests. An example of a two sided alternative is what we ve been using for our two sample tests: One sided tests So far all of our tests have been two sided. While this may be a bit easier to understand, this is often not the best way to do a hypothesis test. One simple thing that we can do to get

More information

Regression, part II. I. What does it all mean? A) Notice that so far all we ve done is math.

Regression, part II. I. What does it all mean? A) Notice that so far all we ve done is math. Regression, part II I. What does it all mean? A) Notice that so far all we ve done is math. 1) One can calculate the Least Squares Regression Line for anything, regardless of any assumptions. 2) But, if

More information

Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc.

Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc. Chapter 23 Inferences About Means Sampling Distributions of Means Now that we know how to create confidence intervals and test hypotheses about proportions, we do the same for means. Just as we did before,

More information

Introduction to Statistical Data Analysis Lecture 4: Sampling

Introduction to Statistical Data Analysis Lecture 4: Sampling Introduction to Statistical Data Analysis Lecture 4: Sampling James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis 1 / 30 Introduction

More information

Chapter 15. Probability Rules! Copyright 2012, 2008, 2005 Pearson Education, Inc.

Chapter 15. Probability Rules! Copyright 2012, 2008, 2005 Pearson Education, Inc. Chapter 15 Probability Rules! Copyright 2012, 2008, 2005 Pearson Education, Inc. The General Addition Rule When two events A and B are disjoint, we can use the addition rule for disjoint events from Chapter

More information

Discrete Distributions

Discrete Distributions Discrete Distributions STA 281 Fall 2011 1 Introduction Previously we defined a random variable to be an experiment with numerical outcomes. Often different random variables are related in that they have

More information

Homework 4 Solutions Math 150

Homework 4 Solutions Math 150 Homework Solutions Math 150 Enrique Treviño 3.2: (a) The table gives P (Z 1.13) = 0.1292. P (Z > 1.13) = 1 0.1292 = 0.8708. The table yields P (Z 0.18) = 0.571. (c) The table doesn t consider Z > 8 but

More information

STA 291 Lecture 16. Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately) normal

STA 291 Lecture 16. Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately) normal STA 291 Lecture 16 Normal distributions: ( mean and SD ) use table or web page. The sampling distribution of and are both (approximately) normal X STA 291 - Lecture 16 1 Sampling Distributions Sampling

More information

( )( b + c) = ab + ac, but it can also be ( )( a) = ba + ca. Let s use the distributive property on a couple of

( )( b + c) = ab + ac, but it can also be ( )( a) = ba + ca. Let s use the distributive property on a couple of Factoring Review for Algebra II The saddest thing about not doing well in Algebra II is that almost any math teacher can tell you going into it what s going to trip you up. One of the first things they

More information

Direct Proofs. the product of two consecutive integers plus the larger of the two integers

Direct Proofs. the product of two consecutive integers plus the larger of the two integers Direct Proofs A direct proof uses the facts of mathematics and the rules of inference to draw a conclusion. Since direct proofs often start with premises (given information that goes beyond the facts of

More information