Central Limit Theorem Confidence Intervals Worked example #6. July 24, 2017

Size: px
Start display at page:

Download "Central Limit Theorem Confidence Intervals Worked example #6. July 24, 2017"

Transcription

1 Central Limit Theorem Confidence Intervals Worked example #6 July 24, 2017

2 10 8 Raw scores 6 4 Mean=71.4% Scaling is to add 3.6% to bring mean to 75% Scaled scores Mean = 75%

3 A B C D F Mean=71.4% (add 3.6%) Scaled Mean = 75%

4 Let's return to the Normal distribution f(x)

5 The area under the curve between two x values represent how much of the data is in that range. A B Area under curve between A and B Integral impossible, but numerical methods were used to create tables of areas under standard normal from A to B.

6 Values in the table are a values. Z a is the value Z such that there is a area to the left. Z 1-a is the value Z such that there is a area to the right. Z 1-a/2 is the value Z such that there is a/2 area to the right. Ex: Z /2 = Z = 1.96 Beware how different texts and tables use a, 1-a, a/2 and 1-a/2

7 Why do we care about areas under the Normal distribution? (1) Many populations exhibit a normal distribution Calculating areas allows us to predict how much of the population data is in a region if we have a sample mean and variance. - Lab #8

8 Recall: we transformed any normal distribution into the standard with 2 steps. (1) subtract mean (2) divide by standard deviation Consider individual data points Z-score of x = Z = (x-m)/s The Z-score allows comparison or standardization of data sets with different means and standard deviations.

9 Example, IQ: How to compare IQ test results from 2008 with 1996? Get test scores Compute Z-scores Compute an IQ score: IQ = Z Mean IQ = 100, Standard deviation = 15 ~ 66% population ~ 13% population ~ 2.5% population >130

10 Why do we care about areas under the Normal distribution? (2) Central Limit Theorem For large sample size the distribution of sample means will be normal, no matter what the actual population distribution is. - Lab #8 Distribution of data Distribution of means If we study means we can use normal distribution

11 Central limit theorem For large sample size the distribution of sample means will be normal, no matter what the actual population distribution is. Sample means ~ N(m, s 2 /n) Notice: s of the sample means is s/n 1/2 Note: the variance of the distribution of sample means depends on the size of the sample. Note: The ranges around the mean are confidence intervals, we are confident the true population mean lies within a region around our sample mean. Bigger region means more confidence but less utility

12 Central limit theorem For large sample size the distribution of sample means will be normal, no matter what the actual population distribution is. Distribution of the sample means Pop. mean

13 Central limit theorem The variance of the distribution of sample means depends on the size of the sample and population variance. which means that Standard error of the sample mean Distribution of samples with many values Distribution of samples with few values means

14 Central limit theorem The range around the sample mean is a confidence interval. We are x% confident the true population mean lies within a region around our sample mean that includes x% of the distribution. Distribution of sample means Distribution of sample means Very confident pop. mean is somewhere in this range Less confident pop. mean is somewhere in this range Tradeoff between certainty and utility Standard deviation: measures spread of sample or population data values Standard error: measures spread of potential of the population mean

15 Central limit theorem We are x% confident the true population mean lies within a region around our sample mean that includes x% of the distribution. Distribution is a normal distribution and we know how to calculate what % of area lies within a range defined by Z scores. Using the Normal distribution allows calculation of confidence intervals and quantitative statements about population mean from sample data. Since we usually don't know "s", we have to estimate it from "s". This means we can't just use Z scores we have to use t distribution (which includes uncertainty in our estimation of s via s). If we knew s, we could use this Z distribution Since s unknown, we have to use this t distribution

16 Central limit theorem We typically don't know m or s (if we did we wouldn't need to get a confidence interval because we would know m) so we use X to estimate m and s to estimate s. Sample means ~ N(X, s 2 /n) (if s is known, m unknown) Region Degree of confidence that pop. mean is in this region x ± Z /2 (s 2 /n) 1/2 68% x ± Z /2 (s 2 /n) 1/2 95% x ± Z /2 (s 2 /n) 1/2 99% Z /2 =1 Z /2 =1.96 Z /2 =2.575 Note: We use values from a Z distribution since s is known.

17 Central limit theorem We typically don't know m or s (if we did we wouldn't need to get a confidence interval because we would know m) so we use X to estimate m and s to estimate s. Sample means ~ t(x, s 2 /n) (if s is unknown, m unknown) Region Degree of confidence that pop. mean is in this region x ± t /2,df (s 2 /n) 1/2 68% x ± t /2,df (s 2 /n) 1/2 95% x ± t /2,df (s 2 /n) 1/2 99% t /2,df =varies t /2,df =varies t /2,df =varies Note: We have to use values from a t distribution since s is unknown. The "t distribution" is a bit wider than the Z to include our uncertainty in estimating s with s

18 Central limit theorem m known s known No sampling needed m unknown s known use Z distribution m unknown s unknown use t distribution Note: - As the sample size increases the t distribution becomes the Z distribution. Some naughty people forget the t distribution even exists... Caution: - Don't lose sight of the goal, to describe a region within which we are confident the population mean lies.

19 Central limit theorem The t distribution includes the uncertainty in estimate of s and is wider. Using the "t distribution" also requires us to specify the degrees of freedom (df) of the data, in this case df=n-1 t a,df you may also see t a/2,df & t 1-a/2,df in texts and tables For our t table the a refers to the area to the right. (our Z table shows area to the left) Tables often require interpolation if sample size not listed As df increases t distribution becomes the Z distribution

20 Central limit theorem Example: (1) s known = 10 Use Z distribution sample size = 20 df=na -1.96SE +1.96SE 95% confident pop. mean is somewhere in this range 95% CI, confidence interval (2) s unknown Use t distribution s = 10 sample size = 20 df=20-1= se SE 95% CI (3) s unknown Use t distribution s = 10 sample size = 101 df=101-1= se SE 95% CI

21 HANDOUT #6 Consider a medication that may increase the clotting time of patients taking it. We need reliable data on the usual clotting time (CT) of individuals not taking the medication in order to determine whether the medication is effective. We are unable to regularly measure the CT of every individual in the population of people not taking the medication so we will estimate the population parameter, mean clotting time (CT), with a sample from the general population. For the first set of questions we have access to detailed physiological data from the NIH and we know that the population standard deviation of CT values in humans is 3.2 seconds Sample data: 18, 20, 22, 23, 26, 17, 14, 22, 18, 21, 20, 19. What is the region in which there is a 95% chance that the true population mean CT lies? What is the region in which there is a 99% chance that the true population mean CT lies? If we have a larger sample: 18, 20, 22, 23, 26, 17, 14, 22, 18, 21, 20, 19, 18, 20, 22, 23, 26, 17, 14, 22, 18, 21, 20, 19, 18, 20, 22, 23, 26, 17, 14, 22, 18, 21, 20, 19. What is the region in which there is a 95% chance that the true population mean CT lies?

22 HANDOUT #6 note: we are interested in a/2 area to the right, but our Z table shows 1-a/2 area to the left. Consider a medication that may increase the clotting time of patients taking it. We need reliable data on the usual clotting time (CT) of individuals not taking the medication in order to determine whether the medication is effective. We are unable to regularly measure the CT of every individual in the population of people not taking the medication so we will estimate the population parameter, mean clotting time (CT), with a sample from the general population. For the first set of questions we have access to detailed physiological data from the NIH and we know that the population standard deviation of CT values in humans is 3.2 seconds Sample data: 18, 20, 22, 23, 26, 17, 14, 22, 18, 21, 20, 19. What is the region in which there is a 95% chance that the true population mean CT lies? Z 1-a/2 = Z /2 = Z = 1.96 from our Z table What is the region in which there is a 99% chance that the true population mean CT lies? If we have a larger sample: 18, 20, 22, 23, 26, 17, 14, 22, 18, 21, 20, 19, 18, 20, 22, 23, 26, 17, 14, 22, 18, 21, 20, 19, 18, 20, 22, 23, 26, 17, 14, 22, 18, 21, 20, 19. What is the region in which there is a 95% chance that the true population mean CT lies? sample mean=20, pop s = 3.2 X ± 1.96(s/n 1/2 ) 20 ± 1.96(3.2/12 1/2 ) 20 ± 1.81, {18.19,21.81} Z 1-a/2 = Z /2 = Z = from our Z table X ± 2.575(s/n 1/2 ) 20 ± 2.575(3.2/12 1/2 ) 20 ± 2.38, {17.62,22.38} Z 1-a/2 = Z /2 = Z = 1.96 from our Z table X ± 1.96(s/n 1/2 ) 20 ± 1.96(3.2/36 1/2 ) 20 ± 1.05, {18.95,21.05} note: with a larger sample size the confidence interval for the same degree of confidence is narrower because the SE is smaller.

23 HANDOUT #6 For the second set of questions we consider the more realistic situation in which we don't know the population standard deviation and have to estimate it from the sample data Sample data: 18, 20, 22, 23, 26, 17, 14, 22, 18, 21, 20, 19. What is the region in which there is a 95% chance that the true population mean CT lies? What is the region in which there is a 99% chance that the true population mean CT lies? If we have a larger sample: 18, 20, 22, 23, 26, 17, 14, 22, 18, 21, 20, 19, 18, 20, 22, 23, 26, 17, 14, 22, 18, 21, 20, 19, 18, 20, 22, 23, 26, 17, 14, 22, 18, 21, 20, 19. What is the region in which there is a 95% chance that the true population mean CT lies?

24 HANDOUT #6 note: instead of our table showing 1-a to the left like the Z table does, the t table shows values for a area to the right. For the second set of questions we consider the more realistic situation in which we don't know the population standard deviation and have to estimate it from the sample data Sample data: 18, 20, 22, 23, 26, 17, 14, 22, 18, 21, 20, 19. sample mean=20, sample s = What is the region in which there is a 95% chance that the true population mean CT lies? t a/2,df = t 0.05/2,11 = t 0.025,11 = from our t table X ± 2.201(s/n 1/2 ) 20 ± 2.201(3.133/12 1/2 ) 20 ± 1.99, {18.01,21.99} What is the region in which there is a 99% chance that the true population mean CT lies? t a/2 = t 0.01/2,11 = t 0.005,11 = from our t table X ± 3.106(s/n 1/2 ) 20 ± 3.106(3.133/12 1/2 ) 20 ± 2.81, {17.19,22.81} If we have a larger sample: 18, 20, 22, 23, 26, 17, 14, 22, 18, 21, 20, 19, 18, 20, 22, 23, 26, 17, 14, 22, 18, 21, 20, 19, 18, 20, 22, 23, 26, 17, 14, 22, 18, 21, 20, 19. What is the region in which there is a 95% chance that the true population mean CT lies? note: our t table does not have an entry for df=35 so we round down and use the df=30 value from our table. t a/2,df = t 0.05/2,35 = t 0.025,30 = from our t table X ± 2.042(s/n 1/2 ) 20 ± 2.042(3.04/36 1/2 ) 20 ± 1.04, {18.96,21.04} note: the SD is also smaller for the larger sample because of the relatively larger denominator; as n-1 increases it approaches n.

25

26

27

28

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b). Confidence Intervals 1) What are confidence intervals? Simply, an interval for which we have a certain confidence. For example, we are 90% certain that an interval contains the true value of something

More information

Chapter 8 - Statistical intervals for a single sample

Chapter 8 - Statistical intervals for a single sample Chapter 8 - Statistical intervals for a single sample 8-1 Introduction In statistics, no quantity estimated from data is known for certain. All estimated quantities have probability distributions of their

More information

Statistical Inference

Statistical Inference Chapter 14 Confidence Intervals: The Basic Statistical Inference Situation: We are interested in estimating some parameter (population mean, μ) that is unknown. We take a random sample from this population.

More information

Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc.

Chapter 23. Inferences About Means. Monday, May 6, 13. Copyright 2009 Pearson Education, Inc. Chapter 23 Inferences About Means Sampling Distributions of Means Now that we know how to create confidence intervals and test hypotheses about proportions, we do the same for means. Just as we did before,

More information

The independent-means t-test:

The independent-means t-test: The independent-means t-test: Answers the question: is there a "real" difference between the two conditions in my experiment? Or is the difference due to chance? Previous lecture: (a) Dependent-means t-test:

More information

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples Objective Section 9.4 Inferences About Two Means (Matched Pairs) Compare of two matched-paired means using two samples from each population. Hypothesis Tests and Confidence Intervals of two dependent means

More information

Correlation. We don't consider one variable independent and the other dependent. Does x go up as y goes up? Does x go down as y goes up?

Correlation. We don't consider one variable independent and the other dependent. Does x go up as y goes up? Does x go down as y goes up? Comment: notes are adapted from BIOL 214/312. I. Correlation. Correlation A) Correlation is used when we want to examine the relationship of two continuous variables. We are not interested in prediction.

More information

Confidence Intervals. - simply, an interval for which we have a certain confidence.

Confidence Intervals. - simply, an interval for which we have a certain confidence. Confidence Intervals I. What are confidence intervals? - simply, an interval for which we have a certain confidence. - for example, we are 90% certain that an interval contains the true value of something

More information

Two-sample inference: Continuous data

Two-sample inference: Continuous data Two-sample inference: Continuous data Patrick Breheny April 6 Patrick Breheny University of Iowa to Biostatistics (BIOS 4120) 1 / 36 Our next several lectures will deal with two-sample inference for continuous

More information

The Chi-Square Distributions

The Chi-Square Distributions MATH 03 The Chi-Square Distributions Dr. Neal, Spring 009 The chi-square distributions can be used in statistics to analyze the standard deviation of a normally distributed measurement and to test the

More information

Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t

Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t Lecture 26: Chapter 10, Section 2 Inference for Quantitative Variable Confidence Interval with t t Confidence Interval for Population Mean Comparing z and t Confidence Intervals When neither z nor t Applies

More information

The Chi-Square Distributions

The Chi-Square Distributions MATH 183 The Chi-Square Distributions Dr. Neal, WKU The chi-square distributions can be used in statistics to analyze the standard deviation σ of a normally distributed measurement and to test the goodness

More information

Measurement And Uncertainty

Measurement And Uncertainty Measurement And Uncertainty Based on Guidelines for Evaluating and Expressing the Uncertainty of NIST Measurement Results, NIST Technical Note 1297, 1994 Edition PHYS 407 1 Measurement approximates or

More information

Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing

Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing Notes 3: Statistical Inference: Sampling, Sampling Distributions Confidence Intervals, and Hypothesis Testing 1. Purpose of statistical inference Statistical inference provides a means of generalizing

More information

Interval estimation. October 3, Basic ideas CLT and CI CI for a population mean CI for a population proportion CI for a Normal mean

Interval estimation. October 3, Basic ideas CLT and CI CI for a population mean CI for a population proportion CI for a Normal mean Interval estimation October 3, 2018 STAT 151 Class 7 Slide 1 Pandemic data Treatment outcome, X, from n = 100 patients in a pandemic: 1 = recovered and 0 = not recovered 1 1 1 0 0 0 1 1 1 0 0 1 0 1 0 0

More information

Mathematical Notation Math Introduction to Applied Statistics

Mathematical Notation Math Introduction to Applied Statistics Mathematical Notation Math 113 - Introduction to Applied Statistics Name : Use Word or WordPerfect to recreate the following documents. Each article is worth 10 points and should be emailed to the instructor

More information

5 Basic Steps in Any Hypothesis Test

5 Basic Steps in Any Hypothesis Test 5 Basic Steps in Any Hypothesis Test Step 1: Determine hypotheses (H0 and Ha). H0: μ d = 0 (μ 1 μ 2 =0) Ha: μ d > 0 (μ 1 μ 2 >0) upper-sided Ha: : μ d 0 (μ 1 μ 2 0) two-sided Step 2: Verify necessary conditions,

More information

Business Statistics. Lecture 5: Confidence Intervals

Business Statistics. Lecture 5: Confidence Intervals Business Statistics Lecture 5: Confidence Intervals Goals for this Lecture Confidence intervals The t distribution 2 Welcome to Interval Estimation! Moments Mean 815.0340 Std Dev 0.8923 Std Error Mean

More information

Sampling Distributions: Central Limit Theorem

Sampling Distributions: Central Limit Theorem Review for Exam 2 Sampling Distributions: Central Limit Theorem Conceptually, we can break up the theorem into three parts: 1. The mean (µ M ) of a population of sample means (M) is equal to the mean (µ)

More information

Review. One-way ANOVA, I. What s coming up. Multiple comparisons

Review. One-way ANOVA, I. What s coming up. Multiple comparisons Review One-way ANOVA, I 9.07 /15/00 Earlier in this class, we talked about twosample z- and t-tests for the difference between two conditions of an independent variable Does a trial drug work better than

More information

Statistics Introductory Correlation

Statistics Introductory Correlation Statistics Introductory Correlation Session 10 oscardavid.barrerarodriguez@sciencespo.fr April 9, 2018 Outline 1 Statistics are not used only to describe central tendency and variability for a single variable.

More information

1 Descriptive statistics. 2 Scores and probability distributions. 3 Hypothesis testing and one-sample t-test. 4 More on t-tests

1 Descriptive statistics. 2 Scores and probability distributions. 3 Hypothesis testing and one-sample t-test. 4 More on t-tests Overall Overview INFOWO Statistics lecture S3: Hypothesis testing Peter de Waal Department of Information and Computing Sciences Faculty of Science, Universiteit Utrecht 1 Descriptive statistics 2 Scores

More information

Ch. 7. One sample hypothesis tests for µ and σ

Ch. 7. One sample hypothesis tests for µ and σ Ch. 7. One sample hypothesis tests for µ and σ Prof. Tesler Math 18 Winter 2019 Prof. Tesler Ch. 7: One sample hypoth. tests for µ, σ Math 18 / Winter 2019 1 / 23 Introduction Data Consider the SAT math

More information

Statistics and Quantitative Analysis U4320. Segment 5: Sampling and inference Prof. Sharyn O Halloran

Statistics and Quantitative Analysis U4320. Segment 5: Sampling and inference Prof. Sharyn O Halloran Statistics and Quantitative Analysis U4320 Segment 5: Sampling and inference Prof. Sharyn O Halloran Sampling A. Basics 1. Ways to Describe Data Histograms Frequency Tables, etc. 2. Ways to Characterize

More information

Student s t-distribution. The t-distribution, t-tests, & Measures of Effect Size

Student s t-distribution. The t-distribution, t-tests, & Measures of Effect Size Student s t-distribution The t-distribution, t-tests, & Measures of Effect Size Sampling Distributions Redux Chapter 7 opens with a return to the concept of sampling distributions from chapter 4 Sampling

More information

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015

AMS7: WEEK 7. CLASS 1. More on Hypothesis Testing Monday May 11th, 2015 AMS7: WEEK 7. CLASS 1 More on Hypothesis Testing Monday May 11th, 2015 Testing a Claim about a Standard Deviation or a Variance We want to test claims about or 2 Example: Newborn babies from mothers taking

More information

Objectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters

Objectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters Objectives 10.1 Simple linear regression Statistical model for linear regression Estimating the regression parameters Confidence interval for regression parameters Significance test for the slope Confidence

More information

Sampling distributions:

Sampling distributions: Sampling distributions: In Psychology we generally make inferences about populations on the basis of limited samples. We therefore need to know what relationship exists between samples and populations.

More information

Data analysis and Geostatistics - lecture VI

Data analysis and Geostatistics - lecture VI Data analysis and Geostatistics - lecture VI Statistical testing with population distributions Statistical testing - the steps 1. Define a hypothesis to test in statistics only a hypothesis rejection is

More information

Latent Trait Reliability

Latent Trait Reliability Latent Trait Reliability Lecture #7 ICPSR Item Response Theory Workshop Lecture #7: 1of 66 Lecture Overview Classical Notions of Reliability Reliability with IRT Item and Test Information Functions Concepts

More information

QUIZ 4 (CHAPTER 7) - SOLUTIONS MATH 119 SPRING 2013 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS = 100%

QUIZ 4 (CHAPTER 7) - SOLUTIONS MATH 119 SPRING 2013 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS = 100% QUIZ 4 (CHAPTER 7) - SOLUTIONS MATH 119 SPRING 013 KUNIYUKI 105 POINTS TOTAL, BUT 100 POINTS = 100% 1) We want to conduct a study to estimate the mean I.Q. of a pop singer s fans. We want to have 96% confidence

More information

Today - SPSS and standard error - End of Midterm 1 exam material - T-scores

Today - SPSS and standard error - End of Midterm 1 exam material - T-scores Today - SPSS and standard error - End of Midterm 1 exam material - T-scores Previously, on StatsClass: The standard error is a measure of the typical amount that that a sample mean will be off from the

More information

Chapter 23. Inference About Means

Chapter 23. Inference About Means Chapter 23 Inference About Means 1 /57 Homework p554 2, 4, 9, 10, 13, 15, 17, 33, 34 2 /57 Objective Students test null and alternate hypotheses about a population mean. 3 /57 Here We Go Again Now that

More information

Chapter 5 Least Squares Regression

Chapter 5 Least Squares Regression Chapter 5 Least Squares Regression A Royal Bengal tiger wandered out of a reserve forest. We tranquilized him and want to take him back to the forest. We need an idea of his weight, but have no scale!

More information

= X+(z.95)SE(X) X-(z.95)SE(X)

= X+(z.95)SE(X) X-(z.95)SE(X) BIOSTATS 540 Fall 05 Introductory Biostatistics Page of 6 Unit 6 Estimation Homework #0 (Unit 6 Estimation, part of ) SOLUTIONS. Alzheimers disease has a poorer prognosis when it is diagnosed at a relatively

More information

Statistical Analysis How do we know if it works? Group workbook: Cartoon from XKCD.com. Subscribe!

Statistical Analysis How do we know if it works? Group workbook: Cartoon from XKCD.com. Subscribe! Statistical Analysis How do we know if it works? Group workbook: Cartoon from XKCD.com. Subscribe! http://www.xkcd.com/552/ Significant Concepts We structure the presentation and processing of data to

More information

Chapter 5 Confidence Intervals

Chapter 5 Confidence Intervals Chapter 5 Confidence Intervals Confidence Intervals about a Population Mean, σ, Known Abbas Motamedi Tennessee Tech University A point estimate: a single number, calculated from a set of data, that is

More information

Uncertainty: A Reading Guide and Self-Paced Tutorial

Uncertainty: A Reading Guide and Self-Paced Tutorial Uncertainty: A Reading Guide and Self-Paced Tutorial First, read the description of uncertainty at the Experimental Uncertainty Review link on the Physics 108 web page, up to and including Rule 6, making

More information

Question. z-scores. What We Will Cover in This Section. On which of the following tests did Pat do best compared to the other students?

Question. z-scores. What We Will Cover in This Section. On which of the following tests did Pat do best compared to the other students? z-scores 9/17/2003 P225 Z-scores 1 What We Will Cover in This Section What a z-score is. Computation. Properties. Assumptions. Uses 9/17/2003 P225 Z-scores 2 Question On which of the following tests did

More information

GROUPED DATA E.G. FOR SAMPLE OF RAW DATA (E.G. 4, 12, 7, 5, MEAN G x / n STANDARD DEVIATION MEDIAN AND QUARTILES STANDARD DEVIATION

GROUPED DATA E.G. FOR SAMPLE OF RAW DATA (E.G. 4, 12, 7, 5, MEAN G x / n STANDARD DEVIATION MEDIAN AND QUARTILES STANDARD DEVIATION FOR SAMPLE OF RAW DATA (E.G. 4, 1, 7, 5, 11, 6, 9, 7, 11, 5, 4, 7) BE ABLE TO COMPUTE MEAN G / STANDARD DEVIATION MEDIAN AND QUARTILES Σ ( Σ) / 1 GROUPED DATA E.G. AGE FREQ. 0-9 53 10-19 4...... 80-89

More information

Math 124: Modules Overall Goal. Point Estimations. Interval Estimation. Math 124: Modules Overall Goal.

Math 124: Modules Overall Goal. Point Estimations. Interval Estimation. Math 124: Modules Overall Goal. What we will do today s David Meredith Department of Mathematics San Francisco State University October 22, 2009 s 1 2 s 3 What is a? Decision support Political decisions s s Goal of statistics: optimize

More information

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS

T.I.H.E. IT 233 Statistics and Probability: Sem. 1: 2013 ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS ESTIMATION AND HYPOTHESIS TESTING OF TWO POPULATIONS In our work on hypothesis testing, we used the value of a sample statistic to challenge an accepted value of a population parameter. We focused only

More information

7.1 Basic Properties of Confidence Intervals

7.1 Basic Properties of Confidence Intervals 7.1 Basic Properties of Confidence Intervals What s Missing in a Point Just a single estimate What we need: how reliable it is Estimate? No idea how reliable this estimate is some measure of the variability

More information

Multiple Regression Analysis

Multiple Regression Analysis Multiple Regression Analysis y = β 0 + β 1 x 1 + β 2 x 2 +... β k x k + u 2. Inference 0 Assumptions of the Classical Linear Model (CLM)! So far, we know: 1. The mean and variance of the OLS estimators

More information

Probability and Samples. Sampling. Point Estimates

Probability and Samples. Sampling. Point Estimates Probability and Samples Sampling We want the results from our sample to be true for the population and not just the sample But our sample may or may not be representative of the population Sampling error

More information

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b).

Note that we are looking at the true mean, μ, not y. The problem for us is that we need to find the endpoints of our interval (a, b). Confidence Intervals 1) What are confidence intervals? Simply, an interval for which we have a certain confidence. For example, we are 90% certain that an interval contains the true value of something

More information

Using SPSS for One Way Analysis of Variance

Using SPSS for One Way Analysis of Variance Using SPSS for One Way Analysis of Variance This tutorial will show you how to use SPSS version 12 to perform a one-way, between- subjects analysis of variance and related post-hoc tests. This tutorial

More information

Typing Equations in MS Word 2010

Typing Equations in MS Word 2010 CM3215 Fundamentals of Chemical Engineering Laboratory Typing Equations in MS Word 2010 https://www.youtube.com/watch?v=cenp9mehtmy Professor Faith Morrison Department of Chemical Engineering Michigan

More information

Design of Experiments. Factorial experiments require a lot of resources

Design of Experiments. Factorial experiments require a lot of resources Design of Experiments Factorial experiments require a lot of resources Sometimes real-world practical considerations require us to design experiments in specialized ways. The design of an experiment is

More information

Hypothesis testing: Steps

Hypothesis testing: Steps Review for Exam 2 Hypothesis testing: Steps Repeated-Measures ANOVA 1. Determine appropriate test and hypotheses 2. Use distribution table to find critical statistic value(s) representing rejection region

More information

Confidence Intervals for the Sample Mean

Confidence Intervals for the Sample Mean Confidence Intervals for the Sample Mean As we saw before, parameter estimators are themselves random variables. If we are going to make decisions based on these uncertain estimators, we would benefit

More information

An inferential procedure to use sample data to understand a population Procedures

An inferential procedure to use sample data to understand a population Procedures Hypothesis Test An inferential procedure to use sample data to understand a population Procedures Hypotheses, the alpha value, the critical region (z-scores), statistics, conclusion Two types of errors

More information

Business Statistics. Lecture 10: Course Review

Business Statistics. Lecture 10: Course Review Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,

More information

MS&E 226: Small Data

MS&E 226: Small Data MS&E 226: Small Data Lecture 15: Examples of hypothesis tests (v5) Ramesh Johari ramesh.johari@stanford.edu 1 / 32 The recipe 2 / 32 The hypothesis testing recipe In this lecture we repeatedly apply the

More information

Unit 9 ONE Sample Inference SOLUTIONS

Unit 9 ONE Sample Inference SOLUTIONS BIOSTATS 540 Fall 017 Introductory Biostatistics Solutions Unit 9 ONE Sample Inference SOLUTIONS 1. This exercise gives you practice calculating a confidence interval for the mean of a Normal distribution

More information

MALLOY PSYCH 3000 MEAN & VARIANCE PAGE 1 STATISTICS MEASURES OF CENTRAL TENDENCY. In an experiment, these are applied to the dependent variable (DV)

MALLOY PSYCH 3000 MEAN & VARIANCE PAGE 1 STATISTICS MEASURES OF CENTRAL TENDENCY. In an experiment, these are applied to the dependent variable (DV) MALLOY PSYCH 3000 MEAN & VARIANCE PAGE 1 STATISTICS Descriptive statistics Inferential statistics MEASURES OF CENTRAL TENDENCY In an experiment, these are applied to the dependent variable (DV) E.g., MEASURES

More information

Chapter 16 One-way Analysis of Variance

Chapter 16 One-way Analysis of Variance Chapter 16 One-way Analysis of Variance I am assuming that most people would prefer to see the solutions to these problems as computer printout. (I will use R and SPSS for consistency.) 16.1 Analysis of

More information

Statistics: revision

Statistics: revision NST 1B Experimental Psychology Statistics practical 5 Statistics: revision Rudolf Cardinal & Mike Aitken 29 / 30 April 2004 Department of Experimental Psychology University of Cambridge Handouts: Answers

More information

Last two weeks: Sample, population and sampling distributions finished with estimation & confidence intervals

Last two weeks: Sample, population and sampling distributions finished with estimation & confidence intervals Past weeks: Measures of central tendency (mean, mode, median) Measures of dispersion (standard deviation, variance, range, etc). Working with the normal curve Last two weeks: Sample, population and sampling

More information

Two-Sample Inferential Statistics

Two-Sample Inferential Statistics The t Test for Two Independent Samples 1 Two-Sample Inferential Statistics In an experiment there are two or more conditions One condition is often called the control condition in which the treatment is

More information

Harvard University. Rigorous Research in Engineering Education

Harvard University. Rigorous Research in Engineering Education Statistical Inference Kari Lock Harvard University Department of Statistics Rigorous Research in Engineering Education 12/3/09 Statistical Inference You have a sample and want to use the data collected

More information

Hypothesis testing: Steps

Hypothesis testing: Steps Review for Exam 2 Hypothesis testing: Steps Exam 2 Review 1. Determine appropriate test and hypotheses 2. Use distribution table to find critical statistic value(s) representing rejection region 3. Compute

More information

Confidence Intervals with σ unknown

Confidence Intervals with σ unknown STAT 141 Confidence Intervals and Hypothesis Testing 10/26/04 Today (Chapter 7): CI with σ unknown, t-distribution CI for proportions Two sample CI with σ known or unknown Hypothesis Testing, z-test Confidence

More information

Two-sample inference: Continuous data

Two-sample inference: Continuous data Two-sample inference: Continuous data Patrick Breheny November 11 Patrick Breheny STA 580: Biostatistics I 1/32 Introduction Our next two lectures will deal with two-sample inference for continuous data

More information

Bio 183 Statistics in Research. B. Cleaning up your data: getting rid of problems

Bio 183 Statistics in Research. B. Cleaning up your data: getting rid of problems Bio 183 Statistics in Research A. Research designs B. Cleaning up your data: getting rid of problems C. Basic descriptive statistics D. What test should you use? What is science?: Science is a way of knowing.(anon.?)

More information

Lecture 2. Estimating Single Population Parameters 8-1

Lecture 2. Estimating Single Population Parameters 8-1 Lecture 2 Estimating Single Population Parameters 8-1 8.1 Point and Confidence Interval Estimates for a Population Mean Point Estimate A single statistic, determined from a sample, that is used to estimate

More information

Statistical Intervals (One sample) (Chs )

Statistical Intervals (One sample) (Chs ) 7 Statistical Intervals (One sample) (Chs 8.1-8.3) Confidence Intervals The CLT tells us that as the sample size n increases, the sample mean X is close to normally distributed with expected value µ and

More information

Confidence Intervals for Normal Data Spring 2018

Confidence Intervals for Normal Data Spring 2018 Confidence Intervals for Normal Data 18.05 Spring 2018 Agenda Exam on Monday April 30. Practice questions posted. Friday s class is for review (no studio) Today Review of critical values and quantiles.

More information

Last week: Sample, population and sampling distributions finished with estimation & confidence intervals

Last week: Sample, population and sampling distributions finished with estimation & confidence intervals Past weeks: Measures of central tendency (mean, mode, median) Measures of dispersion (standard deviation, variance, range, etc). Working with the normal curve Last week: Sample, population and sampling

More information

Chapter Eight: Assessment of Relationships 1/42

Chapter Eight: Assessment of Relationships 1/42 Chapter Eight: Assessment of Relationships 1/42 8.1 Introduction 2/42 Background This chapter deals, primarily, with two topics. The Pearson product-moment correlation coefficient. The chi-square test

More information

Confidence intervals

Confidence intervals Confidence intervals We now want to take what we ve learned about sampling distributions and standard errors and construct confidence intervals. What are confidence intervals? Simply an interval for which

More information

The t-statistic. Student s t Test

The t-statistic. Student s t Test The t-statistic 1 Student s t Test When the population standard deviation is not known, you cannot use a z score hypothesis test Use Student s t test instead Student s t, or t test is, conceptually, very

More information

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p).

We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation, Y ~ BIN(n,p). Sampling distributions and estimation. 1) A brief review of distributions: We're in interested in Pr{three sixes when throwing a single dice 8 times}. => Y has a binomial distribution, or in official notation,

More information

Psychology 282 Lecture #4 Outline Inferences in SLR

Psychology 282 Lecture #4 Outline Inferences in SLR Psychology 282 Lecture #4 Outline Inferences in SLR Assumptions To this point we have not had to make any distributional assumptions. Principle of least squares requires no assumptions. Can use correlations

More information

Introduction to Statistical Data Analysis Lecture 5: Confidence Intervals

Introduction to Statistical Data Analysis Lecture 5: Confidence Intervals Introduction to Statistical Data Analysis Lecture 5: Confidence Intervals James V. Lambers Department of Mathematics The University of Southern Mississippi James V. Lambers Statistical Data Analysis 1

More information

Population Variance. Concepts from previous lectures. HUMBEHV 3HB3 one-sample t-tests. Week 8

Population Variance. Concepts from previous lectures. HUMBEHV 3HB3 one-sample t-tests. Week 8 Concepts from previous lectures HUMBEHV 3HB3 one-sample t-tests Week 8 Prof. Patrick Bennett sampling distributions - sampling error - standard error of the mean - degrees-of-freedom Null and alternative/research

More information

Estimating a Population Mean

Estimating a Population Mean Estimating a Population Mean MATH 130, Elements of Statistics I J. Robert Buchanan Department of Mathematics Fall 2017 Objectives At the end of this lesson we will be able to: obtain a point estimate for

More information

Lab #12: Exam 3 Review Key

Lab #12: Exam 3 Review Key Psychological Statistics Practice Lab#1 Dr. M. Plonsky Page 1 of 7 Lab #1: Exam 3 Review Key 1) a. Probability - Refers to the likelihood that an event will occur. Ranges from 0 to 1. b. Sampling Distribution

More information

Notes for Week 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1

Notes for Week 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1 Notes for Wee 13 Analysis of Variance (ANOVA) continued WEEK 13 page 1 Exam 3 is on Friday May 1. A part of one of the exam problems is on Predictiontervals : When randomly sampling from a normal population

More information

Z score indicates how far a raw score deviates from the sample mean in SD units. score Mean % Lower Bound

Z score indicates how far a raw score deviates from the sample mean in SD units. score Mean % Lower Bound 1 EDUR 8131 Chat 3 Notes 2 Normal Distribution and Standard Scores Questions Standard Scores: Z score Z = (X M) / SD Z = deviation score divided by standard deviation Z score indicates how far a raw score

More information

Section B. The Theoretical Sampling Distribution of the Sample Mean and Its Estimate Based on a Single Sample

Section B. The Theoretical Sampling Distribution of the Sample Mean and Its Estimate Based on a Single Sample This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike License. Your use of this material constitutes acceptance of that license and the conditions of use of materials on this

More information

Institute of Actuaries of India

Institute of Actuaries of India Institute of Actuaries of India Subject CT3 Probability & Mathematical Statistics May 2011 Examinations INDICATIVE SOLUTION Introduction The indicative solution has been written by the Examiners with the

More information

Identify the scale of measurement most appropriate for each of the following variables. (Use A = nominal, B = ordinal, C = interval, D = ratio.

Identify the scale of measurement most appropriate for each of the following variables. (Use A = nominal, B = ordinal, C = interval, D = ratio. Answers to Items from Problem Set 1 Item 1 Identify the scale of measurement most appropriate for each of the following variables. (Use A = nominal, B = ordinal, C = interval, D = ratio.) a. response latency

More information

STAT 285: Fall Semester Final Examination Solutions

STAT 285: Fall Semester Final Examination Solutions Name: Student Number: STAT 285: Fall Semester 2014 Final Examination Solutions 5 December 2014 Instructor: Richard Lockhart Instructions: This is an open book test. As such you may use formulas such as

More information

ANOVA Situation The F Statistic Multiple Comparisons. 1-Way ANOVA MATH 143. Department of Mathematics and Statistics Calvin College

ANOVA Situation The F Statistic Multiple Comparisons. 1-Way ANOVA MATH 143. Department of Mathematics and Statistics Calvin College 1-Way ANOVA MATH 143 Department of Mathematics and Statistics Calvin College An example ANOVA situation Example (Treating Blisters) Subjects: 25 patients with blisters Treatments: Treatment A, Treatment

More information

We need to define some concepts that are used in experiments.

We need to define some concepts that are used in experiments. Chapter 0 Analysis of Variance (a.k.a. Designing and Analysing Experiments) Section 0. Introduction In Chapter we mentioned some different ways in which we could get data: Surveys, Observational Studies,

More information

STAT 201 Assignment 6

STAT 201 Assignment 6 STAT 201 Assignment 6 Partial Solutions 12.1 Research question: Do parents in the school district support the new education program? Parameter: p = proportion of all parents in the school district who

More information

1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College

1-Way ANOVA MATH 143. Spring Department of Mathematics and Statistics Calvin College 1-Way ANOVA MATH 143 Department of Mathematics and Statistics Calvin College Spring 2010 The basic ANOVA situation Two variables: 1 Categorical, 1 Quantitative Main Question: Do the (means of) the quantitative

More information

2011 Pearson Education, Inc

2011 Pearson Education, Inc Statistics for Business and Economics Chapter 7 Inferences Based on Two Samples: Confidence Intervals & Tests of Hypotheses Content 1. Identifying the Target Parameter 2. Comparing Two Population Means:

More information

Chapter 8 Handout: Interval Estimates and Hypothesis Testing

Chapter 8 Handout: Interval Estimates and Hypothesis Testing Chapter 8 Handout: Interval Estimates and Hypothesis esting Preview Clint s Assignment: aking Stock General Properties of the Ordinary Least Squares (OLS) Estimation Procedure Estimate Reliability: Interval

More information

Descriptive Statistics

Descriptive Statistics Descriptive Statistics Once an experiment is carried out and the results are measured, the researcher has to decide whether the results of the treatments are different. This would be easy if the results

More information

+ Specify 1 tail / 2 tail

+ Specify 1 tail / 2 tail Week 2: Null hypothesis Aeroplane seat designer wonders how wide to make the plane seats. He assumes population average hip size μ = 43.2cm Sample size n = 50 Question : Is the assumption μ = 43.2cm reasonable?

More information

One-factor analysis of variance (ANOVA)

One-factor analysis of variance (ANOVA) One-factor analysis of variance (ANOVA) March 1, 2017 psych10.stanford.edu Announcements / Action Items Schedule update: final R lab moved to Week 10 Optional Survey 5 coming soon, due on Saturday Last

More information

Sociology 6Z03 Review II

Sociology 6Z03 Review II Sociology 6Z03 Review II John Fox McMaster University Fall 2016 John Fox (McMaster University) Sociology 6Z03 Review II Fall 2016 1 / 35 Outline: Review II Probability Part I Sampling Distributions Probability

More information

Treatment of Error in Experimental Measurements

Treatment of Error in Experimental Measurements in Experimental Measurements All measurements contain error. An experiment is truly incomplete without an evaluation of the amount of error in the results. In this course, you will learn to use some common

More information

Advanced Experimental Design

Advanced Experimental Design Advanced Experimental Design Topic Four Hypothesis testing (z and t tests) & Power Agenda Hypothesis testing Sampling distributions/central limit theorem z test (σ known) One sample z & Confidence intervals

More information

Announcements. Unit 7: Multiple linear regression Lecture 3: Confidence and prediction intervals + Transformations. Uncertainty of predictions

Announcements. Unit 7: Multiple linear regression Lecture 3: Confidence and prediction intervals + Transformations. Uncertainty of predictions Housekeeping Announcements Unit 7: Multiple linear regression Lecture 3: Confidence and prediction intervals + Statistics 101 Mine Çetinkaya-Rundel November 25, 2014 Poster presentation location: Section

More information

Topic 2 Measurement and Calculations in Chemistry

Topic 2 Measurement and Calculations in Chemistry Topic Measurement and Calculations in Chemistry Nature of Measurement Quantitative observation consisting of two parts. number scale (unit) Examples 0 grams 6.63 10 34 joule seconds The Fundamental SI

More information

Chapter Seven: Multi-Sample Methods 1/52

Chapter Seven: Multi-Sample Methods 1/52 Chapter Seven: Multi-Sample Methods 1/52 7.1 Introduction 2/52 Introduction The independent samples t test and the independent samples Z test for a difference between proportions are designed to analyze

More information

The Central Limit Theorem

The Central Limit Theorem - The Central Limit Theorem Definition Sampling Distribution of the Mean the probability distribution of sample means, with all samples having the same sample size n. (In general, the sampling distribution

More information