Stats Review Chapter 14. Mary Stangler Center for Academic Success Revised 8/16

Similar documents
Inferences for Regression

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

Regression Analysis II

Assumptions, Diagnostics, and Inferences for the Simple Linear Regression Model with Normal Residuals

Measuring the fit of the model - SSR

Stats Review Chapter 3. Mary Stangler Center for Academic Success Revised 8/16

Chapter 14 Simple Linear Regression (A)

Two-Sample Inference for Proportions and Inference for Linear Regression

Visual interpretation with normal approximation

LAB 2. HYPOTHESIS TESTING IN THE BIOLOGICAL SCIENCES- Part 2

MBA 605, Business Analytics Donald D. Conant, Ph.D. Master of Business Administration

LECTURE 5. Introduction to Econometrics. Hypothesis testing

df=degrees of freedom = n - 1

The goodness-of-fit test Having discussed how to make comparisons between two proportions, we now consider comparisons of multiple proportions.

Inference for Regression

DATA IN SERIES AND TIME I. Several different techniques depending on data and what one wants to do

Econ 3790: Business and Economics Statistics. Instructor: Yogesh Uppal

CBA4 is live in practice mode this week exam mode from Saturday!

7.2 One-Sample Correlation ( = a) Introduction. Correlation analysis measures the strength and direction of association between

STAT 512 MidTerm I (2/21/2013) Spring 2013 INSTRUCTIONS

EC2001 Econometrics 1 Dr. Jose Olmo Room D309

STA 101 Final Review

Psychology 282 Lecture #4 Outline Inferences in SLR

Inference for Regression Simple Linear Regression

Lecture 3: Inference in SLR

Econ 3790: Business and Economic Statistics. Instructor: Yogesh Uppal

A discussion on multiple regression models

Review: General Approach to Hypothesis Testing. 1. Define the research question and formulate the appropriate null and alternative hypotheses.

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA)

Mathematical Notation Math Introduction to Applied Statistics

Midterm 2 - Solutions

Linear Correlation and Regression Analysis

GROUPED DATA E.G. FOR SAMPLE OF RAW DATA (E.G. 4, 12, 7, 5, MEAN G x / n STANDARD DEVIATION MEDIAN AND QUARTILES STANDARD DEVIATION

Ch Inference for Linear Regression

Chapter 24. Comparing Means. Copyright 2010 Pearson Education, Inc.

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit

Probability and Statistics Notes

χ L = χ R =

Business Statistics. Lecture 10: Course Review

Correlation and Regression Analysis. Linear Regression and Correlation. Correlation and Linear Regression. Three Questions.

Correlation Analysis

Econometrics. 4) Statistical inference

STATISTICS 141 Final Review

Stat 5102 Final Exam May 14, 2015

Review 6. n 1 = 85 n 2 = 75 x 1 = x 2 = s 1 = 38.7 s 2 = 39.2

Chapter 12 - Lecture 2 Inferences about regression coefficient

Six Sigma Black Belt Study Guides

Lecture 30. DATA 8 Summer Regression Inference

Tables Table A Table B Table C Table D Table E 675

Midterm 2 - Solutions

appstats27.notebook April 06, 2017

Lecture Slides. Elementary Statistics. by Mario F. Triola. and the Triola Statistics Series

STA Module 11 Inferences for Two Population Means

STA Rev. F Learning Objectives. Two Population Means. Module 11 Inferences for Two Population Means

Chapter 9 Inferences from Two Samples

Introduction to Statistical Inference Lecture 8: Linear regression, Tests and confidence intervals

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.

STAT420 Midterm Exam. University of Illinois Urbana-Champaign October 19 (Friday), :00 4:15p. SOLUTIONS (Yellow)

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

Chapter 27 Summary Inferences for Regression

Test 3 Practice Test A. NOTE: Ignore Q10 (not covered)

Inferences for Correlation

Inference with Simple Regression

Lectures 5 & 6: Hypothesis Testing

Simple Linear Regression for the Climate Data

Difference between means - t-test /25

Ecn Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman. Midterm 2. Name: ID Number: Section:

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006

Chapter 16. Simple Linear Regression and Correlation

STA Module 10 Comparing Two Proportions

Section 9.4. Notation. Requirements. Definition. Inferences About Two Means (Matched Pairs) Examples

Stat 529 (Winter 2011) Experimental Design for the Two-Sample Problem. Motivation: Designing a new silver coins experiment

Scatter plot of data from the study. Linear Regression

1. Define the following terms (1 point each): alternative hypothesis

Remedial Measures, Brown-Forsythe test, F test

Nature vs. nurture? Lecture 18 - Regression: Inference, Outliers, and Intervals. Regression Output. Conditions for inference.

Analysis of Variance

Simple Linear Regression

Chapter 8 Handout: Interval Estimates and Hypothesis Testing

y ˆ i = ˆ " T u i ( i th fitted value or i th fit)

Chapter 7 Comparison of two independent samples

Regression Analysis and Forecasting Prof. Shalabh Department of Mathematics and Statistics Indian Institute of Technology-Kanpur

COSC 341 Human Computer Interaction. Dr. Bowen Hui University of British Columbia Okanagan

Lecture 17: Small-Sample Inferences for Normal Populations. Confidence intervals for µ when σ is unknown

Hypothesis Testing for Var-Cov Components

PLSC PRACTICE TEST ONE

Classroom Activity 7 Math 113 Name : 10 pts Intro to Applied Stats

Chapter 24. Comparing Means

Scatter plot of data from the study. Linear Regression

Statistics for Managers using Microsoft Excel 6 th Edition

determine whether or not this relationship is.

Correlation and Regression

9-6. Testing the difference between proportions /20

1 Independent Practice: Hypothesis tests for one parameter:

Econ 3790: Statistics Business and Economics. Instructor: Yogesh Uppal

Mathematical Notation Math Introduction to Applied Statistics

Lecture 4: Statistical Hypothesis Testing

Chapter 23: Inferences About Means

Basic Business Statistics 6 th Edition

Transcription:

Stats Review Chapter 14 Revised 8/16

Note: This review is meant to highlight basic concepts from the course. It does not cover all concepts presented by your instructor. Refer back to your notes, unit objectives, handouts, etc. to further prepare for your exam. The questions are displayed on one slide followed by the answers are displayed in red on the next. This review is available in alternate formats upon request.

Standard Error of Estimate Below is a table of the population of St. Cloud from 1970-2010. Find the standard error of estimate of the following data where the years represent the years after 1970, that is 1970=0. Year 0 10 20 30 40 Population 39691 42566 48812 59107 65842

Standard Error of Estimate Below is a table of the population of St. Cloud from 1970-2010. Find the standard error of estimate of the following data where the years represent the years after 1970, that is 1970=0. Year 0 10 20 30 40 Population 39691 42566 48812 59107 65842 We need to use the formula s e = (y i y i ) 2 n 2 = residuals 2 Step 1: Find the least-squares regression line (see chapter 4 for help). For this data the least-squares regression line is y = 37435 + 688.4x Step 2: Obtain the predicted value for each year ( y) using the least squares- regression line Step 3i: Calculate the residuals (y i y i ) Step 3ii: Calculate the residuals squared, (y i y i ) 2 Year Population (y i ) Step 2: y Step 3i: (y i y i ) Step 3ii: (y i y i ) 2 0 39691 37435 + 688.4 0 = 37435 (39691-37435)=2256 (2256) 2 =5089536 10 42566 44319-1753 3073009 20 48812 51203-2391 5716881 30 59107 58087 1020 1040400 40 65842 64971 871 758641 n 2 Step 4: Add up the residuals squares, residuals 2 = 15678467 Step 5: Put it into the formula s e = (y i y i ) 2 n 2 = 15678467 =2286.08 5 2

Sample Standard Deviation of β 1, s β1 Find the sample standard deviation of β 1 from the previous problem s data.

Sample Standard Deviation of β 1, s β1 Find the sample standard deviation of β 1 from the previous problem s data. Option 1: Option 2: Step 1: Find the sample standard deviation of the x values (years), s x. Using technology (see chapter 3 on doing it by hand) we get s x =15.811 Step 2: Find s β1 using the formula s β1 = s e was found in the last problem s β1 = s e = 2286.08 n 1 s x 5 1 15.811 = 72. 29 s e n 1 s x Step 1: fill out the table below x (mean)=20 Step 2: Find the sum of the final column and take the square root. 400+100+0+100+400=1000, 1000 = 31.623. This means (x i x) 2 =31.623 Step 3: Put into the formula s β1 = s e was found in the last problem s β1 = s e = 2286.08 (x i x) 2 31.623 = 72. 79 s e (x i x) 2 Year, x i x (x i x i ) (x i x i ) 2 0 20 0-20=-20 ( 20) 2 =400 10 20-10 100 20 20 0 0 30 20 10 100 40 20 20 400

Testing the Significance of the Least-Squares Regression Model Below are is a table of the population of St. Cloud from 1970-2010. Find the standard error of estimate of the following data where the years represent the years after 1970, that is 1970=0. Assuming the residuals are normally distributed and the residuals are normally distributed with constant error variance, test whether a linear relationship exists between year and population with α=.05 significance level. Also find the confidence interval. Year 0 10 20 30 40 Population 39691 42566 48812 59107 65842

Testing the Significance of the Least-Squares Regression Model Below are is a table of the population of St. Cloud from 1970-2010. Find the standard error of estimate of the following data where the years represent the years after 1970, that is 1970=0. Assuming the residuals are normally distributed and the residuals are normally distributed with constant error variance, test whether a linear relationship exists between year and population with α=.05 significance level. Also find the confidence interval. Year 0 10 20 30 40 Population 39691 42566 48812 59107 65842 Step 1: Determine the null and alternative hypotheses. H 0 : β 1 = 0 (no linear relationship) H 1 : β 1 0 (linear relationship) It is a two-tailed test because we are seeing if there is a difference. Step 2: Select the level of significance. α=.05 Step 3: Compute the test statistic t 0 = b 1 s b 1 From the least-squares regression line we have b 1 =688.43 (the slope of the regression line) and from slide 21, s β1 = 55.998. t 0 = b 1 s b 1 = 688.43 72.29 = 9.52 with n-2=5-2=3 degrees of freedom Step 4: Using the p-value approach, reject H 0 if the p-value<α Using technology, we have a p-value of 0.0025..0025<.05, so reject H 0 Therefore there is a linear relationship between year and population exists.

Confidence Intervals for the Slope of the Regression Line Using the same data as before, find the 95% confidence interval.

Confidence Intervals for the Slope of the Regression Line Using the same data as before, find the 95% confidence interval. Step 1: Find b 1 From before, y = 37435 + 688.4x Step 2: Verify the conditions (see page 687) Conditions are met Step 3: Determine the critical value, t α/2 See chapter 9 t.05/2 = t.025 =3.18 with n-2=5-2=3 degrees of freedom Step 4: Compute the confidence interval using b 1 ± t α/2 688.43 ± 3.18 2286.08 31.623 688.43 ± 229.888 (458.36,918.5) s e (x i x) 2

Confidence Intervals for a Mean Response Researchers examined the mean surface temperature of a body of salt water and compared it to the mean coral growth (in mm/year). The data is listed below. Construct a 99% confidence interval for the predicted mean of coral reef growth whose temperature is 29.9. The least squares regression line is y= 11.7159-0.3036x. Surface Temp 29.7 29.9 30.2 30.2 30.5 30.7 30.9 Growth 2.63 2.58 2.68 2.6 2.48 2.38 2.26

Confidence Intervals for a Mean Response Researchers examined the mean surface temperature of a body of salt water and compared it to the mean coral growth (in mm/year). The data is listed below. Construct a 99% confidence interval for the predicted mean of coral reef growth whose temperature is 29.9. The least squares regression line is y= 11.7159-0.3036x. Surface Temp 29.7 29.9 30.2 30.2 30.5 30.7 30.9 Growth 2.63 2.58 2.68 2.6 2.48 2.38 2.26 Step 1: Find s e and (x i x) 2 using technology or step shown in the previous slides. Also find y for the given value. s e =0.0836 (x i x) 2 =1.1 (using 30.3 as x) y =11.7159-.3036(29.9)=2.638 Step 2: Determine the critical value, t α/2 See chapter 9 t.01/2 = t.005 =4.032 with n-2=7-2=5 degrees of freedom Step 3: Put into formula y ± tα 2 s e 1 + (x x) 2 n (x i 2.638 ± 4.032.083 1 7 x) 2 where x is the given value, 29.9 2.638 ±.1797 (2.4583,2.8177) + (29.9 30.3)2 1.1 With Technology the answer is (2.456139, 2.8181986)

Prediction Interval for an Individual Response about y Using the coral growth data from the previous slides, construct a 99% prediction interval for the predicted growth at a temperature of 29.9.

Prediction Interval for an Individual Response about y Using the coral growth data from the previous slides, construct a 99% prediction interval for the predicted growth at a temperature of 29.9. From the previous problem we know s e =0.0836 (x i x) 2 =1.1 y =11.7159-.3036(29.9)=2.638 t α/2 = t.01/2 = t.005 =4.032 with n-2=7-2=5 degrees of freedom Putting this into the formula y ± tα 2 s e 2.638 ± 4.032.083 1 + 1 7 + (29.9 30.3)2 1.1 2.638 ±.3798 (2.2582,3.0178) 1 + 1 + (x x) 2 n (x i x) 2 With Technology the answer is (2.2544944, 3.0198433)

Confidence and Prediction Intervals What is the difference between the prediction made in slide 12 and slide 14?

Confidence and Prediction Intervals What is the difference between the prediction made in slide 12 and slide 14? The confidence interval on slide 27 is an estimate on the mean coral growth for all bodies of salt water that have a temperature of 29.9. The prediction interval made on slide 29 is an estimate of the coral growth of one body of water whose temperature is 29.9. Remember confidence interval is for all while a prediction interval is for an individual.