Chapter 9. Correlation and Regression

Size: px
Start display at page:

Download "Chapter 9. Correlation and Regression"

Transcription

1 Chapter 9 Correlation and Regression

2 Lesson 9-1/9-2, Part 1 Correlation

3 Registered Florida Pleasure Crafts and Watercraft Related Manatee Deaths Year Boats in Ten Thousands Manatee Deaths

4 9-1 Overview This chapter introduces methods for making inferences based on sample data that come in pairs or bivariate data. Is there a relationship between the data? If so, what is the equation? Use that equation for prediction.

5 Correlation A correlation exists between two variables when one of them is related to the other in some way.

6 Scatter Diagram A scatterplot (or scatter diagram) is a graph in which the paired (x, y) sample data are plotted with a horizontal x-axis and a vertical y-axis. Each individual (x, y) pair is plotted as a single point.

7 Scatter Diagram Table 9-1: Registered Florida Craft (in tens of thousands) and Watercraft-Related Manatee Deaths Year x: Boats y: Manatee

8 Scatter Plots TI83

9 Manatee Deaths from Boats Scatter Plot Registered Boats (tens of thousands)

10 Dangers of Scatter Diagrams The horizontal or vertical scale of a scatter diagram can be set so that the scatter diagram misleads a reader.

11 Dangers of Scatter Diagrams Because of this danger you must use numerical summaries of paired data in conjunction with graphs in order to determine the type of relationship.

12 Positive Linear Correlations

13 Negative Linear Correlations

14 No Linear Correlations

15 Linear Correlation Coefficient The linear correlation coefficient (r ) measures the direction and strength of the linear relationship between paired x and y values in a sample.

16 Assumptions The sample of paired data (x, y) is a random sample. The pairs of (x, y) data have a bivariate normal distribution.

17 Notation for the Linear Correlation Coefficient n number of pairs of data present x the sum of all x-values 2 x x 2 the x-value should be squared and then those squares added. the sum of all x-values and the total then squared xy the x-value should be multiplied by its corresponding y-value r linear correlation coefficient for a sample ρ linear correlation coefficient for a population

18 r Formula for the Linear Correlation Coefficient n xy x y n x x n y y

19 Justification for r formula r x x y y n s s 1 x y x, y centriod Figure 9-7

20 Interpreting the Linear Correlation Coefficient 1) Linear correlation coefficient r is always between 1 and 1. 2) The closer r is to 1, the stronger the evidence of positive association between the two variables. 3) The closer r is to a 1, the stronger the evidence of negative relationship between two variables. 4) If r is closer to 0, does not rule out any strong relationship between x and y, there could still be a strong relationship but one that is not linear. 5) The units of x and y plays no role in the interpretation of r. 6) Correlation is strongly effected by outlying observations.

21 Example Page 511, #6 Use a scatter plot and the linear correlation r to determine whether there is a correlation between the two variables. x y r n xy x y n x x n y y x y x y x 2 y

22 Example Page 511, #6 r n xy x y n x x n y y x y xy x 2 y x = 16 y = 41 xy = 185 x = 70 x = 495

23 Example Page 511, #6 r n xy x y n x x n y y r 5(185) x = 16 y = 41 xy = 185 x 2 = 70 y 2 = 495

24 Example Page 511, #6 x = 16 y = 41 xy = 185 x 2 = 70 y 2 = 495

25 Example Page 511, #6

26 Example Page 511, #6 Using Table A-6, n = 5 and assuming = 0.05 Critical Value = 0.878; therefore r = indicates significant (positive) linear correlation.

27 Lesson 9-1/9-2, Part 2 Correlation

28 Interpreting the Linear Correlation Coefficient The absolute value of the computed value of r exceeds the value in Table A-6, conclude that there is significant linear correlation. Otherwise there is not significant evidence to support the conclusion of significant linear correlation.

29 1 r 1 Value of r does not change if all values of either variable are converted to a different scale. The r is not affected by the choice of x and y. Properties of the Linear Correlation Coefficient Interchange x and y and the value r will not change. r measures the strength of a linear relationship.

30 Interpreting r: Explain Variation The value of r 2 is the proportion of the variation in y that is explained by the linear relationship between x and y. r 2 is the square of the linear correlation coefficient r.

31 Example: Boats and Manatees A. Find the value of the linear correlation coefficient r. B. What proportion of the variation of the manatee deaths can be explained the variation in the number of boat registrations? Table 9-1: Registered Florida Craft (in tens of thousands) and Watercraft-Related Manatee Deaths Year x: Boats y: Manatee

32 Example: Boats and Manatees with r = 0.922, we get r 2 = We can conclude that (or about 85%) of the variation in manatee deaths can be explained by the linear relationship between the number of boat registrations and the number of manatee deaths from boats. This implies that 15% of the variation of manatee deaths cannot be explained by the number of boat registrations.

33 Hypothesis Test Method 1 We wish to determine whether there is a significant linear correlation between two variables. Let H o : = 0 No significant linear correlation Let H 1 : 0 Significant linear correlation Let H 1 : > 0 Significant positive linear correlation Let H 1 : < 0 Significant negative linear correlation

34 Test Statistic t r 1 r 2 n 2 Critical Values: Use table A-3 with df = n - 2

35 Example: Boats and Manatees Test the claim that there is a linear correlation between the number of registered boats and the number of manatee deaths from boats. Table 9-1: Registered Florida Craft (in tens of thousands) and Watercraft-Related Manatee Deaths Year x: Boats y: Manatee H o : = 0 H 1 : 0

36 Example: Boats and Manatees t r 2 1 r n p-value = 1.51E 4

37 Example: Boats and Manatees t 6.71 p value df n 2 8 Critical Values

38 Example: Boats and Manatees There is sufficient evidence to reject H o, since p-value = < = 0.05 and conclude that there is significant correlation between the number of registered boats and the number of manatee deaths from boats.

39 Hypothesis Test Method 2 We wish to determine whether there is a significant linear correlation between two variables. Test statistic is r Critical values refer to Table A-6 with no degrees of freedom.

40 Example Page 511, #2 Using data collected from the FBI and the Bureau of Alcohol, Tobacco, and Firearms, the number of registered automatic Weapons and the murder rate (in murders per 100,000 people) was obtained for each of eight randomly selected states. Statdisk was used to find the value of the linear correlation Coefficient is r = a) Is there a significant linear correlation between the number of registered automatic weapons and the murder rate? n r Use table A-6 assume = 0.05

41 Example Page 511, #2 critical values = Since the absolute value of is greater than there is significant positive correlation

42 Example Page 511, #2 b) What proportion of the variation in the murder rate can be explained by the linear relationship between the murder rate and the number of registered automatic weapons? r r (0.885) The proportion of the variation in murder rate that can be explained in terms of variation in registered automatic weapons is 78.3%.

43 Example Page 512, #8 Listed below are the numbers of fires (in thousands) and the acres that were burned (in millions) in 11 western states in year of the last decade (based on the data from USA Today. Is there a correlation? Fires Acres Burned

44 Example Page 512, #8 LinRegTTest Ho : ρ 0 p value H : ρ 0 1 α 0.05 r 2 r % There is sufficient evidence to fail to reject H o since the p-value > α and are unable to conclude that there is significant linear correlation between the number of fires and the numbers of acres burned.

45 Example Page 512, #8 The data were listed under a headline of Loggers seize on fires to argue for more cutting. Do the data support the argument that as loggers remove more trees, the risk of fire decreases because the forests are less dense? The results do not support any conclusion about the removal of trees affecting the risk of fires, because none of the variables addresses the removal/density of trees.

46 Common Errors Involving Correlation Causation: It is wrong to conclude that correlation implies causality. Averages: Averages suppress individual variation and may inflate the correlation coefficient. Linearity: There may be some relationship between x and y even when there is no significant linear correlation.

47 Example Page 515, #26 Describe the error in the stated conclusion. Given: There is significant linear correlation between personal income and years of education. Conclusion: More education causes a person s income to rise. The presence of a significant linear correlation does not necessarily mean that one of the variables causes the other. Correlation does not imply causality.

48 Example Page 515, #28 Describe the error in the stated conclusion. Given: There is significant linear correlation between state average tax burdens and state average incomes. Conclusion: There is significant linear correlation between Individual tax burdens and individual incomes rise. Averages tend to suppress variation among individuals, so a correlation among averages does not necessary mean that there is correlation among individuals.

49 Lesson 9-3, Part 1 Regression

50 Objective In lesson 9-2 we use a scatter diagram and linear correlation coefficient to indicate that a linear relation exists between two variables. In lesson 9-3 we are going to find a linear equation that describes the relation between the two variables.

51 Regression Equation The regression equation expresses a relationship between x (called the independent variable, predictor variable or explanatory variable), and y (called the dependent variable or response variable) y mx b The typical equation of a straight line is expressed in form yˆ bo b1x, where b o is the y-intercept and b 1 is the slope.

52 Assumptions We are investigating only linear relationships For each x-value, y is a random variable having a normal (bell-shaped) distribution. All of these y distributions have the same variance. For a given value of x, the distribution of y-values has a mean that lies on the regression line. Results are not seriously affected if departures from normal distributions and equal variances are not too extreme.

53 Regression Line Regression Equation Given a collection of paired data, the regression equation Algebraically describes the relationship between two variables. Regression Line yˆ b b x o 1 The graph of the regression equation is called the regression (or line of best fit, or least square line).

54 Notation for Regression Equation y-intercept of Regression equation Slope of regression equation Equation of the regression line Population Parameter Sample Statistic TI o b o a 1 b 1 b y = o + 1 x yˆ b b x o 1 y = a + bx

55 Formulas for b o and b 1 Formula 9-2 (Slope) b n xy x y n x x Formula 9-3 (y-intercept) b y b x o 1

56 Example: Boats and Manatee Table 9-1: Registered Florida Craft (in tens of thousands) and Watercraft-Related Manatee Deaths Year x: Boats y: Manatee Find the regression equation Enter the values in L1 and L2 and use LinRegTTest

57 Example: Boats and Manatee Find the equation of the regression line. y a bx yˆ x

58 Example: Boats and Manatee Regression Line

59 Example Page 527, #6 Find the equation of the regression line. Use the given data. x y Enter the data into L1 and L2 and use LinRegTTest

60 Example Page 527, #6 yˆ x

61 Using the Regression Equation for Predictions In predicting a value of y based on some given value of x If there is not a significant linear correlation, the best predicted y-value is ( y ). If there is significant linear correlation, the best predicated y-value is found by substituting the x-value into the regression equation.

62 Guidelines for Using The Regression Equation 1. If there is no linear correlation, don t use the regression to make predictions. 2. When using the regression equation for predictions, stay within the scope of the available sample data. 3. A regression equation on old data is not necessary valid now. 4. Don t make predictions about a population that is different from the population from which the sample data were drawn.

63 Example page 527, #2 In each of the following cases, find the best predicted value of y given that x = The given statistics are summarized from paired sample data. a. r y 8.00 n 30, and the equation of the regression line is yˆ x Step 1 Check for linear correlation Since the absolute value of is less than There is no linear correlation. yˆ y

64 Example page 527, #2 b. r y 8.00 n 30, and the equation of the regression line is yˆ x Step 1 Check for linear correlation Since the absolute value of is greater than There is linear correlation yˆ x yˆ (2.00) yˆ

65 Example Page 528, #8 Find the best predicted value for the number of acres burned given that there was 80 fires. Fires Acres Burned Enter data into L1 and L2 and use LinRegTTest to check for linear correlation.

66 Lesson 9-3, Part 2 Regression

67 Example Page 528, #8 Regression Equation: yˆ x P-value > α there is no significant linear correlation. Use y for predication when x 80.

68 Example Page 528, #8 Go to 2 nd STAT yˆ y

69 Interpreting the Regression Equation Marginal change is the amount that a variable changes when the other variable changes by exactly one unit. Outlier is a point lying far away from the other data points. Influential point strongly effects the graph of the regression line.

70 Example Page 530, #26 Refer to the sample data listed in Table 9-1. If we include another pair of values consisting of x = 120 (for 1,200,000 boats) and y = 10 (manatee deaths from boats), is this new point an outlier? Is it an influential point? Table 9-1: Registered Florida Craft (in tens of thousands) and Watercraft-Related Manatee Deaths Year x: Boats y: Manatee

71 Example Page 530, #26 Yes; the point is an outlier, since it is far from the other data points 120 is far from other boat values, and 10 is far from the other manatee death values. Yes; the point is an influential one, since it will dramatically alter the regression line the original regression line predicts yˆ (120) 159.4, and so the new regression line will have to change considerably to come close to (120, 10).

72 Residuals and Least-Square Property Residual For a sample paired (x, y) data, the difference between the observed sample y-value and y-value that is predicted by using the regression equation. Least-Square Property A straight line satisfies this property if the sum of the squares of the line residuals is the smallest sum possible. y yˆ

73 Example: Residuals x y yˆ 54x x yˆ

74 The Method Least-Square The line that best describes the relation between two variables is the one that makes the residuals as small as possible. The least-square regression line minimizes the square of the vertical distance between observed values and predicated values. Minimize (residuals) 2

75 Lesson 9-4, Part 1 Variation and Prediction Intervals

76 Different types of Variation We consider different types of variation that can be used for two major applications: To determine the proportion of the variation in y that can be explained by the linear relationship between x and y. To construct interval estimates of predicted y- values such intervals are called prediction intervals.

77 Total and Explained Deviations Total Deviation from the mean of the particular point (x, y) is the vertical distance y y which is the distance between the point (x, y) and the horizontal line passing through the sample mean y. Explained Deviation is the vertical distance ŷ y which is the distance between the predicted y-value and the horizontal line passing through the sample mean y.

78 Total and Explained Deviations Unexplained Deviation is the vertical distance y yˆ, which is the vertical distance between the point (x, y) and the regression line. The distance y yˆ, is also called residual, as defined by Lesson 9-3.

79 Deviations

80 Total Deviation (total deviation) = (explained deviation) + (unexplained deviation) y y ŷ y y yˆ y y 2 ŷ y 2 y yˆ 2 Formula 9-4

81 Coefficient of Determination The coefficient of determination measures the percentage of total variation in the y-value that is explained by the regression line. r 2 ŷ y y y 2 2 explained variation total variation

82 Example Page 538, #2 Use the value of the linear correlation coefficient r to find the coefficient of determination and the percentage of the total variation that can be explained by the linear relationship between the two variables. r r ( 0.6) 0.36 The portion of the total variation explained by the regression line is 36%

83 Standard Error of Estimate The standard error of estimate is a measure of the differences (or distances) between the observed sample y-values and the predicted y-values ŷ that are obtained using the regression equation. s e y yˆ y b y b xy n2 n2 2 2 o 1

84 Example Page 539, #10 Fourteen different second-year medical students took blood pressure measurements of the same patient and the results are listed below. Systolic Diastolic Enter the data into L1 and L2 and use LinRegTtest

85 Example Page 539, #10 2 A. Find the explained variation ŷ y. Step 1 Find the regression equation yˆ x

86 Example Page 539, #10 Step 2 Use L3 to find ŷ

87 Example Page 539, #10 Step 3 Use L4 to find y Use 2 nd Stat

88 Example Page 539, #10 Step 4 Use L4 to find ŷ y 2

89 Example Page 539, #10 Step 5 Find ŷ y 2 Explained Variation ŷ y Use 2 nd Stat

90 Example Page 539, #10 2 B. Find the unexplained variation y yˆ. 2 Step 1 Use L5 to find ( y yˆ )

91 Example Page 539, #10 Step 3 Find y yˆ 2 Unexplained Variation 2 y yˆ Use 2 nd Stat

92 Example Page 539, #10 C. Find the total variation y y 2 TV EV UV

93 Example Page 539, #10 D. Find the coefficient of determination. 2 r

94 Example Page 539, #10 E. Find the standard error of estimate s e

95 Lesson 9-4, Part 2 Variation and Prediction Intervals

96 Prediction Intervals Prediction Intervals are intervals constructed about the predicted value of y that are used to measure the accuracy of a single predicted value.

97 Predication Interval for an Individual y yˆ E y yˆ E E 2 1 n( x x) o t s 1 α e 2 n n x x 2 2 x o = the given value of x t /2 = has n 2 degree of freedom in table A-3

98 Example Page 540, #14 Refer to problem #10 and assume necessary conditions of normality and variance are met. Systolic Diastolic A. Find the predicted diastolic reading given that the systolic reading is 120. Enter the data into L1 and L2 and use LinRegTtest

99 Example Page 540, #14 Step 1 Find the regression equation yˆ x (120)

100 Example Page 540, #14 B. Find a 95% predication interval estimate of the diastolic reading given that the systolic reading is 120. Step 1 Identify the calculations needed for the margin of error formula n 14 x x x n x x

101 Example Page 540, #14 n 14 x x x n x x s e t ,0.025 E 2 1 n( x x) o t s 1 α e 2 n n x x ( )

102 Example Page 540, #14 Step 2 Use ŷ and margin of error yˆ E y yˆ E y y When x = 120, (78) we are 95% certain that the corresponding y-value is between and 98.22

103 Interpreting a Computer Display Minitab The regression equation is Nicotine = Tar Predictor Coef SE Coef T P Constant Tar S = R-Sq = 92.4% R-Sq(adj) = 92.1% Predicted values for New Observations New Obs Fit SE Fit 95.0% CI 95.0% PI (1.2107, ) (1.0731, )

104 Example Page 539, #6 and 8 6. Identify Total Variation What percent of the total variation in nicotine can be explained by the linear relationship between tar and nicotine? 92.4% 8. Finding the Prediction Interval For given tar amount 17mg. Identify the 95% confidence interval estimate of the amount of nicotine, and write a statement interpreting that interval. 1.1 < y < 1.4; we have 95% confidence that the limits 1.1 and 1.4 contain the true amount of nicotine.

105 Lesson 9-6 Modeling

106 Mathematical Modeling A mathematical model is a mathematical function that fits or describes real-world data

107 TI Generic Models y a bx 2 y ax bx c y a bln x y ab x y b ax 1 y c ae bx

108 TI Key Strokes Use the Stat key

109 Development of a Good Mathematics Model Look for a Pattern in the Graph Examine the graph of the plotted points and compare the basic pattern to the known generic graphs. Find and Compare Values of R 2 Select functions that result in larger values of R 2, because such larger values correspond to functions that better fit the observed points. Think Use common sense. Don t use a model that lead to predicted values known to be totally unrealistic.

110 Example Page 554, #2 Construct a scatterplot and identify the mathematical model that best fits the given data. Assume that the model is to be used only for the scope of the given data, and considered only linear, quadratic, logarithmic, exponential and power models. x y

111 Example Page 554, #2 GRAPH Zoom #9:Zoomstat

112 Example Page 554, #2 Step 3 Decide which mathematical model Linear: yˆ 5x2

113 Example Page 554, #4 Construct a scatterplot and identify the mathematical model that best fits the given data. Assume that the model is to be used only for the scope of the given data, and considered only linear, quadratic, logarithmic, exponential and power models. x y

114 Example Page 554, #4 GRAPH Zoom #9:Zoomstat

115 Example Page 554, #4 Step 3 Decide which mathematical model Power: yˆ 2x 1 2

116 Example Page 555, #6 Refer to the annual high values of the Dow-Jones Industrial Average that are listed in Data Set 25 in Appendix B. What is the best predicted value for the year 2001? Given that the Actual high value in 2001 was 11,350, how good was the predicted value? What does the pattern suggest about the stock market for investment purposes?

117 Example Page 555, #6

118 Example Page 555, #6 Decide which mathematical model Exponential: yˆ x

119 Example Page 555, #6 What is the best predicted value for the year 2001? Given that the actual high value in 2001 was 11,350, how good was the predicted value? yˆ x 22 ˆ ( ) 12,262 y It misses by a fairly large amount.

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1

Lecture Slides. Elementary Statistics Tenth Edition. by Mario F. Triola. and the Triola Statistics Series. Slide 1 Lecture Slides Elementary Statistics Tenth Edition and the Triola Statistics Series by Mario F. Triola Slide 1 Chapter 10 Correlation and Regression 10-1 Overview 10-2 Correlation 10-3 Regression 10-4

More information

Chapter 10 Correlation and Regression

Chapter 10 Correlation and Regression Chapter 10 Correlation and Regression 10-1 Review and Preview 10-2 Correlation 10-3 Regression 10-4 Variation and Prediction Intervals 10-5 Multiple Regression 10-6 Modeling Copyright 2010, 2007, 2004

More information

23. Inference for regression

23. Inference for regression 23. Inference for regression The Practice of Statistics in the Life Sciences Third Edition 2014 W. H. Freeman and Company Objectives (PSLS Chapter 23) Inference for regression The regression model Confidence

More information

9. Linear Regression and Correlation

9. Linear Regression and Correlation 9. Linear Regression and Correlation Data: y a quantitative response variable x a quantitative explanatory variable (Chap. 8: Recall that both variables were categorical) For example, y = annual income,

More information

Chapter 10. Correlation and Regression. Lecture 1 Sections:

Chapter 10. Correlation and Regression. Lecture 1 Sections: Chapter 10 Correlation and Regression Lecture 1 Sections: 10.1 10. You will now be introduced to important methods for making inferences based on sample data that come in pairs. In the previous chapter,

More information

Review 6. n 1 = 85 n 2 = 75 x 1 = x 2 = s 1 = 38.7 s 2 = 39.2

Review 6. n 1 = 85 n 2 = 75 x 1 = x 2 = s 1 = 38.7 s 2 = 39.2 Review 6 Use the traditional method to test the given hypothesis. Assume that the samples are independent and that they have been randomly selected ) A researcher finds that of,000 people who said that

More information

INFERENCE FOR REGRESSION

INFERENCE FOR REGRESSION CHAPTER 3 INFERENCE FOR REGRESSION OVERVIEW In Chapter 5 of the textbook, we first encountered regression. The assumptions that describe the regression model we use in this chapter are the following. We

More information

Correlation & Simple Regression

Correlation & Simple Regression Chapter 11 Correlation & Simple Regression The previous chapter dealt with inference for two categorical variables. In this chapter, we would like to examine the relationship between two quantitative variables.

More information

Ch 13 & 14 - Regression Analysis

Ch 13 & 14 - Regression Analysis Ch 3 & 4 - Regression Analysis Simple Regression Model I. Multiple Choice:. A simple regression is a regression model that contains a. only one independent variable b. only one dependent variable c. more

More information

LINEAR REGRESSION ANALYSIS. MODULE XVI Lecture Exercises

LINEAR REGRESSION ANALYSIS. MODULE XVI Lecture Exercises LINEAR REGRESSION ANALYSIS MODULE XVI Lecture - 44 Exercises Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Exercise 1 The following data has been obtained on

More information

Chapter 2: Looking at Data Relationships (Part 3)

Chapter 2: Looking at Data Relationships (Part 3) Chapter 2: Looking at Data Relationships (Part 3) Dr. Nahid Sultana Chapter 2: Looking at Data Relationships 2.1: Scatterplots 2.2: Correlation 2.3: Least-Squares Regression 2.5: Data Analysis for Two-Way

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line

Inference for Regression Inference about the Regression Model and Using the Regression Line Inference for Regression Inference about the Regression Model and Using the Regression Line PBS Chapter 10.1 and 10.2 2009 W.H. Freeman and Company Objectives (PBS Chapter 10.1 and 10.2) Inference about

More information

y n 1 ( x i x )( y y i n 1 i y 2

y n 1 ( x i x )( y y i n 1 i y 2 STP3 Brief Class Notes Instructor: Ela Jackiewicz Chapter Regression and Correlation In this chapter we will explore the relationship between two quantitative variables, X an Y. We will consider n ordered

More information

Analysis of Bivariate Data

Analysis of Bivariate Data Analysis of Bivariate Data Data Two Quantitative variables GPA and GAES Interest rates and indices Tax and fund allocation Population size and prison population Bivariate data (x,y) Case corr&reg 2 Independent

More information

28. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III 28. SIMPLE LINEAR REGRESSION III Fitted Values and Residuals To each observed x i, there corresponds a y-value on the fitted line, y = βˆ + βˆ x. The are called fitted values. ŷ i They are the values of

More information

Review of Regression Basics

Review of Regression Basics Review of Regression Basics When describing a Bivariate Relationship: Make a Scatterplot Strength, Direction, Form Model: y-hat=a+bx Interpret slope in context Make Predictions Residual = Observed-Predicted

More information

Lecture 18: Simple Linear Regression

Lecture 18: Simple Linear Regression Lecture 18: Simple Linear Regression BIOS 553 Department of Biostatistics University of Michigan Fall 2004 The Correlation Coefficient: r The correlation coefficient (r) is a number that measures the strength

More information

Chapter 12 Summarizing Bivariate Data Linear Regression and Correlation

Chapter 12 Summarizing Bivariate Data Linear Regression and Correlation Chapter 1 Summarizing Bivariate Data Linear Regression and Correlation This chapter introduces an important method for making inferences about a linear correlation (or relationship) between two variables,

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

CHAPTER 5 LINEAR REGRESSION AND CORRELATION

CHAPTER 5 LINEAR REGRESSION AND CORRELATION CHAPTER 5 LINEAR REGRESSION AND CORRELATION Expected Outcomes Able to use simple and multiple linear regression analysis, and correlation. Able to conduct hypothesis testing for simple and multiple linear

More information

Scatterplots. 3.1: Scatterplots & Correlation. Scatterplots. Explanatory & Response Variables. Section 3.1 Scatterplots and Correlation

Scatterplots. 3.1: Scatterplots & Correlation. Scatterplots. Explanatory & Response Variables. Section 3.1 Scatterplots and Correlation 3.1: Scatterplots & Correlation Scatterplots A scatterplot shows the relationship between two quantitative variables measured on the same individuals. The values of one variable appear on the horizontal

More information

Conditions for Regression Inference:

Conditions for Regression Inference: AP Statistics Chapter Notes. Inference for Linear Regression We can fit a least-squares line to any data relating two quantitative variables, but the results are useful only if the scatterplot shows a

More information

Inference with Simple Regression

Inference with Simple Regression 1 Introduction Inference with Simple Regression Alan B. Gelder 06E:071, The University of Iowa 1 Moving to infinite means: In this course we have seen one-mean problems, twomean problems, and problems

More information

Business Statistics. Lecture 10: Correlation and Linear Regression

Business Statistics. Lecture 10: Correlation and Linear Regression Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form

More information

Linear Regression. Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x).

Linear Regression. Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x). Linear Regression Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x). A dependent variable is a random variable whose variation

More information

Can you tell the relationship between students SAT scores and their college grades?

Can you tell the relationship between students SAT scores and their college grades? Correlation One Challenge Can you tell the relationship between students SAT scores and their college grades? A: The higher SAT scores are, the better GPA may be. B: The higher SAT scores are, the lower

More information

Bivariate Data Summary

Bivariate Data Summary Bivariate Data Summary Bivariate data data that examines the relationship between two variables What individuals to the data describe? What are the variables and how are they measured Are the variables

More information

AP Statistics Unit 6 Note Packet Linear Regression. Scatterplots and Correlation

AP Statistics Unit 6 Note Packet Linear Regression. Scatterplots and Correlation Scatterplots and Correlation Name Hr A scatterplot shows the relationship between two quantitative variables measured on the same individuals. variable (y) measures an outcome of a study variable (x) may

More information

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000

Lecture 14. Analysis of Variance * Correlation and Regression. The McGraw-Hill Companies, Inc., 2000 Lecture 14 Analysis of Variance * Correlation and Regression Outline Analysis of Variance (ANOVA) 11-1 Introduction 11-2 Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination

More information

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA)

Lecture 14. Outline. Outline. Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA) Outline Lecture 14 Analysis of Variance * Correlation and Regression Analysis of Variance (ANOVA) 11-1 Introduction 11- Scatter Plots 11-3 Correlation 11-4 Regression Outline 11-5 Coefficient of Determination

More information

Looking at data: relationships

Looking at data: relationships Looking at data: relationships Least-squares regression IPS chapter 2.3 2006 W. H. Freeman and Company Objectives (IPS chapter 2.3) Least-squares regression p p The regression line Making predictions:

More information

Warm-up Using the given data Create a scatterplot Find the regression line

Warm-up Using the given data Create a scatterplot Find the regression line Time at the lunch table Caloric intake 21.4 472 30.8 498 37.7 335 32.8 423 39.5 437 22.8 508 34.1 431 33.9 479 43.8 454 42.4 450 43.1 410 29.2 504 31.3 437 28.6 489 32.9 436 30.6 480 35.1 439 33.0 444

More information

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. FINAL EXAM ** Two different ways to submit your answer sheet (i) Use MS-Word and place it in a drop-box. (ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. Deadline: December

More information

Regression. Marc H. Mehlman University of New Haven

Regression. Marc H. Mehlman University of New Haven Regression Marc H. Mehlman marcmehlman@yahoo.com University of New Haven the statistician knows that in nature there never was a normal distribution, there never was a straight line, yet with normal and

More information

UNIT 12 ~ More About Regression

UNIT 12 ~ More About Regression ***SECTION 15.1*** The Regression Model When a scatterplot shows a relationship between a variable x and a y, we can use the fitted to the data to predict y for a given value of x. Now we want to do tests

More information

Multiple Regression Methods

Multiple Regression Methods Chapter 1: Multiple Regression Methods Hildebrand, Ott and Gray Basic Statistical Ideas for Managers Second Edition 1 Learning Objectives for Ch. 1 The Multiple Linear Regression Model How to interpret

More information

Objectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters

Objectives Simple linear regression. Statistical model for linear regression. Estimating the regression parameters Objectives 10.1 Simple linear regression Statistical model for linear regression Estimating the regression parameters Confidence interval for regression parameters Significance test for the slope Confidence

More information

9 Correlation and Regression

9 Correlation and Regression 9 Correlation and Regression SW, Chapter 12. Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then retakes the

More information

Correlation. A statistics method to measure the relationship between two variables. Three characteristics

Correlation. A statistics method to measure the relationship between two variables. Three characteristics Correlation Correlation A statistics method to measure the relationship between two variables Three characteristics Direction of the relationship Form of the relationship Strength/Consistency Direction

More information

Six Sigma Black Belt Study Guides

Six Sigma Black Belt Study Guides Six Sigma Black Belt Study Guides 1 www.pmtutor.org Powered by POeT Solvers Limited. Analyze Correlation and Regression Analysis 2 www.pmtutor.org Powered by POeT Solvers Limited. Variables and relationships

More information

Answer Key. 9.1 Scatter Plots and Linear Correlation. Chapter 9 Regression and Correlation. CK-12 Advanced Probability and Statistics Concepts 1

Answer Key. 9.1 Scatter Plots and Linear Correlation. Chapter 9 Regression and Correlation. CK-12 Advanced Probability and Statistics Concepts 1 9.1 Scatter Plots and Linear Correlation Answers 1. A high school psychologist wants to conduct a survey to answer the question: Is there a relationship between a student s athletic ability and his/her

More information

Correlation and Regression

Correlation and Regression Correlation and Regression Dr. Bob Gee Dean Scott Bonney Professor William G. Journigan American Meridian University 1 Learning Objectives Upon successful completion of this module, the student should

More information

Relationships Regression

Relationships Regression Relationships Regression BPS chapter 5 2006 W.H. Freeman and Company Objectives (BPS chapter 5) Regression Regression lines The least-squares regression line Using technology Facts about least-squares

More information

Correlation and Regression

Correlation and Regression A. The Basics of Correlation Analysis 1. SCATTER DIAGRAM A key tool in correlation analysis is the scatter diagram, which is a tool for analyzing potential relationships between two variables. One variable

More information

ECON3150/4150 Spring 2015

ECON3150/4150 Spring 2015 ECON3150/4150 Spring 2015 Lecture 3&4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo January 29, 2015 1 / 67 Chapter 4 in S&W Section 17.1 in S&W (extended OLS assumptions) 2

More information

Chapter 6: Exploring Data: Relationships Lesson Plan

Chapter 6: Exploring Data: Relationships Lesson Plan Chapter 6: Exploring Data: Relationships Lesson Plan For All Practical Purposes Displaying Relationships: Scatterplots Mathematical Literacy in Today s World, 9th ed. Making Predictions: Regression Line

More information

y = a + bx 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation Review: Interpreting Computer Regression Output

y = a + bx 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation Review: Interpreting Computer Regression Output 12.1: Inference for Linear Regression Review: General Form of Linear Regression Equation y = a + bx y = dependent variable a = intercept b = slope x = independent variable Section 12.1 Inference for Linear

More information

Test 3 Practice Test A. NOTE: Ignore Q10 (not covered)

Test 3 Practice Test A. NOTE: Ignore Q10 (not covered) Test 3 Practice Test A NOTE: Ignore Q10 (not covered) MA 180/418 Midterm Test 3, Version A Fall 2010 Student Name (PRINT):............................................. Student Signature:...................................................

More information

STAT Chapter 11: Regression

STAT Chapter 11: Regression STAT 515 -- Chapter 11: Regression Mostly we have studied the behavior of a single random variable. Often, however, we gather data on two random variables. We wish to determine: Is there a relationship

More information

Analysing data: regression and correlation S6 and S7

Analysing data: regression and correlation S6 and S7 Basic medical statistics for clinical and experimental research Analysing data: regression and correlation S6 and S7 K. Jozwiak k.jozwiak@nki.nl 2 / 49 Correlation So far we have looked at the association

More information

LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION

LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION In this lab you will learn how to use Excel to display the relationship between two quantitative variables, measure the strength and direction of the

More information

AP Statistics Bivariate Data Analysis Test Review. Multiple-Choice

AP Statistics Bivariate Data Analysis Test Review. Multiple-Choice Name Period AP Statistics Bivariate Data Analysis Test Review Multiple-Choice 1. The correlation coefficient measures: (a) Whether there is a relationship between two variables (b) The strength of the

More information

CRP 272 Introduction To Regression Analysis

CRP 272 Introduction To Regression Analysis CRP 272 Introduction To Regression Analysis 30 Relationships Among Two Variables: Interpretations One variable is used to explain another variable X Variable Independent Variable Explaining Variable Exogenous

More information

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments.

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments. Analysis of Covariance In some experiments, the experimental units (subjects) are nonhomogeneous or there is variation in the experimental conditions that are not due to the treatments. For example, a

More information

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #6

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #6 STA 8 Applied Linear Models: Regression Analysis Spring 011 Solution for Homework #6 6. a) = 11 1 31 41 51 1 3 4 5 11 1 31 41 51 β = β1 β β 3 b) = 1 1 1 1 1 11 1 31 41 51 1 3 4 5 β = β 0 β1 β 6.15 a) Stem-and-leaf

More information

Mathematics for Economics MA course

Mathematics for Economics MA course Mathematics for Economics MA course Simple Linear Regression Dr. Seetha Bandara Simple Regression Simple linear regression is a statistical method that allows us to summarize and study relationships between

More information

Prob/Stats Questions? /32

Prob/Stats Questions? /32 Prob/Stats 10.4 Questions? 1 /32 Prob/Stats 10.4 Homework Apply p551 Ex 10-4 p 551 7, 8, 9, 10, 12, 13, 28 2 /32 Prob/Stats 10.4 Objective Compute the equation of the least squares 3 /32 Regression A scatter

More information

REVIEW 8/2/2017 陈芳华东师大英语系

REVIEW 8/2/2017 陈芳华东师大英语系 REVIEW Hypothesis testing starts with a null hypothesis and a null distribution. We compare what we have to the null distribution, if the result is too extreme to belong to the null distribution (p

More information

1) A residual plot: A)

1) A residual plot: A) 1) A residual plot: A) B) C) D) E) displays residuals of the response variable versus the independent variable. displays residuals of the independent variable versus the response variable. displays residuals

More information

Chapter 7 Linear Regression

Chapter 7 Linear Regression Chapter 7 Linear Regression 1 7.1 Least Squares: The Line of Best Fit 2 The Linear Model Fat and Protein at Burger King The correlation is 0.76. This indicates a strong linear fit, but what line? The line

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression OI CHAPTER 7 Important Concepts Correlation (r or R) and Coefficient of determination (R 2 ) Interpreting y-intercept and slope coefficients Inference (hypothesis testing and confidence

More information

Chapter 14. Statistical versus Deterministic Relationships. Distance versus Speed. Describing Relationships: Scatterplots and Correlation

Chapter 14. Statistical versus Deterministic Relationships. Distance versus Speed. Describing Relationships: Scatterplots and Correlation Chapter 14 Describing Relationships: Scatterplots and Correlation Chapter 14 1 Statistical versus Deterministic Relationships Distance versus Speed (when travel time is constant). Income (in millions of

More information

BIVARIATE DATA data for two variables

BIVARIATE DATA data for two variables (Chapter 3) BIVARIATE DATA data for two variables INVESTIGATING RELATIONSHIPS We have compared the distributions of the same variable for several groups, using double boxplots and back-to-back stemplots.

More information

Chapter 5 Friday, May 21st

Chapter 5 Friday, May 21st Chapter 5 Friday, May 21 st Overview In this Chapter we will see three different methods we can use to describe a relationship between two quantitative variables. These methods are: Scatterplot Correlation

More information

appstats8.notebook October 11, 2016

appstats8.notebook October 11, 2016 Chapter 8 Linear Regression Objective: Students will construct and analyze a linear model for a given set of data. Fat Versus Protein: An Example pg 168 The following is a scatterplot of total fat versus

More information

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220 Dr. Mohammad Zainal Chapter Goals After completing

More information

Midterm 2 - Solutions

Midterm 2 - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis February 23, 2010 Instructor: John Parman Midterm 2 - Solutions You have until 10:20am to complete this exam. Please remember to put

More information

Chapters 9 and 10. Review for Exam. Chapter 9. Correlation and Regression. Overview. Paired Data

Chapters 9 and 10. Review for Exam. Chapter 9. Correlation and Regression. Overview. Paired Data Chapters 9 and 10 Review for Exam 1 Chapter 9 Correlation and Regression 2 Overview Paired Data is there a relationship if so, what is the equation use the equation for prediction 3 Definition Correlation

More information

Important note: Transcripts are not substitutes for textbook assignments. 1

Important note: Transcripts are not substitutes for textbook assignments. 1 In this lesson we will cover correlation and regression, two really common statistical analyses for quantitative (or continuous) data. Specially we will review how to organize the data, the importance

More information

IES 612/STA 4-573/STA Winter 2008 Week 1--IES 612-STA STA doc

IES 612/STA 4-573/STA Winter 2008 Week 1--IES 612-STA STA doc IES 612/STA 4-573/STA 4-576 Winter 2008 Week 1--IES 612-STA 4-573-STA 4-576.doc Review Notes: [OL] = Ott & Longnecker Statistical Methods and Data Analysis, 5 th edition. [Handouts based on notes prepared

More information

Examining Relationships. Chapter 3

Examining Relationships. Chapter 3 Examining Relationships Chapter 3 Scatterplots A scatterplot shows the relationship between two quantitative variables measured on the same individuals. The explanatory variable, if there is one, is graphed

More information

Chapter 10. Regression. Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania

Chapter 10. Regression. Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania Chapter 10 Regression Understandable Statistics Ninth Edition By Brase and Brase Prepared by Yixun Shi Bloomsburg University of Pennsylvania Scatter Diagrams A graph in which pairs of points, (x, y), are

More information

Confidence Interval for the mean response

Confidence Interval for the mean response Week 3: Prediction and Confidence Intervals at specified x. Testing lack of fit with replicates at some x's. Inference for the correlation. Introduction to regression with several explanatory variables.

More information

determine whether or not this relationship is.

determine whether or not this relationship is. Section 9-1 Correlation A correlation is a between two. The data can be represented by ordered pairs (x,y) where x is the (or ) variable and y is the (or ) variable. There are several types of correlations

More information

Any of 27 linear and nonlinear models may be fit. The output parallels that of the Simple Regression procedure.

Any of 27 linear and nonlinear models may be fit. The output parallels that of the Simple Regression procedure. STATGRAPHICS Rev. 9/13/213 Calibration Models Summary... 1 Data Input... 3 Analysis Summary... 5 Analysis Options... 7 Plot of Fitted Model... 9 Predicted Values... 1 Confidence Intervals... 11 Observed

More information

Chapter 27 Summary Inferences for Regression

Chapter 27 Summary Inferences for Regression Chapter 7 Summary Inferences for Regression What have we learned? We have now applied inference to regression models. Like in all inference situations, there are conditions that we must check. We can test

More information

Business Statistics. Lecture 10: Course Review

Business Statistics. Lecture 10: Course Review Business Statistics Lecture 10: Course Review 1 Descriptive Statistics for Continuous Data Numerical Summaries Location: mean, median Spread or variability: variance, standard deviation, range, percentiles,

More information

STAT 212 Business Statistics II 1

STAT 212 Business Statistics II 1 STAT 1 Business Statistics II 1 KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA STAT 1: BUSINESS STATISTICS II Semester 091 Final Exam Thursday Feb

More information

Chapter 12 : Linear Correlation and Linear Regression

Chapter 12 : Linear Correlation and Linear Regression Chapter 1 : Linear Correlation and Linear Regression Determining whether a linear relationship exists between two quantitative variables, and modeling the relationship with a line, if the linear relationship

More information

Ordinary Least Squares Regression Explained: Vartanian

Ordinary Least Squares Regression Explained: Vartanian Ordinary Least Squares Regression Explained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent

More information

MBA Statistics COURSE #4

MBA Statistics COURSE #4 MBA Statistics 51-651-00 COURSE #4 Simple and multiple linear regression What should be the sales of ice cream? Example: Before beginning building a movie theater, one must estimate the daily number of

More information

2. Outliers and inference for regression

2. Outliers and inference for regression Unit6: Introductiontolinearregression 2. Outliers and inference for regression Sta 101 - Spring 2016 Duke University, Department of Statistical Science Dr. Çetinkaya-Rundel Slides posted at http://bit.ly/sta101_s16

More information

5.1 Bivariate Relationships

5.1 Bivariate Relationships Chapter 5 Summarizing Bivariate Data Source: TPS 5.1 Bivariate Relationships What is Bivariate data? When exploring/describing a bivariate (x,y) relationship: Determine the Explanatory and Response variables

More information

Math 52 Linear Regression Instructions TI-83

Math 52 Linear Regression Instructions TI-83 Math 5 Linear Regression Instructions TI-83 Use the following data to study the relationship between average hours spent per week studying and overall QPA. The idea behind linear regression is to determine

More information

Overview. 4.1 Tables and Graphs for the Relationship Between Two Variables. 4.2 Introduction to Correlation. 4.3 Introduction to Regression 3.

Overview. 4.1 Tables and Graphs for the Relationship Between Two Variables. 4.2 Introduction to Correlation. 4.3 Introduction to Regression 3. 3.1-1 Overview 4.1 Tables and Graphs for the Relationship Between Two Variables 4.2 Introduction to Correlation 4.3 Introduction to Regression 3.1-2 4.1 Tables and Graphs for the Relationship Between Two

More information

Chapter 12: Multiple Regression

Chapter 12: Multiple Regression Chapter 12: Multiple Regression 12.1 a. A scatterplot of the data is given here: Plot of Drug Potency versus Dose Level Potency 0 5 10 15 20 25 30 0 5 10 15 20 25 30 35 Dose Level b. ŷ = 8.667 + 0.575x

More information

AMS 7 Correlation and Regression Lecture 8

AMS 7 Correlation and Regression Lecture 8 AMS 7 Correlation and Regression Lecture 8 Department of Applied Mathematics and Statistics, University of California, Santa Cruz Suumer 2014 1 / 18 Correlation pairs of continuous observations. Correlation

More information

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation?

Linear Regression. Linear Regression. Linear Regression. Did You Mean Association Or Correlation? Did You Mean Association Or Correlation? AP Statistics Chapter 8 Be careful not to use the word correlation when you really mean association. Often times people will incorrectly use the word correlation

More information

Describing Bivariate Relationships

Describing Bivariate Relationships Describing Bivariate Relationships Bivariate Relationships What is Bivariate data? When exploring/describing a bivariate (x,y) relationship: Determine the Explanatory and Response variables Plot the data

More information

11 Correlation and Regression

11 Correlation and Regression Chapter 11 Correlation and Regression August 21, 2017 1 11 Correlation and Regression When comparing two variables, sometimes one variable (the explanatory variable) can be used to help predict the value

More information

Ch Inference for Linear Regression

Ch Inference for Linear Regression Ch. 12-1 Inference for Linear Regression ACT = 6.71 + 5.17(GPA) For every increase of 1 in GPA, we predict the ACT score to increase by 5.17. population regression line β (true slope) μ y = α + βx mean

More information

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

Chapte The McGraw-Hill Companies, Inc. All rights reserved. 12er12 Chapte Bivariate i Regression (Part 1) Bivariate Regression Visual Displays Begin the analysis of bivariate data (i.e., two variables) with a scatter plot. A scatter plot - displays each observed

More information

Inferences for linear regression (sections 12.1, 12.2)

Inferences for linear regression (sections 12.1, 12.2) Inferences for linear regression (sections 12.1, 12.2) Regression case history: do bigger national parks help prevent extinction? ex. area of natural reserves and extinction: 6 national parks in Tanzania

More information

1 A Review of Correlation and Regression

1 A Review of Correlation and Regression 1 A Review of Correlation and Regression SW, Chapter 12 Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then

More information

The Simple Linear Regression Model

The Simple Linear Regression Model The Simple Linear Regression Model Lesson 3 Ryan Safner 1 1 Department of Economics Hood College ECON 480 - Econometrics Fall 2017 Ryan Safner (Hood College) ECON 480 - Lesson 3 Fall 2017 1 / 77 Bivariate

More information

Chapter 14. Multiple Regression Models. Multiple Regression Models. Multiple Regression Models

Chapter 14. Multiple Regression Models. Multiple Regression Models. Multiple Regression Models Chapter 14 Multiple Regression Models 1 Multiple Regression Models A general additive multiple regression model, which relates a dependent variable y to k predictor variables,,, is given by the model equation

More information

Start with review, some new definitions, and pictures on the white board. Assumptions in the Normal Linear Regression Model

Start with review, some new definitions, and pictures on the white board. Assumptions in the Normal Linear Regression Model Start with review, some new definitions, and pictures on the white board. Assumptions in the Normal Linear Regression Model A1: There is a linear relationship between X and Y. A2: The error terms (and

More information

Sampling Distributions in Regression. Mini-Review: Inference for a Mean. For data (x 1, y 1 ),, (x n, y n ) generated with the SRM,

Sampling Distributions in Regression. Mini-Review: Inference for a Mean. For data (x 1, y 1 ),, (x n, y n ) generated with the SRM, Department of Statistics The Wharton School University of Pennsylvania Statistics 61 Fall 3 Module 3 Inference about the SRM Mini-Review: Inference for a Mean An ideal setup for inference about a mean

More information

Chapter 8. Linear Regression /71

Chapter 8. Linear Regression /71 Chapter 8 Linear Regression 1 /71 Homework p192 1, 2, 3, 5, 7, 13, 15, 21, 27, 28, 29, 32, 35, 37 2 /71 3 /71 Objectives Determine Least Squares Regression Line (LSRL) describing the association of two

More information

Simple Linear Regression Using Ordinary Least Squares

Simple Linear Regression Using Ordinary Least Squares Simple Linear Regression Using Ordinary Least Squares Purpose: To approximate a linear relationship with a line. Reason: We want to be able to predict Y using X. Definition: The Least Squares Regression

More information