Chapter 4: Regression Models

Size: px
Start display at page:

Download "Chapter 4: Regression Models"

Transcription

1 Sales volume of company 1 Textbook: pp Chapter 4: Regression Models Money spent on advertising

2 2 Learning Objectives After completing this chapter, students will be able to: Identify variables, visualise them in a scatter diagram, and use them in a regression model. Develop simple linear regression equations from sample data and interpret the slope and intercept. Calculate the coefficient of determination and the coefficient of correlation and interpret their meanings. List the assumptions used in regression and use residual plots to identify problems. Interpret the F test in a linear regression model.

3 3 Learning Objectives After completing this chapter, students will be able to: Use computer software for regression analysis. Develop a multiple regression model and use it for prediction purposes. Use dummy variables to model categorical data. Determine which variables should be included in a multiple regression model. Transform a nonlinear function into a linear one for use in regression. Understand and avoid mistakes commonly made in the use of regression analysis.

4 4 Introduction (1 of 2) Regression analysis very valuable tool for a manager o o Understand the relationship between variables Predict the value of one variable based on another variable Simple linear regression models have only two variables Multiple regression models have more than one independent variable

5 5 Introduction (2 of 2) Variable to be predicted is called the dependent variable or response variable o Value depends on the value of the independent variable(s) [Explanatory or predictor variable]

6 6 Scatter Diagram / Scatter Plot Scatter diagram or scatter plot often used to investigate the relationship between variables o Independent variable normally plotted on X axis o Dependent variable normally plotted on Y axis

7 7 Triple A Construction (1 of 6) Triple A Construction renovates old homes The dollar volume of renovation work is dependent on the area payroll Triple A Construction Company Sales and Local Payroll Economists have predicted the local area payroll to be $600 million next year, and Triple A wants to plan accordingly!

8 8 Triple A Construction (2 of 6) Scatter Diagram of Triple A Construction Company Data:

9 9 Simple Linear Regression (1 of 2) In any regression model, there is an implicit assumption (which can be tested) that a relationship exists between the variables! There is some random error that cannot be predicted! Where Y X = dependent variable (response) = independent variable (predictor or explanatory) β 0 = intercept (value of Y when X = 0) β 1 ε Y = slope of the regression line = e = random error X 0 1

10 10 Simple Linear Regression (2 of 2) True values for the slope and intercept are not known so they are estimated using sample data Yˆ b b X 0 1 Where Ŷ b 0 b 1 = predicted value of Y = estimate of β 0, based on sample results = estimate of β 1, based on sample results

11 11 Triple A Construction (3 of 6) Predict sales based on area payroll Y = Sales X = Area payroll The line in our Scatter Diagram minimises the errors Error = (Actual value) (Predicted value) e Y Yˆ Regression analysis minimises the sum of squared errors Least-squares regression

12 12 Triple A Construction (4 of 6) Formulas for simple linear regression, intercept and slope:

13 13 Triple A Construction (5 of 6) Regression Calculations for Triple A Construction:

14 14 Triple A Construction (6 of 6) Y X 0 1 Regression calculations: Therefore sales = (payroll) If the payroll next year is $600 million Y = (6) = 9.5 or $950,000

15 15 Measuring the Fit of the Regression Model (1 of 4) Regression models can be developed for any variables X and Y How helpful is the model in predicting Y? With average error positive and negative errors cancel each other out Three measures of variability o SST Total variability about the mean o SSE Variability about the regression line o SSR Total variability that is explained by the model

16 16 Measuring the Fit of the Regression Model (2 of 4) Sum of squares total Sum of squares error Sum of squares regression An important relationship: SST ( Y Y) SSE e ( Y Yˆ ) SSR ( Yˆ Y) SST SSR + SSE 2

17 Sum of squares total (SST) = total variability in Y; The SSE is much lower than SST. Using the regression line has reduced the variability in the sum of squares by = (SSR). This indicates how much of the total variability in Y is explained by the regression model! 17 Measuring the Fit of the Regression Model (3 of 4) Sum of Squares for Triple A Construction: ^ ^ ^ ^ ^ SST ( Y Y) The mean is compared to each value 2 Prediction for each observation is compared to the actual value. SSE e ( Y Yˆ ) 2 2 Total variability in Y explained by the regression model! SSR ( Yˆ Y) 2

18 18 Measuring the Fit of the Regression Model (4 of 4) Deviations from the Regression Line and from the Mean:

19 19 The SSR is the explained variability in Y; SSE is the unexplained variability in Y. Coefficient of Determination r 2 The proportion of the variability in Y explained by the regression equation o The coefficient of determination is r 2. 2 SSR SSE r 1 SST SST o For Triple A Construction r About 69% of the variability in Y is explained by the equation based on payroll (X) If every point in the sample were on the regression line (meaning all errors are 0), then 100% of the variability in Y could be explained by the regression equation, so r 2 = 1 and SEE = 0. The lowest possible value r 2 is 0, indicating that X explains 0% of the variability in Y. r 2 can range from 0 to 1. In developing regression equations, a good model will have an r 2 value close to 1.

20 20 Correlation Coefficient r An expression of the strength of the linear relationship o Always between +1 and 1 o The correlation coefficient is r r r 2 o For Triple A Construction: r

21 Four Values of the Correlation Coefficient 21

22 22 Assumptions of the Regression Model With certain assumptions about the errors, statistical tests can be performed to determine the model s usefulness: 1. Errors are independent 2. Errors are normally distributed 3. Errors have a mean of zero 4. Errors have a constant variance A plot of the residuals (errors) often highlights glaring violations of assumptions

23 When the errors (residuals) are plotted against the independent variable, the pattern should appear random. 23 Residual Plots (1 of 3) Pattern of Errors Indicating Randomness: Errors seem random and no discernible pattern is present.

24 24 Residual Plots (2 of 3) Nonconstant Error Variance: Error pattern in which the errors increase as X increases! Violation of constant variance assumption!

25 Patterns in the plot of the errors indicate problems with the assumptions or the model specification! 25 Residual Plots (3 of 3) Pattern of Errors Indicating Relationship Is Not Linear: Errors consistently increase at first, and then consistently decrease not a linear model!

26 26 Estimating the Variance (1 of 2) Errors are assumed to have a constant variance (σ 2 ), usually unknown o Estimated using the mean squared error (MSE), s 2 2 SSE s MSE Sum of squares due to error divided by n k the degrees of freedom 1 where n = number of observations in the sample k = number of independent variables

27 27 Estimating the Variance (2 of 2) For Triple A Construction: 2 SSE s MSE n k o Estimate the standard deviation, s o The standard error of the estimate or the standard deviation of the regression s MSE

28 28 Testing the Model for Significance (1 of 4) When the sample size is too small, you can get good values for MSE (mean squared error) and r 2 even if there is no relationship between the variables o Testing the model for significance helps determine if the values are meaningful o Performing a statistical hypothesis test

29 29 Testing the Model for Significance (2 of 4) We start with the general linear model: Y X 0 1 If β 1 = 0, the null hypothesis is that there is no relationship between X and Y The alternate hypothesis is that there is a linear relationship (β 1 0) If the null hypothesis can be rejected, we have proven there is a relationship We use the F statistic for this test

30 30 MSE (Mean Squared Error) MSR (Mean Square Regression) Testing the Model for Significance (3 of 4) The F statistic is based on the MSE and MSR MSR SSR k where k = number of independent variables in the model The F statistic is F MSR MSE Describes an F distribution with: o degrees of freedom for the numerator = df 1 = k o degrees of freedom for the denominator = df 2 = n k 1

31 31 Testing the Model for Significance (4 of 4) If there is very little error, MSE would be small and the F statistic would be large model is useful If the F statistic is large, the significance level (p-value) will be low unlikely would have occurred by chance When the F value is large, we can reject the null hypothesis and accept that there is a linear relationship between X and Y and the values of the MSE and r 2 are meaningful

32 MSE (Mean Squared Error) MSR (Mean Square Regression) 32 Steps in a Hypothesis Test (1 of 2) 1. Specify null and alternative hypotheses H H : : Select the level of significance (α) Common values are 0.01 and Calculate the value of the test statistic F MSR MSE

33 33 Steps in a Hypothesis Test (2 of 2) 4. Make a decision using one of the following methods a) Reject the null hypothesis if the test statistic is greater than the F value from the table in Appendix D. Otherwise, do not reject the null hypothesis Reject if F F df df 1 2 k calculated n k 1, df, df b) Reject the null hypothesis if the observed significance level, or p- value, is less than the level of significance (α). Otherwise, do not reject the null hypothesis: 1 2 p-value P( F calculated test statistic) Reject if p-value

34 34 Triple A Construction (1 of 3) Step 1 Step 2 Step 3 H 0 : β 1 = 0 (no linear relationship between X and Y) H 1 : β 1 0 (linear relationship exists between X and Y) Select α = 0.05 Calculate the value of the test statistic SSR MSR k 1 MSR F 9.09 MSE

35 35 Triple A Construction (2 of 3) Step 4 Reject the null hypothesis if the test statistic is greater than the F value in Appendix D (page 576) df 1 = k = 1 df 2 = n k 1 = = 4 The value of F associated with a 5% level of significance and with degrees of freedom 1 and 4 is: F 0.05,1,4 = 7.71 F calculated = 9.09 Reject H 0 because 9.09 > 7.71

36 36 Triple A Construction (3 of 3) F-Distribution for Triple A Construction Test for Significance: We can conclude there is a statistically significant relationship between X and Y The r 2 value of 0.69 means about 69% of the variability in sales (Y) is explained by local payroll (X)

37 37 Analysis of Variance (ANOVA) Table With software models, an ANOVA table is typically created that shows the observed significance level (p-value) for the calculated F value o This can be compared to the level of significance (α) to make a decision Analysis of Variance Table for Regression:

38 Because this probability is less than 0.05, we reject the null hypothesis of no linear relationship and conclude there is a linear relationship between X and Y. 38 ANOVA for Triple A Construction Excel 2016 Output for Triple A Construction Example: P(F > ) =

39 49 Multiple Regression Analysis (1 of 2) Extensions to the simple linear model Models with more than one independent variable Y = β 0 + β 1 X 1 + β 2 X β k X k + ε Where Y = dependent variable (response variable) X i = i th independent variable (predictor or explanatory variable) β 0 = intercept (value of Y when all X i = 0) β i = coefficient of the i th independent variable k = number of independent variables ε = random error

40 50 Multiple Regression Analysis (2 of 2) To estimate these values, a sample is taken the following equation developed ˆ k k Y b b X b X b X where Y = predicted value of Y b 0 = sample intercept (an estimate of β 0 ) b i = sample coefficient of the ith variable (an estimate of β i )

41 51 Jenny Wilson Realty (1 of 9) Develop a model to determine the suggested listing price for houses based on the size and age of the house where Y b 0 X 1 and X 2 b 1 and b 2 Yˆ b b X b X = predicted value of dependent variable (selling price) = Y intercept = value of the two independent variables (square footage and age) respectively = slopes for X 1 and X 2 respectively o Selects a sample of houses that have sold recently and records the data

42 Jenny Wilson Real Estate Data 52

43 53 PROGRAMME 4.4A Jenny Wilson Realty (2 of 9) Input Screen for Jenny Wilson Realty Multiple Regression in Excel 2016:

44 54 PROGRAMME 4.4B Jenny Wilson Realty (3 of 9) Excel 2016 Output Screen for Jenny Wilson Realty Multiple Regression Example: Yˆ b b X b X , X X 1 2

45 55 Evaluating the Multiple Regression Model (1 of 2) Similar to simple linear regression models The p-value for the F test and r 2 interpreted the same The hypothesis is different because there is more than one independent variable The F test is investigating whether all the coefficients are equal to 0 at the same time

46 56 Evaluating the Multiple Regression Model (2 of 2) To determine which independent variables are significant, tests are performed for each variable H : H1: 1 0 The test statistic is calculated and if the p-value is lower than the level of significance (α), the null hypothesis is rejected

47 57 Jenny Wilson Realty (4 of 9) Full model is statistically significant Useful in predicting selling price p-value for F test = r 2 = Are both variables significant? For X 1 (square footage) For a = 0.05, p-value = rejected For X 1 (age) For a = 0.05, p-value = rejected H H : : null hypothesis is null hypothesis is

48 58 Jenny Wilson Realty (5 of 9) Both square footage and age are helpful in predicting the selling price!

49 59 Binary or Dummy Variables Binary (or dummy or indicator) variables are special variables created for qualitative data A dummy variable is assigned a value of 1 if a particular condition is met and a value of 0 otherwise The number of dummy variables must equal one less than the number of categories of the qualitative variable

50 60 Jenny Wilson Realty (6 of 9) A better model can be developed if information about the condition of the property is included X 3 = 1 if house is in excellent condition = 0 otherwise X 4 = 1 if house is in mint condition = 0 otherwise Two dummy variables are used to describe the three categories of condition No variable is needed for good condition since if both X 3 and X 4 = 0, the house must be in good condition

51 61 PROGRAMME 4.5A Jenny Wilson Realty (7 of 9) Input Screen for Jenny Wilson Realty Example:

52 62 PROGRAMME 4.5B Jenny Wilson Realty (8 of 9) Output Screen for Jenny Wilson Realty Example with Dummy Variables in Excel 2016: Y ˆ 121, X 3,962 X + 33,162 X + 47,369 X

53 63 PROGRAMME 4.5B Jenny Wilson Realty (9 of 9) Output Screen for Jenny Wilson Realty Example with Dummy Variables in Excel 2016: Coefficient of determination, r 2 = Y ˆ 121, X 3,962 X + 33,162 X + 47,369 X

54 64 Model Building (1 of 5) The best model is a statistically significant model with a high r 2 and few variables As more variables are added to the model, the r 2 value increases For this reason, the adjusted r 2 value is often used to determine the usefulness of an additional variable The adjusted r 2 takes into account the number of independent variables in the model

55 65 Model Building (2 of 5) The formula for r 2 2 SSR SSE r 1 SST SST The formula for adjusted r 2 2 SSE / ( n k 1) Adjusted r 1 SST / ( n 1) As the number of variables increases, the adjusted r 2 gets smaller unless the increase due to the new variable is large enough to offset the change in k

56 66 Model Building (3 of 5) In general, if a new variable increases the adjusted r 2, it should probably be included in the model!

57 67 Model Building (4 of 5) Stepwise regression systematically adds or deletes independent variables A forward stepwise procedure puts the most significant variable in first, adds the next variable that will improve the model the most Backward stepwise regression begins with all the independent variables and deletes the least helpful

58 68 Model Building (5 of 5) In some cases variables contain duplicate information When two independent variables are correlated, they are said to be collinear When more than two independent variables are correlated, multicollinearity exists When multicollinearity is present, hypothesis tests for the individual coefficients are not valid but the model may still be useful

59 69 Nonlinear Regression In some situations, variables are not linear Transformations may be used to turn a nonlinear model into a linear model

60 70 Colonel Motors (1 of 6) Use regression analysis to improve fuel efficiency o Study the impact of weight on miles per gallon (MPG) Automobile Weight Versus MPG:

61 71 Colonel Motors (2 of 6) Linear Model for MPG Data: Yˆ b b 0 1X1

62 72 PROGRAMME 4.6 Colonel Motors (3 of 6) Excel 2016 Output with Linear Regression Model for MPG Data: Useful model Small F test for significance Good r 2 value Y ˆ X or MPG (weight in 1,000 lb.) 1

63 73 Colonel Motors (4 of 6) Nonlinear Model for MPG Data:

64 74 Colonel Motors (5 of 6) The nonlinear model is a quadratic model The easiest approach develop a new variable New model X 2 (weight) 2 Yˆ b b X b X

65 75 PROGRAMME 4.7 Colonel Motors (6 of 6) Excel 2016 Output with Nonlinear Regression Model for MPG Data: Y ˆ X 3.4 X 1 2 Improved model Small F test for significance Adjusted r 2 and r 2 both increased

66 76 Cautions and Pitfalls (1 of 2) If the assumptions are not met, the statistical test may not be valid Correlation does not necessarily mean causation Multicollinearity makes interpreting coefficients problematic, but the model may still be good Using a regression model beyond the range of X is questionable, as the relationship may not hold outside the sample data

67 77 Cautions and Pitfalls (2 of 2) A t-test for the intercept (b 0 ) may be ignored as this point is often outside the range of the model A linear relationship may not be the best relationship, even if the F test returns an acceptable value A nonlinear relationship can exist even if a linear relationship does not Even though a relationship is statistically significant it may not have any practical value

68 78 Homework --- Chapter 4 Please read Chapter 5 (pp )!

69 Parameters and Statistics 79

70 80 Least Squares Method The objective of the scatter diagram is to measure the strength and direction of the linear relationship. Both can be more easily judged by drawing a straight line through the data. Which line best describes the relationship between X and Y?

71 81 Least Squares Method We need an objective method of producing a straight line. The best line will be one that is closest to the points on the scatterplot The best line is one that minimises the total distance between itself and all the observed data points. Since we oftentimes use regression to predict values of Y from observed values of X, we choose to measure the distance vertically.

72 82 Least Squares Method We want to find the line that minimises the vertical distance between itself and the observed points on the scatterplot. So here we have 2 different lines that may describe the relationship between X and Y. To determine which one is best, we can find the vertical distances from each point to the line... So based on this, the line on the right is better than the line on the left in describing the relationship between X and Y. ***infinite number of lines***

73 83 Least Squares Method Recall, the slope-intercept equation for a line is expressed in these terms: y = mx + b Where: m is the slope of the line b is the y-intercept. If we have determined there is a linear relationship between two variables with covariance and the coefficient of correlation, can we determine a linear function of the relationship?

74 84 Least Squares Method We typically rewrite this line as: yˆ b0 b1 x Read as y-hat! --- Fitted regression line! where the slope, and the intercept, b 1 s s xy 2 x b0 y b1 x Read as b naught

75 Interpretation of the b 0, b 1 85

76 86 Some of the errors will be positive and some will be negative! The problem is that when we add positive and negative values, they tend to cancel each other out. Best line: least-squares, or regression line We can then define the error to be the difference between the coordinates and the prediction line. The coordinate of one point: (x i, y i ) Predicted value for given x i : Best line minimises y i y ˆi ˆ yi b0 b1 x i 2, the sum of the squared errors. Error = distance from one point to the line = Coordinate Prediction

77 87 Some of the errors will be positive and some will be negative! The problem is that when we add positive and negative values, they tend to cancel each other out. Best line: least-squares, or regression line When we square those error lines, we are literally making squares from those lines. We can visualise this as... So we want to find the regression line that minimises the sum of the areas of these error squares. For this regression line, the sum of the areas of the squares would look like this...

78 88 Let`s determine the best-fitted line for following data: b Least Squares Method 1 2 s s xy x b0 y b1 x

79 89 b Least Squares Method 1 2 s s xy x b0 y b1 x

80 90 b Least Squares Method 1 2 s s xy x b0 y b1 x

81 91 b Least Squares Method 1 2 s s xy x b0 y b1 x

82 92 b Least Squares Method 1 2 s s xy x b0 y b1 x Lines of best fit will pivot around the point which represents the mean of X and the mean of the Y variables!

83 93 b Least Squares Method 1 2 s s xy x b0 y b1 x sx 2 x i x n 1 2

84 94 b Least Squares Method 1 2 s s xy x b0 y b1 x

85 95 b Least Squares Method 1 2 s s xy x b0 y b1 x

86 96 b Least Squares Method 1 2 s s xy x b0 y b1 x

87 97 b Least Squares Method 1 2 s s xy x b0 y b1 x

88 98 b Least Squares Method 1 2 s s xy x b0 y b1 x

89 99 b Least Squares Method 1 2 s s xy x b0 y b1 x

90 100 b Least Squares Method 1 2 s s xy x b0 y b1 x sx 2 x i x n 1 2

91 101 b Least Squares Method 1 2 s s xy x b0 y b1 x sx 2 x i x n 1 2

92 102 b Least Squares Method 1 2 s s xy x b0 y b1 x sx 2 x i x n 1 2

93 103 b Least Squares Method 1 2 s s xy x b0 y b1 x sx 2 x i x n 1 2

94 104 b Least Squares Method 1 2 s s xy x b0 y b1 x sx 2 x i x n 1 2

95 105 b Least Squares Method 1 2 s s xy x b0 y b1 x

96 106 Line of Best Fit Only for medium to strong correlations...

97 Line of Best Fit 107

98 Line of Best Fit 108

99 109 r = Sample Coefficient of Correlation What line? r measures closeness of data to the best line. best? In terms of least squared error: How

100 110 Interpretation of the b 0, b 1, yˆ x i i In a fixed and variable costs model: yˆ x i i b 0 =9.95? Intercept: predicted value of y when x = 0. b 1 =2.25? Slope: predicted change in y when x increases by 1.

101 111 Interpretation of the b 0, b 1, yˆ x i i A simple example of a linear equation A company has fixed costs of $7,000 for plant and equipment and variable costs of $600 for each unit of output. What is total cost at varying levels of output? let x = units of output let C = total cost C = fixed cost plus variable cost = 7, x

102 112 Interpretation of the b 0, b 1, yˆ x i i b 1, slope, always has the same sign as r, the correlation coefficient but they measure different things! The sum of the errors (or residuals),, is always 0 i y i (zero). The line always passes through the point. y ˆ x, y

103 113 Coefficient of Determination When we introduced the coefficient of correlation we pointed out that except for 1, 0, and +1 we cannot precisely interpret its meaning. We can judge the coefficient of correlation in relation to its proximity to 1, 0, and +1 only. Fortunately, we have another measure that can be precisely interpreted. It is the coefficient of determination, which is calculated by squaring the coefficient of correlation. For this reason we denote it R 2. The coefficient of determination measures the amount of variation in the dependent variable that is explained by the variation in the independent variable.

104 114 Coefficient of Determination The coefficient of determination measures the amount of variation in the dependent variable that is explained by the variation in the independent variable. The coefficient of determination is R 2 = This tells us that 75.8% of the variation in electrical costs is explained by the number of tools. The remaining 24.2% is unexplained.

105 Least Squares Method --- R 2 115

Chapter 4. Regression Models. Learning Objectives

Chapter 4. Regression Models. Learning Objectives Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Learning Objectives After completing

More information

Regression Models. Chapter 4. Introduction. Introduction. Introduction

Regression Models. Chapter 4. Introduction. Introduction. Introduction Chapter 4 Regression Models Quantitative Analysis for Management, Tenth Edition, by Render, Stair, and Hanna 008 Prentice-Hall, Inc. Introduction Regression analysis is a very valuable tool for a manager

More information

Bayesian Analysis LEARNING OBJECTIVES. Calculating Revised Probabilities. Calculating Revised Probabilities. Calculating Revised Probabilities

Bayesian Analysis LEARNING OBJECTIVES. Calculating Revised Probabilities. Calculating Revised Probabilities. Calculating Revised Probabilities Valua%on and pricing (November 5, 2013) LEARNING OBJECTIVES Lecture 7 Decision making (part 3) Regression theory Olivier J. de Jong, LL.M., MM., MBA, CFD, CFFA, AA www.olivierdejong.com 1. List the steps

More information

Regression Models. Chapter 4

Regression Models. Chapter 4 Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Introduction Regression analysis

More information

Correlation Analysis

Correlation Analysis Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the

More information

Chapter 3 Multiple Regression Complete Example

Chapter 3 Multiple Regression Complete Example Department of Quantitative Methods & Information Systems ECON 504 Chapter 3 Multiple Regression Complete Example Spring 2013 Dr. Mohammad Zainal Review Goals After completing this lecture, you should be

More information

Chapter 14 Student Lecture Notes 14-1

Chapter 14 Student Lecture Notes 14-1 Chapter 14 Student Lecture Notes 14-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 14 Multiple Regression Analysis and Model Building Chap 14-1 Chapter Goals After completing this

More information

Statistics for Managers using Microsoft Excel 6 th Edition

Statistics for Managers using Microsoft Excel 6 th Edition Statistics for Managers using Microsoft Excel 6 th Edition Chapter 13 Simple Linear Regression 13-1 Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

Basic Business Statistics 6 th Edition

Basic Business Statistics 6 th Edition Basic Business Statistics 6 th Edition Chapter 12 Simple Linear Regression Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of a dependent variable based

More information

Chapter 7 Student Lecture Notes 7-1

Chapter 7 Student Lecture Notes 7-1 Chapter 7 Student Lecture Notes 7- Chapter Goals QM353: Business Statistics Chapter 7 Multiple Regression Analysis and Model Building After completing this chapter, you should be able to: Explain model

More information

Chapter 16. Simple Linear Regression and dcorrelation

Chapter 16. Simple Linear Regression and dcorrelation Chapter 16 Simple Linear Regression and dcorrelation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

The Multiple Regression Model

The Multiple Regression Model Multiple Regression The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & or more independent variables (X i ) Multiple Regression Model with k Independent Variables:

More information

Chapter 13. Multiple Regression and Model Building

Chapter 13. Multiple Regression and Model Building Chapter 13 Multiple Regression and Model Building Multiple Regression Models The General Multiple Regression Model y x x x 0 1 1 2 2... k k y is the dependent variable x, x,..., x 1 2 k the model are the

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 4 4- Basic Business Statistics th Edition Chapter 4 Introduction to Multiple Regression Basic Business Statistics, e 9 Prentice-Hall, Inc. Chap 4- Learning Objectives In this chapter, you learn:

More information

Ch 13 & 14 - Regression Analysis

Ch 13 & 14 - Regression Analysis Ch 3 & 4 - Regression Analysis Simple Regression Model I. Multiple Choice:. A simple regression is a regression model that contains a. only one independent variable b. only one dependent variable c. more

More information

Chapter 16. Simple Linear Regression and Correlation

Chapter 16. Simple Linear Regression and Correlation Chapter 16 Simple Linear Regression and Correlation 16.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X.

Estimating σ 2. We can do simple prediction of Y and estimation of the mean of Y at any value of X. Estimating σ 2 We can do simple prediction of Y and estimation of the mean of Y at any value of X. To perform inferences about our regression line, we must estimate σ 2, the variance of the error term.

More information

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006

Keller: Stats for Mgmt & Econ, 7th Ed July 17, 2006 Chapter 17 Simple Linear Regression and Correlation 17.1 Regression Analysis Our problem objective is to analyze the relationship between interval variables; regression analysis is the first tool we will

More information

Regression Analysis II

Regression Analysis II Regression Analysis II Measures of Goodness of fit Two measures of Goodness of fit Measure of the absolute fit of the sample points to the sample regression line Standard error of the estimate An index

More information

Chapter Learning Objectives. Regression Analysis. Correlation. Simple Linear Regression. Chapter 12. Simple Linear Regression

Chapter Learning Objectives. Regression Analysis. Correlation. Simple Linear Regression. Chapter 12. Simple Linear Regression Chapter 12 12-1 North Seattle Community College BUS21 Business Statistics Chapter 12 Learning Objectives In this chapter, you learn:! How to use regression analysis to predict the value of a dependent

More information

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression Chapter 14 Student Lecture Notes 14-1 Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Multiple Regression QMIS 0 Dr. Mohammad Zainal Chapter Goals After completing

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line

Inference for Regression Inference about the Regression Model and Using the Regression Line Inference for Regression Inference about the Regression Model and Using the Regression Line PBS Chapter 10.1 and 10.2 2009 W.H. Freeman and Company Objectives (PBS Chapter 10.1 and 10.2) Inference about

More information

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220 Dr. Mohammad Zainal Chapter Goals After completing

More information

Lecture 10 Multiple Linear Regression

Lecture 10 Multiple Linear Regression Lecture 10 Multiple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: 6.1-6.5 10-1 Topic Overview Multiple Linear Regression Model 10-2 Data for Multiple Regression Y i is the response variable

More information

Regression Models REVISED TEACHING SUGGESTIONS ALTERNATIVE EXAMPLES

Regression Models REVISED TEACHING SUGGESTIONS ALTERNATIVE EXAMPLES M04_REND6289_10_IM_C04.QXD 5/7/08 2:49 PM Page 46 4 C H A P T E R Regression Models TEACHING SUGGESTIONS Teaching Suggestion 4.1: Which Is the Independent Variable? We find that students are often confused

More information

Chapter 14 Simple Linear Regression (A)

Chapter 14 Simple Linear Regression (A) Chapter 14 Simple Linear Regression (A) 1. Characteristics Managerial decisions often are based on the relationship between two or more variables. can be used to develop an equation showing how the variables

More information

SIMPLE REGRESSION ANALYSIS. Business Statistics

SIMPLE REGRESSION ANALYSIS. Business Statistics SIMPLE REGRESSION ANALYSIS Business Statistics CONTENTS Ordinary least squares (recap for some) Statistical formulation of the regression model Assessing the regression model Testing the regression coefficients

More information

Inference for Regression

Inference for Regression Inference for Regression Section 9.4 Cathy Poliak, Ph.D. cathy@math.uh.edu Office in Fleming 11c Department of Mathematics University of Houston Lecture 13b - 3339 Cathy Poliak, Ph.D. cathy@math.uh.edu

More information

Simple Linear Regression

Simple Linear Regression 9-1 l Chapter 9 l Simple Linear Regression 9.1 Simple Linear Regression 9.2 Scatter Diagram 9.3 Graphical Method for Determining Regression 9.4 Least Square Method 9.5 Correlation Coefficient and Coefficient

More information

Mathematics for Economics MA course

Mathematics for Economics MA course Mathematics for Economics MA course Simple Linear Regression Dr. Seetha Bandara Simple Regression Simple linear regression is a statistical method that allows us to summarize and study relationships between

More information

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. FINAL EXAM ** Two different ways to submit your answer sheet (i) Use MS-Word and place it in a drop-box. (ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. Deadline: December

More information

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit LECTURE 6 Introduction to Econometrics Hypothesis testing & Goodness of fit October 25, 2016 1 / 23 ON TODAY S LECTURE We will explain how multiple hypotheses are tested in a regression model We will define

More information

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information. STA441: Spring 2018 Multiple Regression This slide show is a free open source document. See the last slide for copyright information. 1 Least Squares Plane 2 Statistical MODEL There are p-1 explanatory

More information

Regression Analysis. BUS 735: Business Decision Making and Research

Regression Analysis. BUS 735: Business Decision Making and Research Regression Analysis BUS 735: Business Decision Making and Research 1 Goals and Agenda Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn

More information

Business Statistics. Lecture 10: Correlation and Linear Regression

Business Statistics. Lecture 10: Correlation and Linear Regression Business Statistics Lecture 10: Correlation and Linear Regression Scatterplot A scatterplot shows the relationship between two quantitative variables measured on the same individuals. It displays the Form

More information

STA121: Applied Regression Analysis

STA121: Applied Regression Analysis STA121: Applied Regression Analysis Linear Regression Analysis - Chapters 3 and 4 in Dielman Artin Department of Statistical Science September 15, 2009 Outline 1 Simple Linear Regression Analysis 2 Using

More information

Lecture 15 Multiple regression I Chapter 6 Set 2 Least Square Estimation The quadratic form to be minimized is

Lecture 15 Multiple regression I Chapter 6 Set 2 Least Square Estimation The quadratic form to be minimized is Lecture 15 Multiple regression I Chapter 6 Set 2 Least Square Estimation The quadratic form to be minimized is Q = (Y i β 0 β 1 X i1 β 2 X i2 β p 1 X i.p 1 ) 2, which in matrix notation is Q = (Y Xβ) (Y

More information

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables.

Regression Analysis. BUS 735: Business Decision Making and Research. Learn how to detect relationships between ordinal and categorical variables. Regression Analysis BUS 735: Business Decision Making and Research 1 Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn how to estimate

More information

BNAD 276 Lecture 10 Simple Linear Regression Model

BNAD 276 Lecture 10 Simple Linear Regression Model 1 / 27 BNAD 276 Lecture 10 Simple Linear Regression Model Phuong Ho May 30, 2017 2 / 27 Outline 1 Introduction 2 3 / 27 Outline 1 Introduction 2 4 / 27 Simple Linear Regression Model Managerial decisions

More information

Inference for Regression Simple Linear Regression

Inference for Regression Simple Linear Regression Inference for Regression Simple Linear Regression IPS Chapter 10.1 2009 W.H. Freeman and Company Objectives (IPS Chapter 10.1) Simple linear regression p Statistical model for linear regression p Estimating

More information

Review of Statistics 101

Review of Statistics 101 Review of Statistics 101 We review some important themes from the course 1. Introduction Statistics- Set of methods for collecting/analyzing data (the art and science of learning from data). Provides methods

More information

The simple linear regression model discussed in Chapter 13 was written as

The simple linear regression model discussed in Chapter 13 was written as 1519T_c14 03/27/2006 07:28 AM Page 614 Chapter Jose Luis Pelaez Inc/Blend Images/Getty Images, Inc./Getty Images, Inc. 14 Multiple Regression 14.1 Multiple Regression Analysis 14.2 Assumptions of the Multiple

More information

Multiple Regression. Inference for Multiple Regression and A Case Study. IPS Chapters 11.1 and W.H. Freeman and Company

Multiple Regression. Inference for Multiple Regression and A Case Study. IPS Chapters 11.1 and W.H. Freeman and Company Multiple Regression Inference for Multiple Regression and A Case Study IPS Chapters 11.1 and 11.2 2009 W.H. Freeman and Company Objectives (IPS Chapters 11.1 and 11.2) Multiple regression Data for multiple

More information

Ch14. Multiple Regression Analysis

Ch14. Multiple Regression Analysis Ch14. Multiple Regression Analysis 1 Goals : multiple regression analysis Model Building and Estimating More than 1 independent variables Quantitative( 量 ) independent variables Qualitative( ) independent

More information

STAT 350 Final (new Material) Review Problems Key Spring 2016

STAT 350 Final (new Material) Review Problems Key Spring 2016 1. The editor of a statistics textbook would like to plan for the next edition. A key variable is the number of pages that will be in the final version. Text files are prepared by the authors using LaTeX,

More information

STATISTICS 110/201 PRACTICE FINAL EXAM

STATISTICS 110/201 PRACTICE FINAL EXAM STATISTICS 110/201 PRACTICE FINAL EXAM Questions 1 to 5: There is a downloadable Stata package that produces sequential sums of squares for regression. In other words, the SS is built up as each variable

More information

Chapte The McGraw-Hill Companies, Inc. All rights reserved.

Chapte The McGraw-Hill Companies, Inc. All rights reserved. 12er12 Chapte Bivariate i Regression (Part 1) Bivariate Regression Visual Displays Begin the analysis of bivariate data (i.e., two variables) with a scatter plot. A scatter plot - displays each observed

More information

ST430 Exam 2 Solutions

ST430 Exam 2 Solutions ST430 Exam 2 Solutions Date: November 9, 2015 Name: Guideline: You may use one-page (front and back of a standard A4 paper) of notes. No laptop or textbook are permitted but you may use a calculator. Giving

More information

Econ 3790: Business and Economics Statistics. Instructor: Yogesh Uppal

Econ 3790: Business and Economics Statistics. Instructor: Yogesh Uppal Econ 3790: Business and Economics Statistics Instructor: Yogesh Uppal yuppal@ysu.edu Sampling Distribution of b 1 Expected value of b 1 : Variance of b 1 : E(b 1 ) = 1 Var(b 1 ) = σ 2 /SS x Estimate of

More information

Econ 3790: Statistics Business and Economics. Instructor: Yogesh Uppal

Econ 3790: Statistics Business and Economics. Instructor: Yogesh Uppal Econ 3790: Statistics Business and Economics Instructor: Yogesh Uppal Email: yuppal@ysu.edu Chapter 14 Covariance and Simple Correlation Coefficient Simple Linear Regression Covariance Covariance between

More information

Categorical Predictor Variables

Categorical Predictor Variables Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively

More information

Draft Proof - Do not copy, post, or distribute. Chapter Learning Objectives REGRESSION AND CORRELATION THE SCATTER DIAGRAM

Draft Proof - Do not copy, post, or distribute. Chapter Learning Objectives REGRESSION AND CORRELATION THE SCATTER DIAGRAM 1 REGRESSION AND CORRELATION As we learned in Chapter 9 ( Bivariate Tables ), the differential access to the Internet is real and persistent. Celeste Campos-Castillo s (015) research confirmed the impact

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore What is Multiple Linear Regression Several independent variables may influence the change in response variable we are trying to study. When several independent variables are included in the equation, the

More information

LI EAR REGRESSIO A D CORRELATIO

LI EAR REGRESSIO A D CORRELATIO CHAPTER 6 LI EAR REGRESSIO A D CORRELATIO Page Contents 6.1 Introduction 10 6. Curve Fitting 10 6.3 Fitting a Simple Linear Regression Line 103 6.4 Linear Correlation Analysis 107 6.5 Spearman s Rank Correlation

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression In simple linear regression we are concerned about the relationship between two variables, X and Y. There are two components to such a relationship. 1. The strength of the relationship.

More information

Variance Decomposition and Goodness of Fit

Variance Decomposition and Goodness of Fit Variance Decomposition and Goodness of Fit 1. Example: Monthly Earnings and Years of Education In this tutorial, we will focus on an example that explores the relationship between total monthly earnings

More information

Linear Regression. Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x).

Linear Regression. Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x). Linear Regression Simple linear regression model determines the relationship between one dependent variable (y) and one independent variable (x). A dependent variable is a random variable whose variation

More information

CHAPTER EIGHT Linear Regression

CHAPTER EIGHT Linear Regression 7 CHAPTER EIGHT Linear Regression 8. Scatter Diagram Example 8. A chemical engineer is investigating the effect of process operating temperature ( x ) on product yield ( y ). The study results in the following

More information

CS 5014: Research Methods in Computer Science

CS 5014: Research Methods in Computer Science Computer Science Clifford A. Shaffer Department of Computer Science Virginia Tech Blacksburg, Virginia Fall 2010 Copyright c 2010 by Clifford A. Shaffer Computer Science Fall 2010 1 / 207 Correlation and

More information

Ch 2: Simple Linear Regression

Ch 2: Simple Linear Regression Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component

More information

Chapter 12 - Part I: Correlation Analysis

Chapter 12 - Part I: Correlation Analysis ST coursework due Friday, April - Chapter - Part I: Correlation Analysis Textbook Assignment Page - # Page - #, Page - # Lab Assignment # (available on ST webpage) GOALS When you have completed this lecture,

More information

Example: Forced Expiratory Volume (FEV) Program L13. Example: Forced Expiratory Volume (FEV) Example: Forced Expiratory Volume (FEV)

Example: Forced Expiratory Volume (FEV) Program L13. Example: Forced Expiratory Volume (FEV) Example: Forced Expiratory Volume (FEV) Program L13 Relationships between two variables Correlation, cont d Regression Relationships between more than two variables Multiple linear regression Two numerical variables Linear or curved relationship?

More information

Variance Decomposition in Regression James M. Murray, Ph.D. University of Wisconsin - La Crosse Updated: October 04, 2017

Variance Decomposition in Regression James M. Murray, Ph.D. University of Wisconsin - La Crosse Updated: October 04, 2017 Variance Decomposition in Regression James M. Murray, Ph.D. University of Wisconsin - La Crosse Updated: October 04, 2017 PDF file location: http://www.murraylax.org/rtutorials/regression_anovatable.pdf

More information

Regression With a Categorical Independent Variable

Regression With a Categorical Independent Variable Regression ith a Independent Variable ERSH 8320 Slide 1 of 34 Today s Lecture Regression with a single categorical independent variable. Today s Lecture Coding procedures for analysis. Dummy coding. Relationship

More information

ECON 450 Development Economics

ECON 450 Development Economics ECON 450 Development Economics Statistics Background University of Illinois at Urbana-Champaign Summer 2017 Outline 1 Introduction 2 3 4 5 Introduction Regression analysis is one of the most important

More information

Section 3: Simple Linear Regression

Section 3: Simple Linear Regression Section 3: Simple Linear Regression Carlos M. Carvalho The University of Texas at Austin McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/ 1 Regression: General Introduction

More information

Data Analysis 1 LINEAR REGRESSION. Chapter 03

Data Analysis 1 LINEAR REGRESSION. Chapter 03 Data Analysis 1 LINEAR REGRESSION Chapter 03 Data Analysis 2 Outline The Linear Regression Model Least Squares Fit Measures of Fit Inference in Regression Other Considerations in Regression Model Qualitative

More information

Linear regression. Linear regression is a simple approach to supervised learning. It assumes that the dependence of Y on X 1,X 2,...X p is linear.

Linear regression. Linear regression is a simple approach to supervised learning. It assumes that the dependence of Y on X 1,X 2,...X p is linear. Linear regression Linear regression is a simple approach to supervised learning. It assumes that the dependence of Y on X 1,X 2,...X p is linear. 1/48 Linear regression Linear regression is a simple approach

More information

CHAPTER 4 & 5 Linear Regression with One Regressor. Kazu Matsuda IBEC PHBU 430 Econometrics

CHAPTER 4 & 5 Linear Regression with One Regressor. Kazu Matsuda IBEC PHBU 430 Econometrics CHAPTER 4 & 5 Linear Regression with One Regressor Kazu Matsuda IBEC PHBU 430 Econometrics Introduction Simple linear regression model = Linear model with one independent variable. y = dependent variable

More information

Ordinary Least Squares Regression Explained: Vartanian

Ordinary Least Squares Regression Explained: Vartanian Ordinary Least Squares Regression Explained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent

More information

Correlation 1. December 4, HMS, 2017, v1.1

Correlation 1. December 4, HMS, 2017, v1.1 Correlation 1 December 4, 2017 1 HMS, 2017, v1.1 Chapter References Diez: Chapter 7 Navidi, Chapter 7 I don t expect you to learn the proofs what will follow. Chapter References 2 Correlation The sample

More information

Regression Analysis. Regression: Methodology for studying the relationship among two or more variables

Regression Analysis. Regression: Methodology for studying the relationship among two or more variables Regression Analysis Regression: Methodology for studying the relationship among two or more variables Two major aims: Determine an appropriate model for the relationship between the variables Predict the

More information

3. Diagnostics and Remedial Measures

3. Diagnostics and Remedial Measures 3. Diagnostics and Remedial Measures So far, we took data (X i, Y i ) and we assumed where ɛ i iid N(0, σ 2 ), Y i = β 0 + β 1 X i + ɛ i i = 1, 2,..., n, β 0, β 1 and σ 2 are unknown parameters, X i s

More information

THE ROYAL STATISTICAL SOCIETY 2008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS

THE ROYAL STATISTICAL SOCIETY 2008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS THE ROYAL STATISTICAL SOCIETY 008 EXAMINATIONS SOLUTIONS HIGHER CERTIFICATE (MODULAR FORMAT) MODULE 4 LINEAR MODELS The Society provides these solutions to assist candidates preparing for the examinations

More information

Introduction to Regression

Introduction to Regression Regression Introduction to Regression If two variables covary, we should be able to predict the value of one variable from another. Correlation only tells us how much two variables covary. In regression,

More information

Unit 10: Simple Linear Regression and Correlation

Unit 10: Simple Linear Regression and Correlation Unit 10: Simple Linear Regression and Correlation Statistics 571: Statistical Methods Ramón V. León 6/28/2004 Unit 10 - Stat 571 - Ramón V. León 1 Introductory Remarks Regression analysis is a method for

More information

Chapter 13 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics

Chapter 13 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics Chapter 13 Student Lecture Notes 13-1 Department of Quantitative Methods & Information Sstems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analsis QMIS 0 Dr. Mohammad

More information

Multiple Regression. Peerapat Wongchaiwat, Ph.D.

Multiple Regression. Peerapat Wongchaiwat, Ph.D. Peerapat Wongchaiwat, Ph.D. wongchaiwat@hotmail.com The Multiple Regression Model Examine the linear relationship between 1 dependent (Y) & 2 or more independent variables (X i ) Multiple Regression Model

More information

STAT Chapter 11: Regression

STAT Chapter 11: Regression STAT 515 -- Chapter 11: Regression Mostly we have studied the behavior of a single random variable. Often, however, we gather data on two random variables. We wish to determine: Is there a relationship

More information

Simple Linear Regression

Simple Linear Regression Simple Linear Regression ST 430/514 Recall: A regression model describes how a dependent variable (or response) Y is affected, on average, by one or more independent variables (or factors, or covariates)

More information

Chapter 14. Linear least squares

Chapter 14. Linear least squares Serik Sagitov, Chalmers and GU, March 5, 2018 Chapter 14 Linear least squares 1 Simple linear regression model A linear model for the random response Y = Y (x) to an independent variable X = x For a given

More information

ECO220Y Simple Regression: Testing the Slope

ECO220Y Simple Regression: Testing the Slope ECO220Y Simple Regression: Testing the Slope Readings: Chapter 18 (Sections 18.3-18.5) Winter 2012 Lecture 19 (Winter 2012) Simple Regression Lecture 19 1 / 32 Simple Regression Model y i = β 0 + β 1 x

More information

FAQ: Linear and Multiple Regression Analysis: Coefficients

FAQ: Linear and Multiple Regression Analysis: Coefficients Question 1: How do I calculate a least squares regression line? Answer 1: Regression analysis is a statistical tool that utilizes the relation between two or more quantitative variables so that one variable

More information

Inference for the Regression Coefficient

Inference for the Regression Coefficient Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression line. We can shows that b 0 and b 1 are the unbiased estimates

More information

Correlation. A statistics method to measure the relationship between two variables. Three characteristics

Correlation. A statistics method to measure the relationship between two variables. Three characteristics Correlation Correlation A statistics method to measure the relationship between two variables Three characteristics Direction of the relationship Form of the relationship Strength/Consistency Direction

More information

Linear Regression and Correlation

Linear Regression and Correlation Linear Regression and Correlation Chapter 13 McGraw-Hill/Irwin Copyright 2010 by The McGraw-Hill Companies, Inc. All rights reserved. GOALS 1. Understand and interpret the terms dependent and independent

More information

Lecture 9: Linear Regression

Lecture 9: Linear Regression Lecture 9: Linear Regression Goals Develop basic concepts of linear regression from a probabilistic framework Estimating parameters and hypothesis testing with linear models Linear regression in R Regression

More information

Can you tell the relationship between students SAT scores and their college grades?

Can you tell the relationship between students SAT scores and their college grades? Correlation One Challenge Can you tell the relationship between students SAT scores and their college grades? A: The higher SAT scores are, the better GPA may be. B: The higher SAT scores are, the lower

More information

Correlation & Simple Regression

Correlation & Simple Regression Chapter 11 Correlation & Simple Regression The previous chapter dealt with inference for two categorical variables. In this chapter, we would like to examine the relationship between two quantitative variables.

More information

Business Statistics. Lecture 9: Simple Regression

Business Statistics. Lecture 9: Simple Regression Business Statistics Lecture 9: Simple Regression 1 On to Model Building! Up to now, class was about descriptive and inferential statistics Numerical and graphical summaries of data Confidence intervals

More information

Multiple Linear Regression. Chapter 12

Multiple Linear Regression. Chapter 12 13 Multiple Linear Regression Chapter 12 Multiple Regression Analysis Definition The multiple regression model equation is Y = b 0 + b 1 x 1 + b 2 x 2 +... + b p x p + ε where E(ε) = 0 and Var(ε) = s 2.

More information

Simple Linear Regression: One Qualitative IV

Simple Linear Regression: One Qualitative IV Simple Linear Regression: One Qualitative IV 1. Purpose As noted before regression is used both to explain and predict variation in DVs, and adding to the equation categorical variables extends regression

More information

Unit 11: Multiple Linear Regression

Unit 11: Multiple Linear Regression Unit 11: Multiple Linear Regression Statistics 571: Statistical Methods Ramón V. León 7/13/2004 Unit 11 - Stat 571 - Ramón V. León 1 Main Application of Multiple Regression Isolating the effect of a variable

More information

REVIEW 8/2/2017 陈芳华东师大英语系

REVIEW 8/2/2017 陈芳华东师大英语系 REVIEW Hypothesis testing starts with a null hypothesis and a null distribution. We compare what we have to the null distribution, if the result is too extreme to belong to the null distribution (p

More information

A discussion on multiple regression models

A discussion on multiple regression models A discussion on multiple regression models In our previous discussion of simple linear regression, we focused on a model in which one independent or explanatory variable X was used to predict the value

More information

Analysing data: regression and correlation S6 and S7

Analysing data: regression and correlation S6 and S7 Basic medical statistics for clinical and experimental research Analysing data: regression and correlation S6 and S7 K. Jozwiak k.jozwiak@nki.nl 2 / 49 Correlation So far we have looked at the association

More information

ST Correlation and Regression

ST Correlation and Regression Chapter 5 ST 370 - Correlation and Regression Readings: Chapter 11.1-11.4, 11.7.2-11.8, Chapter 12.1-12.2 Recap: So far we ve learned: Why we want a random sample and how to achieve it (Sampling Scheme)

More information

What is a Hypothesis?

What is a Hypothesis? What is a Hypothesis? A hypothesis is a claim (assumption) about a population parameter: population mean Example: The mean monthly cell phone bill in this city is μ = $42 population proportion Example:

More information

Lectures on Simple Linear Regression Stat 431, Summer 2012

Lectures on Simple Linear Regression Stat 431, Summer 2012 Lectures on Simple Linear Regression Stat 43, Summer 0 Hyunseung Kang July 6-8, 0 Last Updated: July 8, 0 :59PM Introduction Previously, we have been investigating various properties of the population

More information