MBA Statistics COURSE #4

Size: px
Start display at page:

Download "MBA Statistics COURSE #4"

Transcription

1 MBA Statistics COURSE #4 Simple and multiple linear regression What should be the sales of ice cream?

2 Example: Before beginning building a movie theater, one must estimate the daily number of people entering the building. How can we estimate it? There are 2 millions individuals in the city. 2

3 Possible solutions: One could realize a local market study. However it is often imprecise, specially for new projects. One could get data from similar projects in other cities. 3

4 City Attendance (x1000) What do you think? Can we do better? 4

5 Probably, taking into account the size of the city City Attendance (x1 000) Size (millions)

6 Case study: Ice Cream Sales The file icecream.xls contains pairs of data representing ice cream sales and temperature recorded that day, for 30 days. Is there a relation between temperature and sales? Can temperature be used to predict ice cream sales? If so what s the prediction when the temperature is 25? 6

7 Introduction One of the principle objectives of statistics is to explain the variability that we observe in data. Linear regression (or linear models) is a statistical tool MUCH USED to study the presence of a linear relation between a dependent variable Y (quantitative and continuous) and one or more independent variables X 1, X 2,, X p (qualitative and/or quantitative), called independent or explanatory variables. 7

8 For example, a manager could be interested in seeing if he could explain a good part of the variability that he observes in sales in his differents branches (dependant variable Y) in the last 12 months, by the area, number of employees, number of payed overtime hours, quality of customer service, number of promotions, etc. ( independent or explanatory variables). 8

9 A regression model can be used to answer one of the following three objectives: Describe data coming from non experimental studies i.e. we observe reality as it is. Examine the hypothesis (data coming from controled experimental studies). Predict (if we like to take risks!!). 9

10 Example: We are interested in knowing what are the important factors that influence or determine the value of a property and we want to build a model that would help us evaluate this value using certain factors. To do this, we have obtained the total value for a sample of 79 properties in a given region. The following variables have also been collected for each property: 10

11 Brief glimpse of the data file:house.xls # of square feet total land first outdoor heating OBS value value # of acres floor condition type Good NatGas Good NatGas Good Electric Average Electric Average NatGas Good Electric Excellnt Electric OBS # of # of # of completed # of non completed # of rooms bedroom bathrooms bathrooms fire-places GARAGE Garage NoGarage Garage Garage NoGarage Garage Garage 11

12 Is there a link between the total value and the different factors? Total Land

13 Total Total Acre Sq.Feet Total Total Rooms Bedroom 13

14 Total Total Completed Bathrooms Bathrooms Total Total Fire-place NoGarage Garage Garage 14

15 The Pearson correlation coefficient r is used to measure the intensity of the linear relation between two quantitative variables. The correlation coefficient r will take its values between -1 and 1. If a perfect linear relation exist between X and Y, then r = ±1 (r =1 if X and Y vary in the same direction and r = -1 if X varies in the opposite direction of Y). If r = 0, there is no linear link between X and Y. The more the r value furthers from 0 to get closer to ±1, the more the linear link intensity between X and Y becomes larger. 15

16 Y 6.5 * r = Y r = 1 31 * 6.0 * * 29 * 27 * 25 * 5.5 * * 23 * 21 * 19 * 5.0 * 17 * 15 * 13 * 4.5 * * * 11 * * * X X Y r = * * * * * * * * * * * X 16

17 Descriptive statistics Variable N Mean Median Sta.Deviation Minimum Maximum Total Land Acre Sq.Feet Rooms Bedrooms C.Bathro Bathro Fire-pl Pearson Correlation Coefficients Total Land Acre Sq.Feet Rooms Bedroom C.Bathro Bathro Land Acre Sq.Feet Rooms Bedrooms C.Bathro Bathro Fire-pl

18 BE CAREFULL!! it is important to interpret the correlation coefficient with the graph. r = in all cases below * * * * * * 10.0 * 8 * * * Y1 * Y2 * * 7.5 * * 6 * * * * 5.0 * 4 * * X X 15.0 Y * 12.5 * Y * * * * * 7.5 * 7.5 * * * * * * * * * * * * X X

19 Simple linear regression To describe a linear relation between two quantitative variables or to be able to predict Y for a given value of X, we use a regression line: Y = β 0 + β 1 X + ε Since any statistical model is only an approximation (we hope the best possible!!) and because the linear link is never perfect, in the model, there is always an error, noted ε. If there was a perfect linear relation between Y and X, the error term would always be equal to 0, and all the variability of Y would be explained by the independent variable X. 19

20 So, for a given value of X, we would like to estimate Y. Thus, with the help of the data sample we will estimate the regression model parameters β 0 and β 1 in order to minimize the residuals (errors) sum of squares. The squared correlation coefficient is called the coefficient of determination and the percentage of the variability of Y explained by X: R 2 = 1 - (n-2)/(n-1){s e /S y } 2, where S e is the standard deviation of the errors and S y is the standard deviation of Y. 20

21 We can also use the adjusted coefficient of determination to indicate the percentage of the variability of Y explained by X: R 2 ajusted = 1 - {S e /S y }2. 21

22 Simple linear regression example: MODEL 1. Regression Analysis The regression equation is Total = Sq.Feet Predictor Coef StDev T P Constant Sq.Feet S = R-Sq = 58.8% R-Sq(adj) = 58.2% Analysis of Variance Source DF SS MS F P Regression E E Residual Error E Total E+11 22

23 MODEL 2. The regression equation is : Total = Rooms Predictor Coef StDev T P Constant Rooms S = R-Sq = 39.3% R-Sq(adj) = 38.5% Analysis of Variance Source DF SS MS F P Regression E E Residual Error E Total E+11 MODEL 3. The regression equation is : Total = Bedrooms Predictor Coef StDev T P Constant Bedrooms S = R-Sq = 33.9% R-Sq(adj) = 33.1% Analysis of Variance Source DF SS MS F P Regression E E Residual Error E Total E+11 23

24 Model 1: total value = *( # of squared feet ). R 2 = 58.8%. Thus 58.8% of the variability of the total value is explained by the # of squared feet. Model 2: total value = *(# of rooms ). R 2 = 39.3%. Thus 39.3% of the variability of the total value is explained by the # of rooms. Model 3: total value = *(# of bedrooms ). R 2 = 33.9%. Thus 33.9% of the variability of the total value is explained by the # of bedrooms. 24

25 Which one of the 3 previous models would you choose and why? Model 1 because it has the largest value of R 2. 25

26 1-α confidence interval for the mean of the values of Y for a specific value of X: For model 1 and a value of X=1500 sq.ft we obtain the following point estimation : est. total value = *1500 = $ 95% confidence interval for the mean of the total value for properties of 1500 sq.ft : [ , ] as calculated by CI-regression.xls 26

27 1-α confidence interval for a new value of Y (prediction) being given a specific value of X: For model 1 and a value of X=1500 sq.ft we obtain the following point estimation : est.total value = *1500 = $ 95% confidence interval for a predicted total value when the area of the first floor is 1500 sq.ft : [59 742, ] The confidence interval for a predicted value is always larger than for the mean of the value of Y for a specific X. 27

28 Inference on regression model parameters: If there is no linear link between Y and X then β 1 = 0. So, we want to examine the following hypothesis : H 0 : β 1 = 0 vs H 1 : β 1 0 We will reject H 0 when the p-value is too small This test will be valid if the relation between X and Y is linear the data are independent the variance of Y is the same for every value of X. Y has a normal distribution for every value of X or the sample size n is large. 28

29 Multiple linear regression It is more likely possible that the variability of the dependent variable Y will be explained not only by one independent variable X, but rather by a linear combination of several independent variables X 1, X 2,, X p. In this case, the multiple regression model is given by: Y = β 0 + β 1 X 1 + β 2 X β p X p + ε Also, using the sample data, we will estimate the regression model parameters β 0, β 1,, β p in order to minimize the residuals (errors) sum of squares. 29

30 The multiple correlation coefficient R 2, also called the coefficient of determination, represents the percentage of the variability of Y explained by the independent variables X 1, X 2,, X p. In the model, when we add one or more independent variables, R 2 increases. The question is to know if R 2 increases to a significant degree. Note that we cannot have more independent variables in the model that there are observations in the sample. (general rule: n 5p). 30

31 Example: MODEL 1. The regression equation is Total = Land Acre Sq.Feet Rooms Bedroom C.Bathro Bathro Fire-pl. Predictor Coef StDev T P Constant Land Acre Sq.Feet Rooms Bedroom CBathro Bathro Fire-pl S = R-Sq = 88.9% R-Sq(adj) = 87.6% Analysis of Variance Source DF SS MS F P Regression E Residual Error Total E+11 31

32 MODEL 2 Regression Analysis The regression equation is Total = Land Acre Sq.Feet Bedroom C.bathro Bathro Predictor Coef StDev T P Constant Land Acre Sq.Feet Bedroom C.bathro Bathro S = R-Sq = 88.5% R-Sq(adj) = 87.6% Analysis of Variance Source DF SS MS F P Regression E Residual Error Total E+11 32

33 MODEL 3 Regression Analysis The regression equation is Total = Land Acre Sq.Feet C.bathro Bathro Predictor Coef StDev T P Constant Land Acre Sq.Feet C.bathro Bathro S = R-Sq = 88,3% R-Sq(adj) = 87,5% Analysis of Variance Source DF SS MS F P Regression E Residual Error Total E+11 33

34 Model without the area of the land ( # of acres ) because of the multicolinearity with the land value. MODEL 4 The regression equation is Total = Land Sq.Feet C.bathro Bathro Predictor Coef StDev T P Constant Land Sq.Feet C.bathro Bathro S = R-Sq = 87.0% R-Sq(adj) = 86.3% Analysis of Variance Source DF SS MS F P Regression E E Residual Error Total E+11 34

35 Which one of the 4 previous models would you choose and why? Probably model 4 because all the independent variables are significant at the 5% level (i.e. for each β in the model, p-value < 5%) and although R 2 is smaller, it is just marginally smaller. Moreover, all the model coefficients make«sense»! In model 1, the variables # of rooms and # of fire-place are not statistically significant at the 5% level (p-value > 5%). The variable # of bedrooms is at the limit with a p-value =

36 Which one of the 4 previous models would you choose and why?(continued) In model 2 the variable # of bedroom is not statistically significant at the 5% level. In model 3 (and the previous models), the variable # of acres coefficient is negative which is contrary to «common sense» and to what we observed in the scatter plot and the positive Pearson correlation coefficient (r = 0.608). In models 1 to 3, the negative coefficient for the variable # of acres is due to the fact that there is a strong linear relation between the value of the land and the area of the land (r = 0.918): multicolinearity problem. 36

37 Multicolinearity If two or more explanatory variables are strongly correlated (> 0.85 in absolute value), one says that there is multicolinearity. It has an influence on the estimation of parameters in the model. If two explanatory variables are highly correlated, then can get rid of one of these variables. Because of the strong correlation, the contribution of the other variable is not significant. The correlation between several pairs of variables can be calculated in Excel using correlation in the Data Analysis toolbox. 37

38 How can we choose a particular linear regression model among all the possible ones? There are several techniques: Step by step selection by adding one variable at a time, starting with the most significant one (stepwise, forward). Selection starting from the model in which all the variables are included and removing one variable at a time starting with the least significant (backward). Construct all possible models and choose the best subset of variables according to certain specific criteria (ex: adjusted R 2, C p de Mallow.) 38

39 Example of selection among the best subsets: Best Subsets Regression : Response is Total B C S e b B F q R d a a i L A f o r t t r a c e o o h h e Adj. n r e m o r r p Vars R-Sq R-Sq C-p s d e t s m o o l X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X 39

40 Selection of the model without the variable # of acres Best Subsets Regression : Response is Total B C S e b B F q R d a a i L f o r t t r a e o o h h e Adj. n e m o r r p Vars R-Sq R-Sq C-p s d t s m o o l X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X X 40

41 The selection of the best model is done according to the combination: The greatest value of R 2 adjusted for the number of variables in the model. The smallest value of C p. For the models with R 2 adjusted and comparable C p, we will choose the model which has the most «common sense» according to the experts in the field. For the models with R 2 adjusted and comparable C p, the model with the independent variables that are the easiest and least expensive to measure. The model validity. 41

42 1-α confidence interval for Y mean and a new value of Y (prediction) being given a specific value combination for X 1, X 2,, X p. For model 4 and property with a land= $, sq.ft = 1500, 2 completed bathrooms and 1 notcompleted, we obtain the following point estimation : est. total value = * * * *1 = $ 95% confidence interval for the mean of the total value: [ , ] 95% confidence interval for a total predicted value : [ , ] 42

43 Notes: For a 1500 sq.ft property, the multiple regression model gives a smaller 95% confidence intervals than the simple regression model. Therefore the addition of several other variables in the model helped to better explain the total value variability and to improve our estimations. If two or more independent variables are correlated we will say that there is multicolinearity. This can influence the value of the parameters in the model. Also, if two independent variables are strongly correlated then only one of the two variables would be included in the model, the other one bringing very little additional information. Certain conditions are required for the validity of the model and the corresponding inference (similar to the simple linear regression ). 43

44 Dummy variables How can one take into account qualitative information in a regression? Application: Test on two or more means 44

45 Trick If a qualitative variable takes two values, one defined one dummy variable taking values 0 or 1. Examples: Sex: 1 if male, 0 otherwise Garage: 1 if garage, 0 if not. 45

46 Trick (continued) More generally, if a qualitative variable can take m values, one defines (m-1) dummy variables all takong values 0 or 1. Example: Sex and job category (executive, white-collar, bue-collar) X 1 = 1 if male, 0 otherwise. X 2 = 1 si exe, 0 otherwise. X 3 = 1 si w-c, 0 otherwise. 46

47 Example One wants to explain the salary of an employee (Y) with the following variables: sex, job category and experience. X 1 = 1 if male, 0 otherwise. X 2 = 1 if exe, 0 otherwise. X 3 = 1 if w-c, 0 otherwise. X 4 = years of experience. 47

48 Example (continued) Regression model: Y = β 0 + β 1 X 1 + β 2 X 2 + β 3 X 3 + β 4 X 4 + ε Question: Interpret β 0, β 1, β 2, β 3, β 4. How do know if women have a smaller salary? 48

49 P-value for one-tailed tests in Excel. The evaluation of the p-value of a one-tailed test hypothesis H 1 is not given in general, only the p-value of a two-tailed test. For example, in regression, Excel calculates the p-value P corresponding to H 0 : β i = 0 vs H 1 : β i 0. How can we calculate the p-value correponding to one-tailed hypotheses H 1? 49

50 Rules : P: p-value for the two-tailed test. If H 1 is of the form β i > 0 and b i >0, then the p-value of the right-tailed is P/2. Otherwise it is 1- P/2. If H 1 is of the form β i < 0 and b i <0, then the p-value of the right-tailed is P/2. Otherwise it is 1- P/2. In other words, the one-tailed p-value is half of the two-tailed p- value when the estimated coefficient has the same sign as the coefficient in H 1. Otherwise, it is 1- p-value /2. 50

51 Question: One wants to know if having a garage increase the total value of the property. The hypotheses to be tested should be: H 0 : β garage 0 vs H 1 : β garage >0 Since b garage = > 0, the p-value corresponding to H 1 : β garage > 0 is /2 = < The anwser is yes because we accept H 1. Does the decision depend on coding? 51

52 If the dummy is defined by 0 if there is a garage and 1 otherwise, we would have got: Totale = ,83 Terrain + 47,2 Pied SbainsC Sbains Garage Predictor Coef StDev T P Constant ,08 0,000 Terrain 1,8342 0,1892 9,69 0,000 Pied2 47,175 7,013 6,73 0,000 SbainsC ,54 0,001 Sbains ,63 0,001 Garage ,01 0,058 S = R-Sq = 87,6% R-Sq(adj) = 86,8% 52

53 In that case, the right choice for hypotheses would have been: H 0 : β garage 0 vs H 1 : β garage <0 The corresponding p-value stays = 0.058/2 because b garage = < 0 has the same sign as β garage in H 1. 53

54 Comparison of several means Suppose one wants to compare the respective means of a quantitative variable Y for two groups: µ 1 = mean of group 1, µ 2 = mean of group 2. One can use regression by defining X = 1 for group 1, and X= 0 for group 2. In this case, β = µ 1 µ 2. 54

55 Hypothesis H 1 : µ 1 > µ 2 correspond to H 1 : β > 0. Hypothesis H 1 : µ 1 < µ 2 correspond to H 1 : β < 0. Hypothesis H 1 : µ 1 µ 2 correspond to H 1 : β 0. 55

56 Example A manager has some doubts on the (positive) effects of a course in order to improve the speed a given task is performed by employees. To confirm his belief, he asked a technician to choose at random 10 employees and to measure the time (hours) to complete a task. Then the same employees attend the course. After the course the employees had to realize a similar task. The results are summarized in the following table: manager.xls 56

57 Questions: a) Should the company maintain the formation program? Take α = 5%. b) The technician in charge of the measurements forget to identify employees on the measurements form. What is the conclusion using that data set? Unfortunately, case b is based on a real case. 57

58 Solution For situation a), data are paired and we have to check if the differences «Before After» are significantly positive. The p-value is < 0.05 = α. One accepts H 1 and the manager conclude that the program should be maintained. 58

59 In the second case, data are not paired. One can use regression with Y = time of execution, and X = 1 for measurements before the course and X = 0 for measurements after the course. In that case, the right choice for H 1 is: H 1 : β > 0 Results are given by: Coefficients Standard Error t Stat P-value Intercept E-19 X

60 Since H 1 : β > 0 (which is equivalent to H 1 : µ before > µ after ), and b = > 0, the p-value is 0.201/2 = > One accepts H 0, so the formation program should nt be maintained. This is a very good example of the consequence of the greater variability for two samples compared to a paired sample. 60

61 Remark: Comparing several means If one needs to compare the means of k group, for some variable Y, one can use also regression. For i=1, 2,, k-1, set: X i = 1 for group i, 0 otherwise. Then β 0 = mean of group k = µ k and β i = µ i - µ k, 1 i k-1. 61

62 Therefore, the regression test where H 0 is given by H 0 : β 1 = β 2 =... = β k-1 = 0 is equivalent to a test where H 0 is given by H 0 : µ 1 = µ 2 =... = µ k If H 0 is rejected, then we conclude that at least two means are different. 62

STATISTICS 110/201 PRACTICE FINAL EXAM

STATISTICS 110/201 PRACTICE FINAL EXAM STATISTICS 110/201 PRACTICE FINAL EXAM Questions 1 to 5: There is a downloadable Stata package that produces sequential sums of squares for regression. In other words, the SS is built up as each variable

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 4 4- Basic Business Statistics th Edition Chapter 4 Introduction to Multiple Regression Basic Business Statistics, e 9 Prentice-Hall, Inc. Chap 4- Learning Objectives In this chapter, you learn:

More information

Chapter 3 Multiple Regression Complete Example

Chapter 3 Multiple Regression Complete Example Department of Quantitative Methods & Information Systems ECON 504 Chapter 3 Multiple Regression Complete Example Spring 2013 Dr. Mohammad Zainal Review Goals After completing this lecture, you should be

More information

The simple linear regression model discussed in Chapter 13 was written as

The simple linear regression model discussed in Chapter 13 was written as 1519T_c14 03/27/2006 07:28 AM Page 614 Chapter Jose Luis Pelaez Inc/Blend Images/Getty Images, Inc./Getty Images, Inc. 14 Multiple Regression 14.1 Multiple Regression Analysis 14.2 Assumptions of the Multiple

More information

Chapter 14 Multiple Regression Analysis

Chapter 14 Multiple Regression Analysis Chapter 14 Multiple Regression Analysis 1. a. Multiple regression equation b. the Y-intercept c. $374,748 found by Y ˆ = 64,1 +.394(796,) + 9.6(694) 11,6(6.) (LO 1) 2. a. Multiple regression equation b.

More information

Correlation & Simple Regression

Correlation & Simple Regression Chapter 11 Correlation & Simple Regression The previous chapter dealt with inference for two categorical variables. In this chapter, we would like to examine the relationship between two quantitative variables.

More information

STAT 212 Business Statistics II 1

STAT 212 Business Statistics II 1 STAT 1 Business Statistics II 1 KING FAHD UNIVERSITY OF PETROLEUM & MINERALS DEPARTMENT OF MATHEMATICAL SCIENCES DHAHRAN, SAUDI ARABIA STAT 1: BUSINESS STATISTICS II Semester 091 Final Exam Thursday Feb

More information

28. SIMPLE LINEAR REGRESSION III

28. SIMPLE LINEAR REGRESSION III 28. SIMPLE LINEAR REGRESSION III Fitted Values and Residuals To each observed x i, there corresponds a y-value on the fitted line, y = βˆ + βˆ x. The are called fitted values. ŷ i They are the values of

More information

(4) 1. Create dummy variables for Town. Name these dummy variables A and B. These 0,1 variables now indicate the location of the house.

(4) 1. Create dummy variables for Town. Name these dummy variables A and B. These 0,1 variables now indicate the location of the house. Exam 3 Resource Economics 312 Introductory Econometrics Please complete all questions on this exam. The data in the spreadsheet: Exam 3- Home Prices.xls are to be used for all analyses. These data are

More information

Multiple Regression Examples

Multiple Regression Examples Multiple Regression Examples Example: Tree data. we have seen that a simple linear regression of usable volume on diameter at chest height is not suitable, but that a quadratic model y = β 0 + β 1 x +

More information

SMAM 314 Practice Final Examination Winter 2003

SMAM 314 Practice Final Examination Winter 2003 SMAM 314 Practice Final Examination Winter 2003 You may use your textbook, one page of notes and a calculator. Please hand in the notes with your exam. 1. Mark the following statements True T or False

More information

Model Building Chap 5 p251

Model Building Chap 5 p251 Model Building Chap 5 p251 Models with one qualitative variable, 5.7 p277 Example 4 Colours : Blue, Green, Lemon Yellow and white Row Blue Green Lemon Insects trapped 1 0 0 1 45 2 0 0 1 59 3 0 0 1 48 4

More information

INFERENCE FOR REGRESSION

INFERENCE FOR REGRESSION CHAPTER 3 INFERENCE FOR REGRESSION OVERVIEW In Chapter 5 of the textbook, we first encountered regression. The assumptions that describe the regression model we use in this chapter are the following. We

More information

Chapter 9. Correlation and Regression

Chapter 9. Correlation and Regression Chapter 9 Correlation and Regression Lesson 9-1/9-2, Part 1 Correlation Registered Florida Pleasure Crafts and Watercraft Related Manatee Deaths 100 80 60 40 20 0 1991 1993 1995 1997 1999 Year Boats in

More information

Chapter 13. Multiple Regression and Model Building

Chapter 13. Multiple Regression and Model Building Chapter 13 Multiple Regression and Model Building Multiple Regression Models The General Multiple Regression Model y x x x 0 1 1 2 2... k k y is the dependent variable x, x,..., x 1 2 k the model are the

More information

Regression Models. Chapter 4. Introduction. Introduction. Introduction

Regression Models. Chapter 4. Introduction. Introduction. Introduction Chapter 4 Regression Models Quantitative Analysis for Management, Tenth Edition, by Render, Stair, and Hanna 008 Prentice-Hall, Inc. Introduction Regression analysis is a very valuable tool for a manager

More information

Ch 13 & 14 - Regression Analysis

Ch 13 & 14 - Regression Analysis Ch 3 & 4 - Regression Analysis Simple Regression Model I. Multiple Choice:. A simple regression is a regression model that contains a. only one independent variable b. only one dependent variable c. more

More information

Chapter 14 Student Lecture Notes 14-1

Chapter 14 Student Lecture Notes 14-1 Chapter 14 Student Lecture Notes 14-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 14 Multiple Regression Analysis and Model Building Chap 14-1 Chapter Goals After completing this

More information

Confidence Interval for the mean response

Confidence Interval for the mean response Week 3: Prediction and Confidence Intervals at specified x. Testing lack of fit with replicates at some x's. Inference for the correlation. Introduction to regression with several explanatory variables.

More information

Homework 1 Solutions

Homework 1 Solutions Homework 1 Solutions January 18, 2012 Contents 1 Normal Probability Calculations 2 2 Stereo System (SLR) 2 3 Match Histograms 3 4 Match Scatter Plots 4 5 Housing (SLR) 4 6 Shock Absorber (SLR) 5 7 Participation

More information

Multiple Linear Regression

Multiple Linear Regression Andrew Lonardelli December 20, 2013 Multiple Linear Regression 1 Table Of Contents Introduction: p.3 Multiple Linear Regression Model: p.3 Least Squares Estimation of the Parameters: p.4-5 The matrix approach

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line

Inference for Regression Inference about the Regression Model and Using the Regression Line Inference for Regression Inference about the Regression Model and Using the Regression Line PBS Chapter 10.1 and 10.2 2009 W.H. Freeman and Company Objectives (PBS Chapter 10.1 and 10.2) Inference about

More information

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3

Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details. Section 10.1, 2, 3 Inference for Regression Inference about the Regression Model and Using the Regression Line, with Details Section 10.1, 2, 3 Basic components of regression setup Target of inference: linear dependency

More information

Examination paper for TMA4255 Applied statistics

Examination paper for TMA4255 Applied statistics Department of Mathematical Sciences Examination paper for TMA4255 Applied statistics Academic contact during examination: Anna Marie Holand Phone: 951 38 038 Examination date: 16 May 2015 Examination time

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

Marquette University Executive MBA Program Statistics Review Class Notes Summer 2018

Marquette University Executive MBA Program Statistics Review Class Notes Summer 2018 Marquette University Executive MBA Program Statistics Review Class Notes Summer 2018 Chapter One: Data and Statistics Statistics A collection of procedures and principles

More information

Models with qualitative explanatory variables p216

Models with qualitative explanatory variables p216 Models with qualitative explanatory variables p216 Example gen = 1 for female Row gpa hsm gen 1 3.32 10 0 2 2.26 6 0 3 2.35 8 0 4 2.08 9 0 5 3.38 8 0 6 3.29 10 0 7 3.21 8 0 8 2.00 3 0 9 3.18 9 0 10 2.34

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your

More information

Basic Business Statistics 6 th Edition

Basic Business Statistics 6 th Edition Basic Business Statistics 6 th Edition Chapter 12 Simple Linear Regression Learning Objectives In this chapter, you learn: How to use regression analysis to predict the value of a dependent variable based

More information

Answer all questions from part I. Answer two question from part II.a, and one question from part II.b.

Answer all questions from part I. Answer two question from part II.a, and one question from part II.b. B203: Quantitative Methods Answer all questions from part I. Answer two question from part II.a, and one question from part II.b. Part I: Compulsory Questions. Answer all questions. Each question carries

More information

Analysis of Bivariate Data

Analysis of Bivariate Data Analysis of Bivariate Data Data Two Quantitative variables GPA and GAES Interest rates and indices Tax and fund allocation Population size and prison population Bivariate data (x,y) Case corr&reg 2 Independent

More information

23. Inference for regression

23. Inference for regression 23. Inference for regression The Practice of Statistics in the Life Sciences Third Edition 2014 W. H. Freeman and Company Objectives (PSLS Chapter 23) Inference for regression The regression model Confidence

More information

Business 320, Fall 1999, Final

Business 320, Fall 1999, Final Business 320, Fall 1999, Final name You may use a calculator and two cheat sheets. You have 3 hours. I pledge my honor that I have not violated the Honor Code during this examination. Obvioiusly, you may

More information

School of Mathematical Sciences. Question 1

School of Mathematical Sciences. Question 1 School of Mathematical Sciences MTH5120 Statistical Modelling I Practical 8 and Assignment 7 Solutions Question 1 Figure 1: The residual plots do not contradict the model assumptions of normality, constant

More information

27. SIMPLE LINEAR REGRESSION II

27. SIMPLE LINEAR REGRESSION II 27. SIMPLE LINEAR REGRESSION II The Model In linear regression analysis, we assume that the relationship between X and Y is linear. This does not mean, however, that Y can be perfectly predicted from X.

More information

Section 3: Simple Linear Regression

Section 3: Simple Linear Regression Section 3: Simple Linear Regression Carlos M. Carvalho The University of Texas at Austin McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/ 1 Regression: General Introduction

More information

Six Sigma Black Belt Study Guides

Six Sigma Black Belt Study Guides Six Sigma Black Belt Study Guides 1 www.pmtutor.org Powered by POeT Solvers Limited. Analyze Correlation and Regression Analysis 2 www.pmtutor.org Powered by POeT Solvers Limited. Variables and relationships

More information

Trendlines Simple Linear Regression Multiple Linear Regression Systematic Model Building Practical Issues

Trendlines Simple Linear Regression Multiple Linear Regression Systematic Model Building Practical Issues Trendlines Simple Linear Regression Multiple Linear Regression Systematic Model Building Practical Issues Overfitting Categorical Variables Interaction Terms Non-linear Terms Linear Logarithmic y = a +

More information

Multiple Regression: Chapter 13. July 24, 2015

Multiple Regression: Chapter 13. July 24, 2015 Multiple Regression: Chapter 13 July 24, 2015 Multiple Regression (MR) Response Variable: Y - only one response variable (quantitative) Several Predictor Variables: X 1, X 2, X 3,..., X p (p = # predictors)

More information

Correlation Analysis

Correlation Analysis Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the

More information

School of Mathematical Sciences. Question 1. Best Subsets Regression

School of Mathematical Sciences. Question 1. Best Subsets Regression School of Mathematical Sciences MTH5120 Statistical Modelling I Practical 9 and Assignment 8 Solutions Question 1 Best Subsets Regression Response is Crime I n W c e I P a n A E P U U l e Mallows g E P

More information

Project Report for STAT571 Statistical Methods Instructor: Dr. Ramon V. Leon. Wage Data Analysis. Yuanlei Zhang

Project Report for STAT571 Statistical Methods Instructor: Dr. Ramon V. Leon. Wage Data Analysis. Yuanlei Zhang Project Report for STAT7 Statistical Methods Instructor: Dr. Ramon V. Leon Wage Data Analysis Yuanlei Zhang 77--7 November, Part : Introduction Data Set The data set contains a random sample of observations

More information

SMAM 314 Exam 42 Name

SMAM 314 Exam 42 Name SMAM 314 Exam 42 Name Mark the following statements True (T) or False (F) (10 points) 1. F A. The line that best fits points whose X and Y values are negatively correlated should have a positive slope.

More information

Stat 501, F. Chiaromonte. Lecture #8

Stat 501, F. Chiaromonte. Lecture #8 Stat 501, F. Chiaromonte Lecture #8 Data set: BEARS.MTW In the minitab example data sets (for description, get into the help option and search for "Data Set Description"). Wild bears were anesthetized,

More information

Correlation and Regression

Correlation and Regression Correlation and Regression Dr. Bob Gee Dean Scott Bonney Professor William G. Journigan American Meridian University 1 Learning Objectives Upon successful completion of this module, the student should

More information

FREC 608 Guided Exercise 9

FREC 608 Guided Exercise 9 FREC 608 Guided Eercise 9 Problem. Model of Average Annual Precipitation An article in Geography (July 980) used regression to predict average annual rainfall levels in California. Data on the following

More information

Mathematics for Economics MA course

Mathematics for Economics MA course Mathematics for Economics MA course Simple Linear Regression Dr. Seetha Bandara Simple Regression Simple linear regression is a statistical method that allows us to summarize and study relationships between

More information

Chapter 4. Regression Models. Learning Objectives

Chapter 4. Regression Models. Learning Objectives Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Learning Objectives After completing

More information

Chapter 7 Student Lecture Notes 7-1

Chapter 7 Student Lecture Notes 7-1 Chapter 7 Student Lecture Notes 7- Chapter Goals QM353: Business Statistics Chapter 7 Multiple Regression Analysis and Model Building After completing this chapter, you should be able to: Explain model

More information

A discussion on multiple regression models

A discussion on multiple regression models A discussion on multiple regression models In our previous discussion of simple linear regression, we focused on a model in which one independent or explanatory variable X was used to predict the value

More information

Multiple linear regression S6

Multiple linear regression S6 Basic medical statistics for clinical and experimental research Multiple linear regression S6 Katarzyna Jóźwiak k.jozwiak@nki.nl November 15, 2017 1/42 Introduction Two main motivations for doing multiple

More information

Bayesian Analysis LEARNING OBJECTIVES. Calculating Revised Probabilities. Calculating Revised Probabilities. Calculating Revised Probabilities

Bayesian Analysis LEARNING OBJECTIVES. Calculating Revised Probabilities. Calculating Revised Probabilities. Calculating Revised Probabilities Valua%on and pricing (November 5, 2013) LEARNING OBJECTIVES Lecture 7 Decision making (part 3) Regression theory Olivier J. de Jong, LL.M., MM., MBA, CFD, CFFA, AA www.olivierdejong.com 1. List the steps

More information

ECO220Y Simple Regression: Testing the Slope

ECO220Y Simple Regression: Testing the Slope ECO220Y Simple Regression: Testing the Slope Readings: Chapter 18 (Sections 18.3-18.5) Winter 2012 Lecture 19 (Winter 2012) Simple Regression Lecture 19 1 / 32 Simple Regression Model y i = β 0 + β 1 x

More information

Final Exam Bus 320 Spring 2000 Russell

Final Exam Bus 320 Spring 2000 Russell Name Final Exam Bus 320 Spring 2000 Russell Do not turn over this page until you are told to do so. You will have 3 hours minutes to complete this exam. The exam has a total of 100 points and is divided

More information

10. Alternative case influence statistics

10. Alternative case influence statistics 10. Alternative case influence statistics a. Alternative to D i : dffits i (and others) b. Alternative to studres i : externally-studentized residual c. Suggestion: use whatever is convenient with the

More information

Applied Regression Analysis. Section 2: Multiple Linear Regression

Applied Regression Analysis. Section 2: Multiple Linear Regression Applied Regression Analysis Section 2: Multiple Linear Regression 1 The Multiple Regression Model Many problems involve more than one independent variable or factor which affects the dependent or response

More information

x3,..., Multiple Regression β q α, β 1, β 2, β 3,..., β q in the model can all be estimated by least square estimators

x3,..., Multiple Regression β q α, β 1, β 2, β 3,..., β q in the model can all be estimated by least square estimators Multiple Regression Relating a response (dependent, input) y to a set of explanatory (independent, output, predictor) variables x, x 2, x 3,, x q. A technique for modeling the relationship between variables.

More information

Hypothesis testing. Data to decisions

Hypothesis testing. Data to decisions Hypothesis testing Data to decisions The idea Null hypothesis: H 0 : the DGP/population has property P Under the null, a sample statistic has a known distribution If, under that that distribution, the

More information

Topic 10 - Linear Regression

Topic 10 - Linear Regression Topic 10 - Linear Regression Least squares principle Hypothesis tests/confidence intervals/prediction intervals for regression 1 Linear Regression How much should you pay for a house? Would you consider

More information

y n 1 ( x i x )( y y i n 1 i y 2

y n 1 ( x i x )( y y i n 1 i y 2 STP3 Brief Class Notes Instructor: Ela Jackiewicz Chapter Regression and Correlation In this chapter we will explore the relationship between two quantitative variables, X an Y. We will consider n ordered

More information

1 A Review of Correlation and Regression

1 A Review of Correlation and Regression 1 A Review of Correlation and Regression SW, Chapter 12 Suppose we select n = 10 persons from the population of college seniors who plan to take the MCAT exam. Each takes the test, is coached, and then

More information

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore What is Multiple Linear Regression Several independent variables may influence the change in response variable we are trying to study. When several independent variables are included in the equation, the

More information

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal

Business Statistics. Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220. Dr. Mohammad Zainal Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Introduction to Linear Regression and Correlation Analysis QMIS 220 Dr. Mohammad Zainal Chapter Goals After completing

More information

STAT 3A03 Applied Regression With SAS Fall 2017

STAT 3A03 Applied Regression With SAS Fall 2017 STAT 3A03 Applied Regression With SAS Fall 2017 Assignment 2 Solution Set Q. 1 I will add subscripts relating to the question part to the parameters and their estimates as well as the errors and residuals.

More information

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY

EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY EXAMINATIONS OF THE ROYAL STATISTICAL SOCIETY HIGHER CERTIFICATE IN STATISTICS, 2011 MODULE 4 : Linear models Time allowed: One and a half hours Candidates should answer THREE questions. Each question

More information

MULTIPLE REGRESSION ANALYSIS AND OTHER ISSUES. Business Statistics

MULTIPLE REGRESSION ANALYSIS AND OTHER ISSUES. Business Statistics MULTIPLE REGRESSION ANALYSIS AND OTHER ISSUES Business Statistics CONTENTS Multiple regression Dummy regressors Assumptions of regression analysis Predicting with regression analysis Old exam question

More information

Ordinary Least Squares Regression Explained: Vartanian

Ordinary Least Squares Regression Explained: Vartanian Ordinary Least Squares Regression Explained: Vartanian When to Use Ordinary Least Squares Regression Analysis A. Variable types. When you have an interval/ratio scale dependent variable.. When your independent

More information

Multiple Regression Methods

Multiple Regression Methods Chapter 1: Multiple Regression Methods Hildebrand, Ott and Gray Basic Statistical Ideas for Managers Second Edition 1 Learning Objectives for Ch. 1 The Multiple Linear Regression Model How to interpret

More information

Simple Linear Regression: A Model for the Mean. Chap 7

Simple Linear Regression: A Model for the Mean. Chap 7 Simple Linear Regression: A Model for the Mean Chap 7 An Intermediate Model (if the groups are defined by values of a numeric variable) Separate Means Model Means fall on a straight line function of the

More information

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box.

(ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. FINAL EXAM ** Two different ways to submit your answer sheet (i) Use MS-Word and place it in a drop-box. (ii) Scan your answer sheets INTO ONE FILE only, and submit it in the drop-box. Deadline: December

More information

Chapter 26 Multiple Regression, Logistic Regression, and Indicator Variables

Chapter 26 Multiple Regression, Logistic Regression, and Indicator Variables Chapter 26 Multiple Regression, Logistic Regression, and Indicator Variables 26.1 S 4 /IEE Application Examples: Multiple Regression An S 4 /IEE project was created to improve the 30,000-footlevel metric

More information

(1) The explanatory or predictor variables may be qualitative. (We ll focus on examples where this is the case.)

(1) The explanatory or predictor variables may be qualitative. (We ll focus on examples where this is the case.) Introduction to Analysis of Variance Analysis of variance models are similar to regression models, in that we re interested in learning about the relationship between a dependent variable (a response)

More information

Econometrics. 4) Statistical inference

Econometrics. 4) Statistical inference 30C00200 Econometrics 4) Statistical inference Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Confidence intervals of parameter estimates Student s t-distribution

More information

Section 5: Dummy Variables and Interactions

Section 5: Dummy Variables and Interactions Section 5: Dummy Variables and Interactions Carlos M. Carvalho The University of Texas at Austin McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/ 1 Example: Detecting

More information

ANOVA - analysis of variance - used to compare the means of several populations.

ANOVA - analysis of variance - used to compare the means of several populations. 12.1 One-Way Analysis of Variance ANOVA - analysis of variance - used to compare the means of several populations. Assumptions for One-Way ANOVA: 1. Independent samples are taken using a randomized design.

More information

regression analysis is a type of inferential statistics which tells us whether relationships between two or more variables exist

regression analysis is a type of inferential statistics which tells us whether relationships between two or more variables exist regression analysis is a type of inferential statistics which tells us whether relationships between two or more variables exist sales $ (y - dependent variable) advertising $ (x - independent variable)

More information

1 A Non-technical Introduction to Regression

1 A Non-technical Introduction to Regression 1 A Non-technical Introduction to Regression Chapters 1 and Chapter 2 of the textbook are reviews of material you should know from your previous study (e.g. in your second year course). They cover, in

More information

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept

Interactions. Interactions. Lectures 1 & 2. Linear Relationships. y = a + bx. Slope. Intercept Interactions Lectures 1 & Regression Sometimes two variables appear related: > smoking and lung cancers > height and weight > years of education and income > engine size and gas mileage > GMAT scores and

More information

Regression analysis is a tool for building mathematical and statistical models that characterize relationships between variables Finds a linear

Regression analysis is a tool for building mathematical and statistical models that characterize relationships between variables Finds a linear Regression analysis is a tool for building mathematical and statistical models that characterize relationships between variables Finds a linear relationship between: - one independent variable X and -

More information

STAT 3900/4950 MIDTERM TWO Name: Spring, 2015 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis

STAT 3900/4950 MIDTERM TWO Name: Spring, 2015 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis STAT 3900/4950 MIDTERM TWO Name: Spring, 205 (print: first last ) Covered topics: Two-way ANOVA, ANCOVA, SLR, MLR and correlation analysis Instructions: You may use your books, notes, and SPSS/SAS. NO

More information

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression Chapter 14 Student Lecture Notes 14-1 Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Multiple Regression QMIS 0 Dr. Mohammad Zainal Chapter Goals After completing

More information

Inferences for linear regression (sections 12.1, 12.2)

Inferences for linear regression (sections 12.1, 12.2) Inferences for linear regression (sections 12.1, 12.2) Regression case history: do bigger national parks help prevent extinction? ex. area of natural reserves and extinction: 6 national parks in Tanzania

More information

determine whether or not this relationship is.

determine whether or not this relationship is. Section 9-1 Correlation A correlation is a between two. The data can be represented by ordered pairs (x,y) where x is the (or ) variable and y is the (or ) variable. There are several types of correlations

More information

Regression Analysis. BUS 735: Business Decision Making and Research

Regression Analysis. BUS 735: Business Decision Making and Research Regression Analysis BUS 735: Business Decision Making and Research 1 Goals and Agenda Goals of this section Specific goals Learn how to detect relationships between ordinal and categorical variables. Learn

More information

2.4.3 Estimatingσ Coefficient of Determination 2.4. ASSESSING THE MODEL 23

2.4.3 Estimatingσ Coefficient of Determination 2.4. ASSESSING THE MODEL 23 2.4. ASSESSING THE MODEL 23 2.4.3 Estimatingσ 2 Note that the sums of squares are functions of the conditional random variables Y i = (Y X = x i ). Hence, the sums of squares are random variables as well.

More information

LINEAR REGRESSION ANALYSIS. MODULE XVI Lecture Exercises

LINEAR REGRESSION ANALYSIS. MODULE XVI Lecture Exercises LINEAR REGRESSION ANALYSIS MODULE XVI Lecture - 44 Exercises Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur Exercise 1 The following data has been obtained on

More information

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments.

Analysis of Covariance. The following example illustrates a case where the covariate is affected by the treatments. Analysis of Covariance In some experiments, the experimental units (subjects) are nonhomogeneous or there is variation in the experimental conditions that are not due to the treatments. For example, a

More information

Inference for the Regression Coefficient

Inference for the Regression Coefficient Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression line. We can shows that b 0 and b 1 are the unbiased estimates

More information

LI EAR REGRESSIO A D CORRELATIO

LI EAR REGRESSIO A D CORRELATIO CHAPTER 6 LI EAR REGRESSIO A D CORRELATIO Page Contents 6.1 Introduction 10 6. Curve Fitting 10 6.3 Fitting a Simple Linear Regression Line 103 6.4 Linear Correlation Analysis 107 6.5 Spearman s Rank Correlation

More information

Analyzing Lines of Fit

Analyzing Lines of Fit 4.5 Analyzing Lines of Fit Essential Question How can you analytically find a line of best fit for a scatter plot? Finding a Line of Best Fit Work with a partner. The scatter plot shows the median ages

More information

In order to carry out a study on employees wages, a company collects information from its 500 employees 1 as follows:

In order to carry out a study on employees wages, a company collects information from its 500 employees 1 as follows: INTRODUCTORY ECONOMETRICS Dpt of Econometrics & Statistics (EA3) University of the Basque Country UPV/EHU OCW Self Evaluation answers Time: 21/2 hours SURNAME: NAME: ID#: Specific competences to be evaluated

More information

1 Correlation and Inference from Regression

1 Correlation and Inference from Regression 1 Correlation and Inference from Regression Reading: Kennedy (1998) A Guide to Econometrics, Chapters 4 and 6 Maddala, G.S. (1992) Introduction to Econometrics p. 170-177 Moore and McCabe, chapter 12 is

More information

Chapter 14 Simple Linear Regression (A)

Chapter 14 Simple Linear Regression (A) Chapter 14 Simple Linear Regression (A) 1. Characteristics Managerial decisions often are based on the relationship between two or more variables. can be used to develop an equation showing how the variables

More information

CHAPTER 5 FUNCTIONAL FORMS OF REGRESSION MODELS

CHAPTER 5 FUNCTIONAL FORMS OF REGRESSION MODELS CHAPTER 5 FUNCTIONAL FORMS OF REGRESSION MODELS QUESTIONS 5.1. (a) In a log-log model the dependent and all explanatory variables are in the logarithmic form. (b) In the log-lin model the dependent variable

More information

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #6

STA 108 Applied Linear Models: Regression Analysis Spring Solution for Homework #6 STA 8 Applied Linear Models: Regression Analysis Spring 011 Solution for Homework #6 6. a) = 11 1 31 41 51 1 3 4 5 11 1 31 41 51 β = β1 β β 3 b) = 1 1 1 1 1 11 1 31 41 51 1 3 4 5 β = β 0 β1 β 6.15 a) Stem-and-leaf

More information

Conditions for Regression Inference:

Conditions for Regression Inference: AP Statistics Chapter Notes. Inference for Linear Regression We can fit a least-squares line to any data relating two quantitative variables, but the results are useful only if the scatterplot shows a

More information

12.12 MODEL BUILDING, AND THE EFFECTS OF MULTICOLLINEARITY (OPTIONAL)

12.12 MODEL BUILDING, AND THE EFFECTS OF MULTICOLLINEARITY (OPTIONAL) 12.12 Model Building, and the Effects of Multicollinearity (Optional) 1 Although Excel and MegaStat are emphasized in Business Statistics in Practice, Second Canadian Edition, some examples in the additional

More information

Two-Variable Analysis: Simple Linear Regression/ Correlation

Two-Variable Analysis: Simple Linear Regression/ Correlation Two-Variable Analysis: Simple Linear Regression/ Correlation 1 Topics I. Scatter Plot (X-Y Graph) II. III. Simple Linear Regression Correlation, R IV. Assessing Model Accuracy, R 2 V. Regression Abuses

More information

STAT 7030: Categorical Data Analysis

STAT 7030: Categorical Data Analysis STAT 7030: Categorical Data Analysis 5. Logistic Regression Peng Zeng Department of Mathematics and Statistics Auburn University Fall 2012 Peng Zeng (Auburn University) STAT 7030 Lecture Notes Fall 2012

More information

This document contains 3 sets of practice problems.

This document contains 3 sets of practice problems. P RACTICE PROBLEMS This document contains 3 sets of practice problems. Correlation: 3 problems Regression: 4 problems ANOVA: 8 problems You should print a copy of these practice problems and bring them

More information