Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

Similar documents
Chapter 15 Student Lecture Notes 15-1

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Chapter 13: Multiple Regression

Statistics for Economics & Business

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management

Statistics for Business and Economics

Basic Business Statistics, 10/e

Statistics II Final Exam 26/6/18

Chapter 11: Simple Linear Regression and Correlation

Chapter 15 - Multiple Regression

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Chapter 14 Simple Linear Regression

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

x i1 =1 for all i (the constant ).

STATISTICS QUESTIONS. Step by Step Solutions.

Comparison of Regression Lines

Lecture 6: Introduction to Linear Regression

Economics 130. Lecture 4 Simple Linear Regression Continued

Negative Binomial Regression

Biostatistics 360 F&t Tests and Intervals in Regression 1

Correlation and Regression

Scatter Plot x

Introduction to Regression

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε

Statistics MINITAB - Lab 2

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

NANYANG TECHNOLOGICAL UNIVERSITY SEMESTER I EXAMINATION MTH352/MH3510 Regression Analysis

Introduction to Dummy Variable Regressors. 1. An Example of Dummy Variable Regressors

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Topic 7: Analysis of Variance

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

Chapter 3. Two-Variable Regression Model: The Problem of Estimation

Chapter 8 Indicator Variables

Dummy variables in multiple variable regression model

Polynomial Regression Models

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

is the calculated value of the dependent variable at point i. The best parameters have values that minimize the squares of the errors

STAT 3340 Assignment 1 solutions. 1. Find the equation of the line which passes through the points (1,1) and (4,5).

Chapter 10. What is Regression Analysis? Simple Linear Regression Analysis. Examples

4 Analysis of Variance (ANOVA) 5 ANOVA. 5.1 Introduction. 5.2 Fixed Effects ANOVA

Regression. The Simple Linear Regression Model

/ n ) are compared. The logic is: if the two

Linear Regression Analysis: Terminology and Notation

where I = (n x n) diagonal identity matrix with diagonal elements = 1 and off-diagonal elements = 0; and σ 2 e = variance of (Y X).

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE)

18. SIMPLE LINEAR REGRESSION III

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

a. (All your answers should be in the letter!

CHAPTER 6 GOODNESS OF FIT AND CONTINGENCY TABLE PREPARED BY: DR SITI ZANARIAH SATARI & FARAHANIM MISNI

28. SIMPLE LINEAR REGRESSION III

PBAF 528 Week Theory Is the variable s place in the equation certain and theoretically sound? Most important! 2. T-test

Module Contact: Dr Susan Long, ECO Copyright of the University of East Anglia Version 1

17 - LINEAR REGRESSION II

January Examinations 2015

PubH 7405: REGRESSION ANALYSIS. SLR: INFERENCES, Part II

STAT 3008 Applied Regression Analysis

Learning Objectives for Chapter 11

Chapter 14 Simple Linear Regression Page 1. Introduction to regression analysis 14-2

Chapter 9: Statistical Inference and the Relationship between Two Variables

x = , so that calculated

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

Regression Analysis. Regression Analysis

β0 + β1xi and want to estimate the unknown

Tests of Single Linear Coefficient Restrictions: t-tests and F-tests. 1. Basic Rules. 2. Testing Single Linear Coefficient Restrictions

experimenteel en correlationeel onderzoek

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Professor Chris Murray. Midterm Exam

Lecture 16 Statistical Analysis in Biomaterials Research (Part II)

Answers Problem Set 2 Chem 314A Williamsen Spring 2000

Outline. Zero Conditional mean. I. Motivation. 3. Multiple Regression Analysis: Estimation. Read Wooldridge (2013), Chapter 3.

x yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting.

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables

Continuous vs. Discrete Goods

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours

Soc 3811 Basic Social Statistics Third Midterm Exam Spring 2010

CHAPTER 8. Exercise Solutions

Statistics Chapter 4

BOOTSTRAP METHOD FOR TESTING OF EQUALITY OF SEVERAL MEANS. M. Krishna Reddy, B. Naveen Kumar and Y. Ramu

The SAS program I used to obtain the analyses for my answers is given below.

REGRESSION ANALYSIS II- MULTICOLLINEARITY

Lecture 6 More on Complete Randomized Block Design (RBD)

Chapter 16 Student Lecture Notes 16-1

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Introduction to Analysis of Variance (ANOVA) Part 1

STAT 511 FINAL EXAM NAME Spring 2001

University of California at Berkeley Fall Introductory Applied Econometrics Final examination

Lecture 3 Stat102, Spring 2007

( )( ) [ ] [ ] ( ) 1 = [ ] = ( ) 1. H = X X X X is called the hat matrix ( it puts the hats on the Y s) and is of order n n H = X X X X.

STAT 405 BIOSTATISTICS (Fall 2016) Handout 15 Introduction to Logistic Regression

Outline. EEC 686/785 Modeling & Performance Evaluation of Computer Systems. Lecture k r Factorial Designs with Replication

Reduced slides. Introduction to Analysis of Variance (ANOVA) Part 1. Single factor

Chapter 5 Multilevel Models

Biostatistics. Chapter 11 Simple Linear Correlation and Regression. Jing Li

Transcription:

Statstcs for Managers Usng Mcrosoft Excel/SPSS Chapter 14 Multple Regresson Models 1999 Prentce-Hall, Inc. Chap. 14-1

Chapter Topcs The Multple Regresson Model Contrbuton of Indvdual Independent Varables Coeffcent of Determnaton Categorcal Explanatory Varables Transformaton of Varables Model Buldng 1999 Prentce-Hall, Inc. Chap. 14-2

Populaton Y-ntercept The Multple Regresson Model Relatonshp between 1 dependent & 2 or more ndependent varables s a lnear functon Populaton slopes Random Error Y X X 0 1 1 2 2 p X p Ŷ b b X b X 0 1 1 2 2 b p X p e Dependent (Response) varable for sample Independent (Explanatory) varables for sample model 1999 Prentce-Hall, Inc. Chap. 14-3

Sample Multple Regresson Model Y Y b0 b1 X1 b2x 2 b p X p e e X 2 X 1 Ŷ b b X b X 0 1 1 2 2 b p X p 1999 Prentce-Hall, Inc. Chap. 14-4

Multple Regresson Model: Example Develop a model for estmatng heatng ol used for a sngle famly home n the month of January based on average temperature and amount of nsulaton n nches. O l (G a l) T e m p ( 0 F) In su la to n 275.30 40 3 363.80 27 3 164.30 40 10 40.80 73 6 94.30 64 6 230.90 34 6 366.70 9 6 300.60 8 10 237.80 23 10 121.40 63 3 31.40 65 10 203.50 41 6 441.10 21 3 323.00 38 3 52.50 58 10 1999 Prentce-Hall, Inc. Chap. 14-5

Sample Regresson Model: Example Ŷ b b X b X 0 1 1 2 2 Excel Output b p X C o e ffc e n ts I n te r c e p t 5 6 2. 1 5 1 0 0 9 2 X V a r a b l e 1-5. 4 3 6 5 8 0 5 8 8 X V a r a b l e 2-2 0. 0 1 2 3 2 0 6 7 p Ŷ 562. 151 5. 437X 20. 012X 1 2 For each degree ncrease n temperature, the average amount of For each ncrease n one nch heatng ol used s decreased by of nsulaton, the use of heatng 5.437 gallons, holdng nsulaton ol s decreased by 20.012 constant. gallons, holdng temperature constant. 1999 Prentce-Hall, Inc. Chap. 14-6

Usng The Model to Make Predctons Estmate the average amount of heatng ol used for a home f the average temperature s 30 0 and the nsulaton s 6 nches. Ŷ 562. 151 5. 437X 20. 012X 1 2 562. 151 5. 437 30 20. 012 6 278. 969 The estmated heatng ol used s 278.97 gallons 1999 Prentce-Hall, Inc. Chap. 14-7

Coeffcent of Multple Determnaton Excel Output R eg resso n S tatstcs M u lt p le R 0. 9 8 2 6 5 4 7 5 7 R S q u a re 0. 9 6 5 6 1 0 3 7 1 A d ju s t e d R S q u a re 0. 9 5 9 8 7 8 7 6 6 S t a n d a rd E rro r 2 6. 0 1 3 7 8 3 2 3 O b s e rva t o n s 15 2 r Y, 12 Adjusted r 2 SSR SST reflects the number of explanatory varables and sample sze s smaller than r 2 1999 Prentce-Hall, Inc. Chap. 14-8

Resdual Plots Resduals Vs Y May need to transform Y varable Resduals Vs X 1 May need to transform X 1 varable Resduals Vs X 2 May need to transform X 2 varable Resduals Vs Tme May have autocorrelaton 1999 Prentce-Hall, Inc. Chap. 14-9

Resduals Resdual Plots: Example 60 Tem perature R esdual P lot Excel Output 40 20 Insulaton R esdual P lot 0-2 0 0 20 40 60 80-4 0-6 0 No Dscernable Pattern 0 2 4 6 8 10 12 1999 Prentce-Hall, Inc. Chap. 14-10

Testng for Overall Sgnfcance Shows f there s a lnear relatonshp between all of the X varables together and Y Use F test Statstc Hypotheses: H 0 : 1 = 2 = = p = 0 (No lnear relatonshp) H 1 : At least one 0 ( At least one ndependent varable affects Y) 1999 Prentce-Hall, Inc. Chap. 14-11

Test for Overall Sgnfcance Excel Output: Example A N O V A df SS MS F S gnfcance F R e g re sso n 2 2 2 8 0 1 4.6 1 1 4 0 0 7.3 1 6 8.4 7 1 2 0 2 8 1.6 5 4 1 1 E -0 9 R e sd u a l 12 8 1 2 0.6 0 3 6 7 6.7 1 6 9 T o ta l 14 2 3 6 1 3 5.2 p = 2, the number of explanatory varables n - 1 p value MRS MSE = F Test Statstc 1999 Prentce-Hall, Inc. Chap. 14-12

Test for Overall Sgnfcance Example Soluton H 0 : 1 = 2 = = p = 0 H 1 : At least one I 0 a =.05 df = 2 and 12 Crtcal Value(s): a = 0.05 0 3.89 F Test Statstc: F 168.47 (Excel Output) Decson: Reject at a = 0.05 Concluson: There s evdence that At least one ndependent varable affects Y 1999 Prentce-Hall, Inc. Chap. 14-13

Test for Sgnfcance: Indvdual Varables Shows f there s a lnear relatonshp between the varable X and Y Use t test Statstc Hypotheses: H 0 : = 0 (No lnear relatonshp) H 1 : 0 (Lnear relatonshp between X and Y) 1999 Prentce-Hall, Inc. Chap. 14-14

t Test Statstc Excel Output: Example C o e ffc e n ts S ta n d a rd E rro r t Test Statstc for X 1 (Temperature) t S ta t I n te r c e p t 5 6 2. 1 5 1 0 0 9 2 1. 0 9 3 1 0 4 3 3 2 6. 6 5 0 9 4 X V a r a b l e 1-5. 4 3 6 5 8 0 6 0. 3 3 6 2 1 6 1 6 7-1 6. 1 6 9 9 X V a r a b l e 2-2 0. 0 1 2 3 2 1 2. 3 4 2 5 0 5 2 2 7-8. 5 4 3 1 3 t Test Statstc for X 2 (Insulaton) 1999 Prentce-Hall, Inc. Chap. 14-15

t Test : Example Soluton Does temperature have a sgnfcant effect on monthly consumpton of heatng ol? Test at a = 0.05. H 0 : 1 = 0 H 1 : 1 0 df = 12 Crtcal Value(s): Reject H 0 Reject H 0.025.025-2.1788 0 2.1788 Z Test Statstc: t Test Statstc = -16.1699 Decson: Reject H 0 at a = 0.05 Concluson: There s evdence of a sgnfcant effect of temperature on ol consumpton. 1999 Prentce-Hall, Inc. Chap. 14-16

Confdence Interval Estmate For The Slope Provde the 95% confdence nterval for the populaton slope 1 (the effect of temperature on ol consumpton). b t n S 1 p1 b 1 Coeffcents Lower 95% Upper 95% Inte rce pt 562.151009 516.1930837 608.108935 X V a ra ble 1-5.4365806-6.169132673-4.7040285 X V a ra ble 2-20.012321-25.11620102-14.90844-6.169 1-4.704 The average consumpton of ol s reduced by between 4.7 gallons to 6.17 gallons per each ncrease of 1 0 F. 1999 Prentce-Hall, Inc. Chap. 14-17

Testng Portons of Model Contrbuton of One X to Model (holdng all others constant) Denote by SSR(X all varables except ) 2 r Y 1.2 = Coeffcent of partal determnaton of X 1 wth Y holdng X 2 constant Evaluate Separate Models Useful n Selectng Independent Varables 1999 Prentce-Hall, Inc. Chap. 14-18

Testng Portons of Model: SSR Contrbuton of X 1 gven X 2 has been ncluded: SSR(X 1 X 2 ) = SSR(X 1 and X 2 ) - SSR(X 2 ) From ANOVA secton of regresson for Ŷ b0 b1 X1 b2x 2 From ANOVA secton of regresson for Ŷ b0 b2x 2 1999 Prentce-Hall, Inc. Chap. 14-19

Partal F Test For Contrbuton of X Hypotheses: H 0 : Varable X does not sgnfcantly mprove the model gven all others ncluded H 1 : Varable X sgnfcantly mproves the model gven all others ncluded Test Statstc: F = Wth df = 1 and (n - p -1) SSR( X all others ) MSE 1999 Prentce-Hall, Inc. Chap. 14-20

Coeffcent of Partal Determnaton 2 SSR( X1 X 2 ) Y 1 2 SST SSR( X1 and X 2 ) SSR( X1 X 2 r. ) From ANOVA secton of regresson for Ŷ b0 b1 X1 b2x 2 From ANOVA secton of regresson for Ŷ b0 b2x 2 1999 Prentce-Hall, Inc. Chap. 14-21

Testng Portons of Model: Example Test at the a =.05 level to determne f the varable of average temperature sgnfcantly mproves the model gven that nsulaton s ncluded. 1999 Prentce-Hall, Inc. Chap. 14-22

Testng Portons of Model: Example H 0 : X 1 does not mprove model (X 2 ncluded) H 1 : X 1 does mprove model A N O V A F SSR( a =.05, df = 1 and 12 Crtcal Value = 4.75 (For X 1 and X 2 ) A N O V A (For X 2 ) SS SS X 1 MSE X 2 ) MS R e g re sso n 2 2 8 0 1 4.6 2 6 3 1 1 4 0 0 7.3 1 3 R e sd u a l 8 1 2 0.6 0 3 0 1 6 6 7 6.7 1 6 9 1 8 T o ta l 2 3 6 1 3 5.2 2 9 3 R e g r e s s o n 5 1 0 7 6. 4 7 R e s d u a l 1 8 5 0 5 8. 8 T o t a l 2 3 6 1 3 5. 2 228, 015 51, 076 676, 717 Concluson: Reject H 0. X 1 does mprove model = 261.47 1999 Prentce-Hall, Inc. Chap. 14-23

Curvlnear Regresson Model Relatonshp between 1 response varable and 2 or more explanatory varable s a polynomal functon Useful when scatter dagram ndcates non-lnear relatonshp Curvlnear model: Y 0 X 1 The second explanatory varable s the square of the 1st. 1999 Prentce-Hall, Inc. Chap. 14-24 1 X 2 2 1

Curvlnear Regresson Model Curvlnear models may be consdered when scatter dagram takes on the followng shapes: Y Y Y Y X 1 X 1 2 > 0 2 > 0 2 < 0 2 < 0 X 1 X 1 2 = the coeffcent of the quadratc term 1999 Prentce-Hall, Inc. Chap. 14-25

Testng for Sgnfcance: Curvlnear Model Testng for Overall Relatonshp Smlar to test for lnear model F test statstc = MSR MSE Testng the Curvlnear Effect Compare curvlnear model 2 Y X X 0 1 wth the lnear model Y X 1 0 1 1 2 1 1999 Prentce-Hall, Inc. Chap. 14-26

Dummy-Varable Models Categorcal Varable Involved (dummy varable) wth 2 Levels: yes or no, on or off, male or female, Coded 0 or 1 Intercepts Dfferent Assumes Equal Slopes Regresson Model has Same Form: Y X X X 0 1 1 2 2 p p 1999 Prentce-Hall, Inc. Chap. 14-27

Dummy-Varable Models Assumpton Gven: Ŷ b0 b1x 1 b2x 2 Y = Assessed Value of House X 1 = Square footage of House X 2 = Desrablty of Neghborhood = Desrable (X 2 = 1) Ŷ b0 b1x 1 b2( 1) ( b0 b2 ) b1x 1 Undesrable (X 2 = 0) Ŷ b0 b1x 1 b2( 0) b0 b1x 1 0 f undesrable 1 f desrable Same slopes 1999 Prentce-Hall, Inc. Chap. 14-28

Dummy-Varable Models Assumpton Y (Assessed Value) Intercepts dfferent b 0 + b 2 b 0 Same slopes X 1 (Square footage) 1999 Prentce-Hall, Inc. Chap. 14-29

Evaluatng Presence of Interacton Hypothesze Interacton Between Pars of Independent Varables Contans 2-way Product Terms Y 0 1X1 2X 2 3X1 X 2 Hypotheses: H 0 : 3 = 0 (No nteracton between X 1 and X 2 H 1 : 3 0 (X 1 nteracts wth X 2 ) 1999 Prentce-Hall, Inc. Chap. 14-30

Usng Transformatons For Non-lnear Models that Volate Lnear Regresson Assumptons Determne Type of Transformaton From Scatter Dagram Requres Data Transformaton Ether or Both Independent and Dependent Varables May be Transformed 1999 Prentce-Hall, Inc. Chap. 14-31

Square Root Transformaton Y X X 0 1 1 2 2 Y 1 > 0 1 < 0 X 1 Smlarly for X 2 Transforms one of above model to one that appears lnear. Often used to overcome heteroscedastcty. 1999 Prentce-Hall, Inc. Chap. 14-32

Logarthmc Transformaton Y ln( X ) 0 1 1 2 ln( X 2 ) Y 1 > 0 Smlarly for X 2 1 < 0 X 1 Transformed from an orgnal multplcatve model 1999 Prentce-Hall, Inc. Chap. 14-33

Exponental Transformaton Orgnal Model Y e 0 1X 1 2X 2 Y 1 > 0 Smlarly for X 2 1 < 0 X 1 Transformed nto: lny 0 1X1 2X 2 ln 1 1999 Prentce-Hall, Inc. Chap. 14-34

Collnearty Hgh Correlaton Between Explanatory Varables Coeffcents Measure Combned Effect No New Informaton Provded Leads to Unstable Coeffcents Dependng on the explanatory varables VIF Used to Measure Collnearty VIF 1 j 1 R 2 j, 1999 Prentce-Hall, Inc. Chap. 14-35 2 j R = Coeffcent of Multple Determnaton of X j wth all the others

Model Buldng Goal s to Develop Model wth Fewest Explanatory Varables Easer to nterpret Lower probablty of collnearty Stepwse Regresson Procedure Provde lmted evaluaton of alternatve models Best-Subset Approach Uses the C p Statstc Selects model wth small C p near p+1 1999 Prentce-Hall, Inc. Chap. 14-36

Model Buldng Flowchart Choose X 1,X 2, X k Run Regresson to fnd VIFs Any VIF>5? No Run Subsets Regresson to Obtan best models n terms of C p Remove Varable wth Hghest VIF Yes Yes More than One? No Remove ths X Do Complete Analyss Add Curvlnear Term and/or Transform Varables as Indcated Perform Predctons 1999 Prentce-Hall, Inc. Chap. 14-37

Chapter Summary Presented The Multple Regresson Model Consdered Contrbuton of Indvdual Independent Varables Dscussed Coeffcent of Determnaton Addressed Categorcal Explanatory Varables Consdered Transformaton of Varables Dscussed Model Buldng 1999 Prentce-Hall, Inc. Chap. 14-38