Econometrics. 5) Dummy variables

Size: px
Start display at page:

Download "Econometrics. 5) Dummy variables"

Transcription

1 30C00200 Econometrics 5) Dummy variables Timo Kuosmanen Professor, Ph.D.

2 Today s topics Qualitative factors as explanatory variables Binary qualitative factors Dummy variables and their interpretation Categorical variables Dummy variable trap Modeling ordinal scale data by dummy variables Slope dummies and their interpretation F-test of joint significance

3 Quantitative variables Thus far, our regressors (x) have been quantitative, interval or ratio-scale variables Examples Size of apartment in m 2 Year of construction Number of bedrooms

4 Qualitative factors In many cases, the dependent variable is influenced by qualitative factors Examples: Is there elevator in the building? Yes / no Is there sauna in the apartment? Yes / no Is there a balcony in the apartment? Yes / no

5 Dummy variables The qualitative factors can be coded numerically by using binary {0,1} numbers Example: Elevator = {1 if there is elevator; 0 if not} Sauna = {1 if there is sauna; 0 if not} Balcony = {1 if there is balcony; 0 if not} Such binary variables can be directly inserted to the regression model as explanatory variables (x), referred to as dummy variables.

6 Interpretation of dummy variables Example: Hedonic model with Elevator dummy: Price = β 1 + β 2 Size + + δ Elevator + ε i δ is the effect of elevator on price For apartments without elevator, the intercept is β 1 For apartments with elevator, the intercept is β 1 + δ

7 Example: hedonic model with elevator dummy SUMMARY OUTPUT Regression Statistics Multiple R 0,90764 R Square 0,82381 Adjusted R Square 0, Standard Error 80551,73 Observations 67 ANOVA df SS MS F Significance F Regression 4 1,88E+12 4,7E+11 72, ,12E-22 Residual 62 4,02E+11 6,49E+09 Total 66 2,28E+12 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept , ,42 2, , , size m2 7023,05 858,5364 8, ,93E , ,24 nr. Bedrooms , ,87-3, , ,2 Age -2553,42 558,2156-4, ,34E , ,57 Elevator 24616, ,74 1, , , ,46

8 Categorical variables Some qualitative factors have more than two possible values Example: Districts in Espoo Tapiola Espoonlahti Kivenlahti Leppävaara Olari Such qualitative factors with many categories can be coded numerically by using multiple dummy variables: D1 = {1 if district=tapiola; 0 otherwise} D2 = {1 if district=espoonlahti; 0 otherwise} D3 = {1 if district=kivenlahti; 0 otherwise}

9 SUMMARY OUTPUT Regression Statistics Multiple R 0, R Square 0, Adjusted R Square 0, Standard Error 53206,42 Observations 642 Example: hedonic model with a larger sample ANOVA df SS MS F Significance F Regression 14 5,67E+12 4,05E ,0872 1,7E-184 Residual 627 1,77E+12 2,83E+09 Total 641 7,45E+12 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 85388, ,18 4, ,8E , Size m2 4137, , , ,52E , ,29 Bedrooms ,7 5323,126-6,1794 1,16E ,4 Age -2584,75 151, ,096 4,51E , ,85 Elevator 6216, ,772 1, , , ,94 1st floor -9137, ,723-1, , ,7 2187,183 Top floor 9576, ,709 1, , , ,19 Energy class A-C 33899, ,676 3, , , ,58 Condition satisfactory -2680, ,14-0, , , ,98 Condition good 18120, ,51 1, , , ,51 Loc. Espoonlahti , ,91-1, , ,5 6219,35 Loc. Kivenlahti ,2 8488,473-2, , ,5-2072,91 Loc. Leppävaara -4761, ,203-0, , ,7 7915,191 Loc. Olari -714, ,031-0, , , ,94 Loc. Tapiola ,6 7110,665 17, ,58E ,2

10 Categorical variables When modeling qualitative factors with q different categories, we need to introduce q-1 dummy variables, and leave one level as a reference category, represented by the common intercept β 1 Other effects are expressed relative to the reference category D1 = {1 if district=tapiola; 0 otherwise} D2 = {1 if district=espoonlahti; 0 otherwise} D3 = {1 if district=kivenlahti; 0 otherwise} Price = β 1 + β 2 Size + + δ 1 D1 + δ 2 D2 + δ 3 D3 + ε i Intercept β 1 represent the regression equation for other districts of Espoo Parameter δ 1 captures the effect of Tapiola district

11 Dummy variable trap What if we included separate dummy for all categories? Example: D1 = {1 if district=tapiola; 0 otherwise} D2 = {1 if district=espoonlahti; 0 otherwise} D3 = {1 if district=kivenlahti; 0 otherwise} D6 = {1 if district= other ; 0 otherwise} Price = β 1 + β 2 Size + + δ 1 D1 + δ 2 D2 + δ 3 D3 + + δ 6 D6 + ε i But then, the effect of other districts is β 1 + δ 6 We could freely increase β 1 if we simultaneously decrease all δ parameters. Hence, the model is unidentified: the problem is referred to as the dummy variable trap.

12 Ordinal variables We can similarly use dummy variables for modeling ordinal variables. Example: condition of apartment 1) weak 2) satisfactory 3) good Variables measured on ordinal scale can be modeled using dummy variables, taking one of the levels (e.g. weak) as the reference category.

13 SUMMARY OUTPUT Regression Statistics Multiple R 0, R Square 0, Adjusted R Square 0, Standard Error 53206,42 Observations 642 Example: hedonic model with a larger sample ANOVA df SS MS F Significance F Regression 14 5,67E+12 4,05E ,0872 1,7E-184 Residual 627 1,77E+12 2,83E+09 Total 641 7,45E+12 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 85388, ,18 4, ,8E , Size m2 4137, , , ,52E , ,29 Bedrooms ,7 5323,126-6,1794 1,16E ,4 Age -2584,75 151, ,096 4,51E , ,85 Elevator 6216, ,772 1, , , ,94 1st floor -9137, ,723-1, , ,7 2187,183 Top floor 9576, ,709 1, , , ,19 Energy class A-C 33899, ,676 3, , , ,58 Condition satisfactory -2680, ,14-0, , , ,98 Condition good 18120, ,51 1, , , ,51 Loc. Espoonlahti , ,91-1, , ,5 6219,35 Loc. Kivenlahti ,2 8488,473-2, , ,5-2072,91 Loc. Leppävaara -4761, ,203-0, , ,7 7915,191 Loc. Olari -714, ,031-0, , , ,94 Loc. Tapiola ,6 7110,665 17, ,58E ,2

14 Quantitative dummy variables? Sometimes quantitative variables are modelled as dummy variables as well. Example: number of bedrooms in the hedonic model We implictly assumed that the change from 1 room apartment to 2 room apartment has the same effect on price as the change from 2 rooms to 3 rooms, etc. To capture the differences, we could use dummy variables: Assume the reference category is 3 bedrooms or more D1 = {1 if apartment has 1 bedroom; 0 otherwise} D2 = {1 if apartment has 2 bedrooms; 0 otherwise}

15 SUMMARY OUTPUT Regression Statistics Multiple R 0, R Square 0, Adjusted R Square 0, Standard Error 53524,35 Observations 642 ANOVA df SS MS F Significance F Regression 15 5,65E+12 3,77E ,5378 5E-182 Residual 626 1,79E+12 2,86E+09 Total 641 7,45E+12 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 23887, ,41 1, , , ,68 Size m2 3560, , , ,27E , ,866 1Bedroom 61666, ,29 5, ,18E , ,16 2Bedroom 18828, , , , ,58 Age -2643,66 153, ,257 7,07E ,5-2342,83 Elevator 7326, ,629 1, , , ,46 1st floor ,9 5827,307-2, , ,3-765,43 Top floor 7664, ,457 1, , , ,48 Energy class A-C 36051, ,075 3, , , ,35 Condition satisfactory -1955, ,55-0, , , ,75 Condition good 21884, ,75 1, , , ,63 Loc. Espoonlahti , ,76-1, , ,7 2839,068 Loc. Kivenlahti ,9 8549,151-2, , ,3-2548,37 Loc. Leppävaara -2237, ,609-0, , , ,86 Loc. Olari -352, ,148-0,0437 0, , ,36 Loc. Tapiola ,8 7170,01 17, ,46E ,

16 Slope dummies Thus far, we have assumed the dummy variables shift the regression line via the intercept, but do not affect the slopes. Dummy variables can also be used for modeling the effect on the slopes of quantitative variables (slope dummies). Example: the number of bedrooms can influence the marginal value of size Price = β 1 + β 2 Size + β 3 D1 Size + β 4 D2 Size + + ε i D1 = {1 if apartment has 1 bedroom; 0 otherwise} D2 = {1 if apartment has 2 bedrooms; 0 otherwise}

17 SUMMARY OUTPUT Regression Statistics Multiple R 0, R Square 0, Adjusted R Square 0, Standard Error 53914,47 Observations 642 ANOVA df SS MS F Significance F Regression 15 5,63E+12 3,75E ,0393 4,6E-180 Residual 626 1,82E+12 2,91E+09 Total 641 7,45E+12 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 47765, ,87 2, , , ,99 Size m2 3279, , , ,02E , ,91 Size X 1Bedroom 1295, ,891 4, ,53E , ,825 Size x 2Bedroom 143, ,5412 1, , , ,9625 Age -2626,49 153, ,0579 7,27E , ,12 Elevator 7783, ,02 1, , , ,83 1st floor ,3 5868,643-1,9615 0, ,27122 Top floor 7934, ,085 1, , , ,71 Energy class A-C 37274, ,31 3, , , ,18 Condition satisfactory -1648, ,8-0, , , ,82 Condition good 21748, ,1 1, , , ,34 Loc. Espoonlahti , ,61-1, , ,2 2368,139 Loc. Kivenlahti ,4 8612,746-2, , ,7-1879,98 Loc. Leppävaara -2373, ,595-0, , , ,3 Loc. Olari -116, ,084-0, , , ,15 Loc. Tapiola ,1 7201,74 17, ,17E , ,6

18 Dummies or separate regressions? Consider a model of single regressor x and a single dummy D: y = β 1 + β 2 x + δ D + γ D x + ε where δ is the coefficient of intercept dummy and γ is the coefficient of slope dummy. The model could be equivalently estimated in parts: split the sample in two subsamples (D=0, D=1), and run two separate regressions: For D=0: For D=1 y = β 1 + β 2 x + ε y = β 3 + β 4 x + ε [Note: β 3 = β 1 + δ; β 4 = β 2 + γ]

19 Dummies or separate regressions? Partitioning subsamples (D=0, D=1) to separate regression models is easy to implement, especially if there are many regressors. Pooling the sub-samples (D=0, D=1) in the same model enables us to test for significance of coefficients δ and γ by t-test. We can use the F-test for testing joint significance of the parameters δ and γ (see the next lecture)

20 Qualitative factors as dependent variables It is also possible to model binary {0,1} variables and ordinary variables as dependent variables. For example, regress variable y = elevator dummy on explanatory variables x (e.g., age, number of floors, etc.) Such models can be estimated using Probit and Logit approaches (based on maximum likelihood).

21 F-test of joint significance Statistical significance of the parameters β 1, β 2 can be tested by using t-test (one- and two-sided tests) p-value (two-sided tests only) confidence intervals (two-sided tests, 95% significance) In the multiple regression model, it is possible that some of the regressors are significant while others are not It is useful to test if the model as a whole is significant

22 SUMMARY OUTPUT Regression Statistics Multiple R 0, R Square 0, Adjusted R Square 0, Standard Error 53206,42 Observations 642 ANOVA df SS MS F Significance F Regression 14 5,67E+12 4,05E ,0872 1,7E-184 Residual 627 1,77E+12 2,83E+09 Total 641 7,45E+12 Coefficients Standard Error t Stat P-value Lower 95% Upper 95% Intercept 85388, ,18 4, ,8E , Size m2 4137, , , ,52E , ,29 Bedrooms ,7 5323,126-6,1794 1,16E ,4 Age -2584,75 151, ,096 4,51E , ,85 Elevator 6216, ,772 1, , , ,94 1st floor -9137, ,723-1, , ,7 2187,183 Top floor 9576, ,709 1, , , ,19 Energy class A-C 33899, ,676 3, , , ,58 Condition Satisfactory -2680, ,14-0, , , ,98 Condition Good 18120, ,51 1, , , ,51 Loc. Espoonlahti , ,91-1, , ,5 6219,35 Loc. Kivenlahti ,2 8488,473-2, , ,5-2072,91 Loc. Leppävaara -4761, ,203-0, , ,7 7915,191 Loc. Olari -714, ,031-0, , , ,94 22 Loc. Tapiola ,6 7110,665 17, ,58E ,2

23 F-test of joint significance Multiple regression model: y = β 1 + β 2 x 2 + β 3 x β K x K + ε H 0 : β 2 = β 3 = = β K =0 (true model: y = β 1 + ε) H 1 : β k 0 for some k = 2,,K Note: in the special case of a single regression model: y = β 1 + β 2 x 2 + ε H 0 : β 2 = 0 H 1 : β 2 0

24 F-test of joint significance Empirical fit of the estimated model measured by the coefficient of determination (R 2 ) When H 0 (y = β 1 + ε) is true, R 2 is equal to zero. Is R 2 of the estimated model significantly greater than zero?

25 Test statistic: F-test of joint significance ESS K R K F RSS n K R n K 2 / ( 1) / ( 1) 2 / ( ) (1 ) / ( ) If H0 holds, then the test statistic follows the F-distribution with (K-1) and (n-k) degrees of freedom: F[(K-1), (n-k)] In Excel: =FINV(prob.; df1; df2)

26 F-test of joint significance Plots of the density function of F distribution at different values of df

27 F-test of joint significance Critical values of the F distribution at different sample sizes n, single regressor case, 5% significance level: n critical F 20 4, , , , , , , ,843

28 F-test of joint significance In the single regression case, we could test for the hypothesis H 0 : β 2 = 0 using the F-test or the t-test. Which one is more powerful? In fact, the two tests are equivalent (See 2.12 for details) F 2 R / ( K 1) ( b ) (1 R ) / ( n K) s. e.( b ) t 2 Also the critical values of F-distribution are obtained as the squared values of the critical t

29 Next time Wed 7 March Topic: Model specification

Econometrics. 7) Endogeneity

Econometrics. 7) Endogeneity 30C00200 Econometrics 7) Endogeneity Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Common types of endogeneity Simultaneity Omitted variables Measurement errors

More information

Econometrics. 4) Statistical inference

Econometrics. 4) Statistical inference 30C00200 Econometrics 4) Statistical inference Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Confidence intervals of parameter estimates Student s t-distribution

More information

MULTIPLE REGRESSION ANALYSIS AND OTHER ISSUES. Business Statistics

MULTIPLE REGRESSION ANALYSIS AND OTHER ISSUES. Business Statistics MULTIPLE REGRESSION ANALYSIS AND OTHER ISSUES Business Statistics CONTENTS Multiple regression Dummy regressors Assumptions of regression analysis Predicting with regression analysis Old exam question

More information

Categorical Predictor Variables

Categorical Predictor Variables Categorical Predictor Variables We often wish to use categorical (or qualitative) variables as covariates in a regression model. For binary variables (taking on only 2 values, e.g. sex), it is relatively

More information

Chapter 14 Student Lecture Notes 14-1

Chapter 14 Student Lecture Notes 14-1 Chapter 14 Student Lecture Notes 14-1 Business Statistics: A Decision-Making Approach 6 th Edition Chapter 14 Multiple Regression Analysis and Model Building Chap 14-1 Chapter Goals After completing this

More information

CHAPTER 4 & 5 Linear Regression with One Regressor. Kazu Matsuda IBEC PHBU 430 Econometrics

CHAPTER 4 & 5 Linear Regression with One Regressor. Kazu Matsuda IBEC PHBU 430 Econometrics CHAPTER 4 & 5 Linear Regression with One Regressor Kazu Matsuda IBEC PHBU 430 Econometrics Introduction Simple linear regression model = Linear model with one independent variable. y = dependent variable

More information

Basic Business Statistics, 10/e

Basic Business Statistics, 10/e Chapter 4 4- Basic Business Statistics th Edition Chapter 4 Introduction to Multiple Regression Basic Business Statistics, e 9 Prentice-Hall, Inc. Chap 4- Learning Objectives In this chapter, you learn:

More information

2. Linear regression with multiple regressors

2. Linear regression with multiple regressors 2. Linear regression with multiple regressors Aim of this section: Introduction of the multiple regression model OLS estimation in multiple regression Measures-of-fit in multiple regression Assumptions

More information

ECON 4230 Intermediate Econometric Theory Exam

ECON 4230 Intermediate Econometric Theory Exam ECON 4230 Intermediate Econometric Theory Exam Multiple Choice (20 pts). Circle the best answer. 1. The Classical assumption of mean zero errors is satisfied if the regression model a) is linear in the

More information

Chapter 3 Multiple Regression Complete Example

Chapter 3 Multiple Regression Complete Example Department of Quantitative Methods & Information Systems ECON 504 Chapter 3 Multiple Regression Complete Example Spring 2013 Dr. Mohammad Zainal Review Goals After completing this lecture, you should be

More information

Chapter 7 Student Lecture Notes 7-1

Chapter 7 Student Lecture Notes 7-1 Chapter 7 Student Lecture Notes 7- Chapter Goals QM353: Business Statistics Chapter 7 Multiple Regression Analysis and Model Building After completing this chapter, you should be able to: Explain model

More information

1 A Non-technical Introduction to Regression

1 A Non-technical Introduction to Regression 1 A Non-technical Introduction to Regression Chapters 1 and Chapter 2 of the textbook are reviews of material you should know from your previous study (e.g. in your second year course). They cover, in

More information

Analisi Statistica per le Imprese

Analisi Statistica per le Imprese , Analisi Statistica per le Imprese Dip. di Economia Politica e Statistica 4.3. 1 / 33 You should be able to:, Underst model building using multiple regression analysis Apply multiple regression analysis

More information

New York University Department of Economics. Applied Statistics and Econometrics G Spring 2013

New York University Department of Economics. Applied Statistics and Econometrics G Spring 2013 New York University Department of Economics Applied Statistics and Econometrics G31.1102 Spring 2013 Text: Econometric Analysis, 7 h Edition, by William Greene (Prentice Hall) Optional: A Guide to Modern

More information

Final Exam - Solutions

Final Exam - Solutions Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your

More information

Lab 07 Introduction to Econometrics

Lab 07 Introduction to Econometrics Lab 07 Introduction to Econometrics Learning outcomes for this lab: Introduce the different typologies of data and the econometric models that can be used Understand the rationale behind econometrics Understand

More information

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression

Chapter 14 Student Lecture Notes Department of Quantitative Methods & Information Systems. Business Statistics. Chapter 14 Multiple Regression Chapter 14 Student Lecture Notes 14-1 Department of Quantitative Methods & Information Systems Business Statistics Chapter 14 Multiple Regression QMIS 0 Dr. Mohammad Zainal Chapter Goals After completing

More information

1.) Fit the full model, i.e., allow for separate regression lines (different slopes and intercepts) for each species

1.) Fit the full model, i.e., allow for separate regression lines (different slopes and intercepts) for each species Lecture notes 2/22/2000 Dummy variables and extra SS F-test Page 1 Crab claw size and closing force. Problem 7.25, 10.9, and 10.10 Regression for all species at once, i.e., include dummy variables for

More information

SMA 6304 / MIT / MIT Manufacturing Systems. Lecture 10: Data and Regression Analysis. Lecturer: Prof. Duane S. Boning

SMA 6304 / MIT / MIT Manufacturing Systems. Lecture 10: Data and Regression Analysis. Lecturer: Prof. Duane S. Boning SMA 6304 / MIT 2.853 / MIT 2.854 Manufacturing Systems Lecture 10: Data and Regression Analysis Lecturer: Prof. Duane S. Boning 1 Agenda 1. Comparison of Treatments (One Variable) Analysis of Variance

More information

" M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2

 M A #M B. Standard deviation of the population (Greek lowercase letter sigma) σ 2 Notation and Equations for Final Exam Symbol Definition X The variable we measure in a scientific study n The size of the sample N The size of the population M The mean of the sample µ The mean of the

More information

Inferences for Regression

Inferences for Regression Inferences for Regression An Example: Body Fat and Waist Size Looking at the relationship between % body fat and waist size (in inches). Here is a scatterplot of our data set: Remembering Regression In

More information

Rockefeller College University at Albany

Rockefeller College University at Albany Rockefeller College University at Albany PAD 705 Handout: Suggested Review Problems from Pindyck & Rubinfeld Original prepared by Professor Suzanne Cooper John F. Kennedy School of Government, Harvard

More information

22s:152 Applied Linear Regression

22s:152 Applied Linear Regression 22s:152 Applied Linear Regression Chapter 7: Dummy Variable Regression So far, we ve only considered quantitative variables in our models. We can integrate categorical predictors by constructing artificial

More information

Chapter 4. Regression Models. Learning Objectives

Chapter 4. Regression Models. Learning Objectives Chapter 4 Regression Models To accompany Quantitative Analysis for Management, Eleventh Edition, by Render, Stair, and Hanna Power Point slides created by Brian Peterson Learning Objectives After completing

More information

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit

LECTURE 6. Introduction to Econometrics. Hypothesis testing & Goodness of fit LECTURE 6 Introduction to Econometrics Hypothesis testing & Goodness of fit October 25, 2016 1 / 23 ON TODAY S LECTURE We will explain how multiple hypotheses are tested in a regression model We will define

More information

Ch 2: Simple Linear Regression

Ch 2: Simple Linear Regression Ch 2: Simple Linear Regression 1. Simple Linear Regression Model A simple regression model with a single regressor x is y = β 0 + β 1 x + ɛ, where we assume that the error ɛ is independent random component

More information

A discussion on multiple regression models

A discussion on multiple regression models A discussion on multiple regression models In our previous discussion of simple linear regression, we focused on a model in which one independent or explanatory variable X was used to predict the value

More information

Multiple Regression. Peerapat Wongchaiwat, Ph.D.

Multiple Regression. Peerapat Wongchaiwat, Ph.D. Peerapat Wongchaiwat, Ph.D. wongchaiwat@hotmail.com The Multiple Regression Model Examine the linear relationship between 1 dependent (Y) & 2 or more independent variables (X i ) Multiple Regression Model

More information

Section 5: Dummy Variables and Interactions

Section 5: Dummy Variables and Interactions Section 5: Dummy Variables and Interactions Carlos M. Carvalho The University of Texas at Austin McCombs School of Business http://faculty.mccombs.utexas.edu/carlos.carvalho/teaching/ 1 Example: Detecting

More information

Regression with Qualitative Information. Part VI. Regression with Qualitative Information

Regression with Qualitative Information. Part VI. Regression with Qualitative Information Part VI Regression with Qualitative Information As of Oct 17, 2017 1 Regression with Qualitative Information Single Dummy Independent Variable Multiple Categories Ordinal Information Interaction Involving

More information

Introduction to Linear Regression Analysis

Introduction to Linear Regression Analysis Introduction to Linear Regression Analysis Samuel Nocito Lecture 1 March 2nd, 2018 Econometrics: What is it? Interaction of economic theory, observed data and statistical methods. The science of testing

More information

SIMPLE REGRESSION ANALYSIS. Business Statistics

SIMPLE REGRESSION ANALYSIS. Business Statistics SIMPLE REGRESSION ANALYSIS Business Statistics CONTENTS Ordinary least squares (recap for some) Statistical formulation of the regression model Assessing the regression model Testing the regression coefficients

More information

Regression Models. Chapter 4. Introduction. Introduction. Introduction

Regression Models. Chapter 4. Introduction. Introduction. Introduction Chapter 4 Regression Models Quantitative Analysis for Management, Tenth Edition, by Render, Stair, and Hanna 008 Prentice-Hall, Inc. Introduction Regression analysis is a very valuable tool for a manager

More information

Lecture 3: Multiple Regression

Lecture 3: Multiple Regression Lecture 3: Multiple Regression R.G. Pierse 1 The General Linear Model Suppose that we have k explanatory variables Y i = β 1 + β X i + β 3 X 3i + + β k X ki + u i, i = 1,, n (1.1) or Y i = β j X ji + u

More information

The Multiple Regression Model

The Multiple Regression Model Multiple Regression The Multiple Regression Model Idea: Examine the linear relationship between 1 dependent (Y) & or more independent variables (X i ) Multiple Regression Model with k Independent Variables:

More information

Confidence Intervals, Testing and ANOVA Summary

Confidence Intervals, Testing and ANOVA Summary Confidence Intervals, Testing and ANOVA Summary 1 One Sample Tests 1.1 One Sample z test: Mean (σ known) Let X 1,, X n a r.s. from N(µ, σ) or n > 30. Let The test statistic is H 0 : µ = µ 0. z = x µ 0

More information

22s:152 Applied Linear Regression. Take random samples from each of m populations.

22s:152 Applied Linear Regression. Take random samples from each of m populations. 22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each

More information

Lecture 2 Simple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: Chapter 1

Lecture 2 Simple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: Chapter 1 Lecture Simple Linear Regression STAT 51 Spring 011 Background Reading KNNL: Chapter 1-1 Topic Overview This topic we will cover: Regression Terminology Simple Linear Regression with a single predictor

More information

Inference for the Regression Coefficient

Inference for the Regression Coefficient Inference for the Regression Coefficient Recall, b 0 and b 1 are the estimates of the slope β 1 and intercept β 0 of population regression line. We can shows that b 0 and b 1 are the unbiased estimates

More information

Econometrics. 9) Heteroscedasticity and autocorrelation

Econometrics. 9) Heteroscedasticity and autocorrelation 30C00200 Econometrics 9) Heteroscedasticity and autocorrelation Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Heteroscedasticity Possible causes Testing for

More information

22s:152 Applied Linear Regression. There are a couple commonly used models for a one-way ANOVA with m groups. Chapter 8: ANOVA

22s:152 Applied Linear Regression. There are a couple commonly used models for a one-way ANOVA with m groups. Chapter 8: ANOVA 22s:152 Applied Linear Regression Chapter 8: ANOVA NOTE: We will meet in the lab on Monday October 10. One-way ANOVA Focuses on testing for differences among group means. Take random samples from each

More information

MBA Statistics COURSE #4

MBA Statistics COURSE #4 MBA Statistics 51-651-00 COURSE #4 Simple and multiple linear regression What should be the sales of ice cream? Example: Before beginning building a movie theater, one must estimate the daily number of

More information

1 Correlation and Inference from Regression

1 Correlation and Inference from Regression 1 Correlation and Inference from Regression Reading: Kennedy (1998) A Guide to Econometrics, Chapters 4 and 6 Maddala, G.S. (1992) Introduction to Econometrics p. 170-177 Moore and McCabe, chapter 12 is

More information

Statistics and Quantitative Analysis U4320

Statistics and Quantitative Analysis U4320 Statistics and Quantitative Analysis U3 Lecture 13: Explaining Variation Prof. Sharyn O Halloran Explaining Variation: Adjusted R (cont) Definition of Adjusted R So we'd like a measure like R, but one

More information

2 Regression Analysis

2 Regression Analysis FORK 1002 Preparatory Course in Statistics: 2 Regression Analysis Genaro Sucarrat (BI) http://www.sucarrat.net/ Contents: 1 Bivariate Correlation Analysis 2 Simple Regression 3 Estimation and Fit 4 T -Test:

More information

The multiple regression model; Indicator variables as regressors

The multiple regression model; Indicator variables as regressors The multiple regression model; Indicator variables as regressors Ragnar Nymoen University of Oslo 28 February 2013 1 / 21 This lecture (#12): Based on the econometric model specification from Lecture 9

More information

LECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity

LECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity LECTURE 10 Introduction to Econometrics Multicollinearity & Heteroskedasticity November 22, 2016 1 / 23 ON PREVIOUS LECTURES We discussed the specification of a regression equation Specification consists

More information

Econometrics I Lecture 3: The Simple Linear Regression Model

Econometrics I Lecture 3: The Simple Linear Regression Model Econometrics I Lecture 3: The Simple Linear Regression Model Mohammad Vesal Graduate School of Management and Economics Sharif University of Technology 44716 Fall 1397 1 / 32 Outline Introduction Estimating

More information

Introduction To Logistic Regression

Introduction To Logistic Regression Introduction To Lecture 22 April 28, 2005 Applied Regression Analysis Lecture #22-4/28/2005 Slide 1 of 28 Today s Lecture Logistic regression. Today s Lecture Lecture #22-4/28/2005 Slide 2 of 28 Background

More information

Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity

Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity R.G. Pierse 1 Omitted Variables Suppose that the true model is Y i β 1 + β X i + β 3 X 3i + u i, i 1,, n (1.1) where β 3 0 but that the

More information

Lecture 18 Miscellaneous Topics in Multiple Regression

Lecture 18 Miscellaneous Topics in Multiple Regression Lecture 18 Miscellaneous Topics in Multiple Regression STAT 512 Spring 2011 Background Reading KNNL: 8.1-8.5,10.1, 11, 12 18-1 Topic Overview Polynomial Models (8.1) Interaction Models (8.2) Qualitative

More information

Regression Analysis II

Regression Analysis II Regression Analysis II Measures of Goodness of fit Two measures of Goodness of fit Measure of the absolute fit of the sample points to the sample regression line Standard error of the estimate An index

More information

Hypothesis Testing hypothesis testing approach

Hypothesis Testing hypothesis testing approach Hypothesis Testing In this case, we d be trying to form an inference about that neighborhood: Do people there shop more often those people who are members of the larger population To ascertain this, we

More information

Statistics in medicine

Statistics in medicine Statistics in medicine Lecture 4: and multivariable regression Fatma Shebl, MD, MS, MPH, PhD Assistant Professor Chronic Disease Epidemiology Department Yale School of Public Health Fatma.shebl@yale.edu

More information

Analysis of Variance

Analysis of Variance Statistical Techniques II EXST7015 Analysis of Variance 15a_ANOVA_Introduction 1 Design The simplest model for Analysis of Variance (ANOVA) is the CRD, the Completely Randomized Design This model is also

More information

Linear Regression with one Regressor

Linear Regression with one Regressor 1 Linear Regression with one Regressor Covering Chapters 4.1 and 4.2. We ve seen the California test score data before. Now we will try to estimate the marginal effect of STR on SCORE. To motivate these

More information

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information.

STA441: Spring Multiple Regression. This slide show is a free open source document. See the last slide for copyright information. STA441: Spring 2018 Multiple Regression This slide show is a free open source document. See the last slide for copyright information. 1 Least Squares Plane 2 Statistical MODEL There are p-1 explanatory

More information

ECON 5350 Class Notes Functional Form and Structural Change

ECON 5350 Class Notes Functional Form and Structural Change ECON 5350 Class Notes Functional Form and Structural Change 1 Introduction Although OLS is considered a linear estimator, it does not mean that the relationship between Y and X needs to be linear. In this

More information

Inference for Regression Simple Linear Regression

Inference for Regression Simple Linear Regression Inference for Regression Simple Linear Regression IPS Chapter 10.1 2009 W.H. Freeman and Company Objectives (IPS Chapter 10.1) Simple linear regression p Statistical model for linear regression p Estimating

More information

( ), which of the coefficients would end

( ), which of the coefficients would end Discussion Sheet 29.7.9 Qualitative Variables We have devoted most of our attention in multiple regression to quantitative or numerical variables. MR models can become more useful and complex when we consider

More information

Chapter 13. Multiple Regression and Model Building

Chapter 13. Multiple Regression and Model Building Chapter 13 Multiple Regression and Model Building Multiple Regression Models The General Multiple Regression Model y x x x 0 1 1 2 2... k k y is the dependent variable x, x,..., x 1 2 k the model are the

More information

Problems. Suppose both models are fitted to the same data. Show that SS Res, A SS Res, B

Problems. Suppose both models are fitted to the same data. Show that SS Res, A SS Res, B Simple Linear Regression 35 Problems 1 Consider a set of data (x i, y i ), i =1, 2,,n, and the following two regression models: y i = β 0 + β 1 x i + ε, (i =1, 2,,n), Model A y i = γ 0 + γ 1 x i + γ 2

More information

Simple logistic regression

Simple logistic regression Simple logistic regression Biometry 755 Spring 2009 Simple logistic regression p. 1/47 Model assumptions 1. The observed data are independent realizations of a binary response variable Y that follows a

More information

Lab 10 - Binary Variables

Lab 10 - Binary Variables Lab 10 - Binary Variables Spring 2017 Contents 1 Introduction 1 2 SLR on a Dummy 2 3 MLR with binary independent variables 3 3.1 MLR with a Dummy: different intercepts, same slope................. 4 3.2

More information

STATISTICS 141 Final Review

STATISTICS 141 Final Review STATISTICS 141 Final Review Bin Zou bzou@ualberta.ca Department of Mathematical & Statistical Sciences University of Alberta Winter 2015 Bin Zou (bzou@ualberta.ca) STAT 141 Final Review Winter 2015 1 /

More information

EC312: Advanced Econometrics Problem Set 3 Solutions in Stata

EC312: Advanced Econometrics Problem Set 3 Solutions in Stata EC312: Advanced Econometrics Problem Set 3 Solutions in Stata Nicola Limodio www.nicolalimodio.com N.Limodio1@lse.ac.uk The data set AIRQ contains observations for 30 standard metropolitan statistical

More information

Lecture-1: Introduction to Econometrics

Lecture-1: Introduction to Econometrics Lecture-1: Introduction to Econometrics 1 Definition Econometrics may be defined as 2 the science in which the tools of economic theory, mathematics and statistical inference is applied to the analysis

More information

Lecture 9: Linear Regression

Lecture 9: Linear Regression Lecture 9: Linear Regression Goals Develop basic concepts of linear regression from a probabilistic framework Estimating parameters and hypothesis testing with linear models Linear regression in R Regression

More information

STAT5044: Regression and Anova. Inyoung Kim

STAT5044: Regression and Anova. Inyoung Kim STAT5044: Regression and Anova Inyoung Kim 2 / 47 Outline 1 Regression 2 Simple Linear regression 3 Basic concepts in regression 4 How to estimate unknown parameters 5 Properties of Least Squares Estimators:

More information

Regression With a Categorical Independent Variable: Mean Comparisons

Regression With a Categorical Independent Variable: Mean Comparisons Regression With a Categorical Independent Variable: Mean Lecture 16 March 29, 2005 Applied Regression Analysis Lecture #16-3/29/2005 Slide 1 of 43 Today s Lecture comparisons among means. Today s Lecture

More information

Generalized Linear Models

Generalized Linear Models York SPIDA John Fox Notes Generalized Linear Models Copyright 2010 by John Fox Generalized Linear Models 1 1. Topics I The structure of generalized linear models I Poisson and other generalized linear

More information

Chapter 4: Regression Models

Chapter 4: Regression Models Sales volume of company 1 Textbook: pp. 129-164 Chapter 4: Regression Models Money spent on advertising 2 Learning Objectives After completing this chapter, students will be able to: Identify variables,

More information

Christopher Dougherty London School of Economics and Political Science

Christopher Dougherty London School of Economics and Political Science Introduction to Econometrics FIFTH EDITION Christopher Dougherty London School of Economics and Political Science OXFORD UNIVERSITY PRESS Contents INTRODU CTION 1 Why study econometrics? 1 Aim of this

More information

Linear Regression With Special Variables

Linear Regression With Special Variables Linear Regression With Special Variables Junhui Qian December 21, 2014 Outline Standardized Scores Quadratic Terms Interaction Terms Binary Explanatory Variables Binary Choice Models Standardized Scores:

More information

Econometric Modelling Prof. Rudra P. Pradhan Department of Management Indian Institute of Technology, Kharagpur

Econometric Modelling Prof. Rudra P. Pradhan Department of Management Indian Institute of Technology, Kharagpur Econometric Modelling Prof. Rudra P. Pradhan Department of Management Indian Institute of Technology, Kharagpur Module No. # 01 Lecture No. # 28 LOGIT and PROBIT Model Good afternoon, this is doctor Pradhan

More information

LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION

LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION LAB 5 INSTRUCTIONS LINEAR REGRESSION AND CORRELATION In this lab you will learn how to use Excel to display the relationship between two quantitative variables, measure the strength and direction of the

More information

Ch 3: Multiple Linear Regression

Ch 3: Multiple Linear Regression Ch 3: Multiple Linear Regression 1. Multiple Linear Regression Model Multiple regression model has more than one regressor. For example, we have one response variable and two regressor variables: 1. delivery

More information

Correlation Analysis

Correlation Analysis Simple Regression Correlation Analysis Correlation analysis is used to measure strength of the association (linear relationship) between two variables Correlation is only concerned with strength of the

More information

ECONOMETRIC MODEL WITH QUALITATIVE VARIABLES

ECONOMETRIC MODEL WITH QUALITATIVE VARIABLES ECONOMETRIC MODEL WITH QUALITATIVE VARIABLES How to quantify qualitative variables to quantitative variables? Why do we need to do this? Econometric model needs quantitative variables to estimate its parameters

More information

Logistic Regression. Continued Psy 524 Ainsworth

Logistic Regression. Continued Psy 524 Ainsworth Logistic Regression Continued Psy 524 Ainsworth Equations Regression Equation Y e = 1 + A+ B X + B X + B X 1 1 2 2 3 3 i A+ B X + B X + B X e 1 1 2 2 3 3 Equations The linear part of the logistic regression

More information

The regression model with one fixed regressor cont d

The regression model with one fixed regressor cont d The regression model with one fixed regressor cont d 3150/4150 Lecture 4 Ragnar Nymoen 27 January 2012 The model with transformed variables Regression with transformed variables I References HGL Ch 2.8

More information

Stat 5101 Lecture Notes

Stat 5101 Lecture Notes Stat 5101 Lecture Notes Charles J. Geyer Copyright 1998, 1999, 2000, 2001 by Charles J. Geyer May 7, 2001 ii Stat 5101 (Geyer) Course Notes Contents 1 Random Variables and Change of Variables 1 1.1 Random

More information

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS

MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS Page 1 MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level

More information

Sleep data, two drugs Ch13.xls

Sleep data, two drugs Ch13.xls Model Based Statistics in Biology. Part IV. The General Linear Mixed Model.. Chapter 13.3 Fixed*Random Effects (Paired t-test) ReCap. Part I (Chapters 1,2,3,4), Part II (Ch 5, 6, 7) ReCap Part III (Ch

More information

Lecture 10 Multiple Linear Regression

Lecture 10 Multiple Linear Regression Lecture 10 Multiple Linear Regression STAT 512 Spring 2011 Background Reading KNNL: 6.1-6.5 10-1 Topic Overview Multiple Linear Regression Model 10-2 Data for Multiple Regression Y i is the response variable

More information

General Linear Model (Chapter 4)

General Linear Model (Chapter 4) General Linear Model (Chapter 4) Outcome variable is considered continuous Simple linear regression Scatterplots OLS is BLUE under basic assumptions MSE estimates residual variance testing regression coefficients

More information

Chapter Learning Objectives. Regression Analysis. Correlation. Simple Linear Regression. Chapter 12. Simple Linear Regression

Chapter Learning Objectives. Regression Analysis. Correlation. Simple Linear Regression. Chapter 12. Simple Linear Regression Chapter 12 12-1 North Seattle Community College BUS21 Business Statistics Chapter 12 Learning Objectives In this chapter, you learn:! How to use regression analysis to predict the value of a dependent

More information

MFin Econometrics I Session 5: F-tests for goodness of fit, Non-linearity and Model Transformations, Dummy variables

MFin Econometrics I Session 5: F-tests for goodness of fit, Non-linearity and Model Transformations, Dummy variables MFin Econometrics I Session 5: F-tests for goodness of fit, Non-linearity and Model Transformations, Dummy variables Thilo Klein University of Cambridge Judge Business School Session 5: Non-linearity,

More information

* Tuesday 17 January :30-16:30 (2 hours) Recored on ESSE3 General introduction to the course.

* Tuesday 17 January :30-16:30 (2 hours) Recored on ESSE3 General introduction to the course. Name of the course Statistical methods and data analysis Audience The course is intended for students of the first or second year of the Graduate School in Materials Engineering. The aim of the course

More information

Question 1 carries a weight of 25%; Question 2 carries 20%; Question 3 carries 20%; Question 4 carries 35%.

Question 1 carries a weight of 25%; Question 2 carries 20%; Question 3 carries 20%; Question 4 carries 35%. UNIVERSITY OF EAST ANGLIA School of Economics Main Series PGT Examination 017-18 ECONOMETRIC METHODS ECO-7000A Time allowed: hours Answer ALL FOUR Questions. Question 1 carries a weight of 5%; Question

More information

Chapter 26 Multiple Regression, Logistic Regression, and Indicator Variables

Chapter 26 Multiple Regression, Logistic Regression, and Indicator Variables Chapter 26 Multiple Regression, Logistic Regression, and Indicator Variables 26.1 S 4 /IEE Application Examples: Multiple Regression An S 4 /IEE project was created to improve the 30,000-footlevel metric

More information

Research Design - - Topic 13a Split Plot Design with Either a Continuous or Categorical Between Subjects Factor 2008 R.C. Gardner, Ph.D.

Research Design - - Topic 13a Split Plot Design with Either a Continuous or Categorical Between Subjects Factor 2008 R.C. Gardner, Ph.D. esearch Design - - Topic 3a Split Plot Design with Either a ontinuous or ategorical etween Subjects Factor 8.. Gardner Ph.D. General Description Example Purpose Two ategorical Factors Using Multiple egression

More information

Chapter 3: Multiple Regression. August 14, 2018

Chapter 3: Multiple Regression. August 14, 2018 Chapter 3: Multiple Regression August 14, 2018 1 The multiple linear regression model The model y = β 0 +β 1 x 1 + +β k x k +ǫ (1) is called a multiple linear regression model with k regressors. The parametersβ

More information

FinQuiz Notes

FinQuiz Notes Reading 10 Multiple Regression and Issues in Regression Analysis 2. MULTIPLE LINEAR REGRESSION Multiple linear regression is a method used to model the linear relationship between a dependent variable

More information

Chapter 8: Regression Models with Qualitative Predictors

Chapter 8: Regression Models with Qualitative Predictors Chapter 8: Regression Models with Qualitative Predictors Some predictors may be binary (e.g., male/female) or otherwise categorical (e.g., small/medium/large). These typically enter the regression model

More information

Regression With a Categorical Independent Variable

Regression With a Categorical Independent Variable Regression With a Categorical Independent Variable Lecture 15 March 17, 2005 Applied Regression Analysis Lecture #15-3/17/2005 Slide 1 of 29 Today s Lecture» Today s Lecture» Midterm Note» Example Regression

More information

Making sense of Econometrics: Basics

Making sense of Econometrics: Basics Making sense of Econometrics: Basics Lecture 4: Qualitative influences and Heteroskedasticity Egypt Scholars Economic Society November 1, 2014 Assignment & feedback enter classroom at http://b.socrative.com/login/student/

More information

CRP 272 Introduction To Regression Analysis

CRP 272 Introduction To Regression Analysis CRP 272 Introduction To Regression Analysis 30 Relationships Among Two Variables: Interpretations One variable is used to explain another variable X Variable Independent Variable Explaining Variable Exogenous

More information

Contest Quiz 3. Question Sheet. In this quiz we will review concepts of linear regression covered in lecture 2.

Contest Quiz 3. Question Sheet. In this quiz we will review concepts of linear regression covered in lecture 2. Updated: November 17, 2011 Lecturer: Thilo Klein Contact: tk375@cam.ac.uk Contest Quiz 3 Question Sheet In this quiz we will review concepts of linear regression covered in lecture 2. NOTE: Please round

More information

Multiple Regression Analysis

Multiple Regression Analysis 1 OUTLINE Analysis of Data and Model Hypothesis Testing Dummy Variables Research in Finance 2 ANALYSIS: Types of Data Time Series data Cross-Sectional data Panel data Trend Seasonal Variation Cyclical

More information