Dummy variables in multiple variable regression model

Similar documents
Introduction to Dummy Variable Regressors. 1. An Example of Dummy Variable Regressors

January Examinations 2015

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

PBAF 528 Week Theory Is the variable s place in the equation certain and theoretically sound? Most important! 2. T-test

β0 + β1xi. You are interested in estimating the unknown parameters β

Lecture 6: Introduction to Linear Regression

Chapter 8 Indicator Variables

Chapter 13: Multiple Regression

Marginal Effects in Probit Models: Interpretation and Testing. 1. Interpreting Probit Coefficients

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

Interpreting Slope Coefficients in Multiple Linear Regression Models: An Example

ECON 351* -- Note 23: Tests for Coefficient Differences: Examples Introduction. Sample data: A random sample of 534 paid employees.

Econometrics of Panel Data

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Multinomial logit regression

x i1 =1 for all i (the constant ).

Linear Regression Analysis: Terminology and Notation

ECONOMICS 351* -- Stata 10 Tutorial 6. Stata 10 Tutorial 6

Sociology 470. Bivariate Regression. Extra Points. Regression. Liying Luo Job talk on Thursday 11/3 at Pond 302

JAB Chain. Long-tail claims development. ASTIN - September 2005 B.Verdier A. Klinger

Statistics for Business and Economics

β0 + β1xi. You are interested in estimating the unknown parameters β

NUMERICAL DIFFERENTIATION

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

The Geometry of Logit and Probit

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Chapter 15 - Multiple Regression

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Midterm Examination. Regression and Forecasting Models

CALCULUS CLASSROOM CAPSULES

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

ACTM State Calculus Competition Saturday April 30, 2011

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data

Tests of Exclusion Restrictions on Regression Coefficients: Formulation and Interpretation

Lecture 19. Endogenous Regressors and Instrumental Variables

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

Exercise 1 The General Linear Model : Answers

Correlation and Regression

SIMPLE LINEAR REGRESSION

( ) 1/ 2. ( P SO2 )( P O2 ) 1/ 2.

Chapter 11: Simple Linear Regression and Correlation

Kernel Methods and SVMs Extension

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

Limited Dependent Variables

Chapter 8 Multivariate Regression Analysis

III. Econometric Methodology Regression Analysis

Gravitational Acceleration: A case of constant acceleration (approx. 2 hr.) (6/7/11)

This column is a continuation of our previous column

University of California at Berkeley Fall Introductory Applied Econometrics Final examination

Mathematics Intersection of Lines

e i is a random error

Now we relax this assumption and allow that the error variance depends on the independent variables, i.e., heteroskedasticity

2016 Wiley. Study Session 2: Ethical and Professional Standards Application

PhD/MA Econometrics Examination. January PART A (Answer any TWO from Part A)

Supplementary Notes for Chapter 9 Mixture Thermodynamics

STAT 3008 Applied Regression Analysis

Open Systems: Chemical Potential and Partial Molar Quantities Chemical Potential

(c) Pongsa Pornchaiwiseskul, Faculty of Economics, Chulalongkorn University

between standard Gibbs free energies of formation for products and reactants, ΔG! R = ν i ΔG f,i, we

Negative Binomial Regression

Outline. Zero Conditional mean. I. Motivation. 3. Multiple Regression Analysis: Estimation. Read Wooldridge (2013), Chapter 3.

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

Sociology 301. Bivariate Regression. Clarification. Regression. Liying Luo Last exam (Exam #4) is on May 17, in class.

Physics 4B. Question and 3 tie (clockwise), then 2 and 5 tie (zero), then 4 and 6 tie (counterclockwise) B i. ( T / s) = 1.74 V.

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

A NOTE ON CES FUNCTIONS Drago Bergholt, BI Norwegian Business School 2011

Section 8.3 Polar Form of Complex Numbers

Chapter 9: Statistical Inference and the Relationship between Two Variables

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

Lecture 4 Hypothesis Testing

a. (All your answers should be in the letter!

Statistics MINITAB - Lab 2

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the

Statistics for Economics & Business

Inductance Calculation for Conductors of Arbitrary Shape

ECONOMETRICS - FINAL EXAM, 3rd YEAR (GECO & GADE)

x yi In chapter 14, we want to perform inference (i.e. calculate confidence intervals and perform tests of significance) in this setting.

Let p z be the price of z and p 1 and p 2 be the prices of the goods making up y. In general there is no problem in grouping goods.

Introduction to Regression

Comparison of Regression Lines

Financing Innovation: Evidence from R&D Grants

Sociology 301. Bivariate Regression II: Testing Slope and Coefficient of Determination. Bivariate Regression. Calculating Expected Values

Important Instructions to the Examiners:

Scatter Plot x

4.3 Poisson Regression

Lectures - Week 4 Matrix norms, Conditioning, Vector Spaces, Linear Independence, Spanning sets and Basis, Null space and Range of a Matrix

Professor Chris Murray. Midterm Exam

Department of Statistics University of Toronto STA305H1S / 1004 HS Design and Analysis of Experiments Term Test - Winter Solution

Introduction to Econometrics (3 rd Updated Edition, Global Edition) Solutions to Odd-Numbered End-of-Chapter Exercises: Chapter 13

Soc 3811 Basic Social Statistics Third Midterm Exam Spring 2010

CHAPTER 8. Exercise Solutions

Chapter 4: Regression With One Regressor

Credit Card Pricing and Impact of Adverse Selection

Computation of Higher Order Moments from Two Multinomial Overdispersion Likelihood Models

Predictive Analytics : QM901.1x Prof U Dinesh Kumar, IIMB. All Rights Reserved, Indian Institute of Management Bangalore

Transcription:

WESS Econometrcs (Handout ) Dummy varables n multple varable regresson model. Addtve dummy varables In the prevous handout we consdered the followng regresson model: y x 2x2 k xk,, 2,, n and we nterpreted the coeffcents by partally dfferentatng the dependent varable wth respect to each explanatory varable y x Change n y Unt ncrease n x Holdng all else constant However, n many cases we do not have actual data for many varables that we want to nclude nto a regresson model. For example, n an equaton for wage we may want to allow for the possblty that females appeared to be pad less money for the same job. In whch case we may want to specfy a model: ln( w ) School Female () 2 where, School = Number of years of schoolng, and Female f female, 0 f male. The varable Female s known as an addtve dummy varable and has the effect of vertcally shftng the regresson lne. In all models wth dummy varables the best way to proceed s wrte out the model for each of the categores to whch the dummy varable relates. So n our case the categorcal varable would be gender (whch has two categores Males and Females). For Males (when Females=0), we have from (): ln( w ) School (2a) Whereas for females (when Females=), we have from () ln( w ) School that s, 2 ln( w ) ( ) School (2b) 2 and so the ntercept s n equaton (2a) and s ( 2) n equaton (2b). So the two lnes are parallel (as they have the same slope coeffcents), but they cross the vertcal axs at dfferent ponts. If we take expected values of equaton (2a) we get: E ln( w M ) School and takng expected values of equaton (2b) we get F E ln( w) ( 2) School Now the dfference between these two expressons for the same value of School s:

WESS Econometrcs (Handout ) ln( ) F M E w E ln( w) 2 so 2 s of the change n the expected value of ln(w) for a female compared to a male for a gven value of School. Pctorally ths s shown n Fgure, where the dashed lne represents the regresson lne for males and the sold lne that for females. Fgure : The role of an addtve dummy varable.6.5. ln(w).3 b 3.2. 3.9 2 3 5 6 7 8 School If nstead of estmatng equaton (), we estmated ln( w ) School Male (3) 0 2 where Male f male, 0 f female, as Male Female then ln( w ) School ( Female ) 0 2 ln( w ) School Female 0 2 2 ln( w ) ( ) School Female () 0 2 2 Comparng equaton () wth equaton () as the varables on the left and rght hand sde of the equaton are dentcal the coeffcents must be dentcal, n whch case:,, 2 2 0 2 and so by usng Male n place of Female n equaton (), all that happens s the coeffcent on the Male dummy wll be mnus that on the Female dummy and the coeffcent on the ntercept wll shft accordngly to allow for the fact that a Female s now the default case. For the categorcal varable gender (n whch there are two categores), you nclude dummy varable n the regresson model (n our example Female), so that Male s the 2

WESS Econometrcs (Handout ) default and the coeffcent on Female s the change n the expected value of the dependent varable (for gven values of the other varables) for females relatve to males. In general, f there s a categorcal varable wth s categores, then you nclude s- dummy varables and the omtted category s the default and the coeffcent on any ncluded category s then measured relatve to the default (omtted) category. 2. Multplcatve dummy varables In the equaton for wage we may also want to allow for the possblty that female pay, relatve to male pay, vares wth the level of schoolng. In whch case we may want to specfy a model: ln( w ) School Female ( Female School ) (5) 2 3 where School = Number of years of schoolng and Female f person s female, 0 f male. The varable Female s known as an addtve dummy varable and ( Female School ) s known as a multplcatve dummy varable. The multplcatve dummy varable has the effect of rotatng the regresson lne, so for Males (when Females=0) and we have from (5): ln( w ) School (6) Whereas for females (when Females=) and we have from (5) ln( w ) School School that s, 2 3 ln( w ) ( ) ( ) School (7) 2 3 and so the ntercept s and the coeffcent on School s n equaton (6) and the ntercept s ( 2) and the coeffcent on School s ( 3) n equaton (7). So 2 s the change n the expected value of ln(w) for a female compared to a male for School=0, whle, ln( w ) School ln( w ) School All else constant and Male ( ) 3 All else constant and Female and 3 s the ncrease n the sem-elastcty of wages wth respect to an extra year of schoolng for females compared to males. Pctorally ths s shown n Fgure 2, where 3

WESS Econometrcs (Handout ) the dashed lne represents the regresson lne for males and the sold lne that for females. Fgure 2: The role of an multplcatve dummy varable.6.5. b.3 ln(w) b +b 3.2. 3.9 2 3 5 6 7 8 School As gender has two categores, you can only nclude multplcatve dummy varable n our regresson model ( Female School ), so that Male s the default and the coeffcent on Female School s the change n the coeffcent on School (for gven values of the other varables) for females compared to males. In general, f there s a categorcal varable wth s categores, then you nclude s- multplcatve dummy varables (multpled by School) and the omtted category s the default and the coeffcent on any of the multplcatve dummy varables s measured relatve to the default (omtted) category. 3. Interactve dummy varables In the equaton for wage we may also want to allow for the possblty that two dummy varables could be multpled together, for example, to allow n the model females to be pad less than males and that the dfferental be dfferent for non-whtes (non-wh). In whch case we may want to specfy a model: ln( w ) Female non wh ( Female non wh ) (8) 2 3 In whch we have alternatve cases. Male-whte ln( w ).0.0.0.0 2 3

WESS Econometrcs (Handout ) E[ln( w )] 2. Female-whte ln( w )..0..0 E[ln( w )] 3. Male-Non-whte 2 3 ln( w ).0..0. E[ln( w )] 2 2 3. Female-Non-whte ln( w ).... 2 3 (9a) (9b) (9c) E[ln( w )] 2 3 (9d) Comparng equaton (9b) to equaton (9a): E w E w F, Wh M, Wh ln( ) ln( ) (0a) that s, = Proportonate change n expected wages for females compared to males, gven they are a whte ndvdual. Comparng equaton (9c) to equaton (9a): E w E w M, NonWh M, Wh ln( ) ln( ) 2 (0b) That s, 3 = Proportonate change n expected wages for Whte ndvduals compared to Non-whte ndvduals gven they are males. Comparng equaton (9d) to equaton (9c): E w E w F, NonWh M, NonWh ln( ) ln( ) ( 3) (0c) that s, 2 = Proportonate change n expected wages for females compared to males, gven they are a non-whte ndvdual Comparng equaton (0c) to equaton (0a) ln( ) F, Non Wh ln( ) M, Non Wh ln( ) F, Wh ln( ) M, Wh E w E w E w E w 3 (0d) That s, = Addtonal proportonate change n expected wages for females (compared to males) who are a non-whte ndvdual compared to a whte ndvdual. 5

WESS Econometrcs (Handout ) Alternatvely we could wrte ths as: Male Female Gender effect Whte 2 Non-whte 2 2 3 3 3 Ethncty effect 2 2 3 3 Ths s an example of a dfference-n-dfference model as the coeffcent on the nteracton term s nterpreted as the dfference between two dfferences (see equaton (0d)). Suppose we know that average salares lsted below: Male Female Gender effect Whte 5.09 2.58-2.5 Non-whte.92 2.50-2.2 0.09 Ethncty effect -0.7-0.08 0.09 Then usng these numbers and estmatng equaton (8) by OLS would yeld: a 5.09, b 2.5, b2 0.7 and b 3 0.09. 6