Addressing Alternative. Multiple Regression Spring 2012

Similar documents
Addressing Alternative Explanations: Multiple Regression

[The following data appear in Wooldridge Q2.3.] The table below contains the ACT score and college GPA for eight college students.

Interpreting Slope Coefficients in Multiple Linear Regression Models: An Example

ECON 351* -- Note 23: Tests for Coefficient Differences: Examples Introduction. Sample data: A random sample of 534 paid employees.

Introduction to Dummy Variable Regressors. 1. An Example of Dummy Variable Regressors

DO NOT OPEN THE QUESTION PAPER UNTIL INSTRUCTED TO DO SO BY THE CHIEF INVIGILATOR. Introductory Econometrics 1 hour 30 minutes

β0 + β1xi. You are interested in estimating the unknown parameters β

Reminder: Nested models. Lecture 9: Interactions, Quadratic terms and Splines. Effect Modification. Model 1

β0 + β1xi. You are interested in estimating the unknown parameters β

Interval Estimation in the Classical Normal Linear Regression Model. 1. Introduction

x i1 =1 for all i (the constant ).

Lecture 6: Introduction to Linear Regression

January Examinations 2015

Chapter 9: Statistical Inference and the Relationship between Two Variables

Comparison of Regression Lines

Topic 7: Analysis of Variance

Chapter 15 - Multiple Regression

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Lecture 9: Linear regression: centering, hypothesis testing, multiple covariates, and confounding

Systems of Equations (SUR, GMM, and 3SLS)

Department of Quantitative Methods & Information Systems. Time Series and Their Components QMIS 320. Chapter 6

Statistics for Economics & Business

Module Contact: Dr Susan Long, ECO Copyright of the University of East Anglia Version 1

β0 + β1xi and want to estimate the unknown

Y = β 0 + β 1 X 1 + β 2 X β k X k + ε

Statistics for Business and Economics

Chap 10: Diagnostics, p384

Question 1 carries a weight of 25%; question 2 carries 20%; question 3 carries 25%; and question 4 carries 30%.

Regression Analysis. Regression Analysis

BIO Lab 2: TWO-LEVEL NORMAL MODELS with school children popularity data

Chapter 2 - The Simple Linear Regression Model S =0. e i is a random error. S β2 β. This is a minimization problem. Solution is a calculus exercise.

Chapter 8 Indicator Variables

Statistics for Managers Using Microsoft Excel/SPSS Chapter 14 Multiple Regression Models

Linear regression. Regression Models. Chapter 11 Student Lecture Notes Regression Analysis is the

Introduction to Regression

Basically, if you have a dummy dependent variable you will be estimating a probability.

Resource Allocation and Decision Analysis (ECON 8010) Spring 2014 Foundations of Regression Analysis

The Geometry of Logit and Probit

Political Science 552

/ n ) are compared. The logic is: if the two

Chapter 15 Student Lecture Notes 15-1

Chapter 5: Hypothesis Tests, Confidence Intervals & Gauss-Markov Result

Marginal Effects of Explanatory Variables: Constant or Variable? 1. Constant Marginal Effects of Explanatory Variables: A Starting Point

Chapter 13: Multiple Regression

since [1-( 0+ 1x1i+ 2x2 i)] [ 0+ 1x1i+ assumed to be a reasonable approximation

STATISTICS QUESTIONS. Step by Step Solutions.

Lecture 4 Hypothesis Testing

Negative Binomial Regression

e i is a random error

Soc 3811 Basic Social Statistics Third Midterm Exam Spring 2010

Statistics for Managers Using Microsoft Excel/SPSS Chapter 13 The Simple Linear Regression Model and Correlation

Correlation and Regression

The Ordinary Least Squares (OLS) Estimator

1. Inference on Regression Parameters a. Finding Mean, s.d and covariance amongst estimates. 2. Confidence Intervals and Working Hotelling Bands

University of California at Berkeley Fall Introductory Applied Econometrics Final examination

17 - LINEAR REGRESSION II

Midterm Examination. Regression and Forecasting Models

Correlation and Regression. Correlation 9.1. Correlation. Chapter 9

Basic Business Statistics, 10/e

The SAS program I used to obtain the analyses for my answers is given below.

Outline. Zero Conditional mean. I. Motivation. 3. Multiple Regression Analysis: Estimation. Read Wooldridge (2013), Chapter 3.

Here is the rationale: If X and y have a strong positive relationship to one another, then ( x x) will tend to be positive when ( y y)

Lecture 3 Stat102, Spring 2007

Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur

Lecture Notes for STATISTICAL METHODS FOR BUSINESS II BMGT 212. Chapters 14, 15 & 16. Professor Ahmadi, Ph.D. Department of Management

Professor Chris Murray. Midterm Exam

Econ107 Applied Econometrics Topic 3: Classical Model (Studenmund, Chapter 4)

Lab 4: Two-level Random Intercept Model

STAT 3008 Applied Regression Analysis

Chapter 8 Multivariate Regression Analysis

Biostatistics 360 F&t Tests and Intervals in Regression 1

ECONOMICS 351*-A Mid-Term Exam -- Fall Term 2000 Page 1 of 13 pages. QUEEN'S UNIVERSITY AT KINGSTON Department of Economics

ECON Introductory Econometrics. Lecture 5: OLS with One Regressor: Hypothesis Tests

The Multiple Classical Linear Regression Model (CLRM): Specification and Assumptions. 1. Introduction

Tests of Exclusion Restrictions on Regression Coefficients: Formulation and Interpretation

Scientific Question Determine whether the breastfeeding of Nepalese children varies with child age and/or sex of child.

Economics 130. Lecture 4 Simple Linear Regression Continued

Lecture 2: Prelude to the big shrink

Econ Statistical Properties of the OLS estimator. Sanjaya DeSilva

Chapter 12 Analysis of Covariance

Chapter 3. Two-Variable Regression Model: The Problem of Estimation

Biostatistics. Chapter 11 Simple Linear Correlation and Regression. Jing Li

UNIVERSITY OF TORONTO Faculty of Arts and Science. December 2005 Examinations STA437H1F/STA1005HF. Duration - 3 hours

PBAF 528 Week Theory Is the variable s place in the equation certain and theoretically sound? Most important! 2. T-test

( )( ) [ ] [ ] ( ) 1 = [ ] = ( ) 1. H = X X X X is called the hat matrix ( it puts the hats on the Y s) and is of order n n H = X X X X.

Empirical Application of Simple Regression (Chapter 2)

LINEAR REGRESSION ANALYSIS. MODULE VIII Lecture Indicator Variables

Properties of Least Squares

Scatter Plot x

Yarine Fawaz ECONOMETRICS I

a. (All your answers should be in the letter!

If we apply least squares to the transformed data we obtain. which yields the generalized least squares estimator of β, i.e.,

Statistics MINITAB - Lab 2

ISQS 6348 Final Open notes, no books. Points out of 100 in parentheses. Y 1 ε 2

) is violated, so that V( instead. That is, the variance changes for at least some observations.

Chapter 14 Simple Linear Regression

Lecture 24: Partial correlation, multiple regression, and correlation

MODULE TWO, PART THREE: ENDOGENEITY, INSTRUMENTAL VARIABLES AND TWO-STAGE LEAST SQUARES IN ECONOMIC EDUCATION RESEARCH USING STATA

Linear correlation and linear regression

Econ107 Applied Econometrics Topic 9: Heteroskedasticity (Studenmund, Chapter 10)

Transcription:

Addressng Alternatve Explanatons: Multple Regresson 7.87 Sprng 0

Dd Clnton hurt Gore example Dd Clnton hurt Gore n the 000 electon? Treatment s not lkng Bll Clnton How would you test ths?

Bvarate regresson of Gore thermometer on Clnton thermometer Clnton thermometer

Dd Clnton hurt Gore example What alternatve explanatons would you need to address? Nonrandom selecton nto the treatment group (dslkng Clnton from many sources Let s address one source: party dentfcaton How could we do ths? Matchng: compare Democrats who lke or don t lke Clnton; do the same for Republcans and ndependents Multvarate regresson: control for partsanshp statstcally Also called multple regresson, Ordnary Least Squares (OLS Presentaton below s ntutve

Democratc pcture Clnton thermometer

Independent d pcture Clnton thermometer

Republcan pcture Clnton thermometer

Combned data pcture Clnton thermometer

Combned data pcture wth regresson: bas! Clnton thermometer

Combned data pcture wth true regresson lnes overlad Clnton thermometer

Temptng yet wrong normalzatons Subtract the Gore therm. from the avg. Gore therm. score Clnton thermometer Subtract the Clnton therm. from the avg. Clnton therm. score Clnton thermometer

3D Relatonshp

3D Lnear Relatonshp

3D Relatonshp: Clnton 00 50 0

3D Relatonshp: party Rep Ind Dem

The Lnear Relatonshp between Three Varables Gore thermometer Clnton thermometer Party ID Y 0,, STATA: reg y x x reg gore clnton party3

Multvarate slope coeffcents Bvarate estmate: Multvarate estmate: ˆ ˆ B M Clnton effect (on Gore n bvarate (B regresson cov(, Y var( vs. cov(, Y ˆ M cov(, - var( var( Are Gore and Party ID related? Clnton effect (on Gore n multvarate (M regresson ˆ ˆ B M ˆ cov(, var( Are Clnton and Party ID related? M When does? Obvously, when 0 s Clnton thermometer, s PID, and Y s Gore thermometer

Th Sl C ff t The Slope Coeffcents n n n n Y Y,,, and ( ( ˆ - ( ( ˆ n n n n,, ( ( n n n n Y Y,,, ( ( ( ˆ - ( ( ( ˆ,, ( ( s Clnton thermometer, s PID, and Y s Gore thermometer

The Slope Coeffcents More Smply cov(, Y ˆ cov(, - var( var( ˆ cov(, Y ˆ cov(, - var( var( ˆ and s Clnton thermometer, s PID, and Y s Gore thermometer

The Matrx form y x, x, x k, y x, x, x k, y n x,n x,n x k,n ( y

The Output t. reg gore clnton party3 Source SS df MS Number of obs = 745 -------------+------------------------------ F(, 74 = 048.04 Model 696.9 34630.955 Prob > F = 0.0000 Resdual 5964.934 74 300.0949 R-squared = 0.546 -------------+------------------------------ Adj R-squared = 0.5456 Total 56.84 744 660.68053 Root MSE = 7.37 ------------------------------------------------------------------------------ gore Coef. Std. Err. t P> t [95% Conf. Interval] -------------+---------------------------------------------------------------- clnton.5875.07595 9. 0.000.4777776.5467975 party3 5.77053.5594846 0.3 0.000 4.6739 6.867856 _cons 8.699.0547 7.9 0.000 6.686 30.649 ------------------------------------------------------------------------------ Interpretaton of clnton effect: Holdng constant party dentfcaton a one- Interpretaton of clnton effect: Holdng constant party dentfcaton, a onepont ncrease n the Clnton feelng thermometer s assocated wth a.5 ncrease n the Gore thermometer.

Separate regressons ( ( (3 Intercept 3. 55.9 8.6 Clnton 0.6 -- 0.5 Party -- 5.7 5.8

Is the Clnton effect causal? That s, should we be convnced that negatve feelngs about Clnton really hurt Gore? No! The regresson analyss has only ruled out lnear nonrandom selecton on party ID. Nonrandom selecton nto the treatment could occur from Varables other than party ID, or Reverse causaton, that s, feelngs about Gore nfluencng feelngs about Clnton. Addtonally, the regresson analyss may not have entrely ruled out nonrandom selecton even on party ID because t may have assumed the wrong functonal form. E.g., what f nonrandom selecton on strong Republcan/strong Democrat, but not on weak partsans

Other approaches to addressng confoundng effects? Experments Dfference-n-dfferences n desgns Others?

Summary: Why we control Address alternatve explanatons by removng confoundng effects Improve effcency

Why dd the Clnton Coeffcent change from 0.6 to 0.5. corr gore clnton party, cov (obs=745 gore clnton party3 -------------+--------------------------- gore 660.68 clnton 549.993 883.8 party3 3.7008 6.905.8735

The Calculatons l ˆ B ˆ cov( gore, clnton var( clnton 549.993993 883.8 0.67 ˆ cov( gore, clnton var( clnton M ˆ M 549.993 6.905 5.7705 883.8 883.8 0.67 0.05 0.5 cov( clnton, party var( clnton. corr gore clnton party,cov (obs=745 gore clnton party3 -------------+--------------------------- gore 660.68 clnton 549.993 883.8 party3 3.7008 6.905.8735

D Drnkng and dgreek klfe Example Why s there a correlaton between lvng n a fraternty/sororty y house and drnkng? Greek organzatons often emphasze socal gatherngs g that have alcohol. The effect s beng n the Greek organzaton tself, not the house. There s somethng about the House envronment tself.

Dependent varable: Tmes Drnkng n Past 30 Days

. nfx age 0- resdence 6 greek 4 screen 0 tmespast30 03 howmuchpast30 04 gpa 78-79 studyng 8 tmeshs 35 howmuchhs 36 socalzng 83 stwgt 99 475-493 weght99 494-5 usng da388.dat,clear (438 observatons read. recode tmespast30 tmeshs (=0 (=.5 (3=4 (4=7.5 (5=4.5 (6=9.5 (7=45 (tmespast30: 657 changes made (tmeshs: 07 changes made. replace tmespast30=0 f screen<=3 (463 real changes made

. tab tmespast30 tmespast30 Freq. Percent Cum. ------------+----------------------------------- 0 4,65 33.37 33.37.5,737 9.64 53.0 4,653 9.03 7.04 7.5,854 3.30 85.34 4.5,648.8 97.7 9.5 350.5 99.68 45 45 0.3 00.00 ------------+----------------------------------- Total 3,939 00.00

Key explanatory varables Lve n fraternty/sororty house Indcator varable (dummy varable Coded f lve n, 0 otherwse Member of fraternty/sororty Indcator varable (dummy varable Coded f member, 0 otherwse

Three Regressons Dependent varable: number of tmes drnkng n past 30 days Lve n frat/sor house (ndcator varable 4.44 ---.6 (0.35 (0.38 Member of frat/sor (ndcator varable ---.88 (0.6.44 (0.8 Intercept 454 4.54 47 4.7 47 4.7 (0.56 (0.059 (0.059 R.0.03.05 N 3,876 3,876 3,876 N t St d d th C B t l What s the substantve nterpretaton of the coeffcents? Note: Standard errors n parentheses. Corr. Between lvng n frat/sor house and beng a member of a Greek organzaton s.4

The Pcture Lvng n frat house =0.9 M ˆ =.6 Drnks k per 30 days Y Member of fraternty y M ˆ =.44

Accountng for the total t effect ˆ B ˆ M ˆ M Total effect = Drect effect + ndrect effect Lvng n frat house =0.9 M ˆ =.6 Drnks per 30 days Y Member of fraternty M ˆ =.44

Accountng for the effects of frat house lvng and Greek membershp on drnkng Effect Total Drect Indrect Member of.88.44 0.44 Greek org. (85% (5% Lve n frat/ 4.44.6.8 sor. house (5% (49%