Econ 300/QAC 201: Quantitative Methods in Economics/Applied Data Analysis. 18th Class 7/2/10

Similar documents
SIMULTANEOUS EQUATION MODEL

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

Simultaneous Equation Models Learning Objectives Introduction Introduction (2) Introduction (3) Solving the Model structural equations

Chapter 14. Simultaneous Equations Models Introduction

Econometrics. 7) Endogeneity

Wooldridge, Introductory Econometrics, 3d ed. Chapter 16: Simultaneous equations models. An obvious reason for the endogeneity of explanatory

ECON 4160: Econometrics-Modelling and Systems Estimation Lecture 9: Multiple equation models II

Econ 300/QAC 201: Quantitative Methods in Economics/Applied Data Analysis. 17th Class 7/1/10

08 Endogenous Right-Hand-Side Variables. Andrius Buteikis,

Rockefeller College University at Albany

Lecture 12. Functional form

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables

Dealing With Endogeneity

ECON 4160, Spring term 2015 Lecture 7

The regression model with one stochastic regressor (part II)

Simultaneous Equation Models (Book Chapter 5)

ECO375 Tutorial 8 Instrumental Variables

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares

Econ Review Set 2 - Answers

Problem Set # 1. Master in Business and Quantitative Methods

Single-Equation GMM: Endogeneity Bias

Club Convergence: Some Empirical Issues

Simultaneous Equation Models

Econ 510 B. Brown Spring 2014 Final Exam Answers


Econ 11: Intermediate Microeconomics. Preliminaries

1 Motivation for Instrumental Variable (IV) Regression

Panel Data Exercises Manuel Arellano. Using panel data, a researcher considers the estimation of the following system:

Instrumental Variables, Simultaneous and Systems of Equations

ECONOMETRICS HONOR S EXAM REVIEW SESSION

WISE International Masters

8. Instrumental variables regression

Lecture Notes Part 7: Systems of Equations

Financial Econometrics

Econ 300/QAC 201: Quantitative Methods in Economics/Applied Data Analysis. 12th Class 6/23/10

Handout 12. Endogeneity & Simultaneous Equation Models

Econometrics Honor s Exam Review Session. Spring 2012 Eunice Han

Econometrics 2, Class 1

OSU Economics 444: Elementary Econometrics. Ch.10 Heteroskedasticity

Chapter 2: simple regression model

Keynesian Macroeconomic Theory

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

Short T Panels - Review

An explanation of Two Stage Least Squares

Time Series Methods. Sanjaya Desilva

Rockefeller College University at Albany

14.32 Final : Spring 2001

Multivariate Models. Christopher Ting. Christopher Ting. April 19,

Applied Health Economics (for B.Sc.)

Econometrics Part Three

Theoretical premises of the Keynesian approach

Department of Agricultural Economics. PhD Qualifier Examination. May 2009

The linear regression model: functional form and structural breaks

Handout 11: Measurement Error

11. Further Issues in Using OLS with TS Data

11. Simultaneous-Equation Models

Economics 308: Econometrics Professor Moody

Multiple Linear Regression CIVL 7012/8012

Econometric Analysis of Cross Section and Panel Data

Ability Bias, Errors in Variables and Sibling Methods. James J. Heckman University of Chicago Econ 312 This draft, May 26, 2006

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

Motivation Non-linear Rational Expectations The Permanent Income Hypothesis The Log of Gravity Non-linear IV Estimation Summary.

Chapter 6: Endogeneity and Instrumental Variables (IV) estimator

Measurement Error. Often a data set will contain imperfect measures of the data we would ideally like.

Write your identification number on each paper and cover sheet (the number stated in the upper right hand corner on your exam cover).

Lecture 8: Aggregate demand and supply dynamics, closed economy case.

Solutions to Problem Set 5 (Due December 4) Maximum number of points for Problem set 5 is: 62. Problem 9.C3

Least Squares Estimation of a Panel Data Model with Multifactor Error Structure and Endogenous Covariates

Multivariate forecasting with VAR models

APPENDIX 1 BASIC STATISTICS. Summarizing Data

Simple Regression Model (Assumptions)

ECON 4160: Econometrics-Modelling and Systems Estimation Lecture 7: Single equation models

Econometrics Problem Set 11

Economics 241B Estimation with Instruments

Exercises for Chapter 7

An Introduction to Parameter Estimation

Econ107 Applied Econometrics

ECON Introductory Econometrics. Lecture 16: Instrumental variables

Lecture 6: Dynamic Models

University of Pittsburgh Department of Economics Econ 1720: Advanced Macroeconomics Handout 3

ECO 310: Empirical Industrial Organization Lecture 2 - Estimation of Demand and Supply

slides chapter 3 an open economy with capital


The Simple Linear Regression Model

Dynamic Panel Data Models

Simultaneous equation model (structural form)

For logistic growth, we have

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

The Augmented Solow Model Revisited

Mgmt 469. Causality and Identification

ACE 564 Spring Lecture 8. Violations of Basic Assumptions I: Multicollinearity and Non-Sample Information. by Professor Scott H.

6. Assessing studies based on multiple regression

statistical sense, from the distributions of the xs. The model may now be generalized to the case of k regressors:

Bristol Business School

ECNS 561 Multiple Regression Analysis

Instrumental Variables Estimation in Stata

Macroeconomics Field Exam. August 2007

Instrumental Variables and the Problem of Endogeneity

Linear Regression. Junhui Qian. October 27, 2014

Tutorial 1: Linear Algebra

Transcription:

Econ 300/QAC 201: Quantitative Methods in Economics/Applied Data Analysis 18th Class 7/2/10 Out of the air a voice without a face Proved by statistics that some cause was just In tones as dry and level as the place. --W.H. Auden handouts: --practice test (first two hours worth) --short answers for practice test --essay questions --test next Wed. --project due by 6:00 next Fri. as an email attachment final s format: open-book for the problems, closed-book for the essays should allocate an hour or less to the essays, two hours to the problems go over first problem on problem set #11 [see handout] recall we were going through cases of departure from the assumptions regarding error term structure that lead to OLS being the preferred method review Ch. 24 briefly: predicting a series from itself and from other lagged and contemporaneous series if mean of error terms does not appear to be zero, may need to try a nonlinear fit instead, or use an instrumental variables method (Ch. 25) Ch. 25: addresses the problem of simultaneous equation bias often in economics, each single relationship/equation is part of a bigger model, so some of the X variables in any one equation are Y variables in other equations. Ignoring these interrelationships leads to biased estimates of parameters for single equations. 18-1

consider the macroeconomic model: consumption = C 0 + C 1 (national income) national income = consumption + investment we will write this in statistical form as: Y = α + βx + e X = Y + I (no error term because this is a definition) Note that by assumption, I is exogenous and is therefore statistically independent of e. X and Y, however, are endogenous and are thus influenced by both I and e. This model is mathematically complete; i.e., the two equations in this system can be solved for the two unknowns/endogenous variables. However, in an equation in a simultaneous system, regressors that are not predetermined (X in this case) are not independent of the error term (e). In other words, the assumption underlying OLS that X is statistically independent of e is violated. In this model, X and e are positively correlated: when X is large, e tends to be positive; when X is small, e tends to be negative. Therefore, β is biased upward. In general, we can determine the direction of simultaneous equation bias on each affected coefficient. you can see this by solving for Y and X in terms of α, β, I, and e: for the definition equation: X = α + βx + e + I so X = α 1-β + ( 1 1-β )I + e 1-β so since X is a positive function of e, the correlation between X and e must be positive this means that if e is large, Y is larger and therefore so is X and if e is small, Y is smaller and therefore so is X 18-2

this violates the assumption of OLS that X and e are uncorrelated (which is the assumption that the mean of e is zero) How to remedy the problem of simultaneous equation bias? Use the technique of instrumental variables (IV). The trick is to find a new variable V that has the following properties: 1) V should be uncorrelated with e 2) V should be highly correlated with X We can calculate the covariance of V and Y: s VY = (V - V )(Y - Y ) n - 1 consider our basic regression model: Y = α + βx + e and express the variables in deviation form (relative to the mean), which causes the constant to drop out: (Y - Y ) = β(x - X ) + (e - e ) note the mean e is not equal to zero if there is simultaneous equation bias, so the last term does not reduce to e. Then multiply both sides by (V - V ): (V - V )(Y - Y ) = β(v - V )(X - X ) + (V - V )(e - e ) sum both sides from 1 to n: (V - V )(Y - Y ) = β (V - V )(X - X ) + (V V)(e e) 18-3

and divide through by (n - 1): s VY = βs VX + s Ve so we have transformed the equation relating variables into an equation relating covariances. Now we can rearrange this to show the formula for β: β = s VY s VX - s Ve s VX define the first term on the right as: b V = s VY s VX if the last term on the right is small, then b V is a good estimator of β. b V is the instrumental variable (IV) estimator of β. This term will in fact be small if the two desirable properties of V listed above are met (so s Ve approaches 0, causing the estimator to be consistent; and s VX is large if V and X are highly correlated, again reducing the last term through increasing the size of the denominator) Note that if V = X, the IV estimator is the OLS estimator (in other words, you are using X as an instrumental variable) and it will be a good estimator in the single equation model where σ Ve (σ Xe ) is 0 by assumption However, in the simultaneous equation system, using X as an instrument causes the estimator to be inconsistent because X and e are correlated ( σ Xe 0). However, exogenous variables will yield consistent estimates because they are uncorrelated with e. They can still be better or worse instruments depending on their correlation with X. I in our model above passes this criterion well, so we would use the estimator for β: b I = s IY s IX 18-4

This can be generalized to use a vector of instruments to construct the instrumental variable estimator for any given β. The technique of two stage least squares (2SLS) does exactly this; to estimate any single equation in a system which contains a mix of exogenous and endogenous regressors: 1) (first stage) regress endogenous regressors on all the exogenous variables in the sytem, using OLS. Then the predicted value of the endogenous regressor in each case will be independent of e by construction. 2) (second stage) use the predicted values of the endogenous variables contained in the equation and the exogenous variables as instruments to estimate the single equation. This yields a system of n equations in n unknowns, where n = the number of parameters to be estimated in the equation. This system can then be solved for estimates of the parameters. IV is a much more general technique however. Note that in the example on p. 732 in the book, all the X and Y variables could not enter into each equation. Consider the identification problem in general: an equation will be unidentified (incapable of being estimated) if the number of slope parameters to be estimated in the equation exceeds the number of exogenous variables appearing anywhere in the system. do problem 25-8: 3 exogenous variables, so the 2nd and 5th equations cannot be estimated a graphical example: the two-equation model of a market: supply and demand; it is really a three-equation model, where the equilibrium condition reduces the number of variables to be determined to 2 (Q and P): Q S = b 0 + b 1 P + b 2 W Q D = c 0 + c 1 P + c 2 I 18-5

Here both the supply and demand curves can be identified because there are 2 exogenous variables which cause shifts in the curves when they are graphed in P-Q space. But consider a simpler model: Q S = b 0 + b 1 P + b 2 W Q D = c 0 + c 1 P We can only identify the line that does not move; movements in the other line trace out the stationary line; in this case only the demand curve is identified [show graph] Note that identification is different from mathematical completeness, where one needs as many equations as unknowns in order to solve for the unknowns. Therefore, in both of these market models, if one adds the equilibrium condition: Q S = Q D = Q one can solve for Q and P in terms of the parameters (as functions of the parameters). But if we have data on Q, P, I, and W, only in the first model can all the parameters be estimated individually. so we see modeling has a number of issues: what is exog., what is endog., are there enough equations to solve for the endog, what variables should be in each equation, are all the equations identified? 18-6