Simultaneous Equation Models (Book Chapter 5)

Similar documents
Course Notes: Statistical and Econometric Methods

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables

ECONOMETRIC THEORY. MODULE XVII Lecture - 43 Simultaneous Equations Models

SIMULTANEOUS EQUATION MODEL

Instrumental Variables, Simultaneous and Systems of Equations

Bootstrap Approach to Comparison of Alternative Methods of Parameter Estimation of a Simultaneous Equation Model

Econ 300/QAC 201: Quantitative Methods in Economics/Applied Data Analysis. 18th Class 7/2/10

Statistical and Econometric Methods for Transportation Data Analysis

ECON 4160: Econometrics-Modelling and Systems Estimation Lecture 9: Multiple equation models II

Econometric Methods and Applications II Chapter 2: Simultaneous equations. Econometric Methods and Applications II, Chapter 2, Slide 1

Econometrics Summary Algebraic and Statistical Preliminaries

Short Questions (Do two out of three) 15 points each

Lecture 4: Heteroskedasticity

Econometrics II - EXAM Answer each question in separate sheets in three hours

Econometrics - 30C00200

Statistical and Econometric Methods for Transportation Data Analysis

Financial Econometrics

ECON 497: Lecture 4 Page 1 of 1

Economics 582 Random Effects Estimation

A primer on Structural VARs

Dealing With Endogeneity

Econometric Analysis of Cross Section and Panel Data

. a m1 a mn. a 1 a 2 a = a n

A Joint Tour-Based Model of Vehicle Type Choice and Tour Length

Econometrics. 7) Endogeneity

Econ 510 B. Brown Spring 2014 Final Exam Answers

We begin by thinking about population relationships.

Introduction to Econometrics Final Examination Fall 2006 Answer Sheet

7. GENERALIZED LEAST SQUARES (GLS)

A Non-Parametric Approach of Heteroskedasticity Robust Estimation of Vector-Autoregressive (VAR) Models

The Estimation of Simultaneous Equation Models under Conditional Heteroscedasticity

Robustness of Simultaneous Estimation Methods to Varying Degrees of Correlation Between Pairs of Random Deviates

1 Outline. 1. Motivation. 2. SUR model. 3. Simultaneous equations. 4. Estimation

Title. Description. var intro Introduction to vector autoregressive models

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares

plim W 0 " 1 N = 0 Prof. N. M. Kiefer, Econ 620, Cornell University, Lecture 16.

Simultaneous Equations and Weak Instruments under Conditionally Heteroscedastic Disturbances

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

The impact of residential density on vehicle usage and fuel consumption*

Multivariate Models. Christopher Ting. Christopher Ting. April 19,

OSU Economics 444: Elementary Econometrics. Ch.10 Heteroskedasticity

The Linear Regression Model

The BLP Method of Demand Curve Estimation in Industrial Organization

Linear Regression. Junhui Qian. October 27, 2014

2. Linear regression with multiple regressors

Statistical and Econometric Methods for Transportation Data Analysis

Bootstrapping Heteroskedasticity Consistent Covariance Matrix Estimator

Ordinary Least Squares Regression

1 Introduction to Generalized Least Squares

LECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity

Finite Sample Performance of A Minimum Distance Estimator Under Weak Instruments

Econometric Methods. Prediction / Violation of A-Assumptions. Burcu Erdogan. Universität Trier WS 2011/2012

A Course on Advanced Econometrics

A NOTE ON THE EFFECT OF THE MULTICOLLINEARITY PHENOMENON OF A SIMULTANEOUS EQUATION MODEL

4.8 Instrumental Variables

Rockefeller College University at Albany

PhD/MA Econometrics Examination. January, 2015 PART A. (Answer any TWO from Part A)

ECON 366: ECONOMETRICS II. SPRING TERM 2005: LAB EXERCISE #10 Nonspherical Errors Continued. Brief Suggested Solutions

Time Series Methods. Sanjaya Desilva

Estimating and Testing the US Model 8.1 Introduction

Least Squares Estimation of a Panel Data Model with Multifactor Error Structure and Endogenous Covariates

Economics 308: Econometrics Professor Moody

Econometrics of Panel Data

Heteroskedasticity and Autocorrelation

Instrumental Variables and Two-Stage Least Squares

Quantitative Analysis of Financial Markets. Summary of Part II. Key Concepts & Formulas. Christopher Ting. November 11, 2017

Econ 836 Final Exam. 2 w N 2 u N 2. 2 v N

Nonstationary Panels

Lecture Notes Part 7: Systems of Equations

IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors

WISE MA/PhD Programs Econometrics Instructor: Brett Graham Spring Semester, Academic Year Exam Version: A

Statistical and Econometric Methods for Transportation Data Analysis

The Case Against JIVE

Lesson 17: Vector AutoRegressive Models

Multiple Equation GMM with Common Coefficients: Panel Data

Introduction to Econometrics Midterm Examination Fall 2005 Answer Key

Topic 6: Non-Spherical Disturbances

Handout 12. Endogeneity & Simultaneous Equation Models

Instrumental variables estimation using heteroskedasticity-based instruments

Structural VAR Models and Applications

Exogeneity and Causality

Statistics 910, #5 1. Regression Methods

Intermediate Econometrics

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

8. Instrumental variables regression

Topic 7: Heteroskedasticity

Multicollinearity. Filippo Ferroni 1. Course in Econometrics and Data Analysis Ieseg September 22, Banque de France.

Simple Linear Regression Model & Introduction to. OLS Estimation

Truncation and Censoring

Linear Models in Econometrics


Generalized Method of Moment

Appendix A: The time series behavior of employment growth

An Introduction to Structural Equation Modeling with the sem Package for R 1. John Fox

11. Simultaneous-Equation Models

Exogeneity tests and weak-identification

Next is material on matrix rank. Please see the handout

Lecture 4: Multivariate Regression, Part 2

Simple Linear Regression: The Model

Transcription:

Simultaneous Equation Models (Book Chapter 5) Interrelated equations with continuous dependent variables: Utilization of individual vehicles (measured in kilometers driven) in multivehicle households Interrelation between travel time from home to an activity and the duration of the activity Interrelation of average vehicle speeds by lane with the vehicle speeds in adjacent lanes. Problem: Estimation of equation systems by the ordinary least squares (OLS) violates a key OLS assumption in that a correlation between regressors and distrubances will be present because not all independent variables are fixed in random samples (violation of A5).

Overview of the simultaneous equations problem Consider annual vehicle utilization equations (one for each vehicle) in twovehicle households of the following linear form: u = β Z + α X + λu + ε 1 1 1 1 1 2 1 u = β Z + α X + λu + ε 2 2 2 2 2 1 2 Where: u 1 is the kilometers per year that vehicle 1 is driven, u 2 is the kilometers per year that vehicle 2 is driven, Z 1 and Z 2 are vectors of vehicle attributes (for vehicles 1 and 2 respectively), X is a vector of household characteristics, β's, α's, are vectors of estimable parameters, λ's are estimable scalars, and ε's are disturbance terms.

To satisfy regression assumption A5, the value of the dependent variable (left-hand side variable) must not influence the value of an independent variable (right-hand side). This is not the case in these equations because in the first equation the independent variable u 2 varies as the dependent variable u 1 varies, and in the second equation, the independent variable u 1 varies as the dependent variable u 2 varies. Thus, u 2 and u 1 are said to be endogenous variables in Equations 5.1 and 5.2 respectively.

Reduced Form and the Identification Problem Reduced form solution: solving two equations and two unknowns to arrive at reduced forms. Substituting second equation into the first in the previous example: [ ] u = β Z + α X + λ β Z + α X + λ u + ε + ε 1 1 1 1 1 2 2 2 2 1 2 1 rearranging, β α + λα λβ λε + ε u = Z + X + Z + 1 1 1 2 1 2 1 2 1 1 1 2 1 λ1λ2 1 λλ 1 2 1 λλ 1 2 1 λλ 1 2 and similarly substituting first equation for u 1 in the second equation gives,

β α + λα λ β λε + ε u = Z + X + Z + 2 2 2 1 2 1 2 1 2 2 2 1 1 λ2λ1 1 λλ 2 1 1 λλ 2 1 1 λλ 2 1 Because the endogenous variables u 1 and u 2 are replaced by their exogenous determinants, the equations cand be estimated using ordinary least squares (OLS) as, where, u1= az 1 1+ bx 1 + cz 1 2+ξ 1, and u2 = a2z2 + b2x + c2z1+ξ 2, β α a ; b + λα λ β λε = = ; c = ; = + ε ξ 1 1 1 2 1 2 1 2 1 1 1 1 1 1 λ1λ2 1 λλ 1 2 1 λλ 1 2 1 λλ 1 2

β α + λα λ β λε + ε a = ; b = ; c = ; ξ = λ λ λ λ λ λ λ λ. 2 2 2 1 2 1 2 1 2 2 2 2 1 1 2 1 1 2 1 1 2 1 1 2 1 OLS estimation of these reduced form models (Equations 5.6 and 5.7) is called indirect least squares (ILS).

Problem: While estimated reduced form models are readily used for forecasting purposes, if inferences are to be drawn from the model system, the underlying parameters need to be determined. Unfortunately, uncovering the underlying parameters, (the β's, α's, and λ's) in reduced form models is problematic because either too little or too much information is often available. For example, note that above equations provide two possible solutions for β 1, ( ) β = a 1 λλ and β = 1 1 1 2 1 ( λ λ ) c 1 2 2 1 λ 2.

The Identification Problem In some instances, it may be impossible to determine the underlying parameters. In these cases, the modeling system is said to be unidentified. In cases where exactly one equation solves the underlying parameters, the model system is said to be exactly identified. When more than one equation solves the underlying parameters (as shown in Equation 5.10), the model system is said to be over identified.

Order Condition Determines an equation to be identified if the number of all variables excluded from an equation in an equation system is greater than or equal to the number of endogenous variables in the equation system minus one. For example, in the first equation in the original equation system above, the number of elements in the vector Z 2, which is an exogenous vector excluded from the equation, must be greater than or equal to one because there are two endogenous variables in the equation system (u 1 and u 2 ). Simultaneous Equation Estimation Two modeling alternatives: single-equations estimation methods and systems estimation methods.

The distinction between the two is that systems methods consider all of the parameter restrictions (caused by over identification) in the entire equation system and account for possible contemporaneous (cross-equation) correlation of disturbance terms. Because system estimation approaches are able to utilize more information (parameter restrictions and contemporaneous correlation), they produce variance-covariance matrices that are at worst equal to, and in most cases smaller than those produced by single-equation methods (resulting in lower standard errors and higher t-statistics for estimated model parameters).

Single equation methods 1) Indirect least squares (ILS) Applies ordinary least squares to the reduced form models. Consistent but not unbiased 2) Instrumental variables (IV) Uses an instrument (a variable that is highly correlated with the endogenous variable it replaces, but is not correlated to the disturbance term) to estimate individual equations Consistent but not unbiased.

3) Two-stage least squares (2SLS) Approach finds the best instrument for endogenous variables. Stage 1 regresses each endogenous variable on all exogenous variables. Stage 2 uses regression-estimated values from stage 1 as instruments, and estimates equations with ordinary least squares. Consistent but not unbiased. Generally better small sample properties than ILS or IV.

4) Limited Information Maximum Likelihood (LIML) Uses maximum likelihood to estimate reduced form models. Can incorporate parameter restrictions in over identified equations. Consistent but not unbiased. Has same asymptotic variance-covariance matrix as 2SLS. System equation methods 1) Three Stage Least Squares (3SLS) Stage 1 gets 2SLS estimates of the model system. Stage 2 uses the 2SLS estimates to compute residuals to determine crossequation correlations. Stage 3 uses generalized least squares (GLS) to estimate model parameters. Consistent and more efficient than single-equation estimation methods.

2) Full Information Maximum Likelihood (FIML) Similar to LIML but accounts for contemporaneous correlation of disturbances in the likelihood function. Consistent and more efficient than single-equation estimation methods. Has same asymptotic variance-covariance matrix as 3SLS. A note on generalized least squares estimation Ordinary least squares (OLS) assumptions are that disturbance terms have equal variances and are not correlated. Generalized least squares (GLS) is used to relax these OLS assumptions. Under OLS assumptions, in matrix notation, ( T ) 2 Ε εε = σ I

where: E(.) denotes expected value, ε is an n 1 column vector of equation disturbance terms (where n is the total number of observations in the data), T ε is the 1 n transpose of ε, σ 2 is the disturbance term variance, and I is the n n identity matrix, 1 0. 0 0 1. 0 I =.... 0 0. 1 When heteroskedasticity is present, Ε ( εε ) T =. Ω, where Ω is n n matrix,

2 1 2 2 2 0 0 0 0 0 0 n....... σ σ Ω σ =. For disturbance-term correlation, ( ) 2 T = Ε εε σ Ω, where 1 2 1 2 1 1 1 N N N N....... = ρ ρ ρ ρ Ω ρ ρ

Recall that in ordinary least squares, parameters are estimated from, where: T ( ) 1 ˆ X X X T = Y β, ˆβ is an p 1 column vector (where p is the number of parameters), X is an n p matrix of data, T X is the transpose of X, and Y is an n 1 column vector. Using Ω, Equation 5A.5 is rewritten as, T ( ) 1 ˆ 1 T 1 = X X X Y β Ω Ω. The most difficult aspect of GLS estimation is obtaining an estimate of the Ω matrix. In 3SLS, it is estimated using the initial 2SLS parameter estimates.

3SLS application (Manering 1983) u = β Z + α X + λu + ε 1 1 1 1 1 2 1 u = β Z + α X + λu + ε 2 2 2 2 2 1 2 Data: Two Vehicle Households September to November 1979 A total of 272 households Monthly usage in miles

Variable Parameter Estimate t-statistic Constant 1205.14 10.28 Scrappage probability (= $250/vehicle value) Cost per mile divided by income (fuel price in cents/mpg/income) Age of principal driver indicator (1 if principal driver age is less than 50, 0 otherwise) -1150.63-5.82-102.23-2.21 123.51 1.62

Female indicator (1 if principal drive is female, 0 otherwise) Urban indicator (1 if household resides in an urabn area, 0 otherwise) -130.93-1.87-154.44-2.89 Use of other vehicle -0.172-1.18 R 2 = 0.2395 and 0.2611 Percent change resulting from doubling of fuel prices: -12.9% ignoring substutution effects. -11.3% accounting for within-household substution