Economics 620, Lecture 20: Generalized Method of Moment (GMM)

Similar documents
Economics 620, Lecture 18: Nonlinear Models

Economics 620, Lecture 9: Asymptotics III: Maximum Likelihood Estimation

Economics 620, Lecture 7: Still More, But Last, on the K-Varable Linear Model

Economics 620, Lecture 15: Introduction to the Simultaneous Equations Model

Suggested Solution for PS #5

Economics 620, Lecture 19: Introduction to Nonparametric and Semiparametric Estimation

Economics 620, Lecture 14: Lags and Dynamics

Chapter 1. GMM: Basic Concepts

A Course on Advanced Econometrics

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria

Economics 620, Lecture 4: The K-Varable Linear Model I

We begin by thinking about population relationships.

Notes on Generalized Method of Moments Estimation

Economics 620, Lecture 8: Asymptotics I

Notes on Asymptotic Theory: Convergence in Probability and Distribution Introduction to Econometric Theory Econ. 770

Lecture Notes on Measurement Error

Simple Estimators for Semiparametric Multinomial Choice Models

Introduction to Estimation Methods for Time Series models Lecture 2

Introduction: structural econometrics. Jean-Marc Robin

13 Endogeneity and Nonparametric IV

Chapter 2. GMM: Estimating Rational Expectations Models

Economics 620, Lecture 13: Time Series I

PANEL DATA RANDOM AND FIXED EFFECTS MODEL. Professor Menelaos Karanasos. December Panel Data (Institute) PANEL DATA December / 1

Generalized Method of Moments (GMM) Estimation

LECTURE 18: NONLINEAR MODELS

plim W 0 " 1 N = 0 Prof. N. M. Kiefer, Econ 620, Cornell University, Lecture 16.

Instrumental Variables and Two-Stage Least Squares

Time Series Models and Inference. James L. Powell Department of Economics University of California, Berkeley

Chapter 6: Endogeneity and Instrumental Variables (IV) estimator

Chapter 2. Dynamic panel data models

Economics 241B Estimation with Instruments

Microeconometrics: Clustering. Ethan Kaplan

Econ 583 Final Exam Fall 2008

GMM and SMM. 1. Hansen, L Large Sample Properties of Generalized Method of Moments Estimators, Econometrica, 50, p

2014 Preliminary Examination

GMM estimation of spatial panels

Models, Testing, and Correction of Heteroskedasticity. James L. Powell Department of Economics University of California, Berkeley

LECTURE 10: NEYMAN-PEARSON LEMMA AND ASYMPTOTIC TESTING. The last equality is provided so this can look like a more familiar parametric test.

Economics 620, Lecture 2: Regression Mechanics (Simple Regression)

Introduction to the Simultaneous Equations Model

1 Outline. 1. Motivation. 2. SUR model. 3. Simultaneous equations. 4. Estimation

7 Semiparametric Methods and Partially Linear Regression

Econometrics Midterm Examination Answers

Fluctuations from the Semicircle Law Lecture 1

Lecture 4: Linear panel models

Quantitative Techniques - Lecture 8: Estimation

GMM, HAC estimators, & Standard Errors for Business Cycle Statistics

11. Further Issues in Using OLS with TS Data

GMM-based inference in the AR(1) panel data model for parameter values where local identi cation fails

Exercise Sheet 4 Instrumental Variables and Two Stage Least Squares Estimation

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2014 Instructor: Victor Aguirregabiria

Linear models. Linear models are computationally convenient and remain widely used in. applied econometric research

ECONOMETRICS FIELD EXAM Michigan State University May 9, 2008

Econ 623 Econometrics II Topic 2: Stationary Time Series

EC212: Introduction to Econometrics Review Materials (Wooldridge, Appendix)

STAT 100C: Linear models

GMM - Generalized method of moments

Economics 241B Review of Limit Theorems for Sequences of Random Variables

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

Lecture 6: Dynamic panel models 1

Econometrics II. Nonstandard Standard Error Issues: A Guide for the. Practitioner

Max. Likelihood Estimation. Outline. Econometrics II. Ricardo Mora. Notes. Notes

Multiple Equation GMM with Common Coefficients: Panel Data

PSC 504: Instrumental Variables

Asymptotics for Nonlinear GMM

The Kernel Trick, Gram Matrices, and Feature Extraction. CS6787 Lecture 4 Fall 2017

Discussion of Sensitivity and Informativeness under Local Misspecification

Generalized Method of Moments: I. Chapter 9, R. Davidson and J.G. MacKinnon, Econometric Theory and Methods, 2004, Oxford.

Empirical Methods in Applied Microeconomics

Asymptotic Theory. L. Magee revised January 21, 2013

Analogy Principle. Asymptotic Theory Part II. James J. Heckman University of Chicago. Econ 312 This draft, April 5, 2006

Estimating Deep Parameters: GMM and SMM

Single-Equation GMM: Endogeneity Bias

Generalized Method of Moment

MLE and GMM. Li Zhao, SJTU. Spring, Li Zhao MLE and GMM 1 / 22

Stochastic Processes. Monday, November 14, 11

ECON 616: Lecture 1: Time Series Basics

1 Class Organization. 2 Introduction

Chapter 1: A Brief Review of Maximum Likelihood, GMM, and Numerical Tools. Joan Llull. Microeconometrics IDEA PhD Program

The Asymptotic Variance of Semi-parametric Estimators with Generated Regressors

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE

Sampling Distributions and Asymptotics Part II

L03. PROBABILITY REVIEW II COVARIANCE PROJECTION. NA568 Mobile Robotics: Methods & Algorithms

Causal Inference Lecture Notes: Causal Inference with Repeated Measures in Observational Studies

Computation Of Asymptotic Distribution. For Semiparametric GMM Estimators. Hidehiko Ichimura. Graduate School of Public Policy

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Lecture Notes Part 7: Systems of Equations

Strength and weakness of instruments in IV and GMM estimation of dynamic panel data models

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2016 Instructor: Victor Aguirregabiria

LECTURE 11: GENERALIZED LEAST SQUARES (GLS) In this lecture, we will consider the model y = Xβ + ε retaining the assumption Ey = Xβ.

Contents. University of York Department of Economics PhD Course 2006 VAR ANALYSIS IN MACROECONOMICS. Lecturer: Professor Mike Wickens.

Economics 620, Lecture 4: The K-Variable Linear Model I. y 1 = + x 1 + " 1 y 2 = + x 2 + " 2 :::::::: :::::::: y N = + x N + " N

ECON 4160: Econometrics-Modelling and Systems Estimation Lecture 7: Single equation models

Module 9: Stationary Processes

Econ 582 Fixed Effects Estimation of Panel Data

Missing dependent variables in panel data models

On the optimal weighting matrix for the GMM system estimator in dynamic panel data models

Specification testing in panel data models estimated by fixed effects with instrumental variables

Nonparametric Identi cation of Regression Models Containing a Misclassi ed Dichotomous Regressor Without Instruments

Business Economics BUSINESS ECONOMICS. PAPER No. : 8, FUNDAMENTALS OF ECONOMETRICS MODULE No. : 3, GAUSS MARKOV THEOREM

Transcription:

Economics 620, Lecture 20: Generalized Method of Moment (GMM) Nicholas M. Kiefer Cornell University Professor N. M. Kiefer (Cornell University) Lecture 20: GMM 1 / 16

Key: Set sample moments equal to theoretical moments and solve parameters. Generalized moments: Expectations of functions Eg. E(y ) = 0 set 1 n P (yi ^) = 0, ^ = y Professor N. M. Kiefer (Cornell University) Lecture 20: GMM 2 / 16

Regression E(y X ) = 0 set 1 n P (yi X i ) = 0? Too many solutions Suppose we group into K groups Solve simultaneously Illustrates arbitrariness of choice of moment conditions A better moment condition: E(X 0 (y X )) = 0 Professor N. M. Kiefer (Cornell University) Lecture 20: GMM 3 / 16

solve a GMM estimator! X (y X ^) = 0 for ^ = (X 0 X ) 1 X 0 y Often we have overidentifying restrictions E(W 0 (y X )) = 0 W : n p, p > k Then W 0 (y X ^) = 0 is p linear equations in K unknonws. Write W 0 y = W 0 X + " and do GLS. Professor N. M. Kiefer (Cornell University) Lecture 20: GMM 4 / 16

If Then V (y) = 2 I, V (") = 2 W 0 W. ^ GLS = ^ GMM = (X 0 W (W 0 W ) 1 W 0 X ) 1 X 0 W (W 0 W ) 1 W 0 y Which is the solution to min(y X ^) 0 W (W 0 W ) 1 W 0 (y X ^). With V (y) = V : ^ = (X 0 W (W 0 VW ) 1 W 0 X ) 1 W (W 0 VW ) 1 W 0 y: Solution to min(y x ^) 0 W (W 0 VW ) 1 W 0 (y X ^) Professor N. M. Kiefer (Cornell University) Lecture 20: GMM 5 / 16

Distribution Theory Assume W 0 " p n! N(0; W 0 VW ) plausible? Then ^ + (X 0 W (W 0 VW ) 1 X 0 W ) 1 X 0 W (W 0 VW ) 1 W 0 " so p n(^ )~N(0; (X 0 W (W 0 VW ) 1 W 0 X ) 1 ) The trick is choosing the moments (or instruments) Professor N. M. Kiefer (Cornell University) Lecture 20: GMM 6 / 16

More cannot hurt. Asymptotically we have: Instead use: (Fewer conditions) Then E(W 0 (y X )) = 0 E(A 0 W 0 (y X )) = 0 A : m p, m < p V ( A ) = [X 0 WA(A 0 W 0 VWA) 1 A 0 W 0 X ] V (^) 1 V ( A ) 1 = X 0 W [(W 0 VW ) 1 A(A 0 W 0 VWA) 1 A 0 ]W 0 X Letting CC 0 = (W 0 VW ) 1 V (^) 1 V ( A ) 1 = X 0 WC[I C 1 A(A 0 C 0 1 C 1 A) 1 A 0 C 0 1 ]C 0 W 0 X p.s.d., so V ( A ) V (^) Professor N. M. Kiefer (Cornell University) Lecture 20: GMM 7 / 16 1

Note that more may not help if conditions are chosen right. (W 0 = X 0 for OLS) Look more closely at the case V (y) = I : If we let W = [XZ] ^ = (X 0 W (W 0 W ) 1 W 0 X ) 1 X 0 W (W 0 W ) 1 W 0 y = (X 0 X ) 1 X 0 y (after a little work) First factor is (X 0 X ) 1 Note W (W 0 W ) 1 W 0 X is the matrix of predicted values from the regression of X on W (namely, X itself) Professor N. M. Kiefer (Cornell University) Lecture 20: GMM 8 / 16

Nonlinear Models E(W 0 f ()) = 0, W : n p, f (): n 1, : k 1 Same principle as linear: Let F = f : n k and E( 0 ) = V Solving by nonlinear GLS minimizes f 0 W (W 0 VW ) 1 W 0 f and p n(^ )~N(0; n[f 0 W (W 0 VW ) 1 W 0 F ] 1 ) choice of instruments? 1 F Try W = V Then V (^) = [F 0 V 1 F (F 0 V 1 VV 1 F ) 1 F 0 V 1 F ] 1 = [F 0 V 1 F ] 1 Smallest in this class (why?) (note there are only k of these) Professor N. M. Kiefer (Cornell University) Lecture 20: GMM 9 / 16

Generalization We have only considered covariances so far. Consider G: N p where we let each column represent di erent functions of data. We have N observations on each function, sothe moment conditions become E(f ti (y t ; )) = 0, i = 1; :::; k t = 1; :::; N: Where earlier we had speci ed f it (y; ) = w it f it Now GLS can be applied: Resulting in min 1 0 GAG 0 1 Professor N. M. Kiefer (Cornell University) Lecture 20: GMM 10 / 16

That is, the moment conditions are 1 0 G = 0 and A is a p.d. weighting matrix. For e ciency, A should be equal to (well, proportional to) V (1 0 G) 1 (this is E(FF 0 ) in earlier notation). To develop intuition for this, consider the linear case where G has elements f ti = w ti (y t x t ) And the 1 0 just sums over observations, so V (1 0 G) is just (W 0 W ): Note 1 0 G is taking the place of (y X ) 0 W : Professor N. M. Kiefer (Cornell University) Lecture 20: GMM 11 / 16

Asymptotic Distribution of GMM Write Q = 1 0 GAG 0 1, the function to be minimized to calculate the GMM estimator. Taking the Taylor expansion of the rst order condition Q = 0 0 = Q () + Q ( ) and just as in the ML case, we solve for the vector of estimation errors as ( ) = (Q ) 1 Q and apply a LLN to the second derivative matrix and a CLT to the scores. Professor N. M. Kiefer (Cornell University) Lecture 20: GMM 12 / 16

The notation can get cumbersome here, but let g ij = the derivative of the ith column of 1 0 G with respect to j, and let g = fg ij g be the associated matrix. Then V (n 1=2 ( )) = (g 0 Ag) 1 g 0 AEFF 0 Ag(g 0 Ag) 1 : This comes from evaluating the derivatives, bringing in the scaling factors in n and generally simplifying. The result should look familiar. Note that when A = EFF 0, the formula simpli es to As in the usual GLS case! V (n 1=2 ( )) = (g 0 Ag) 1 : Professor N. M. Kiefer (Cornell University) Lecture 20: GMM 13 / 16

Questions: How many moments to use? Note the FOC only use k... Nice analogy with 2SLS - with lots of IV, still with 2SLS, we reduce to the just identi ed case by using the optimal linear combination of instruments. Discussion? In fact k moments are su cient for e ciency if there are k parameters. What are the k moments? The real use for GMM is when the LF is too complicated or unknown (better, not plausibly known). Professor N. M. Kiefer (Cornell University) Lecture 20: GMM 14 / 16

Conditional GMM Basically generates more moment conditions. When we take X 0 (y X ) = 0, we are imposing that the error is uncorrelated with X. But the property may be stronger, e.g. that E((y X )jx ) = 0. This implies that EX 0 (y X ) = 0 but also that any other function of X is uncorrelated with (y X ). This comes up a lot in RE modeling and provides a source of lots of moment conditions. Note tradeo between introduction of noise and gains from using more moments. Professor N. M. Kiefer (Cornell University) Lecture 20: GMM 15 / 16

Conclusion GMM requires fewer assumptions than ML Can be somewhat arbitrary Can be very ine cient relative to ML Professor N. M. Kiefer (Cornell University) Lecture 20: GMM 16 / 16