Notes on Generalized Method of Moments Estimation

Similar documents
ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria

A Course on Advanced Econometrics

Chapter 1. GMM: Basic Concepts

13 Endogeneity and Nonparametric IV

We begin by thinking about population relationships.

GMM-based inference in the AR(1) panel data model for parameter values where local identi cation fails

Chapter 2. Dynamic panel data models

On GMM Estimation and Inference with Bootstrap Bias-Correction in Linear Panel Data Models

Chapter 2. GMM: Estimating Rational Expectations Models

Chapter 6: Endogeneity and Instrumental Variables (IV) estimator

Simple Estimators for Semiparametric Multinomial Choice Models

GMM estimation of spatial panels

Instrumental Variables. Ethan Kaplan

Speci cation of Conditional Expectation Functions

Generalized Method of Moments: I. Chapter 9, R. Davidson and J.G. MacKinnon, Econometric Theory and Methods, 2004, Oxford.

2014 Preliminary Examination

Economics 620, Lecture 20: Generalized Method of Moment (GMM)

11. Bootstrap Methods

The Case Against JIVE

Models, Testing, and Correction of Heteroskedasticity. James L. Powell Department of Economics University of California, Berkeley

Time Series Models and Inference. James L. Powell Department of Economics University of California, Berkeley

Strength and weakness of instruments in IV and GMM estimation of dynamic panel data models

E ciency Gains by Modifying GMM Estimation in Linear Models under Heteroskedasticity

Suggested Solution for PS #5

Instrumental Variables and GMM: Estimation and Testing. Steven Stillman, New Zealand Department of Labour

LSQ. Function: Usage:

Finite Sample Performance of A Minimum Distance Estimator Under Weak Instruments

ECONOMETRICS FIELD EXAM Michigan State University May 9, 2008

Day 2A Instrumental Variables, Two-stage Least Squares and Generalized Method of Moments

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43

Dynamic Panels. Chapter Introduction Autoregressive Model

1. The Multivariate Classical Linear Regression Model

Generalized Method of Moments Estimation

Simple Estimators for Monotone Index Models

x i = 1 yi 2 = 55 with N = 30. Use the above sample information to answer all the following questions. Show explicitly all formulas and calculations.

1 The Multiple Regression Model: Freeing Up the Classical Assumptions

Econometrics II. Nonstandard Standard Error Issues: A Guide for the. Practitioner

Instrumental Variables and Two-Stage Least Squares

Introduction: structural econometrics. Jean-Marc Robin

R = µ + Bf Arbitrage Pricing Model, APM

On the optimal weighting matrix for the GMM system estimator in dynamic panel data models

ECONOMETRICS II (ECO 2401) Victor Aguirregabiria. Spring 2018 TOPIC 4: INTRODUCTION TO THE EVALUATION OF TREATMENT EFFECTS

ORTHOGONALITY TESTS IN LINEAR MODELS SEUNG CHAN AHN ARIZONA STATE UNIVERSITY ABSTRACT

Inference about Clustering and Parametric. Assumptions in Covariance Matrix Estimation

the error term could vary over the observations, in ways that are related

Nonparametric Identi cation of Regression Models Containing a Misclassi ed Dichotomous Regressor Without Instruments

GMM based inference for panel data models

Contents. University of York Department of Economics PhD Course 2006 VAR ANALYSIS IN MACROECONOMICS. Lecturer: Professor Mike Wickens.

GMM Estimation with Noncausal Instruments

This chapter reviews properties of regression estimators and test statistics based on

Birkbeck Working Papers in Economics & Finance

Economics 241B Estimation with Instruments

Microeconometrics: Clustering. Ethan Kaplan

Robust Con dence Intervals in Nonlinear Regression under Weak Identi cation

Likelihood Ratio Based Test for the Exogeneity and the Relevance of Instrumental Variables

Econometrics I. Ricardo Mora

PANEL DATA RANDOM AND FIXED EFFECTS MODEL. Professor Menelaos Karanasos. December Panel Data (Institute) PANEL DATA December / 1

Economics 620, Lecture 7: Still More, But Last, on the K-Varable Linear Model

Economic modelling and forecasting

The Impact of a Hausman Pretest on the Size of a Hypothesis Test: the Panel Data Case

Testing Overidentifying Restrictions with Many Instruments and Heteroskedasticity

Markov-Switching Models with Endogenous Explanatory Variables. Chang-Jin Kim 1

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2014 Instructor: Victor Aguirregabiria

When is it really justifiable to ignore explanatory variable endogeneity in a regression model?

Specification Test for Instrumental Variables Regression with Many Instruments

The BLP Method of Demand Curve Estimation in Industrial Organization

Univariate Panel Data Models and GMM Estimators: An Exploration Using Real and Simulated Data

ECON 594: Lecture #6

Generalized Method of Moments (GMM) Estimation

Spatial panels: random components vs. xed e ects

Ross (1976) introduced the Arbitrage Pricing Theory (APT) as an alternative to the CAPM.

Exercises (in progress) Applied Econometrics Part 1

Heteroskedasticity. We now consider the implications of relaxing the assumption that the conditional

Introductory Econometrics

A Course in Applied Econometrics Lecture 7: Cluster Sampling. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

GMM - Generalized method of moments

Pseudo panels and repeated cross-sections

GMM Based Tests for Locally Misspeci ed Models

Single Equation Linear GMM

Econometrics of Panel Data

An estimate of the long-run covariance matrix, Ω, is necessary to calculate asymptotic

The Forward Premium is Still a Puzzle Appendix

Lecture 4: Linear panel models

Discriminating between (in)valid external instruments and (in)valid exclusion restrictions

OLSQ. Function: Usage:

Parametric Inference on Strong Dependence

Judging contending estimators by simulation: tournaments in dynamic panel data models

ERSA Training Workshop Lecture 5: Estimation of Binary Choice Models with Panel Data

Wageningen Summer School in Econometrics. The Bayesian Approach in Theory and Practice

Robust Standard Errors in Transformed Likelihood Estimation of Dynamic Panel Data Models with Cross-Sectional Heteroskedasticity

Bootstrap-Based Improvements for Inference with Clustered Errors

Testing Weak Convergence Based on HAR Covariance Matrix Estimators

ECONOMET RICS P RELIM EXAM August 24, 2010 Department of Economics, Michigan State University

Robust Standard Errors in Transformed Likelihood Estimation of Dynamic Panel Data Models with Cross-Sectional Heteroskedasticity

A Conditional-Heteroskedasticity-Robust Con dence Interval for the Autoregressive Parameter

Birkbeck Economics MSc Economics, PGCert Econometrics MSc Financial Economics Autumn 2009 ECONOMETRICS Ron Smith :

Heteroskedasticity. Part VII. Heteroskedasticity

7 Semiparametric Methods and Partially Linear Regression

A CONSISTENT SPECIFICATION TEST FOR MODELS DEFINED BY CONDITIONAL MOMENT RESTRICTIONS. Manuel A. Domínguez and Ignacio N. Lobato 1

Nonparametric Identi cation of Regression Models Containing a Misclassi ed Dichotomous Regressor Without Instruments

Transcription:

Notes on Generalized Method of Moments Estimation c Bronwyn H. Hall March 1996 (revised February 1999) 1. Introduction These notes are a non-technical introduction to the method of estimation popularized by Hansen and others, commonly referred to as Generalized Method of Moments. Theoretical issues are mentioned but not discussed thoroughly and no proofs are given (nor are assumptions fully stated): see Hansen (1982), Chamberlain (1987), and the texts by Ruud (2000) and Davidson and MacKinnon (1993) for details on these. The emphasis here is on explication of the basic method and notes on how to perform this kind of estimation in TSP. 2. Method of Moments This is a consistent, but not generally e cient estimator. Assume the data y = (y 1 ;y 2 ; ::::; y n ) are generated by a probability distribution indexed by a parameter vector µ with k elements. In the method of moments, µ is estimated by computing k sample moments of y, setting them equal to population moments derived from the assumed probability distribution, and solving for µ. Because sample moments are generally consistent estimators of population moments (under some regularity conditions), b µ will be consistent for µ (if it exists). 2.1. Example 1: Sample Mean The population moment is ¹, the expectation of y and the sample moment is the sample mean of y. y is assumed to be independently and identically distributed

with mean ¹ and nite variance. Then we can estimate ¹ by setting it equal to thesample rstmoment: b¹ = 1 n 2.2. Example 2: Gamma Distribution nx y i The data y are assumed to be generated according to the probability density f(y i )= p e y i y p 1 i (p) There are two parameters to be estimated, and p. The rst two population moments are the following: E(y i )=p= and V (y i )=p= 2 The rst two sample moments are the mean y and the variance s 2 y. We set these equal to the population moments and solve for and p: b = y=s 2 y and bp = y 2 =s 2 y These estimates are consistent if the sample variance of y is nite; for example, p lim b = p lim y=p lim s 2 y = E(Y )=V (Y )= 3. Generalized Method of Moments - Theory In the previous two examples, we had exactly as many moment conditions as parameters, so we simply solved the k equation in k unknowns to obtain our estimates. What happens when we have more moment conditions than parameters? How can we combine them optimally? That is what the GMM estimator does. It estimates the parameter vector by minimizing the sum of squares of the di erences between the population moments and the sample moments, using the variance of the moments as a metric. This is the minimum variance estimator in the class of estimators that use these moment conditions. Here is some notation: Write the theoretical moment condition as E[m j (y i; µ)] = 0 j =1; :::; J and k < J elements in µ 2

Then the sample moments of the data are m j (µ) = 1 n nx m j (y i; µ) The set of J equations m j (µ) =0will not have a unique solution for µ in general. Now de ne The GMM estimator is then m(µ) =[m j (µ); m 2 (µ); :::; m J (µ)] bµ =argmin µ m(µ) 0 W 1 m(µ) where W is the asymptotic variance of m(µ) or an estimate. This estimator is similar to, but not the same as, a generalized least squares (GLS) estimator. When W is either the true variance or a consistent estimate of it, b µ is consistent and asymptotically e cient for µ, in the class of estimators that use only the information in the moment conditions (for example, there may be a more e cient ML estimator if the data y are assumed to be generated by a speci c probability distribution - see Davidson and MacKinnon, pp. 597-600 for details). That is, if W is positive de nite, p lim m =0, and certain regularity conditions hold, p lim b µ = µ Theasymptoticvarianceofthisestimatoris (µ) =[G 0 W 1 G] 1 where G is the matrix of derivatives of the moment conditions with respect to the parameters; the jth row of G is given by Comments: G j = @m j @µ 0 ² When J = k, the choice of W does not matter, since m 0 at the solution. ² When J<k, µ is not identi ed. 3

² Under the null that the moment restrictions hold, the criterion function evaluated at the estimated b µ has the chi-squared distribution with degrees of freedom equal to the number of independent moment conditions (J) less the number of estimated parameters (k). This is the basis of the omnibus speci cation test called the J-test or the Sargan test. TSP reports the value of this test statistic and its p-value in the GMM output. ² m j (y i; µ) can be an arbitrary nonlinear function of data and parameters. 3.1. Example 3: Instrumental Variable Estimation with Homoskedastic Disturbances The model is the usual linear model: y i = X i + e i i =1; :::; n V ar(e i )=¾ 2 E[e i jx i ] 6= 0but E[e i jz i ]=0for a set of J instruments Z i Assume J>k. Then the J moment conditions are m = Z 0 e =0 We can estimate the variance of m by the following: W = Var(Z 0 e)=e[z 0 ee 0 Z] E[Z 0 e]e[e 0 Z]=E[Z 0 (¾ 2 I)Z] =¾ 2 Z 0 Z Therefore the GMM estimator using these moment conditions, is given by b IV =argmine 0 Z(Z 0 Z) 1 Z 0 e Note two things: we have dropped ¾ 2 because it is a scalar and does not a ect the minimization 1 and the resulting criterion function is the instrumental variable criterion function. The solution to this minimization problem has a rst order condition given by 1 This is not strictly true in nite samples. If we are estimating ¾ 2 along with the rest of the model, because the estimate will depend on b IV, it will make a di erence if we include it in the criterion function. If we do include it, and we update it continuously during the iterations, the resulting estimator will be LIML rather than IV; they are asymptotically equivalent but may di er in nite samples. 4

@e 0 @ Z(Z0 Z) 1 Z 0 e = X 0 Z(Z 0 Z) 1 Z 0 (y Xb IV )=0 which yields the IV estimator: b IV =[X 0 Z(Z 0 Z) 1 Z 0 X] 1 X 0 Z(Z 0 Z) 1 Z 0 y An estimate of the variance of this estimator is given by (b IV )=[G 0 c W 1 G] 1 =[X 0 Z(¾ 2 Z 0 Z) 1 Z 0 X] 1 This is the estimator computed by INST (TSLS) in TSP for linear two stage least squares or instrumental variables. In general, the model can be nonlinear with additive disturbances, so that e = h(y; X; ). In that case the estimator is computed by iterative methods and thevarianceis (b IV )=[ @e 0 Z(¾ 2 Z 0 Z) 1 Z 0 @e @ @ ] 1 This is the estimator computed by LSQ in TSP for nonlinear two stage least squares. 3.2. Example 4: Three Stage Least Squares with Heteroskedastic Disturbances (the Hansen-Singleton Estimator) This is the estimator commonly known in the econometric literature as GMM. It is usually applied in situations where there are several equations with correlated disturbances in the model (e.g., multiple assets as in Hansen-Singleton, or multiple time periods for panel data), and where it would be inappropriate to impose homoskedasticity on the disturbances. For simplicity, I discuss the case where the same set of instruments are available for all equations. See Appendix B to Mairesse and Hall (1996) for details on what to do when you want to use di erent instruments in di erent equations (use the MASK option in GMM in TSP). The moment conditions for this estimator can be written m i = z i e i 5

where z i is a vector of z s for the ith rm (including all time periods) and e i is a vector of disturbances for the ith rm. The data for each rm are assumed to be independently but not identically distributed conditional on the z s. The moment conditions for estimation are therefore nx m = (z i e i ) Thevarianceofm is Var[ P n z i e i ] and this can be approximated by the following expression (given the conditional independence of the rms): W = 1 nx i e i )(z i e i ) n (z 0 = 1 nx z i zi 0 e i e 0 i n In general, this formula allows for heteroskedasticity of the e s, but when they are homoskedastic, it reduces to the following: E[W jz] = 1 nx z i zi 0 E[e i e 0 n ijz] = 1 n Z0 Z where is the covariance matrix of the disturbances e. Thus one can show that GMM with the assumption of homoskedasticity for this model is the same as three-stage least squares estimation. In TSP, it can be done using 3SLS (LSQ) by listing the z s in the instrument list and de ning the equations of the model using FRML. However, under heteroskedasticity, the variance of the e s does not factor out of this expression, and we obtain the proper GMM estimator. Although estimation is straightforward (use the GMM procedure), several di culties arise: ² Estimates of the variance W basedonfourthmomentsconvergeslowly,so you may need quite large samples to get consistent estimates. In nite samples, three-stage type estimates may be preferred if the amount of heteroskedasticity is not too large. ² Like 2SLS and 3SLS, nite sample estimates are biased toward OLS (estimates without instruments), but in this case nite may be fairly large. ² As in the case of 2SLS and 3SLS, nite sample estimates are sensitive to the choice of normalization (which, if any, endogenous variable has its coe cient set to unity) unless the estimate of W used is continuously updated during iteration. A LIML-type estimator is available that may be preferred in this case. 6

4. References Chamberlain, G. (1987). Asymptotic E ciency in Estimation with Conditional Moment Restrictions, Journal of Econometrics 34: 305-334. Davidson, Russell, and James G. MacKinnon (1993). Estimation and Inference in Econometrics, New York: Oxford University Press. Hall, Bronwyn H., and Clint Cummins (1999). TSP Version 4.5 User s Manual, Palo Alto, CA: TSP International. Hansen, L. P. (1982). Large Sample Properties of Generalized Method of Moments Estimators, Econometrica 50: 1029-1054. Mairesse, J., and B. H. Hall (1996). Estimating the Productivity of Research and Development in French and United States Manufacturing Firms, in B. van Ark and K. Wagner (eds.), International Productivity Di erences: Measurement and Explanations, Amsterdam: Elsevier Science. Ruud, Paul A. (2000). An Introduction to Classical Econometric Theory, Oxford: Oxford University Press. 7