A Note on the Correlated Random Coefficient Model. Christophe Kolodziejczyk

Similar documents
4.8 Instrumental Variables

Conditions for the Existence of Control Functions in Nonseparable Simultaneous Equations Models 1

Single-Equation GMM: Endogeneity Bias

The relationship between treatment parameters within a latent variable framework

Rewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35

Lecture Notes on Measurement Error

Finite Sample Performance of A Minimum Distance Estimator Under Weak Instruments

Ability Bias, Errors in Variables and Sibling Methods. James J. Heckman University of Chicago Econ 312 This draft, May 26, 2006

Chapter 6: Endogeneity and Instrumental Variables (IV) estimator

Control Functions in Nonseparable Simultaneous Equations Models 1

( ) : : (CUHIES2000), Heckman Vytlacil (1999, 2000, 2001),Carneiro, Heckman Vytlacil (2001) Carneiro (2002) Carneiro, Heckman Vytlacil (2001) :

Chapter 2. Dynamic panel data models

Problem Set #6: OLS. Economics 835: Econometrics. Fall 2012

A Practical Test for Strict Exogeneity in Linear Panel Data Models with Fixed Effects

Unconditional Quantile Regression with Endogenous Regressors

Discrete Dependent Variable Models

Estimation of Dynamic Nonlinear Random E ects Models with Unbalanced Panels.

Weak Stochastic Increasingness, Rank Exchangeability, and Partial Identification of The Distribution of Treatment Effects

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables

Economics 472. Lecture 10. where we will refer to y t as a m-vector of endogenous variables, x t as a q-vector of exogenous variables,

Econometric Analysis of Cross Section and Panel Data

Limited Dependent Variables and Panel Data

Panel Threshold Regression Models with Endogenous Threshold Variables

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,

Syllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools

Flexible Estimation of Treatment Effect Parameters

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Pseudo panels and repeated cross-sections

1. Basic Model of Labor Supply

Large Sample Properties & Simulation

1 Estimation of Persistent Dynamic Panel Data. Motivation

A Course in Applied Econometrics Lecture 4: Linear Panel Data Models, II. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Comments on: Panel Data Analysis Advantages and Challenges. Manuel Arellano CEMFI, Madrid November 2006

Policy-Relevant Treatment Effects

Exercise sheet 6 Models with endogenous explanatory variables

Introduction to Linear Regression Analysis Interpretation of Results

Microeconometrics. C. Hsiao (2014), Analysis of Panel Data, 3rd edition. Cambridge, University Press.

Four Parameters of Interest in the Evaluation. of Social Programs. James J. Heckman Justin L. Tobias Edward Vytlacil

More on Roy Model of Self-Selection

Instrumental Variables

Econ 673: Microeconometrics Chapter 12: Estimating Treatment Effects. The Problem

Bias Correction Methods for Dynamic Panel Data Models with Fixed Effects

INFERENCE APPROACHES FOR INSTRUMENTAL VARIABLE QUANTILE REGRESSION. 1. Introduction

On GMM Estimation and Inference with Bootstrap Bias-Correction in Linear Panel Data Models

Least Squares Estimation of a Panel Data Model with Multifactor Error Structure and Endogenous Covariates

SINGLE-STEP ESTIMATION OF A PARTIALLY LINEAR MODEL

Lecture Notes Part 7: Systems of Equations

ECONOMETRICS FIELD EXAM Michigan State University May 9, 2008

Seemingly Unrelated Regressions in Panel Models. Presented by Catherine Keppel, Michael Anreiter and Michael Greinecker

More on Specification and Data Issues

Dynamic Panel of Count Data with Initial Conditions and Correlated Random Effects. : Application for Health Data. Sungjoo Yoon

Econometrics. 7) Endogeneity

Lecture 8: Instrumental Variables Estimation

Recitation Notes 6. Konrad Menzel. October 22, 2006

DEPARTMENT OF ECONOMICS

Empirical approaches in public economics

Introduction: structural econometrics. Jean-Marc Robin

Simplified Implementation of the Heckman Estimator of the Dynamic Probit Model and a Comparison with Alternative Estimators

Instrumental Variables

Introduction to Linear Regression Analysis

A Monte Carlo Comparison of Various Semiparametric Type-3 Tobit Estimators

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser

Econometrics of Panel Data

Linear dynamic panel data models

Chapter 1 Introduction. What are longitudinal and panel data? Benefits and drawbacks of longitudinal data Longitudinal data models Historical notes

Exercise Sheet 4 Instrumental Variables and Two Stage Least Squares Estimation

Computation Of Asymptotic Distribution. For Semiparametric GMM Estimators. Hidehiko Ichimura. Graduate School of Public Policy

Online Appendix to Yes, But What s the Mechanism? (Don t Expect an Easy Answer) John G. Bullock, Donald P. Green, and Shang E. Ha

Robustness of Logit Analysis: Unobserved Heterogeneity and Misspecified Disturbances

x i = 1 yi 2 = 55 with N = 30. Use the above sample information to answer all the following questions. Show explicitly all formulas and calculations.

Dealing With Endogeneity

12E016. Econometric Methods II 6 ECTS. Overview and Objectives

Has the Family Planning Policy Improved the Quality of the Chinese New. Generation? Yingyao Hu University of Texas at Austin

Intermediate Econometrics

Estimation of a Local-Aggregate Network Model with. Sampled Networks

Additional Material for Estimating the Technology of Cognitive and Noncognitive Skill Formation (Cuttings from the Web Appendix)

Chapter 6 Stochastic Regressors

Econometrics of Panel Data

Final Exam. Economics 835: Econometrics. Fall 2010

Markov-Switching Models with Endogenous Explanatory Variables. Chang-Jin Kim 1

Part VII. Accounting for the Endogeneity of Schooling. Endogeneity of schooling Mean growth rate of earnings Mean growth rate Selection bias Summary

Econometrics of Panel Data

α version (only brief introduction so far)

Identification of Regression Models with Misclassified and Endogenous Binary Regressor

Max. Likelihood Estimation. Outline. Econometrics II. Ricardo Mora. Notes. Notes

Econometrics Master in Business and Quantitative Methods

ECON Introductory Econometrics. Lecture 16: Instrumental variables

Econometrics (60 points) as the multivariate regression of Y on X 1 and X 2? [6 points]

Classical Measurement Error with Several Regressors

ECONOMETRICS. Bruce E. Hansen. c2000, 2001, 2002, 2003, University of Wisconsin

Recitation Notes 5. Konrad Menzel. October 13, 2006

Changes in the Transitory Variance of Income Components and their Impact on Family Income Instability

Econometrics II. Nonstandard Standard Error Issues: A Guide for the. Practitioner

GMM estimation of spatial panels

Endogenous Treatment Effects for Count Data Models with Endogenous Participation or Sample Selection

Labor Supply and the Two-Step Estimator

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43

Omitted Variable Biases of OLS and Spatial Lag Models

Estimation of Panel Data Models with Binary Indicators when Treatment Effects are not Constant over Time. Audrey Laporte a,*, Frank Windmeijer b

Transcription:

CAM Centre for Applied Microeconometrics Department of Economics University of Copenhagen http://www.econ.ku.dk/cam/ A Note on the Correlated Random Coefficient Model Christophe Kolodziejczyk 2006-10 The activities of CAM are financed by a grant from The Danish National Research Foundation

A Note on the Correlated Random Coecient Model Christophe Kolodziejczyk Center for Applied Microeconometrics, Department of Economics, University of Copenhagen Abstract In this note we derive the bias of the OLS estimator for a correlated random coecient model with one random coecient, but which is correlated with a binary variable. We provide set-identication to the parameters of interest of the model. We also show how to reduce the bias of the estimator. Keywords: Correlated random coecient model, Discrete choice, Bias, Cross-section data JEL Codes: C13, C21 I thank Martin Browning and Ramses Abul Naga for their very useful suggestion. I am responsible for any error. 1

A Note on the Correlated Random Coecient Model Christophe Kolodziejczyk 1159 words 1 Introduction Many individual economic decisions involve private information which by denition are not available to the researcher. Unobserved heterogeneity may inuence decisions over the lifecycle and both the timing of these events and the size of these idiosyncratic parameters will be intimately linked. This poses some methodological problems for the estimation of models where we have unobserved heterogeneity correlated with variables that reect past decisions. Often we will have to estimate correlated random coecient models. For relevant examples of this phenomenon, one can refer to Card [2] and Browning and Lechene [1]. Card, for example, shows with a life-cycle model that the number of years of schooling depends not on only on the ability of the individuals but also on their specic returns to education. If the returns to schooling vary among the population we will have to estimate an earnings equation where the idiosyncratic returns to education are correlated with the number of schooling years. If we estimate such a model by OLS, we will get biased estimate because people with lower returns will tend to invest less in education. The econometrics literature has recently been interested in the treatment of correlated random coecient models. Card proposes instrumental variables estimators for his model. Heckman and Vitlacyl [3] and Wooldridge [5], [6] provide identifying assumptions if one is willing to use instrumental variables techniques. Unfortunately, the assumptions proposed might be dicult to fulll in the presence of endogenous binary variables. It seems on the other hand to be fruitful to investigate the size of the bias of traditional estimators and provide bounds to the parameters of interest. The idea of this note is to derive the bias of the OLS estimator of a correlated random coecient model. The model we investigate involves one correlated random coecient and 2

a binary variable associated with it. We use this latter characteristic of the model and the economic predictions to provide set-identication of the parameters of interest. The results provided in this paper may be useful for practitioners as these kind of models appear frequently in economics. In section 2, we present the model which consists of a demand equation conditioned on a binary variable, and we discuss the identication of the model with instrumental variables. In section 4, we derive the bias of the OLS estimator. We show that we can give upper or lowerbounds to the parameters of interest of the model. We also show that we can reduce the bias. Section 5 gives some concluding comments. 2 A Correlated Random Coecient model In this section, we discuss the estimation of a correlated random coecient model (CRCM) with cross-section data. We consider the case where there is only one correlated coecient. Moreover, the random coecient is correlated with a discrete variable. The aim of the estimation is to recover the structural parameters of the model, particularly the mean of the random coecient. Consider for example the following model c w;i = + x i + i h i + i (1) with E [ i j 1; x i ; h i ; i h i ] = 0 and h i is a discrete endogenous variable. Equation (1) is a conditional demand equation where c w i is work-related expenses, x i is total expenditures and h i the work status (see Kolodziejczyk [4] for further details on this model). Since i varies among the population, we can write i = + v i (2) with E ( i ) = and E (v i ) = 0: Since v i is correlated with h i, then the OLS estimator of a regression of c w i on x i and h i will be biased: 3

To see this we can rewrite the model as c w;i = + x i + h i +! i + i ; (3) where we have dened! i v i h i. The variable h i is endogenous, and we assume it can be expressed by the following reduced form equation h i = 1 [g (v i ; q 0 i) > 0] (4) where g () is some non-linear function and q i is a vector of explanatory variables. Equation (4) summarizes the comparison of the utility level obtained in both states and the net gain of taking action h = 1 depends on v i. We assume that that the net gain of action h = 1 g is monotonically decreasing with v i, i.e. @g=@v i < 0: (5) What are the assumptions needed to identify? Heckman and Vitlacyl [3] and Wooldridge [5], [6] nd identifying assumptions for linear versions of this model which allow to use instrumental variables techniques. In order to estimate the model with standard instrumental variables techniques, we have to assume the covariance between the xed costs and the decision to retire is constant. Rewrite the model as i = z 0 i + v i h i = w 0 i h + e i where z i and w i are instruments for i and h i respectively. We need the following assumption to identify E [v i e i j z i ; w i ] = k (6) where k is a constant. Heckman and Vytlacil (op. cit.) propose GMM estimators. Unfor- 4

tunately, because of the non-linearity of the model assumption (6) will probably not hold. Since h i is a non-linear function, the model will be nonseparable. Furthermore, it might be dicult in some context to nd good instruments. Therefore, we prefer to consider the OLS estimator of this random coecient model, compute its bias and try to see whether we can sign it. 3 Bias of the OLS estimator In this section we derive the bias of the OLS estimator of model (1). We rewrite equation (3) as c w;i = x i + h i +! i + i, (7) where the stars denotes variables in deviations from their mean. We assume x i is measured without error and we dene x 0 i (x i ; h i ) and 0 (; ) : We maintain the assumption that E [j x; h;!] = E [] = 0: The probability limit of the OLS estimator is equal to plim OLS = E (x 0 x) 1 E (x 0!) where E (x 0 x) = 0 B @ V (x) cov (x; h) cov (x; h) V (h) 1 C A E (x 0!) = 0 B @ cov (x;!) cov (h;!) 1 C A : 5

The elements of these two matrices can be rewritten as V (h) = P (h = 1) (1 P (h = 1)) ; cov (x; h) = [E (x) E (xj h = 1)] P (h = 1) ; cov (x;!) = fcov (x; vj h = 1) [E (x) E (xj h = 1)] E (vj h = 1)g P (h = 1) ; and cov (h;!) = E (vj h = 1) (1 P (h = 1)) P (h = 1) which result from the following probability limits plim N 1 =N = P (h = 1) ; plim 1 X x 2 N i = E x 2 ; plim 1 X vi h i = E (vj h = 1) ; N 1 plim 1 X xi h i N 1 = E (xj h = 1) ; and plim 1 X vi x i h i N 1 = E (v 0 xj h = 1) where we have dened N as the number of observations and N 1 as the number of observations where h = 1: Using the fact that E (x) = E (xj h = 1) P (h = 1) + E (xj h = 0) P (h = 0) ; the bias can be rewritten as plim OLS = cov (x; vj h = 1) Pr (h = 1) P (h = 0) V (x) P (h = 0) [E (x) E (xj h = 1)] 2 P (h = 1) (8) and cov (x; vj h = 1) [E (x) E (xj h = 1)] Pr (h = 1) plim OLS = E (vj h = 1) + V (x) P (h = 0) [E (x) E (xj h = 1)] 2 P (h = 1) (9) 6

We make the following additional assumptions: E (xj h = 1) > E (xj h = 0) (10) and cov [vxj h = 1] > 0 (11) We also assume that v is symmetrically distributed around its mean. Proposition 1 Under assumptions (5),(10) and (11), the asymptotic biases of OLS and OLS are respectively positive and negative. Proof. By the Cauchy-Schwarz inequality we have V (x) [E (xj h = 0) E (xj h = 1)] 2 P (h = 0) P (h = 1) 0: (12) Under assumptions (11), the right hand-side of (8) is positive. Furthermore, under assumptions (5) and the symmetry assumption for v i we have E [vj h = 1] < 0. In addition with assumption (10) this implies that the right hand-side of (9) is negative. These conditions imply that the OLS estimator of will be downward biased and will give a lower bound to this parameter. The OLS estimator of will be upward biased. The OLS estimator cannot identify and, but we can give a lower bound to and an upper bound to : If we estimate the demand system on the population where h = 0, we will obtain an estimator of, which we denote by 0 OLS. Since we no longer have a correlated random coecient model, this estimator is consistent. Then we can rewrite equation (9) as plim OLS [E (x) E (xj h = 1)] Pr (h = 0) plim OLS 0 OLS = + E (vj h = 1) : (13) All the objects on the left-hand side of equation (13) are identied and can be estimated consistently. This allows a reduction of the asymptotic bias of OLS by using the following 7

estimator OLS N (x x 1 ) OLS 0 OLS N N 1 where x = N P 1 N i=1 x P i and x 1 = N1 1 N i=1 x ih i. 4 Conclusion In this note we have discussed the estimation of a correlated random coecient model with a unique random coecient correlated with a binary variable. We have shown that the instrumental variables methods will unlikely give consistent estimates of the parameters of interest. Therefore we have derived the bias of the OLS estimator of the model and provided some set identication, i.e. we were able to give upper-bounds and lower-bounds to these parameters. Furthermore the economics of the problem helped us to provide an estimator whose bias is lower. This may provide useful information for practitioners who would like to estimate such models which often appear in economics. References [1] Browning, Martin, Lechene, Valerie (2003), "Children and Demand: Direct and Non- Direct Eects", Review of Economics of the Household 1, 9-31. [2] Card, David (2001), "Estimating the Return to Schooling: Progress on Some Persistent Econometric Problems", Econometrica 69, 1127-1160 [3] Heckman, James, Vytlacil, Edward (1998), "Instrumental Variables Methods for the Correlated Coecient Model: Estimating the Average Rate of Return to Schooling When the Return is Correlated with Schooling", Journal of Human Resources 30, 974, 987. [4] Kolodziejczyk, Christophe (2005), "Retirement, Fixed Costs to Work and Homeproduction: an empirical analysis.", Mimeo 8

[5] Wooldridge, Jerey, "Econometric Analysis of Cross-section and Panel Data", Cambridge, MIT Press, 2002 [6] Wooldridge, Jerey M., "Further results on instrumental variables estimation of average treatment eects in the correlated random coecient model", Economics Letters 79, 185-191. 9