Limited Dependent Variables and Panel Data

Similar documents
-redprob- A Stata program for the Heckman estimator of the random effects dynamic probit model

Non-linear panel data modeling

Testing for time-invariant unobserved heterogeneity in nonlinear panel-data models

Econometric Analysis of Cross Section and Panel Data

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63

Women. Sheng-Kai Chang. Abstract. In this paper a computationally practical simulation estimator is proposed for the twotiered

Syllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools

Limited Dependent Variables and Panel Data

Duration dependence and timevarying variables in discrete time duration models

Departamento de Economía Universidad de Chile

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Binary Choice Models Probit & Logit. = 0 with Pr = 0 = 1. decision-making purchase of durable consumer products unemployment

Economics 671: Applied Econometrics Department of Economics, Finance and Legal Studies University of Alabama

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Panel Data Seminar. Discrete Response Models. Crest-Insee. 11 April 2008

Identification and Estimation of Nonlinear Dynamic Panel Data. Models with Unobserved Covariates

Comments on: Panel Data Analysis Advantages and Challenges. Manuel Arellano CEMFI, Madrid November 2006

(a) (3 points) Construct a 95% confidence interval for β 2 in Equation 1.

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data

Applied Microeconometrics (L5): Panel Data-Basics

A Model of Human Capital Accumulation and Occupational Choices. A simplified version of Keane and Wolpin (JPE, 1997)

Applied Health Economics (for B.Sc.)

Identification and Estimation of Nonlinear Dynamic Panel Data. Models with Unobserved Covariates

A nonparametric test for path dependence in discrete panel data

Limited Dependent Variable Models II

Christopher Dougherty London School of Economics and Political Science

Economics 582 Random Effects Estimation

Truncation and Censoring

A dynamic model for binary panel data with unobserved heterogeneity admitting a n-consistent conditional estimator

Models of Qualitative Binary Response

Estimation in the Fixed Effects Ordered Logit Model. Chris Muris (SFU)

Intermediate Econometrics

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables

WISE International Masters

Econometric Analysis of Panel Data. Final Examination: Spring 2013

Økonomisk Kandidateksamen 2004 (I) Econometrics 2. Rettevejledning

xtunbalmd: Dynamic Binary Random E ects Models Estimation with Unbalanced Panels

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Simplified Implementation of the Heckman Estimator of the Dynamic Probit Model and a Comparison with Alternative Estimators

Econometric Analysis of Panel Data. Final Examination: Spring 2018

Introduction to Econometrics

Generalized Dynamic Panel Data Models with Random Effects for Cross-Section and Time

ECONOMETFUCS FIELD EXAM Michigan State University May 11, 2007

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2016 Instructor: Victor Aguirregabiria

Testing Random Effects in Two-Way Spatial Panel Data Models

Discrete Choice Modeling

Econometrics of Panel Data

Short Questions (Do two out of three) 15 points each

Fixed Effects Models for Panel Data. December 1, 2014

Industrial Organization II (ECO 2901) Winter Victor Aguirregabiria. Problem Set #1 Due of Friday, March 22, 2013

Next, we discuss econometric methods that can be used to estimate panel data models.

Economics 520. Lecture Note 19: Hypothesis Testing via the Neyman-Pearson Lemma CB 8.1,

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria

Lecture 6: Discrete Choice: Qualitative Response

Chapter 9 Regression with a Binary Dependent Variable. Multiple Choice. 1) The binary dependent variable model is an example of a

Estimation of some nonlinear panel data models with both. time-varying and time-invariant explanatory variables

Linear Regression With Special Variables


Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Wage Inequality, Labor Market Participation and Unemployment

Answer all questions from part I. Answer two question from part II.a, and one question from part II.b.

Econometrics Honor s Exam Review Session. Spring 2012 Eunice Han

Sample Selection. In a UW Econ Ph.D. Model. April 25, Kevin Kasberg and Alex Stevens ECON 5360 Dr. Aadland

Testing for Regime Switching: A Comment

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

1. The Multivariate Classical Linear Regression Model

8 Nominal and Ordinal Logistic Regression

Specification Test on Mixed Logit Models

Exam D0M61A Advanced econometrics

Introduction to General and Generalized Linear Models

Panel Data Exercises Manuel Arellano. Using panel data, a researcher considers the estimation of the following system:

Review of Panel Data Model Types Next Steps. Panel GLMs. Department of Political Science and Government Aarhus University.

Econ 673: Microeconometrics

Semiparametric Identification in Panel Data Discrete Response Models

Ordered Response and Multinomial Logit Estimation

Economics 2450A: Public Economics Section 8: Optimal Minimum Wage and Introduction to Capital Taxation

New York University Department of Economics. Applied Statistics and Econometrics G Spring 2013

Notes on Heterogeneity, Aggregation, and Market Wage Functions: An Empirical Model of Self-Selection in the Labor Market

Diagnostics of Linear Regression

Missing dependent variables in panel data models

Partial effects in fixed effects models

Standard Errors & Confidence Intervals. N(0, I( β) 1 ), I( β) = [ 2 l(β, φ; y) β i β β= β j

Econometrics of Panel Data

1. You have data on years of work experience, EXPER, its square, EXPER2, years of education, EDUC, and the log of hourly wages, LWAGE

Estimating Panel Data Models in the Presence of Endogeneity and Selection

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE

Discrete panel data. Michel Bierlaire

Commuting paradox revisited - Compensation for commutes in two-earner households? Konferenz Verkehrsökonomik und -politik 2017

G. S. Maddala Kajal Lahiri. WILEY A John Wiley and Sons, Ltd., Publication

Econ 510 B. Brown Spring 2014 Final Exam Answers

Munich Lecture Series 2 Non-linear panel data models: Binary response and ordered choice models and bias-corrected fixed effects models

IDENTIFICATION OF THE BINARY CHOICE MODEL WITH MISCLASSIFICATION

Stock Sampling with Interval-Censored Elapsed Duration: A Monte Carlo Analysis

Group comparisons in logit and probit using predicted probabilities 1

Logistic regression: Why we often can do what we think we can do. Maarten Buis 19 th UK Stata Users Group meeting, 10 Sept. 2015

ARIMA Modelling and Forecasting

Classification. Chapter Introduction. 6.2 The Bayes classifier

Lecture 1. Behavioral Models Multinomial Logit: Power and limitations. Cinzia Cirillo

AGEC 661 Note Fourteen

Transcription:

and Panel Data June 24 th, 2009

Structure 1 2

Many economic questions involve the explanation of binary variables, e.g.: explaining the participation of women in the labor market explaining retirement decisions studying the effect of unemployment on the level of satisfaction (see Winkelmann & Winkelmann (1998)) are mainly used in cross-sectional panels (large N, small T ), i.e. for microeconometric questions.

Introduction Binary choice models are formulated in terms of a latent variable: e.g. the choice of participating in the labor market (the variable of interest) depends on whether the offered wage is greater than the unobserved reservation wage of the given person (the latent variable) Formally, y it = { 1 if y it = x it β + u it > 0 0 if y it = x it β + u it 0 where x it is a vector of explanatory variables for person i at time t, where β is a vector of coefficients to be estimated and where u it is a error which will be specified below.

Assumptions and Problems of the Fixed Effects Estimator Assume that u it = µ i + ν it, where: µ i is an unknown parameter to be estimated ν it is a remainder error which is assumed to be i.i.d. across individuals and time with a symmetric cumulative distribution function F ( ) The incidental parameters problem: In the linear case, the estimators for µ i are not consistent if N but T remains fixed. Nevertheless, β can be estimated consistently. With qualitative data, the estimators for β and µ i are however not independent both µ i and β are inconsistent in cross-sectional panels

Maximum Likelihood Estimation The log-likelihood function is given by: log L (β, µ 1,..., µ N ) = y it log [Pr {y it = 1}] + i,t + i,t (1 y it ) log [Pr {y it = 0}] where Pr {y it = 1} = Pr {y it > 0} = Pr { µ i + x itβ + ν it > 0 } = Pr { ν it < µ i + x itβ } = F ( µ i + x itβ ) For small T and large N, maximizing the log-likelihood function yields however inconsistent estimators!

Conditional Maximum Likelihood Estimation In order to obtain a consistent estimator also for small T, a sufficient statistic t i for µ i is needed: Conditional on t i, the likelihood contribution of individual i does not depend any longer on µ i, but only on β, i.e. the density is such that f (y i1,..., y it t i, µ i, β) = f (y i1,..., y it t i, β) Maximizing the conditional log-likelihood function based on f (y i1,..., y it t i, β) with respect to β yields a consistent estimator for β. However: For the probit model (remainder error normally distributed), there is no sufficient statistic for µ i!!! It is not possible to estimate β consistently in cross-sectional panels!

The Fixed Effects Logit Model - An Example For the fixed effects logit model, a sufficient statistic for µ i is t i = t y it. (Chamberlain (1980)) Let T = 2: If t i = 0, there is only one possible event, namely y i1 = 0 and y i2 = 0. (Similarly, for t i = 2) Hence, individuals who do not change status over the time horizon do not enter the conditional log-likelihood function. It is sufficient to consider individuals with t i = 1 with the possible events (y i1, y i2 ) = (0, 1) and (y i1, y i2 ) = (1, 0): Using Bayes rule, it holds that: Pr {(0, 1) t 1 = 1, µ i, β} = Pr {(0, 1) µ i, β} Pr {(0, 1) µ i, β} + Pr {(1, 0) µ i, β} Moreover, Pr {(0, 1) µ i, β} = Pr {y i1 = 0 µ i, β} Pr {y i2 = 1 µ i, β}

The Fixed Effects Logit Model - An Example In the logit model it holds that and correspondingly: Pr {y it = 1 µ i, β} = exp {µ i + x it β} 1 + exp { µ i + x it β} Pr {y it = 0 µ i, β} = 1 exp {µ i + x it β} 1 + exp { µ i + x it β} = 1 1 + exp { µ i + x it β} Inserting this into the expression for Pr {(0, 1) t i = 1, µ i, β} yields: Pr {(0, 1) t i = 1, µ i, β} = exp { (x i2 x i1 ) β } 1 + exp { (x i2 x i1 ) β } and correspondingly: Pr {(1, 0) t i = 1, µ i, β} = 1 Pr {(0, 1) t i = 1, µ i, β}

The Fixed Effects Logit Model - An Example Hence, in the case of T = 2 the fixed effects logit model is equivalent to a simple cross-sectional logit model using an endogenous variable ỹ i which is one if y it changes from 0 to 1 between t = 1 and t = 2 and zero if y it changes from 1 to 0 and the difference x i2 x i1 as the explanatory variable

Testing for Fixed Effects Testing fixed individual effects versus no fixed individual effects can be done using a Hausmann test: The logit estimator ignoring individual effects is consistent and efficient under the null hypothesis of no individual effects and inconsistent under the alternative The fixed effects logit estimator is consistent under both hypotheses, but inefficient if there are no individual effects Under the null hypothesis the test statistic is χ 2 K distributed

Assumptions and Problems of the Random Effects Estimator Usually, a probit specification is used for the random effects estimator, i.e. it is assumed that u it = µ i + ν it, where µ i is i.i.d. with N ( 0, σ 2 µ) and where νit is i.i.d. with N ( 0, σ 2 ν). Computational Problems: Because of the individual random effect, the observations from individual i are correlated over time, i.e. E (u it u is ) = σ 2 µ for t s In order to obtain the probability contribution of any individual i, T -dimensional integrals are necessary this is not feasible if T is large

Assumptions and Problems of the Random Effects Estimator Conditioning on the individual effects, this reduces to a one dimensional integral: = = f (y i1,..., y it x i1,..., x it, β) f (y i1,..., y it x i1,..., x it, µ i, β) f (µ i ) dµ i [ ] f (y it x it, µ i, β) f (µ i ) dµ i t which can be approximated through numerical integration (e.g. Gaussian quadrature procedure)

Simulation Estimation Unfeasible integrals can be replaced by unbiased Monte Carlo probability simulators (method of simulated moments (MSM) estimator) Therefore, it is necessary to simulate M T potential choice sequences for each individual, where M is the number of possible choices each individual faces in any period also not feasible if T is large The MSM estimator is asymptotically as efficient as maximum likelihood

or Unobserved Heterogeneity Frequently, it is observed that people with a long history of unemployment are less likely to find a job. This might have two reasons: : People who are unemployed for a long time loose their skills or become discouraged from searching a job Unobserved Heterogeneity: There is a selection mechanism present, meaning that people who share certain characteristics are those who are long-term unemployed

Testing for Test for γ = 0 in the model y it = x it β + γy it 1 + µ i + ν it If γ = 0 is rejected, this can either be due to true state dependence or serially correlated remainder errors ν it In order to find out whether true state dependence is present, it is possible to condition on lagged values of the explanatory variables without conditioning on the lagged state If conditioning on the lagged explanatory variables does not change the probability of y it = 1, there is no true state dependence in the data If conditioning on the lagged explanatory variables changes the probability of y it = 1, there is true state dependence in the data