Limited Dependent Variables and Panel Data

Similar documents
Non-linear panel data modeling

Partial effects in fixed effects models

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Limited Dependent Variables and Panel Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

ECON 594: Lecture #6

Review of Panel Data Model Types Next Steps. Panel GLMs. Department of Political Science and Government Aarhus University.

Workshop for empirical trade analysis. December 2015 Bangkok, Thailand

ERSA Training Workshop Lecture 5: Estimation of Binary Choice Models with Panel Data

Econometrics Lecture 5: Limited Dependent Variable Models: Logit and Probit

Econometrics II. Seppo Pynnönen. Spring Department of Mathematics and Statistics, University of Vaasa, Finland

Binary Choice Models Probit & Logit. = 0 with Pr = 0 = 1. decision-making purchase of durable consumer products unemployment

Truncation and Censoring

Econometric Analysis of Cross Section and Panel Data

CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M.

Wageningen Summer School in Econometrics. The Bayesian Approach in Theory and Practice

Applied Econometrics Lecture 1

Binary Dependent Variables

Single-level Models for Binary Responses

Linear Regression With Special Variables

Chapter 9 Regression with a Binary Dependent Variable. Multiple Choice. 1) The binary dependent variable model is an example of a

ECONOMETRICS HONOR S EXAM REVIEW SESSION

Lecture 3.1 Basic Logistic LDA

Tobit and Selection Models

Economics 671: Applied Econometrics Department of Economics, Finance and Legal Studies University of Alabama

ECON Introductory Econometrics. Lecture 11: Binary dependent variables

Modeling Binary Outcomes: Logit and Probit Models

Gibbs Sampling in Latent Variable Models #1

Panel Data Exercises Manuel Arellano. Using panel data, a researcher considers the estimation of the following system:

ECONOMETFUCS FIELD EXAM Michigan State University May 11, 2007

Max. Likelihood Estimation. Outline. Econometrics II. Ricardo Mora. Notes. Notes

Gibbs Sampling in Endogenous Variables Models

PS 271B: Quantitative Methods II Lecture Notes

Munich Lecture Series 2 Non-linear panel data models: Binary response and ordered choice models and bias-corrected fixed effects models

Selection endogenous dummy ordered probit, and selection endogenous dummy dynamic ordered probit models

Applied Health Economics (for B.Sc.)

Generalized Linear Models Introduction

Lecture notes to Chapter 11, Regression with binary dependent variables - probit and logit regression

Syllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools

Lecture 12: Application of Maximum Likelihood Estimation:Truncation, Censoring, and Corner Solutions

Discrete Choice Modeling

Jeffrey M. Wooldridge Michigan State University

Lecture 8 Panel Data

SIMPLE SOLUTIONS TO THE INITIAL CONDITIONS PROBLEM IN DYNAMIC, NONLINEAR PANEL DATA MODELS WITH UNOBSERVED HETEROGENEITY

i (x i x) 2 1 N i x i(y i y) Var(x) = P (x 1 x) Var(x)

Specification Tests in Panel Data Models Using Artificial Regressions

Estimation in the Fixed Effects Ordered Logit Model. Chris Muris (SFU)

Econometrics of Panel Data

Stat 5101 Lecture Notes

Sample Selection. In a UW Econ Ph.D. Model. April 25, Kevin Kasberg and Alex Stevens ECON 5360 Dr. Aadland

Latent Variable Models for Binary Data. Suppose that for a given vector of explanatory variables x, the latent

Identification in Discrete Choice Models with Fixed Effects

Panel data methods for policy analysis

2. We care about proportion for categorical variable, but average for numerical one.

Limited Dependent Panel Data Models: A comparative Analysis of classical and Bayesian Inference among Econometric Packages

AGEC 661 Note Fourteen

Econometrics of Panel Data

Limited Dependent Variable Models II

-redprob- A Stata program for the Heckman estimator of the random effects dynamic probit model

Chap 2. Linear Classifiers (FTH, ) Yongdai Kim Seoul National University

Microeconometrics. C. Hsiao (2014), Analysis of Panel Data, 3rd edition. Cambridge, University Press.

Panel Models for Non-Continuous Dependent Variables

Using the Delta Method to Construct Confidence Intervals for Predicted Probabilities, Rates, and Discrete Changes 1

xtunbalmd: Dynamic Binary Random E ects Models Estimation with Unbalanced Panels

Bootstrapping a conditional moments test for normality after tobit estimation

Applied Economics. Regression with a Binary Dependent Variable. Department of Economics Universidad Carlos III de Madrid

MLE and GMM. Li Zhao, SJTU. Spring, Li Zhao MLE and GMM 1 / 22

Semiparametric Generalized Linear Models

Chapter 11. Regression with a Binary Dependent Variable

Comparing groups using predicted probabilities

Econometrics Honor s Exam Review Session. Spring 2012 Eunice Han

Panel Data Seminar. Discrete Response Models. Crest-Insee. 11 April 2008

Maximum Likelihood Estimation

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2016 Instructor: Victor Aguirregabiria

Econometrics I Lecture 3: The Simple Linear Regression Model

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Winter 2014 Instructor: Victor Aguirregabiria

A dynamic model for binary panel data with unobserved heterogeneity admitting a n-consistent conditional estimator

STA216: Generalized Linear Models. Lecture 1. Review and Introduction

Lecture 2: Categorical Variable. A nice book about categorical variable is An Introduction to Categorical Data Analysis authored by Alan Agresti

What s New in Econometrics? Lecture 14 Quantile Methods

Discrete Choice Modeling

NELS 88. Latent Response Variable Formulation Versus Probability Curve Formulation

A Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,

Answer all questions from part I. Answer two question from part II.a, and one question from part II.b.

MLE for a logit model

Machine Learning Linear Classification. Prof. Matteo Matteucci

Lecture 11/12. Roy Model, MTE, Structural Estimation

Problem Set 10: Panel Data

Lecture 4: Linear panel models

Bayesian Inference in GLMs. Frequentists typically base inferences on MLEs, asymptotic confidence

covariance between any two observations

Labor Supply and the Two-Step Estimator

ECONOMETRICS II (ECO 2401S) University of Toronto. Department of Economics. Spring 2013 Instructor: Victor Aguirregabiria

Commuting paradox revisited - Compensation for commutes in two-earner households? Konferenz Verkehrsökonomik und -politik 2017

The Multilevel Logit Model for Binary Dependent Variables Marco R. Steenbergen

FNCE 926 Empirical Methods in CF

Binary Models with Endogenous Explanatory Variables

,..., θ(2),..., θ(n)

CENSORED DATA AND CENSORED NORMAL REGRESSION

Rewrap ECON November 18, () Rewrap ECON 4135 November 18, / 35

Transcription:

Limited Dependent Variables and Panel Data Logit, Probit and Friends Benjamin Bittschi Sebastian Koch

Outline Binary dependent variables Logit Fixed Effects Models Probit Random Effects Models Censored dependent variables Empirical application timber supply

Binary dependent variables Y it =1: event happens Y it =0: event does not happen P it : i s probability for a 1 or zero at time t P it = Pr[y it =1] = E(y it x it ) = F (x it ß) Linear probability model: F (x it ß)= x it ßor F(w)=w > usual panel methods apply but ŷ it is not guaranteed to lie in the unit interval

Binary dependent variables II Solution to the unity problem : Logistic or Normal c.d.f that constrain F (w) between 0 and 1. Logit or w e F( w) = 1+ e Probit Model w w u 1 2 F( w) = e 2π 2 du = φ( w)

Binary dependent variables III Example: Neoclassical labour supply function Worker participation in the labour force: y it = 1 if y it * > 0 y it = 0 if y it * 0 (y it *: Δ offered wage / unobserved reservation wage) where: y it *= x it ß+u it, so that Pr[y it =1] = Pr[y it *>0] = Pr[u it > x it ß]= F (x it ß)

F.E. Model I Incidental parameters problem F.E. Model: y it *= x it ß+μ i +ν it with Pr[y it =1] = Pr[y it *>0] = Pr[ν it > x it ß μ i ] = F(x it ß+μ i ) μ i and ß unknown parameters and as N > for fixed T, μ i increase with N > thus μ i cannot be consistently estimated for fixed T > incidental parameters problem Neyman, Scott (1948) Lancaster (2000)

F.E. Model II In the linear panel data model ßwas estimated consistently by getting rid of μ i > within transformation Possible because mle of ßand μ i are asymptotically independent => not longer the case for qualitative independent variables with fixed T Hsiao (2003: 194f) shows for a simple model with N > and T=2: plim (N > ) ß=2ß Greene (2004) shows: N=1000, T=2, 200 reps, 100% bias N=1000, T=10, 200 reps, 16% bias N=1000, T=20, 200 reps, 6.9% bias

F.E. Model III Solution: Find a minimal sufficient statistic for μ i T For the logit model this is according to Chamberlain (1980). t= 1 y it Therefore: Maximizing conditional likelihood function: N L c = Pr(y i1,,y it ) > conditional logit estimates for ß i= 1 For F.E. probit models: T t= 1 y it We cannot find simple functions for the parameters of interest that are independent of the parameters μ i. There does NOT exist a consistent estimator of ßfor the fixed effects probit models.

RE models R.E. probit model u it = μ i + ν it μ i IIN(0,σ² μ ) ν it IIN(0,σ² ν ) Independent of each other and of x it Since E(u it u is )= σ² μ for t s, the joint likelihood of (y 1t,,y Nt ) can no longer be written as the product of the marginal likelihoods of the y it. => complication of the derivation of maximum likelihood which will now involve T dimensional integrals. L i = Pr[ yi 1, yi2,..., yit / X ] f ( ui 1, ui2,..., uit ) dui 1dui2... duit =...

RE models II Maximization of L i w.r.t. to ßand σ μ infeasible if T is big. Trick: Write the joint density function as a product of the conditional density and the marginal density of μ i. By conditioning on the individual effects, the T dimensional integral reduces to: f T = i1, ui2,..., uit ) f1( uit µ i ) f2( µ i ) t= 1 ( u dµ i

Part II: applied But before: Tobit Censored Truncated Corner Solution Count Data Tobit II Hurdle model

Tobit Models: Censored, truncated and corner solution models Source: Long (1997)

Concept of latent variable Distribution of y* given x in the binary response model Source: Long (1997)

Regression without censoring Source: Long (1997)

Regression with Censoring and Truncation Source: Long (1997)

Application: Timber Supply T = 60 months N = 12.000 members 720.000 observations (95% of obs.: supply = 0) Variable of interest: Timber supply Possibly influenced by: price (main assortment) wind storm (dummy) forest land (in ha) structure (agrar vs forest) sex, age of supplier seasons (dummy) etc. (not yet available)

Overview: aggreated monthly supply

Tobit and Tobit II Tobit: (Corner Solution Model censored model) Tobit II: (Two step or hurdle model) xttobit: xtlogit: (or xtprobit) xtreg: Interpretation? > coefficients of y* not y Interpretation? > latent variable, odds ratio, standard Source: Baltagi (2008)

TOBIT: xttobit

Interpretation Tobit Tobit: (Corner solution model censored model) Interpretation: For a unit change in x k, there is an expected change of ß k units in y*, holding all other variables constant. In the above case: A 1% increase in price, yields in an expected 11.2 % increase in the propensity to supply ( Price elasticity of supply) Source: Long (1997)

Tobit and Tobit II Tobit: (Corner Solution Model censored model) Tobit II: (Two step or hurdle model) xttobit: xtlogit: (or xtprobit) xtreg: Interpretation? > coefficients of y* not y Interpretation? > latent variable, odds ratio, standard Source: Baltagi (2008)

TOBIT II: xtlogit

TOBIT II: xtlogit, odds ratio

Tobit II: Interpretation: xtlogit: For a unit increase in x k, there is an expected logit change by ß k, holding all other variables constant. > Meaning not intuitive. Better: Odds Ratio: e (ßk * delta) For each additional unit (delta =1) in x k, the odds are expected to change by a factor of e (ßk * delta), holding all other variables constant. I.e. in the above case: An increase in price by 10 Euro (delta =10), increases the odds of becoming a supplier by a factor of 1.69. (= 1.054 10 when looking at the xtlogit odds ratio coefficents or exp( 0.052 *10) when looking at the xtlogit output.) xtreg: xtreg interpretation is straight forward. Be aware that the regression only includes those forest owners that indeed do supply. (reduced sample). Source: Long(2008)

TOBIT II: xtreg (re)

TOBIT II : xtreg (re) case 2: forest land < 50 ha

Thanks!

BACKUP SLIDES

TOBIT II: xtprobit

Rho Source: Stata help Source: Kunst