-redprob- A Stata program for the Heckman estimator of the random effects dynamic probit model

Similar documents
Limited Dependent Variables and Panel Data

Non-linear panel data modeling

Simplified Implementation of the Heckman Estimator of the Dynamic Probit Model and a Comparison with Alternative Estimators

Question 1 [17 points]: (ch 11)

Description Remarks and examples Reference Also see

Confidence intervals for the variance component of random-effects linear models

point estimates, standard errors, testing, and inference for nonlinear combinations

Jeffrey M. Wooldridge Michigan State University

xtdpdqml: Quasi-maximum likelihood estimation of linear dynamic short-t panel data models

ECON 594: Lecture #6

Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 2017, Chicago, Illinois

leebounds: Lee s (2009) treatment effects bounds for non-random sample selection for Stata

Treatment interactions with nonexperimental data in Stata

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation

Exploring Marginal Treatment Effects

CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M.

(a) Briefly discuss the advantage of using panel data in this situation rather than pure crosssections

Testing methodology. It often the case that we try to determine the form of the model on the basis of data

Estimation of Dynamic Nonlinear Random E ects Models with Unbalanced Panels.

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser

QIC program and model selection in GEE analyses

ERSA Training Workshop Lecture 5: Estimation of Binary Choice Models with Panel Data

Econometric Analysis of Panel Data. Final Examination: Spring 2013

Binary Dependent Variables

Description Quick start Menu Syntax Options Remarks and examples Stored results Methods and formulas Acknowledgment References Also see

Answer all questions from part I. Answer two question from part II.a, and one question from part II.b.

raise Coef. Std. Err. z P> z [95% Conf. Interval]

Estimating the Fractional Response Model with an Endogenous Count Variable

Problem set - Selection and Diff-in-Diff

GMM Estimation in Stata

Dynamic Panels. Chapter Introduction Autoregressive Model

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares

xtunbalmd: Dynamic Binary Random E ects Models Estimation with Unbalanced Panels

Partial effects in fixed effects models

Please discuss each of the 3 problems on a separate sheet of paper, not just on a separate page!

Multivariate probit regression using simulated maximum likelihood

David M. Drukker Italian Stata Users Group meeting 15 November Executive Director of Econometrics Stata

Model and Working Correlation Structure Selection in GEE Analyses of Longitudinal Data

Women. Sheng-Kai Chang. Abstract. In this paper a computationally practical simulation estimator is proposed for the twotiered

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Handout 12. Endogeneity & Simultaneous Equation Models

Extended regression models using Stata 15

Control Function and Related Methods: Nonlinear Models

Missing dependent variables in panel data models

Econometrics of Panel Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Econometrics II Censoring & Truncation. May 5, 2011

Parametric and semiparametric estimation of ordered response models with sample selection and individual-specific thresholds

Practice exam questions

Fortin Econ Econometric Review 1. 1 Panel Data Methods Fixed Effects Dummy Variables Regression... 7

Short T Panels - Review

Problem Set 10: Panel Data

Marginal effects and extending the Blinder-Oaxaca. decomposition to nonlinear models. Tamás Bartus

Obtaining Critical Values for Test of Markov. Regime Switching

A Journey to Latent Class Analysis (LCA)

Very simple marginal effects in some discrete choice models *

Econometric Analysis of Cross Section and Panel Data

Instrumental Variables, Simultaneous and Systems of Equations

Dealing With and Understanding Endogeneity

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

1 The basics of panel data

Panel Data Seminar. Discrete Response Models. Crest-Insee. 11 April 2008

Generalized method of moments estimation in Stata 11

Using generalized structural equation models to fit customized models without programming, and taking advantage of new features of -margins-

ECON Introductory Econometrics. Lecture 16: Instrumental variables

Lecture 3.1 Basic Logistic LDA

Syllabus. By Joan Llull. Microeconometrics. IDEA PhD Program. Fall Chapter 1: Introduction and a Brief Review of Relevant Tools

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

2. We care about proportion for categorical variable, but average for numerical one.

Panel data methods for policy analysis

The Simulation Extrapolation Method for Fitting Generalized Linear Models with Additive Measurement Error

Estimating Markov-switching regression models in Stata

EC327: Advanced Econometrics, Spring 2007

Lab 07 Introduction to Econometrics

Understanding the multinomial-poisson transformation

Implementing Conditional Tests with Correct Size in the Simultaneous Equations Model

Final Exam. Question 1 (20 points) 2 (25 points) 3 (30 points) 4 (25 points) 5 (10 points) 6 (40 points) Total (150 points) Bonus question (10)

Boolean logit and probit in Stata

Capital humain, développement et migrations: approche macroéconomique (Empirical Analysis - Static Part)

Fixed and Random Effects Models: Vartanian, SW 683

Selection endogenous dummy ordered probit, and selection endogenous dummy dynamic ordered probit models

Panel Data Exercises Manuel Arellano. Using panel data, a researcher considers the estimation of the following system:

ECON2228 Notes 10. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 48

Introductory Econometrics. Lecture 13: Hypothesis testing in the multiple regression model, Part 1

An explanation of Two Stage Least Squares

Applied Quantitative Methods II

INTRODUCTION TO BASIC LINEAR REGRESSION MODEL

Homework Solutions Applied Logistic Regression

Monday 7 th Febraury 2005

1. The Multivariate Classical Linear Regression Model

Chapter 11. Regression with a Binary Dependent Variable

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

ECON2228 Notes 10. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 54

Topic 10: Panel Data Analysis

Obtaining Critical Values for Test of Markov Regime Switching

A simple alternative to the linear probability model for binary choice models with endogenous regressors

dqd: A command for treatment effect estimation under alternative assumptions

FTE Employment before FTE Employment after

Problem Set 5 ANSWERS

Comments on: Panel Data Analysis Advantages and Challenges. Manuel Arellano CEMFI, Madrid November 2006

Transcription:

-redprob- A Stata program for the Heckman estimator of the random effects dynamic probit model Mark B. Stewart University of Warwick January 2006 1 The model The latent equation for the random effects dynamic probit model to be considered is specified as y it = γy it 1 + x 0 itβ + α i + u it (1) (i =1,...,N; t =2,...,T), where yit isthelatentdependentvariable,x it is a vector of exogenous explanatory variables, α i are (unobserved) individual-specific random effects, and the u it are assumed to be distributed N(0,σ 2 u). The observed binary outcome variable is defined as: ½ 1 if y y it = it 0 0 else (2) The subscript i indexes individuals and the subscript t indexes time periods. N is taken to be large, but T is typically small and regarded as fixed. Even when the errors u it are assumed serially independent, the composite error term, v it = α i + u it, will be correlated over time due to the individual specific time invariant α i terms. The individual-specific random effects specification adopted implies equi-correlation between the v it in any two (different) time periods: λ = Corr(v it,v is )= σ2 α σ 2 α + σ 2 u t, s =2,...,T; t 6= s (3) The standard (uncorrelated) random effects model also assumes α i uncorrelated with x it. Alternatively, adopting the Mundlak Chamberlain approach, correlation Address for correspondence: Mark Stewart, Economics Department, University of Warwick, Coventry CV4 7AL, UK. Tel: (44/0)-24-7653-3043. Fax: (44/0)-24-7652-3032. E-mail: Mark.Stewart@warwick.ac.uk 1

between α i and the observed characteristics in the model can be allowed for by assuming a relationship between α and either the time means of the x-variables or a combination of their lags and leads, e.g.: α i = x 0 ia + ζ i,whereζ i iid Normal and independent of x it and u it for all i, t. In terms of implementation, this simply has the effect of adding time means or lags and leads to the set of explanatory variables. To simplify notation the original form (1) will be used here with the understanding that these additional terms are subsumed into the x-vector in the case of the correlated random effects model. Since y is a binary variable, a normalization is required. A convenient one is that σ 2 u =1. Since u it is normally distributed, the transition probability for individual i at time t, givenα i,isgivenby P [y it x it,y it 1,α i ]=Φ [(γy it 1 + x 0 itβ + α i )(2y it 1)]. (4) where Φ is the cumulative distribution function of a standard normal. Estimation of the model requires an assumption about the initial observations, y i1, andinparticularabouttheirrelationshipwiththeα i. The assumption giving rise to the simplest form of model for estimation would be to take the initial conditions, y i1, to be exogenous. This would be appropriate if the start of the process coincided with the start of the observation period for each individual, but this is typically not the case. Under this assumption a standard Random Effects Probit program (such as xtprobit) can be used, since the likelihood can be decomposed into two independent factors and the joint probability for t =2,...,T maximized without reference to that for t =1. However, if the initial conditions are correlated with the α i,aswould be expected in most situations, this estimator will be inconsistent and will tend to overestimate γ and hence overstate the extent of state dependence. The approach to the initial conditions problem proposed by Heckman (1981) involves specifying a linearized approximation to the reduced form equation for the initial value of the latent variable: y i1 = z 0 i1π + η i (5) (i =1,...,N), where z i1 is a vector of exogenous instruments (for example pre-sample variables) and includes x i1,andη i is correlated with α i, but uncorrelated with u it for t 2. Using an orthogonal projection, it can be written as: η i = θα i + u i1 (6) (θ >0), with α i and u i1 independent of one another. It is also assumed that u i1 satisfies the same distributional assumptions as u it for t =2,...,T. (Any change in error variance will also be captured in θ.) The linearized reduced form for the latent variable for the initial time period is therefore specified as y i1 = z 0 i1π + θα i + u i1 (7) (i =1,...,N). The joint probability of the observed binary sequence for individual i, givenα i, in the Heckman approach is thus: Φ [(z 0 i1π + θα i )(2y i1 1)] TY Φ [(γy it 1 + x 0 itβ + α i )(2y it 1)]. (8) t=2 2

Hence for a random sample of individuals the likelihood to be maximized is given by Y Z ( ) TY Φ [(zi1π 0 + θσ α α )(2y i1 1)] Φ [(γy it 1 + x 0 itβ + σ α α )(2y it 1)] df (α ) i α t=2 (9) where F is the distribution function of α = α/σ α. Under the normalization used, σ α = p λ/(1 λ). With α taken to be normally distributed, the integral over α can be evaluated using Gaussian Hermite quadrature. The program redprob provides this Maximum Likelihood estimator. See Stewart (2005) for an application of the estimator in the context of an investigation of the dynamics of the conditional probability of unemployment. 2 The redprob command 2.1 Syntax redprob depvar varlist (varlist init )[if exp] [in range] [,i(varname) t(varname) quadrat(#) from(matname) ] The lagged dependent variable must be constructed by the user and must appear as the first variable in varlist. It is the user s responsibility to ensure that both this variable and depvar are binary 0/1 variables. varlist should additionally contain the variables in x. varlist init should contain the variables in z. 2.2 Options i(varname) specifies the variable name that contains the cross-section identifier, corresponding to index i. t(varname) specifiesthevariablenamethatcontainsthetime-seriesidentifier, corresponding to index t. quadrat(#) specifies the number of Gaussian Hermite quadrature points in the evaluation of the integral. from(matname) specifies a matrix containing starting values for the parameters of the model. Use this option to check that a global maximum has been found. Also use to reduce required number of iterations or to restart a previously halted run. The default uses apooled probit for t 2 and separate probit for the initial period reduced form. 2.3 Sample output To give an example of the use of the command and the output produced, this section uses the union data (http://www.stata-press.com/data/r9/union.dta) used in [R] xtprobit to model the probability of union membership. The data are for US young women and are from the NLSY. A subsample of the dataset is used here: (1) only data from 1978 onwards are used, (2) the data for 1983 are dropped, and (3) only those individuals observed in each of the remaining 6 waves are kept: drop if year<78 drop if year==83 by idcode: gen nwav=_n 3

keep if nwav==6 This gives a balanced panel with N = 799 individuals observed in each of T =6 waves and hence a sample size of NT =4, 794. In the example here the observations for 85 and 87 are implicitly treated as if they were for 84 and 86 respectively, which would give 6 waves at regular 2-year intervals. In addition to the lagged dependent variable, the model used includes age (age in current year), grade (years of schooling completed), and south (1 if resident in south). These variables are contained in x in the specification of the model given above. The vector z additionally contains the variable not_smsa (1 if living outside a standard metropolitan statistical area). This variable has a significant negative effect on the probability of union membership in the initial period reduced form, whether estimatedasaseparateprobitoraspartofthefullmodel. The output from the redprob programisasfollows:. sort idcode year. by idcode: gen tper = _n. by idcode: gen Lunion = union[_n-1] (799 missing values generated). redprob union Lunion age grade south (age grade south not_smsa), /* > */ i(idcode) t(tper) quadrat(24) from(bstart1) (output deleted) Random-Effects Dynamic Probit Model Number of obs = 4794 Wald chi2(4) = 82.04 Log likelihood = -1860.2152 Prob > chi2 = 0.0000 ------------------------------------------------------------------------------ union Coef. Std. Err. z P> z [95% Conf. Interval] union Lunion.6344453.098323 6.45 0.000.4417357.8271548 age -.0285863.009217-3.10 0.002 -.0466512 -.0105213 grade -.0539267.0269226-2.00 0.045 -.106694 -.0011595 south -.4883257.1238919-3.94 0.000 -.7311494 -.245502 _cons.5632897.4798547 1.17 0.240 -.3772082 1.503788 rfper1 age.0081099.0238047 0.34 0.733 -.0385464.0547663 grade -.0064163.0340848-0.19 0.851 -.0732214.0603888 south -.7260671.1650785-4.40 0.000-1.049615 -.4025192 not_smsa -.4152246.1644004-2.53 0.012 -.7374435 -.0930057 _cons -.9597118.8413652-1.14 0.254-2.608757.6893338 /logitrho.8453732.163999 5.15 0.000.5239411 1.166805 /ltheta -.1461108.1267408-1.15 0.249 -.3945182.1022966 rho.6995957.0344663 20.30 0.000.6280689.7625671 theta.864062.1095119 7.89 0.000.6740047 1.107712 ------------------------------------------------------------------------------ 2.4 Comments It is instructive to compare the estimates from redprob with those from using other estimators. Estimation of the corresponding model as a simple pooled probit, i.e. ignoring the cross-correlation between the composite error term in different time periods for the same individual, gives an estimate of γ of 1.88, much larger than that here. However it is important to note that the random effects models and the pooled probit model use different normalizations. The random effects models use a normalization of σ 2 u =1, while the pooled probit estimator uses σ 2 v =1. When comparing 4

them with pooled probit estimates, random effects model estimates therefore need to be multiplied by an estimate of σ u /σ v = 1 λ. Use of the Stata xtprobit command allows individual-specific effects in the equation, but takes the initial condition to be exogenous. This results in a considerable reduction in the estimate of γ compared with pooled probit, to 1.15. Allowing for the different normalizations, the scaled estimate of the coefficient on lagged union membership is 0.79, less than half the pooled probit estimate. Using the Heckman estimator in the program redprob, which allows for the endogeneity of the initial conditions, results in a further reduction in the estimate of γ compared to the xtprobit estimates, from 1.15 to 0.63, a reduction by almost half. The rescaled estimate declines from 0.79 to 0.35. It is therefore less than half the xtprobit estimate and less than one fifth the pooled probit estimate. The estimated effects of the x-variables are all greater (in absolute value) with the xtprobit random effects estimator than the pooled probit estimator, and greater again when the endogeneity of the initial conditions is allowed for with the redprob estimator. Exogeneity of the initial conditions in the random effects model can be viewed as resulting from imposing θ = 0 on the model estimated by redprob. Testing this hypothesis must allow for the fact that it is on the boundary of the parameter space. Never-the-less it is clear that in the example considered here the exogeneity hypothesis is strongly rejected. 3 References Heckman, J.J. (1981), The incidental parameters problem and the problem of initial conditions in estimating a discrete time - discrete data stochastic process, in C.F. Manski and D. McFadden (eds.). Structural Analysis of Discrete Data with Econometric Applications. Cambridge: MIT Press. Stewart, M.B. (2005), The inter-related dynamics of unemployment and low-wage employment, forthcoming Journal of Applied Econometrics. 5