Bias in Conditional and Unconditional Fixed Effects Logit Estimation: a Correction * Tom Coupé

Similar documents
Robust estimation based on the first- and third-moment restrictions of the power transformation model

ACE 562 Fall Lecture 5: The Simple Linear Regression Model: Sampling Properties of the Least Squares Estimators. by Professor Scott H.

R t. C t P t. + u t. C t = αp t + βr t + v t. + β + w t

Econ107 Applied Econometrics Topic 7: Multicollinearity (Studenmund, Chapter 8)

ESTIMATION OF DYNAMIC PANEL DATA MODELS WHEN REGRESSION COEFFICIENTS AND INDIVIDUAL EFFECTS ARE TIME-VARYING

Diebold, Chapter 7. Francis X. Diebold, Elements of Forecasting, 4th Edition (Mason, Ohio: Cengage Learning, 2006). Chapter 7. Characterizing Cycles

A Specification Test for Linear Dynamic Stochastic General Equilibrium Models

ACE 562 Fall Lecture 4: Simple Linear Regression Model: Specification and Estimation. by Professor Scott H. Irwin

How to Deal with Structural Breaks in Practical Cointegration Analysis

Regression with Time Series Data

Robert Kollmann. 6 September 2017

Generalized Least Squares

(10) (a) Derive and plot the spectrum of y. Discuss how the seasonality in the process is evident in spectrum.

GMM - Generalized Method of Moments

Dynamic Models, Autocorrelation and Forecasting

ACE 562 Fall Lecture 8: The Simple Linear Regression Model: R 2, Reporting the Results and Prediction. by Professor Scott H.

Financial Econometrics Jeffrey R. Russell Midterm Winter 2009 SOLUTIONS

Introduction D P. r = constant discount rate, g = Gordon Model (1962): constant dividend growth rate.

The General Linear Test in the Ridge Regression

Licenciatura de ADE y Licenciatura conjunta Derecho y ADE. Hoja de ejercicios 2 PARTE A

Testing for a Single Factor Model in the Multivariate State Space Framework

Problem Set 5. Graduate Macro II, Spring 2017 The University of Notre Dame Professor Sims

Physics 235 Chapter 2. Chapter 2 Newtonian Mechanics Single Particle

Vehicle Arrival Models : Headway

ACE 564 Spring Lecture 7. Extensions of The Multiple Regression Model: Dummy Independent Variables. by Professor Scott H.

Notes on Kalman Filtering

Lecture 2-1 Kinematics in One Dimension Displacement, Velocity and Acceleration Everything in the world is moving. Nothing stays still.

Modeling and Forecasting Volatility Autoregressive Conditional Heteroskedasticity Models. Economic Forecasting Anthony Tay Slide 1

On Measuring Pro-Poor Growth. 1. On Various Ways of Measuring Pro-Poor Growth: A Short Review of the Literature

Explaining Total Factor Productivity. Ulrich Kohli University of Geneva December 2015

13.3 Term structure models

Wednesday, November 7 Handout: Heteroskedasticity

STATE-SPACE MODELLING. A mass balance across the tank gives:

Econ Autocorrelation. Sanjaya DeSilva

Cointegration and Implications for Forecasting

DEPARTMENT OF STATISTICS

Let us start with a two dimensional case. We consider a vector ( x,

Comparing Means: t-tests for One Sample & Two Related Samples

Kriging Models Predicting Atrazine Concentrations in Surface Water Draining Agricultural Watersheds

Chapter Floating Point Representation

Wednesday, December 5 Handout: Panel Data and Unobservable Variables

1. Diagnostic (Misspeci cation) Tests: Testing the Assumptions

Distribution of Estimates

Solutions to Odd Number Exercises in Chapter 6

(a) Set up the least squares estimation procedure for this problem, which will consist in minimizing the sum of squared residuals. 2 t.

Lecture 33: November 29

Matlab and Python programming: how to get started

DEPARTMENT OF ECONOMICS AND FINANCE COLLEGE OF BUSINESS AND ECONOMICS UNIVERSITY OF CANTERBURY CHRISTCHURCH, NEW ZEALAND

Ensamble methods: Bagging and Boosting

FITTING EQUATIONS TO DATA

Biol. 356 Lab 8. Mortality, Recruitment, and Migration Rates

Measurement Error 1: Consequences Page 1. Definitions. For two variables, X and Y, the following hold: Expectation, or Mean, of X.

Exponential Weighted Moving Average (EWMA) Chart Under The Assumption of Moderateness And Its 3 Control Limits

Simulation-Solving Dynamic Models ABE 5646 Week 2, Spring 2010

20. Applications of the Genetic-Drift Model

Two Coupled Oscillators / Normal Modes

Properties of Autocorrelated Processes Economics 30331

Lecture 2 April 04, 2018

You must fully interpret your results. There is a relationship doesn t cut it. Use the text and, especially, the SPSS Manual for guidance.

Random Walk with Anti-Correlated Steps

A new flexible Weibull distribution

The Effect of Nonzero Autocorrelation Coefficients on the Distributions of Durbin-Watson Test Estimator: Three Autoregressive Models

Unit Root Time Series. Univariate random walk

Almost Unbiased Estimator for Estimating Population Mean Using Known Value of Some Population Parameter(s)

Ensamble methods: Boosting

Article from. Predictive Analytics and Futurism. July 2016 Issue 13

Inventory Analysis and Management. Multi-Period Stochastic Models: Optimality of (s, S) Policy for K-Convex Objective Functions

Lecture 3: Exponential Smoothing

3.1 More on model selection

State-Space Models. Initialization, Estimation and Smoothing of the Kalman Filter

Final Spring 2007

Chapter 5. Heterocedastic Models. Introduction to time series (2008) 1

A note on spurious regressions between stationary series

Chapter 11. Heteroskedasticity The Nature of Heteroskedasticity. In Chapter 3 we introduced the linear model (11.1.1)

PENALIZED LEAST SQUARES AND PENALIZED LIKELIHOOD

Hypothesis Testing in the Classical Normal Linear Regression Model. 1. Components of Hypothesis Tests

CONFIDENCE LIMITS AND THEIR ROBUSTNESS

Inventory Control of Perishable Items in a Two-Echelon Supply Chain

1 Differential Equation Investigations using Customizable

Computer Simulates the Effect of Internal Restriction on Residuals in Linear Regression Model with First-order Autoregressive Procedures

0.1 MAXIMUM LIKELIHOOD ESTIMATION EXPLAINED

Georey E. Hinton. University oftoronto. Technical Report CRG-TR February 22, Abstract

Hamilton- J acobi Equation: Weak S olution We continue the study of the Hamilton-Jacobi equation:

STRUCTURAL CHANGE IN TIME SERIES OF THE EXCHANGE RATES BETWEEN YEN-DOLLAR AND YEN-EURO IN

Air Traffic Forecast Empirical Research Based on the MCMC Method

t is a basis for the solution space to this system, then the matrix having these solutions as columns, t x 1 t, x 2 t,... x n t x 2 t...

Estimation Uncertainty

Distribution of Least Squares

4.1 Other Interpretations of Ridge Regression

23.2. Representing Periodic Functions by Fourier Series. Introduction. Prerequisites. Learning Outcomes

Lecture 5. Time series: ECM. Bernardina Algieri Department Economics, Statistics and Finance

Chapter 16. Regression with Time Series Data

Vectorautoregressive Model and Cointegration Analysis. Time Series Analysis Dr. Sevtap Kestel 1

Some Basic Information about M-S-D Systems

Y 0.4Y 0.45Y Y to a proper ARMA specification.

A Point Optimal Test for the Null of Near Integration. A. Aznar and M. I. Ayuda 1. University of Zaragoza

Multi-scale 2D acoustic full waveform inversion with high frequency impulsive source

3.1.3 INTRODUCTION TO DYNAMIC OPTIMIZATION: DISCRETE TIME PROBLEMS. A. The Hamiltonian and First-Order Conditions in a Finite Time Horizon

Outline. lse-logo. Outline. Outline. 1 Wald Test. 2 The Likelihood Ratio Test. 3 Lagrange Multiplier Tests

The consumption-based determinants of the term structure of discount rates: Corrigendum. Christian Gollier 1 Toulouse School of Economics March 2012

Transcription:

Bias in Condiional and Uncondiional Fixed Effecs Logi Esimaion: a Correcion * Tom Coupé Economics Educaion and Research Consorium, Naional Universiy of Kyiv Mohyla Academy Address: Vul Voloska 10, 04070 Kyiv, Ukraine Email: coupe@eerc.kiev.ua Tel: 38-044 492-8012 * I hank he referees and Ehan Kaz for helpful commens and he Economics Research and Oureach Cener (EROC), Kyiv for financial suppor. The STATA do-file is available on he Poliical Analysis Web sie. 1

Absrac In a recen paper published in his journal, Kaz (2001) compares he bias in condiional and uncondiional fixed effecs logi esimaion using Mone Carlo Simulaion. This noe shows ha while Kaz s (2001) specificaion has wrong fixed effecs (in he sense ha he fixed effecs are he same for all individuals), his conclusions sill hold if I correc his specificaion (so ha he fixed effecs do differ over individuals). This noe also illusraes he danger, when using logi, of including dummies when no fixed effecs are presen. 2

1. Inroducion In a recen paper published in his journal, Kaz (2001) sudies he esimaion of he binary choice logi model when panel daa are available, using Mone Carlo Simulaion. He specifies he relaionship beween he binary variable y and X as follows (1) P(y i, exp( γ + αi + xi, β) = 1γ, αi, β ) = 1+ exp( γ + α + x β) i i, Where γ is a consan, α i an individual fixed effec and β he coefficien of he explicaive variable. Daa are available for many individuals i and ime periods, so I have a panel daa se-up. Kaz hen aims o compare wo ways of conrolling for fixed effecs, using uncondiional maximum likelihood bu including individual specific dummies and esimaing he model by condiional maximum likelihood. Condiional maximum likelihood is known o give consisen esimaes of β (Chamberlain, 1980) bu does no provide esimaes of he individual fixed effecs α i, which are needed if one wans o compue saisics like marginal effecs. The uncondiional-wih-dummies esimaor provides esimaes of he individual fixed effecs α i bu has been proven o lead o inconsisency in β due o he incidenal parameer problem for =2, Abrevaya (1997) has shown ha he uncondiional - wih - dummies esimaor of β equals wo imes he condiional maximum likelihood esimaor of β. Mone Carlo simulaions lead Kaz o conclude ha: The implicaion for applied researchers is ha he condiional esimaor is always safe when T<20 and ha he uncondiional esimaor is safe when 16 T<20. When 8<T<16, he bias in he uncondiional esimaor is small and may be accepable o he researcher bu when T is close o wo, he bias is subsanial. Even in his case, however, he bias akes a predicable direcion and magniude. 3

Unforunaely, Kaz s Mone Carlo simulaions are no wha hey preend o be. On p. 381, Kaz wries: I conrolled p by adjusing he consan erm γ, and I se α i o 1 for all i 1. This is a crucial misake. Indeed, if all α i s ake he same value, here is basically no fixed effec anymore. Tha is, he subscrip i can be removed from α i and he new consan erm γ =γ+α brings us o a simple logi model wihou fixed effecs. (2) P(y i, exp( γ' + xi, β) = 1,γ', β) = 1+ exp( γ' + x β) i, Hence, in Kaz s paper he rue model has no fixed effecs in conras o wha Kaz claims 2,3. In Table 1, I esimae he se-up of Kaz s paper using condiional logi and uncondiional logi wih dummies, and hen add a hird esimaion mehod, uncondiional logi wihou dummies (ha is, simple logi) 4. Noe ha he real underlying model is hree imes a simple logi, bu only he hird esimaion mehod realizes his. [Table 1 Here] Resuls for he condiional logi and he uncondiional logi wih dummies are similar o Kaz s resuls. Imporan o noe, however, is column 3, a simple logi gives resuls similar o he condiional esimaor even for small and hence, is much beer han he uncondiional wih dummies esimaor for small. Hence, in Kaz s se-up i is never useful o use he uncondiional wih dummies specificaion. Noe ha Kaz s sudy is an ineresing illusraion of he fac ha in he logi case, including irrelevan variables as regressors can cause biased esimaes, even if hese irrelevan variables are no correlaed wih he rue explicaive variables indeed, including individual specific 4

dummies, which by consrucion are uncorrelaed wih he (random) X, in he case where here are NO fixed effecs will cause a bias in he esimaes of he coefficien of X. I menioned before ha for =2, Abrevaya (1997) has shown ha he uncondiional - wih - dummies esimaor of β equals wo imes he condiional maximum likelihood esimaor of β. In our case, I found exacly he same resul, he coefficien of he dummy-variables specificaion is double he coefficien of he condiional specificaion hence, he bias ha I cause by including he dummies has nohing o do wih he presence of fixed effecs bu is solely due o he mehod of esimaion. To correc Kaz s sudy I generaed α i from N(0,1) insead of seing all α i s o one. To ge fixed effecs, I furher need a correlaion beween X and he fixed effecs. I propose he 5 following specificaion ha models X as a funcion of α i. 2 (3) x i = ( ai + ε i ),ε i : N(0,1) 2 In his way, I have a correlaion beween x i and α i ha is independen of ime T, while x i is sandard normally disribued 6,7. [Table 2 Here] Our conclusion from Table 2 is essenially he same as Kaz s conclusions: from =8, he uncondiional logi wih dummies esimaors gives resuls ha on average are quie close o he rue β. Bu noe ha in conras o Table 1, here, he use of simple logi (column 3) is clearly inferior as one would expec since he simple logi compleely ignores he fixed effecs. 5

2. Conclusions This noe shows ha while Kaz s (2001) specificaion has no meaningful fixed effecs, his conclusions sill hold if I modify his specificaion such as o incorporae fixed effecs hence, when one has a panel daa se wih a relaively large number of ime periods, using dummies o ake ino accoun he fixed effecs is no unreasonable. However, his noe also poins ou he danger, when using logi, of including dummies when no fixed effecs are presen. 6

3. References Abrevaya, Jason. 1997. The Equivalence of Two Esimaors of he Fixed-Effecs Logi Model. Economics Leers 55:41-43. Chamberlain, Gary. 1980. Analysis of Covariance wih Qualiaive Daa. Review of Economic Sudies 47:225-238. Greene, William. 2004. The Behaviour of he Maximum Likelihood Esimaor of Limied Dependen Variable Models in he Presence of Fixed Effecs. Economerics Journal 7: 98-119 Kaz, Ehan. 2001. Bias in Condiional and Uncondiional Fixed Effecs Logi Esimaion. Poliical Analysis 9:379-384. 7

4. Endnoes 1 Ialics added. 2 Of course, one could claim ha here are fixed effecs, bu hey jus happen o be he same. However, in such case he fixed effecs are meaningless. 3 This misake is also presen in he compuer program wrien by Kaz and posed on he Poliical Analysis websie (hp://polmeh.wusl.edu/pa/vol9no4.hml), hough in a slighly differen form. In he compuer program, he fixed effec of he firs individual is dropped, hence he formula is P(y i, exp( γ + αi + xi, β) = 1γ, αi, β ) = 1+ exp( γ + α + x β) i i, wih α i =1 for all i>1 and zero for α 1. Hence in his case, I jus make a difference beween he fixed effec of he firs individual (γ) and he fixed effecs of all oher individuals (γ+ α i ). 4 The parameers are he same as for column 1 in Table 2 of Kaz (2001): β=0.5, γ=-1 5 I could generae α i from N(0,1) and combine his wih a randomly generaed X. This leads o wha economiss would call a random effec since by consrucion here is no correlaion beween he individual specific effecs and he explicaive variable X. The esimaion procedure in his case would be o use he random effecs esimaor, raher han including dummies o capure he fixed effecs. 6 Insead, one could follow Greene (2004) by generaing individual specific effecs ha are correlaed wih X as follows: 8

α i = T xi + a, a N(0,1) i i However, his specificaion implies ha he correlaion beween he fixed effecs and he explicaive variable will decrease as T grows since he influence of an individual x i on he average decreases wih. Hence, using his specificaion one canno disinguish beween he effec of increasing and decreasing he correlaion beween he fixed effecs and he explicaive variable. 7 Noe ha his implies ha I use a sandard normal disribuion for X raher han a uniform disribuion (which was used by Kaz). The disribuion of he X generally, however, has no influence on he resuls (see e.g. Greene (2004)). 9

5. Tables Table 1: Kaz s specificaion under differen esimaion mehods N=100 Condiional Logi Uncondiional Logi wih dummies Uncondiional Logi wihou dummies =2 0.472 (0.032) 0.944 (0.064) 0.488 (0.022) =4 0.520 (0.017) 0.696 (0.023) 0.514 (0.015) =8 0.506 (0.012) 0.579 (0.014) 0.504 (0.011) =12 0.505 (0.095) 0.551 (0.010) 0.498 (0.009) =16 0.495 (0.082) 0.529 (0.008) 0.500 (0.008) Based on 500 Mone Carlo simulaions. Parameers: p=0.56, β=0.5, γ =-1. Mone Carlo sandard errors in brackes. 10

Table 2: Fixed Effec Specificaion under differen esimaion mehods N=100 Condiional Logi Uncondiional Logi wih dummies Uncondiional Logi wihou dummies T=2 0. 535 (0.020) 1.070 (0.041) 1.122 (0.009) T=4 0.502 (0.009) 0.675 (0.013) 1.145 (0.006) T=8 0.508 (0.005) 0.583 (0.006) 1.030 (0.004) =12 0.501 (0.005) 0.549 (0.005) 1.183 (0.003) =16 0.496 (0.004) 0.529 (0.004) 1.165 (0.003) Based on 500 Mone Carlo simulaions. Parameers: β=0.5, γ =-1. Mone Carlo sandard errors in brackes. 11