Heteroskedasticity in Panel Data

Similar documents
Heteroskedasticity in Panel Data

Cross Sectional Time Series: The Normal Model and Panel Corrected Standard Errors

Dynamic panel data methods

Advanced Quantitative Methods: panel data

Time-Series Cross-Section Analysis

Lecture 4: Heteroskedasticity

Journal of Statistical Software

Econometrics of Panel Data

Multiple Equation GMM with Common Coefficients: Panel Data

Panel Data Models. James L. Powell Department of Economics University of California, Berkeley

Heteroskedasticity and Autocorrelation Consistent Standard Errors

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE

HETEROSKEDASTICITY, TEMPORAL AND SPATIAL CORRELATION MATTER

MFE Financial Econometrics 2018 Final Exam Model Solutions

Volatility. Gerald P. Dwyer. February Clemson University

Economics 582 Random Effects Estimation

GMM, HAC estimators, & Standard Errors for Business Cycle Statistics

Econ 582 Fixed Effects Estimation of Panel Data

Model Mis-specification

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Circle the single best answer for each multiple choice question. Your choice should be made clearly.

Econometrics of Panel Data

Vector Autoregressive Model. Vector Autoregressions II. Estimation of Vector Autoregressions II. Estimation of Vector Autoregressions I.

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43

A Course in Applied Econometrics Lecture 7: Cluster Sampling. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Ordinary Least Squares Regression

But Wait, There s More! Maximizing Substantive Inferences from TSCS Models Online Appendix

An estimate of the long-run covariance matrix, Ω, is necessary to calculate asymptotic

Topic 6: Non-Spherical Disturbances

Multiple Regression Analysis

y it = α i + β 0 ix it + ε it (0.1) The panel data estimators for the linear model are all standard, either the application of OLS or GLS.

Model Assumptions; Predicting Heterogeneity of Variance

Econometrics - 30C00200

Internal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9.

Vector autoregressions, VAR

Econ 424 Time Series Concepts

The Size and Power of Four Tests for Detecting Autoregressive Conditional Heteroskedasticity in the Presence of Serial Correlation

Applied Econometrics (QEM)

Lecture 2: Univariate Time Series

Non-linear panel data modeling

Inference with Heteroskedasticity

Note: The primary reference for these notes is Enders (2004). An alternative and more technical treatment can be found in Hamilton (1994).

TAKEHOME FINAL EXAM e iω e 2iω e iω e 2iω

Statistics 910, #5 1. Regression Methods

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

1 The Multiple Regression Model: Freeing Up the Classical Assumptions

2. An Introduction to Moving Average Models and ARMA Models

Clustering as a Design Problem

Problem Set 2 Solution Sketches Time Series Analysis Spring 2010

ECON2228 Notes 10. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 54

ECONOMICS 210C / ECONOMICS 236A MONETARY HISTORY

Single Equation Linear GMM with Serially Correlated Moment Conditions

Econ 423 Lecture Notes: Additional Topics in Time Series 1

ECON2228 Notes 10. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 48

The Simple Linear Regression Model

An overview of applied econometrics

Define y t+h t as the forecast of y t+h based on I t known parameters. The forecast error is. Forecasting

Instrumental Variables

Autocorrelation. Think of autocorrelation as signifying a systematic relationship between the residuals measured at different points in time

LECTURE 11. Introduction to Econometrics. Autocorrelation

Lecture 6a: Unit Root and ARIMA Models

Vector Autoregression

Economics 583: Econometric Theory I A Primer on Asymptotics

Next, we discuss econometric methods that can be used to estimate panel data models.

CHAPTER 6: SPECIFICATION VARIABLES

Dynamic Regression Models (Lect 15)

Robust Standard Errors to Spatial and Time Dependence in Aggregate Panel Models

Heteroscedasticity. Jamie Monogan. Intermediate Political Methodology. University of Georgia. Jamie Monogan (UGA) Heteroscedasticity POLS / 11

ECON/FIN 250: Forecasting in Finance and Economics: Section 8: Forecast Examples: Part 1

ECONOMETRICS HONOR S EXAM REVIEW SESSION

Topic 4 Unit Roots. Gerald P. Dwyer. February Clemson University

Non-Spherical Errors

Weighted Least Squares

Scenario 5: Internet Usage Solution. θ j

Serial Correlation. Edps/Psych/Stat 587. Carolyn J. Anderson. Fall Department of Educational Psychology

Regression of Time Series

x 21 x 22 x 23 f X 1 X 2 X 3 ε

POLSCI 702 Non-Normality and Heteroskedasticity

Econometrics of financial markets, -solutions to seminar 1. Problem 1

Appendix to The Life-Cycle and the Business-Cycle of Wage Risk - Cross-Country Comparisons

Chapter 8: Model Diagnostics

Single Equation Linear GMM with Serially Correlated Moment Conditions

Contents. Part I Statistical Background and Basic Data Handling 5. List of Figures List of Tables xix

Applied Statistics and Econometrics

Heteroskedasticity. in the Error Component Model

Introduction to Econometrics. Heteroskedasticity

Multivariate Time Series: VAR(p) Processes and Models

Topic 7: Heteroskedasticity

UNIVERSITY OF OSLO DEPARTMENT OF ECONOMICS

Økonomisk Kandidateksamen 2004 (I) Econometrics 2. Rettevejledning

Introduction to Regression Analysis. Dr. Devlina Chatterjee 11 th August, 2017

Topic 4: Model Specifications

Lecture 8 Panel Data

Like shoes and shirt, one size does not fit all: Evidence on time series cross-section estimators and specifications from Monte Carlo experiments

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63

Heteroskedasticity. y i = β 0 + β 1 x 1i + β 2 x 2i β k x ki + e i. where E(e i. ) σ 2, non-constant variance.

Appendix A: The time series behavior of employment growth

AGEC 621 Lecture 16 David Bessler

Econometrics Honor s Exam Review Session. Spring 2012 Eunice Han

STAT Financial Time Series

Transcription:

Essex Summer School in Social Science Data Analysis Panel Data Analysis for Comparative Research Heteroskedasticity in Panel Data Christopher Adolph Department of Political Science and Center for Statistics and the Social Sciences University of Washington, Seattle

Review of heteroskedasticity Recall that in cross-sectional LS, heteroskedasticity is assumed away if present, biases our standard errors We noted two approaches Model the heteroskedasticity directly with an appropriate ML model, or Less optimally, continue to use the wrong method (LS), but try to correct the se s; these are known as Huber-White, sandwich, or robust standard errors How do these approaches transfer to the time series context? to panel data?

Dynamic heteroskedasticity As with cross-sectional models, we can model heteroskedasticity directly One possibility is to let heteroskedasticity evolve dynamically We can let heteroskedasticity be (sort-of) ARMA, under the name GARCH Generalized Autoregressive Conditional Heteroskedasticity: y t = µ t + ε t ε t f N ( 0, σ 2 t ) where P Q µ t = α + x t β + y t p φ p + ε t q θ q p=1 q=1 C D σt 2 = exp (η + z t γ) + σt cλ 2 c + c=1 d=1 ε 2 t dξ d In words, y t is an ARMA(P, Q)-GARCH(C, D) distributed time-series (Of course, we could leave out x and/or z if we wanted)

Dynamic heteroskedasticity where ( ) y t = µ t + ε t ε t f N 0, σ 2 t P Q µ t = α + x t β + y t p φ p + ε t q θ q p=1 q=1 σ 2 t = exp (η + z t γ) + C σt cλ 2 c + c=1 D d=1 ε 2 t dξ d Models like the above are workhorses of financial forecasting Can estimated by ML as usual In R, garch() in the tseries package does GARCH May have to look around a bit for ARMA-GARCH

Dynamic and panel heteroskedasticity Panel data allows for more complex forms of heteroskedasticity and serial correlation than cross-sectional data For example Serial correlation: E(ε is ε it ) = σ st 0 (Reduced/eliminated by appropriate ARMA specification) Contemporaneous correlation: E(ε it ε jt ) = σ ij 0 (Globally reduced by year fixed effects, but what about pairs of correlated units?) Panel heteroskedasticity: E(ε 2 is ) = E(ε2 it ) = σ2 i, but σ2 i σ2 j (How to fix?) Dynamic heteroskedasticity: σit 2 = f(σ2 i,t k ) (How to fix?)

Dynamic and panel heteroskedasticity Consider a Panel ARMA-GARCH model: y it = µ it + ε it ε it f N ( 0, σ 2 it ) µ it = α i + τ t + x it β + P y i,t p φ p + p=1 σ 2 it = exp (η i + ζ t + z it γ) + Q ε i,t q θ q q=1 C σi,t cλ 2 c + c=1 Suppose we estimated y it as a function of µ it, but ignored the structure of the error term That is, we estimate a panel ARMA or GMM, but assume ε it is homoskedastic and serially uncorrelated, conditional on the covariates and lags in µ it D d=1 ε 2 i,t dξ d

Dynamic and panel heteroskedasticity If we ignore this: σ 2 it = exp (η i + ζ t + z it γ) + C σi,t cλ 2 c + c=1 D d=1 ε 2 i,t dξ d We would miss three non-standard features of the error variance-covariance: Panel heteroskedasticity, from η i, the unit random effect in the variance function Contemporaneous correlation, from ζ t, the time random effect in the variance function Conditional heteroskedasticity: λ i and ξ i make the variance time dependent Thankfully, few reasonable models are this complex

Dynamic and panel heteroskedasticity Suppose we think this AR(1) with panel heteroskedasticity is appropriate: ( ) y it = µ it + ε it ε it f N 0, σ 2 i µ it = α i + x it β + y i,t p φ σ 2 i = exp (η i ) Only source of heteroskedasticity is now η i : panel heteroskedasticity, not dynamic heteroskedasticity We could switch this to contemporaneous correlation, by swapping ζ t for η i Roughly the model Beck & Katz advocate as a baseline for comparative politics Suggest estimating by LS then correcting se s for omission of η i & contemp corr This procedure yields panel-corrected standard errors, PCSEs What are they, and how do we compute them?

What would we do if we had a plain-vanilla cross-sectional regression and suspected or detected heteroskedasticity? Recall the standard errors from LS are the square roots of the diagonal elements of ˆV( ˆβ) = ˆσ 2 (X X) 1 So if σ 2 varies by i, these will be badly estimated Instead, of the usual σ 2 estimator, we could use the residual from each observation as a robust estimate of its variance: ˆσ 2 i = ˆε 2 i A heteroskedasticity robust formula for the Var-Cov matrix follows: ˆV( ˆβ) = (X X) 1 ( i ˆε i x ix i )(X X) 1 The standard errors of our parameters (β s) are the square roots of the diagonal of this matrix

Review: Adjusting standard errors for heteroskedasticity ˆV( ˆβ) = (X X) 1 ( i ˆε i x ix i )(X X) 1 SE s calculated from this equation are known by many names: Huber-White standard errors robust standard errors sandwich standard errors heteroskedastcity consistent standard errors If you have a single time series, Newey-West standard errors generalize this concept to include robustness to serial correlation For panel data there are many further options, leading to a vast literature exploring refinements to this basic concept

Panel-corrected standard errors To calculate panel-corrected standard errors, we need to estimate the correct variance-covariance matrix Why not just use Huber-White? That would ignore panel structure, which is inefficient if we know how that structure affects heteroskedasticity: ˆσ 2 it = ˆε 2 it Beck and Katz s panel correction produces sharper estimates of ˆσ it 2 strength across the observations from a single unit: by borrowing ˆσ 2 it = ˆσ 2 i = 1 T (ˆε2 i,1 + ˆε 2 i,2 + + ˆε 2 i,t ) Beck and Katz s panel correction also accounts for contemporaneous correlations across units: ˆσ i,j = 1 T (ˆε i,1ˆε j,1 + ˆε i,2ˆε j,2 + + ˆε i,t ˆε j,t ) Note the above will work better if T is large relative to N

Panel-corrected standard errors Building this intuition out into a variance-covariance matrix involves a bit of algebra To make PCSEs, suppose the variance-covariance matrix Ω is N T N T block-diagonal with an N N matrix Σ of contemporaneous covariances on diagonal In other words, allow for unit or contemporaneous heteroskedaticity that stays the same over time Visualizing this large matrix is tricky Note that NT NT block-diagonal means we are ordering the observations first by time, then by unit (reverse of our usual practice)

Panel-corrected standard errors Ω NT NT = σ 2 ε 1 σε 1,ε i σε 1,ε N 0 0 0 0 0 0 σε i,ε 1 σ 2 ε i σε i,ε N 0 0 0 0 0 0 σε N,ε 1 σε N,ε i σ 2 ε 0 0 0 0 0 0 N 0 0 0 σ 2 ε 1 σε 1,ε i σε 1,ε N 0 0 0 0 0 0 σε i,ε 1 σε 2 i σε i,ε N 0 0 0 0 0 0 σε N,ε 1 σε N,ε i σε 2 0 0 0 N 0 0 0 0 0 0 σ 2 ε 1 σε 1,ε i σε 1,ε N 0 0 0 0 0 0 σε i,ε 1 σε 2 i σε i,ε N 0 0 0 0 0 0 σε N,ε 1 σε N,ε i σε 2 N

Panel-corrected standard errors Instead, suppose Ω is NT NT block-diagonal with an N N matrix Σ of contemporaneous covariances on diagonal In other words, allow for unit or contemporaneous heteroskedaticity that stays the same over time Beck and Katz (1995) estimate Σ using LS residuals e i,t : ˆΣ i,j = T t=1 e i,t e j,t T And then use ˆΣ to construct the covariance matrix

Panel-corrected standard errors Monte Carlo experiments show panel-corrected standard errors are correct unless contemporaneous correlation is very high or T is small relative to N (Note: alternative is to estimate random effects in variance by ML) Beck and Katz suggest using LS with PCSEs and lagged DVs as a baseline model Most practioners think fixed effects should also be used Most important: getting the right lag structure & including FEs where appropriate PCSEs (or other var-cov correction) is a second-order concern In R, package pcse will calculate PCSEs for a linear regression Even easier: In the plm package, vcovbk() will produce a panel corrected var-cov matrix from a plm object If N is large relative to T, consider the Driscoll and Kraay alternative, vcovscc()

Panel-corrected standard errors: Application Let s apply Beck-Katz PCSEs to our panel ARIMA/plm example: we ll replace the usual variance-covariance matrix with the panel corrected variance covariance matrix We must make this substitution manually after estimation to get corrected standard errors, confidence intervals, and var-cov matrices: 1 to print the summary() of a plm model Example: summary(plmres, vcov=vcovbk(plmres)) 2 to use the coeftest() function on a plm model Example: coeftest(plmres, vcov=vcovbk(plmres)) 3 to simulate parameters with mvrnorm() for computing counterfactuals Example: mvrnorm(10000, coef(plmres), vcovbk(plmres))

Model RE FE FE-pcse FE ME Education it 2396 7556 7556 8668 8475 559 1216 1348 1245 1456 Democracy it 11094 1290 1290 2615 363 3463 4769 5058 4797 5522 Oil-Producer it 2689 4484 GDP i,t 1 023 015 015 017 020 002 002 002 GDP i,t 2 012 002 σ α 014 30910 Fixed effects x x x x Random effects x x N 113 113 113 113 113 T 328 328 328 228 328 observed N T 2794 2794 2794 2741 2794 AIC 43376 42112 LM test p-value <0001 <0001 0131

Expected change in constant GDP ($ pc) 15000 5000 0 5000 15000 Fixed effects ARIMA(1,1,0) 1 11 21 31 41 Time Recall the fixed effects results These are uncorrected for panel heteroskedasticity or contemporaneous correlation

Expected change in constant GDP ($ pc) 15000 5000 0 5000 15000 Fixed effects ARIMA(1,1,0), pcse 1 11 21 31 41 Time Panel correction usually makes little difference in long T small N contexts But in short T, robust standard errors can be quite important

Heteroskedastic and serial correlation consistent Var-Cov In the PCSEs approach, the focus is on panel heteroskedasticity It is assumed that serial correlation has been adequately modeled and purged A reasonable check when we have a few dozen periods of data, though similar in most cases to either ordinary SEs or White SEs But what if we have a low T? We might be more worried about residual serial correlation (and don t have practical access to ARMA diagnostics or fitting) Now there is more need for a correction to the variance covariance that corrects for observed error correlation across units and across periods Arellano (1987) provides a heteroskastic and autocorrelation consistent variance-covariance matrix: in plm, vcovhc() Use the same commands as above, but with vcovhc() instead of vcovbk() Particularly important to correct with panel GMM estimators

Change in Packs pc, +1 years 30 20 10 0 Change in Packs pc, +3 years 40 30 20 10 0 Linear / no trend Linear / year effects Log Log / no trend Log Log / year effects 30 20 10 0 40 30 20 10 0 Our prior results for cigarette taxes used the Arellano heteroskedastic and serial correlation consistent var-cov matrix What would happen if we had used the ordinary, homoskedastic var-cov matrix to compute CIs?

Change in Packs pc, +1 years 30 20 10 0 Change in Packs pc, +3 years 40 30 20 10 0 Linear / no trend Linear / year effects Log Log / no trend Log Log / year effects 30 20 10 0 40 30 20 10 0 The effects sizes are mostly unchanged: adjustments to standard errors affect CIs, not point estimates But the CIs are radically different under the traditional var-cov estimator Far too small (invisible even!) for the misspecified linear models And too large for the more correctly specified log-log models!

Change in Packs pc, +1 years 30 20 10 0 Change in Packs pc, +3 years 40 30 20 10 0 Linear / no trend Linear / year effects Log Log / no trend Log Log / year effects 30 20 10 0 40 30 20 10 0 Just as panel GMM point estimate are sensitive to assumptions, so are the standard errors Use caution, and prefer vcovhc() to vcov() in PGMM models Be sure to check which var-cov matrix your functions are using: the default may be wrong!