α version (only brief introduction so far)

Similar documents
Econometrics. Week 6. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Lecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook)

Applied Economics. Panel Data. Department of Economics Universidad Carlos III de Madrid

Econometrics I KS. Module 1: Bivariate Linear Regression. Alexander Ahammer. This version: March 12, 2018

Applied Quantitative Methods II

Econ 582 Fixed Effects Estimation of Panel Data

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Week 2: Pooling Cross Section across Time (Wooldridge Chapter 13)

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43

Controlling for Time Invariant Heterogeneity

Applied Microeconometrics (L5): Panel Data-Basics

Fixed Effects Models for Panel Data. December 1, 2014

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data

Panel data methods for policy analysis

EC327: Advanced Econometrics, Spring 2007

Linear Panel Data Models

Econometrics of Panel Data

Econometrics of Panel Data

Final Exam. Economics 835: Econometrics. Fall 2010

Econometrics. Week 11. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Basic econometrics. Tutorial 3. Dipl.Kfm. Johannes Metzler

Econometrics I Lecture 3: The Simple Linear Regression Model

Empirical Application of Panel Data Regression

Advanced Econometrics

Econometrics of Panel Data

Panel Data Models. James L. Powell Department of Economics University of California, Berkeley

Wooldridge, Introductory Econometrics, 4th ed. Chapter 2: The simple regression model

Dealing With Endogeneity

Introductory Econometrics

A Course in Applied Econometrics Lecture 7: Cluster Sampling. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation

Short T Panels - Review

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63

Simultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser

What s New in Econometrics? Lecture 14 Quantile Methods

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Topic 10: Panel Data Analysis

Econometrics of Panel Data

Specification testing in panel data models estimated by fixed effects with instrumental variables

Multiple Equation GMM with Common Coefficients: Panel Data

New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation

Heteroskedasticity. Part VII. Heteroskedasticity

Panel Data Exercises Manuel Arellano. Using panel data, a researcher considers the estimation of the following system:

Econometrics I Lecture 7: Dummy Variables

Linear Models in Econometrics

10 Panel Data. Andrius Buteikis,

Jeffrey M. Wooldridge Michigan State University

Lecture 10: Panel Data

LECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity

Lecture 8: Instrumental Variables Estimation

Deriving Some Estimators of Panel Data Regression Models with Individual Effects

Gov 2000: 13. Panel Data and Clustering

Econometric Analysis of Cross Section and Panel Data

Applied Quantitative Methods II

Panel Data: Linear Models

A Course in Applied Econometrics Lecture 4: Linear Panel Data Models, II. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

Longitudinal Data Analysis. RatSWD Nachwuchsworkshop Vorlesung von Josef Brüderl 25. August, 2009

Panel Data: Fixed and Random Effects

Exogenous Treatment and Endogenous Factors: Vanishing of Omitted Variable Bias on the Interaction Term

An overview of applied econometrics

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

Econometrics of Panel Data

INTRODUCTION TO BASIC LINEAR REGRESSION MODEL

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Fortin Econ Econometric Review 1. 1 Panel Data Methods Fixed Effects Dummy Variables Regression... 7

LECTURE 11. Introduction to Econometrics. Autocorrelation

Notes on Panel Data and Fixed Effects models

Panel Data Model (January 9, 2018)

Introduction to Panel Data Analysis

Non-linear panel data modeling

Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares

Chapter 2: simple regression model

A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

ECON The Simple Regression Model

4 Instrumental Variables Single endogenous variable One continuous instrument. 2

Instrumental Variables

ADVANCED ECONOMETRICS I. Course Description. Contents - Theory 18/10/2017. Theory (1/3)

1 The basics of panel data

CORRELATED RANDOM EFFECTS MODELS WITH UNBALANCED PANELS

Estimating Panel Data Models in the Presence of Endogeneity and Selection

Internal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9.

Chapter 9: The Regression Model with Qualitative Information: Binary Variables (Dummies)

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables

The regression model with one stochastic regressor (part II)

Problem 13.5 (10 points)

Beyond the Target Customer: Social Effects of CRM Campaigns

Econometrics of Panel Data

Dynamic Panel Data Models

centeris paribus. w w partial effect E(y w, c)á w. abil. c =(exper, abil) exper

Dynamic Panel Data Ch 1. Reminder on Linear Non Dynamic Models

Applied Econometrics Lecture 1

econstor Make Your Publications Visible.

Econometrics - 30C00200

When Should We Use Linear Fixed Effects Regression Models for Causal Inference with Longitudinal Data?

Economics 582 Random Effects Estimation

Course Econometrics I

Econometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous

PANEL DATA RANDOM AND FIXED EFFECTS MODEL. Professor Menelaos Karanasos. December Panel Data (Institute) PANEL DATA December / 1

Environmental Econometrics

Transcription:

Econometrics I KS Module 8: Panel Data Econometrics Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: June 18, 2018 α version (only brief introduction so far) Alexander Ahammer (JKU) Module 8: Panel data 1 / 10

Now we consider different data dimensions, cont d Finally, we shall give a very brief introduction into panel data econometrics. Time restrictions prevent us from going into too much detail. Panel econometrics are a crucial component of the subsequent courses, Econometrics II (treatment evaluation) and Microeconometrics. Here we discuss some very basic ideas, What is time series data? What is pooled OLS, what are fixed effects and random effects? How can we estimate these models? We cannot cover a detailed discussion of (1) the underlying assumptions behind these models, and (2) their advantages and disadvantages. In case you have to use panel methods before the subsequent courses start, make sure to read Wooldridge (ch. 13 & 14) in great detail. Alexander Ahammer (JKU) Module 8: Panel data 2 / 10

The 2-period model Panel data have a cross-sectional and a time series component you observe the same individuals across time. Consider the simplest framework; you follow each individual i = 1,..., n for T = 2 periods; t = 1 and t = 2. You are interested in the relationship between y and x. Simply ignoring the panel component would lead to the pooled OLS model, y it = β 0 + β 1 x it + u it, i = 1,..., n; t = 1, 2. (1) where, ideally, you would adjust your standard errors to account for the autocorrelation amongst the u it s (see Wooldridge, p. 483; Computing Standard Errors Robust to Serial Correlation and Heteroskedasticity of Unknown Form ). Mechanics and coefficient interpretation are similar to cross-sectional OLS. Alexander Ahammer (JKU) Module 8: Panel data 3 / 10

The 2-period model The beauty of panel data is that it allows you to account for unobserved heterogeneity that stays constant over time among individuals. Consider a model where unobserved factors affecting the dependent variable consist of two types: those that are constant and those that vary over time, y it = β 0 + β 1 x it + ψ i + u it (2) xit and u it vary over time, ψ i is time-invariant. Additionally, you may want to allow for different intercepts depending on whether you are in period 1 or period 2, y it = β 0 + δ1{t = 2} + β 1x it + ψ i + u it (3) In (2), the variable ψ i captures all unobserved, time-constant factors that affect y it. In applied work, ψ i is typically referred to as a fixed effect. Alexander Ahammer (JKU) Module 8: Panel data 4 / 10

The 2-period model First-differencing How do we estimate the model in 2? Using pooled OLS is inappropriate, because we have to assume that Cov(ψ i, u it ) = 0. Otherwise, we obtain inconsistent estimates. We can easily allow for Cov(ψ i, u it ) 0. For a cross-sectional observation i n, we can write the two periods as y i2 = (β 0 + δ) + β 1 x i2 + ψ i + u i2 for t = 2 (4) y i1 = β 0 + β 1 x i1 + ψ i + u i1 for t = 1 (5) Subtracting (4) from (5), we get (y i2 y i1 ) = δ + β 1 (x i2 x i1 ) + (u i2 u i1 ) (6) or y i = δ + β 1 x i + u i (7) Alexander Ahammer (JKU) Module 8: Panel data 5 / 10

The 2-period model First-differencing First-differenced equation y i = δ + β 1 x i + u i (7) In (7), is the change from period 1 to period 2. Importantly, ψ i does not appear in this equation anymore. This is called first-differencing. = We differenced-out the unobserved heterogeneity. Key identifying assumption: Cov( u i, x i ) = 0. This assumption holds if the idiosyncratic error at each time t, u it, is uncorrelated with the explanatory variable in both time periods. This is a variant of the strict exogenenity assumption read Wooldridge (ch. 13, esp. pp. 481 483) for more information, also on other underlying assumptions. Alexander Ahammer (JKU) Module 8: Panel data 6 / 10

The 2-period model First-differencing Advantages of first-differencing Suppose you want to estimate returns to schooling. The following model for t = 1, 2 holds in the population, wage it = β 0 + β 1 educ it + β 2 exper it + β 3 female i + β 4 ability i + u it (8) If you first-difference the model, you get wage i = δ + β 1 educ i + β 2 exper i + u it (9) where δ is the difference in intercepts between t 1 und t 2. Notice that ability i disappears, because it doesn t have a t subscript? This applies to all time-invariant variables. In such a model, unobserved variables x that are correlated with u only cause problems if they are time-variant. All x you can safely assume to be time-invariant are differenced-out. Disadvantage: You cannot identify parameters of observable, time-invariant variables such as female i. Alexander Ahammer (JKU) Module 8: Panel data 7 / 10

Fixed effects model Consider a general fixed effects model, y it = β 1 x it + ψ i + u it, i = 1,..., n; t = 1,..., T (10) First differencing works great with T = 2. The easiest method to get rid of ψ i with large T is to demean. For each i, define the average of equation (10) over time as ȳ i = β 1 x i + ψ i + ū i (11) where, e.g., ȳ i = T 1 T t=1 x it, and so on. Because ψ i is fixed over time, it appears in both (10) and (11). If we subtract one from the other, we obtain (y it ȳ i ) = β 1 (x it x i ) + (u it ū i ) (12) which is called demeaning or within transforming data. You can show that demeaning leads to an equivalent estimate of ˆβ 1 as including a dummy variable for each i in model (10). Alexander Ahammer (JKU) Module 8: Panel data 8 / 10

Random effects model Consider the same unobserved effects model as before, y it = β 0 + β 1 x it + ψ i + u it (13) where we include an intercept in order to be able to assume that E(ψ i ) = 0, without loss of generality. The goal of fixed effects estimation is to get rid of ψ i, because we assume it to be correlated with the x it. If we assume ψ i is uncorrelated with each explanatory variable, Cov(x itk, ψ i ) = 0 for all t = 1,..., T and all variables k = 1,..., K, we arrive at the random effects model. It has the advantage of allowing to estimate also coefficients of time-varying variables, but requires a different stricter set of assumptions. Thus, it is hardly used in applied Econometrics nowadays. Check Wooldridge (ch. 14) for more information. Alexander Ahammer (JKU) Module 8: Panel data 9 / 10

Literature Main reference: Wooldridge, J. M. (2015). Introductory Econometrics: A Modern Approach, 5th ed., South Western College Publishing. Alexander Ahammer (JKU) Module 8: Panel data 10 / 10