Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data

Size: px
Start display at page:

Download "Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data"

Transcription

1 Panel data Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data - possible to control for some unobserved heterogeneity - possible to model dynamics

2 Examples - individual earnings - household expenditures - firm investment - sector productivity - regional migration - country income per capita, or growth rates

3 Dimensions of the panel are important for asymptotic properties of different estimators. Large N, small T often found in microeconomic data. Longer T more common with aggregate data. Semi-asymptotic results let one dimension become large with the other held fixed. Our emphasis will be on the case where N with T fixed, more relevant for microeconometric applications. Reliance on any asymptotic results is hazardous if neither N nor T is large.

4 Static linear model y it = x it β + w i γ + (η i + v it ) for i = 1,..., N and t = 1,..., T x it = (x 1it,..., x Kit ), β = β 1., w i = (w 1i,..., w Gi ), γ = γ 1. β K γ G 1 K K 1 1 G G 1 y it, η i, and v it scalars Observed y it, x it, w i. Unobserved η i, v it.

5 Stack T observations for each individual y i = X i β + W i γ + (η i j T + v i ) for i = 1,..., N y i = y i1. y it, X i = x 1i1... x Ki x 1iT... x KiT, W i = w 1i... w Gi..... w 1i... w Gi, η i j T = η i. η i, v i = v i1. v it T 1 T K T G T 1 T 1 j T is a T 1 column vector with each element equal to one

6 Then stack over N individuals y = Xβ + W γ + (η + v) y = y 1. y N, X = X 1. X N, W = W 1. W N, η = η 1 j T. η N j T, v = v 1. v N NT 1 NT K NT G NT 1 NT 1

7 Special case: no time-invariant explanatory variables (G = 0) y = Xβ + (η + v) y = y 1. y N, X = X 1. X N, η = η 1 j T. η N j T, v = v 1. v N NT 1 NT K NT 1 NT 1 Two important assumptions that we maintain throughout:

8 y i = X i β + (η i j T + v i ) Cross-sectional independence: Observations on (y i, X i ) are independent over i = 1,..., N Slope parameter homogeneity: The parameters in β are common to all i = 1,..., N The form of unobserved heterogeneity that we address relates to the individualspecific intercept terms (η i ) in our linear model relating y it to x it (known as fixed effects or random effects, depending on whether they are assumed to be correlated or uncorrelated with the explanatory variables in x it )

9 y = Xβ + (η + v) = Xβ + u u = η + v; u it = η i + v it Ordinary least squares β OLS = (X X) 1 X y Assumption (x it predetermined) E[x it v it ] = 0 Properties of β OLS then depend on E[x it η i ].

10 Assumption (uncorrelated individual effects, or random effects ) E[x it η i ] = 0 Then β OLS is a consistent estimator of β as N or as T (or both). E[x it v it ] = 0 and E[x it η i ] = 0 = E[x it u it ] = 0. OLS would be consistent under these assumptions for a single cross-section. Panel dimension is thus not critical here for consistency. OLS is not effi cient in the panel setting, unless σ 2 η = var(η i ) = 0 (and v it iid(0, σ 2 v)).

11 Assumption (correlated individual effects, or fixed effects ) E[x it η i ] 0 Then β OLS is an inconsistent estimator of β as N or as T (or both). E[x it v it ] = 0 and E[x it η i ] 0 = E[x it u it ] 0. OLS using the panel (pooled OLS) is subject to the same kind of omitted variable bias as OLS in a single cross-section. Repeated observations do not change this; but do allow us to transform the model in order to construct consistent estimators.

12 Panel data is most useful when we suspect that cross-section regression results would be biased, due to (relevant and correlated) omitted variables. Particularly if it is plausible that important omitted variables are timeinvariant (or vary little over the sample period). And the explanatory variables of interest and the dependent variable vary over time.

13 Examples: Do high investment countries tend to have higher per capita income because investment raises income, or because factors like good governance or favorable geography raise both investment and income? Do high R&D firms tend to have higher TFP because R&D raises TFP, or because good managers both invest in R&D and (independently) achieve high TFP?

14 Classical panel data estimators Assumption (strict exogeneity) E[x it v is ] = 0 for all s, t This assumption is crucial for asymptotic properties in the case where N with T fixed, although not in the case where T. Strict exogeneity rules out feedback from past v is shocks to current x it. Hence rules out lagged dependent variables.

15 Assumption (error components) E[η i ] = E[v it ] = E[η i v it ] = 0 Assumption (serially uncorrelated shocks) E[v it v is ] = 0 for s t Assumption (homoskedasticity) E[η 2 i] = σ 2 η E[v 2 it] = σ 2 v

16 For the case of uncorrelated individual effects, ineffi ciency of pooled OLS reflects the serial correlation in u it = η i + v it due to the presence of the time-invariant individual effects (η i ). u it = η i + v it u i,t 1 = η i + v i,t 1 Under the classical assumptions E[u it u i,t 1 ] = E[η 2 i] = σ 2 η And E[u 2 it] = E[η 2 i] + E[v 2 it] = σ 2 η + σ 2 v

17 So And σ 2 η + σ 2 v σ 2 η σ 2 η σ 2 E[u i u η σ 2 η + σ 2 v... σ 2 η i] = = Ω i T T σ 2 η σ 2 η... σ 2 η + σ 2 v Ω i Ω i 0 E[uu ] = = Ω NT NT 0 0 Ω i

18 Generalised Least Squares Under the classical assumptions, the GLS (or random effects ) estimator is consistent and effi cient if E[x it η i ] = 0 β GLS = (X Ω 1 X) 1 X Ω 1 y NB. Consistency requires all the explanatory variables to be uncorrelated with the individual effects. If E[x it η i ] 0, β GLS is inconsistent as N with T fixed.

19 β GLS can be obtained using OLS on the transformed model y it = x itβ + u it where y it = y it (1 θ)y i and θ 2 = σ 2 v σ 2 v + T σ 2 η, y i = 1 T T s=1 y is This transformation is known as theta-differencing.

20 Feasible GLS uses consistent estimates of σ 2 η and σ 2 v to obtain a consistent estimate of θ. These can be obtained using residuals from the Within Groups and Between Groups estimators (to be discussed below). Feasible GLS is asymptotically equivalent to true GLS for this model. Hence feasible GLS is asymptotically effi cient under the classical assumptions, when E[x it η i ] = 0.

21 y it = y it (1 θ)y i θ 2 = σ 2 v σ 2 v + T σ 2 η For σ 2 η = 0, θ = 1 and y it = y it. Special case where OLS is effi cient. As T, θ 0 and y it = y it y i. In this case GLS coincides with the simpler Within Groups estimator (discussed below), and estimation of θ becomes redundant.

22 Within Groups Within transformation T Key property ỹ it = y it y i, y i = 1 T s=1 y is η i = η i so that η i = η i η i = 0 Example of a transformation that eliminates time-invariant variables. Notice that theta-differencing does not eliminate the time-invariant individuals effects (η i ) from the error term for θ 0 (η i = η i (1 θ)η i = θη i ) - hence we require E[x it η i ] = 0 for GLS to be consistent.

23 Transformed model ỹ it = x it β + ṽ it The Within Groups (or fixed effects ) estimator is OLS on this transformed model β W G = ( X 1 X) X ỹ Under classical assumptions, β W G is consistent, both for E[x it η i ] = 0 and for E[x it η i ] 0 - since the time-invariant individuals effects (η i ) are eliminated from the error term by the within transformation, we do not require E[x it η i ] = 0 for Within Groups to be consistent.

24 The Within Groups estimator is thus consistent in the case where some or all of the explanatory variables are correlated with this unobserved heterogeneity. In some contexts this is a key advantage, relative to cross-section OLS, pooled OLS or GLS. This illustrates how we can construct consistent estimates using panel data in settings where cross-section OLS would be subject to omitted variables bias.

25 But note that this advantage comes at a price. As N with T fixed, β W G is less effi cient than β GLS in the case where E[x it η i ] = 0. β W G is only effi cient (under classical assumptions) in the special case where all the explanatory variables are correlated with η i. Moreover any observed time-invariant explanatory variables are also eliminated by the transformation, so the Within Groups estimator does not identify the γ parameters in the more general model y it = x it β + w i γ + (η i + v it )

26 This illustrates that repeated observations (i.e. panel data) are most useful when the variables of interest change over time - repeated observations are less useful when the variables of interest remain constant over time. For example, panel data is less successful in controlling for unobserved ability if we want to estimate the effect of schooling on earnings - years of schooling remain constant for most people once they leave fulltime education and join the labour force.

27 More generally, the Within Groups parameter estimates are likely to be imprecise if there is only limited time-series ( within ) variation. The Within Groups estimator of β can also be obtained by including a set of N dummy variables, for each individual y it = η 1 D 1i η N D Ni + x it β + v it and using OLS on this model (D 1i = 1 for the observations on individual 1, and zero otherwise) Hence Within Groups is also called Least Squares Dummy Variables (LSDV).

28 Note that, in the case where N with T fixed, consistency depends on the strict exogeneity assumption. x it = x it 1 T (x i x it ) ṽ it = v it 1 T (v i v it ) Hence E[ x it ṽ it ] = 0 requires E[x it v is ] = 0 for all s, t unless T. This motivated the development of alternative estimators for dynamic panel data models, that are consistent as N for fixed T, in the presence of (e.g.) lagged dependent variables.

29 Other estimators Between Groups Between Groups is OLS on the cross-section equation y i = x i β + (η i + v i ) i = 1,..., N Consistency requires E[x it η i ] = 0. Between Groups is not effi cient - only used to obtain an estimate of σ 2 η when implementing feasible GLS.

30 First-differenced OLS OLS on the first-differenced equations y it = x it β + v it for i = 1,..., N and t = 2,..., T where y it = y it y i,t 1 First-differencing is another transformation that eliminates the time-invariant individual effects ( η i = η i η i = 0). Consistency requires E[ x it v it ] = 0 - this is implied by (but weaker than) strict exogeneity.

31 Within Groups is more effi cient than first-differenced OLS under classical assumptions, i.e. v it iid(0, σ 2 v), serially uncorrelated and homoskedastic. First-differenced OLS is more effi cient if v it is a random walk, i.e. v it = v i,t 1 + ε it with ε it iid(0, σ 2 ε) s.t. v it is serially uncorrelated. First-differenced OLS (but not Within Groups) would also be consistent as N with T fixed in cases where we suspect feedback from second lags or longer lags of v is onto x it, but not from the first lag (v i,t 1 ) onto x it - i.e. where any feedback takes two or more periods to influence x it.

32 Calculating the feasible GLS estimator y it = x itβ + u it y it = y it (1 θ)y i θ 2 = σ 2 v σ 2 v + T σ 2 η σ 2 v can be estimated consistently using the Within Groups residuals ṽit = ỹ it x it βw G σ 2 v = ṽ ṽ N(T 1) K

33 Notice that although we have N T observations and K parameters, we have only N(T 1) K degrees of freedom for the Within Groups estimator - we estimate N parameters when we estimate the individual means (y i for i = 1,..., N) used to construct the within transformation - or, equivalently, when we estimate coeffi cients on the N individual dummy variables in the Least Squares Dummy Variables representation - this is also relevant when we estimate the asymptotic variance of the Within Groups estimator using avar( β W G ) = σ 2 v( X X) 1

34 σ 2 η can then be estimated consistently using the Between Groups residuals û i = ( η i + v i ) = y i x i βbg σ 2 u = ( σ 2 η + 1 T σ2 v) = û û N K and then σ 2 η = σ 2 u 1 T σ2 v

35 Testing for correlated individual effects With fixed T, it is useful to test whether some of the included explanatory variables are correlated with the unobserved individual effects. β W G is consistent whether the individual effects are correlated with the included regressors, or not. β GLS (and β BG ) is consistent only if the individual effects are uncorrelated with all the included regressors; biased and inconsistent otherwise. Estimates should be similar if η i is uncorrelated with all the included regressors; but different if η i is correlated with any of the included regressors.

36 Hausman test q = β W G β GLS h = q [avar( q)] 1 q a χ 2 (K) under the null hypothesis that E[x it η i ] = 0. avar( q) = avar( β W G ) avar( β GLS ) An equivalent test can be based on q = β W G β BG. These tests require the classical assumptions, under which the FGLS estimator is effi cient relative to the Within estimator under the null. Versions robust to heteroskedasticity are now available.

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE

Chapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE Chapter 6. Panel Data Joan Llull Quantitative Statistical Methods II Barcelona GSE Introduction Chapter 6. Panel Data 2 Panel data The term panel data refers to data sets with repeated observations over

More information

Heteroskedasticity. We now consider the implications of relaxing the assumption that the conditional

Heteroskedasticity. We now consider the implications of relaxing the assumption that the conditional Heteroskedasticity We now consider the implications of relaxing the assumption that the conditional variance V (u i x i ) = σ 2 is common to all observations i = 1,..., In many applications, we may suspect

More information

Applied Microeconometrics (L5): Panel Data-Basics

Applied Microeconometrics (L5): Panel Data-Basics Applied Microeconometrics (L5): Panel Data-Basics Nicholas Giannakopoulos University of Patras Department of Economics ngias@upatras.gr November 10, 2015 Nicholas Giannakopoulos (UPatras) MSc Applied Economics

More information

Topic 10: Panel Data Analysis

Topic 10: Panel Data Analysis Topic 10: Panel Data Analysis Advanced Econometrics (I) Dong Chen School of Economics, Peking University 1 Introduction Panel data combine the features of cross section data time series. Usually a panel

More information

Short T Panels - Review

Short T Panels - Review Short T Panels - Review We have looked at methods for estimating parameters on time-varying explanatory variables consistently in panels with many cross-section observation units but a small number of

More information

Advanced Econometrics

Advanced Econometrics Based on the textbook by Verbeek: A Guide to Modern Econometrics Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna May 16, 2013 Outline Univariate

More information

the error term could vary over the observations, in ways that are related

the error term could vary over the observations, in ways that are related Heteroskedasticity We now consider the implications of relaxing the assumption that the conditional variance Var(u i x i ) = σ 2 is common to all observations i = 1,..., n In many applications, we may

More information

Econ 582 Fixed Effects Estimation of Panel Data

Econ 582 Fixed Effects Estimation of Panel Data Econ 582 Fixed Effects Estimation of Panel Data Eric Zivot May 28, 2012 Panel Data Framework = x 0 β + = 1 (individuals); =1 (time periods) y 1 = X β ( ) ( 1) + ε Main question: Is x uncorrelated with?

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 2 Jakub Mućk Econometrics of Panel Data Meeting # 2 1 / 26 Outline 1 Fixed effects model The Least Squares Dummy Variable Estimator The Fixed Effect (Within

More information

Econometrics. Week 6. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics. Week 6. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Econometrics Week 6 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 21 Recommended Reading For the today Advanced Panel Data Methods. Chapter 14 (pp.

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 1 Jakub Mućk Econometrics of Panel Data Meeting # 1 1 / 31 Outline 1 Course outline 2 Panel data Advantages of Panel Data Limitations of Panel Data 3 Pooled

More information

Dealing With Endogeneity

Dealing With Endogeneity Dealing With Endogeneity Junhui Qian December 22, 2014 Outline Introduction Instrumental Variable Instrumental Variable Estimation Two-Stage Least Square Estimation Panel Data Endogeneity in Econometrics

More information

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63 1 / 63 Panel Data Models Chapter 5 Financial Econometrics Michael Hauser WS17/18 2 / 63 Content Data structures: Times series, cross sectional, panel data, pooled data Static linear panel data models:

More information

y it = α i + β 0 ix it + ε it (0.1) The panel data estimators for the linear model are all standard, either the application of OLS or GLS.

y it = α i + β 0 ix it + ε it (0.1) The panel data estimators for the linear model are all standard, either the application of OLS or GLS. 0.1. Panel Data. Suppose we have a panel of data for groups (e.g. people, countries or regions) i =1, 2,..., N over time periods t =1, 2,..., T on a dependent variable y it and a kx1 vector of independent

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)

More information

Econometrics Homework 4 Solutions

Econometrics Homework 4 Solutions Econometrics Homework 4 Solutions Question 1 (a) General sources of problem: measurement error in regressors, omitted variables that are correlated to the regressors, and simultaneous equation (reverse

More information

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Econometrics Week 4 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 23 Recommended Reading For the today Serial correlation and heteroskedasticity in

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 3 Jakub Mućk Econometrics of Panel Data Meeting # 3 1 / 21 Outline 1 Fixed or Random Hausman Test 2 Between Estimator 3 Coefficient of determination (R 2

More information

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague

Econometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Econometrics Week 8 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 25 Recommended Reading For the today Instrumental Variables Estimation and Two Stage

More information

Applied Economics. Panel Data. Department of Economics Universidad Carlos III de Madrid

Applied Economics. Panel Data. Department of Economics Universidad Carlos III de Madrid Applied Economics Panel Data Department of Economics Universidad Carlos III de Madrid See also Wooldridge (chapter 13), and Stock and Watson (chapter 10) 1 / 38 Panel Data vs Repeated Cross-sections In

More information

Panel Data Models. James L. Powell Department of Economics University of California, Berkeley

Panel Data Models. James L. Powell Department of Economics University of California, Berkeley Panel Data Models James L. Powell Department of Economics University of California, Berkeley Overview Like Zellner s seemingly unrelated regression models, the dependent and explanatory variables for panel

More information

Intermediate Econometrics

Intermediate Econometrics Intermediate Econometrics Heteroskedasticity Text: Wooldridge, 8 July 17, 2011 Heteroskedasticity Assumption of homoskedasticity, Var(u i x i1,..., x ik ) = E(u 2 i x i1,..., x ik ) = σ 2. That is, the

More information

Dynamic Panel Data Workshop. Yongcheol Shin, University of York University of Melbourne

Dynamic Panel Data Workshop. Yongcheol Shin, University of York University of Melbourne Dynamic Panel Data Workshop Yongcheol Shin, University of York University of Melbourne 10-12 June 2014 2 Contents 1 Introduction 11 11 Models For Pooled Time Series 12 111 Classical regression model 13

More information

Applied Quantitative Methods II

Applied Quantitative Methods II Applied Quantitative Methods II Lecture 10: Panel Data Klára Kaĺıšková Klára Kaĺıšková AQM II - Lecture 10 VŠE, SS 2016/17 1 / 38 Outline 1 Introduction 2 Pooled OLS 3 First differences 4 Fixed effects

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 6 Jakub Mućk Econometrics of Panel Data Meeting # 6 1 / 36 Outline 1 The First-Difference (FD) estimator 2 Dynamic panel data models 3 The Anderson and Hsiao

More information

Regression with time series

Regression with time series Regression with time series Class Notes Manuel Arellano February 22, 2018 1 Classical regression model with time series Model and assumptions The basic assumption is E y t x 1,, x T = E y t x t = x tβ

More information

10 Panel Data. Andrius Buteikis,

10 Panel Data. Andrius Buteikis, 10 Panel Data Andrius Buteikis, andrius.buteikis@mif.vu.lt http://web.vu.lt/mif/a.buteikis/ Introduction Panel data combines cross-sectional and time series data: the same individuals (persons, firms,

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 4 Jakub Mućk Econometrics of Panel Data Meeting # 4 1 / 30 Outline 1 Two-way Error Component Model Fixed effects model Random effects model 2 Non-spherical

More information

Panel Data Exercises Manuel Arellano. Using panel data, a researcher considers the estimation of the following system:

Panel Data Exercises Manuel Arellano. Using panel data, a researcher considers the estimation of the following system: Panel Data Exercises Manuel Arellano Exercise 1 Using panel data, a researcher considers the estimation of the following system: y 1t = α 1 + βx 1t + v 1t. (t =1,..., T ) y Nt = α N + βx Nt + v Nt where

More information

Multiple Equation GMM with Common Coefficients: Panel Data

Multiple Equation GMM with Common Coefficients: Panel Data Multiple Equation GMM with Common Coefficients: Panel Data Eric Zivot Winter 2013 Multi-equation GMM with common coefficients Example (panel wage equation) 69 = + 69 + + 69 + 1 80 = + 80 + + 80 + 2 Note:

More information

Notes on Panel Data and Fixed Effects models

Notes on Panel Data and Fixed Effects models Notes on Panel Data and Fixed Effects models Michele Pellizzari IGIER-Bocconi, IZA and frdb These notes are based on a combination of the treatment of panel data in three books: (i) Arellano M 2003 Panel

More information

Panel Data Model (January 9, 2018)

Panel Data Model (January 9, 2018) Ch 11 Panel Data Model (January 9, 2018) 1 Introduction Data sets that combine time series and cross sections are common in econometrics For example, the published statistics of the OECD contain numerous

More information

1 Estimation of Persistent Dynamic Panel Data. Motivation

1 Estimation of Persistent Dynamic Panel Data. Motivation 1 Estimation of Persistent Dynamic Panel Data. Motivation Consider the following Dynamic Panel Data (DPD) model y it = y it 1 ρ + x it β + µ i + v it (1.1) with i = {1, 2,..., N} denoting the individual

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 4 Jakub Mućk Econometrics of Panel Data Meeting # 4 1 / 26 Outline 1 Two-way Error Component Model Fixed effects model Random effects model 2 Hausman-Taylor

More information

EC327: Advanced Econometrics, Spring 2007

EC327: Advanced Econometrics, Spring 2007 EC327: Advanced Econometrics, Spring 2007 Wooldridge, Introductory Econometrics (3rd ed, 2006) Chapter 14: Advanced panel data methods Fixed effects estimators We discussed the first difference (FD) model

More information

Lecture 10: Panel Data

Lecture 10: Panel Data Lecture 10: Instructor: Department of Economics Stanford University 2011 Random Effect Estimator: β R y it = x itβ + u it u it = α i + ɛ it i = 1,..., N, t = 1,..., T E (α i x i ) = E (ɛ it x i ) = 0.

More information

Introduction to Econometrics. Heteroskedasticity

Introduction to Econometrics. Heteroskedasticity Introduction to Econometrics Introduction Heteroskedasticity When the variance of the errors changes across segments of the population, where the segments are determined by different values for the explanatory

More information

Capital humain, développement et migrations: approche macroéconomique (Empirical Analysis - Static Part)

Capital humain, développement et migrations: approche macroéconomique (Empirical Analysis - Static Part) Séminaire d Analyse Economique III (LECON2486) Capital humain, développement et migrations: approche macroéconomique (Empirical Analysis - Static Part) Frédéric Docquier & Sara Salomone IRES UClouvain

More information

Panel Data: Linear Models

Panel Data: Linear Models Panel Data: Linear Models Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini Laura Magazzini (@univr.it) Panel Data: Linear Models 1 / 45 Introduction Outline What

More information

Economics 582 Random Effects Estimation

Economics 582 Random Effects Estimation Economics 582 Random Effects Estimation Eric Zivot May 29, 2013 Random Effects Model Hence, the model can be re-written as = x 0 β + + [x ] = 0 (no endogeneity) [ x ] = = + x 0 β + + [x ] = 0 [ x ] = 0

More information

Linear dynamic panel data models

Linear dynamic panel data models Linear dynamic panel data models Laura Magazzini University of Verona L. Magazzini (UniVR) Dynamic PD 1 / 67 Linear dynamic panel data models Dynamic panel data models Notation & Assumptions One of the

More information

Lecture 6: Dynamic panel models 1

Lecture 6: Dynamic panel models 1 Lecture 6: Dynamic panel models 1 Ragnar Nymoen Department of Economics, UiO 16 February 2010 Main issues and references Pre-determinedness and endogeneity of lagged regressors in FE model, and RE model

More information

Review of Econometrics

Review of Econometrics Review of Econometrics Zheng Tian June 5th, 2017 1 The Essence of the OLS Estimation Multiple regression model involves the models as follows Y i = β 0 + β 1 X 1i + β 2 X 2i + + β k X ki + u i, i = 1,...,

More information

Econometric Analysis of Cross Section and Panel Data

Econometric Analysis of Cross Section and Panel Data Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND

More information

INTRODUCTION TO BASIC LINEAR REGRESSION MODEL

INTRODUCTION TO BASIC LINEAR REGRESSION MODEL INTRODUCTION TO BASIC LINEAR REGRESSION MODEL 13 September 2011 Yogyakarta, Indonesia Cosimo Beverelli (World Trade Organization) 1 LINEAR REGRESSION MODEL In general, regression models estimate the effect

More information

Topic 7: Heteroskedasticity

Topic 7: Heteroskedasticity Topic 7: Heteroskedasticity Advanced Econometrics (I Dong Chen School of Economics, Peking University Introduction If the disturbance variance is not constant across observations, the regression is heteroskedastic

More information

Controlling for Time Invariant Heterogeneity

Controlling for Time Invariant Heterogeneity Controlling for Time Invariant Heterogeneity Yona Rubinstein July 2016 Yona Rubinstein (LSE) Controlling for Time Invariant Heterogeneity 07/16 1 / 19 Observables and Unobservables Confounding Factors

More information

Linear models. Linear models are computationally convenient and remain widely used in. applied econometric research

Linear models. Linear models are computationally convenient and remain widely used in. applied econometric research Linear models Linear models are computationally convenient and remain widely used in applied econometric research Our main focus in these lectures will be on single equation linear models of the form y

More information

PANEL DATA RANDOM AND FIXED EFFECTS MODEL. Professor Menelaos Karanasos. December Panel Data (Institute) PANEL DATA December / 1

PANEL DATA RANDOM AND FIXED EFFECTS MODEL. Professor Menelaos Karanasos. December Panel Data (Institute) PANEL DATA December / 1 PANEL DATA RANDOM AND FIXED EFFECTS MODEL Professor Menelaos Karanasos December 2011 PANEL DATA Notation y it is the value of the dependent variable for cross-section unit i at time t where i = 1,...,

More information

System GMM estimation of Empirical Growth Models

System GMM estimation of Empirical Growth Models System GMM estimation of Empirical Growth Models ELISABETH DORNETSHUMER June 29, 2007 1 Introduction This study based on the paper "GMM Estimation of Empirical Growth Models" by Stephan Bond, Anke Hoeffler

More information

Linear Panel Data Models

Linear Panel Data Models Linear Panel Data Models Michael R. Roberts Department of Finance The Wharton School University of Pennsylvania October 5, 2009 Michael R. Roberts Linear Panel Data Models 1/56 Example First Difference

More information

Applied Econometrics. Lecture 3: Introduction to Linear Panel Data Models

Applied Econometrics. Lecture 3: Introduction to Linear Panel Data Models Applied Econometrics Lecture 3: Introduction to Linear Panel Data Models Måns Söderbom 4 September 2009 Department of Economics, Universy of Gothenburg. Email: mans.soderbom@economics.gu.se. Web: www.economics.gu.se/soderbom,

More information

Lecture 7: Dynamic panel models 2

Lecture 7: Dynamic panel models 2 Lecture 7: Dynamic panel models 2 Ragnar Nymoen Department of Economics, UiO 25 February 2010 Main issues and references The Arellano and Bond method for GMM estimation of dynamic panel data models A stepwise

More information

Non-linear panel data modeling

Non-linear panel data modeling Non-linear panel data modeling Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini May 2010 Laura Magazzini (@univr.it) Non-linear panel data modeling May 2010 1

More information

Introductory Econometrics

Introductory Econometrics Based on the textbook by Wooldridge: : A Modern Approach Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna December 11, 2012 Outline Heteroskedasticity

More information

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43

Panel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43 Panel Data March 2, 212 () Applied Economoetrics: Topic March 2, 212 1 / 43 Overview Many economic applications involve panel data. Panel data has both cross-sectional and time series aspects. Regression

More information

Cluster-Robust Inference

Cluster-Robust Inference Cluster-Robust Inference David Sovich Washington University in St. Louis Modern Empirical Corporate Finance Modern day ECF mainly focuses on obtaining unbiased or consistent point estimates (e.g. indentification!)

More information

We can relax the assumption that observations are independent over i = firms or plants which operate in the same industries/sectors

We can relax the assumption that observations are independent over i = firms or plants which operate in the same industries/sectors Cluster-robust inference We can relax the assumption that observations are independent over i = 1, 2,..., n in various limited ways One example with cross-section data occurs when the individual units

More information

Lecture 8 Panel Data

Lecture 8 Panel Data Lecture 8 Panel Data Economics 8379 George Washington University Instructor: Prof. Ben Williams Introduction This lecture will discuss some common panel data methods and problems. Random effects vs. fixed

More information

Instrumental Variables

Instrumental Variables Università di Pavia 2010 Instrumental Variables Eduardo Rossi Exogeneity Exogeneity Assumption: the explanatory variables which form the columns of X are exogenous. It implies that any randomness in the

More information

Motivation for multiple regression

Motivation for multiple regression Motivation for multiple regression 1. Simple regression puts all factors other than X in u, and treats them as unobserved. Effectively the simple regression does not account for other factors. 2. The slope

More information

1. You have data on years of work experience, EXPER, its square, EXPER2, years of education, EDUC, and the log of hourly wages, LWAGE

1. You have data on years of work experience, EXPER, its square, EXPER2, years of education, EDUC, and the log of hourly wages, LWAGE 1. You have data on years of work experience, EXPER, its square, EXPER, years of education, EDUC, and the log of hourly wages, LWAGE You estimate the following regressions: (1) LWAGE =.00 + 0.05*EDUC +

More information

Specification testing in panel data models estimated by fixed effects with instrumental variables

Specification testing in panel data models estimated by fixed effects with instrumental variables Specification testing in panel data models estimated by fixed effects wh instrumental variables Carrie Falls Department of Economics Michigan State Universy Abstract I show that a handful of the regressions

More information

Dynamic Panels. Chapter Introduction Autoregressive Model

Dynamic Panels. Chapter Introduction Autoregressive Model Chapter 11 Dynamic Panels This chapter covers the econometrics methods to estimate dynamic panel data models, and presents examples in Stata to illustrate the use of these procedures. The topics in this

More information

Fixed Effects Models for Panel Data. December 1, 2014

Fixed Effects Models for Panel Data. December 1, 2014 Fixed Effects Models for Panel Data December 1, 2014 Notation Use the same setup as before, with the linear model Y it = X it β + c i + ɛ it (1) where X it is a 1 K + 1 vector of independent variables.

More information

Testing Random Effects in Two-Way Spatial Panel Data Models

Testing Random Effects in Two-Way Spatial Panel Data Models Testing Random Effects in Two-Way Spatial Panel Data Models Nicolas Debarsy May 27, 2010 Abstract This paper proposes an alternative testing procedure to the Hausman test statistic to help the applied

More information

A Course in Applied Econometrics Lecture 4: Linear Panel Data Models, II. Jeff Wooldridge IRP Lectures, UW Madison, August 2008

A Course in Applied Econometrics Lecture 4: Linear Panel Data Models, II. Jeff Wooldridge IRP Lectures, UW Madison, August 2008 A Course in Applied Econometrics Lecture 4: Linear Panel Data Models, II Jeff Wooldridge IRP Lectures, UW Madison, August 2008 5. Estimating Production Functions Using Proxy Variables 6. Pseudo Panels

More information

Iris Wang.

Iris Wang. Chapter 10: Multicollinearity Iris Wang iris.wang@kau.se Econometric problems Multicollinearity What does it mean? A high degree of correlation amongst the explanatory variables What are its consequences?

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Endogeneity b) Instrumental

More information

Econometrics Summary Algebraic and Statistical Preliminaries

Econometrics Summary Algebraic and Statistical Preliminaries Econometrics Summary Algebraic and Statistical Preliminaries Elasticity: The point elasticity of Y with respect to L is given by α = ( Y/ L)/(Y/L). The arc elasticity is given by ( Y/ L)/(Y/L), when L

More information

Sensitivity of GLS estimators in random effects models

Sensitivity of GLS estimators in random effects models of GLS estimators in random effects models Andrey L. Vasnev (University of Sydney) Tokyo, August 4, 2009 1 / 19 Plan Plan Simulation studies and estimators 2 / 19 Simulation studies Plan Simulation studies

More information

Efficiency of repeated-cross-section estimators in fixed-effects models

Efficiency of repeated-cross-section estimators in fixed-effects models Efficiency of repeated-cross-section estimators in fixed-effects models Montezuma Dumangane and Nicoletta Rosati CEMAPRE and ISEG-UTL January 2009 Abstract PRELIMINARY AND INCOMPLETE Exploiting across

More information

Panel Data: Fixed and Random Effects

Panel Data: Fixed and Random Effects Short Guides to Microeconometrics Fall 2016 Kurt Schmidheiny Unversität Basel Panel Data: Fixed and Random Effects 1 Introduction In panel data, individuals (persons, firms, cities, ) are observed at several

More information

LECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity

LECTURE 10. Introduction to Econometrics. Multicollinearity & Heteroskedasticity LECTURE 10 Introduction to Econometrics Multicollinearity & Heteroskedasticity November 22, 2016 1 / 23 ON PREVIOUS LECTURES We discussed the specification of a regression equation Specification consists

More information

Linear Regression with Time Series Data

Linear Regression with Time Series Data Econometrics 2 Linear Regression with Time Series Data Heino Bohn Nielsen 1of21 Outline (1) The linear regression model, identification and estimation. (2) Assumptions and results: (a) Consistency. (b)

More information

ECON 497: Lecture Notes 10 Page 1 of 1

ECON 497: Lecture Notes 10 Page 1 of 1 ECON 497: Lecture Notes 10 Page 1 of 1 Metropolitan State University ECON 497: Research and Forecasting Lecture Notes 10 Heteroskedasticity Studenmund Chapter 10 We'll start with a quote from Studenmund:

More information

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables

Applied Econometrics (MSc.) Lecture 3 Instrumental Variables Applied Econometrics (MSc.) Lecture 3 Instrumental Variables Estimation - Theory Department of Economics University of Gothenburg December 4, 2014 1/28 Why IV estimation? So far, in OLS, we assumed independence.

More information

Appendix A: The time series behavior of employment growth

Appendix A: The time series behavior of employment growth Unpublished appendices from The Relationship between Firm Size and Firm Growth in the U.S. Manufacturing Sector Bronwyn H. Hall Journal of Industrial Economics 35 (June 987): 583-606. Appendix A: The time

More information

Lecture 4: Linear panel models

Lecture 4: Linear panel models Lecture 4: Linear panel models Luc Behaghel PSE February 2009 Luc Behaghel (PSE) Lecture 4 February 2009 1 / 47 Introduction Panel = repeated observations of the same individuals (e.g., rms, workers, countries)

More information

Simple Linear Regression Model & Introduction to. OLS Estimation

Simple Linear Regression Model & Introduction to. OLS Estimation Inside ECOOMICS Introduction to Econometrics Simple Linear Regression Model & Introduction to Introduction OLS Estimation We are interested in a model that explains a variable y in terms of other variables

More information

Freeing up the Classical Assumptions. () Introductory Econometrics: Topic 5 1 / 94

Freeing up the Classical Assumptions. () Introductory Econometrics: Topic 5 1 / 94 Freeing up the Classical Assumptions () Introductory Econometrics: Topic 5 1 / 94 The Multiple Regression Model: Freeing Up the Classical Assumptions Some or all of classical assumptions needed for derivations

More information

Lecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook)

Lecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook) Lecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook) 1 2 Panel Data Panel data is obtained by observing the same person, firm, county, etc over several periods. Unlike the pooled cross sections,

More information

Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"

Ninth ARTNeT Capacity Building Workshop for Trade Research Trade Flows and Trade Policy Analysis Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric

More information

Week 2: Pooling Cross Section across Time (Wooldridge Chapter 13)

Week 2: Pooling Cross Section across Time (Wooldridge Chapter 13) Week 2: Pooling Cross Section across Time (Wooldridge Chapter 13) Tsun-Feng Chiang* *School of Economics, Henan University, Kaifeng, China March 3, 2014 1 / 30 Pooling Cross Sections across Time Pooled

More information

α version (only brief introduction so far)

α version (only brief introduction so far) Econometrics I KS Module 8: Panel Data Econometrics Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: June 18, 2018 α version (only brief introduction so far) Alexander

More information

Econ 510 B. Brown Spring 2014 Final Exam Answers

Econ 510 B. Brown Spring 2014 Final Exam Answers Econ 510 B. Brown Spring 2014 Final Exam Answers Answer five of the following questions. You must answer question 7. The question are weighted equally. You have 2.5 hours. You may use a calculator. Brevity

More information

Lecture 4: Heteroskedasticity

Lecture 4: Heteroskedasticity Lecture 4: Heteroskedasticity Econometric Methods Warsaw School of Economics (4) Heteroskedasticity 1 / 24 Outline 1 What is heteroskedasticity? 2 Testing for heteroskedasticity White Goldfeld-Quandt Breusch-Pagan

More information

Applied Statistics and Econometrics

Applied Statistics and Econometrics Applied Statistics and Econometrics Lecture 6 Saul Lach September 2017 Saul Lach () Applied Statistics and Econometrics September 2017 1 / 53 Outline of Lecture 6 1 Omitted variable bias (SW 6.1) 2 Multiple

More information

Chapter 2. Dynamic panel data models

Chapter 2. Dynamic panel data models Chapter 2. Dynamic panel data models School of Economics and Management - University of Geneva Christophe Hurlin, Université of Orléans University of Orléans April 2018 C. Hurlin (University of Orléans)

More information

Econometrics (60 points) as the multivariate regression of Y on X 1 and X 2? [6 points]

Econometrics (60 points) as the multivariate regression of Y on X 1 and X 2? [6 points] Econometrics (60 points) Question 7: Short Answers (30 points) Answer parts 1-6 with a brief explanation. 1. Suppose the model of interest is Y i = 0 + 1 X 1i + 2 X 2i + u i, where E(u X)=0 and E(u 2 X)=

More information

Econometrics - 30C00200

Econometrics - 30C00200 Econometrics - 30C00200 Lecture 11: Heteroskedasticity Antti Saastamoinen VATT Institute for Economic Research Fall 2015 30C00200 Lecture 11: Heteroskedasticity 12.10.2015 Aalto University School of Business

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

Econ 836 Final Exam. 2 w N 2 u N 2. 2 v N

Econ 836 Final Exam. 2 w N 2 u N 2. 2 v N 1) [4 points] Let Econ 836 Final Exam Y Xβ+ ε, X w+ u, w N w~ N(, σi ), u N u~ N(, σi ), ε N ε~ Nu ( γσ, I ), where X is a just one column. Let denote the OLS estimator, and define residuals e as e Y X.

More information

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16) Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16) 1 2 Model Consider a system of two regressions y 1 = β 1 y 2 + u 1 (1) y 2 = β 2 y 1 + u 2 (2) This is a simultaneous equation model

More information

Økonomisk Kandidateksamen 2004 (I) Econometrics 2. Rettevejledning

Økonomisk Kandidateksamen 2004 (I) Econometrics 2. Rettevejledning Økonomisk Kandidateksamen 2004 (I) Econometrics 2 Rettevejledning This is a closed-book exam (uden hjælpemidler). Answer all questions! The group of questions 1 to 4 have equal weight. Within each group,

More information

Instrumental Variables and Two-Stage Least Squares

Instrumental Variables and Two-Stage Least Squares Instrumental Variables and Two-Stage Least Squares Generalised Least Squares Professor Menelaos Karanasos December 2011 Generalised Least Squares: Assume that the postulated model is y = Xb + e, (1) where

More information

OSU Economics 444: Elementary Econometrics. Ch.10 Heteroskedasticity

OSU Economics 444: Elementary Econometrics. Ch.10 Heteroskedasticity OSU Economics 444: Elementary Econometrics Ch.0 Heteroskedasticity (Pure) heteroskedasticity is caused by the error term of a correctly speciþed equation: Var(² i )=σ 2 i, i =, 2,,n, i.e., the variance

More information

Basic econometrics. Tutorial 3. Dipl.Kfm. Johannes Metzler

Basic econometrics. Tutorial 3. Dipl.Kfm. Johannes Metzler Basic econometrics Tutorial 3 Dipl.Kfm. Introduction Some of you were asking about material to revise/prepare econometrics fundamentals. First of all, be aware that I will not be too technical, only as

More information

Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity

Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity R.G. Pierse 1 Omitted Variables Suppose that the true model is Y i β 1 + β X i + β 3 X 3i + u i, i 1,, n (1.1) where β 3 0 but that the

More information

Econometrics Honor s Exam Review Session. Spring 2012 Eunice Han

Econometrics Honor s Exam Review Session. Spring 2012 Eunice Han Econometrics Honor s Exam Review Session Spring 2012 Eunice Han Topics 1. OLS The Assumptions Omitted Variable Bias Conditional Mean Independence Hypothesis Testing and Confidence Intervals Homoskedasticity

More information