HOW IS GENERALIZED LEAST SQUARES RELATED TO WITHIN AND BETWEEN ESTIMATORS IN UNBALANCED PANEL DATA?

Size: px
Start display at page:

Download "HOW IS GENERALIZED LEAST SQUARES RELATED TO WITHIN AND BETWEEN ESTIMATORS IN UNBALANCED PANEL DATA?"

Transcription

1 HOW IS GENERALIZED LEAST SQUARES RELATED TO WITHIN AND BETWEEN ESTIMATORS IN UNBALANCED PANEL DATA? ERIK BIØRN Department of Economics University of Oslo P.O. Box 1095 Blindern 0317 Oslo Norway March Abstract: For a random effects panel data regression model in the unbalanced case we demonstrate that the Generalized Least Squares (GLS) estimator can be expressed as a (matrix) weighted average of estimators which utilize the within and the between individual variation. Two between estimators which generalizes the familiar estimator for the balanced case one of which depending on the variance components are involved. The regression s intercept needs specific treatment. We finally define an estimator which contains the GLS and the within and between estimators for balanced and unbalanced panel data as special cases. Keywords: Panel Data. Unbalanced panels. Missing data. Generalized Least Squares. Within estimation. Between estimation JEL classification: C13 C3

2 1 Introduction It is well known that for balanced panel data with random individual specific effects the Ordinary (OLS) and the Generalized Least Squares (GLS) estimators of the slope coefficient vector can be interpreted as (matrix) weighted averages of the estimators which utilize only the within individual and only the between individual variation in the data set often denoted as within and between estimators [see Hsiao (003 section 3.3.). Since unbalanced situations are more common than balanced in particular when using micro data due to entry or exit of respondents non-response rotation designs etc. the interest of this relationship is somewhat limited. There exists a growing literature on the GLS estimation of random individual effects models in unbalanced situations [see e.g. Biørn (1981) and Baltagi (1985). The question of whether and possibly how this estimator can be related to estimators which can be interpreted as within and between estimators has not been addressed in this literature. In this note the latter question is addressed. We demonstrate that a weighting relationship for the GLS with unbalanced panel data and random individual effects similar to that in the balanced case exists provided that the within and the between variation in the data are redefined in a suitable way. In deriving this relationship we demonstrate that specific attention must be given to the intercept term of the equation. Finally we present a general estimator which contains the GLS the OLS the within individual and the between individual estimators for balanced and unbalanced situations as special cases and which can be and easily implemented for computer programming. Model OLS and GLS estimators Consider a one-way error components regression model for unbalanced panel data in which individual i (i 1... N) is observed in T i periods and let t denote the observation number (which differs from the calendar period if the starting period of the individuals differ or if gaps occur in the time series of some of them): (1) y it x it β + k + ϵ it ϵ it α i + u it i 1... N α i IID(0 σα) u it IID(0 σ ) α i u it x it t 1... T i where x it is a (row) vector of regressors β its (column) vector of coefficients α i a random effect u it a disturbance and denotes orthogonal to. Let y i (y i1... y i ) y (y 1... y N ) X i (x i1... x it i ) X (X 1... X N) etc. and let I p be the p dimensional identity matrix and e p the (p 1) vector of ones. The number of observations i.e. the number of rows in y and X is n N i1 T i. The model can then be rewritten in the more compact form () y Xβ + e n k + ϵ ϵ [e T 1 α 1... e T N α N + u E(ϵ) 0 n1 E(ϵϵ ) Ω diag(ω 1... Ω N ) 1

3 where diag denotes block-diagonalization (3) Ω i E(ϵ i ϵ i) σ α e e T i + σ I σ K + (σ + T i σ α)j i 1... N and J p 1 p e pe p K p I p J p (p 1... ). Since J p and K p are idempotent and have orthogonal columns we simply have (4) Ω 1 i K T i σ + J σ + T i σ α σ (K + θ i J ) where (5) θ i var(ū i ) var( ε i ) σ σ +T i σ α 1 i 1... N. We use the following notation for the within the between and the total covariation in arbitrary matrices Z and Q constructed in the same way as X: W ZQ N i1 t1 (z it z i ) (q it q i ) Z diag(k T1... K TN )Q B ZQ N i1 T i( z i z) ( q i q) Z diag(j T1... J TN )Q Z J n Q G ZQ N i1 t1 (z it z) (q it q) W ZQ + B ZQ Z (I n J n )Q where z i 1 T i t1 z it and z 1 n N i1 t1 z it 1 n N i1 T i z i. Also we let X i (X i. e ) and X ( X 1... X N). As a consequence of not including the intercept in β and a column of ones in X we will [unlike e.g. Baltagi (008 section 9.) specify the intercept explicitly in the formulae. This will be essential for the purpose of defining the generalized between covariation and the two versions of the between estimators and interpreting the GLS estimator in terms of estimators which exploit the within and between variation. The OLS and GLS estimators of (β k) are respectively [ βols ( X X) [ 1 ( X X 1 [ y) ix i X ie X (6) iy i e X i e e e y i kols [ W XX + T i x i x i x i x i n 1 [ W XY + T i x i ȳ i ȳ i [ βgls (7) kgls ( X Ω 1 i X) 1 ( X Ω 1 i y) [ X iω 1 1 [ i X i X iω 1 i e X e Ω 1 i X i e Ω 1 i e [ W XX + 1 [ Φ i x i x i Φi x i Φi x i Φi iω 1 i y i y i e Ω 1 i W XY + Φ i x i ȳ i Φi ȳ i

4 where (8) Φ i var(u it) var( ε i ) T iσ σ +T i σ α θ i T i > < 1 (1 1 T i )σ > < σ α. The last equality in (7) follows from (4). Since the formula for the partitioned inverse [see Lütkepohl (1996 section 3.5.3) implies [ 1 [ A b Q Q b ( (9) c b c b c Q b c Qb c + 1 Q A b ) 1 b c c where A is a symmetric matrix b a row vector and c a scalar the estimators (6) and (7) can be rewritten as (10) β OLS [W XX + T i ( x i x) ( x i x) 1 [W XY + T i ( x i x) (ȳ i ȳ) kols ȳ x β OLS (11) β GLS [W XX + Φ i ( x i x) ( x i x) 1 [W XY + Φ i ( x i x) (ȳ i ȳ) kgls ȳ x β GLS where in general z z i z Φi z i Φi. In the unbalanced case we have: [1 The global means entering the OLS and the GLS formulae differ ( z z). [ The GLS estimators depend on (θ 1... θ N ). 3 Within and Between estimator first version The within estimator corresponding to β OLS and the between estimators corresponding to β OLS and k OLS obtained by running OLS on respectively (1) with the (k+α i )s considered as N unknown constants and on (1) ȳ i T i x i β + T i k + T i ϵ i i 1... N are (13) (14) β W W 1 XX W XY β B B 1 XX B XY kb ȳ x β B The disturbances in (1) T i ϵ i T i (α i + ū i ) are homoskedastic in the absence of individual effect (α i 0 θ i 1 Φ i T i ) and heteroskedastic with var( T i ϵ i ) σ /θ i otherwise. 3

5 From (10) (13) and (14) it follows for any panel design that (15) βols G 1 XX G XY (W XX +B XX ) 1 (W XX βw +B XX βb ). For a balanced design (T i T ) we have θ i θ σ Φ σ +T σα i Φ T σ σ +T σα (11) (13) and (14) yield (16) βgls (W XX +θb XX ) 1 (W XX βw +θb XX βb ). i so that We seek a relationship for the unbalanced case which can be interpreted as a generalization of the latter. This will involve modified between estimators. 4 Between estimator second version. A general class Let v i be an arbitrary weight and considered the following generalization of (1): (17) vi ȳ i v i x i β + v i k + v i ϵ i i 1... N with var( v i ϵ i ) v i Φ i σ. Running OLS gives a set of generalized between estimators: [ [ β 1 [ (18) B(v) vi x kb (v) i x i vi x i vi x i ȳ i vi x i vi vi ȳ i where v (v 1... v N ). Let further (19) z(v) vi z i vi BZQ (v) v i [ z i z(v) [ q i q(v) allowing (18) to be written in a form similar to (14) when we again use (9) (0) β B(v) B XX (v) 1 BXY (v) k B (v) ỹ(v) x(v)β B(v). Let now T (T 1... T N ) and Φ (Φ 1... Φ N ) so that we can write z z(t ) z z(φ) B ZQ B ZQ (T ). As a special case let in (0) v Φ which gives (1) β B β B(Φ) B XX (Φ) 1 BXY (Φ) kb k B (Φ) ỹ(φ) x(φ)β B(Φ) ȳ x β B. Proposition: Let for arbitrary vectors (z it q it ) W ZQ N i1 t1 (z it z i ) (q it q i ) B ZQ T i ( z i z) ( q i q) B ZQ (v) v i [ z i z(v) [ q i q(v) 4

6 where z(v)[ v i z i /[ v i z z(t ). Define corresponding estimators of β as β W W 1 XX W XY β B B 1 XX B XY β B(v) B XX (v) 1 BXY (v). Let (λ W λ B ) be non-negative scalars and consider the class of estimators β (λ W λ B v) [λ W W XX +λ B BXX (v) 1 [λ W W XX βw +λ B BXX (v)β B(v) being homogeneous in (λ W λ B ) and v of degree zero. Specifically β OLS β (1 1 T ) β B β (0 1 T ) β B(T ) β W β (1 0 T ) β (1 0 v) irrespective of v β B β (0 1 Φ) β B(Φ) β GLS β (1 1 Φ). [A β OLS β W β B β B β GLS are for suitable choices of v matrix weighted averages of β W β B(v). [B β OLS β GLS are in general not matrix weighted averages of β W β B. In practical applications the θ i s (Φ i s) have to be estimated which requires estimation of σ and σ α. We find that if the disturbances ϵ it were known the following estimators would be unbiased for the variance components: () σ W ϵϵ n N σ α n[(n N)B ϵϵ (N 1)W ϵϵ (n N)(n m) where n N i1 T i m N i1 T i. [For a balanced panel n NT m NT. The proof follows from the fact that and hence which implies W uu N i1 t1 u it N i1 T iū i B uu N i1 T iū i nū B αα N i1 T iα i nᾱ E(ū i ) 1 T i σ E(ū ) 1 n σ E(ᾱ ) m n σ α E(W ϵϵ ) E(W uu ) (n N)σ E(B uu ) (N 1)σ E(B αα ) E(B ϵϵ ) E(B uu ) + E(B αα ) ( n m ) σ n α (3) σ E(W ϵϵ) n N σ α n[(n N)E(B ϵϵ) (N 1)E(W ϵϵ ). (n N)(n m) 5

7 This we can verify that () give unbiased estimators: E( σ ) σ E( σ α) σ α. [See also Searle Casella and McCulloch (199 Section 3.6) and Biørn (004 Section 3). In practice the ϵ it disturbances are replaced by consistent residuals. The derived parameters θ i and Φ i can therefore be estimated by (4) (5) σ θ i σ +T i σ α Φ i T i σ σ +T i σ α (n m)w ϵϵ (n m)w ϵϵ + T i [(n N)B ϵϵ (N 1)W ϵϵ T i (n m)w ϵϵ (n m)w ϵϵ + T i [(n N)B ϵϵ (N 1)W ϵϵ to obtain feasible Between and GLS estimators by replacing Φ by Φ. 5 Conclusion Our conclusions then are the following: 1. If we define a modified between estimator of β given by (18) by choosing the weight vector v such that the weighted equation in individual means (16) gets disturbances with variance σ we obtain the Between estimator for the unbalanced panel data set β B β B(Φ). The prescription ensuring this is v Φ. [A Feasible Between estimator is obtained for Φ Φ as obtained from (5). The GLS estimator for the unbalanced case can be interpreted as a matrix weighted mean of β W and β B β B(Φ) with weights depending on X. [Replacing Φ by Φ we get the Feasible GLS estimator. Unlike the OLS estimator for the unbalanced case it cannot however in general be interpreted as a matrix weighted mean of β W and β B β B(T ). 3. Since in the balanced case (Φ θ T ) we have B XX (Φ) θb XX β B(Φ) β B(T ) β B β B the familiar decomposition given by (16) holds. References Baltagi B.H. (1985): Pooling Cross-Sections with Unequal me-series Lengths. Economics Letters Baltagi B.H. (008): Econometric Analysis of Panel Data. Chichester: Wiley. Biørn E. (1981): Estimating Economic Relations from Incomplete Cross-Section/me-Series Data. Journal of Econometrics Biørn E. (004): Regression Systems for Unbalanced Panel Data: A Stepwise Maximum Likelihood Procedure. Journal of Econometrics Hsiao C. (003): Analysis of Panel Data. Cambridge: Cambridge University Press Lütkepohl H. (1996): Handbook of Matrices. Chichester: Wiley. Searle S.R. Casella G. and McCulloch C.E. (199): Variance Components. New York: Wiley. 6

Simple Linear Regression: The Model

Simple Linear Regression: The Model Simple Linear Regression: The Model task: quantifying the effect of change X in X on Y, with some constant β 1 : Y = β 1 X, linear relationship between X and Y, however, relationship subject to a random

More information

Regression systems for unbalanced panel data: a stepwise maximum likelihood procedure

Regression systems for unbalanced panel data: a stepwise maximum likelihood procedure Journal of Econometrics 1 (004) 81 91 www.elsevier.com/locate/econbase Regression systems for unbalanced panel data: a stepwise maximum likelihood procedure Erik Birn Department of Economics, University

More information

Quantitative Analysis of Financial Markets. Summary of Part II. Key Concepts & Formulas. Christopher Ting. November 11, 2017

Quantitative Analysis of Financial Markets. Summary of Part II. Key Concepts & Formulas. Christopher Ting. November 11, 2017 Summary of Part II Key Concepts & Formulas Christopher Ting November 11, 2017 christopherting@smu.edu.sg http://www.mysmu.edu/faculty/christophert/ Christopher Ting 1 of 16 Why Regression Analysis? Understand

More information

Time Invariant Variables and Panel Data Models : A Generalised Frisch- Waugh Theorem and its Implications

Time Invariant Variables and Panel Data Models : A Generalised Frisch- Waugh Theorem and its Implications Time Invariant Variables and Panel Data Models : A Generalised Frisch- Waugh Theorem and its Implications Jaya Krishnakumar No 2004.01 Cahiers du département d économétrie Faculté des sciences économiques

More information

the error term could vary over the observations, in ways that are related

the error term could vary over the observations, in ways that are related Heteroskedasticity We now consider the implications of relaxing the assumption that the conditional variance Var(u i x i ) = σ 2 is common to all observations i = 1,..., n In many applications, we may

More information

Heteroskedasticity. in the Error Component Model

Heteroskedasticity. in the Error Component Model Heteroskedasticity in the Error Component Model Baltagi Textbook Chapter 5 Mozhgan Raeisian Parvari (0.06.010) Content Introduction Cases of Heteroskedasticity Adaptive heteroskedastic estimators (EGLS,

More information

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data

Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data Panel data Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data - possible to control for some unobserved heterogeneity - possible

More information

Intermediate Econometrics

Intermediate Econometrics Intermediate Econometrics Heteroskedasticity Text: Wooldridge, 8 July 17, 2011 Heteroskedasticity Assumption of homoskedasticity, Var(u i x i1,..., x ik ) = E(u 2 i x i1,..., x ik ) = σ 2. That is, the

More information

Introductory Econometrics

Introductory Econometrics Based on the textbook by Wooldridge: : A Modern Approach Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna October 16, 2013 Outline Introduction Simple

More information

LECTURE 2 LINEAR REGRESSION MODEL AND OLS

LECTURE 2 LINEAR REGRESSION MODEL AND OLS SEPTEMBER 29, 2014 LECTURE 2 LINEAR REGRESSION MODEL AND OLS Definitions A common question in econometrics is to study the effect of one group of variables X i, usually called the regressors, on another

More information

Panel Data Models. James L. Powell Department of Economics University of California, Berkeley

Panel Data Models. James L. Powell Department of Economics University of California, Berkeley Panel Data Models James L. Powell Department of Economics University of California, Berkeley Overview Like Zellner s seemingly unrelated regression models, the dependent and explanatory variables for panel

More information

Topic 10: Panel Data Analysis

Topic 10: Panel Data Analysis Topic 10: Panel Data Analysis Advanced Econometrics (I) Dong Chen School of Economics, Peking University 1 Introduction Panel data combine the features of cross section data time series. Usually a panel

More information

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018

Econometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018 Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 4 Jakub Mućk Econometrics of Panel Data Meeting # 4 1 / 30 Outline 1 Two-way Error Component Model Fixed effects model Random effects model 2 Non-spherical

More information

The exact bias of S 2 in linear panel regressions with spatial autocorrelation SFB 823. Discussion Paper. Christoph Hanck, Walter Krämer

The exact bias of S 2 in linear panel regressions with spatial autocorrelation SFB 823. Discussion Paper. Christoph Hanck, Walter Krämer SFB 83 The exact bias of S in linear panel regressions with spatial autocorrelation Discussion Paper Christoph Hanck, Walter Krämer Nr. 8/00 The exact bias of S in linear panel regressions with spatial

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 2 Jakub Mućk Econometrics of Panel Data Meeting # 2 1 / 26 Outline 1 Fixed effects model The Least Squares Dummy Variable Estimator The Fixed Effect (Within

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 1 Jakub Mućk Econometrics of Panel Data Meeting # 1 1 / 31 Outline 1 Course outline 2 Panel data Advantages of Panel Data Limitations of Panel Data 3 Pooled

More information

PANEL DATA RANDOM AND FIXED EFFECTS MODEL. Professor Menelaos Karanasos. December Panel Data (Institute) PANEL DATA December / 1

PANEL DATA RANDOM AND FIXED EFFECTS MODEL. Professor Menelaos Karanasos. December Panel Data (Institute) PANEL DATA December / 1 PANEL DATA RANDOM AND FIXED EFFECTS MODEL Professor Menelaos Karanasos December 2011 PANEL DATA Notation y it is the value of the dependent variable for cross-section unit i at time t where i = 1,...,

More information

Ma 3/103: Lecture 24 Linear Regression I: Estimation

Ma 3/103: Lecture 24 Linear Regression I: Estimation Ma 3/103: Lecture 24 Linear Regression I: Estimation March 3, 2017 KC Border Linear Regression I March 3, 2017 1 / 32 Regression analysis Regression analysis Estimate and test E(Y X) = f (X). f is the

More information

Nonstationary Panels

Nonstationary Panels Nonstationary Panels Based on chapters 12.4, 12.5, and 12.6 of Baltagi, B. (2005): Econometric Analysis of Panel Data, 3rd edition. Chichester, John Wiley & Sons. June 3, 2009 Agenda 1 Spurious Regressions

More information

INTRODUCTORY ECONOMETRICS

INTRODUCTORY ECONOMETRICS INTRODUCTORY ECONOMETRICS Lesson 2b Dr Javier Fernández etpfemaj@ehu.es Dpt. of Econometrics & Statistics UPV EHU c J Fernández (EA3-UPV/EHU), February 21, 2009 Introductory Econometrics - p. 1/192 GLRM:

More information

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)

Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16) Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16) 1 2 Model Consider a system of two regressions y 1 = β 1 y 2 + u 1 (1) y 2 = β 2 y 1 + u 2 (2) This is a simultaneous equation model

More information

1 Estimation of Persistent Dynamic Panel Data. Motivation

1 Estimation of Persistent Dynamic Panel Data. Motivation 1 Estimation of Persistent Dynamic Panel Data. Motivation Consider the following Dynamic Panel Data (DPD) model y it = y it 1 ρ + x it β + µ i + v it (1.1) with i = {1, 2,..., N} denoting the individual

More information

ECON The Simple Regression Model

ECON The Simple Regression Model ECON 351 - The Simple Regression Model Maggie Jones 1 / 41 The Simple Regression Model Our starting point will be the simple regression model where we look at the relationship between two variables In

More information

Linear models. Linear models are computationally convenient and remain widely used in. applied econometric research

Linear models. Linear models are computationally convenient and remain widely used in. applied econometric research Linear models Linear models are computationally convenient and remain widely used in applied econometric research Our main focus in these lectures will be on single equation linear models of the form y

More information

Ordinary Least Squares Regression

Ordinary Least Squares Regression Ordinary Least Squares Regression Goals for this unit More on notation and terminology OLS scalar versus matrix derivation Some Preliminaries In this class we will be learning to analyze Cross Section

More information

Introductory Econometrics

Introductory Econometrics Based on the textbook by Wooldridge: : A Modern Approach Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna December 11, 2012 Outline Heteroskedasticity

More information

Panel Data Model (January 9, 2018)

Panel Data Model (January 9, 2018) Ch 11 Panel Data Model (January 9, 2018) 1 Introduction Data sets that combine time series and cross sections are common in econometrics For example, the published statistics of the OECD contain numerous

More information

Advanced Econometrics

Advanced Econometrics Based on the textbook by Verbeek: A Guide to Modern Econometrics Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna May 16, 2013 Outline Univariate

More information

Heteroskedasticity. We now consider the implications of relaxing the assumption that the conditional

Heteroskedasticity. We now consider the implications of relaxing the assumption that the conditional Heteroskedasticity We now consider the implications of relaxing the assumption that the conditional variance V (u i x i ) = σ 2 is common to all observations i = 1,..., In many applications, we may suspect

More information

Bootstrapping Heteroskedasticity Consistent Covariance Matrix Estimator

Bootstrapping Heteroskedasticity Consistent Covariance Matrix Estimator Bootstrapping Heteroskedasticity Consistent Covariance Matrix Estimator by Emmanuel Flachaire Eurequa, University Paris I Panthéon-Sorbonne December 2001 Abstract Recent results of Cribari-Neto and Zarkos

More information

Econometrics I Lecture 3: The Simple Linear Regression Model

Econometrics I Lecture 3: The Simple Linear Regression Model Econometrics I Lecture 3: The Simple Linear Regression Model Mohammad Vesal Graduate School of Management and Economics Sharif University of Technology 44716 Fall 1397 1 / 32 Outline Introduction Estimating

More information

The Simple Regression Model. Simple Regression Model 1

The Simple Regression Model. Simple Regression Model 1 The Simple Regression Model Simple Regression Model 1 Simple regression model: Objectives Given the model: - where y is earnings and x years of education - Or y is sales and x is spending in advertising

More information

Some Monte Carlo Evidence for Adaptive Estimation of Unit-Time Varying Heteroscedastic Panel Data Models

Some Monte Carlo Evidence for Adaptive Estimation of Unit-Time Varying Heteroscedastic Panel Data Models Some Monte Carlo Evidence for Adaptive Estimation of Unit-Time Varying Heteroscedastic Panel Data Models G. R. Pasha Department of Statistics, Bahauddin Zakariya University Multan, Pakistan E-mail: drpasha@bzu.edu.pk

More information

Applied Microeconometrics (L5): Panel Data-Basics

Applied Microeconometrics (L5): Panel Data-Basics Applied Microeconometrics (L5): Panel Data-Basics Nicholas Giannakopoulos University of Patras Department of Economics ngias@upatras.gr November 10, 2015 Nicholas Giannakopoulos (UPatras) MSc Applied Economics

More information

In the bivariate regression model, the original parameterization is. Y i = β 1 + β 2 X2 + β 2 X2. + β 2 (X 2i X 2 ) + ε i (2)

In the bivariate regression model, the original parameterization is. Y i = β 1 + β 2 X2 + β 2 X2. + β 2 (X 2i X 2 ) + ε i (2) RNy, econ460 autumn 04 Lecture note Orthogonalization and re-parameterization 5..3 and 7.. in HN Orthogonalization of variables, for example X i and X means that variables that are correlated are made

More information

Review of Econometrics

Review of Econometrics Review of Econometrics Zheng Tian June 5th, 2017 1 The Essence of the OLS Estimation Multiple regression model involves the models as follows Y i = β 0 + β 1 X 1i + β 2 X 2i + + β k X ki + u i, i = 1,...,

More information

Economics 582 Random Effects Estimation

Economics 582 Random Effects Estimation Economics 582 Random Effects Estimation Eric Zivot May 29, 2013 Random Effects Model Hence, the model can be re-written as = x 0 β + + [x ] = 0 (no endogeneity) [ x ] = = + x 0 β + + [x ] = 0 [ x ] = 0

More information

Models, Testing, and Correction of Heteroskedasticity. James L. Powell Department of Economics University of California, Berkeley

Models, Testing, and Correction of Heteroskedasticity. James L. Powell Department of Economics University of California, Berkeley Models, Testing, and Correction of Heteroskedasticity James L. Powell Department of Economics University of California, Berkeley Aitken s GLS and Weighted LS The Generalized Classical Regression Model

More information

Introduction to Econometrics Final Examination Fall 2006 Answer Sheet

Introduction to Econometrics Final Examination Fall 2006 Answer Sheet Introduction to Econometrics Final Examination Fall 2006 Answer Sheet Please answer all of the questions and show your work. If you think a question is ambiguous, clearly state how you interpret it before

More information

CLUSTER EFFECTS AND SIMULTANEITY IN MULTILEVEL MODELS

CLUSTER EFFECTS AND SIMULTANEITY IN MULTILEVEL MODELS HEALTH ECONOMICS, VOL. 6: 439 443 (1997) HEALTH ECONOMICS LETTERS CLUSTER EFFECTS AND SIMULTANEITY IN MULTILEVEL MODELS RICHARD BLUNDELL 1 AND FRANK WINDMEIJER 2 * 1 Department of Economics, University

More information

Marcia Gumpertz and Sastry G. Pantula Department of Statistics North Carolina State University Raleigh, NC

Marcia Gumpertz and Sastry G. Pantula Department of Statistics North Carolina State University Raleigh, NC A Simple Approach to Inference in Random Coefficient Models March 8, 1988 Marcia Gumpertz and Sastry G. Pantula Department of Statistics North Carolina State University Raleigh, NC 27695-8203 Key Words

More information

Reference: Davidson and MacKinnon Ch 2. In particular page

Reference: Davidson and MacKinnon Ch 2. In particular page RNy, econ460 autumn 03 Lecture note Reference: Davidson and MacKinnon Ch. In particular page 57-8. Projection matrices The matrix M I X(X X) X () is often called the residual maker. That nickname is easy

More information

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley

Review of Classical Least Squares. James L. Powell Department of Economics University of California, Berkeley Review of Classical Least Squares James L. Powell Department of Economics University of California, Berkeley The Classical Linear Model The object of least squares regression methods is to model and estimate

More information

No 19/2016 Panel data estimators and aggregation Erik Biørn

No 19/2016 Panel data estimators and aggregation Erik Biørn MEMORADUM o 19/2016 Panel data estimators and aggregation Erik Biørn ISS: 0809-8786 Department of Economics University of Oslo his series is published by the University of Oslo Department of Economics

More information

Topic 6: Non-Spherical Disturbances

Topic 6: Non-Spherical Disturbances Topic 6: Non-Spherical Disturbances Our basic linear regression model is y = Xβ + ε ; ε ~ N[0, σ 2 I n ] Now we ll generalize the specification of the error term in the model: E[ε] = 0 ; E[εε ] = Σ = σ

More information

Introduction to Estimation Methods for Time Series models. Lecture 1

Introduction to Estimation Methods for Time Series models. Lecture 1 Introduction to Estimation Methods for Time Series models Lecture 1 Fulvio Corsi SNS Pisa Fulvio Corsi Introduction to Estimation () Methods for Time Series models Lecture 1 SNS Pisa 1 / 19 Estimation

More information

Seemingly Unrelated Regressions in Panel Models. Presented by Catherine Keppel, Michael Anreiter and Michael Greinecker

Seemingly Unrelated Regressions in Panel Models. Presented by Catherine Keppel, Michael Anreiter and Michael Greinecker Seemingly Unrelated Regressions in Panel Models Presented by Catherine Keppel, Michael Anreiter and Michael Greinecker Arnold Zellner An Efficient Method of Estimating Seemingly Unrelated Regressions and

More information

Sensitivity of GLS estimators in random effects models

Sensitivity of GLS estimators in random effects models of GLS estimators in random effects models Andrey L. Vasnev (University of Sydney) Tokyo, August 4, 2009 1 / 19 Plan Plan Simulation studies and estimators 2 / 19 Simulation studies Plan Simulation studies

More information

Peter Hoff Linear and multilinear models April 3, GLS for multivariate regression 5. 3 Covariance estimation for the GLM 8

Peter Hoff Linear and multilinear models April 3, GLS for multivariate regression 5. 3 Covariance estimation for the GLM 8 Contents 1 Linear model 1 2 GLS for multivariate regression 5 3 Covariance estimation for the GLM 8 4 Testing the GLH 11 A reference for some of this material can be found somewhere. 1 Linear model Recall

More information

Linear Regression. Junhui Qian. October 27, 2014

Linear Regression. Junhui Qian. October 27, 2014 Linear Regression Junhui Qian October 27, 2014 Outline The Model Estimation Ordinary Least Square Method of Moments Maximum Likelihood Estimation Properties of OLS Estimator Unbiasedness Consistency Efficiency

More information

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data

Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)

More information

ON VARIANCE COVARIANCE COMPONENTS ESTIMATION IN LINEAR MODELS WITH AR(1) DISTURBANCES. 1. Introduction

ON VARIANCE COVARIANCE COMPONENTS ESTIMATION IN LINEAR MODELS WITH AR(1) DISTURBANCES. 1. Introduction Acta Math. Univ. Comenianae Vol. LXV, 1(1996), pp. 129 139 129 ON VARIANCE COVARIANCE COMPONENTS ESTIMATION IN LINEAR MODELS WITH AR(1) DISTURBANCES V. WITKOVSKÝ Abstract. Estimation of the autoregressive

More information

1 Introduction to Generalized Least Squares

1 Introduction to Generalized Least Squares ECONOMICS 7344, Spring 2017 Bent E. Sørensen April 12, 2017 1 Introduction to Generalized Least Squares Consider the model Y = Xβ + ɛ, where the N K matrix of regressors X is fixed, independent of the

More information

Regression and Statistical Inference

Regression and Statistical Inference Regression and Statistical Inference Walid Mnif wmnif@uwo.ca Department of Applied Mathematics The University of Western Ontario, London, Canada 1 Elements of Probability 2 Elements of Probability CDF&PDF

More information

Applied Econometrics (QEM)

Applied Econometrics (QEM) Applied Econometrics (QEM) The Simple Linear Regression Model based on Prinicples of Econometrics Jakub Mućk Department of Quantitative Economics Jakub Mućk Applied Econometrics (QEM) Meeting #2 The Simple

More information

ANALYSIS OF PANEL DATA MODELS WITH GROUPED OBSERVATIONS. 1. Introduction

ANALYSIS OF PANEL DATA MODELS WITH GROUPED OBSERVATIONS. 1. Introduction Tatra Mt Math Publ 39 (2008), 183 191 t m Mathematical Publications ANALYSIS OF PANEL DATA MODELS WITH GROUPED OBSERVATIONS Carlos Rivero Teófilo Valdés ABSTRACT We present an iterative estimation procedure

More information

ECON3150/4150 Spring 2015

ECON3150/4150 Spring 2015 ECON3150/4150 Spring 2015 Lecture 3&4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo January 29, 2015 1 / 67 Chapter 4 in S&W Section 17.1 in S&W (extended OLS assumptions) 2

More information

The Two-way Error Component Regression Model

The Two-way Error Component Regression Model The Two-way Error Component Regression Model Badi H. Baltagi October 26, 2015 Vorisek Jan Highlights Introduction Fixed effects Random effects Maximum likelihood estimation Prediction Example(s) Two-way

More information

Least Squares Estimation-Finite-Sample Properties

Least Squares Estimation-Finite-Sample Properties Least Squares Estimation-Finite-Sample Properties Ping Yu School of Economics and Finance The University of Hong Kong Ping Yu (HKU) Finite-Sample 1 / 29 Terminology and Assumptions 1 Terminology and Assumptions

More information

Econometrics I KS. Module 1: Bivariate Linear Regression. Alexander Ahammer. This version: March 12, 2018

Econometrics I KS. Module 1: Bivariate Linear Regression. Alexander Ahammer. This version: March 12, 2018 Econometrics I KS Module 1: Bivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: March 12, 2018 Alexander Ahammer (JKU) Module 1: Bivariate

More information

Linear Regression. In this problem sheet, we consider the problem of linear regression with p predictors and one intercept,

Linear Regression. In this problem sheet, we consider the problem of linear regression with p predictors and one intercept, Linear Regression In this problem sheet, we consider the problem of linear regression with p predictors and one intercept, y = Xβ + ɛ, where y t = (y 1,..., y n ) is the column vector of target values,

More information

Modelling Multi-dimensional Panel Data: A Random Effects Approach

Modelling Multi-dimensional Panel Data: A Random Effects Approach Modelling Multi-dimensional Panel Data: A Random Effects Approach Laszlo Balazsi Badi H. Baltagi Laszlo Matyas Daria Pus Preliminary version, please do not quote February 5, 06 Abstract The last two decades

More information

10 Panel Data. Andrius Buteikis,

10 Panel Data. Andrius Buteikis, 10 Panel Data Andrius Buteikis, andrius.buteikis@mif.vu.lt http://web.vu.lt/mif/a.buteikis/ Introduction Panel data combines cross-sectional and time series data: the same individuals (persons, firms,

More information

Multivariate Regression Analysis

Multivariate Regression Analysis Matrices and vectors The model from the sample is: Y = Xβ +u with n individuals, l response variable, k regressors Y is a n 1 vector or a n l matrix with the notation Y T = (y 1,y 2,...,y n ) 1 x 11 x

More information

Title. Description. var intro Introduction to vector autoregressive models

Title. Description. var intro Introduction to vector autoregressive models Title var intro Introduction to vector autoregressive models Description Stata has a suite of commands for fitting, forecasting, interpreting, and performing inference on vector autoregressive (VAR) models

More information

Heteroskedasticity. Part VII. Heteroskedasticity

Heteroskedasticity. Part VII. Heteroskedasticity Part VII Heteroskedasticity As of Oct 15, 2015 1 Heteroskedasticity Consequences Heteroskedasticity-robust inference Testing for Heteroskedasticity Weighted Least Squares (WLS) Feasible generalized Least

More information

Simple Linear Regression Model & Introduction to. OLS Estimation

Simple Linear Regression Model & Introduction to. OLS Estimation Inside ECOOMICS Introduction to Econometrics Simple Linear Regression Model & Introduction to Introduction OLS Estimation We are interested in a model that explains a variable y in terms of other variables

More information

Test of hypotheses with panel data

Test of hypotheses with panel data Stochastic modeling in economics and finance November 4, 2015 Contents 1 Test for poolability of the data 2 Test for individual and time effects 3 Hausman s specification test 4 Case study Contents Test

More information

Large Sample Properties of Estimators in the Classical Linear Regression Model

Large Sample Properties of Estimators in the Classical Linear Regression Model Large Sample Properties of Estimators in the Classical Linear Regression Model 7 October 004 A. Statement of the classical linear regression model The classical linear regression model can be written in

More information

The Simple Regression Model. Part II. The Simple Regression Model

The Simple Regression Model. Part II. The Simple Regression Model Part II The Simple Regression Model As of Sep 22, 2015 Definition 1 The Simple Regression Model Definition Estimation of the model, OLS OLS Statistics Algebraic properties Goodness-of-Fit, the R-square

More information

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63

Panel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63 1 / 63 Panel Data Models Chapter 5 Financial Econometrics Michael Hauser WS17/18 2 / 63 Content Data structures: Times series, cross sectional, panel data, pooled data Static linear panel data models:

More information

The Multiple Regression Model Estimation

The Multiple Regression Model Estimation Lesson 5 The Multiple Regression Model Estimation Pilar González and Susan Orbe Dpt Applied Econometrics III (Econometrics and Statistics) Pilar González and Susan Orbe OCW 2014 Lesson 5 Regression model:

More information

Econometrics Multiple Regression Analysis: Heteroskedasticity

Econometrics Multiple Regression Analysis: Heteroskedasticity Econometrics Multiple Regression Analysis: João Valle e Azevedo Faculdade de Economia Universidade Nova de Lisboa Spring Semester João Valle e Azevedo (FEUNL) Econometrics Lisbon, April 2011 1 / 19 Properties

More information

Next is material on matrix rank. Please see the handout

Next is material on matrix rank. Please see the handout B90.330 / C.005 NOTES for Wednesday 0.APR.7 Suppose that the model is β + ε, but ε does not have the desired variance matrix. Say that ε is normal, but Var(ε) σ W. The form of W is W w 0 0 0 0 0 0 w 0

More information

Weighted Least Squares

Weighted Least Squares Weighted Least Squares The standard linear model assumes that Var(ε i ) = σ 2 for i = 1,..., n. As we have seen, however, there are instances where Var(Y X = x i ) = Var(ε i ) = σ2 w i. Here w 1,..., w

More information

Homoskedasticity. Var (u X) = σ 2. (23)

Homoskedasticity. Var (u X) = σ 2. (23) Homoskedasticity How big is the difference between the OLS estimator and the true parameter? To answer this question, we make an additional assumption called homoskedasticity: Var (u X) = σ 2. (23) This

More information

Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model

Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model Restricted Maximum Likelihood in Linear Regression and Linear Mixed-Effects Model Xiuming Zhang zhangxiuming@u.nus.edu A*STAR-NUS Clinical Imaging Research Center October, 015 Summary This report derives

More information

MA 575 Linear Models: Cedric E. Ginestet, Boston University Midterm Review Week 7

MA 575 Linear Models: Cedric E. Ginestet, Boston University Midterm Review Week 7 MA 575 Linear Models: Cedric E. Ginestet, Boston University Midterm Review Week 7 1 Random Vectors Let a 0 and y be n 1 vectors, and let A be an n n matrix. Here, a 0 and A are non-random, whereas y is

More information

GENERALIZED ESTIMATORS OF STATIONARY RANDOM-COEFFICIENTS PANEL DATA MODELS: ASYMPTOTIC AND SMALL SAMPLE PROPERTIES

GENERALIZED ESTIMATORS OF STATIONARY RANDOM-COEFFICIENTS PANEL DATA MODELS: ASYMPTOTIC AND SMALL SAMPLE PROPERTIES GENERALIZED ESTIMATORS OF STATIONARY RANDOM-COEFFICIENTS PANEL DATA MODELS: ASYMPTOTIC AND SMALL SAMPLE PROPERTIES Authors: Mohamed Reda Abonazel Department of Applied Statistics and Econometrics, Institute

More information

1. The OLS Estimator. 1.1 Population model and notation

1. The OLS Estimator. 1.1 Population model and notation 1. The OLS Estimator OLS stands for Ordinary Least Squares. There are 6 assumptions ordinarily made, and the method of fitting a line through data is by least-squares. OLS is a common estimation methodology

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 3 Jakub Mućk Econometrics of Panel Data Meeting # 3 1 / 21 Outline 1 Fixed or Random Hausman Test 2 Between Estimator 3 Coefficient of determination (R 2

More information

Econometrics Master in Business and Quantitative Methods

Econometrics Master in Business and Quantitative Methods Econometrics Master in Business and Quantitative Methods Helena Veiga Universidad Carlos III de Madrid Models with discrete dependent variables and applications of panel data methods in all fields of economics

More information

Lecture 3: Multiple Regression

Lecture 3: Multiple Regression Lecture 3: Multiple Regression R.G. Pierse 1 The General Linear Model Suppose that we have k explanatory variables Y i = β 1 + β X i + β 3 X 3i + + β k X ki + u i, i = 1,, n (1.1) or Y i = β j X ji + u

More information

Lecture 9 SLR in Matrix Form

Lecture 9 SLR in Matrix Form Lecture 9 SLR in Matrix Form STAT 51 Spring 011 Background Reading KNNL: Chapter 5 9-1 Topic Overview Matrix Equations for SLR Don t focus so much on the matrix arithmetic as on the form of the equations.

More information

Intermediate Econometrics

Intermediate Econometrics Intermediate Econometrics Markus Haas LMU München Summer term 2011 15. Mai 2011 The Simple Linear Regression Model Considering variables x and y in a specific population (e.g., years of education and wage

More information

Introduction to Econometrics Midterm Examination Fall 2005 Answer Key

Introduction to Econometrics Midterm Examination Fall 2005 Answer Key Introduction to Econometrics Midterm Examination Fall 2005 Answer Key Please answer all of the questions and show your work Clearly indicate your final answer to each question If you think a question is

More information

Econometrics Summary Algebraic and Statistical Preliminaries

Econometrics Summary Algebraic and Statistical Preliminaries Econometrics Summary Algebraic and Statistical Preliminaries Elasticity: The point elasticity of Y with respect to L is given by α = ( Y/ L)/(Y/L). The arc elasticity is given by ( Y/ L)/(Y/L), when L

More information

Lecture 7: Dynamic panel models 2

Lecture 7: Dynamic panel models 2 Lecture 7: Dynamic panel models 2 Ragnar Nymoen Department of Economics, UiO 25 February 2010 Main issues and references The Arellano and Bond method for GMM estimation of dynamic panel data models A stepwise

More information

Specification Test for Instrumental Variables Regression with Many Instruments

Specification Test for Instrumental Variables Regression with Many Instruments Specification Test for Instrumental Variables Regression with Many Instruments Yoonseok Lee and Ryo Okui April 009 Preliminary; comments are welcome Abstract This paper considers specification testing

More information

Testing Random Effects in Two-Way Spatial Panel Data Models

Testing Random Effects in Two-Way Spatial Panel Data Models Testing Random Effects in Two-Way Spatial Panel Data Models Nicolas Debarsy May 27, 2010 Abstract This paper proposes an alternative testing procedure to the Hausman test statistic to help the applied

More information

Econometric Methods for Panel Data Part I

Econometric Methods for Panel Data Part I Econometric Methods for Panel Data Part I Robert M. Kunst University of Vienna February 011 1 Introduction The word panel is derived from Dutch and originally describes a rectangular board. In econometrics,

More information

Panel data can be defined as data that are collected as a cross section but then they are observed periodically.

Panel data can be defined as data that are collected as a cross section but then they are observed periodically. Panel Data Model Panel data can be defined as data that are collected as a cross section but then they are observed periodically. For example, the economic growths of each province in Indonesia from 1971-2009;

More information

2. Linear regression with multiple regressors

2. Linear regression with multiple regressors 2. Linear regression with multiple regressors Aim of this section: Introduction of the multiple regression model OLS estimation in multiple regression Measures-of-fit in multiple regression Assumptions

More information

Motivation for multiple regression

Motivation for multiple regression Motivation for multiple regression 1. Simple regression puts all factors other than X in u, and treats them as unobserved. Effectively the simple regression does not account for other factors. 2. The slope

More information

Econometrics of Panel Data

Econometrics of Panel Data Econometrics of Panel Data Jakub Mućk Meeting # 6 Jakub Mućk Econometrics of Panel Data Meeting # 6 1 / 36 Outline 1 The First-Difference (FD) estimator 2 Dynamic panel data models 3 The Anderson and Hsiao

More information

The multiple regression model; Indicator variables as regressors

The multiple regression model; Indicator variables as regressors The multiple regression model; Indicator variables as regressors Ragnar Nymoen University of Oslo 28 February 2013 1 / 21 This lecture (#12): Based on the econometric model specification from Lecture 9

More information

Lecture 10: Panel Data

Lecture 10: Panel Data Lecture 10: Instructor: Department of Economics Stanford University 2011 Random Effect Estimator: β R y it = x itβ + u it u it = α i + ɛ it i = 1,..., N, t = 1,..., T E (α i x i ) = E (ɛ it x i ) = 0.

More information

Variable Selection and Model Building

Variable Selection and Model Building LINEAR REGRESSION ANALYSIS MODULE XIII Lecture - 37 Variable Selection and Model Building Dr. Shalabh Department of Mathematics and Statistics Indian Institute of Technology Kanpur The complete regression

More information

Econ 510 B. Brown Spring 2014 Final Exam Answers

Econ 510 B. Brown Spring 2014 Final Exam Answers Econ 510 B. Brown Spring 2014 Final Exam Answers Answer five of the following questions. You must answer question 7. The question are weighted equally. You have 2.5 hours. You may use a calculator. Brevity

More information