Introduction to Panel Data Analysis
|
|
- Gerard Holland
- 6 years ago
- Views:
Transcription
1 Introduction to Panel Data Analysis Youngki Shin Department of Economics Statistics and Data Series at Western November 21, / 40
2 Motivation More observations mean more information. 2 / 40
3 Motivation More observations mean more information. More observations with a certain structure mean much more information: pooled cross sections and panel data 2 / 40
4 Motivation More observations mean more information. More observations with a certain structure mean much more information: pooled cross sections and panel data How can we extract additional information from pooled cross sections or panel data? 2 / 40
5 Example Effect of an Incinerator on Housing Prices With cross-sectional data in 1981, we have rprice = 101, , nearinc (3, 093.0) (5, ) n = / 40
6 Example Effect of an Incinerator on Housing Prices With cross-sectional data in 1981, we have rprice = 101, , nearinc n = 142 (3, 093.0) (5, ) With another cross-sectional data in 1978 when there were no incinerator, we have rprice = 82, , nearinc n = 179 (2, ) (4, ) 3 / 40
7 Example Effect of an Incinerator on Housing Prices With cross-sectional data in 1981, we have rprice = 101, , nearinc n = 142 (3, 093.0) (5, ) With another cross-sectional data in 1978 when there were no incinerator, we have rprice = 82, , nearinc n = 179 (2, ) (4, ) Therefore, the true effect of the incinerator in not 30, but 30, ( 18, ) = 11, / 40
8 Outline Data Structure Policy Evaluation with Pooled Cross Sections Three Approaches in Panel Data Estimation First Difference (FD) Estimator Fixed Effect (FE) Estimator Random Effect (RE) Estimator Empirical Application: Smoking on Birth Outcomes Concluding Remarks 4 / 40
9 Outline Data Structure Policy Evaluation with Pooled Cross Sections Three Approaches in Panel Data Estimation First Difference (FD) Estimator Fixed Effect (FE) Estimator Random Effect (RE) Estimator Empirical Application: Smoking on Birth Outcomes Concluding Remarks 5 / 40
10 Data Structure (cont.) A set of pooled cross sections is obtained by sampling randomly from a large population at different time points. A (typical) panel data set follow the same individuals over time. For example, consider that I sample three individuals from this room at two time points: Time Pooled Panel t=1 John, Jane, Evelyn Eric, Andrew, Rachel t=2 Kyle, Justin, Lisa Eric, Andrew, Rachel 6 / 40
11 Data Structure (cont.) A Snapshot of Data Table: Pooled Data year rprice nearinc y Table: Panel Data id year inf unem / 40
12 Data Structure (cont.) There are also very useful panel structures other than the individual-time combination. 1 Twins data: i is for twins id, and t is for the individual among the specific twins. Control for unobserved generic factors. 2 School data: students sampled from many schools (or classrooms). Then, i is for school id, and t is for the student in school i. 8 / 40
13 Data Structure (cont.) Examples of pooled cross sections: Current Population Survey (CPS), USA Examples of a panel data: Labor Market Activity Survey (LMAS), Canada Panel Study of Income Dynamics (PSID), USA National Longitudinal Survey of Youth (NLSY), USA A time series of provincial (or country) level data. ex) inflation and unemployment rate of 50 countries in It is usually easier to collect pooled cross sections than to do panel data. 9 / 40
14 Outline Data Structure Policy Evaluation with Pooled Cross Sections Three Approaches in Panel Data Estimation First Difference (FD) Estimator Fixed Effect (FE) Estimator Random Effect (RE) Estimator Empirical Application: Smoking on Birth Outcomes Concluding Remarks 10 / 40
15 Policy Evaluation with Pooled Cross Sections Difference-in-Difference Estimator Terminology: Treatment Group: those who are affected by a policy (a treatment) Control Group: those who are not. The object of policy evaluation is to measure the (mean) difference of outcomes between the treatment group and the control group. This measure is also called the average treatment effect. Consider that you are testing the effect of a new drug. How can you design the experiment? Randomization. Recall the incinerator and housing prices example. Is randomization possible? 11 / 40
16 Policy Evaluation with Pooled Cross Sections Difference-in-Difference Estimator Consider an example of a drug test: blprs i = β 0 + β 1 treat i + u i If you randomized the control/treatment groups well, i.e. Cov(treat i, u i ) = 0, then you can estimate the effect of the drug by a single cross section. In policy evaluation in social sciences, treat i and u i are easily correlated: log(wage i ) = β 0 + β 1 jbtrn i + u i 12 / 40
17 Policy Evaluation with Pooled Cross Sections Difference-in-Difference Estimator Pooled cross sections help us to evaluate the policy effect correctly by measuring the difference twice (before and after the policy implementation.) Recall the two regressions in the incinerator example: rprice = γ 0 + γ 1 nearinc + u in years 1978 and 1981 ˆδ 1 = ˆγ 1,81 ˆγ 1,78 = ( rprice 81,nr rprice 81,fr ) ( rprice78,nr rprice 78,fr ) If perfectly randomized, the second term is 0. This estimator is called the Difference-in-Difference estimator. 13 / 40
18 Policy Evaluation with a Pooled Cross Section Difference-in-Difference Estimator The effect can be estimated just by a single regression with some dummy variable. rprice = β 0 + δ 0 y81 + β 1 nearinc + δ 1 y81 nearinc + u This result is not intuitive. Just follow the logic: Before (y81 = 0) After (y81 = 1) After-Before Control (nearinc = 0) β 0 β 0 + δ 0 δ 0 Treatment (nearinc = 1) β 0 + β 1 β 0 + δ 0 + β 1 + δ 1 δ 0 + δ 1 Treatment-Control β 1 β 1 + δ 1 δ 1 Therefore, δ 1 in the above regression gives the same estimate of the Difference-in-Difference estimator. 14 / 40
19 Outline Data Structure Policy Evaluation with Pooled Cross Sections Three Approaches in Panel Data Estimation First Difference (FD) Estimator Fixed Effect (FE) Estimator Random Effect (RE) Estimator Empirical Application: Smoking on Birth Outcomes Concluding Remarks 15 / 40
20 Panel Data and the First Difference (FD) Estimator In panel data, we follow the same individual over time. This specific structure enables us to conduct a better analysis. Specifically, we can control for certain types of omitted variables called unobserved heterogeneity. Let us think about some examples: log(wage it ) = β 0 + δ 0 d2 t + β 1 educ it + a i + u it }{{} v it Notation: now we have two subscripts, i and t. Both a i and u it are unobservables called a fixed effect and an idiosyncratic error, respectively. 16 / 40
21 Panel Data and the First Difference (FD) Estimator For simplicity, consider two periods model: y it = β 0 + δ 0 d2 t + β 1 x it + a i + u it t = 1, 2. The pooled OLS does not work well since a i is usually correlated with x it, i.e. Cov(v it, x it ) 0. A simple solution is the First-Difference (FD) estimator. y i2 = (β 0 + δ 0 ) + β 1 x i2 + a i + u i2 t = 2 y i1 = β 0 + β 1 x i1 + a i + u i1 t = 1 Taking a difference gives y i2 y i1 = δ 0 + β 1 (x i2 x i1 ) + (u i2 u i1 ) or y i = δ 0 + β 1 x i + u i. 17 / 40
22 Panel Data and the First Difference (FD) Estimator The (pooled) OLS works in the new regression, 1 u i and x i are uncorrelated; 2 x i has some variation. y i = δ 0 + β 1 x i + u i, if The second condition is violated if x it does not change over time: ex) gender, race, etc.. Then, x i = 0. Even in the wage equation example, log(wage it ) = β 0 + δ 0 d2 t + β 1 educ it + a i + u it, Most working population do not increase the years of educ. 18 / 40
23 Panel Data and the First Difference (FD) Estimator More than Two Time Periods When panel data contain more than two time periods, we can still apply the FD estimator to control for unobserved heterogeneity. The sufficient condition for the estimator to be valid is This condition is violated when Cov(x it, u is ) = 0 for all t and s. 1 Future regressors react to the past dependent variable (feedback); 2 Regressors contain a lagged dependent variable; 3 An important (i.e. related to x it ) time-varying regressor is omitted. Take differences with adjacent time periods and run the following regression when t = 1, 2, and 3: y it = α 0 + α 3 d3 t + β 1 x it + u it for t = 2, / 40
24 Additional Remarks on FD Estimator Due to the expansion over the time dimension, serial correlation may arise. Also, we cannot exclude the heteroskedasticity problem. Since we use the OLS estimator, we can apply the White correction or the HAC estimation method as before. 20 / 40
25 Fixed Effect Estimator Consider a simple error component model again: y it = β 1 x it + a i + u it, t = 1,..., T and i = 1,..., n. We assume that the idiosyncratic error u it is innocuous in the sense: E(u it X i ) = 0 or E(u it x it ) = 0. However, the individual fixed effect a i could be arbitrarily correlated with x it. We have already known that the FD estimator cancels out the unobserved heterogeneity a i. 21 / 40
26 Fixed Effect Estimator There is a different way to cancel out unobserved heterogeneity. First, fix the individual i and take an average over time: ȳ i = β 1 x i + a i + ū i. where ȳ i = 1 T T y it, x i = 1 T t=1 T x it, and ū i = 1 T t=1 T u it. t=1 The point is ā i = 1 T T a i = 1 T Ta i = a i. t=1 22 / 40
27 Fixed Effect Estimator Now, take a difference between two equations: y it = β 1 x it + a i + u it, t = 1, 2,..., T. ȳ i = β 1 x i + a i + ū i. Then, what we have is y it ȳ i = β 1 (x it x i ) + (u it ū i ), t = 1, 2,..., T or ÿ it = β 1 ẍ it + ü it, t = 1, 2,..., T. We may apply the pooled OLS on the last equation. 23 / 40
28 Fixed Effect Estimator The FE estimator uses information from within group (i) variation: ÿ i1 = y i1 ȳ i ÿ i2 = y i2 ȳ i. ÿ it = y it ȳ i For this reason, the FE estimator is also called within estimator. This can be readily extended to a multiple regression model: ÿ it = β 1 ẍ 1it + β 2 ẍ 2it β k ẍ kit + ü it 24 / 40
29 Fixed Effect Estimator FD vs. FE If T = 2, the FD estimator and the FE estimator are identical: ( ) yi1 + y i2 ÿ i2 y i2 ȳ i = y i2 = y 1 y y i2. Therefore, ÿ i2 = β 1 ẍ it + ü it 1 2 y i2 = β x i u i2 y i2 = β 1 x i2 + u i2 However, they are different in a finite sample if T > 2. Unless there is a unit root (or severe serial correlation) problem, you would better use the FE estimator. 25 / 40
30 Random Effect Estimator In the random effect model: we assume that y it = β 0 + β 1 x it + a i + u it, Cov(x it, a i ) = 0. Then, we come back to the nice world where we don t need to cancel out a i. Just use the pooled OLS? 26 / 40
31 Random Effect Estimator In the random effect model: we assume that y it = β 0 + β 1 x it + a i + u it, Cov(x it, a i ) = 0. Then, we come back to the nice world where we don t need to cancel out a i. Just use the pooled OLS? No. There is a serial correlation problem. 26 / 40
32 Random Effect Estimator Serial Correlation in the RE model We have two components in the error term: v it = a i + u it Suppose that u it is totally innocuous again: Cov(a i, u it ) = Cov(u it, u is ) = 0 for t s. Now, we calculate Corr(v it, v is ) and show that it is not zero: Var(v it ) = Var(a i + u it ) = σ 2 a + σ 2 u Cov(v it, v is ) = E ((a i + u it )(a i + u is )) = E(a 2 i + a i u is + a i u it + u it u is ) = E(a 2 i ) = σ 2 a 27 / 40
33 Random Effect Estimator Serial Correlation in the RE model Therefore, Corr(v it, v is ) = σ2 a σ 2 a + σ 2 u 0 Any inference based on the pooled OLS would be incorrect. However, we know how to fix this problem. Do GLS! We want to transform the original model into ỹ it = β 0 + β 1 x it + ṽ it where ṽ it does not have the serial correlation anymore. 28 / 40
34 Random Effect Estimator We multiplied ρ and took a difference when there is a AR(1) serial correlation. In this case, we multiply and take a difference as [ σu 2 λ = 1 σu 2 + T σa 2 ] (1/2) y it λȳ i = β 0 (1 λ) + β 1 (x it λ x i ) + v it λ v i We can show that ṽ it (= v it λ v i ) is not serially correlated. The λ should be estimated by ˆλ. This specific GLS estimator is called the Random Effect (RE) estimator. 29 / 40
35 Random Effect Estimator The RE estimator is something between the pooled OLS and the FE estimator. Note that in Equation: y it λȳ i = β 0 (1 λ) + β 1 (x it λ x i ) + v it λ v i, it becomes the pooled OLS when λ = 0, and does the FE estimator when λ = 1. The λ is always between 0 and 1 in the RE model. As T, the FE and RE estimators are equivalent since λ / 40
36 Random Effect Estimator RE vs. FE If you believe that there is obvious endogenous fixed factor, a i, in your model, you should use the FE estimator. Otherwise, the RE estimator will tell you more: non time-varying regressors, efficiency etc. Keep in mind that the RE estimator is not even consistent if Cov(x it, a i ) 0. We can test whether Cov(x it, a i ) = 0 or not. 31 / 40
37 Random Effect Estimator Hausman Test The idea of the Hausman test is simple. The null hypothesis is H 0 :Cov(x it, a i ) = 0 H 1 :Cov(x it, a i ) 0 Under H 0, both RE and FE are consistent: p p β RE β, βfe β. Thus, we can expect that β RE β FE. However, under H 1, only β FE is consistent. Therefore, we reject H 0 if the difference between β RE and β FE is large enough. 32 / 40
38 Outline Data Structure Policy Evaluation with Pooled Cross Sections Three Approaches in Panel Data Estimation First Difference (FD) Estimator Fixed Effect (FE) Estimator Random Effect (RE) Estimator Empirical Application: Smoking on Birth Outcomes Concluding Remarks 33 / 40
39 Empirical Application: Smoking on Birth Outcomes Infants born to women who smoke during pregnancy have a lower average birthweight... Low birthweight is associated with increased risk for neonatal, perinatal, and infant morbidity and mortality. (Women and Smoking: A Report of the Surgeon General, 2001, requoted from Abrevaya (2006)) 34 / 40
40 Empirical Application: Smoking on Birth Outcomes The direct medical costs: According to the estimates of Lewit et al. (1995), the low-birthweight (LBW) infants (less than 10% of births) account for more than 1/3 of health care costs during the first year of life. The long-term costs: Hack et al. (1995) find that LBW babies have developmental problems in cognition, attention and neuromotor functioning that persist until adolescence. (Abrevaya (2006)) 35 / 40
41 Empirical Application: Smoking on Birth Outcomes How to Estimate The OLS estimates would be biased into the negative direction due to endogeneity. IV estimation? 492 Comparison between J. ABREVAYA OLS and IV estimates from Abrevaya (2006) 36 / 40
42 Empirical Application: Smoking on Birth Outcomes The fixed-effect (FE) estimation can be used if panel data are available. Abrevaya (2006) constructed a pseudo panel data set and showed that the FE estimate is smaller than that of the OLS. y ib = x ib β + γs ib + c i + u ib where i is Mom s id and b is the order of a baby from Mom i. The estimation results for γ by OLS and FE are (3.20) (4.75), respectively. 37 / 40
43 Concluding Remarks 38 / 40
44 Concluding Remarks Pooled cross sections are very similar to a single cross section, but observations across different time points help evaluate the correct policy effect. 38 / 40
45 Concluding Remarks Pooled cross sections are very similar to a single cross section, but observations across different time points help evaluate the correct policy effect. Extra information contained in panel data enables us to control for the individual fixed effect by FD and FE estimators. 38 / 40
46 Concluding Remarks Pooled cross sections are very similar to a single cross section, but observations across different time points help evaluate the correct policy effect. Extra information contained in panel data enables us to control for the individual fixed effect by FD and FE estimators. If the fixed effect is not correlated with regressors, we can apply RE estimator, which is a GLS estimator. 38 / 40
47 Concluding Remarks Pooled cross sections are very similar to a single cross section, but observations across different time points help evaluate the correct policy effect. Extra information contained in panel data enables us to control for the individual fixed effect by FD and FE estimators. If the fixed effect is not correlated with regressors, we can apply RE estimator, which is a GLS estimator. Panel data are not restricted to the individual-time structure. 38 / 40
48 Stata Commands Load the data set filename.dta. First, we need to set an id variable and a time variable. Check the relevant variable names. xtset id time. Now type xtsum. The command for the FE estimator is xtreg dep x1 x2 x3,..., fe The command for the RE estimator is xtreg dep x1 x2 x3,..., re 39 / 40
Econometrics. Week 6. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague
Econometrics Week 6 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 21 Recommended Reading For the today Advanced Panel Data Methods. Chapter 14 (pp.
More informationLecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook)
Lecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook) 1 2 Panel Data Panel data is obtained by observing the same person, firm, county, etc over several periods. Unlike the pooled cross sections,
More informationApplied Economics. Panel Data. Department of Economics Universidad Carlos III de Madrid
Applied Economics Panel Data Department of Economics Universidad Carlos III de Madrid See also Wooldridge (chapter 13), and Stock and Watson (chapter 10) 1 / 38 Panel Data vs Repeated Cross-sections In
More informationApplied Microeconometrics (L5): Panel Data-Basics
Applied Microeconometrics (L5): Panel Data-Basics Nicholas Giannakopoulos University of Patras Department of Economics ngias@upatras.gr November 10, 2015 Nicholas Giannakopoulos (UPatras) MSc Applied Economics
More informationApplied Quantitative Methods II
Applied Quantitative Methods II Lecture 10: Panel Data Klára Kaĺıšková Klára Kaĺıšková AQM II - Lecture 10 VŠE, SS 2016/17 1 / 38 Outline 1 Introduction 2 Pooled OLS 3 First differences 4 Fixed effects
More informationDealing With Endogeneity
Dealing With Endogeneity Junhui Qian December 22, 2014 Outline Introduction Instrumental Variable Instrumental Variable Estimation Two-Stage Least Square Estimation Panel Data Endogeneity in Econometrics
More informationWooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares
Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Many economic models involve endogeneity: that is, a theoretical relationship does not fit
More informationLecture 4: Linear panel models
Lecture 4: Linear panel models Luc Behaghel PSE February 2009 Luc Behaghel (PSE) Lecture 4 February 2009 1 / 47 Introduction Panel = repeated observations of the same individuals (e.g., rms, workers, countries)
More informationLinear Panel Data Models
Linear Panel Data Models Michael R. Roberts Department of Finance The Wharton School University of Pennsylvania October 5, 2009 Michael R. Roberts Linear Panel Data Models 1/56 Example First Difference
More informationWeek 2: Pooling Cross Section across Time (Wooldridge Chapter 13)
Week 2: Pooling Cross Section across Time (Wooldridge Chapter 13) Tsun-Feng Chiang* *School of Economics, Henan University, Kaifeng, China March 3, 2014 1 / 30 Pooling Cross Sections across Time Pooled
More informationRepeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data
Panel data Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data - possible to control for some unobserved heterogeneity - possible
More informationEC327: Advanced Econometrics, Spring 2007
EC327: Advanced Econometrics, Spring 2007 Wooldridge, Introductory Econometrics (3rd ed, 2006) Chapter 14: Advanced panel data methods Fixed effects estimators We discussed the first difference (FD) model
More informationEconometrics. Week 4. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague
Econometrics Week 4 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 23 Recommended Reading For the today Serial correlation and heteroskedasticity in
More information1 Motivation for Instrumental Variable (IV) Regression
ECON 370: IV & 2SLS 1 Instrumental Variables Estimation and Two Stage Least Squares Econometric Methods, ECON 370 Let s get back to the thiking in terms of cross sectional (or pooled cross sectional) data
More informationEconometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague
Econometrics Week 8 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 25 Recommended Reading For the today Instrumental Variables Estimation and Two Stage
More informationInstrumental Variables and the Problem of Endogeneity
Instrumental Variables and the Problem of Endogeneity September 15, 2015 1 / 38 Exogeneity: Important Assumption of OLS In a standard OLS framework, y = xβ + ɛ (1) and for unbiasedness we need E[x ɛ] =
More informationMaking sense of Econometrics: Basics
Making sense of Econometrics: Basics Lecture 4: Qualitative influences and Heteroskedasticity Egypt Scholars Economic Society November 1, 2014 Assignment & feedback enter classroom at http://b.socrative.com/login/student/
More informationPanel data methods for policy analysis
IAPRI Quantitative Analysis Capacity Building Series Panel data methods for policy analysis Part I: Linear panel data models Outline 1. Independently pooled cross sectional data vs. panel/longitudinal
More informationOrdinary Least Squares Regression
Ordinary Least Squares Regression Goals for this unit More on notation and terminology OLS scalar versus matrix derivation Some Preliminaries In this class we will be learning to analyze Cross Section
More information1 Correlation between an independent variable and the error
Chapter 7 outline, Econometrics Instrumental variables and model estimation 1 Correlation between an independent variable and the error Recall that one of the assumptions that we make when proving the
More informationINTRODUCTION TO BASIC LINEAR REGRESSION MODEL
INTRODUCTION TO BASIC LINEAR REGRESSION MODEL 13 September 2011 Yogyakarta, Indonesia Cosimo Beverelli (World Trade Organization) 1 LINEAR REGRESSION MODEL In general, regression models estimate the effect
More informationNotes on Panel Data and Fixed Effects models
Notes on Panel Data and Fixed Effects models Michele Pellizzari IGIER-Bocconi, IZA and frdb These notes are based on a combination of the treatment of panel data in three books: (i) Arellano M 2003 Panel
More informationEconometrics Homework 4 Solutions
Econometrics Homework 4 Solutions Question 1 (a) General sources of problem: measurement error in regressors, omitted variables that are correlated to the regressors, and simultaneous equation (reverse
More informationBasic Regressions and Panel Data in Stata
Developing Trade Consultants Policy Research Capacity Building Basic Regressions and Panel Data in Stata Ben Shepherd Principal, Developing Trade Consultants 1 Basic regressions } Stata s regress command
More informationLECTURE 11. Introduction to Econometrics. Autocorrelation
LECTURE 11 Introduction to Econometrics Autocorrelation November 29, 2016 1 / 24 ON PREVIOUS LECTURES We discussed the specification of a regression equation Specification consists of choosing: 1. correct
More informationLecture 5: Omitted Variables, Dummy Variables and Multicollinearity
Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity R.G. Pierse 1 Omitted Variables Suppose that the true model is Y i β 1 + β X i + β 3 X 3i + u i, i 1,, n (1.1) where β 3 0 but that the
More informationEconometrics Review questions for exam
Econometrics Review questions for exam Nathaniel Higgins nhiggins@jhu.edu, 1. Suppose you have a model: y = β 0 x 1 + u You propose the model above and then estimate the model using OLS to obtain: ŷ =
More informationECONOMETRICS HONOR S EXAM REVIEW SESSION
ECONOMETRICS HONOR S EXAM REVIEW SESSION Eunice Han ehan@fas.harvard.edu March 26 th, 2013 Harvard University Information 2 Exam: April 3 rd 3-6pm @ Emerson 105 Bring a calculator and extra pens. Notes
More informationEmpirical Application of Simple Regression (Chapter 2)
Empirical Application of Simple Regression (Chapter 2) 1. The data file is House Data, which can be downloaded from my webpage. 2. Use stata menu File Import Excel Spreadsheet to read the data. Don t forget
More informationCHAPTER 6: SPECIFICATION VARIABLES
Recall, we had the following six assumptions required for the Gauss-Markov Theorem: 1. The regression model is linear, correctly specified, and has an additive error term. 2. The error term has a zero
More informationChapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE
Chapter 6. Panel Data Joan Llull Quantitative Statistical Methods II Barcelona GSE Introduction Chapter 6. Panel Data 2 Panel data The term panel data refers to data sets with repeated observations over
More informationEmpirical Application of Panel Data Regression
Empirical Application of Panel Data Regression 1. We use Fatality data, and we are interested in whether rising beer tax rate can help lower traffic death. So the dependent variable is traffic death, while
More informationEc1123 Section 7 Instrumental Variables
Ec1123 Section 7 Instrumental Variables Andrea Passalacqua Harvard University andreapassalacqua@g.harvard.edu November 16th, 2017 Andrea Passalacqua (Harvard) Ec1123 Section 7 Instrumental Variables November
More informationFixed Effects Models for Panel Data. December 1, 2014
Fixed Effects Models for Panel Data December 1, 2014 Notation Use the same setup as before, with the linear model Y it = X it β + c i + ɛ it (1) where X it is a 1 K + 1 vector of independent variables.
More informationEcon 582 Fixed Effects Estimation of Panel Data
Econ 582 Fixed Effects Estimation of Panel Data Eric Zivot May 28, 2012 Panel Data Framework = x 0 β + = 1 (individuals); =1 (time periods) y 1 = X β ( ) ( 1) + ε Main question: Is x uncorrelated with?
More informationSTOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Lena Nekby Number of credits: 7,5 credits Date of exam: Friday, June 5, 009 Examination time: 3 hours
More information10 Panel Data. Andrius Buteikis,
10 Panel Data Andrius Buteikis, andrius.buteikis@mif.vu.lt http://web.vu.lt/mif/a.buteikis/ Introduction Panel data combines cross-sectional and time series data: the same individuals (persons, firms,
More informationECON Introductory Econometrics. Lecture 16: Instrumental variables
ECON4150 - Introductory Econometrics Lecture 16: Instrumental variables Monique de Haan (moniqued@econ.uio.no) Stock and Watson Chapter 12 Lecture outline 2 OLS assumptions and when they are violated Instrumental
More informationMotivation for multiple regression
Motivation for multiple regression 1. Simple regression puts all factors other than X in u, and treats them as unobserved. Effectively the simple regression does not account for other factors. 2. The slope
More informationTopic 10: Panel Data Analysis
Topic 10: Panel Data Analysis Advanced Econometrics (I) Dong Chen School of Economics, Peking University 1 Introduction Panel data combine the features of cross section data time series. Usually a panel
More informationEcon 1123: Section 5. Review. Internal Validity. Panel Data. Clustered SE. STATA help for Problem Set 5. Econ 1123: Section 5.
Outline 1 Elena Llaudet 2 3 4 October 6, 2010 5 based on Common Mistakes on P. Set 4 lnftmpop = -.72-2.84 higdppc -.25 lackpf +.65 higdppc * lackpf 2 lnftmpop = β 0 + β 1 higdppc + β 2 lackpf + β 3 lackpf
More informationEconometrics of Panel Data
Econometrics of Panel Data Jakub Mućk Meeting # 1 Jakub Mućk Econometrics of Panel Data Meeting # 1 1 / 31 Outline 1 Course outline 2 Panel data Advantages of Panel Data Limitations of Panel Data 3 Pooled
More informationHomework Set 2, ECO 311, Fall 2014
Homework Set 2, ECO 311, Fall 2014 Due Date: At the beginning of class on October 21, 2014 Instruction: There are twelve questions. Each question is worth 2 points. You need to submit the answers of only
More informationMetrics Honors Review
Metrics Honors Review petertu@fas.harvard.edu Harvard University Department of Economics 26 March 2015 : Logistics Exam Date: Wednesday, April 8 from 3-6pm in Sever Hall 113 The exam covers material from
More informationECO375 Tutorial 9 2SLS Applications and Endogeneity Tests
ECO375 Tutorial 9 2SLS Applications and Endogeneity Tests Matt Tudball University of Toronto Mississauga November 23, 2017 Matt Tudball (University of Toronto) ECO375H5 November 23, 2017 1 / 33 Hausman
More informationJob Training Partnership Act (JTPA)
Causal inference Part I.b: randomized experiments, matching and regression (this lecture starts with other slides on randomized experiments) Frank Venmans Example of a randomized experiment: Job Training
More informationOne Economist s Perspective on Some Important Estimation Issues
One Economist s Perspective on Some Important Estimation Issues Jere R. Behrman W.R. Kenan Jr. Professor of Economics & Sociology University of Pennsylvania SRCD Seattle Preconference on Interventions
More informationEC402 - Problem Set 3
EC402 - Problem Set 3 Konrad Burchardi 11th of February 2009 Introduction Today we will - briefly talk about the Conditional Expectation Function and - lengthily talk about Fixed Effects: How do we calculate
More informationNinth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"
Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric
More informationEconometrics of causal inference. Throughout, we consider the simplest case of a linear outcome equation, and homogeneous
Econometrics of causal inference Throughout, we consider the simplest case of a linear outcome equation, and homogeneous effects: y = βx + ɛ (1) where y is some outcome, x is an explanatory variable, and
More informationGov 2000: 13. Panel Data and Clustering
Gov 2000: 13. Panel Data and Clustering Matthew Blackwell Harvard University mblackwell@gov.harvard.edu November 28, 2016 Where are we? Where are we going? Up until now: the linear regression model, its
More informationEconometrics of Panel Data
Econometrics of Panel Data Jakub Mućk Meeting # 6 Jakub Mućk Econometrics of Panel Data Meeting # 6 1 / 36 Outline 1 The First-Difference (FD) estimator 2 Dynamic panel data models 3 The Anderson and Hsiao
More informationEconometrics Honor s Exam Review Session. Spring 2012 Eunice Han
Econometrics Honor s Exam Review Session Spring 2012 Eunice Han Topics 1. OLS The Assumptions Omitted Variable Bias Conditional Mean Independence Hypothesis Testing and Confidence Intervals Homoskedasticity
More informationEconometrics in a nutshell: Variation and Identification Linear Regression Model in STATA. Research Methods. Carlos Noton.
1/17 Research Methods Carlos Noton Term 2-2012 Outline 2/17 1 Econometrics in a nutshell: Variation and Identification 2 Main Assumptions 3/17 Dependent variable or outcome Y is the result of two forces:
More informationApplied Health Economics (for B.Sc.)
Applied Health Economics (for B.Sc.) Helmut Farbmacher Department of Economics University of Mannheim Autumn Semester 2017 Outlook 1 Linear models (OLS, Omitted variables, 2SLS) 2 Limited and qualitative
More informationPBAF 528 Week 8. B. Regression Residuals These properties have implications for the residuals of the regression.
PBAF 528 Week 8 What are some problems with our model? Regression models are used to represent relationships between a dependent variable and one or more predictors. In order to make inference from the
More informationRecent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data
Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)
More informationIV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors
IV Estimation and its Limitations: Weak Instruments and Weakly Endogeneous Regressors Laura Mayoral IAE, Barcelona GSE and University of Gothenburg Gothenburg, May 2015 Roadmap Deviations from the standard
More informationInternal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9.
Section 7 Model Assessment This section is based on Stock and Watson s Chapter 9. Internal vs. external validity Internal validity refers to whether the analysis is valid for the population and sample
More informationIntermediate Econometrics
Intermediate Econometrics Heteroskedasticity Text: Wooldridge, 8 July 17, 2011 Heteroskedasticity Assumption of homoskedasticity, Var(u i x i1,..., x ik ) = E(u 2 i x i1,..., x ik ) = σ 2. That is, the
More informationLinear Models in Econometrics
Linear Models in Econometrics Nicky Grant At the most fundamental level econometrics is the development of statistical techniques suited primarily to answering economic questions and testing economic theories.
More informationMultiple Equation GMM with Common Coefficients: Panel Data
Multiple Equation GMM with Common Coefficients: Panel Data Eric Zivot Winter 2013 Multi-equation GMM with common coefficients Example (panel wage equation) 69 = + 69 + + 69 + 1 80 = + 80 + + 80 + 2 Note:
More informationCapital humain, développement et migrations: approche macroéconomique (Empirical Analysis - Static Part)
Séminaire d Analyse Economique III (LECON2486) Capital humain, développement et migrations: approche macroéconomique (Empirical Analysis - Static Part) Frédéric Docquier & Sara Salomone IRES UClouvain
More informationExercise sheet 6 Models with endogenous explanatory variables
Exercise sheet 6 Models with endogenous explanatory variables Note: Some of the exercises include estimations and references to the data files. Use these to compare them to the results you obtained with
More informationEconometrics of Panel Data
Econometrics of Panel Data Jakub Mućk Meeting # 2 Jakub Mućk Econometrics of Panel Data Meeting # 2 1 / 26 Outline 1 Fixed effects model The Least Squares Dummy Variable Estimator The Fixed Effect (Within
More informationPanel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43
Panel Data March 2, 212 () Applied Economoetrics: Topic March 2, 212 1 / 43 Overview Many economic applications involve panel data. Panel data has both cross-sectional and time series aspects. Regression
More informationMultiple Regression Analysis: Heteroskedasticity
Multiple Regression Analysis: Heteroskedasticity y = β 0 + β 1 x 1 + β x +... β k x k + u Read chapter 8. EE45 -Chaiyuth Punyasavatsut 1 topics 8.1 Heteroskedasticity and OLS 8. Robust estimation 8.3 Testing
More informationRockefeller College University at Albany
Rockefeller College University at Albany PAD 705 Handout: Simultaneous quations and Two-Stage Least Squares So far, we have studied examples where the causal relationship is quite clear: the value of the
More informationEconomics 345: Applied Econometrics Section A01 University of Victoria Midterm Examination #2 Version 2 Fall 2016 Instructor: Martin Farnham
Economics 345: Applied Econometrics Section A01 University of Victoria Midterm Examination #2 Version 2 Fall 2016 Instructor: Martin Farnham Last name (family name): First name (given name): Student ID
More informationEmpirical approaches in public economics
Empirical approaches in public economics ECON4624 Empirical Public Economics Fall 2016 Gaute Torsvik Outline for today The canonical problem Basic concepts of causal inference Randomized experiments Non-experimental
More informationPanel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63
1 / 63 Panel Data Models Chapter 5 Financial Econometrics Michael Hauser WS17/18 2 / 63 Content Data structures: Times series, cross sectional, panel data, pooled data Static linear panel data models:
More informationEconomics 582 Random Effects Estimation
Economics 582 Random Effects Estimation Eric Zivot May 29, 2013 Random Effects Model Hence, the model can be re-written as = x 0 β + + [x ] = 0 (no endogeneity) [ x ] = = + x 0 β + + [x ] = 0 [ x ] = 0
More informationSTOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Lena Nekby Number of credits: 7,5 credits Date of exam: Saturday, May 9, 008 Examination time: 3
More informationIntroduction to Linear Regression Analysis Interpretation of Results
Introduction to Linear Regression Analysis Interpretation of Results Samuel Nocito Lecture 2 March 8th, 2018 Lecture 1 Summary Why and how we use econometric tools in empirical research. Ordinary Least
More informationØkonomisk Kandidateksamen 2004 (I) Econometrics 2. Rettevejledning
Økonomisk Kandidateksamen 2004 (I) Econometrics 2 Rettevejledning This is a closed-book exam (uden hjælpemidler). Answer all questions! The group of questions 1 to 4 have equal weight. Within each group,
More informationHeteroskedasticity. Part VII. Heteroskedasticity
Part VII Heteroskedasticity As of Oct 15, 2015 1 Heteroskedasticity Consequences Heteroskedasticity-robust inference Testing for Heteroskedasticity Weighted Least Squares (WLS) Feasible generalized Least
More informationFortin Econ Econometric Review 1. 1 Panel Data Methods Fixed Effects Dummy Variables Regression... 7
Fortin Econ 495 - Econometric Review 1 Contents 1 Panel Data Methods 2 1.1 Fixed Effects......................... 2 1.1.1 Dummy Variables Regression............ 7 1.1.2 First Differencing Methods.............
More informationPanel Data: Linear Models
Panel Data: Linear Models Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini Laura Magazzini (@univr.it) Panel Data: Linear Models 1 / 45 Introduction Outline What
More informationECO375 Tutorial 8 Instrumental Variables
ECO375 Tutorial 8 Instrumental Variables Matt Tudball University of Toronto Mississauga November 16, 2017 Matt Tudball (University of Toronto) ECO375H5 November 16, 2017 1 / 22 Review: Endogeneity Instrumental
More informationEMERGING MARKETS - Lecture 2: Methodology refresher
EMERGING MARKETS - Lecture 2: Methodology refresher Maria Perrotta April 4, 2013 SITE http://www.hhs.se/site/pages/default.aspx My contact: maria.perrotta@hhs.se Aim of this class There are many different
More informationPhD/MA Econometrics Examination January 2012 PART A
PhD/MA Econometrics Examination January 2012 PART A ANSWER ANY TWO QUESTIONS IN THIS SECTION NOTE: (1) The indicator function has the properties: (2) Question 1 Let, [defined as if using the indicator
More informationEconometrics Summary Algebraic and Statistical Preliminaries
Econometrics Summary Algebraic and Statistical Preliminaries Elasticity: The point elasticity of Y with respect to L is given by α = ( Y/ L)/(Y/L). The arc elasticity is given by ( Y/ L)/(Y/L), when L
More information1 The Multiple Regression Model: Freeing Up the Classical Assumptions
1 The Multiple Regression Model: Freeing Up the Classical Assumptions Some or all of classical assumptions were crucial for many of the derivations of the previous chapters. Derivation of the OLS estimator
More informationα version (only brief introduction so far)
Econometrics I KS Module 8: Panel Data Econometrics Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: June 18, 2018 α version (only brief introduction so far) Alexander
More informationMULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS
MULTIPLE REGRESSION AND ISSUES IN REGRESSION ANALYSIS Page 1 MSR = Mean Regression Sum of Squares MSE = Mean Squared Error RSS = Regression Sum of Squares SSE = Sum of Squared Errors/Residuals α = Level
More informationEconometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018
Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate
More informationAdvanced Econometrics
Based on the textbook by Verbeek: A Guide to Modern Econometrics Robert M. Kunst robert.kunst@univie.ac.at University of Vienna and Institute for Advanced Studies Vienna May 16, 2013 Outline Univariate
More informationLeast Squares Estimation of a Panel Data Model with Multifactor Error Structure and Endogenous Covariates
Least Squares Estimation of a Panel Data Model with Multifactor Error Structure and Endogenous Covariates Matthew Harding and Carlos Lamarche January 12, 2011 Abstract We propose a method for estimating
More information1. You have data on years of work experience, EXPER, its square, EXPER2, years of education, EDUC, and the log of hourly wages, LWAGE
1. You have data on years of work experience, EXPER, its square, EXPER, years of education, EDUC, and the log of hourly wages, LWAGE You estimate the following regressions: (1) LWAGE =.00 + 0.05*EDUC +
More information8. Instrumental variables regression
8. Instrumental variables regression Recall: In Section 5 we analyzed five sources of estimation bias arising because the regressor is correlated with the error term Violation of the first OLS assumption
More informationCasuality and Programme Evaluation
Casuality and Programme Evaluation Lecture V: Difference-in-Differences II Dr Martin Karlsson University of Duisburg-Essen Summer Semester 2017 M Karlsson (University of Duisburg-Essen) Casuality and Programme
More informationLecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)
Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16) 1 2 Model Consider a system of two regressions y 1 = β 1 y 2 + u 1 (1) y 2 = β 2 y 1 + u 2 (2) This is a simultaneous equation model
More informationEconometrics of Panel Data
Econometrics of Panel Data Jakub Mućk Meeting # 4 Jakub Mućk Econometrics of Panel Data Meeting # 4 1 / 26 Outline 1 Two-way Error Component Model Fixed effects model Random effects model 2 Hausman-Taylor
More informationIntroduction to Econometrics
Introduction to Econometrics STAT-S-301 Experiments and Quasi-Experiments (2016/2017) Lecturer: Yves Dominicy Teaching Assistant: Elise Petit 1 Why study experiments? Ideal randomized controlled experiments
More informationOutline. Nature of the Problem. Nature of the Problem. Basic Econometrics in Transportation. Autocorrelation
1/30 Outline Basic Econometrics in Transportation Autocorrelation Amir Samimi What is the nature of autocorrelation? What are the theoretical and practical consequences of autocorrelation? Since the assumption
More informationCluster-Robust Inference
Cluster-Robust Inference David Sovich Washington University in St. Louis Modern Empirical Corporate Finance Modern day ECF mainly focuses on obtaining unbiased or consistent point estimates (e.g. indentification!)
More informationPractice exam questions
Practice exam questions Nathaniel Higgins nhiggins@jhu.edu, nhiggins@ers.usda.gov 1. The following question is based on the model y = β 0 + β 1 x 1 + β 2 x 2 + β 3 x 3 + u. Discuss the following two hypotheses.
More informationMgmt 469. Causality and Identification
Mgmt 469 Causality and Identification As you have learned by now, a key issue in empirical research is identifying the direction of causality in the relationship between two variables. This problem often
More informationEconomics 345: Applied Econometrics Section A01 University of Victoria Midterm Examination #2 Version 1 SOLUTIONS Fall 2016 Instructor: Martin Farnham
Economics 345: Applied Econometrics Section A01 University of Victoria Midterm Examination #2 Version 1 SOLUTIONS Fall 2016 Instructor: Martin Farnham Last name (family name): First name (given name):
More informationExam D0M61A Advanced econometrics
Exam D0M61A Advanced econometrics 19 January 2009, 9 12am Question 1 (5 pts.) Consider the wage function w i = β 0 + β 1 S i + β 2 E i + β 0 3h i + ε i, where w i is the log-wage of individual i, S i is
More information