Introduction to Regression Models for Panel Data Analysis. Indiana University Workshop in Methods February 13, Professor Patricia A.
|
|
- Roger Fleming
- 6 years ago
- Views:
Transcription
1 Introduction to Regression Models for Panel Data Analysis Indiana Universy Workshop in Methods February 13, 2015 Professor Patricia A. McManus Panel Data Analysis February 2015
2 What are Panel Data? Panel data are a type of longudinal data, or data collected at different points in time. Three main types of longudinal data: Time series data. Many observations (large t) on as few as one un (small N). Examples: stock price trends, aggregate national statistics. Pooled cross sections. Two or more independent samples of many uns (large N) drawn from the same population at different time periods: o General Social Surveys o US Decennial Census extracts o Current Population Surveys* Panel data. Two or more observations (small t) on many uns (large N). o Panel surveys of households and individuals (PSID, NLSY, ANES) o Data on organizations and firms at different time points o Aggregated regional data over time This workshop is a basic introduction to the analysis of panel data. In particular, I will cover the linear error components model. WIM Panel Data Analysis October 2015 Page 1
3 Why Analyze Panel Data? We are interested in describing change over time o social change, e.g. changing attudes, behaviors, social relationships o individual growth or development, e.g. life-course studies, child development, career trajectories, school achievement o occurrence (or non-occurrence) of events We want superior estimates trends in social phenomena o Panel models can be used to inform policy e.g. health, obesy o Multiple observations on each un can provide superior estimates as compared to cross-sectional models of association We want to estimate causal models o Policy evaluation o Estimation of treatment effects WIM Panel Data Analysis October 2015 Page 2
4 What kind of data are required for panel analysis? Basic panel methods require at least two waves of measurement. Consider student GPAs and job hours during two semesters of college. One way to organize the panel data is to create a single record for each combination of un and time period: StudentID Semester Female HSGPA GPA JobHrs Notice that the data include: o A time-invariant unique identifier for each un (StudentID) o A time-varying outcome (GPA) o An indicator for time (Semester). Panel datasets can include other time-varying or time-invariant variables WIM Panel Data Analysis October 2015 Page 3
5 An alternative way to structure the data is to keep all the measures related to each student in a single record. This is sometimes called wide format. StudentID Female HSGPA GPA5 JobHrs5 GPA6 JobHrs o Why are there two variables for GPA and JobHrs? o Why is there only one variable for gender and high school GPA? o Where is the indicator for time? WIM Panel Data Analysis October 2015 Page 4
6 Estimation Techniques for Panel Models We can wre a simple panel equation predicting GPA from hours worked: GPA = b + TERM b + HSGPA b + JOB b + v 0 T H J General Linear Model is the foundation of linear panel model estimation o Ordinary Least Squares (OLS) o Weighted least squares (WLS) o Generalized least squares (GLS) Least-squares estimation of panel models typically entails three steps: (a) Data transformation or first-stage estimation (b) Estimation of the parameters using Ordinary Least Squares (c) Estimation of the variance-covariance matrix of the estimates (VCE) Parameter estimates are sometimes refined using eratively reweighted least squares (IRLS), a maximum likelihood estimator. WIM Panel Data Analysis October 2015 Page 5
7 Basic Questions for the Panel Analyst What s the story you want to tell? Is this a descriptive analysis? Less worry, fewer controls are usually better. Is this an attempt at causal analysis using observational data? Careful specification AND theory are essential. How does time matter? Some analyses, e.g. difference-in-difference analysis associates time wh an event (before and after) Some analyses may be interested in growth trajectories. Panel analysis may be appropriate even if time is irrelevant. Panel models using cross-sectional data collected at fixed periods of time generally use dummy variables for each time period in a two-way specification wh fixed-effects for time. Are the data up to the demands of the analysis? Panel analysis is data-intensive. Two waves are a bare minimum Can you perform the necessary specification tests? How will you address panel attrion? WIM Panel Data Analysis October 2015 Page 6
8 Review of the Classical Linear Regression Model y = b + x b + x b + + x b + u, i=1,2,3, N i 0 1i 1 2i 2... ki k i Where we assume that the linear model is correct and: E u x, x,.., x = 0 Covariates are Exogenous: ( ) Uncorrelated errors: ( ) i 1i 2i ki Cov u, u = 0 Homoskedastic errors: ( ) ( ) 2 i j Var u = Var y x, x,..., x = s i i 1i 2i ki If assumptions do not hold, OLS estimates are BIASED and/or INEFFICIENT Biased - Expected value of parameter estimate is different from true. o Consistency. If an estimator is unbiased, or if the bias shrinks as the sample size increases, is CONSISTENT Inefficient - An estimator is inefficient if an alternative estimator converges more rapidly on the true coefficients as sample size increases. o Estimators that explo all available information are more efficient WIM Panel Data Analysis October 2015 Page 7
9 OLS Bias Due to Endogeney Omted Variable Bias o Intervening variables, selectivy Measurement Error in the Covariates Simultaney Bias o Feedback loops o Omted variables Conventional regression-based strategies to address endogeney bias Instrumental Variables estimation Structural Equations Models Propensy score estimation Fixed effects panel models WIM Panel Data Analysis October 2015 Page 8
10 Illustration of Whin-un correlation. Peak-flow Measurements Wright Peak Flow measures Subject ID Wright Peak #1 Wright Peak #2 WIM Panel Data Analysis October 2015 Page 9
11 OLS Inefficiency due to Correlated Errors Many data structures are susceptible to error correlation: Hierarchical data sample multiple individuals from each un, e.g. household members, employees in firms, multiple pupils from each school. Multistage probabily samples often incorporate cluster-based sampling designs wh errors that may be correlated whin clusters. Repeated observations data often show whin-un error correlation. Time series data often have errors that are serially correlated, that is, correlated over time. Panel data have errors that can be correlated whin un (e.g. individuals), whin period. Conventional regression-based strategies to address correlated errors Cluster-consistent covariance matrix estimator to adjust standard errors. Generalized Least Squares instead of OLS to explo correlation structure. Generalized Estimation Equations (GEE) Mixed Effects Estimators for multilevel models WIM Panel Data Analysis October 2015 Page 10
12 Linear Panel Data Model (LPM) Suppose the data are on each cross-section un over T time periods: y y = x ' b, u,1,1 t1,1 = x ' b, u,2,2 t2,1 ::: y = x ' b, u it, it, T it,, t=1,2,,t We can express this concisely using y i to represent the vector of individual outcomes for person i across all time periods: y = X b + u i i i, where y ' i = y,1, y,2,..., yit For comparison, begin wh two conventional OLS linear regression models, one for each period. Note that the variables female highgpa (HS GPA) are time-invariant. WIM Panel Data Analysis October 2015 Page 11
13 OLS Results for each term: Term 5 GPA Term 6 GPA Estimate SE t-stat Estimate SE t-stat Intercept jobhrs female highgpa Pooled OLS Results for both terms: Term 5&6 GPA Term 5&6 GPA (Clustered SE) Estimate SE t-stat Estimate SE t-stat Intercept jobhrs female highgpa term WIM Panel Data Analysis October 2015 Page 12
14 Linear Unobserved Effects Panel Data Model Motivation: Unobserved heterogeney Suppose we have a model wh an unobserved, time-constant variable c: y = c + x c + x c + + x c + c + u k k Where u is uncorrelated wh all explanatory variables in x. Because c is unobserved is absorbed into the error term, so we can wre the model as follows: y = c + x c + x c + + x c + v v = c + u k k The error term v consists of two components, an idiosyncratic component u and an unobserved heterogeney component c. WIM Panel Data Analysis October 2015 Page 13
15 OLS Estimation of the Error Components Model If the unobserved heterogeney c i is correlated wh one or more of the explanatory variables, OLS parameter estimates are biased and inconsistent. If the unobserved heterogeney c is uncorrelated wh the explanatory variables in x i, OLS is unbiased even in a single cross-section. If we have more than one observation on any un, the errors will be correlated and OLS estimates will be inefficient y = c, x c, x c,..., x c, v i, k k i,1 i1 i1 i1 y = c, x c, x c,..., x c, v i, k k i,2 v = c, u i,1 i i,1 v = c, u i,2 i i,2 cov(v, v ) ¹ 0 i,1 i,2 i2 i2 i2 WIM Panel Data Analysis October 2015 Page 14
16 Unobserved Heterogeney in Panel Data Suppose the data are on each cross-section un over T time periods. This is an unobserved effects model (UEM), also called the error components model. We can wre the model for each time period: y = x c + c + u i1 i1 i i1 y = x c + c + u i2 i2 i i2 y = x c + c + u it it i it, Where there are T observations on outcome y for person i, x is a vector of explanatory variables measured at time t, c i is unobserved in all periods but constant over time u is a time-varying idiosyncratic error Define v = c i + u as the compose error. WIM Panel Data Analysis October 2015 Page 15
17 Consistent estimation of the Error Components Model wh Pooled OLS If we assume no contemporaneous correlation of the errors and the explanatory variables, pooled OLS estimation is consistent: ' E ( x u ) = 0 and ' i E( x c ) = 0, t=1,2,,t Efficient estimation of the Error Components Model wh Pooled OLS Even if estimation is consistent, pooled OLS may not be efficient. One strategy is to combine pooled OLS wh cluster-consistent standard errors. Panel methods over OLS to explo OR remove unobserved heterogeney. In the next sections, we consider the dominant approaches to estimation of the error components panel model: fixed effects and random effects. WIM Panel Data Analysis October 2015 Page 16
18 Just a few panel data examples: Wage penalty for motherhood Men s wage premium for heterosexual marriage Effect of regulation of nursing pay on hospal qualy Effect of Incarceration on wages and income inequaly Effect of parental divorce on mental health over life-course Determinants of Death Penalty in US states Effect of Democracy on Human Capal and Economic Growth WIM Panel Data Analysis October 2015 Page 17
19 Fixed Effects Methods for Panel Data Suppose the unobserved effect c i is correlated wh the covariates. Example: Motherhood wage penalty We observe that mothers earn less than other women, cet par. b ˆ KIDS = in a log wage model suggests that each addional OLS child reduces mothers hourly wages by about 8% But if women who are less oriented towards work are also more likely to have more children, omting work orientations from the model will bias the coefficient on children. Fixed-effects methods transform the model to remove c i b ˆ KIDS = FE estimates a persistent but much smaller penalty. FE WIM Panel Data Analysis October 2015 Page 18
20 Caution: Fixed effects has some disadvantages FE is not a panacea for all sources of endogeney bias. time-varying unobserved effects time-varying measurement error simultaney or feedback loops All time-constant effects are removed. No estimation of effects of race, gender, birth order, etc. Poor estimates if ltle variation (e.g. education in adulthood) FE trades consistency for efficiency. FE uses only whin-un change, ignores between-un variation. Parameter estimates may be imprecise, standard errors large. Despe limations, FE is an indispensable tool in the panel analyst s toolbox. WIM Panel Data Analysis October 2015 Page 19
21 Fixed Effects Transformation - the Whin Estimator Suppose we have the UEM model: ' = + i + y x c c u, t=1,2,,t For each un, average this equation over all time periods t: ' i = xic + i + i y c u Subtract the whin-un average from each observation on that un: ' ' ( ) ( ) ( ) y - y = x - x c + c - c + u -u, t=1,2,,t i i i i i This is the fixed effects transformation. We can wre as: y, ' = xb + u where ci - ci = 0 and y = y -y, x i = x -xi, u = u -ui and x does not contain an intercept term. WIM Panel Data Analysis October 2015 Page 20
22 The fixed-effects estimator, also called the whin estimator, applies pooled OLS to the transformed equation: -1-1 N N N T N T ' ' ' ' FE = å XX i i å Xy i i = åå xx åå xy çi= 1 çi= 1 çi= 1 t= 1 çi= 1 t= 1 æ ö æ ö æ ö æ ö ˆ b è ø è ø è ø è ø Recall the student GPA Data: StudentID Semester Female HSGPA GPA JobHrs After applying the fixed-effects transform, the demeaned (mean-centered) data: StudentID Semester CFemale CHSGPA CGPA CJobHrs WIM Panel Data Analysis October 2015 Page 21
23 Fixed Effects Dummy Variables Regression Up to now, we ve treated the unobservables c i as random variables: ' = xc + i + y c u An alternative approach is to treat c i as a fixed parameter for each un. In this case, we can use dummy variables regression to estimate c i. Step one: Create a dummy variable for each of sample un i Step two: Substute the vector of N-1 dummies for c i : ' = g g2 + 3 g gn + y x b d d dn u, (where the intercept g 1 estimates the effect when d 1=1) Step three: Estimate the equation using pooled OLS. The fixed effects dummy variables (FEDV) estimator produces precisely the same coefficient vector and standard errors as the FE estimator. WIM Panel Data Analysis October 2015 Page 22
24 Why not Just Use a Lagged Dependent Variable? Source: David Johnson. Journal of Marriage and Family, Vol. 67, No. 4 (Nov., 2005), pp WIM Panel Data Analysis October 2015 Page 23
25 Random Effects Methods If we can assume that the unobserved heterogeney will not bias the estimates: Fixed effects methods are inefficient. They throw away information. Pooled OLS is inefficient because does not explo the autocorrelation in the compose error term. Random effects methods use feasible GLS estimation (RE FGLS) to explo whin-cluster correlation Random effects estimation is more efficient than FE or OLS The random effects assumption of no bias due to c i is more stringent Ec ( x,..., x ) = Ec ( ) = 0 i i1 it i WIM Panel Data Analysis October 2015 Page 24
26 A Conventional FGLS Random Effects Estimator Assume the errors are correlated whin each un Assume the errors are uncorrelated across uns Assume the variance in the compose errors is equal to the sum of the variances in the unobserved effectc i and the idiosyncratic error u i : v = u + c s s s RE strategy: If v u c s = s + s, find estimators such that v = u + c sˆ sˆ sˆ WIM Panel Data Analysis October 2015 Page 25
27 Practical Feature of Random Effects Estimation Recall that the fixed effects whin estimator essentially transforms the data by centering each variable on the un-specific mean. OLS is then performed on the fully demeaned transformed data. The random effects estimator essentially transforms the data by partially demeaning each variable. Instead of subtracting the entire un-specific mean, only part of the mean is subtracted. The demeaning factor lis between 0 and 1, wh the specific value based on the variance components estimation. WIM Panel Data Analysis October 2015 Page 26
28 RE Results compared to pooled OLS Results for two terms: RE Term 5&6 GPA OLS Term 5&6 GPA Estimate SE z-stat Estimate SE t-stat Intercept jobhrs female highgpa term RE Results for six terms: Terms 1-6 GPA (FE, N=400) Estimate SE Intercept jobhrs female highgpa term WIM Panel Data Analysis October 2015 Page 27
29 Random Effects or Fixed Effects - How to decide? Hausman test for the Exogeney of the Unobserved Error Component If the unobserved effects are exogenous, the FE and RE are asymptotically equivalent. This suggests the null hypothesis for the Hausman test: H bˆ = b ˆ, 0 : RE FE where b ˆRE and b ˆFE are coefficient vectors for the time-varying explanatory variables, excluding the time variables. If the null hypothesis is rejected, we conclude that RE is inconsistent, and the FE model is preferred. If the null hypothesis cannot be rejected, random effects is preferred because is a more efficient estimator. WIM Panel Data Analysis October 2015 Page 28
30 Conventional Hausman Test in Stata:. xtreg gpa job sex highgpa,fe. estimates store fe. xtreg gpa job sex highgpa,re. estimates store re. hausman fe re ---- Coefficients ---- (b) (B) (b-b) sqrt(diag(v_b-v_b)) fe re Difference S.E job b = consistent under Ho and Ha; obtained from xtreg B = inconsistent under Ha, efficient under Ho; obtained from xtreg Test: Ho: difference in coefficients not systematic chi2(1) = (b-b)'[(v_b-v_b)^(-1)](b-b) = Prob>chi2 = We reject the null and conclude the fixed effects estimator is appropriate. WIM Panel Data Analysis October 2015 Page 29
31 An Alternative Hausman Test and FE/RE Hybrid Methods The Correlated Random Effects Model (Allison 2005,2009; Wooldridge 2005) Suppose we have both time-varying and time-invariant covariates: ' ' = x + zi + i + y c d c u, t=1,2,,t Add a vector of whin-un means for the time-varying covariates: ' ' ' = xc + zid + xiy + i + y c u Estimate using the random effects estimator and the result is a hybrid Alternative Hausman: If a Wald test for the joint statistical significance of the coefficient estimates in x rejects the null, FE is preferred The coefficient vector b yields the fixed effects estimates The coefficient vector x produces the between estimates The coefficient vector dproduces estimates for the time-invariante covariates. Interpret wh caution: these might still be correlated wh the unobserved error. WIM Panel Data Analysis October 2015 Page 30
32 Interpretation of Results from the Error Components Model Since the UEM model is derived as a levels model, coefficients can be interpreted much the same as interpretations of a conventional OLS model, but there are nuances: For example, suppose we estimate the relationship between marriage and men s wages, b ˆ MARRIED 0.05 in every model. Pooled OLS cross-section coefficients contain information about average differences between uns. Ey [ x ] = x c + c i This is a population-averaged effect. On average, married men earn 5% more than men who are not married. This says nothing about the causal effect of marriage on men s earnings. WIM Panel Data Analysis October 2015 Page 31
33 RE/FE/FD estimate average effects whin uns. If the unobserved effects are exogenous these are asymptotically equivalent to the population averaged effect. Ey [ x, c ] = x c i On average, entering marriage increases men s earnings by 5%. RE coefficients represent average change whin uns, estimated from all uns whether they experience change or not. FE coefficients represent average changes whin uns, only for uns that did experience change This is akin to a treatment effect among the treated. On average, men who married increased their earnings by 5%. WIM Panel Data Analysis October 2015 Page 32
34 Best Practices Theorize the model What exactly does this unobserved heterogeney represent? Why would you expect to be correlated / uncorrelated wh the regressors? Is likely there is endogeney due to time-varying unobserved heterogeney or feedback from the idiosyncratic error to the next wave of covariates? Specification Testing for Panel Analysis - Interval/Continuous Outcomes Always neglected but formal test for unobserved effect can be useful. Optional: Obtain intraclass correlation coefficient (ICC) as indicator of the extent of whin-un clustering. This is a descriptive statistic, not a test. Specification test(s) for strict exogeney Hausman-type specification test for RE vs. FE Test for serial correlation in the idiosyncratic errors WIM Panel Data Analysis October 2015 Page 33
35 Extensions FE Models wh Time-Invariant Predictors Interactions between time and covariate Panel Models for Categorical Outcomes Fixed effects log and random effects log for binary outcomes Fixed and random effects Poisson models can be used for count outcomes. Population averaged models can be estimated using General Estimation Equations (GEE). Dynamic panel models i.e. lagged dependent variable as a covariate: GPA = b, GPA b, TERM b, HSGPA b, JOB b, v 0 i, t-1 GPA T H J GLM models for instrumental variables (IV) estimation Generalized Method of Moments (GMM) is used for some dynamic panel models because allows a flexible specification of the instruments WIM Panel Data Analysis October 2015 Page 34
Applied Econometrics. Lecture 3: Introduction to Linear Panel Data Models
Applied Econometrics Lecture 3: Introduction to Linear Panel Data Models Måns Söderbom 4 September 2009 Department of Economics, Universy of Gothenburg. Email: mans.soderbom@economics.gu.se. Web: www.economics.gu.se/soderbom,
More informationApplied Microeconometrics (L5): Panel Data-Basics
Applied Microeconometrics (L5): Panel Data-Basics Nicholas Giannakopoulos University of Patras Department of Economics ngias@upatras.gr November 10, 2015 Nicholas Giannakopoulos (UPatras) MSc Applied Economics
More informationProblem Set 10: Panel Data
Problem Set 10: Panel Data 1. Read in the data set, e11panel1.dta from the course website. This contains data on a sample or 1252 men and women who were asked about their hourly wage in two years, 2005
More informationSpecification testing in panel data models estimated by fixed effects with instrumental variables
Specification testing in panel data models estimated by fixed effects wh instrumental variables Carrie Falls Department of Economics Michigan State Universy Abstract I show that a handful of the regressions
More informationLecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook)
Lecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook) 1 2 Panel Data Panel data is obtained by observing the same person, firm, county, etc over several periods. Unlike the pooled cross sections,
More informationDeriving Some Estimators of Panel Data Regression Models with Individual Effects
Deriving Some Estimators of Panel Data Regression Models wh Individual Effects Megersa Tadesse Jirata 1, J. Cheruyot Chelule 2, R. O. Odhiambo 3 1 Pan African Universy Instute of Basic Sciences, Technology
More informationPanel Data. March 2, () Applied Economoetrics: Topic 6 March 2, / 43
Panel Data March 2, 212 () Applied Economoetrics: Topic March 2, 212 1 / 43 Overview Many economic applications involve panel data. Panel data has both cross-sectional and time series aspects. Regression
More informationEconometric Analysis of Cross Section and Panel Data
Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND
More informationLongitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 2017, Chicago, Illinois
Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 217, Chicago, Illinois Outline 1. Opportunities and challenges of panel data. a. Data requirements b. Control
More informationTopic 10: Panel Data Analysis
Topic 10: Panel Data Analysis Advanced Econometrics (I) Dong Chen School of Economics, Peking University 1 Introduction Panel data combine the features of cross section data time series. Usually a panel
More informationFixed and Random Effects Models: Vartanian, SW 683
: Vartanian, SW 683 Fixed and random effects models See: http://teaching.sociology.ul.ie/dcw/confront/node45.html When you have repeated observations per individual this is a problem and an advantage:
More informationECONOMETRICS HONOR S EXAM REVIEW SESSION
ECONOMETRICS HONOR S EXAM REVIEW SESSION Eunice Han ehan@fas.harvard.edu March 26 th, 2013 Harvard University Information 2 Exam: April 3 rd 3-6pm @ Emerson 105 Bring a calculator and extra pens. Notes
More informationEC327: Advanced Econometrics, Spring 2007
EC327: Advanced Econometrics, Spring 2007 Wooldridge, Introductory Econometrics (3rd ed, 2006) Chapter 14: Advanced panel data methods Fixed effects estimators We discussed the first difference (FD) model
More informationLongitudinal Data Analysis Using SAS Paul D. Allison, Ph.D. Upcoming Seminar: October 13-14, 2017, Boston, Massachusetts
Longitudinal Data Analysis Using SAS Paul D. Allison, Ph.D. Upcoming Seminar: October 13-14, 217, Boston, Massachusetts Outline 1. Opportunities and challenges of panel data. a. Data requirements b. Control
More informationEstimation of Panel Data Models with Binary Indicators when Treatment Effects are not Constant over Time. Audrey Laporte a,*, Frank Windmeijer b
Estimation of Panel ata Models wh Binary Indicators when Treatment Effects are not Constant over Time Audrey Laporte a,*, Frank Windmeijer b a epartment of Health Policy, Management and Evaluation, Universy
More informationINTRODUCTION TO BASIC LINEAR REGRESSION MODEL
INTRODUCTION TO BASIC LINEAR REGRESSION MODEL 13 September 2011 Yogyakarta, Indonesia Cosimo Beverelli (World Trade Organization) 1 LINEAR REGRESSION MODEL In general, regression models estimate the effect
More informationApplied Quantitative Methods II
Applied Quantitative Methods II Lecture 10: Panel Data Klára Kaĺıšková Klára Kaĺıšková AQM II - Lecture 10 VŠE, SS 2016/17 1 / 38 Outline 1 Introduction 2 Pooled OLS 3 First differences 4 Fixed effects
More informationRepeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data
Panel data Repeated observations on the same cross-section of individual units. Important advantages relative to pure cross-section data - possible to control for some unobserved heterogeneity - possible
More informationRecent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data
Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)
More informationEconometrics. Week 6. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague
Econometrics Week 6 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 21 Recommended Reading For the today Advanced Panel Data Methods. Chapter 14 (pp.
More informationApplied Economics. Panel Data. Department of Economics Universidad Carlos III de Madrid
Applied Economics Panel Data Department of Economics Universidad Carlos III de Madrid See also Wooldridge (chapter 13), and Stock and Watson (chapter 10) 1 / 38 Panel Data vs Repeated Cross-sections In
More informationEconometrics Honor s Exam Review Session. Spring 2012 Eunice Han
Econometrics Honor s Exam Review Session Spring 2012 Eunice Han Topics 1. OLS The Assumptions Omitted Variable Bias Conditional Mean Independence Hypothesis Testing and Confidence Intervals Homoskedasticity
More informationmultilevel modeling: concepts, applications and interpretations
multilevel modeling: concepts, applications and interpretations lynne c. messer 27 october 2010 warning social and reproductive / perinatal epidemiologist concepts why context matters multilevel models
More informationPlease discuss each of the 3 problems on a separate sheet of paper, not just on a separate page!
Econometrics - Exam May 11, 2011 1 Exam Please discuss each of the 3 problems on a separate sheet of paper, not just on a separate page! Problem 1: (15 points) A researcher has data for the year 2000 from
More informationIntroduction to Panel Data Analysis
Introduction to Panel Data Analysis Youngki Shin Department of Economics Email: yshin29@uwo.ca Statistics and Data Series at Western November 21, 2012 1 / 40 Motivation More observations mean more information.
More informationEconometrics. 8) Instrumental variables
30C00200 Econometrics 8) Instrumental variables Timo Kuosmanen Professor, Ph.D. http://nomepre.net/index.php/timokuosmanen Today s topics Thery of IV regression Overidentification Two-stage least squates
More informationSTOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Per Pettersson-Lidbom Number of creds: 7,5 creds Date of exam: Thursday, January 15, 009 Examination
More informationNon-linear panel data modeling
Non-linear panel data modeling Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini May 2010 Laura Magazzini (@univr.it) Non-linear panel data modeling May 2010 1
More informationEcon 1123: Section 5. Review. Internal Validity. Panel Data. Clustered SE. STATA help for Problem Set 5. Econ 1123: Section 5.
Outline 1 Elena Llaudet 2 3 4 October 6, 2010 5 based on Common Mistakes on P. Set 4 lnftmpop = -.72-2.84 higdppc -.25 lackpf +.65 higdppc * lackpf 2 lnftmpop = β 0 + β 1 higdppc + β 2 lackpf + β 3 lackpf
More informationWe begin by thinking about population relationships.
Conditional Expectation Function (CEF) We begin by thinking about population relationships. CEF Decomposition Theorem: Given some outcome Y i and some covariates X i there is always a decomposition where
More informationChapter 15 Panel Data Models. Pooling Time-Series and Cross-Section Data
Chapter 5 Panel Data Models Pooling Time-Series and Cross-Section Data Sets of Regression Equations The topic can be introduced wh an example. A data set has 0 years of time series data (from 935 to 954)
More informationDealing With Endogeneity
Dealing With Endogeneity Junhui Qian December 22, 2014 Outline Introduction Instrumental Variable Instrumental Variable Estimation Two-Stage Least Square Estimation Panel Data Endogeneity in Econometrics
More informationLecture 4: Linear panel models
Lecture 4: Linear panel models Luc Behaghel PSE February 2009 Luc Behaghel (PSE) Lecture 4 February 2009 1 / 47 Introduction Panel = repeated observations of the same individuals (e.g., rms, workers, countries)
More informationPanel data methods for policy analysis
IAPRI Quantitative Analysis Capacity Building Series Panel data methods for policy analysis Part I: Linear panel data models Outline 1. Independently pooled cross sectional data vs. panel/longitudinal
More informationEconometrics of Panel Data
Econometrics of Panel Data Jakub Mućk Meeting # 6 Jakub Mućk Econometrics of Panel Data Meeting # 6 1 / 36 Outline 1 The First-Difference (FD) estimator 2 Dynamic panel data models 3 The Anderson and Hsiao
More informationEC402 - Problem Set 3
EC402 - Problem Set 3 Konrad Burchardi 11th of February 2009 Introduction Today we will - briefly talk about the Conditional Expectation Function and - lengthily talk about Fixed Effects: How do we calculate
More informationProblem Set # 1. Master in Business and Quantitative Methods
Problem Set # 1 Master in Business and Quantitative Methods Contents 0.1 Problems on endogeneity of the regressors........... 2 0.2 Lab exercises on endogeneity of the regressors......... 4 1 0.1 Problems
More informationMaking sense of Econometrics: Basics
Making sense of Econometrics: Basics Lecture 4: Qualitative influences and Heteroskedasticity Egypt Scholars Economic Society November 1, 2014 Assignment & feedback enter classroom at http://b.socrative.com/login/student/
More informationShort T Panels - Review
Short T Panels - Review We have looked at methods for estimating parameters on time-varying explanatory variables consistently in panels with many cross-section observation units but a small number of
More informationIntermediate Econometrics
Intermediate Econometrics Markus Haas LMU München Summer term 2011 15. Mai 2011 The Simple Linear Regression Model Considering variables x and y in a specific population (e.g., years of education and wage
More informationReview of Panel Data Model Types Next Steps. Panel GLMs. Department of Political Science and Government Aarhus University.
Panel GLMs Department of Political Science and Government Aarhus University May 12, 2015 1 Review of Panel Data 2 Model Types 3 Review and Looking Forward 1 Review of Panel Data 2 Model Types 3 Review
More informationNinth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"
Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric
More information1 Motivation for Instrumental Variable (IV) Regression
ECON 370: IV & 2SLS 1 Instrumental Variables Estimation and Two Stage Least Squares Econometric Methods, ECON 370 Let s get back to the thiking in terms of cross sectional (or pooled cross sectional) data
More informationWISE International Masters
WISE International Masters ECONOMETRICS Instructor: Brett Graham INSTRUCTIONS TO STUDENTS 1 The time allowed for this examination paper is 2 hours. 2 This examination paper contains 32 questions. You are
More informationIntroduction to Econometrics
Introduction to Econometrics T H I R D E D I T I O N Global Edition James H. Stock Harvard University Mark W. Watson Princeton University Boston Columbus Indianapolis New York San Francisco Upper Saddle
More informationEconomics 308: Econometrics Professor Moody
Economics 308: Econometrics Professor Moody References on reserve: Text Moody, Basic Econometrics with Stata (BES) Pindyck and Rubinfeld, Econometric Models and Economic Forecasts (PR) Wooldridge, Jeffrey
More informationEconometrics Homework 4 Solutions
Econometrics Homework 4 Solutions Question 1 (a) General sources of problem: measurement error in regressors, omitted variables that are correlated to the regressors, and simultaneous equation (reverse
More informationCHAPTER 6: SPECIFICATION VARIABLES
Recall, we had the following six assumptions required for the Gauss-Markov Theorem: 1. The regression model is linear, correctly specified, and has an additive error term. 2. The error term has a zero
More informationCRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M.
CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. Linear
More informationApplied Econometrics (MSc.) Lecture 3 Instrumental Variables
Applied Econometrics (MSc.) Lecture 3 Instrumental Variables Estimation - Theory Department of Economics University of Gothenburg December 4, 2014 1/28 Why IV estimation? So far, in OLS, we assumed independence.
More informationSimultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser
Simultaneous Equations with Error Components Mike Bronner Marko Ledic Anja Breitwieser PRESENTATION OUTLINE Part I: - Simultaneous equation models: overview - Empirical example Part II: - Hausman and Taylor
More informationECON 4551 Econometrics II Memorial University of Newfoundland. Panel Data Models. Adapted from Vera Tabakova s notes
ECON 4551 Econometrics II Memorial University of Newfoundland Panel Data Models Adapted from Vera Tabakova s notes 15.1 Grunfeld s Investment Data 15.2 Sets of Regression Equations 15.3 Seemingly Unrelated
More informationBasic econometrics. Tutorial 3. Dipl.Kfm. Johannes Metzler
Basic econometrics Tutorial 3 Dipl.Kfm. Introduction Some of you were asking about material to revise/prepare econometrics fundamentals. First of all, be aware that I will not be too technical, only as
More information4 Instrumental Variables Single endogenous variable One continuous instrument. 2
Econ 495 - Econometric Review 1 Contents 4 Instrumental Variables 2 4.1 Single endogenous variable One continuous instrument. 2 4.2 Single endogenous variable more than one continuous instrument..........................
More informationLinear Panel Data Models
Linear Panel Data Models Michael R. Roberts Department of Finance The Wharton School University of Pennsylvania October 5, 2009 Michael R. Roberts Linear Panel Data Models 1/56 Example First Difference
More informationWooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares
Wooldridge, Introductory Econometrics, 4th ed. Chapter 15: Instrumental variables and two stage least squares Many economic models involve endogeneity: that is, a theoretical relationship does not fit
More informationCasuality and Programme Evaluation
Casuality and Programme Evaluation Lecture V: Difference-in-Differences II Dr Martin Karlsson University of Duisburg-Essen Summer Semester 2017 M Karlsson (University of Duisburg-Essen) Casuality and Programme
More informationGLS and FGLS. Econ 671. Purdue University. Justin L. Tobias (Purdue) GLS and FGLS 1 / 22
GLS and FGLS Econ 671 Purdue University Justin L. Tobias (Purdue) GLS and FGLS 1 / 22 In this lecture we continue to discuss properties associated with the GLS estimator. In addition we discuss the practical
More informationFixed Effects Models for Panel Data. December 1, 2014
Fixed Effects Models for Panel Data December 1, 2014 Notation Use the same setup as before, with the linear model Y it = X it β + c i + ɛ it (1) where X it is a 1 K + 1 vector of independent variables.
More informationInternal vs. external validity. External validity. This section is based on Stock and Watson s Chapter 9.
Section 7 Model Assessment This section is based on Stock and Watson s Chapter 9. Internal vs. external validity Internal validity refers to whether the analysis is valid for the population and sample
More informationECON 482 / WH Hong Binary or Dummy Variables 1. Qualitative Information
1. Qualitative Information Qualitative Information Up to now, we assume that all the variables has quantitative meaning. But often in empirical work, we must incorporate qualitative factor into regression
More information4. Nonlinear regression functions
4. Nonlinear regression functions Up to now: Population regression function was assumed to be linear The slope(s) of the population regression function is (are) constant The effect on Y of a unit-change
More informationSTOCKHOLM UNIVERSITY Department of Economics Course name: Empirical Methods Course code: EC40 Examiner: Lena Nekby Number of credits: 7,5 credits Date of exam: Saturday, May 9, 008 Examination time: 3
More informationEconometrics - 30C00200
Econometrics - 30C00200 Lecture 11: Heteroskedasticity Antti Saastamoinen VATT Institute for Economic Research Fall 2015 30C00200 Lecture 11: Heteroskedasticity 12.10.2015 Aalto University School of Business
More informationFinal Exam - Solutions
Ecn 102 - Analysis of Economic Data University of California - Davis March 19, 2010 Instructor: John Parman Final Exam - Solutions You have until 5:30pm to complete this exam. Please remember to put your
More informationCh 7: Dummy (binary, indicator) variables
Ch 7: Dummy (binary, indicator) variables :Examples Dummy variable are used to indicate the presence or absence of a characteristic. For example, define female i 1 if obs i is female 0 otherwise or male
More informationPhD/MA Econometrics Examination January 2012 PART A
PhD/MA Econometrics Examination January 2012 PART A ANSWER ANY TWO QUESTIONS IN THIS SECTION NOTE: (1) The indicator function has the properties: (2) Question 1 Let, [defined as if using the indicator
More informationPanel Data Models. Chapter 5. Financial Econometrics. Michael Hauser WS17/18 1 / 63
1 / 63 Panel Data Models Chapter 5 Financial Econometrics Michael Hauser WS17/18 2 / 63 Content Data structures: Times series, cross sectional, panel data, pooled data Static linear panel data models:
More informationChapter 6. Panel Data. Joan Llull. Quantitative Statistical Methods II Barcelona GSE
Chapter 6. Panel Data Joan Llull Quantitative Statistical Methods II Barcelona GSE Introduction Chapter 6. Panel Data 2 Panel data The term panel data refers to data sets with repeated observations over
More information4 Instrumental Variables Single endogenous variable One continuous instrument. 2
Econ 495 - Econometric Review 1 Contents 4 Instrumental Variables 2 4.1 Single endogenous variable One continuous instrument. 2 4.2 Single endogenous variable more than one continuous instrument..........................
More informationApplied Econometrics Lecture 1
Lecture 1 1 1 Università di Urbino Università di Urbino PhD Programme in Global Studies Spring 2018 Outline of this module Beyond OLS (very brief sketch) Regression and causality: sources of endogeneity
More informationInstrumental Variables
Università di Pavia 2010 Instrumental Variables Eduardo Rossi Exogeneity Exogeneity Assumption: the explanatory variables which form the columns of X are exogenous. It implies that any randomness in the
More informationIV Estimation WS 2014/15 SS Alexander Spermann. IV Estimation
SS 2010 WS 2014/15 Alexander Spermann Evaluation With Non-Experimental Approaches Selection on Unobservables Natural Experiment (exogenous variation in a variable) DiD Example: Card/Krueger (1994) Minimum
More informationLecture-1: Introduction to Econometrics
Lecture-1: Introduction to Econometrics 1 Definition Econometrics may be defined as 2 the science in which the tools of economic theory, mathematics and statistical inference is applied to the analysis
More information10 Panel Data. Andrius Buteikis,
10 Panel Data Andrius Buteikis, andrius.buteikis@mif.vu.lt http://web.vu.lt/mif/a.buteikis/ Introduction Panel data combines cross-sectional and time series data: the same individuals (persons, firms,
More informationBasic Regressions and Panel Data in Stata
Developing Trade Consultants Policy Research Capacity Building Basic Regressions and Panel Data in Stata Ben Shepherd Principal, Developing Trade Consultants 1 Basic regressions } Stata s regress command
More informationAnswers to Problem Set #4
Answers to Problem Set #4 Problems. Suppose that, from a sample of 63 observations, the least squares estimates and the corresponding estimated variance covariance matrix are given by: bβ bβ 2 bβ 3 = 2
More informationOrdinary Least Squares Regression
Ordinary Least Squares Regression Goals for this unit More on notation and terminology OLS scalar versus matrix derivation Some Preliminaries In this class we will be learning to analyze Cross Section
More informationEconometrics. Week 8. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague
Econometrics Week 8 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 25 Recommended Reading For the today Instrumental Variables Estimation and Two Stage
More informationIntermediate Econometrics
Intermediate Econometrics Heteroskedasticity Text: Wooldridge, 8 July 17, 2011 Heteroskedasticity Assumption of homoskedasticity, Var(u i x i1,..., x ik ) = E(u 2 i x i1,..., x ik ) = σ 2. That is, the
More informationRockefeller College University at Albany
Rockefeller College University at Albany PAD 705 Handout: Simultaneous quations and Two-Stage Least Squares So far, we have studied examples where the causal relationship is quite clear: the value of the
More informationCapital humain, développement et migrations: approche macroéconomique (Empirical Analysis - Static Part)
Séminaire d Analyse Economique III (LECON2486) Capital humain, développement et migrations: approche macroéconomique (Empirical Analysis - Static Part) Frédéric Docquier & Sara Salomone IRES UClouvain
More informationA Course in Applied Econometrics Lecture 7: Cluster Sampling. Jeff Wooldridge IRP Lectures, UW Madison, August 2008
A Course in Applied Econometrics Lecture 7: Cluster Sampling Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. The Linear Model with Cluster Effects 2. Estimation with a Small Number of roups and
More informationWHAT IS HETEROSKEDASTICITY AND WHY SHOULD WE CARE?
1 WHAT IS HETEROSKEDASTICITY AND WHY SHOULD WE CARE? For concreteness, consider the following linear regression model for a quantitative outcome (y i ) determined by an intercept (β 1 ), a set of predictors
More informationTopic 7: Heteroskedasticity
Topic 7: Heteroskedasticity Advanced Econometrics (I Dong Chen School of Economics, Peking University Introduction If the disturbance variance is not constant across observations, the regression is heteroskedastic
More informationAnswer Key: Problem Set 6
: Problem Set 6 1. Consider a linear model to explain monthly beer consumption: beer = + inc + price + educ + female + u 0 1 3 4 E ( u inc, price, educ, female ) = 0 ( u inc price educ female) σ inc var,,,
More informationApplied Quantitative Methods II
Applied Quantitative Methods II Lecture 4: OLS and Statistics revision Klára Kaĺıšková Klára Kaĺıšková AQM II - Lecture 4 VŠE, SS 2016/17 1 / 68 Outline 1 Econometric analysis Properties of an estimator
More informationFortin Econ Econometric Review 1. 1 Panel Data Methods Fixed Effects Dummy Variables Regression... 7
Fortin Econ 495 - Econometric Review 1 Contents 1 Panel Data Methods 2 1.1 Fixed Effects......................... 2 1.1.1 Dummy Variables Regression............ 7 1.1.2 First Differencing Methods.............
More informationLecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16)
Lecture: Simultaneous Equation Model (Wooldridge s Book Chapter 16) 1 2 Model Consider a system of two regressions y 1 = β 1 y 2 + u 1 (1) y 2 = β 2 y 1 + u 2 (2) This is a simultaneous equation model
More informationWarwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation
Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Michele Aquaro University of Warwick This version: July 21, 2016 1 / 31 Reading material Textbook: Introductory
More informationEconomics 241B Estimation with Instruments
Economics 241B Estimation with Instruments Measurement Error Measurement error is de ned as the error resulting from the measurement of a variable. At some level, every variable is measured with error.
More information1. You have data on years of work experience, EXPER, its square, EXPER2, years of education, EDUC, and the log of hourly wages, LWAGE
1. You have data on years of work experience, EXPER, its square, EXPER, years of education, EDUC, and the log of hourly wages, LWAGE You estimate the following regressions: (1) LWAGE =.00 + 0.05*EDUC +
More informationThe Simple Linear Regression Model
The Simple Linear Regression Model Lesson 3 Ryan Safner 1 1 Department of Economics Hood College ECON 480 - Econometrics Fall 2017 Ryan Safner (Hood College) ECON 480 - Lesson 3 Fall 2017 1 / 77 Bivariate
More informationProblem Set 5 ANSWERS
Economics 20 Problem Set 5 ANSWERS Prof. Patricia M. Anderson 1, 2 and 3 Suppose that Vermont has passed a law requiring employers to provide 6 months of paid maternity leave. You are concerned that women
More informationContest Quiz 3. Question Sheet. In this quiz we will review concepts of linear regression covered in lecture 2.
Updated: November 17, 2011 Lecturer: Thilo Klein Contact: tk375@cam.ac.uk Contest Quiz 3 Question Sheet In this quiz we will review concepts of linear regression covered in lecture 2. NOTE: Please round
More informationLecture 5: Omitted Variables, Dummy Variables and Multicollinearity
Lecture 5: Omitted Variables, Dummy Variables and Multicollinearity R.G. Pierse 1 Omitted Variables Suppose that the true model is Y i β 1 + β X i + β 3 X 3i + u i, i 1,, n (1.1) where β 3 0 but that the
More informationJeffrey M. Wooldridge Michigan State University
Fractional Response Models with Endogenous Explanatory Variables and Heterogeneity Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. Fractional Probit with Heteroskedasticity 3. Fractional
More informationα version (only brief introduction so far)
Econometrics I KS Module 8: Panel Data Econometrics Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: June 18, 2018 α version (only brief introduction so far) Alexander
More informationUsing Instrumental Variables to Find Causal Effects in Public Health
1 Using Instrumental Variables to Find Causal Effects in Public Health Antonio Trujillo, PhD John Hopkins Bloomberg School of Public Health Department of International Health Health Systems Program October
More informationECON3150/4150 Spring 2015
ECON3150/4150 Spring 2015 Lecture 3&4 - The linear regression model Siv-Elisabeth Skjelbred University of Oslo January 29, 2015 1 / 67 Chapter 4 in S&W Section 17.1 in S&W (extended OLS assumptions) 2
More information