Jeffrey M. Wooldridge Michigan State University
|
|
- Prosper Lucas
- 5 years ago
- Views:
Transcription
1 Fractional Response Models with Endogenous Explanatory Variables and Heterogeneity Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. Fractional Probit with Heteroskedasticity 3. Fractional Probit with an Endogenous Explanatory Variable 4. Linear Unobserved Effects Models with Unbalanced Panels 5. Nonlinear UE Models with Unbalanced Panels 1
2 1. Introduction A fractional response y satisfies 0 y 1, possibly with P y 0 1orP y 1 0 (or both). Assume y is the variable we would like to explain in terms of covariates, x x 1,...,x K. (No data censoring, but y may be a corner solution.) Focus here is on mean response. If x is exogenous, goal is to estimate E y x. 2
3 Can always use a linear model for E y x, but it is at best an approximation. Papke and Wooldridge (1996, Journal of Applied Econometrics): Model E y x using models of the form G x for 0 G nonindex forms). 1(or 3
4 So-called fractional response models (fractional probit, fractional logit) easily estimated using glm, and robust inference is trivial (and very important: MLE standard errors are too large). For panel data, can use xtgee. Papke and Wooldridge (2008, Journal of Econometrics) show how to use correlated random effects approaches to estimate fractional response models for panel data. But for balanced panels. Wooldridge (2005, Rothenberg Festschrift; 2010, MIT Press) considers models with continuous endogenous explanatory variables (EEVs). Proposes two-step control function approach. 4
5 Papke and Wooldridge (2008): heterogeneity and continuous EEV. Combination of CRE and control function methods for fractional probit. But balanced panel, and only two-step estimators. What if we want a one-step quasi-mle (which simplifies inference and may have better finite-sample properties)? So y 1 is a fractional response and y 2 a continuous EEV. Wooldridge (2011, unpublished) shows that the ivprobit log-likelihood identifies the (scaled) parameters under correct specification of E y 1 y 2, z 1, a 1 where a 1 is the omitted variable. (and y 2 follows classical linear model). 5
6 What if y 1 is a fractional response and y 2 abinaryeev?two-step forbidden regression is not valid. Wooldridge (2011) shows the biprobit log likelihood identifies the (scaled) parameters if E y 1 y 2, z 1, a 1 is correctly specified (and y 2 follows a probit). Neither ivprobit nor biprobit allow y 1 to be a fractional response. Neither does cmp (Roodman, 2009). Bottom line: Many existing Stata commands could be used to estimate flexible fractional response models allowing for endogeneity and unbalanced panel by removing the data checks on the response variable. 6
7 2. Fractional Probit with Heteroskedasticity Let x x 1, x 2,...,x K. Fractional probit model is E y x 0 x 0 1 x 1... K x K Might want more flexibility. If P y 0 0, could use a two-part model. But can directly make model for E y x more flexible, for example, E y x 0 x exp z /2 where z 1 M is a function of x 1, x 2,...,x K that does not include a constant. 7
8 The j and h are consistently estimated using the Bernoulli quasi-mle if E y x is correctly specified. As usual, need to use robust inference because y is not binary. (The conditional mean may be misspecified, anyway.) Ideally, just type hetprobit y x1... x2, het(z1 z2... zm), robust But y is turned into a binary response. Can easily test H 0 : 0 with robust Wald statistic. 8
9 clear capture program drop frac_het program frac_het version 11 args llf xb zg quietly replace llf $ML_y1*log(normal( xb *exp(- zg ))) /// (1 - $ML_y1)*log(1 - normal( xb *exp(- zg ))) end ml model lf frac_het (prate mrate ltotemp age sole) /// (mrate ltotemp age sole, nocons), vce(robust) ml max 9
10 Number of obs 4075 Wald chi2(4) 152. Log pseudolikelihood Prob chi Robust prate Coef. Std. Err. z P z [95% Conf. Interval eq1 mrate ltotemp age sole _cons eq2 mrate ltotemp age sole
11 . test [eq2] ( 1) [eq2]mrate 0 ( 2) [eq2]ltotemp 0 ( 3) [eq2]age 0 ( 4) [eq2]sole 0 chi2( 4) Prob chi * Usual fractional probit (could use glm): capture program drop frac_probit program frac_probit version 11 args llf xb quietly replace llf $ML_y1*log(normal( xb )) /// (1 - $ML_y1)*log(1 - normal( xb )) end ml model lf frac_probit (prate mrate ltotemp age sole), vce(robust) ml max 11
12 Number of obs 4075 Wald chi2(4) 695. Log pseudolikelihood Prob chi Robust prate Coef. Std. Err. z P z [95% Conf. Interval mrate ltotemp age sole _cons
13 Should do a comparison of average partial effects between ordinary fractional probit and heteroskedastic fractional probit. The hetprobit quasi-mle is needed for nonlinear CRE panel models with unbalanced panels. 13
14 3. Fractional Probit with an Endogenous Explanatory Variable Adapted from Wooldridge (2011, unpublished). Set up endogeneity as an omitted variable problem, and start by assuming y 2 is continuous: E y 1 z, y 2, a 1 x 1 1 a 1. y 2 z 2 v 2, where x 1 is a general nonlinear function of z 1, y 2, a 1 is an omitted factor thought to be correlated with y 2 but independent of the exogenous variables z. 14
15 The average partial effects in this model are obtained from the average structural function (ASF): ASF x 1 E a1 x 1 1 a 1 x 1 a1 where a1 1 / 1 2 a1 1/2. Happily, these are precisely the parameters that are identified. If a 1, v 2 is jointly normal, a two-step control function method is valid (Wooldridge, 2005). Note that the distribution of y 1 is not further restricted. 15
16 (i) Regress y i2 on z i and obtain the residuals, v i2. (ii) Use probit of y i1 on x i1, v i2 to estimate parameters with different scales, say e1 and e1. (Can implement as a generalized linear model. ) The average structural function (ASF) is consistently estimated as N ASF y 2, z 1 N 1 i 1 x 1 e1 e1 v i2, and this can be used to obtain APEs with respect to y 2 or z 1 (Wooldridge, 2005). 16
17 What about a quasi-liml approach? Can show that E y 1 y 2, z x 1 r1 1 / 2 y 2 z /2 and so we can plug this mean function into the Bernoulli quasi-log likelihood. This gives q 1 y 1, y 2, z, 1, 2. Identify 2 and 2 using the Gaussian QLL, which gives q 2 y 2, z, 2. The same objective function we get for MLE with y 1 binary can be used when y 1 is fractional continuous or otherwise. In other words, ivprobit could be easily modified and use robust inference. 17
18 A similar argument holds when y 2 is binary and follows a probit model: y 2 1 z 2 v 2 0 v 2 z ~ Normal 0, 1 Can show that E y 1 y 2, z has the same form as the response probability in the so-called bivariate probit model. 18
19 For example, E y 1 y 2 1, z z 2 x 1 r1 1 v /2 dv 2 So for q 2 y 2, z, 2 we use the usual probit log-likelihood. For q 1 y 1, y 2, z, 1, 2 we use the Bernoulli QLL associated with bivariate probit. So if y 1 were allowed to be fractional, biprobit with a robust option could be used. 19
20 4. Linear Unobserved Effects Models with Unbalanced Panels Model for a random draw i has T potential time periods: E u it x i1,...,x it, c i 0. y it x it c i u it, t 1,...,T Given access to a balanced random sample, the zero conditional mean assumption is sufficient for FE to be consistent (as N, T fixed) and N -asymptotically normal, provided all elements of x it have some time variation. 20
21 Let s it : t 1,...,T be a sequence of selection indicators : s it 1 if and only if observation i, t is used. These are generally outcomes of random variables. T The number of time periods available for unit i is T i r 1 s ir ; this is properly viewed as random. 21
22 Fixed Effects on the Unbalanced Panel The time-demeaned data uses a different number of time periods for different i. Let T 1 ÿ it y it T i r 1 T 1 ẍ it x it T i r 1 s ir y ir s ir x ir 22
23 The FE estimator is then FE N N 1 i 1 T t 1 1 s it ẍ it ẍ it N N 1 i 1 T t 1 s it ẍ it ÿ it, A sufficient condition for consistency of FE on the unbalanced panel is an extension of the usual strict exogeneity assumption: E u it x i, s i, c i 0, t 1,...,T s i s i1,...,s it 23
24 Both the covariates and selection are strictly exogenous conditional on c i. Rules out selection in any time period depending on the shocks in any time period. That is, the condition is generally violated if Cov s ir, u it 0 for any r, t pair. Importantly, it allows s it to depend on c i in an unrestricted way. xtreg allows unbalanced panels and properly computes standard errors and test statistics. 24
25 Random Effects on the Unbalanced Panel The quasi-time-demeaning value for unit i is i T i c2 / u2 1/2. Now define y it y it iȳ i 1 T where ȳ i T i r 1 s ir y ir, and similarly for x it. Then, RE is POLS of y it on x it using the s it 1 data points. 25
26 Useful equivalence result (Wooldridge, 2010, unpublished). Define T 1 x i T i r 1 s ir x ir and consider either POLS or RE estimation of the following equation on the unbalanced panel: y it x it x i v it Then POLS RE FE. Generally, POLS RE 26
27 Must be careful in constructing x i; only use periods where all variables are observed (s it 1). Must now include the time averages of year dummies because these are no longer constants in an unbalanced panel. Same result holds when add any other time-constant covariates. Implies that the CRE is robust even with unbalanced panels. Basis for robust Hausman test. H 0 : 0. Use RE with all time-constant controls included. 27
28 Heterogeneous Slopes Suppose the population model is E y it x i, a i, b i a i x it b i, so, in the population, x it : t 1,...,T is strictly exogenous conditional on a i, b i. Define a i c i, b i d i and write y it x it c i x it d i u it where E u it x i, a i, b i E u it x i, c i, d i 0 for all t. 28
29 Assume that selection may be related to x i, a i, b i but not the idiosyncratic shocks: E u it x i, a i, b i, s i 0, t 1,...,T. Multiply population equation by the selection indicator: s i y it s it s it x it s it c i s it x it d i s it u it Find an estimating equation by conditioning on s it, s it x it : t 1,...,T. 29
30 Let h i h it : t 1,...,T s it, s it x it : t 1,...,T and consider E s i y it h i s it s it x it s it E c i h i s it x it E d i h i and then make assumptions concerning E c i h i and E d i h i. We might choose w i T i, x i as the exchangeable functions satisfying E c i h i E c i w i, E d i h i E d i w i. 30
31 A flexible specification with g ir 1 T i r : T E c i T i, x i r 1 T r g ir r r 1 T T E d i T i, x i g ir r r r 1 r 1 g ir x i r r g ir x i r I K r, where the r are the expected values of x i given r time periods observed and r is the fraction of observations with r time periods: r E x i T i r, r E 1 T i r 31
32 This formulation is identical to running separate regressions for each T i : y it on 1, x it, x i, x i r x it,fors it 1 where r N 1 r N i 1 1 T i r x i and N r is the number of observations with T i r. The coefficient on x it, r, is the APE given T i r. Average these across r to obtain the overall APE. Cannot identify the APE for T i 1. 32
33 A simple test of the null that the r do not change. Augmented equation is y it x it 1 T i 2 x it T i T 1 x it T 1 c i u it where the base group is T i T. Use FE on the unbalanced panel and obtain a fully robust test of H 0 : 2 0,..., T 1 0 This is like a Chow test where the slopes are allowed to differ by the number of available time periods for each unit. 33
34 . use meap94_98. xtset schid year. egen tobs sum(1), by(schid). tab tobs tobs Freq. Percent Cum , , , Total 7, gen tobs4 tobs 4. gen tobs3 tobs 3. gen tobs3_lavgrexp tobs3*lavgrexp. gen tobs4_lavgrexp tobs4*lavgrexp 34
35 . xtreg math4 lavgrexp lunch lenrol y95 y96 y97 y98, fe cluster(distid)... (Std. Err. adjusted for 467 clusters in distid Robust math4 Coef. Std. Err. t P t [95% Conf. Interval lavgrexp lunch lenrol y y y y _cons sigma_u sigma_e rho (fraction of variance due to u_i)
36 . xtreg math4 lavgrexp tobs3_lavgrexp tobs4_lavgrexp lunch lenrol y95 y96 y97 y98, fe cluster(distid)... (Std. Err. adjusted for 467 clusters in distid Robust math4 Coef. Std. Err. t P t [95% Conf. Interval lavgrexp tobs3_lavg~p tobs4_lavg~p lunch lenrol y y y y _cons sigma_u sigma_e rho (fraction of variance due to u_i)
37 . test tobs3_lavgrexp tobs4_lavgrexp ( 1) tobs3_lavgrexp 0 ( 2) tobs4_lavgrexp 0 F( 2, 466) 2.37 Prob F * Might get away with using the pooled equations. 37
38 5. Nonlinear UE Models with Unbalanced Panels Adapted from Wooldridge (2010, unpublished). Interested in E y it x it, c i, where 0 y it 1 and c i is unobserved heterogeneity. (Binary response as special case.) Again, unbalanced panel. Assume strictly exogenous covariates conditional on c i and ignorable selection: E y it x i, c i, s i E y it x it, c i, t 1,...,T. 38
39 Do not model serial correlation. Make inference robust. Specify models for D c i s it, s it x it : t 1,...,T. Let w i be a vector of known functions of s it, s it x it : t 1,...,T that act as sufficient statistics, so that D c i s it, s it x it : t 1,...,T D c i w i 39
40 For simplicity, take E y it x i, c i E y it x it, c i x it c i, t 1,...,T where x it can include time dummies or other aggregate time variables. Assume that selection is conditionally ignorable for all t, that is, E y it x i, c i, s i E y it x i, c i. 40
41 All that is left is to specify a model for D c i w i for suitably chosen functions w i of s it, s it x it : t 1,...,T. Simplest is the time average on the selected periods, x i, and the number of time periods, T i. A specification linear in x i but with intercept and slopes different for each T i is T E c i w i r 1 T r 1 T i r r 1 1 T i r x i r 41
42 At a minimum, should let the variance of c i change with T i : Var c i w i exp T 1 r 1 1 T i r r If we also maintain that D c i w i is normal, then we obtain the following: E y it x it, w i, s it 1 x T it r 1 T r g ir r 1 g ir x i r 1/2 g ir r T exp r 2 where g ir 1 T i r. No difficulty in adding g ir x i for r 1,...,T to the variance function. 42
43 Can use heteroskedastic probit software provided the response variable can be fractional. The explanatory variables at time t are 1, x it, g i1,...,g it, g i1 x i,...,g it x i and the explanatory variables in the variance are simply the dummy variables g i2,...,g it, or also add g i1 x i,...,g it x i. Might want to impose restrictions, such as constant slopes on x i. 43
44 The average partial effects are easy to obtain from the estimated average structural function : N ASF x t N 1 i 1 x T t r 1 T rg ir r 1 T exp r 2 g ir r g ir x i r 1/2, where the coefficients with ^ are from the pooled heteroskedastic fractional probit estimation. The functions of T i, x i are averaged out, leaving the result a function of x t. Take derivatives or changes with respect to x tj. 44
45 . use meap94_98 xtset schid year panel variable: time variable: delta: schid (unbalanced) year, 1994 to 1998, but with gaps 1 unit. tab tobs number of time periods Freq. Percent Cum , , , Total 7, gen tobs3 tobs 3. gen tobs4 tobs 4. replace math4 math4/100 (7150 real changes made) 45
46 . capture program drop frac_het.. program frac_het 1. version args llf xb zg 3. quietly replace llf $ML_y1*log(normal( xb *exp(- zg ))) /// (1 - $ML_y1)*log(1 - normal( xb *exp(- zg ))) 4. end.. end of do-file. ml model lf frac_het (math4 lavgrexp lunch lenrol y95 y96 y97 y98 lavgrexpb lunchb lenrolb y95b y96b y97b y98b tobs3 tobs4) (tobs3 tobs4, nocons), vce(cluster schid). ml max 46
47 Log pseudolikelihood Prob chi (Std. Err. adjusted for 1683 clusters in schid Robust math4 Coef. Std. Err. z P z [95% Conf. Interval eq1 lavgrexp lunch lenrol y y y y lavgrexpb lunchb lenrolb y95b y96b y97b y98b tobs tobs _cons eq2 tobs tobs
48 . ml model lf frac_probit (math4 lavgrexp lunch lenrol y95 y96 y97 y98 lavgrexpb lunchb lenrolb y95b y96b y97b y98b tobs3 tobs4), vce(cluster schid. ml max Log pseudolikelihood Prob chi (Std. Err. adjusted for 1683 clusters in schid Robust math4 Coef. Std. Err. z P z [95% Conf. Interval lavgrexp lunch lenrol y y y y lavgrexpb lunchb lenrolb y95b y96b y97b y98b tobs tobs _cons
CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M.
CRE METHODS FOR UNBALANCED PANELS Correlated Random Effects Panel Data Models IZA Summer School in Labor Economics May 13-19, 2013 Jeffrey M. Wooldridge Michigan State University 1. Introduction 2. Linear
More informationfhetprob: A fast QMLE Stata routine for fractional probit models with multiplicative heteroskedasticity
fhetprob: A fast QMLE Stata routine for fractional probit models with multiplicative heteroskedasticity Richard Bluhm May 26, 2013 Introduction Stata can easily estimate a binary response probit models
More informationThoughts on Heterogeneity in Econometric Models
Thoughts on Heterogeneity in Econometric Models Presidential Address Midwest Economics Association March 19, 2011 Jeffrey M. Wooldridge Michigan State University 1 1. Introduction Much of current econometric
More informationA Course in Applied Econometrics Lecture 14: Control Functions and Related Methods. Jeff Wooldridge IRP Lectures, UW Madison, August 2008
A Course in Applied Econometrics Lecture 14: Control Functions and Related Methods Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. Linear-in-Parameters Models: IV versus Control Functions 2. Correlated
More informationNew Developments in Econometrics Lecture 16: Quantile Estimation
New Developments in Econometrics Lecture 16: Quantile Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. Review of Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile
More informationControl Function and Related Methods: Nonlinear Models
Control Function and Related Methods: Nonlinear Models Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. General Approach 2. Nonlinear
More informationCORRELATED RANDOM EFFECTS MODELS WITH UNBALANCED PANELS
CORRELATED RANDOM EFFECTS MODELS WITH UNBALANCED PANELS Jeffrey M. Wooldridge Department of Economics Michigan State University East Lansing, MI 48824-1038 wooldri1@msu.edu July 2009 I presented an earlier
More informationWhat s New in Econometrics? Lecture 14 Quantile Methods
What s New in Econometrics? Lecture 14 Quantile Methods Jeff Wooldridge NBER Summer Institute, 2007 1. Reminders About Means, Medians, and Quantiles 2. Some Useful Asymptotic Results 3. Quantile Regression
More informationEconometric Analysis of Cross Section and Panel Data
Econometric Analysis of Cross Section and Panel Data Jeffrey M. Wooldridge / The MIT Press Cambridge, Massachusetts London, England Contents Preface Acknowledgments xvii xxiii I INTRODUCTION AND BACKGROUND
More informationEmpirical Application of Panel Data Regression
Empirical Application of Panel Data Regression 1. We use Fatality data, and we are interested in whether rising beer tax rate can help lower traffic death. So the dependent variable is traffic death, while
More informationBinary Dependent Variables
Binary Dependent Variables In some cases the outcome of interest rather than one of the right hand side variables - is discrete rather than continuous Binary Dependent Variables In some cases the outcome
More informationQuantitative Methods Final Exam (2017/1)
Quantitative Methods Final Exam (2017/1) 1. Please write down your name and student ID number. 2. Calculator is allowed during the exam, but DO NOT use a smartphone. 3. List your answers (together with
More informationPlease discuss each of the 3 problems on a separate sheet of paper, not just on a separate page!
Econometrics - Exam May 11, 2011 1 Exam Please discuss each of the 3 problems on a separate sheet of paper, not just on a separate page! Problem 1: (15 points) A researcher has data for the year 2000 from
More informationA Course in Applied Econometrics Lecture 18: Missing Data. Jeff Wooldridge IRP Lectures, UW Madison, August Linear model with IVs: y i x i u i,
A Course in Applied Econometrics Lecture 18: Missing Data Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. When Can Missing Data be Ignored? 2. Inverse Probability Weighting 3. Imputation 4. Heckman-Type
More informationLecture 3 Linear random intercept models
Lecture 3 Linear random intercept models Example: Weight of Guinea Pigs Body weights of 48 pigs in 9 successive weeks of follow-up (Table 3.1 DLZ) The response is measures at n different times, or under
More informationMonday 7 th Febraury 2005
Monday 7 th Febraury 2 Analysis of Pigs data Data: Body weights of 48 pigs at 9 successive follow-up visits. This is an equally spaced data. It is always a good habit to reshape the data, so we can easily
More informationFortin Econ Econometric Review 1. 1 Panel Data Methods Fixed Effects Dummy Variables Regression... 7
Fortin Econ 495 - Econometric Review 1 Contents 1 Panel Data Methods 2 1.1 Fixed Effects......................... 2 1.1.1 Dummy Variables Regression............ 7 1.1.2 First Differencing Methods.............
More information2. We care about proportion for categorical variable, but average for numerical one.
Probit Model 1. We apply Probit model to Bank data. The dependent variable is deny, a dummy variable equaling one if a mortgage application is denied, and equaling zero if accepted. The key regressor is
More informationpoint estimates, standard errors, testing, and inference for nonlinear combinations
Title xtreg postestimation Postestimation tools for xtreg Description The following postestimation commands are of special interest after xtreg: command description xttest0 Breusch and Pagan LM test for
More informationA Course in Applied Econometrics Lecture 7: Cluster Sampling. Jeff Wooldridge IRP Lectures, UW Madison, August 2008
A Course in Applied Econometrics Lecture 7: Cluster Sampling Jeff Wooldridge IRP Lectures, UW Madison, August 2008 1. The Linear Model with Cluster Effects 2. Estimation with a Small Number of roups and
More informationDynamic Panels. Chapter Introduction Autoregressive Model
Chapter 11 Dynamic Panels This chapter covers the econometrics methods to estimate dynamic panel data models, and presents examples in Stata to illustrate the use of these procedures. The topics in this
More informationNinth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis"
Ninth ARTNeT Capacity Building Workshop for Trade Research "Trade Flows and Trade Policy Analysis" June 2013 Bangkok, Thailand Cosimo Beverelli and Rainer Lanz (World Trade Organization) 1 Selected econometric
More informationSimultaneous Equations with Error Components. Mike Bronner Marko Ledic Anja Breitwieser
Simultaneous Equations with Error Components Mike Bronner Marko Ledic Anja Breitwieser PRESENTATION OUTLINE Part I: - Simultaneous equation models: overview - Empirical example Part II: - Hausman and Taylor
More informationLab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p )
Lab 3: Two levels Poisson models (taken from Multilevel and Longitudinal Modeling Using Stata, p. 376-390) BIO656 2009 Goal: To see if a major health-care reform which took place in 1997 in Germany was
More informationExam ECON3150/4150: Introductory Econometrics. 18 May 2016; 09:00h-12.00h.
Exam ECON3150/4150: Introductory Econometrics. 18 May 2016; 09:00h-12.00h. This is an open book examination where all printed and written resources, in addition to a calculator, are allowed. If you are
More informationGeneralized linear models
Generalized linear models Christopher F Baum ECON 8823: Applied Econometrics Boston College, Spring 2016 Christopher F Baum (BC / DIW) Generalized linear models Boston College, Spring 2016 1 / 1 Introduction
More informationmultilevel modeling: concepts, applications and interpretations
multilevel modeling: concepts, applications and interpretations lynne c. messer 27 october 2010 warning social and reproductive / perinatal epidemiologist concepts why context matters multilevel models
More informationLongitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 2017, Chicago, Illinois
Longitudinal Data Analysis Using Stata Paul D. Allison, Ph.D. Upcoming Seminar: May 18-19, 217, Chicago, Illinois Outline 1. Opportunities and challenges of panel data. a. Data requirements b. Control
More information1 The basics of panel data
Introductory Applied Econometrics EEP/IAS 118 Spring 2015 Related materials: Steven Buck Notes to accompany fixed effects material 4-16-14 ˆ Wooldridge 5e, Ch. 1.3: The Structure of Economic Data ˆ Wooldridge
More informationLecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook)
Lecture 9: Panel Data Model (Chapter 14, Wooldridge Textbook) 1 2 Panel Data Panel data is obtained by observing the same person, firm, county, etc over several periods. Unlike the pooled cross sections,
More informationNew Developments in Econometrics Lecture 11: Difference-in-Differences Estimation
New Developments in Econometrics Lecture 11: Difference-in-Differences Estimation Jeff Wooldridge Cemmap Lectures, UCL, June 2009 1. The Basic Methodology 2. How Should We View Uncertainty in DD Settings?
More informationRecent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data
Recent Advances in the Field of Trade Theory and Policy Analysis Using Micro-Level Data July 2012 Bangkok, Thailand Cosimo Beverelli (World Trade Organization) 1 Content a) Classical regression model b)
More informationProblem Set 5 ANSWERS
Economics 20 Problem Set 5 ANSWERS Prof. Patricia M. Anderson 1, 2 and 3 Suppose that Vermont has passed a law requiring employers to provide 6 months of paid maternity leave. You are concerned that women
More informationProblem Set 10: Panel Data
Problem Set 10: Panel Data 1. Read in the data set, e11panel1.dta from the course website. This contains data on a sample or 1252 men and women who were asked about their hourly wage in two years, 2005
More informationSpecification Tests in Unbalanced Panels with Endogeneity.
Specification Tests in Unbalanced Panels with Endogeneity. Riju Joshi Jeffrey M. Wooldridge June 22, 2017 Abstract This paper develops specification tests for unbalanced panels with endogenous explanatory
More informationApplied Econometrics. Lecture 3: Introduction to Linear Panel Data Models
Applied Econometrics Lecture 3: Introduction to Linear Panel Data Models Måns Söderbom 4 September 2009 Department of Economics, Universy of Gothenburg. Email: mans.soderbom@economics.gu.se. Web: www.economics.gu.se/soderbom,
More informationExercices for Applied Econometrics A
QEM F. Gardes-C. Starzec-M.A. Diaye Exercices for Applied Econometrics A I. Exercice: The panel of households expenditures in Poland, for years 1997 to 2000, gives the following statistics for the whole
More informationECON 594: Lecture #6
ECON 594: Lecture #6 Thomas Lemieux Vancouver School of Economics, UBC May 2018 1 Limited dependent variables: introduction Up to now, we have been implicitly assuming that the dependent variable, y, was
More informationAnalyzing Proportions
Institut für Soziologie Eberhard Karls Universität Tübingen www.maartenbuis.nl The problem A proportion is bounded between 0 and 1, this means that: the effect of explanatory variables tends to be non-linear,
More informationA simple alternative to the linear probability model for binary choice models with endogenous regressors
A simple alternative to the linear probability model for binary choice models with endogenous regressors Christopher F Baum, Yingying Dong, Arthur Lewbel, Tao Yang Boston College/DIW Berlin, U.Cal Irvine,
More informationDealing With and Understanding Endogeneity
Dealing With and Understanding Endogeneity Enrique Pinzón StataCorp LP October 20, 2016 Barcelona (StataCorp LP) October 20, 2016 Barcelona 1 / 59 Importance of Endogeneity Endogeneity occurs when a variable,
More informationECON 4551 Econometrics II Memorial University of Newfoundland. Panel Data Models. Adapted from Vera Tabakova s notes
ECON 4551 Econometrics II Memorial University of Newfoundland Panel Data Models Adapted from Vera Tabakova s notes 15.1 Grunfeld s Investment Data 15.2 Sets of Regression Equations 15.3 Seemingly Unrelated
More informationOutline. Linear OLS Models vs: Linear Marginal Models Linear Conditional Models. Random Intercepts Random Intercepts & Slopes
Lecture 2.1 Basic Linear LDA 1 Outline Linear OLS Models vs: Linear Marginal Models Linear Conditional Models Random Intercepts Random Intercepts & Slopes Cond l & Marginal Connections Empirical Bayes
More informationxtdpdqml: Quasi-maximum likelihood estimation of linear dynamic short-t panel data models
xtdpdqml: Quasi-maximum likelihood estimation of linear dynamic short-t panel data models Sebastian Kripfganz University of Exeter Business School, Department of Economics, Exeter, UK UK Stata Users Group
More informationLecture 3.1 Basic Logistic LDA
y Lecture.1 Basic Logistic LDA 0.2.4.6.8 1 Outline Quick Refresher on Ordinary Logistic Regression and Stata Women s employment example Cross-Over Trial LDA Example -100-50 0 50 100 -- Longitudinal Data
More informationLongitudinal Data Analysis. RatSWD Nachwuchsworkshop Vorlesung von Josef Brüderl 25. August, 2009
Longitudinal Data Analysis RatSWD Nachwuchsworkshop Vorlesung von Josef Brüderl 25. August, 2009 Longitudinal Data Analysis Traditional definition Statistical methods for analyzing data with a time dimension
More informationEconometrics Homework 4 Solutions
Econometrics Homework 4 Solutions Computer Question (Optional, no need to hand in) (a) c i may capture some state-specific factor that contributes to higher or low rate of accident or fatality. For example,
More informationDavid M. Drukker Italian Stata Users Group meeting 15 November Executive Director of Econometrics Stata
Estimating the ATE of an endogenously assigned treatment from a sample with endogenous selection by regression adjustment using an extended regression models David M. Drukker Executive Director of Econometrics
More informationCapital humain, développement et migrations: approche macroéconomique (Empirical Analysis - Static Part)
Séminaire d Analyse Economique III (LECON2486) Capital humain, développement et migrations: approche macroéconomique (Empirical Analysis - Static Part) Frédéric Docquier & Sara Salomone IRES UClouvain
More informationUniversity of California at Berkeley Fall Introductory Applied Econometrics Final examination. Scores add up to 125 points
EEP 118 / IAS 118 Elisabeth Sadoulet and Kelly Jones University of California at Berkeley Fall 2008 Introductory Applied Econometrics Final examination Scores add up to 125 points Your name: SID: 1 1.
More informationEconometrics I KS. Module 1: Bivariate Linear Regression. Alexander Ahammer. This version: March 12, 2018
Econometrics I KS Module 1: Bivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: March 12, 2018 Alexander Ahammer (JKU) Module 1: Bivariate
More informationINTRODUCTION TO BASIC LINEAR REGRESSION MODEL
INTRODUCTION TO BASIC LINEAR REGRESSION MODEL 13 September 2011 Yogyakarta, Indonesia Cosimo Beverelli (World Trade Organization) 1 LINEAR REGRESSION MODEL In general, regression models estimate the effect
More informationPanel data methods for policy analysis
IAPRI Quantitative Analysis Capacity Building Series Panel data methods for policy analysis Part I: Linear panel data models Outline 1. Independently pooled cross sectional data vs. panel/longitudinal
More informationLecture#12. Instrumental variables regression Causal parameters III
Lecture#12 Instrumental variables regression Causal parameters III 1 Demand experiment, market data analysis & simultaneous causality 2 Simultaneous causality Your task is to estimate the demand function
More informationNon-linear panel data modeling
Non-linear panel data modeling Laura Magazzini University of Verona laura.magazzini@univr.it http://dse.univr.it/magazzini May 2010 Laura Magazzini (@univr.it) Non-linear panel data modeling May 2010 1
More informationBasic Regressions and Panel Data in Stata
Developing Trade Consultants Policy Research Capacity Building Basic Regressions and Panel Data in Stata Ben Shepherd Principal, Developing Trade Consultants 1 Basic regressions } Stata s regress command
More informationFixed and Random Effects Models: Vartanian, SW 683
: Vartanian, SW 683 Fixed and random effects models See: http://teaching.sociology.ul.ie/dcw/confront/node45.html When you have repeated observations per individual this is a problem and an advantage:
More informationHomework Solutions Applied Logistic Regression
Homework Solutions Applied Logistic Regression WEEK 6 Exercise 1 From the ICU data, use as the outcome variable vital status (STA) and CPR prior to ICU admission (CPR) as a covariate. (a) Demonstrate that
More informationUsing generalized structural equation models to fit customized models without programming, and taking advantage of new features of -margins-
Using generalized structural equation models to fit customized models without programming, and taking advantage of new features of -margins- Isabel Canette Principal Mathematician and Statistician StataCorp
More information(a) Briefly discuss the advantage of using panel data in this situation rather than pure crosssections
Answer Key Fixed Effect and First Difference Models 1. See discussion in class.. David Neumark and William Wascher published a study in 199 of the effect of minimum wages on teenage employment using a
More informationNonlinear Econometric Analysis (ECO 722) : Homework 2 Answers. (1 θ) if y i = 0. which can be written in an analytically more convenient way as
Nonlinear Econometric Analysis (ECO 722) : Homework 2 Answers 1. Consider a binary random variable y i that describes a Bernoulli trial in which the probability of observing y i = 1 in any draw is given
More informationWarwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation
Warwick Economics Summer School Topics in Microeconometrics Instrumental Variables Estimation Michele Aquaro University of Warwick This version: July 21, 2016 1 / 31 Reading material Textbook: Introductory
More informationESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics
ESTIMATING AVERAGE TREATMENT EFFECTS: REGRESSION DISCONTINUITY DESIGNS Jeff Wooldridge Michigan State University BGSE/IZA Course in Microeconometrics July 2009 1. Introduction 2. The Sharp RD Design 3.
More informationDescription Quick start Menu Syntax Options Remarks and examples Stored results Methods and formulas Acknowledgment References Also see
Title stata.com hausman Hausman specification test Description Quick start Menu Syntax Options Remarks and examples Stored results Methods and formulas Acknowledgment References Also see Description hausman
More informationConfidence intervals for the variance component of random-effects linear models
The Stata Journal (2004) 4, Number 4, pp. 429 435 Confidence intervals for the variance component of random-effects linear models Matteo Bottai Arnold School of Public Health University of South Carolina
More informationEstimating the Fractional Response Model with an Endogenous Count Variable
Estimating the Fractional Response Model with an Endogenous Count Variable Estimating FRM with CEEV Hoa Bao Nguyen Minh Cong Nguyen Michigan State Universtiy American University July 2009 Nguyen and Nguyen
More informationShort T Panels - Review
Short T Panels - Review We have looked at methods for estimating parameters on time-varying explanatory variables consistently in panels with many cross-section observation units but a small number of
More information1. Overview of the Basic Model
IRP Lectures Madison, WI, August 2008 Lectures 3 & 4, Monday, August 4, 11:15-12:30 and 1:30-2:30 Linear Panel Data Models These notes cover some recent topics in linear panel data models. They begin with
More informationDifference-in-Differences Estimation
Difference-in-Differences Estimation Jeff Wooldridge Michigan State University Programme Evaluation for Policy Analysis Institute for Fiscal Studies June 2012 1. The Basic Methodology 2. How Should We
More informationEconometrics for PhDs
Econometrics for PhDs Amine Ouazad April 2012, Final Assessment - Answer Key 1 Questions with a require some Stata in the answer. Other questions do not. 1 Ordinary Least Squares: Equality of Estimates
More informationReview of Panel Data Model Types Next Steps. Panel GLMs. Department of Political Science and Government Aarhus University.
Panel GLMs Department of Political Science and Government Aarhus University May 12, 2015 1 Review of Panel Data 2 Model Types 3 Review and Looking Forward 1 Review of Panel Data 2 Model Types 3 Review
More information-redprob- A Stata program for the Heckman estimator of the random effects dynamic probit model
-redprob- A Stata program for the Heckman estimator of the random effects dynamic probit model Mark B. Stewart University of Warwick January 2006 1 The model The latent equation for the random effects
More informationEconometrics I KS. Module 2: Multivariate Linear Regression. Alexander Ahammer. This version: April 16, 2018
Econometrics I KS Module 2: Multivariate Linear Regression Alexander Ahammer Department of Economics Johannes Kepler University of Linz This version: April 16, 2018 Alexander Ahammer (JKU) Module 2: Multivariate
More informationTopic 10: Panel Data Analysis
Topic 10: Panel Data Analysis Advanced Econometrics (I) Dong Chen School of Economics, Peking University 1 Introduction Panel data combine the features of cross section data time series. Usually a panel
More informationLecture 7: OLS with qualitative information
Lecture 7: OLS with qualitative information Dummy variables Dummy variable: an indicator that says whether a particular observation is in a category or not Like a light switch: on or off Most useful values:
More informationERSA Training Workshop Lecture 5: Estimation of Binary Choice Models with Panel Data
ERSA Training Workshop Lecture 5: Estimation of Binary Choice Models with Panel Data Måns Söderbom Friday 16 January 2009 1 Introduction The methods discussed thus far in the course are well suited for
More informationLecture 8 Panel Data
Lecture 8 Panel Data Economics 8379 George Washington University Instructor: Prof. Ben Williams Introduction This lecture will discuss some common panel data methods and problems. Random effects vs. fixed
More informationLecture 8: Instrumental Variables Estimation
Lecture Notes on Advanced Econometrics Lecture 8: Instrumental Variables Estimation Endogenous Variables Consider a population model: y α y + β + β x + β x +... + β x + u i i i i k ik i Takashi Yamano
More informationProblem set - Selection and Diff-in-Diff
Problem set - Selection and Diff-in-Diff 1. You want to model the wage equation for women You consider estimating the model: ln wage = α + β 1 educ + β 2 exper + β 3 exper 2 + ɛ (1) Read the data into
More informationGeneralized method of moments estimation in Stata 11
Generalized method of moments estimation in Stata 11 David M. Drukker StataCorp Stata Conference Washington, DC 2009 1 / 27 Outline 1 A quick introduction to GMM 2 gmm examples Ordinary least squares Two-stage
More informationGMM Estimation in Stata
GMM Estimation in Stata Econometrics I Department of Economics Universidad Carlos III de Madrid Master in Industrial Economics and Markets 1 Outline Motivation 1 Motivation 2 3 4 2 Motivation 3 Stata and
More informationStata tip 63: Modeling proportions
The Stata Journal (2008) 8, Number 2, pp. 299 303 Stata tip 63: Modeling proportions Christopher F. Baum Department of Economics Boston College Chestnut Hill, MA baum@bc.edu You may often want to model
More informationEconometrics of Panel Data
Econometrics of Panel Data Jakub Mućk Meeting # 1 Jakub Mućk Econometrics of Panel Data Meeting # 1 1 / 31 Outline 1 Course outline 2 Panel data Advantages of Panel Data Limitations of Panel Data 3 Pooled
More informationECONOMETRICS FIELD EXAM Michigan State University May 9, 2008
ECONOMETRICS FIELD EXAM Michigan State University May 9, 2008 Instructions: Answer all four (4) questions. Point totals for each question are given in parenthesis; there are 00 points possible. Within
More informationMeasurement Error. Often a data set will contain imperfect measures of the data we would ideally like.
Measurement Error Often a data set will contain imperfect measures of the data we would ideally like. Aggregate Data: (GDP, Consumption, Investment are only best guesses of theoretical counterparts and
More informationEC327: Advanced Econometrics, Spring 2007
EC327: Advanced Econometrics, Spring 2007 Wooldridge, Introductory Econometrics (3rd ed, 2006) Chapter 14: Advanced panel data methods Fixed effects estimators We discussed the first difference (FD) model
More informationPanel Data: Fixed and Random Effects
Short Guides to Microeconometrics Fall 2016 Kurt Schmidheiny Unversität Basel Panel Data: Fixed and Random Effects 1 Introduction In panel data, individuals (persons, firms, cities, ) are observed at several
More information8. Nonstandard standard error issues 8.1. The bias of robust standard errors
8.1. The bias of robust standard errors Bias Robust standard errors are now easily obtained using e.g. Stata option robust Robust standard errors are preferable to normal standard errors when residuals
More informationCourse Econometrics I
Course Econometrics I 3. Multiple Regression Analysis: Binary Variables Martin Halla Johannes Kepler University of Linz Department of Economics Last update: April 29, 2014 Martin Halla CS Econometrics
More informationCENSORED DATA AND CENSORED NORMAL REGRESSION
CENSORED DATA AND CENSORED NORMAL REGRESSION Data censoring comes in many forms: binary censoring, interval censoring, and top coding are the most common. They all start with an underlying linear model
More informationGreene, Econometric Analysis (7th ed, 2012)
EC771: Econometrics, Spring 2012 Greene, Econometric Analysis (7th ed, 2012) Chapters 2 3: Classical Linear Regression The classical linear regression model is the single most useful tool in econometrics.
More informationA Course in Applied Econometrics Lecture 4: Linear Panel Data Models, II. Jeff Wooldridge IRP Lectures, UW Madison, August 2008
A Course in Applied Econometrics Lecture 4: Linear Panel Data Models, II Jeff Wooldridge IRP Lectures, UW Madison, August 2008 5. Estimating Production Functions Using Proxy Variables 6. Pseudo Panels
More information****Lab 4, Feb 4: EDA and OLS and WLS
****Lab 4, Feb 4: EDA and OLS and WLS ------- log: C:\Documents and Settings\Default\Desktop\LDA\Data\cows_Lab4.log log type: text opened on: 4 Feb 2004, 09:26:19. use use "Z:\LDA\DataLDA\cowsP.dta", clear.
More informationGreene, Econometric Analysis (6th ed, 2008)
EC771: Econometrics, Spring 2010 Greene, Econometric Analysis (6th ed, 2008) Chapter 17: Maximum Likelihood Estimation The preferred estimator in a wide variety of econometric settings is that derived
More informationMicroeconometrics (PhD) Problem set 2: Dynamic Panel Data Solutions
Microeconometrics (PhD) Problem set 2: Dynamic Panel Data Solutions QUESTION 1 Data for this exercise can be prepared by running the do-file called preparedo posted on my webpage This do-file collects
More informationMultivariate probit regression using simulated maximum likelihood
Multivariate probit regression using simulated maximum likelihood Lorenzo Cappellari & Stephen P. Jenkins ISER, University of Essex stephenj@essex.ac.uk 1 Overview Introduction and motivation The model
More informationSpecification testing in panel data models estimated by fixed effects with instrumental variables
Specification testing in panel data models estimated by fixed effects wh instrumental variables Carrie Falls Department of Economics Michigan State Universy Abstract I show that a handful of the regressions
More informationExtended regression models using Stata 15
Extended regression models using Stata 15 Charles Lindsey Senior Statistician and Software Developer Stata July 19, 2018 Lindsey (Stata) ERM July 19, 2018 1 / 103 Introduction Common problems in observational
More informationECON2228 Notes 10. Christopher F Baum. Boston College Economics. cfb (BC Econ) ECON2228 Notes / 48
ECON2228 Notes 10 Christopher F Baum Boston College Economics 2014 2015 cfb (BC Econ) ECON2228 Notes 10 2014 2015 1 / 48 Serial correlation and heteroskedasticity in time series regressions Chapter 12:
More informationEconometrics. Week 6. Fall Institute of Economic Studies Faculty of Social Sciences Charles University in Prague
Econometrics Week 6 Institute of Economic Studies Faculty of Social Sciences Charles University in Prague Fall 2012 1 / 21 Recommended Reading For the today Advanced Panel Data Methods. Chapter 14 (pp.
More information